Difference between revisions of "TextAnalysis"
Jump to navigation
Jump to search
Line 31: | Line 31: | ||
** [http://dclure.org/logs/tuning-textplot/ Blog post explaining parameters] | ** [http://dclure.org/logs/tuning-textplot/ Blog post explaining parameters] | ||
* [http://www.laurenceanthony.net/software/antconc/ Laurence Anthony's AntConc], a GUI concordancing and text analysis toolkit | * [http://www.laurenceanthony.net/software/antconc/ Laurence Anthony's AntConc], a GUI concordancing and text analysis toolkit | ||
+ | |||
+ | =Corpus building= | ||
+ | |||
+ | ==Some places to get text== | ||
+ | ===Plain text=== | ||
+ | *[https://www.gutenberg.org/Free ebooks by Project Gutenberg - Gutenberg] | ||
+ | *[http://eebo.chadwyck.com/home Early English Books Online (EEBO)] | ||
+ | *[http://omekasites.northeastern.edu/ECDA/ Early Caribbean Digital Archive (ECDA)] | ||
+ | |||
+ | ====Encoded==== | ||
+ | *[http://www.wwp.northeastern.edu/wwo/ Women Writers Online] | ||
+ | *[http://docsouth.unc.edu/ UNC's ''Documenting the American South'' Project] |
Revision as of 03:38, 24 February 2016
Resources for Exploring Text Analysis
R
- Matthew Jockers, Text Analysis With R for Students of Literature (PDF available for download via the NEU Library)
- Download and install R
- Download and install RStudio
- RSeek (search tool for finding resources on R)
- Simple data types in R
Topic Modeling
- Megan R. Brett's "Basic Introduction"
- Scott Weingart's "Guided Tour"
- Ben Schmidt's post about Topic Modeling's limitations
- MALLET
- GUI Tools
- Google's Topic Modeling Tool (GUI instance of MALLET)
- Stanford Topic Modeling Toolbox
- Serendip
word2vec
- Ben Schmidt's Blog Post on Vector Space Models
- Links to his R wrapper package for word2vec
Miscellaneous text analysis tools
- David McClure's TextPlot (produces force-directed Gephi network visualization of words in a text, based on estimated kernel densities)
- Laurence Anthony's AntConc, a GUI concordancing and text analysis toolkit
Corpus building
Some places to get text
Plain text
- ebooks by Project Gutenberg - Gutenberg
- Early English Books Online (EEBO)
- Early Caribbean Digital Archive (ECDA)