Difference between revisions of "TextAnalysis"
Jump to navigation
Jump to search
Line 26: | Line 26: | ||
==Miscellaneous text analysis tools== | ==Miscellaneous text analysis tools== | ||
+ | * [http://voyant-tools.org/ Voyant Tools] | ||
+ | * [http://www.laurenceanthony.net/software/antconc/ Laurence Anthony's AntConc], a GUI concordancing and text analysis toolkit | ||
* David McClure's TextPlot (produces force-directed network of words in a text, based on estimated kernel densities) | * David McClure's TextPlot (produces force-directed network of words in a text, based on estimated kernel densities) | ||
** [http://dclure.org/essays/mental-maps-of-texts/ Blog post explaining concept] | ** [http://dclure.org/essays/mental-maps-of-texts/ Blog post explaining concept] | ||
** [http://dclure.org/tutorials/textplot-refresh/ Blog post to download and set up] | ** [http://dclure.org/tutorials/textplot-refresh/ Blog post to download and set up] | ||
** [http://dclure.org/logs/tuning-textplot/ Blog post explaining parameters] | ** [http://dclure.org/logs/tuning-textplot/ Blog post explaining parameters] | ||
− | |||
=Corpus building= | =Corpus building= |
Revision as of 03:50, 24 February 2016
Resources for Exploring Text Analysis
R
- Matthew Jockers, Text Analysis With R for Students of Literature (PDF available for download via the NEU Library)
- Download and install R
- Download and install RStudio
- RSeek (search tool for finding resources on R)
- Simple data types in R
Topic Modeling
- Megan R. Brett's "Basic Introduction"
- Scott Weingart's "Guided Tour"
- Ben Schmidt's post about Topic Modeling's limitations
- MALLET
- GUI Tools
- Google's Topic Modeling Tool (GUI instance of MALLET)
- Stanford Topic Modeling Toolbox
- Serendip
word2vec
- Ben Schmidt's Blog Post on Vector Space Models
- which links to his R wrapper package for word2vec
Miscellaneous text analysis tools
- Voyant Tools
- Laurence Anthony's AntConc, a GUI concordancing and text analysis toolkit
- David McClure's TextPlot (produces force-directed network of words in a text, based on estimated kernel densities)