Difference between revisions of "TextAnalysis"
Jump to navigation
Jump to search
Line 15: | Line 15: | ||
*[http://www.scottbot.net/HIAL/?p=19113 Scott Weingart's "Guided Tour"] | *[http://www.scottbot.net/HIAL/?p=19113 Scott Weingart's "Guided Tour"] | ||
*[http://journalofdigitalhumanities.org/2-1/words-alone-by-benjamin-m-schmidt/ Ben Schmidt's post about Latent Dirichlet allocation's (LDA's) limitations] | *[http://journalofdigitalhumanities.org/2-1/words-alone-by-benjamin-m-schmidt/ Ben Schmidt's post about Latent Dirichlet allocation's (LDA's) limitations] | ||
− | *[http://mallet.cs.umass.edu/topics.php MALLET] | + | *[http://mallet.cs.umass.edu/topics.php MALLET] (An open-source Java-based LDA package) |
* GUI Tools | * GUI Tools | ||
− | ** [https://code.google.com/p/topic-modeling-tool/ Google's Topic Modeling Tool] ( | + | ** [https://code.google.com/p/topic-modeling-tool/ Google's Topic Modeling Tool] (MALLET) |
+ | ** [http://vep.cs.wisc.edu/serendip/ Serendip] (MALLET) | ||
** [http://nlp.stanford.edu/software/tmt/tmt-0.4/ Stanford Topic Modeling Toolbox] | ** [http://nlp.stanford.edu/software/tmt/tmt-0.4/ Stanford Topic Modeling Toolbox] | ||
− | |||
==word2vec== | ==word2vec== |
Revision as of 03:58, 24 February 2016
Resources for Exploring Text Analysis
R
- Matthew Jockers, Text Analysis With R for Students of Literature (PDF available for download via the NEU Library)
- Download and install R
- Download and install RStudio
- RSeek (search tool for finding resources on R)
- Simple data types in R
Topic Modeling
- Megan R. Brett's "Basic Introduction"
- Scott Weingart's "Guided Tour"
- Ben Schmidt's post about Latent Dirichlet allocation's (LDA's) limitations
- MALLET (An open-source Java-based LDA package)
- GUI Tools
- Google's Topic Modeling Tool (MALLET)
- Serendip (MALLET)
- Stanford Topic Modeling Toolbox
word2vec
- Ben Schmidt's Blog Post on Vector Space Models
- which links to his R wrapper package for word2vec
Miscellaneous text analysis tools
- Voyant Tools
- Laurence Anthony's AntConc, a GUI concordancing and text analysis toolkit
- David McClure's TextPlot (produces force-directed network of words in a text, based on estimated kernel densities)