Tao do Senra

Research Bookmarks

Where to publish?

Potential Journals

Potential Conferences

See where these articles about topic modeling

Resources

= Experiments =

It bundles tools for data retrieval (Google + Twitter + Wikipedia API, web spider, HTML DOM parser), text analysis (rule-based shallow parser, WordNet interface, syntactical + semantical n-gram search algorithm, tf-idf + cosine similarity + LSA metrics), clustering and classification (k-means, KNN, SVM), and data visualization (graph networks)

== Statistical Validation ==

= Tools =

There exist several mature toolkits which deal with Vector Space Modelling. These include: - NLTK (Bird and Loper, 2004) - http://www.natlang.com/introduction-to-named-entity-recognition - Apache’s UIMA and ClearTK (Ogren et al.,2008), - Weka (Frank et al., 2005), - OpenNLP (Baldridge et al., 2002), - Mallet (McCallum, 2002), - MDP (Zito et al., 2008), - Nieme (Maes, 2009), - Gate (Cunningham, 2002), - Orange (Demsar et al., 2004)

== Web Storage and Data Sharing ==

= Related Systems =

== Cortical Algorithms ==

== Machine Learning in Python ==

== Delicious ==

== Data Sources == * Electronic Data Gathering, Analysis, and Retrieval system * Enron e-mail dataset * USGS Data Sources * Unesco SKOS Taxonomy * http://dados.gov.br/ * http://www.data.gov/ * [LOD Cloud] (http://www4.wiwiss.fu-berlin.de/lodcloud/state/) * Mathematics Subject Classification MSC2010 * Stanford Graph Datasources * NASA SKOS datasources * Library of Congress Classification Outline * Freebase - An entity graph of people, places and things, built by a community that loves open data. * CIA - The World FactBook * http://www.datawrangling.com/some-datasets-available-on-the-web * GeoNames * http://www.geonames.org/ontology/documentation.html * DBPedia * http://wiki.dbpedia.org/Downloads37 * FreeBase * http://sw.opencyc.org/ * YAGO2 * Wiki Call4Papers * Wipkipedia Topics: * http://www.sccs.swarthmore.edu/users/08/ajb/tmve/wiki100k/browse/topic-list.html * 100 Wikipedia topics * JSTOR * http://www-users.york.ac.uk/~pml1/bayes/data.htm * TREC Data * ClueWeb2009 * LinkedData Datasets * Universal Decimal Classification * Emails * pdfs * Bookmarks (delicious) - exportados

== Visualization ==

== Topic Modeling ==

= On-Going Research about Semantics and Organizations =

TODO