API Reference
Modules:
interfaces
– Core gensim interfacesutils
– Various utility functionsmatutils
– Math utilscorpora.bleicorpus
– Corpus in Blei’s LDA-C formatcorpora.csvcorpus
– Corpus in CSV formatcorpora.dictionary
– Construct word<->id mappingscorpora.hashdictionary
– Construct word<->id mappingscorpora.indexedcorpus
– Random access to corpus documentscorpora.lowcorpus
– Corpus in List-of-Words formatcorpora.malletcorpus
– Corpus in Mallet format of List-Of-Words.corpora.mmcorpus
– Corpus in Matrix Market formatcorpora.sharded_corpus
– Corpus stored in separate filescorpora.svmlightcorpus
– Corpus in SVMlight formatcorpora.textcorpus
– Building corpora with dictionariescorpora.ucicorpus
– Corpus in UCI bag-of-words formatcorpora.wikicorpus
– Corpus from a Wikipedia dumpmodels.ldamodel
– Latent Dirichlet Allocationmodels.ldamulticore
– parallelized Latent Dirichlet Allocationmodels.lsimodel
– Latent Semantic Indexingmodels.ldaseqmodel
– Dynamic Topic Modeling in Pythonmodels.tfidfmodel
– TF-IDF modelmodels.rpmodel
– Random Projectionsmodels.hdpmodel
– Hierarchical Dirichlet Processmodels.logentropy_model
– LogEntropy modelmodels.normmodel
– Normalization modelmodels.lsi_dispatcher
– Dispatcher for distributed LSImodels.lsi_worker
– Worker for distributed LSImodels.lda_dispatcher
– Dispatcher for distributed LDAmodels.lda_worker
– Worker for distributed LDAmodels.word2vec
– Deep learning with word2vecmodels.doc2vec
– Deep learning with paragraph2vecmodels.phrases
– Phrase (collocation) detectionmodels.wrappers.ldamallet
– Latent Dirichlet Allocation via Malletmodels.wrappers.dtmmodel
– Dynamic Topic Models (DTM) and Dynamic Influence Models (DIM)models.wrappers.ldavowpalwabbit
– Latent Dirichlet Allocation via Vowpal Wabbitsimilarities.docsim
– Document similarity queriessimilarities.index
– Fast Approximate Nearest Neighbor Similarity with Annoy packagetopic_coherence.aggregation
– Aggregation moduletopic_coherence.direct_confirmation_measure
– Direct confirmation measure moduletopic_coherence.indirect_confirmation_measure
– Indirect confirmation measure moduletopic_coherence.probability_estimation
– Probability estimation moduletopic_coherence.segmentation
– Segmentation modulescripts.glove2word2vec
– Convert glove format to word2vecscripts.make_wikicorpus
– Convert articles from a Wikipedia dump to vectors.scripts.word2vec_standalone
– Train word2vec on text file CORPUSparsing.porter
– Porter Stemming Algorithmparsing.preprocessing
– Functions to preprocess raw textsummarization.bm25
– BM25 ranking functionsummarization.commons
– Common graph functionssummarization.graph
– TextRank graphsummarization.keywords
– Keywords for TextRank summarization algorithmsummarization.pagerank_weighted
– Weighted PageRank algorithmsummarization.summarizer
– TextRank Summarisersummarization.syntactic_unit
– Syntactic Unit classsummarization.textcleaner
– Summarization pre-processing