API Reference
Modules:
interfaces– Core gensim interfacesutils– Various utility functionsmatutils– Math utilscorpora.bleicorpus– Corpus in Blei’s LDA-C formatcorpora.csvcorpus– Corpus in CSV formatcorpora.dictionary– Construct word<->id mappingscorpora.hashdictionary– Construct word<->id mappingscorpora.indexedcorpus– Random access to corpus documentscorpora.lowcorpus– Corpus in List-of-Words formatcorpora.malletcorpus– Corpus in Mallet format of List-Of-Words.corpora.mmcorpus– Corpus in Matrix Market formatcorpora.sharded_corpus– Corpus stored in separate filescorpora.svmlightcorpus– Corpus in SVMlight formatcorpora.textcorpus– Building corpora with dictionariescorpora.ucicorpus– Corpus in UCI bag-of-words formatcorpora.wikicorpus– Corpus from a Wikipedia dumpmodels.ldamodel– Latent Dirichlet Allocationmodels.ldamulticore– parallelized Latent Dirichlet Allocationmodels.lsimodel– Latent Semantic Indexingmodels.ldaseqmodel– Dynamic Topic Modeling in Pythonmodels.tfidfmodel– TF-IDF modelmodels.rpmodel– Random Projectionsmodels.hdpmodel– Hierarchical Dirichlet Processmodels.logentropy_model– LogEntropy modelmodels.normmodel– Normalization modelmodels.lsi_dispatcher– Dispatcher for distributed LSImodels.lsi_worker– Worker for distributed LSImodels.lda_dispatcher– Dispatcher for distributed LDAmodels.lda_worker– Worker for distributed LDAmodels.word2vec– Deep learning with word2vecmodels.doc2vec– Deep learning with paragraph2vecmodels.phrases– Phrase (collocation) detectionmodels.wrappers.ldamallet– Latent Dirichlet Allocation via Malletmodels.wrappers.dtmmodel– Dynamic Topic Models (DTM) and Dynamic Influence Models (DIM)models.wrappers.ldavowpalwabbit– Latent Dirichlet Allocation via Vowpal Wabbitsimilarities.docsim– Document similarity queriessimilarities.index– Fast Approximate Nearest Neighbor Similarity with Annoy packagetopic_coherence.aggregation– Aggregation moduletopic_coherence.direct_confirmation_measure– Direct confirmation measure moduletopic_coherence.indirect_confirmation_measure– Indirect confirmation measure moduletopic_coherence.probability_estimation– Probability estimation moduletopic_coherence.segmentation– Segmentation modulescripts.glove2word2vec– Convert glove format to word2vecscripts.make_wikicorpus– Convert articles from a Wikipedia dump to vectors.scripts.word2vec_standalone– Train word2vec on text file CORPUSparsing.porter– Porter Stemming Algorithmparsing.preprocessing– Functions to preprocess raw textsummarization.bm25– BM25 ranking functionsummarization.commons– Common graph functionssummarization.graph– TextRank graphsummarization.keywords– Keywords for TextRank summarization algorithmsummarization.pagerank_weighted– Weighted PageRank algorithmsummarization.summarizer– TextRank Summarisersummarization.syntactic_unit– Syntactic Unit classsummarization.textcleaner– Summarization pre-processing