Package: faSTM 0.0.0.9000

faSTM: Fast Structural Topic Models

A modern implementation of the Structural Topic Model. faSTM fits the logistic-normal STM (with prevalence and content covariates) via a multithreaded Rust core, with an opt-in stochastic-variational path for large corpora. It is self-contained: text preparation is read from 'quanteda' or 'tidytext' objects, model inspection (labelTopics with FREX/lift/score, findThoughts, semantic coherence, exclusivity, topic correlations) and an estimateEffect() (method-of-composition posterior propagation) are built in. The fitted object is structurally compatible with 'stm' so existing analyses migrate with minimal changes.

Authors:Neal Caren [aut, cre], Margaret Roberts [cph], Brandon Stewart [cph], Dustin Tingley [cph]

faSTM_0.0.0.9000.tar.gz
faSTM_0.0.0.9000.zip(r-4.7)faSTM_0.0.0.9000.zip(r-4.6)faSTM_0.0.0.9000.zip(r-4.5)
faSTM_0.0.0.9000.tgz(r-4.6-x86_64)faSTM_0.0.0.9000.tgz(r-4.6-arm64)faSTM_0.0.0.9000.tgz(r-4.5-x86_64)faSTM_0.0.0.9000.tgz(r-4.5-arm64)
faSTM_0.0.0.9000.tar.gz(r-4.7-arm64)faSTM_0.0.0.9000.tar.gz(r-4.7-x86_64)faSTM_0.0.0.9000.tar.gz(r-4.6-arm64)faSTM_0.0.0.9000.tar.gz(r-4.6-x86_64)
manual.pdf |manual.html✨
DESCRIPTION |NEWS
card.svg |card.png
faSTM/json (API)

# Install 'faSTM' in R:

install.packages('faSTM', repos = c('https://nealcaren.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/nealcaren/fastm/issues

Pkgdown/docs site:https://nealcaren.github.io

Datasets:

congress - U.S. Congressional Speeches
poliblog - CMU 2008 Political Blog Corpus

On CRAN:

rust cargo

3.78 score 20 scripts 71 exports 4 dependencies

Last updated from:b4ac5c1f0d. Checks:11 WARNING, 1 OK, 1 FAIL. Indexed: yes.

Target	Result	Time
linux-devel-arm64	WARNING	273
linux-devel-x86_64	WARNING	294
source / vignettes	OK	531
linux-release-arm64	WARNING	275
linux-release-x86_64	WARNING	289
macos-release-arm64	WARNING	262
macos-release-x86_64	WARNING	396
macos-oldrel-arm64	WARNING	268
macos-oldrel-x86_64	WARNING	344
windows-devel	WARNING	612
windows-release	WARNING	626
windows-oldrel	WARNING	295
wasm-release	FAIL	167

Exports:align_corpus alignCorpus ame as_corpus asSTMCorpus augment calcfrex calclift calcscore check_residuals checkBeta checkResiduals cloud coherence content_topics convertCorpus effect_estimates estimateEffect eval_heldout eval.heldout exclusivity find_thoughts find_topic findThoughts findTopic fit_new_documents fitNewDocuments frex_scores from_tidy glance label_topics labelTopics make_dt make_heldout make.dt make.heldout makeDesignMatrix many_topics manyTopics multi_stm optimizeDocument permutation_test plot_topic_network plotModels plotQuote posterior_theta_samples read_ldac readLdac s sage_labels sageLabels search_k searchK select_best select_model selectModel semantic_coherence semanticCoherence stm thetaPosterior tidy toLDAvis topic_corr_graph topic_correlation topic_lasso topic_proportions topic_terms topicCorr topicQuality write_ldac writeLdac

Dependencies:generics lattice MASS Matrix

Validation: parity with stm, and fit quality

Same model, same numbers | Topic labels (probability, FREX, lift, score) | Semantic coherence and exclusivity | Different fit, comparable quality | What this means for your analysis

Rendered fromvalidation.Rmdusingknitr::knitr

Last update: 2026-06-19
Started: 2026-06-19

Beyond stm: faSTM's extensions

Rendered frombeyond-stm.Rmdusingknitr::knitr

Last update: 2026-06-19
Started: 2026-06-19

faSTM: the stm vignette, run on faSTM

Rendered fromfaSTM.Rmdusingknitr::knitr

Last update: 2026-06-19
Started: 2026-06-18

Help page	Topics
Align a new corpus to a fitted model's vocabulary	align_corpus
Align a new corpus to a reference vocabulary (stm-compatible)	alignCorpus
Average marginal effects from an estimateEffect fit	ame
Build a faSTM corpus from prepared text	as_corpus
Convert search_k diagnostics to long form for plotting	as.data.frame.faSTM_searchk
Coerce inputs into an stm-style corpus (stm-compatible)	asSTMCorpus
Augment: most-likely topic for each document-term token	augment.faSTM
stm-compatible label scorers (FREX / lift / score)	calcfrex calclift calcscore
Residual dispersion check (is K large enough?)	check_residuals
Flag words that load almost entirely on one topic	checkBeta
Topic coherence (Mimno / NPMI / c_v)	coherence
U.S. Congressional Speeches (Party x Chamber, 1987-2011)	congress
Marginal content words by one content covariate	content_topics
Convert documents/vocab between corpus formats (stm-compatible)	convertCorpus
Extract estimateEffect estimates as a tidy data.frame (no plotting)	effect_estimates
Estimate covariate effects on topic prevalence (method of composition)	estimateEffect
Evaluate held-out log-likelihood of a fit on a held-out set	eval_heldout
Topic exclusivity (FREX-summary, frexw default 0.7)	exclusivity
Representative documents for each topic	find_thoughts
Find topics whose top words include given words	find_topic
Infer topic proportions for new documents	fit_new_documents
Fit a structural topic model and return its raw arrays.	fit_stm
Infer topics for new documents (stm-compatible signature)	fitNewDocuments
FREX scores for every word and topic	frex_scores
Build a faSTM corpus from a tidy (long) term-count table	from_tidy
One-row model summary for a faSTM fit	glance.faSTM
Out-of-sample topic inference: for each new document, run the variational E-step against fixed globals (β, μ, Σ⁻¹) and return θ. Documents are passed sparse — 'words' are 0-based ids into the _fitted model's_ vocabulary (out-of-vocabulary terms dropped by the R caller) with their 'counts', concatenated, plus per-document term counts 'doc_nterms'.	infer_theta_new
Label topics by top words (prob, FREX, lift, score)	label_topics
LDA topic-word matrix via topica's CVB0 (deterministic collapsed variational Bayes), to seed a "replicate stm's LDA init" STM fit. Mirrors stm's collapsed-Gibbs LDA initialization; the result is fed back as 'init_beta'. Returns K*V row-major topic-word probabilities.	lda_init_beta
Document-topic proportions as a data frame	make_dt
Create a held-out version of a corpus for document-completion validation	make_heldout
Build a (sparse) design matrix for new data (stm-compatible)	makeDesignMatrix
Select models across a range of K	many_topics
Cross-run topic stability	multi_stm
Per-document variational E-step (stm-compatible)	optimizeDocument
Permutation test for a binary covariate's effect on topics	permutation_test
Topic correlation network	plot_topic_network
Plot a fitted model	plot.faSTM
Plot estimated covariate effects on topic prevalence	plot.faSTM_effect
Plot search_k diagnostics	plot.faSTM_searchk
CMU 2008 Political Blog Corpus (poliblog5k)	poliblog
Draw from the per-document topic-proportion posterior	posterior_theta_samples
Predict topic proportions for new documents	predict.faSTM
Read/write a corpus in LDA-C (Blei) sparse format	read_ldac write_ldac
Spline term for prevalence formulas	s
Labels for a content (SAGE) model	sage_labels
Search over the number of topics K	search_k
Pick one model from a 'select_model' run	select_best
Fit several models and keep the ones on the quality frontier	select_model
Semantic coherence (Mimno et al. 2011)	semantic_coherence
Fit a structural topic model (fast Rust backend, stm-compatible object)	stm
Tidy a faSTM fit (topic-term or document-topic distributions)	tidy.faSTM
Tidy an estimateEffect fit (one row per term per topic)	tidy.faSTM_effect
Topic-correlation network as an igraph graph	topic_corr_graph
Topic correlation graph (positive correlations of topic proportions)	topic_correlation
Predict a document-level outcome from topic proportions (lasso)	topic_lasso
Expected topic proportions (the numbers behind the summary plot)	topic_proportions
Top terms per topic, with their numeric scores (tidy)	topic_terms

Package: faSTM 0.0.0.9000

faSTM: Fast Structural Topic Models

Citation

Development and contributors

Readme and manuals

Help Manual

Usage by other packages (reverse dependencies)