New paper - NLP 4Ms in NH Texts Predict Avoidable Transfers

2025-12-12

Our paper, “Using Novel Natural Language Processing Approaches to Examine Age-Friendly Communication about Nursing Nome Residents with Dementia | The Gerontologist | Oxford Academic” has been published in The Gerontologist. This work builds directly on the [4M taxonomy](Development and evaluation of a 4M taxonomy from nursing home staff text messages using a fine-tuned generative language model - PubMed) I developed last year from fine-tuning a local LLM on NH staff texts. That one ended 2024 on a high note, teaching me volumes about embeddings, clustering, and reproducible NLP pipelines. We’ve taken those extracted 4Ms (what matters, medications, mentation, mobility) and asked: do they predict avoidable NH-to-hospital transfers, especially for folks with Alzheimer’s Disease and Related Dementias (ADRD)?

Nursing homes are a critical domain for aging America’s challenges. Nearly half of residents have ADRD, and avoidable transfers to hospitals can lead to poorer outcomes. We merged 30k+ text messages from 16 Missouri NHs in the MOQI CMS demo (2016-2020) with 3,687 transfer events. Focused on 1,031 observations timed pre-transfer (0-28+ days out).

Models compared:

Supervised ML: Our 4M-EE pipeline with possibilistic KNN on embeddings
Fine-tuned Gemma 2 LLM: Local, secure, taxonomy builder.

Validated against expert gold standard, picked the best model via AIC.

Findings:

Factor	Key Association with Avoidable Transfer
Late ADRD	↑ (OR 16.03)[1]
Full Code CPR	↓ (OR 0.17)
Rural NH	↓ (OR 0.0008)
Mentation (all msgs)	↓ per mention (OR 0.91)
Mobility (<14 days)	↑ per mention (OR 1.21)

Caveats: Missouri-only, transfers-only sample, ADRD staging limits. This is the kind of Nightingale-Lovelace fusion I love to work on. Grateful for the team, NIA funding (R01AG078281), and Mizzou’s support.