Package: TextAnalysisR 0.1.4

TextAnalysisR: A Text Mining Workflow Tool
Provides a text mining and natural language processing workflow for documents. Includes preprocessing via 'quanteda', lexical analysis (term frequency-inverse document frequency, log-odds ratios, lexical diversity) via 'tidytext', topic modeling via 'stm' and the 'BERTopic' approach, semantic similarity and document clustering on transformer representations, an interactive 'Shiny' interface with 'ggplot2' visualization, optional 'spaCy' preprocessing, and local 'sentence-transformers' or web-based ('OpenAI', 'Gemini') model providers for retrieval-augmented generation, as described in Shin et al. (2026) <doi:10.1177/07319487251412879>.
Authors:
TextAnalysisR_0.1.4.tar.gz
TextAnalysisR_0.1.4.zip(r-4.7)TextAnalysisR_0.1.4.zip(r-4.6)TextAnalysisR_0.1.4.zip(r-4.5)
TextAnalysisR_0.1.4.tgz(r-4.6-any)TextAnalysisR_0.1.4.tgz(r-4.5-any)
TextAnalysisR_0.1.4.tar.gz(r-4.7-any)TextAnalysisR_0.1.4.tar.gz(r-4.6-any)
TextAnalysisR_0.1.4.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
DESCRIPTION |NEWS
card.svg |card.png
TextAnalysisR/json (API)
| # Install 'TextAnalysisR' in R: |
| install.packages('TextAnalysisR', repos = c('https://mshin77.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/mshin77/textanalysisr/issues
Pkgdown/docs site:https://mshin77.github.io
- acronym - Acronym List
- SpecialEduTech - Special education technology bibliographic data
- stm_15 - An example structure of a structural topic model
- stopwords_list - Stopwords List
bert-modellexiconollamapythonsemantictext-miningtopic-modelingword-networks
Last updated from:8d75b3d589. Checks:9 OK. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-x86_64 | OK | 240 | ||
| source / vignettes | OK | 312 | ||
| linux-release-x86_64 | OK | 240 | ||
| macos-release-arm64 | OK | 114 | ||
| macos-oldrel-arm64 | OK | 164 | ||
| windows-devel | OK | 160 | ||
| windows-release | OK | 175 | ||
| windows-oldrel | OK | 165 | ||
| wasm-release | OK | 164 |
Exports:%>%analyze_document_clusteringanalyze_semantic_evolutionanalyze_sentimentanalyze_sentiment_llmanalyze_similarity_gapsassess_embedding_stabilityauto_tune_embedding_topicscalculate_assignment_consistencycalculate_clustering_metricscalculate_cosine_similaritycalculate_cross_similaritycalculate_dispersion_metricscalculate_document_similaritycalculate_keyword_stabilitycalculate_lexical_dispersioncalculate_log_odds_ratiocalculate_semantic_driftcalculate_similarity_robustcalculate_text_readabilitycalculate_topic_probabilitycalculate_topic_stabilitycalculate_weighted_log_oddscalculate_word_frequencycall_llm_apicheck_python_envcheck_vision_modelsclear_lexdiv_cachecluster_embeddingsdescribe_imagedetect_multi_wordsdetect_pdf_content_typedetect_pdf_content_type_pyexport_document_clusteringextract_cross_category_similaritiesextract_keywords_keynessextract_keywords_tfidfextract_morphologyextract_named_entitiesextract_noun_chunksextract_pos_tagsextract_subjects_objectsextract_tables_from_pdf_pyextract_topic_terms_dffind_optimal_kfind_similar_wordsfind_topic_matchesfit_embedding_modelfit_embedding_topicsfit_semantic_modelfit_temporal_modelfit_topic_prevalence_modelgenerate_cluster_labelsgenerate_cluster_labels_autogenerate_embeddingsgenerate_topic_contentgenerate_topic_labelsget_available_dfmget_available_tokensget_best_embeddingsget_content_type_promptget_content_type_user_templateget_sentencesget_sentiment_colorget_sentiment_colorsget_spacy_model_infoget_topic_prevalenceget_topic_termsget_topic_textsget_word_similarityidentify_topic_trendsimport_filesinit_spacy_nlplemmatize_tokenslexical_diversity_analysislexical_frequency_analysisplot_cluster_termsplot_cross_category_heatmapplot_document_sentiment_trajectoryplot_emotion_radarplot_entity_frequenciesplot_keyness_keywordsplot_keyword_comparisonplot_lexical_dispersionplot_lexical_diversity_distributionplot_log_odds_ratioplot_model_comparisonplot_morphology_featureplot_mwe_frequencyplot_ngram_frequencyplot_pos_frequenciesplot_quality_metricsplot_readability_by_groupplot_readability_distributionplot_semantic_vizplot_sentiment_boxplotplot_sentiment_by_categoryplot_sentiment_distributionplot_sentiment_violinplot_similarity_heatmapplot_term_trends_continuousplot_tfidf_keywordsplot_top_readability_documentsplot_topic_effects_categoricalplot_topic_effects_continuousplot_topic_probabilityplot_weighted_log_oddsplot_word_frequencyplot_word_probabilityprep_textsprocess_pdf_unifiedreduce_dimensionsrender_displacy_deprender_displacy_entrun_apprun_neural_topics_internalrun_rag_searchrun_text_workflowsemantic_document_clusteringsemantic_similarity_analysissentiment_embedding_analysissentiment_lexicon_analysissetup_python_envshow_web_bannerspacy_extract_entitiesspacy_has_vectorsspacy_initializedspacy_lemmatizespacy_parse_fullsummarize_morphologyunite_colsvalidate_cross_modelsvalidate_semantic_coherenceword_co_occurrence_networkword_correlation_network
Dependencies:backportsbase64encbroombslibcachemclicommonmarkcpp11crosstalkdigestdplyrDTevaluatefarverfastmapfastmatchfontawesomefsgenericsggplot2gluegtablehighrhtmltoolshtmlwidgetshttpuvigraphisobandISOcodesjaneaustenrjquerylibjsonliteknitrlabelinglaterlatticelazyevallifecyclemagrittrMatrixmemoisemimensyllableotelpillarpkgconfigplyrpromisesproxyCpurrrquantedaquanteda.textstatsR6rappdirsRColorBrewerRcppRcppArmadilloreshape2rlangrmarkdownS7sassscalesshinySnowballCsourcetoolsstopwordsstringistringrtibbletidyrtidyselecttidytexttinytextokenizersutf8vctrsviridisLitewidyrwithrxfunxml2xtableyaml
Last update: 2026-06-03
Started: 2025-11-30
Last update: 2026-06-02
Started: 2025-11-30
Last update: 2026-06-02
Started: 2025-12-29
Last update: 2026-06-02
Started: 2025-11-30
Last update: 2026-06-02
Started: 2025-11-30
Last update: 2026-06-02
Started: 2025-11-30
Last update: 2026-06-02
Started: 2025-11-30
Last update: 2026-06-02
Started: 2025-11-30
Last update: 2026-06-02
Started: 2025-11-30
Last update: 2026-05-28
Started: 2025-11-30
Last update: 2026-05-27
Started: 2025-11-30
