Package: RTextTools 1.4.3

RTextTools: Automatic Text Classification via Supervised Learning

A machine learning package for automatic text classification that makes it simple for novice users to get started with machine learning, while allowing experienced users to easily experiment with different settings and algorithm combinations. The package includes eight algorithms for ensemble classification (svm, slda, boosting, bagging, random forests, glmnet, decision trees, neural networks), comprehensive analytics, and thorough documentation.

Authors:Timothy P. Jurka, Loren Collingwood, Amber E. Boydstun, Emiliano Grossman, Wouter van Atteveldt

RTextTools_1.4.3.tar.gz
RTextTools_1.4.3.zip(r-4.5)RTextTools_1.4.3.zip(r-4.4)RTextTools_1.4.3.zip(r-4.3)
RTextTools_1.4.3.tgz(r-4.5-x86_64)RTextTools_1.4.3.tgz(r-4.5-arm64)RTextTools_1.4.3.tgz(r-4.4-x86_64)RTextTools_1.4.3.tgz(r-4.4-arm64)RTextTools_1.4.3.tgz(r-4.3-x86_64)RTextTools_1.4.3.tgz(r-4.3-arm64)
RTextTools_1.4.3.tar.gz(r-4.5-noble)RTextTools_1.4.3.tar.gz(r-4.4-noble)
RTextTools_1.4.3.tgz(r-4.4-emscripten)RTextTools_1.4.3.tgz(r-4.3-emscripten)
RTextTools.pdf |RTextTools.html✨
RTextTools/json (API)

# Install 'RTextTools' in R:

install.packages('RTextTools', repos = c('https://lorenc5.r-universe.dev', 'https://cloud.r-project.org'))

Datasets:

NYTimes - A sample dataset containing labeled headlines from The New York Times.
USCongress - A sample dataset containing labeled bills from the United State Congress.

On CRAN:

This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.

3.84 score 1 stars 772 scripts 608 downloads 9 mentions 18 exports 44 dependencies

Last updated 5 years agofrom:95f6a1b20b. Checks:12 OK. Indexed: yes.

Target	Result	Latest binary
Doc / Vignettes	OK	Mar 12 2025
R-4.5-win-x86_64	OK	Mar 12 2025
R-4.5-mac-x86_64	OK	Mar 12 2025
R-4.5-mac-aarch64	OK	Mar 12 2025
R-4.5-linux-x86_64	OK	Mar 12 2025
R-4.4-win-x86_64	OK	Mar 12 2025
R-4.4-mac-x86_64	OK	Mar 12 2025
R-4.4-mac-aarch64	OK	Mar 12 2025
R-4.4-linux-x86_64	OK	Mar 12 2025
R-4.3-win-x86_64	OK	Mar 12 2025
R-4.3-mac-x86_64	OK	Mar 12 2025
R-4.3-mac-aarch64	OK	Mar 12 2025

Exports:classify_model classify_models create_analytics create_container create_ensembleSummary create_matrix create_precisionRecallSummary create_scoreSummary cross_validate getStemLanguages print_algorithms read_data recall_accuracy summary.analytics summary.analytics_virgin train_model train_models wordStem

Dependencies:BH bitops caTools class cli codetools data.table diagram digest e1071 foreach future future.apply glmnet globals ipred iterators KernSmooth lattice lava listenv MASS Matrix NLP nnet numDeriv parallelly prodlim progressr proxy randomForest Rcpp RcppEigen rlang rpart shape slam SparseM SQUAREM survival tau tm tree xml2

Help page	Topics
an S4 class containing the analytics for a classified set of documents.	analytics_virgin-class
an S4 class containing the analytics for a classified set of documents.	analytics-class
converts a tm DocumentTermMatrix or TermDocumentMatrix into a matrix.csr representation.	as.compressed.matrix
makes predictions from a train_model() object.	classify_model
makes predictions from a train_models() object.	classify_models
creates an object of class analytics given classification results.	create_analytics
creates a container for training, classifying, and analyzing documents.	create_container
creates a summary with ensemble coverage and precision.	create_ensembleSummary
creates a document-term matrix to be passed into create_container().	create_matrix
creates a summary with precision, recall, and F1 scores.	create_precisionRecallSummary
creates a summary with the best label for each document.	create_scoreSummary
used for cross-validation of various algorithms.	cross_validate
Query the languages supported in this package	getStemLanguages
an S4 class containing the training and classification matrices.	matrix_container-class
a sample dataset containing labeled headlines from The New York Times.	NYTimes
prints available algorithms for train_model() and train_models().	print_algorithms
reads data from files into an R data frame.	read_data
calculates the recall accuracy of the classified data.	recall_accuracy
summarizes the 'analytics-class' class	summary.analytics
summarizes the 'analytics_virgin-class' class	summary.analytics_virgin
makes a model object using the specified algorithm.	train_model
makes a model object using the specified algorithms.	train_models
a sample dataset containing labeled bills from the United State Congress.	USCongress
Get the common root/stem of words	wordStem

Package: RTextTools 1.4.3

RTextTools: Automatic Text Classification via Supervised Learning

Citation

Readme and manuals

Help Manual

Usage by other packages (reverse dependencies)