Package: lsa 0.73.3

lsa: Latent Semantic Analysis

The basic idea of latent semantic analysis (LSA) is, that text do have a higher order (=latent semantic) structure which, however, is obscured by word usage (e.g. through the use of synonyms or polysemy). By using conceptual indices that are derived statistically via a truncated singular value decomposition (a two-mode factor analysis) over a given document-term matrix, this variability problem can be overcome.

Authors:Fridolin Wild

lsa_0.73.3.tar.gz
lsa_0.73.3.zip(r-4.5)lsa_0.73.3.zip(r-4.4)lsa_0.73.3.zip(r-4.3)
lsa_0.73.3.tgz(r-4.4-any)lsa_0.73.3.tgz(r-4.3-any)
lsa_0.73.3.tar.gz(r-4.5-noble)lsa_0.73.3.tar.gz(r-4.4-noble)
lsa_0.73.3.tgz(r-4.4-emscripten)lsa_0.73.3.tgz(r-4.3-emscripten)
lsa.pdf |lsa.html
lsa/json (API)

# Install 'lsa' in R:
install.packages('lsa', repos = c('https://fwild.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Datasets:
  • alnumx - Regular expression for removal of non-alphanumeric characters
  • corpus_essays - Corpora
  • corpus_scores - Corpora
  • corpus_training - Corpora
  • specialchars - List of special character html entities and their character replacement
  • stopwords_ar - Stopwordlists in German, English, Dutch, French, Polish, and Arab
  • stopwords_de - Stopwordlists in German, English, Dutch, French, Polish, and Arab
  • stopwords_en - Stopwordlists in German, English, Dutch, French, Polish, and Arab
  • stopwords_fr - Stopwordlists in German, English, Dutch, French, Polish, and Arab
  • stopwords_nl - Stopwordlists in German, English, Dutch, French, Polish, and Arab
  • stopwords_pl - Stopwordlists in German, English, Dutch, French, Polish, and Arab

On CRAN:

This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.

6.16 score 23 packages 1.1k scripts 3.7k downloads 5 mentions 26 exports 1 dependencies

Last updated 3 years agofrom:31ba5c16be. Checks:OK: 1 NOTE: 6. Indexed: yes.

TargetResultDate
Doc / VignettesOKNov 15 2024
R-4.5-winNOTENov 15 2024
R-4.5-linuxNOTENov 15 2024
R-4.4-winNOTENov 15 2024
R-4.4-macNOTENov 15 2024
R-4.3-winNOTENov 15 2024
R-4.3-macNOTENov 15 2024

Exports:as.textmatrixassociatecosinedelTripledimcalc_kaiserdimcalc_ndocsdimcalc_rawdimcalc_shareentropyfold_ingetTriplegw_entropygw_gfidfgw_idfgw_normalisationlsalw_bintflw_logtflw_tfprint.textmatrixquerysample.textmatrixsetTriplesummary.textmatrixtextmatrixtextvector

Dependencies:SnowballC