diff options
author | Nikos Giotis <nikos.giotis@gmail.com> | 2017-03-05 11:02:08 +0700 |
---|---|---|
committer | Willy Sudiarto Raharjo <willysr@slackbuilds.org> | 2017-03-05 11:02:08 +0700 |
commit | 66f974475ff3c17432c32653626a225cade03294 (patch) | |
tree | c5a9b2e4dc70d7995d20dc3e6c921fee720c84cc /python/PyStemmer/README | |
parent | 847c66a1c5036590b10db27f936220a5f6cdcdf8 (diff) | |
download | slackbuilds-66f974475ff3c17432c32653626a225cade03294.tar.gz |
python/PyStemmer: Added (Snowball stemming algorithms).
Signed-off-by: Willy Sudiarto Raharjo <willysr@slackbuilds.org>
Diffstat (limited to 'python/PyStemmer/README')
-rw-r--r-- | python/PyStemmer/README | 18 |
1 files changed, 18 insertions, 0 deletions
diff --git a/python/PyStemmer/README b/python/PyStemmer/README new file mode 100644 index 0000000000..161b13c630 --- /dev/null +++ b/python/PyStemmer/README @@ -0,0 +1,18 @@ +Snowball stemming algorithms, for information retrieval + +Stemming algorithms + +PyStemmer provides access to efficient algorithms for calculating a "stemmed" +form of a word. This is a form with most of the common morphological endings +removed; hopefully representing a common linguistic base form. This is most +useful in building search engines and information retrieval software; +for example, a search with stemming enabled should be able to find a document +containing "cycling" given the query "cycles". + +PyStemmer provides algorithms for several (mainly european) languages, by +wrapping the libstemmer library from the Snowball project in a Python module. + +It also provides access to the classic Porter stemming algorithm for english: +although this has been superceded by an improved algorithm, the original +algorithm may be of interest to information retrieval researchers wishing +to reproduce results of earlier experiments. |