Package: jiebaRS 0.1.0
jiebaRS: Chinese Text Segmentation, POS Tagging, and Keyword Extraction for R
Provides fast Chinese text segmentation, keyword extraction via 'TF-IDF' and 'TextRank', and part-of-speech tagging, powered by a 'Rust' backend ('jieba-rs'). Supports custom dictionaries, user words, stop words, IDF files, and HMM models, with parallel batch processing of multiple strings. Serves as a modern, maintained replacement for the 'jiebaR' package.
Authors:
jiebaRS_0.1.0.tar.gz
jiebaRS_0.1.0.zip(r-4.7)jiebaRS_0.1.0.zip(r-4.6)jiebaRS_0.1.0.zip(r-4.5)
jiebaRS_0.1.0.tgz(r-4.6-x86_64)jiebaRS_0.1.0.tgz(r-4.6-arm64)jiebaRS_0.1.0.tgz(r-4.5-x86_64)jiebaRS_0.1.0.tgz(r-4.5-arm64)
jiebaRS_0.1.0.tar.gz(r-4.7-arm64)jiebaRS_0.1.0.tar.gz(r-4.7-x86_64)jiebaRS_0.1.0.tar.gz(r-4.6-arm64)jiebaRS_0.1.0.tar.gz(r-4.6-x86_64)
jiebaRS_0.1.0.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
DESCRIPTION |NEWS
card.svg |card.png
jiebaRS/json (API)
| # Install 'jiebaRS' in R: |
| install.packages('jiebaRS', repos = c('https://yousa-mirage.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/yousa-mirage/jiebars/issues
Pkgdown/docs site:https://yousa-mirage.github.io
Last updated from:9662148258 (on v0.1.0). Checks:12 OK, 1 ERROR. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-arm64 | OK | 254 | ||
| linux-devel-x86_64 | OK | 208 | ||
| source / vignettes | OK | 309 | ||
| linux-release-arm64 | OK | 200 | ||
| linux-release-x86_64 | OK | 218 | ||
| macos-release-arm64 | OK | 250 | ||
| macos-release-x86_64 | OK | 334 | ||
| macos-oldrel-arm64 | OK | 254 | ||
| macos-oldrel-x86_64 | OK | 556 | ||
| windows-devel | OK | 237 | ||
| windows-release | OK | 212 | ||
| windows-oldrel | ERROR | 243 | ||
| wasm-release | OK | 209 |
Exports:add_wordcount_ngramsfilter_segmentfreqget_idfget_tuplekeywordskeywords_dfnew_user_wordsegmentsegment_batchtaggingtagging_batchtextranktextrank_dfworker
Readme and manuals
Help Manual
| Help page | Topics |
|---|---|
| Count n-grams from segmented text | count_ngrams |
| Filter segmentation results | filter_segment |
| The frequency of words | freq |
| Generate IDF dict | get_idf |
| Compatibility wrapper for 'jiebaR::get_tuple()' | get_tuple |
| Extract keywords from text | keywords |
| Extract keywords as a data frame | keywords_df |
| Add user word | add_word new_user_word |
| Segment text with a jieba worker | segment |
| Segment a batch of strings | segment_batch |
| Tag text with a jiebaRS worker | tagging |
| Tag a batch of strings | tagging_batch |
| Extract TextRank keywords from text | textrank |
| Extract TextRank keywords as a data frame | textrank_df |
| Initialize a jiebaRS worker | worker |
