Package: jiebaRS 0.1.0

jiebaRS: Chinese Text Segmentation, POS Tagging, and Keyword Extraction for R

Provides fast Chinese text segmentation, keyword extraction via 'TF-IDF' and 'TextRank', and part-of-speech tagging, powered by a 'Rust' backend ('jieba-rs'). Supports custom dictionaries, user words, stop words, IDF files, and HMM models, with parallel batch processing of multiple strings. Serves as a modern, maintained replacement for the 'jiebaR' package.

Authors:Hao Cheng [aut, cre, cph]

jiebaRS_0.1.0.tar.gz
jiebaRS_0.1.0.zip(r-4.7)jiebaRS_0.1.0.zip(r-4.6)jiebaRS_0.1.0.zip(r-4.5)
jiebaRS_0.1.0.tgz(r-4.6-x86_64)jiebaRS_0.1.0.tgz(r-4.6-arm64)jiebaRS_0.1.0.tgz(r-4.5-x86_64)jiebaRS_0.1.0.tgz(r-4.5-arm64)
jiebaRS_0.1.0.tar.gz(r-4.7-arm64)jiebaRS_0.1.0.tar.gz(r-4.7-x86_64)jiebaRS_0.1.0.tar.gz(r-4.6-arm64)jiebaRS_0.1.0.tar.gz(r-4.6-x86_64)
jiebaRS_0.1.0.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
DESCRIPTION |NEWS
card.svg |card.png
jiebaRS/json (API)

# Install 'jiebaRS' in R:

install.packages('jiebaRS', repos = c('https://yousa-mirage.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/yousa-mirage/jiebars/issues

Pkgdown/docs site:https://yousa-mirage.github.io

On CRAN:

jieba rust cargo

2.18 score 1 stars 5 scripts 16 exports 2 dependencies

Last updated from:9662148258 (on v0.1.0). Checks:12 OK, 1 ERROR. Indexed: yes.

Target	Result	Time
linux-devel-arm64	OK	254
linux-devel-x86_64	OK	208
source / vignettes	OK	309
linux-release-arm64	OK	200
linux-release-x86_64	OK	218
macos-release-arm64	OK	250
macos-release-x86_64	OK	334
macos-oldrel-arm64	OK	254
macos-oldrel-x86_64	OK	556
windows-devel	OK	237
windows-release	OK	212
windows-oldrel	ERROR	243
wasm-release	OK	209

Exports:add_word count_ngrams filter_segment freq get_idf get_tuple keywords keywords_df new_user_word segment segment_batch tagging tagging_batch textrank textrank_df worker

Dependencies:cli rlang

Citation

Development and contributors

Readme and manuals

Help Manual

Help page	Topics
Count n-grams from segmented text	count_ngrams
Filter segmentation results	filter_segment
The frequency of words	freq
Generate IDF dict	get_idf
Compatibility wrapper for 'jiebaR::get_tuple()'	get_tuple
Extract keywords from text	keywords
Extract keywords as a data frame	keywords_df
Add user word	add_word new_user_word
Segment text with a jieba worker	segment
Segment a batch of strings	segment_batch
Tag text with a jiebaRS worker	tagging
Tag a batch of strings	tagging_batch
Extract TextRank keywords from text	textrank
Extract TextRank keywords as a data frame	textrank_df
Initialize a jiebaRS worker	worker