Project Icon

jcseg

Optimized Chinese Text Segmentation with Keyword and Summary Extraction

Product DescriptionJcseg provides a comprehensive solution for Chinese text segmentation, based on the efficient mmseg algorithm and offering seven distinct modes. It integrates TextRank for extracting keywords, keyphrases, and key sentences and utilizes BM25 for automatic text summarization. The high-performance server module enables RESTful API access and seamless HTTP integration across different languages. Users can customize word libraries, use Simplified/Traditional Chinese word lists, and add synonyms and pinyin. The latest interfaces for Lucene, Solr, Elasticsearch, and OpenSearch are supported, with flexible configuration through jcseg.properties.
Project Details