Friso
Visit ToolFriso is a high-performance Chinese tokenizer that supports both GBK and UTF-8 character sets. It is based on the MMSEG algorithm and can be easily embedded in other programs like MySQL, PHP, and PostgreSQL.
At a glance
Trending
Friso is a high-performance Chinese tokenizer that supports both GBK and UTF-8 character sets. It is based on the MMSEG algorithm and can be easily embedded in other programs like MySQL, PHP, and PostgreSQL.
Trending
About
Friso is an open-source, high-performance Chinese tokenizer developed in ANSI C, utilizing the popular MMSEG algorithm. It offers robust support for both GBK and UTF-8 character sets, ensuring broad compatibility. Designed with modularity in mind, Friso can be seamlessly integrated into various applications, including MySQL, PostgreSQL, and PHP. The tool provides four distinct segmentation modes: simple, complex, detect, and maximum, catering to different performance and accuracy requirements. Additionally, Friso includes advanced features such as keyword, key phrase, and key sentence extraction based on the TextRank algorithm, along with support for custom dictionaries, simplified/traditional Chinese conversion, and mixed English/Chinese word recognition. It also offers plugins for PHP5, PHP7, OCaml, and Lua, making it a versatile solution for Chinese text processing.
Capabilities
Pricing & Plans
Open Source ยท Free
Free
FAQs
Trending