Download the Brochure

Contact Us

Release

Sudachi GitHub Sponsorship Opportunities Begin

2022/10/07

 (Head Office: Chiyoda-ku, Tokyo; CEO: Osamu Hata) announced today that the Works Tokushima NLP Laboratory for Artificial Intelligence, an AI research institute of Works Applications Group (hereinafter "Works"), will start accepting Sudachi development sponsors through the GitHub sponsorship system from October 7. We are pleased to announce that we will start accepting Sudachi development sponsors from October 7, 2012.


Sudachi is one of the open source software (OSS) released free of charge by Works and is a morphological analyzer*1 essential for natural language processing.SudachiPy, the Python version of Sudachi, released as OSS in June 2019 and surpassed 6 million downloads in September 2022. This number is one of the top downloads among OSS for Japanese language processing released by companies or research institutes*2. Due to its high processing accuracy, SudachiPy has been installed in various search engines such as medical, plant, shopping mall, and SNS. Works will achieve sustainable development of various OSS by seeking sponsors.

1.SudachPy downloads exceed 6 million

Works opened the Works Tokushima Artificial Intelligence NLP Laboratory in Tokushima Prefecture in 2017, specializing in natural language processing. Natural language processing is a technology in which artificial intelligence (AI) analyzes natural language, and is used in chatbots and AI-OCR developed by Works. In addition, a number of OSS for Japanese language processing have been released under commercially available licenses, and are widely used by many companies and research institutes for search engines, text mining, and other applications.

 

Sudachi, one of the publicly available OSS, has the largest registered vocabulary in Japan at 2.9 million words (as of May 2022), and is a morphological analyzer that can withstand commercial use with flexibility and accuracy that are lacking in existing morphological analyzers. In addition to handling a wide variety of Japanese notational quirks, it incorporates the latest words in the "dictionary" necessary for morphological analysis, and ensures high-quality analysis performance through regular updates and maintenance, making it a focus of attention among OSS morphological analyzer engines.

 

<Sudachi External Evaluation Case Study>
■2018 Adopted by the world's largest conference on language resources
"The 11th International Conference on Language Resources and Evaluation (LREC/ELREC)".

■2021 Awarded the Outstanding Research Award (Best Paper Award) by the Institute of Electronics, Information and Communication Engineers (IEICE)
Research Group on Language Understanding and Communication

<The main OSS for Japanese language processing released by Works>
■ Japanese morphological analyzer :Sudachi, SudachiPy
https://github.com/WorksApplications/Sudachi
https://github.com/WorksApplications/SudachiPy

■ Large-scale Japanese eigenexpression and morphological dictionary :SudachiDict
https://github.com/WorksApplications/SudachiDict

■ Large-scale Japanese word vector: chiVe
https://github.com/WorksApplications/chiVe

■ Large-scale Japanese pre-training model: chiTra
https://github.com/WorksApplications/sudachiTra

2.Sudachi GitHub Sponsor Recruitment Begins

In order to strengthen Sudachi's research and development capabilities and realize sustainable development as OSS, we are now accepting applications for Sudachi GitHub Sponsors. The "GitHub Sponsor" is a GitHub released in 30 countries around the world in 2019. *3 and aims to financially support developers and teams of open source projects.

 

Applications for Sudachi GitHub Sponsorship are now being accepted from both individuals and organizations. The amount of the sponsorship fee and the frequency of support can be set as low as $1 per sponsorship. Sponsors will be rewarded with a sponsor badge on the Github page, a logo on the website, access to the development roadmap, participation in the sponsor chat space, priority bug handling, workshops, and more.

 

For more information on sponsorship opportunities, please visit the following website.
https://github.com/sponsors/WorksApplications

*1: Morphological analysis
It is a part of natural language processing (NLP) and is usually Morphological analysis is a part of natural language processing (NLP), and is a technique for segmenting "natural language," which is the language we commonly use in our daily lives, into morphemes (the smallest unit of a word that has a meaning).

*2:Number of downloads of packages registered in the Python Package Index
Source: awesome-japanese-nlp-resources

*3:GitHub is used by 40 million developers and is a central development platform for software development from open source projects to business use

About Works Applications Group

Since its establishment in 1996, Works Applications Group has been providing Products / Services mainly to major Japanese companies as Japan's first packaged software company for business applications. Based on the corporate philosophy of changing the concept of "work," making work more creative, increasing corporate productivity, and expanding corporate value, Works Applications Group aims for further development as a solution provider centered on ERP, to be a partner in promoting DX for small, medium, and start-up companies in addition to major corporations. We will continue to develop further as a solution provider with ERP at its core.

*Company names, product names and service names are trademarks or registered trademarks of their respective companies.
*The information in this release is current as of the date of publication, and is subject to change or withdrawal without notice. Please be aware that the forecasts and other forward-looking information in this release are based on uncertainties and may differ from actual results.

For inquiries regarding this article, please contact

Public Relations, Works Applications Corporation
TEL : 03-3512-1400 03-3512-1400
FAX : 03-3512-1401
E-mail: [email protected]