联合开发网   搜索   要求与建议
                登陆    注册
排序按匹配   按投票   按下载次数   按上传日期
按分类查找All 多国语言处理(9) 

[多国语言处理] ictclas_Source_Code

计算所汉语词法分析系统ICTCLAS介绍 词是最小的能够独立活动的有意义的语言成分。 但汉语是以字为基本的书写单位,词语之间没有明显的区分标记,因此,中文词语分析是中文信息处理的基础与关键。为此,我们中国科学院计算技术研究所在多年研究基础上,耗时一年研制出了汉语词法分析系统ICTCLAS(Institute of Computing Technology, Chinese Lexical Analysis System),该系统的功能有:中文分词;词性标注;未登录词识别。分词正确率高达97%以上,未登录词识别召回率均高于90%,其中中国人名的识别召回率接近98%处理速度为31.5Kbytes/s。ICTCLAS的特色还在于:可以根据需要输出多个高概率结果,有多种输出格式,支持北大词性标注集,973专家组给出的词性标注集合。该系统得到了专家的好评,并有多篇论文在国内外发表。 计算所汉语词法分析系统ICTCLAS同时还提供一套完整的动态连接库ICTCLAS.dll和相应的概率词典,开发者可以完全忽略汉语词法分析,直接在自己的系统中调用ICTCLAS,ICTCLAS可以根据需要输出多个高概率的结果,输出格式也可以定制,开发者在分词和词性标注的基础上继续上层开发。
calculation Chinese lexical analysis system ICTCLAS introduced the term is the smallest independent of meaningful activities language components. It is Chinese characters written for the basic unit, the word no clear distinction between markers, therefore, the Chinese term analysis of the Chinese information processing infrastructure and key. To this end, we CAS Institute of Computing Technology based on years of research, 976,000 developed the Chinese lexical analysis system ICTCLAS (Institute of Compu Hosiery Technology, Chinese Lexical Analysis System), the system functions : the Chinese word; tagging; Unknown word recognition. Word accuracy rate of as high as 97%, unknown word recognition recall rate is higher than 90%. these names identify the recall rate of nearly 98% for the proce (2006-03-03, C/C++, 110KB, 下载745次)

http://www.pudn.com/Download/item/id/150241.html

[多国语言处理] 上个世纪最伟大的代码

说明: 这个程序(omni.com)是97年的Mekka ’97 4K Intro比赛的一等奖作品。整个程序全长4095字节,其中包含133字节的自解压程序(类RAR压缩),未解压的程序长4782字节。三维场景包含144个立方体,367个面,362个点,15个不同的64*64的纹理
Note : This procedure (omni.com) is the Mekka 97 4K Intro'97 won the competition works. The whole process 4,095 byte length, which includes 133 bytes of self-extracting (RAR compression type), resulting in decompression procedures 4,782 bytes long. 3D cube contains 144, 367-, 362, 15* 64 different texture 64 (2004-12-15, DOS, 7KB, 下载57次)

http://www.pudn.com/Download/item/id/1103083334194067.html

[多国语言处理] ganzhiji

基于感知机的中文分词程序,实现基本的文本分词,正确率高达97 以上
Perceptron-based Chinese word segmentation program to achieve basic text word, the correct rate of 97 or more (2010-07-09, Python, 5036KB, 下载43次)

http://www.pudn.com/Download/item/id/1236933.html

[多国语言处理] fanjianzhuanhua

汉字简繁转化代码
code into Chinese characters 66.248.97.196 (2005-01-15, WINDOWS, 256KB, 下载21次)

http://www.pudn.com/Download/item/id/1105758369214988.html

[多国语言处理] ICTCLAS_2009_API_DOC

计算所汉语词法分析系统ICTCLAS.分词正确率高达97.58 (973专家组评测),未登录词识别召回率均高于90 ,其中中国人名的识别召回率接近98 处理速度为31.5Kbytes/s。ICTCLAS的特色还在于:可以根据需要输出多个高概率结果,有多种输出格式,支持北大词性标注集,973专家组给出的词性标注集合。这是最新版的API接口文档,有详细的示例。
Calculation of the Chinese lexical analysis system ICTCLAS. Segmentation correct rate of 97.58 (973 Expert Group on Evaluation), unknown word recognition than the recall rate of 90 percent, of which the recognition of China to recall the names of persons close to 98 of processing speed for 31.5Kbytes/s. Also features ICTCLAS is: can output the results of a number of high probability, there are a variety of output formats, to support the North-of-speech tagging sets, 973 expert group is given a collection of-speech tagging. This is the latest version of the API interface documentation, detailed examples. (2009-07-13, Visual C++, 54KB, 下载19次)

http://www.pudn.com/Download/item/id/841937.html

[多国语言处理] c_code.aspx

中文字体简繁转化代码,方便快捷,使用灵活!
66.248.97.196 into Chinese character code, convenient and flexible in use! (2006-03-14, C/C++, 256KB, 下载10次)

http://www.pudn.com/Download/item/id/154016.html

[多国语言处理] xmind

首席安全官([https: cncso.com)致力建立领先的互联网安全专家智库,为数字经济发展和产业升级提供安全专家智库服务。](https: cncso.com\)%E8%87%B4%E5%8A%9B%E5%BB%BA%E7%AB%8B%E9%A2%86%E5%85%88%E7%9A%84%E4%BA%92%E8%81%94%E7%BD%91%E5%AE%89%E5%85%A8%E4%B8%93%E5%AE%B6%E6%99%BA%E5%BA%93%EF%BC%8C%E4%B8%BA%E6%95%B0%E5%AD%97%E7%BB%8F%E6%B5%8E%E5%8F%91%E5%B1%95%E5%92%8C%E4%BA%A7%E4%B8%9A%E5%8D%87%E7%BA%A7%E6%8F%90%E4%BE%9B%E5%AE%89%E5%85%A8%E4%B8%93%E5%AE%B6%E6%99%BA%E5%BA%93%E6%9C%8D%E5%8A%A1%E3%80%82),
The Chief Security Officer ([https: cncso. com]) is committed to building a leading Internet security expert think tank to provide security expert think tank services for the development of the digital economy and industrial upgrading.] (https: cncso. com ) %E8% 87% B4% E5% 8A% 9B% E5% BB% BA% E7% AB% 8B% E9% A2% 86% E5% 85% 88% E7% 9A% 84% E4% BA% 92% E8% 81% 94% E7% BD% 91% E5% AE% 89% E5% 85% A8% E4% B8% 93% E5% AE% B6% E6% 99% BA% E5% BA% 93% EF% BC% 8C% E4% B8% BA% E6% 95% B0% E5% AD% 97% E7% BB% 8F% E6% B5% 8E% E5% 8F% 91% E5% B1% 95% E5% 92% 8C% E4% BA% A7% E4% B8% 9A% E5% 8D% 87% E7% BA% A7% E6% 8F% 90% E4% BE% 9B% E5% AE% 89% E5% 85% A8% E4% B8% 93% E5% AE% B6% E6% 99% BA% E5% BA% 93% E6% 9C% 8D% E5% 8A% A1% E3% 80% 82), (2023-10-25, Others, 0KB, 下载0次)

http://www.pudn.com/Download/item/id/1698296239720301.html

[多国语言处理] defoldfonts

1994年,使用LaTeX2ε时,旧的字体命令\rm、\sf、\tt、\bf、\it、\sl和\sc变得过时。该软件包定义了它们,还定义了...,
In 1994 with LaTeX2ε the old font commands \rm, \sf, \tt, \bf, \it, \sl, and \sc became obsolete. This package defines them and also the deprecated KOMA-Script command \sfb. (2023-08-04, TeX, 0KB, 下载0次)

http://www.pudn.com/Download/item/id/1691190525337775.html

[多国语言处理] do_IconFont

字 (图) 库 ,支持从[http: www.iconfont.cn plus网站下载的ttf格式图标](http: www.iconfont.cn plus%E7%BD%91%E7%AB%99%E4%B8%8B%E8%BD%BD%E7%9A%84ttf%E6%A0%BC%E5%BC%8F%E5%9B%BE%E6%A0%87),
The word (picture) library supports ttf format icons downloaded from [http: www.iconfont.cn plus] (http: www.iconfont.cn plus% E7% BD% 91% E7% AB% 99% E4% B8% 8B% E8% BD% E7% 9A% 84ttf% E6% A0% BC% E5% BC% 8F% E5% 9B% BE% E6% A0% 87), (2017-11-03, Java, 0KB, 下载0次)

http://www.pudn.com/Download/item/id/1688872742218733.html
总计:9