没有合适的资源?快使用搜索试试~ 我知道了~
温馨提示
Aiming at the characteristics of Naxi language, a method is proposed for Naxi sentence similarity calculation. First, according to the characteristics of Naxi language that verbs set back, and nouns and verbs appear in chunks. Naxi NP and VP chunks are defined and chunk rule is extracted. According to the rules of the Naxi sentence chunking, extracts NP and VP chunks as so on. Then, by using the Naxi-Chinese dictionary, Naxi word is mapped to the Chinese word. By using the Chinese word similarit
资源推荐
资源详情
资源评论
Vol. , No. , 2013 1
Copyright © 2013 Inderscience Enterprises Ltd.
Naxi Sentence Similarity
Calculating Based on Improved
Chunking Edit-Distance
Huihui Zhang
School of Information Engineering and Automation
Kunming University of Science and Technology, Kunming
Key Laboratory of Intelligent Information Processing
Kunming University of Science and Technology, China
E-mail: glitter_zhang@163.com
Zhengtao Yu*
School of Information Engineering and Automation
Kunming University of Science and Technology, Kunming
Key Laboratory of Intelligent Information Processing
Kunming University of Science and Technology, China
E-mail: ztyu@hotmail.com
*Corresponding authors
Longhua Shen,
China Research and Development Academy of Machinery Equipment, Beijing
E-mail: lhsheng@liip.cn
Jianyi Guo, Cunli Mao
The School of Information Engineering and Automation
Kunming University of Science and Technology, China
Key Laboratory of Intelligent Information Processing
Kunming University of Science and Technology, China
E-mail:
gjade86@hotmail.com, mcl@163.com
Abstract: Aiming at the characteristics of Naxi language, a method is proposed for Naxi
sentence similarity calculation. First, according to the characteristics of Naxi language that
verbs set back, and nouns and verbs appear in chunks. Naxi NP and VP chunks are defined
and chunk rule is extracted. According to the rules of the Naxi sentence chunking, extracts
NP and VP chunks as so on. Then, by using the Naxi-Chinese Dictionary, Naxi word is
mapped to the Chinese word. By using the Chinese word similarity, Naxi words semantic
similarity is calculated. Similarity of chunks is calculated by the combination of Chinese
word similarity. Chunks similarity is defined as the replacement cost of chunk that edits
operation, and Naxi sentence similarity is computed according to replacement cost. At last,
experiment is done to calculate Naxi sentence similarity. Experimental result shows that
proposed method is better than other methods, and chunk exchange method can effectively
improve the accuracy of the Naxi sentence similarity.
Keywords: Naxi; Sentence similarity;Chunk; Edit-distance.
1 INTRODUCTION
Dongba is also called Naxi pictograph, which currently is
the only living pictograph in the world and is widesp -read
concerned by researchers around the world (Lei Shi, 2005;
Yu Sui-sheng, 2008). Naxi sentence similarity calculation is
the foundation of Naxi and Chinese bilingual retrieval and
bilingual learning. In domestic, respecting to the Chinese
sentence similarity comput ing research, Zhifang Sui and
Shiwen Yu proposed the Skeletal-Dependency-Tree-Based
Computational Model for the Sentence Similarity for the
machine translation(Zhifang Sui and Shiwen YU, 1998);
Sujian Li proposed relevance quantitative calculation model
which base on HowNet and Cilin(Su-jian Li, 2002);
Xueqiang Lv consider the two factors of word-form and
word-order similarity, and proposed sentence similarity
model and the most similar sentence search
algorithm(Xueqiang Lv, Feiliang Ren Huangzhi Dan and
Tianshun Yao, 2003); Wanxiang Che used Similar Chinese
Sentence Retrieval Based on Improved Edit-Distance(Bing
资源评论
weixin_38621365
- 粉丝: 7
- 资源: 906
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
- 智能笔项目源代码全套技术资料.zip
- 在线考试系统项目源代码全套技术资料.zip
- 高等数学学习资料合集 高等数学(工本)mind
- 西门子V90效率倍增-伺服驱动功能库详解简易循环功能库之Homing-V90PN.mp4
- 自考04741计算机网络原理真题及答案及课件
- 基于STM32芯片开发 安防系统 完整作品
- 4_base.apk.1
- 学生导师双选系统项目源代码全套技术资料.zip
- 自考02318《计算机组成原理》试题及答案 2014-2018及课件
- 图书管理系统,仅供参考
- 数据科学与大数据毕业设计系统项目源代码全套技术资料.zip
- 全国自考02197概率论与数理统计(二)试题及答案2014-2019
- CHGCOLOR压缩包
- 多轮自动红队方法提升大语言模型安全性
- python语言kssp爬虫程序代码XQZQ.txt
- 亲测源码云赏V7.0微信视频打赏系统源码已测试完整无错版
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功