基于词性信息自动识别和标注非分句 Automatic identification and labeling of non-clauses based on part of speech期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于词性信息自动识别和标注非分句

引用本文：	李琼,李志.基于词性信息自动识别和标注非分句[J].长春工程学院学报(社会科学版),2011,12(1):77-80.

作者姓名：	李琼李志

作者单位：	华中师范大学国际文化交流学院,武汉,430079;华中师范大学国际文化交流学院,武汉,430079

基金项目：	教育部人文社会科学研究青年项目的研究成果(项目编号:09YJC740032);华中师范大学“丹桂计划”项目

摘要：	在完成自动分词和词性标注工作的基础上,进行分句层次和关系的自动划分和标注,以期建设一个面向中文信息处理的大规模复句"精加工"语料库.可以利用词性信息制定一系列规则去实现部分非分句的自动识别和标注,同时建设一个短语库,把短语语言片段收录其中.
关键词：	词性短语库词性标注
Automatic identification and labeling of non-clauses based on part of speech

Institution:	LI Qiong,et al.(School of International Culture Exchanges CCNU,Wuhan 430079,China)

Abstract:	In order to build a ＂finishing＂ compound-sentence corpus for Chinese Information Process,automatic word segmentation and POS tagging work should be completed first of all.On this basis,automatic classification and labeling of levels and relationship between clauses should be conducted.We can use the POS information to develop a set of rules to achieve some non-clause of automatic identification and labeling,but also can build a phrase library,which includes the phrase language fragments.

Keywords:	part of speech phrase library rules
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏