首页 | 本学科首页   官方微博 | 高级检索  
     检索      

核心词自动分阶的一种计算模型——以纳西族玛丽玛萨话为例
引用本文:陈保亚,李子鹤,汪锋,杜兆金,张静芬.核心词自动分阶的一种计算模型——以纳西族玛丽玛萨话为例[J].云南民族大学学报(哲学社会科学版),2012,29(5):121-126.
作者姓名:陈保亚  李子鹤  汪锋  杜兆金  张静芬
作者单位:1. 北京大学中文系北京大学中国语言学研究中心,北京,100871
2. 北京大学中文系
基金项目:教育部人文社会科学重点研究基地重大项目“基于系统语音对应的核心词分阶及建模研究”(项目编号:11JJD740004);韩国:POSCO TJ Park Foudantion;四川省凉山彝族自治州社科联项目“彝语文本解读和华夏文明起源研究”阶段成果
摘    要:核心词分阶是判定同源关系的必要步骤.基于大规模语音对应数据库,我们提出并讨论一种算法模型,该模型计算核心词的核心程度,自动调整高阶核心词集和低阶核心词集,使得两阶词集在已知为同源关系的语言中,其分布与已知为接触关系的语言显著不同,即通过算法调整核心词集,使得有阶分布的显著性增加.这个算法模型的基本思路分为两个密切相关的部分:核心程度算法和两阶核心词调整算法.

关 键 词:语源关系  核心词  自动分阶  算法模型  玛丽玛萨话

An Algorithm Model of Automatic Ranking of Basic Words: the Case of Malimasa Variety of the Naxi Nationality
CHEN Bao-ya,LI Zi-he.An Algorithm Model of Automatic Ranking of Basic Words: the Case of Malimasa Variety of the Naxi Nationality[J].Journal of Yunnan Nationalities University:Social Sciences,2012,29(5):121-126.
Authors:CHEN Bao-ya  LI Zi-he
Institution:(Department of Chinese Language and Literature/Centre for Chinese Linguistics,Peking University,Beijing 100871,China)
Abstract:This research,based on a large database of sound correspondence among languages in China,aims at proposing an algorithm model to work out the importance of each basic word,and then adjust the basic word between the high-rank set and the low-rank set automatically.The result will be that when the languages in question are genetically related,the distribution of basic words in the two sets differs obviously from that when the languages in question are in contact relationship.That is,through the algorithm of adjusting the two set of basic words,the obviousness of ranking will increase.This algorithm model can be divided into two interrelated parts: counting to what degree a word being basic,and adjusting the word between high-rank set and low-rank set.
Keywords:genetic relationship  basic words  automatic ranking  algorithm model  Malimasa
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号