刘建毅
灾难备份,网络信息内容安全,网络内容处理,自然语言处理。
个性化签名
- 姓名:刘建毅
- 目前身份:
- 担任导师情况:
- 学位:
-
学术头衔:
- 职称:-
-
学科领域:
计算机科学技术
- 研究兴趣:灾难备份,网络信息内容安全,网络内容处理,自然语言处理。
刘建毅,男,博士,北京邮电大学计算机学院副教授。2000年毕业于西安邮电学院,2005年于北京邮电大学信号与信息处理专业获得博士学位。目前主要研究方向为:灾难备份、网络信息内容安全、网络内容处理、自然语言处理。
完成国家攻关项目7项和多项企业合作项目,申请计算机软件著作权4项、发表论文40余篇,其中sci、ei收录10余篇,参与编写的已出版的著作2部。 e-mail: liujy@bupt.edu.cn
-
凯发k8国际首页主页访问
3922
-
关注数
0
-
成果阅读
724
-
成果数
10
上传时间
2008-03-21
刘建毅, jianyi liu and cong wang
,-0001,():
-1年11月30日
a natural user interface (nui), where a user can type or speak a request, is a good complement to the well-known graphical user interface (gui). accurately extracting user intent from such typed or spoken queries is a very difficult challenge. statistical and knowledge-based are the two opposite kinds of possible approaches. both of them have advantages and disadvantages. this paper presents a mixed approach to spoken language understanding that tries to make best use of the both algorithms. the method was test with real data from users, and resulted in a task error rate of 1.94% and a semantic concept error rate of 5.73%.
spoken language understanding, statistical classification, grammar-based parsing
-
64浏览
-
0点赞
-
0收藏
-
0分享
-
140下载
-
引用
上传时间
2008-03-21
刘建毅, jianyi liu, jinghua wang
,-0001,():
-1年11月30日
in this paper, we introduced language network and described three kinds of networks. keyword extraction is an important technology in many areas of document processing. in particularly, a keyword extraction algorithm based on language network and pagerank is proposed. firstly a semantic network for a single document is build, then pagerank is applied in the network to decide on the importance of a word, finally top-ranked words are selected as keywords of the document. the algorithm is tested on the corpus of cistr, and the experiment result proves practical and effective.
-
90浏览
-
0点赞
-
0收藏
-
0分享
-
371下载
-
引用
上传时间
2008-03-21
刘建毅, jianyi liu, yixin zhong
,-0001,():
-1年11月30日
this paper proposes a hypothesis reordering technique, based on a newly established theory, namely comprehensive information theory, to improve the accuracy of speech recognition in a man-machine dialog system. for each hypothesis, we calculate the amount of comprehensive information that hypothesis provided and then reorder n-best hypothesis according to the amount of comprehensive information. results of experiments have shown its effectiveness.
-
46浏览
-
0点赞
-
0收藏
-
0分享
-
115下载
-
引用
上传时间
2008-03-21
刘建毅, liu jian-yi, wang jing-hua, wang cong
the journal of china universities of posts and telecommunications volume 13, issue 3, september 2006,-0001,():
-1年11月30日
new word identification is a difficult point in chinese word segmentation processing. in the automatic word segmentation processing of large chinese texts, new word can cause segmentation mistakes. the paper defines new word identification as a binary classification problem: whether a character sequence in certain context is a new word or not, and use two statistical learning approaches based on support vector machine (svm) and c4.5. we then investigate various linguistic and statistical features including independent word probability of former character, independent word probability of latter character, front position in-word probability of former character, back position in-word probability of latter character, mutual information and frequency. in pk-close test of the 1st special interest group for chinese language processing (sighan) bakeoff, this approach achieves the high precision and recall.
new word identification, support vector machine, decision tree
-
49浏览
-
0点赞
-
0收藏
-
0分享
-
174下载
-
引用
上传时间
2008-03-21
刘建毅, jianyi liu, jinghua wang, and cong wang
,-0001,():
-1年11月30日
text representation is the basis of text processing. most current text representation model didn’t consider of the words’ relations and result in the loss of text’s structure information, which is important to understand the text. this paper proposed a novel text representation model, which uses lexical network to represent the text and retains the text’s structure. according to the different levels of words’ relations, co-occurrence network, syntactic network and semantic network are introduced. the text network representation was applied into text classification to measure the representation ability of this model. the experiment result shows that our text network representation is prior to vector space model.
-
116浏览
-
0点赞
-
0收藏
-
0分享
-
188下载
-
引用
上传时间
2008-03-21
刘建毅, jianyi liu, shenjin sun, qing guo, cong wang, yixin zhong
,-0001,():
-1年11月30日
in this paper, we present a spoken dialog system for sports event guide, which could provide users with the information on such items as schedule, venue, team, as well as athlete according to users’ favor. it is one of the key applications in the ongoing project “multi-lingual intelligent information services network system”, which will serve the beijing 2008 olympic game. in addition to the descriptions of the system architecture and individual modules, we focus mainly on two important functions: natural language understanding (nlu) and dialog management (dm). the system was evaluated on 884 dialogues and achieved 85.9% transaction success.
-
45浏览
-
0点赞
-
0收藏
-
0分享
-
164下载
-
引用
上传时间
2008-03-21
刘建毅, 张鹏飞, 王枞, 郭燕慧, 李赟
,-0001,():
-1年11月30日
本文设计了一个高性能的电子邮件过滤系统、该系统采用基于全信息的自然语言理解方法论、对邮件从语法(关键词过滤)、语义(主题过滤)、语用(倾向过滤)三个层次上进行过滤、从而尽量避免对正常邮件的误判和对非法邮件的漏判。
邮件过滤, 主题过滤, 倾向过滤
-
78浏览
-
0点赞
-
0收藏
-
0分享
-
198下载
-
引用
上传时间
2008-03-21
刘建毅, 王枞, 郭艳慧, 钟义信
,-0001,():
-1年11月30日
口语自然语言理解是口语对话系统中最重要的组成部分。本文提出了一种基于语义语法的语义分析和统计分类器相结合的口语理解方法。该方法分为两步:首先用语义分析得到句子的语义概念、然后用统计分类器得到句子的任务。将该方法应用于智能公交系统中、取得了很好的实验效果。
口语自然语言理解, 语义语法, 统计分类器
-
71浏览
-
0点赞
-
0收藏
-
0分享
-
223下载
-
引用
上传时间
2008-03-21
刘建毅, 刘建毅、, 王菁华, 王枞
,-0001,():
-1年11月30日
提出了一个基于统计的从未标注语料库中半自动获取语义语法算法。该算法对特定领域的语料库进行反复的时间聚类和空间聚类,通过时间聚类发现语言片段的语法结构,通过空间聚类发现语言片段的语义类别;循环迭代,可以生成一个粗糙的文法。最后,将这些抽取出来的粗糙文法经过人工校对,从而得到新领域的语义语法。实验结果表明,该方法是有效和切实可行的。
对话系统, 语义语法, k—l距离, 互信息
-
77浏览
-
0点赞
-
0收藏
-
0分享
-
153下载
-
引用
上传时间
2008-03-21
刘建毅, 马莉, 李成城
,-0001,():
-1年11月30日
随着网络信息的日益增加和广泛传播,口语对话系统已经为越来越多的研究者所关注。本文主要介绍了一个天气预报口语对话系统的设计与实现,该系统可以实现对国内外150 个城市的天气情况的查询。在实验室环境下,该系统表现了良好的性能。
口语对话系统, 语义分析, 对话管理, 自然语言生成
-
88浏览
-
0点赞
-
0收藏
-
0分享
-
274下载
-
引用