1
文本自动标引与自动分类研究
1.3.2.4 参考文献

参考文献

[1]曾元显.关键词自动提取技术与相关词反馈[J].中国图书馆学会会报,1997,59:59-64.

[2]王强军,李芸,张普.信息技术领域术语提取的初步研究[J].术语标准化与信息技术,2003,1:32-33,37.

[3]Xun E,Huang C,Zhou M.A Unified Statistical Model for the Identi-fication of English BaseNP[A].In:Proceedings of 4th ACM Conference on Digital Libraries[C].Berkeley,CA,USA,2000:254-255.

[4]李素建,王厚峰,俞士汶,辛乘胜.关键词自动标引的最大熵模型应用研究[J].计算机学报,2004,27(9):1192-1197.

[5]张燕飞.信息组织的主题语言[M].武汉:武汉大学出版社,2005:226.

[6]Allan J,Carbonell J,Doddington G,Yamron J,Yang Y.Topic Detection and Tracking Pilot Study:Final Report[A].In:Proceedings of DARPA Broadcast News Transcription and Understanding Workshop [C].Lansdowne,Virginia,USA,1998:194-218.

[7]侯汉清,马张华.主题法导论[M].北京:北京大学出版社,1991.

[8]刘华.基于关键短语的文本内容标引研究[D].北京语言大学博士学位论文.2005:11-13.

[9]戚雨春,董达武,许以理,陈光磊.语言学百科词典[M].上海:上海辞书出版社,1993.

[10]Lahtinen T.Automatic Indexing:an Approach Using an Index Term Corpus and Combining Linguistic and Statistical Methods[R].Academic Dissertation,University of Helsinki,Finland,2000:34.

[11]Harter S P.Online Information Retrieval:Concepts,Principles and Techniques[M].Orlando,Florida:Academic Press,Inc.,1986:42.

[12]Luhn H P.A Statistical Approach to Mechanized Encoding and Searching of Literary Information[J].IBM Journal of Research and Development,1957,1(4):309-317.

[13]Luhn H P.The Automatic Creation of Literature Abstracts[J].IBM Journal of Research and Development.1958.2(2):159-165.

[14]Baxendale P E.Machine-made Index for Technical Literature—an Experiment[J].IBM.Journal of Research and Development,1958,2 (4):354-361.

[15]Edmundson H P,Oswald V A.Automatic Indexing and Abstracting of the Contents of Documents[R].Planning Research Corp, Document PRC R-126,ASTIA AD No.231606,Los Angeles,1959:1-142.

[16]Maron M E,Kuhns J L.On Relevance,Probabilistic Indexing and Information Retrieval[J].Journal of the Association for Computer Machinery,1960,7(3):216-244.

[17]Edmundson H P.New Methods in Automatic Abstracting Extracting [J].Journal of the Association for Computing Machinery,1969,16 (2):264-285.

[18]Lois L E.Experiments in Automatic Indexing and Extracting[J].Information Storage and Retrieval,1970,6:313-334.

[19]Salton G,Yang C S.On the Specification of Term Values in Automatic Indexing[J]..Journal of Documentation,1973,29(4):351-72.

[20]Salton G,Wong A,Yang C S.A Vector Space Model for Automatic Indexing[J].Communications of ACM,1975,18(11):613-620.

[21]Dillon M,Gray A S.FASIT:A Fully Automated Syntactically Based Indexing System[J].Journal of the American Society for Information Science,1983,34(2):99-108.

[22]Devadason F J.Computerization of Deep Structure Based Indexes[J].International Classification,1985,12(2):87-94.

[23]Deerwester S,Dumais S T,Landauer T K,Furnas G W,Harshman R A.Indexing by Latent Semantic Analysis[J]..Journal of the American Society for Information Science,1990,41(6):391-407.

[24]Silva W T,MiliDiu R L.Belief Function Model for Information Retrieval[J].Journal of the American Society for Information Science,1993,44(1):10-18.

[25]Cohen J D.Highlights:Language and Domain-independent Automatic Indexing Terms for Abstracting[J].Journal of the American Society for Information Science,1995,46(3):162-174.

[26]Chien L F.PAT-tree-based Keyword Extraction for Chinese Information Retrieval[A].In:Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval(SIGIR1997)[C].Philadelphia,PA,USA,1997:50-59.

[27]Frank E,Paynter G W,Witten I H.Domain-Specific Keyphrase Extraction[A].In:Proceedings of the 16th International Joint Conference on Artificial Intelligence[C].Stockholm,Sweden,Morgan Kaufmann,1999:668-673.

[28]Turney P D.Learning to Extract Keyphrases from Text[R].NRC Technical Report ERB-1057,National Research Council,Canada.1999:1-43.

[29]Anjewierden A,Kabel S.Automatic Indexing of Documents with Ontologies[A].In:Proceedings of the 13th Belgian/Dutch Conference on Artificial Intelligence(BNAIC-01)[C].Amsterdam,Netherlands,2001:23-30.

[30]Tomokiyo T,Hurst M.A language Model Approach to Keyphrase Extraction[A].In:Proceedings of the ACL Workshop on Multiword Expressions:Analysis,Acquisition &Treatment[C].Sapporo,Japan,2003:33-40.

[31]Hulth A.Improved Automatic Keyword Extraction Given More Linguistic Knowledge[A].In:Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing[C].Sapporo,Japan,2003:216-223.

[32]Zhang K,Xu H,Tang J,Li J Z.Keyword Extraction Using Support Vector Machine[A].In:Proceedings of the Seventh International Conference on Web-Age Information Management(WAIM2006)[C].Hong Kong,China,2006:85-96.

[33]Ercan G,Cicekli I.Using Lexical Chains for Keyword Extraction[J].Information Processing and Management,2007,43(6):1705-1714.

[34]Zhang Chengzhi,Wang Huilin,Yao Liu,Wu Dan,et.al.Automatic Keyword Extraction from Documents Using Conditional Random Fields[J].Journal of Computational Information Systems,2008,4 (3):1169-1180.

[35]章成志.自动标引研究的回顾与展望[J].现代图书情报技术,2007,(11):33-39.

[36]韩客松,王永成.中文全文标引的主题词标引和主题概念标引方法[J].情报学报,2001,20(2):212-216.

[37]索红光,刘玉树,曹淑英.一种基于词汇链的关键词抽取方法[J].中文信息学报,2006,20(6):25-30.

[38]Dennis S F.The Design and Testing of a Fully Automatic Indexingsearching System for Documents Consisting of Expository Text[M].In:G.Schecter eds.Information Retrieval:a Critical Review,Washington D.C.:Thompson Book Company,1967:67-94.

[39]Salton G,Buckley C.Automatic Text Structuring and Retrieval-Experiments in Automatic Encyclopedia Searching[A].In:Proceedings of the Fourteenth SIGIR Conference[C].New York:ACM,1991:21-30.

[40]Salton G,Yang C S,Yu C T.A Theory of Term Importance in Automatic Text Analysis[J].Journal of the American society for Information Science,1975,26(1):33-44.

[41]马颖华,王永成,苏贵洋,张宇萌.一种基于字同现频率的汉语文本主题抽取方法[J].计算机研究与发展,2004,40(6):874-878.

[42]Matsuo Y,Ishizuka M.Keyword Extraction from a Single Document Using Word Co-ocuurrence Statistical Information[J]..International Journal on Artificial Intelligence Tools,2004,13(1):157-169.

[43]Witten I H,Paynter G W,Frank E,Gutwin C,Nevill-Manning C G.KEA:Practical Automatic Keyphrase Extraction[A].In:Proceedings of the 4th ACM Conference on Digital Library(DL’99)[C].Berkeley,CA,USA,1999:254-26.

[44]张庆国,薛德军,张振海,张君玉.海量数据集上基于特征组合的关键词自动抽取[J].情报学报,2006,25(5):587-593.

[45]Keith Humphreys J B.Phraserate:An Html Keyphrase Extractor [R].Technical Report,University of California,Riverside,2002:1-16.

[46]侯汉清,章成志,郑红.Web概念挖掘中标引源加权方案初探[J].情报学报,24(1):87-92.

[47]Boris L,Andreas H.Automatic Multi-label Subject Indexing in a Multilingual Environment[A].In:Proceedings of 7th European Conference in Research and Advanced Technology for Digital Libraries (ECDL 2003)[C].Trondheim,Norway,2003:140-151.

[48]苏新宁.信息检索理论与技术[M].北京:科学技术文献出版社,2004.

[49]苏新宁.情报检索理论与技术[M].北京:科学技术文献出版社,2004.

[50]苏金树,张博锋,徐昕.基于机器学习的文本分类技术研究进展[J].软件学报,2006,17(9):1848-1859.

[51]Yaakov H-K.Automatic Extraction of Keywords from Abstracts [A].In:Proceedings of the 7th International Conference on Knowledge-Based Intelligent Information and Engineering Systems (KES2003)[C],Oxford,UK,2003:843-946.

[52]Leouski A V,Croft W B.An Evaluation of Techniques for Clustering Search Results[R].Technical Report IR—76,Department of Computer Science,University of Massachusetts,Amherst,1996:1-19.

[53]储荷婷.索引自动化:自动标引的主要方法[J].情报学报,1993,12 (3):218-229.

[54]Medelyna O.Automatic Keyphrase Indexing with a Domain-Specific Thesaurus[D].Master Thesis,University of Freiburg,Germany,2005:23-26.

[55]战学钢,林鸿飞,姚天顺.中文文献的层次分类方法[J].中文信息学报,1999,13(6):20-25.

[56]刘开瑛,郑家恒.基于《金融档案分类表》的自动分类算法研究[J].情报学报,1997,16(5):346-353.

[57]庞剑锋,卜东波,白硕.基于向量空模型的文本自动分类系统的研究与实现[J].计算机应用研究,2001,18(9):23-26.

[58]黄萱青,吴立德,石崎洋之,徐国伟.独立于语种的文本分类方法[J].中文信息学报,2000,14(6):1-7.

[59]刘挺,秦兵,张宇,车万翔.信息检索系统导论[M].北京:机械工业出版社,2008.

[60]朱巧明,李培峰,吴娴等.中文信息处理教程[M].北京:清华大学出版社,2005.

[61]Luhn,H.P.Keyword-in-context index for technical literature (KWIC index)[R].Yorktown Heights,NY:IBM Advanced System Development Division,RC-127.1959.

[62]Maron,M.E.Mechanized documentation:the logic behind a probabilistic interpretation[A].In M.E.Stevens,V.E.,Giuliano,&L.B.Heilprin(Eds.),Statistical association methods for mechanized documentation[C],Washington,D.C.:National Bureau of Standards.1965:9-13.

[63]Hayes,P.J.,Andersen,P.M.,Nirenburg,I.B.,and Schmandt,L.M.1990.Tcs:a Shell for Content-based Text Categorization [A].In:Proceedings of CAIA-90,6th IEEE Conference on Artificial Intelligence Applications[C],Santa Barbara,USA,1990:320-326.

[64]Chidanand Apté,Fred Damerau,Sholom M.Weiss.Automated learning of decision rules for text categorization[J].ACM Transactions on Information Systems.1994,12(3):233-251.

[65]Cohen,W.W.,Singer,Y.Context-sensitive learning methods for text categorization[A].In:proceeding of the 19th ACM International Conference on Research and Development in Information Retrieval [C].USA:ACM Press,1996:307-315.

[66]David D.Lewis,Robert E.Schapire,James P.Callan,Ron Papka.Training algorithms for linear text classifiers[A].In:Proceedings of the 19th ACM International Conference on Research and Development in Information Retrieval[C].USA:ACM Press,1996:298-306.

[67]Hwee Tou Ng,Wei Boon Goh,Kok Leong Low.Low Feature selection,perception learning,and a usability case study for text categorization[A].In:Proceedings of the 20th ACM International Conference on Research and Development in Information Retrieval[C].USA:ACM Press,1997:67-73.

[68]Yang Y and Pedersen J O.A comparative study on feature selection in text categorization[A].In:Proceedings of 14th International Conference on Machine Learning(ICML-97)[C],Nashville,USA,1997:412-420.

[69]Oh-Woog Kwon.Text categorization based on k-nearest neighbor approach for Web site classification[J].Information Processing and Management.2003,39:250-44.

[70]Ludovic Denoyer.Bayesian network model for semi-structured document classification[J].Information Processing and Management.2004,40:807-827.

[71]Jyh-Jong Tsay.Improving linear classifier for Chinese text categorization[J].Information Processing and Management.2004,40:223-237.

[72]Ali Selamat.Web page feature selection and classification using neural networks[J].Information Sciences.2004,158:69-88.

[73]Jihe,Ah-HweeTan,Chew Lim Tan.A Comparative Study on Chinese Text Categorization Methods[J].PRICAI Workshop on Text and Web Mining.2000:24-35.

[74]岳喜才,吴晓宇,郑崇勋,叶大田.一种大类别数分类的神经网络方法[J].计算机研究与发展,2000(3):278-283.

[75]Salton,Gerard.Introduction to Modern Information Retrieval[M].McGraw-Hill,1983.

[76]Lewis,D.,Ringuette,M.A Comparison of Two Learning Algorithms for Text Categorization[A].In:Proceedings of the Third Annual Symposium on Document Analysis and Information Retrieval (SDAIR94)[C],Las Vegas,NV,USA,1994:81-93.

[77]William W.Cohen(1996):Learning Rules that Classify E-Mail[A].In:Proceedings of the AAAI Spring Symposium on Machine Learning in Information Access[C],Stanford,USA,1996:18-25.

[78]T.Joachims,Text Categorization with Support Vector Machines:Learning with Many Relevant Features[R].LS8-Report 23,Universit?t Dortmund,LS VIII-Report,1997.

[79]D.Koller and M.Sahami.Hierarchically classifying documents using very few words[A].In:Proceedings of the 14th International Conference on Machine Learning(ICML97)[C].Nashville,TN,USA,1997:170-178.

[80]Rainbow.http://www.cs.cmu.edu/~mccallum/bow/rainbow/.Accessed:2008-1-1.

[81]Shafer Keith.Scorpion Helps Catalog the Web.Bulletin of the American Society for Information Science,1997,24(1):28-29.

[82]Scorpion[OCLC—Software].http://www.oclc.org/research/software/scorpion/default.htm.Accessed:2008-1-1.

[83]Kamal Nigam,John Lafferty,and Andrew McCallum.Using maximum entropy for text classification[A].In:Proceedings of the IJCAI-99 Workshop on Machine Learning for Information Filtering[C],Stockholm,Sweden,1999:61-67.

[84]Desire|Research:Deliverables:D3.1.http://www.desire.org/html/research/deliverables/D3.6/.2008-1-1.

[85]German Harvest Automated Retrieval and Directory.http://www.gerhard.de/.2008-1-1.

[86]侯汉清,黄刚.电子计算机与文献分类.计算机与图书馆,1982(1):5-14.

[87]中文网页自动分类竞赛.http://net.pku.edu.cn/~sewm/contest.htm.2005-1-1.

[88]侯汉清,薛鹏军.基于知识库的网页自动标引和自动分类系统的设计[J].大学图书馆学报,2004,22(1):50-55,64.

[89]李渝勤,孙丽华.基于规则的自动分类在文本分类中的应用[J].中文信息学报,2004,18(4):9-14.

[90]周孟霞.基于规则学习的中医药文献自动标引系统[D].浙江大学硕士学位论文,2004.

[91]张雪英.基于粗糙集理论的文本自动分类研究[D].南京理工大学博士学位论文,2005.

[92]许增福,梁静国,田晓宇.基于加权模糊推理网络的文本自动分类方法[J].哈尔滨工程大学学报,2004,25(4):504-508.

[93]辛明海.个性化信息服务中的本体论自动分类和多Agent技术[D].华侨大学硕士学位论文,2002.

[94]莫少强.计算机辅助图书分类系统的设计与试验[J].计算机与图书馆,1984(1):29-35.

[95]朱兰娟,王永成.中文文献的自动分类[J].中文信息,1986,(4):26-28.

[96]张炳恒,刘金芝.微机图书分类编目自动化系统[J].图书馆工作与研究,1989,(4):13-19.

[97]苏新宁.档案自动分类算法研究[J].情报学报,1995,14(3):194-200.

[98]叶新明.基于《中图法》的中文文献自动分类[J].情报学报,1995,14 (6):423-433.

[99]吴军,王作英.汉语语料的自动分类[J].中文信息学报,1995,9 (4):25-32.

[100]王永成,张坤.中文文献自动分类研究[J].情报学报,1997,16 (5):354-359.

[101]邹涛,王继成,黄源,张福炎.中文文档自动分类系统的设计与实现[J].中文信息学报,1999,13(03):26-32.

[102]李晓黎,刘继敏,史忠植.概念推理网及其在文本分类中的应用[J].计算机研究与发展,2000,37(09):1032-1038.

[103]中文文本分类演示系统.http://mtgroup.ict.ac.cn/class/index.html.Accessed:2006-1-1.

[104]林鸿飞.基于示例的文本标题分类机制[J].计算机研究与发展,2001,38(09):1132-1136.

[105]TRS.http://www.trs.com.cn/products/textmine/trsckm/.Accessed:2008-1-1.

[106]李荣陆,王建会,陈晓云,陶晓鹏,胡运发.使用最大熵模型进行中文文本分类[J].计算机研究与发展,2005,42(1):94-101.

[107]苏金树,张博锋,徐昕.基于机器学习的文本分类技术研究进展[J].软件学报,2006,17(9):1848-1859.

[108]张雪英.基于机器学习的文本自动分类研究进展[J].情报学报,2006,25(6):730-739.

[109]王军.数字图书馆的知识组织系统——从理论到实践[M].北京:北京大学出版社,2009.