Updated on 2024/02/02

写真a

 
TAKEUCHI Koichi
 
Organization
Faculty of Environmental, Life, Natural Science and Technology Associate Professor
Position
Associate Professor
Profile

1998年4月 - 2000年3月 学術情報センター 研究開発部 助手
2000年4月 - 2003年11月 国立情報学研究所 人間・社会情報研究系 情報管理学部門 助手
2002年10月- 2003年5月 フランスINRIA Lorraine 非常勤研究員
2003年12月- 岡山大学工学部情報工学科 知能情報工学 講師
2005年4月- 岡山大学大学院自然科学研究科 パターン情報学 講師
2021年4月- 岡山大学学術研究院自然科学学域 パターン情報学 准教授
2023年4月- 岡山大学学術研究院 環境生命自然科学学域 パターン情報学 准教授


統計的学習モデルを利用した自然言語処理,および言語学の知見に基づく述語辞書の開発,小論文自動採点手法の開発の研究に従事

External link

Degree

  • Ph.D. in Engineering ( 1998.3   Nara Institute of Science and Technology )

Research Interests

  • Natural Language Processing

  • Lexical Semantics

  • terminology

  • Computational Linguistics

  • essay grading

Research Areas

  • Informatics / Intelligent informatics

  • Informatics / Theory of informatics

  • Humanities & Social Sciences / Linguistics

Education

  • Nara Institute of Science and Technology   情報科学研究科   情報処理学

    - 1998.3

      More details

    Country: Japan

    Notes: 修了

    researchmap

Research History

  • Okayama University   The Graduate School of Natural Science and Technology   Associate Professor

    2021.4

      More details

  • National Institute of Informatics   Visiting Associate Professor

    2007.4 - 2010.3

      More details

  • Okayama University   The Graduate School of Natural Science and Technology   Senior Research Assistant

    2005.4 - 2021.3

      More details

  • National Institute of Informatics   Visiting Associate Professor

    2004.12 - 2007.3

      More details

  • Okayama University   Faculty of Engineering   Lecturer

    2003.12 - 2005.3

      More details

  • INRIA Lorraine   Guest Resarcher

    2002.10 - 2003.5

      More details

    Country:France

    researchmap

  • National Institute of Informatics   人間・社会情報研究系 情報管理学部門   Assistant Professor

    2000.4 - 2003.11

      More details

  • 学術情報センター   助手

    1998.4 - 2000.3

      More details

▼display all

Professional Memberships

  • Information Processing Society of Japan

      More details

  • Institute of Electronics, Information and Communication Engineers

      More details

  • The Association for Natural Language Processing

      More details

  • Association for Computing Machinery

      More details

Committee Memberships

  • 2023 7th International Conference on Natural Language Processing and Information Retrieval   Conference Committee  

    2023.10 - 2023.12   

      More details

    Committee type:Other

    researchmap

  • 情報処理学会 情報基礎とアクセス技術研究会   研究運営委委員会 幹事  

    2022.5   

      More details

    Committee type:Academic society

    researchmap

  • The Association for Natural Language Processing   Local Chair of Annual Meeting of Association for Natural Language Processing  

    2017 - 2018   

      More details

    Committee type:Academic society

    researchmap

  • The Institute of Electronics, Information and Communication Engineers   Committee Member of Technical Committee on Natural Language Understanding and Models of Communication  

    2016.6   

      More details

    Committee type:Academic society

    researchmap

  • Computational Terminology Workshop   Organizer  

    2016   

      More details

    Committee type:Academic society

    researchmap

  • The Institute of Electronics, Information and Communication Engineers   Chair of Technical Committee on Natural Language Understanding and Models of Communication  

    2014 - 2016   

      More details

    Committee type:Academic society

    researchmap

  • The Institute of Electronics, Information and Communication Engineers   Vice Chair of Technical Committee on Natural Language Understanding and Models of Communication  

    2012 - 2013   

      More details

    Committee type:Academic society

    researchmap

  • The Institute of Electronics, Information and Communication Engineers   Secretary of Technical Committee on Natural Language Understanding and Models of Communication  

    2010 - 2011   

      More details

    Committee type:Academic society

    researchmap

  • The Institute of Electronics, Information and Communication Engineers   Assistant Secretary of Technical Committee on Natural Language Understanding and Models of Communication  

    2008 - 2009   

      More details

    Committee type:Academic society

    researchmap

▼display all

 

Papers

  • Semantic Role Labeling for Japanese Using Span-Based Models Reviewed

    Callum Tulloch, Koichi Takeuchi

    Proceedings of 7th International Conference on Natural Language Processing and Information Retrieval   2023.12

     More details

    Authorship:Last author   Language:English   Publishing type:Research paper (international conference proceedings)  

    researchmap

  • Estimating Task Priority in Japanese Disaster Chronology Logs Reviewed

    Shinji Koju, Koichi Takeuchi, Akihiro Watanabe, Takahiro Hirayama, Hiroyuki Nakao

    Proceedings of 7th International Conference on Natural Language Processing and Information Retrieval   2023.12

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)  

    researchmap

  • Dependence of perception of vocabulary difficulty on contexture Reviewed

    Parisa Supitayakul, Rika Kuramitsu, Zeynep Yucel, Akito Monden, Koichi Takeuchi

    Proceedings of the 15th International Congress on Advanced Applied Informatics   2023.12

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)  

    researchmap

  • A Platform for Searching Texts for Desired Expressions in a User-Editable Pattern Matching Environment for Language Learning Reviewed

    Tatsuya Katsura, Koichi Takeuchi

    Proceedings of 14th Interrnational Congress on Advanced Applied Informatics   2023.7

     More details

    Authorship:Last author   Language:English   Publishing type:Research paper (international conference proceedings)  

    researchmap

  • Statistical Learning Models for Japanese Essay Scoring Toward One-shot Learning Reviewed

    Chihiro Ejima, Koichi Takeuchi

    12th International Congress on Advanced Applied Informatics (IIAI-AAI)   313 - 318   2022.7

     More details

    Authorship:Last author   Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:IEEE  

    DOI: 10.1109/iiaiaai55812.2022.00070

    researchmap

  • Data Augmentation for Question Answering Using Transformer-based VAE with Negative Sampling Reviewed

    Wataru Kano, Koichi Takeuchi

    2022 12th International Congress on Advanced Applied Informatics (IIAI-AAI)   467 - 470   2022.7

     More details

    Authorship:Last author   Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:IEEE  

    DOI: 10.1109/iiaiaai55812.2022.00097

    researchmap

  • Development of Essay Scoring Methods Based on Reference Texts with Construction of Research-Available Japanese Essay Data Reviewed

    Koichi Takeuchi, Masayuki Ohno, Kouta Motojin, Masahiro Taguchi, Yoshihiko Inada, Masaya Iizuka, Tatsuhiko Abo, Hitoshi Ueda

    Transactions of Information Processing Society of Japan   62 ( 9 )   1586 - 1604   2021.9

     More details

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper (scientific journal)  

    J-GLOBAL

    researchmap

  • Constructing Web-Accessible Semantic Role Labels and Frames for Japanese as Additions to the NPCMJ Parsed Corpus Reviewed

    Koichi Takeuchi, Alastair Butler, Iku Nagasaki, Takuya Okamura, Prashant Pardeshi

    Proceedings of The 12th Language Resources and Evaluation Conference   3153 - 3161   2020

     More details

    Authorship:Lead author   Language:English   Publishing type:Research paper (international conference proceedings)  

    researchmap

  • Using Neural Networks to Construct a Japanese Semantic Role Labeling Model Reviewed

    Takuya Okamura, Koichi Takeuchi, Yasuhiro Ishihara

    Transactions of Information Processing Society of Japan   60 ( 11 )   2063 - 2074   2019.11

     More details

    Authorship:Corresponding author   Language:Japanese   Publishing type:Research paper (scientific journal)  

    J-GLOBAL

    researchmap

  • Evaluation of Embedded Vectors for Lexemes and Synsets Toward Expansion of Japanese WordNet Reviewed

    Daiki Ko, Koichi Takeuchi

    Proceedings of The 16th International Conference of the Pacific Association for Computational Linguistics   79 - 87   2019.10

     More details

    Authorship:Last author   Language:English   Publishing type:Research paper (international conference proceedings)  

    researchmap

  • Improving Japanese Semantic-Role-Labeling Performance with Transfer Learning as Case for Limited Resources of Tagged Corpora on Aggregated Language Reviewed

    Takuya Okamura, Koichi Takeuchi, Yasuhiro Ishihara, Masahiro Taguchi, Yoshihiko Inada, Masaya Iizuka, Tatsuhiko Abo, Hitoshi Ueda

    Proceedings of The 32th International Conference of the Pacific Association for Computational Linguistics   2018

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:Association for Computational Linguistics  

    researchmap

    Other Link: https://dblp.uni-trier.de/conf/paclic/2018

  • Construction of Open Essay Writing Data and Automatic Essay Scoring System for Japanese

    Masayuki Ohno, Koichi Takeuchi, Kota Motojin, Masahiro Taguchi, Yoshihiko Inada, Masaya Iizuka, Tatsuhiko Abo, Hitoshi Ueda

    Proceedings of The 15th International Conference of the Pacific Association for Computational Linguistics   215 - 220   2017

     More details

  • Construction of Japanese Semantic Role Labeling System Using Hierarchical Tag Context Trees Extracted from Tail Expressions of Dependency Elements

    57 ( 7 )   1611 - 1626   2016

     More details

  • A Method of Augmenting Bilingual Terminology by Taking Advantage of the Conceptual Systematicity of Terminologies

    Miki Iwai, Koichi Takeuchi, Kyo Kageura, Kazuya Ishibashi

    Proceedings of the 5th International Workshop on Computational Terminology   2016

     More details

  • Automatic Evaluation Methods of Trainee's Answers to Develop a 4R Risk Prediction Training System

    Hirotsugu Minowa, Hiromi Fujimoto, Koichi Takeuchi

    2015 IIAI 4TH INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS (IIAI-AAI)   283 - 286   2015

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:IEEE  

    4 Rounds (4R) training method is practiced in industrial office site for reducing accidents caused by human factors. The 4R method enables to raise hazard-prediction capability of worker such as coping, decision-making to avoid danger situation. The workers as trainees train on their own by finding hazards which lurked in the hazard prediction training (KYT in Japanese) sheet. However, there is a large problem that a single trainee cannot train oneself using 4R method because the training of 4R method needs instruction of expert as human instructor.
    To solve that problem, we aim to develop hazard prediction training system. The advantage of this system enables trainee to train oneself anytime/anywhere using 4R method. In this research paper, we reports about our proposal of training system, the development of subsystem which based on machine learning to evaluate trainee's answer correct or not, and reports the result of evaluation experimental that showed the average accuracy was 63.0 +/- 22.9 [%].

    DOI: 10.1109/IIAI-AAI.2015.301

    Web of Science

    researchmap

  • A Mechanical Method for Evaluating Trainee Answers in a Risk Prediction Training System Based on the 4R Method

    Hirotsugu Minowa, Hiromi Fujimoto, Koichi Takeuchi

    Information Engineering Express   1 ( 3 )   59 - 68   2015

     More details

    Publishing type:Research paper (scientific journal)   Publisher:International Institute of Applied Informatics  

    DOI: 10.52731/iee.v1.i3.49

    researchmap

  • Japanese Semantic Role Labeling with Hierarchical Tag Context Trees

    Yasuhiro Ishihara, Koichi Takeuchi

    Proceedings of PACLING2015   184 - 189   2015

     More details

  • The use of corpus evidence and human introspection to create idiom variations

    Rei Miyata, Ryoko Adachi, Ulrich Apel, Iris Vogel, Wolfgang Fanderl, Ryo Murayama, Koichi Takeuchi, Kyo Kageura

    Proceedings of the 2nd Asia Pacific Corpus Linguistic Conference   2014

     More details

  • A Simple Platform for Defining Idiom Variation Matching Rules

    Koichi Takeuchi, Ulrich Apel, Rei Miyata, Ryo Murayama, Ryoko Adachi, Wolfgang Fanderl, Iris Vogel, Kyo Kageura

    Proceedings of the XVI EURALEX Internatinal Congress: The User in Focus   399 - 404   2014

     More details

  • Development and use of a platform for defining idiom variation rules

    Ryoko Adachi, Koichi Takeuchi, Ryo Murayamaa, Wolfgang Fanderl, Rei Miyata, Iris Vogel, Ulrich Apel, Kyo Kageura

    Proceedings of the 5th International Language Learning Conference (ILLC 2013)   2013

     More details

  • Terminology-driven Augumentation of Bilingual Terminologies

    Koichi Sato, Koichi Takeuchi, Kyo Kageura

    Proceedings of the XIV Machine Translation Summit   3 - 10   2013

     More details

  • Enhancing Multi-word Term Extraction for Designated Theme Embedded in a Domain Corpus

    Teruo Koyama, Koichi Takeuchi

    Proceedings of International Conference on Terminology and Artificial Intelligence   2011

     More details

  • VERB SENSE DISAMBIGUATION BASED ON THESAURUS OF PREDICATE-ARGUMENT STRUCTURE An Evaluation of Thesaurus of Predicate-argument Structure for Japanese Verbs

    Koichi Takeuchi, Suguru Tsuchiyama, Masato Moriya, Yuuki Moriyasu, Koichi Satoh

    KEOD 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE ENGINEERING AND ONTOLOGY DEVELOPMENT   208 - 213   2011

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:INSTICC-INST SYST TECHNOLOGIES INFORMATION CONTROL & COMMUNICATION  

    This paper presents a system for word sense disambiguation based on a manually constructed thesaurus of predicate-argument structure, which is an ontology on the linguistic side providing essential information for mapping form texts to verb concepts. This system can be effective for word sense disambiguation even though the target word sense system is different from the thesaurus. We applied the proposed word sense disambiguation system to the test corpus of SemEval-2010 Japanese tasks. Experimental results showed that the thesaurus-based disambiguation system outperformed a CRFs-based system in recall rates of verb sense disambiguation. From the results of verb sense disambiguation, we clarified that the abstracted verb classes (709 types) in our proposed system were effective sets for verb sense disambiguation.

    Web of Science

    researchmap

  • Brains, not brawn: The use of "smart" comparable corpora in bilingual terminology mining

    Emmanuel Morin, Béatrice Daille, Koichi Takeuchi, Kyo Kageura

    ACM Transactions on Speech and Language Processing   7 ( 1 )   2010.8

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Current research in text mining favors the quantity of texts over their representativeness. But for bilingual terminologymining, and for many language pairs, large comparable corpora are not available. More importantly, as terms are defined vis-à-vis a specific domain with a restricted register, it is expected that the representativeness rather than the quantity of the corpus matters more in terminology mining. Our hypothesis, therefore, is that the representativeness of the corpus is more important than the quantity and ensures the quality of the acquired terminological resources. This article tests this hypothesis on a French-Japanese bilingual term extraction task. To demonstrate how important the type of discourse is as a characteristic of the comparable corpora, we used a state-of-the-art multilingual terminology mining chain composed of two extraction programs, one in each language, and an alignment program. We evaluated the candidate translations using a reference list, and found that taking discourse type into account resulted in candidate translations of a better quality even when the corpus size was reduced by half. © 2010 ACM.

    DOI: 10.1145/1839478.1839479

    Scopus

    researchmap

  • An ontology-driven system for detecting global health events

    Nigel Collier, Reiko Matsuda Goodwin, John McCrae, Son Doan, Ai Kawazoe, Mike Conway, Asanee Kawtrakul, Koichi Takeuchi, Dinh Dien

    Proceedings of The 23rd International Conference on Computational Linguistics   132 - 134   2010

     More details

  • A Thesaurus of Predicate-Argument Structure for Japanese Verbs to Deal with Granularity of Verb Meanings

    Koichi Takeuchi, Kentaro Inui, Nao Takeuchi, Atsushi Fujita

    Proceedings of The 8th Workshop on Asian Language Resources   1 - 8   2010

     More details

  • Co-clustering with Recursive Elimination for Verb Synonym Extraction from Large Text Corpus

    Koichi Takeuchi, Hideyuki Takahashi

    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS   E92D ( 12 )   2334 - 2340   2009.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)   Publisher:IEICE-INST ELECTRONICS INFORMATION COMMUNICATIONS ENG  

    The extraction of verb synonyms is a key technology to build a verb dictionary as a language resource. This paper presents a co-clustering-based verb synonym extraction approach that increases the number of extracted meanings of polysemous verbs from a large text corpus. For verb synonym extraction with a Clustering approach dealing with polysemous verbs can be one problem issue because each polysemous verb should be categorized into different clusters depending on each meanings thus there is a high possibility of failing to extract sonic of the meanings of polysemous verbs. Our proposed approach can extract the different meanings of polysemous verbs by recursively eliminating the extracted Clusters from the initial data set. The experimental results of verb synonym extraction show that the proposed approach increases the correct verb clusters by about 50% with a 0.9% increase in precision and a 1.5% increase in recall over the previous approach.

    DOI: 10.1587/transinf.E92.D.2334

    Web of Science

    researchmap

  • 言語情報処理における辞書と語彙概念構造 Invited

    竹内孔一

    語彙の意味と文法, くろしお出版   105 - 119   2009.2

     More details

    Publishing type:Research paper (scientific journal)  

    researchmap

  • Bio-medical Term Extraction on Simple Rule Language

    Koichi Takeuchi, Takashi Shinnou, Nigel Collier

    Proceedings of The 3rd International Symposium on Lang   132 - 134   2009

     More details

  • Pattern Based Term Extraction Using ACABIT System Reviewed

    Koichi Takeuchi, Kyo Kageura, Teruo Koyama, Béatrice Daille, Laurent Romary

    CoRR   abs/0907.2452   2009

     More details

  • BioCaster: detecting public health rumors with a Web-based text mining system Reviewed

    Nigel Collier, Son Doan, Ai Kawazoe, Reiko Matsuda Goodwin, Mike Conway, Yoshio Tateno, Quoc-Hung Ngo, Dinh Dien, Asanee Kawtrakul, Koichi Takeuchi, Mika Shigematsu, Kiyosu Taniguchi

    BIOINFORMATICS   24 ( 24 )   2940 - 2941   2008.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)   Publisher:OXFORD UNIV PRESS  

    BioCaster is an ontology-based text mining system for detecting and tracking the distribution of infectious disease outbreaks from linguistic signals on the Web. The system continuously analyzes documents reported from over 1700 RSS feeds, classifies them for topical relevance and plots them onto a Google map using geocoded information. The background knowledge for bridging the gap between Layman's terms and formal-coding systems is contained in the freely available BioCaster ontology which includes information in eight languages focused on the epidemiological role of pathogens as well as geographical locations with their latitudes/longitudes. The system consists of four main stages: topic classification, named entity recognition (NER), disease/location detection and event recognition. Higher order event analysis is used to detect more precisely specified warning signals that can then be notified to registered users via email alerts. Evaluation of the system for topic recognition and entity identification is conducted on a gold standard corpus of annotated news articles.

    DOI: 10.1093/bioinformatics/btn534

    Web of Science

    researchmap

  • Extraction of Verb Synonyms using Co-clustering Approach Reviewed

    Koichi Takeuchi

    PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON UNIVERSAL COMMUNICATION   173 - 178   2008

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:IEEE COMPUTER SOC  

    This paper describes that a graph-based co-clustering approach is suitable for extraction of verb synonyms from large scale texts. The proposed bipartite graph algorithm can produce clusters of verb synonyms as well as noun synonyms taking into account word co-occurrence between verb and its argument. Experimental results show that the co-clustering approach achieve higher accuracy than those by a vector-based single clustering approach that are usually used for construction of thesaurus.

    DOI: 10.1109/ISUC.2008.66

    Web of Science

    researchmap

  • Bilingual Terminology Mining - Using Brain, not brawn comparable corpora

    Emmanuel Morin, Beatrice Daille, Koichi Takeuchi, Kyo Kageura

    Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics   664-671   2007

     More details

  • Flexible automatic look-up of English idiom entries in dictionaries

    Takeuchi, K. Kanehila, T, Hilao, K, Abekawa, T, Kageura, K

    MT Summit XI   451 - 458   2007

     More details

  • Bilingual Terminology Mining - Using Brain, not brawn comparable corpora. Reviewed

    Emmanuel Morin, Béatrice Daille, Koichi Takeuchi, Kyo Kageura

    ACL 2007, Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, June 23-30, 2007, Prague, Czech Republic   664 - 671   2007

     More details

    Publisher:The Association for Computational Linguistics  

    CiNii Article

    researchmap

    Other Link: http://dblp.uni-trier.de/db/conf/acl/acl2007.html#conf/acl/MorinDTK07

  • A multilingual ontology for infectious disease surveillance: rationale, design and challenges

    Nigel Collier, Ai Kawazoe, Lihua Jin, Mika Shigematsu, Dinh Dien, Roberto A. Barrero, Koichi Takeuchi, Asanee Kawtrakul

    LANGUAGE RESOURCES AND EVALUATION   40 ( 3-4 )   405 - 413   2006.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)   Publisher:SPRINGER  

    A lack of surveillance system infrastructure in the Asia-Pacific region is seen as hindering the global control of rapidly spreading infectious diseases such as the recent avian H5N1 epidemic. As part of improving surveillance in the region, the BioCaster project aims to develop a system based on text mining for automatically monitoring Internet news and other online sources in several regional languages. At the heart of the system is an application ontology which serves the dual purpose of enabling advanced searches on the mined facts and of allowing the system to make intelligent inferences for assessing the priority of events. However, it became clear early on in the project that existing classification schemes did not have the necessary language coverage or semantic specificity for our needs. In this article we present an overview of our needs and explore in detail the rationale and methods for developing a new conceptual structure and multilingual terminological resource that focusses on priority pathogens and the diseases they cause. The ontology is made freely available as an online database and downloadable OWL file.

    DOI: 10.1007/s10579-007-9019-7

    Web of Science

    researchmap

  • 語彙概念構造に基づく日本語動詞の統語・意味特性の記述 Reviewed

    竹内孔一, 乾健太郎, 藤田篤

    レキシコンフォーラム   2006

     More details

  • Bio-medical entity extraction using support vector machines

    Koichi Takeuchi, Nigel Collier

    Artificial Intelligence in Medicine   33 ( 2 )   125 - 137   2005

     More details

    Language:English   Publishing type:Research paper (scientific journal)   Publisher:Elsevier  

    Objective: Support vector machines (SVMs) have achieved state-of-the-art performance in several classification tasks. In this article we apply them to the identification and semantic annotation of scientific and technical terminology in the domain of molecular biology. This illustrates the extensibility of the traditional named entity task to special domains with large-scale terminologies such as those in medicine and related disciplines. Methods and materials: The foundation for the model is a sample of text annotated by a domain expert according to an ontology of concepts, properties and relations. The model then learns to annotate unseen terms in new texts and contexts. The results can be used for a variety of intelligent language processing applications. We illustrate SVMs capabilities using a sample of 100 journal abstracts texts taken from the {human, blood cell, transcription factor} domain of MEDLINE. Results: Approximately 3400 terms are annotated and the model performs at about 74% F-score on cross-validation tests. A detailed analysis based on empirical evidence shows the contribution of various feature sets to performance. Conclusion: Our experiments indicate a relationship between feature window size and the amount of training data and that a combination of surface words, orthographic features and head noun features achieve the best performance among the feature sets tested. © 2004 Elsevier B.V. All rights reserved.

    DOI: 10.1016/j.artmed.2004.07.019

    Scopus

    PubMed

    researchmap

  • Construction of Grammar Based Term Extraciton Model for Japanese

    K. Takeuchi, K. Kageura, T. Koyama, B. Daille, L. Romary

    Proceedings of the 3rd International Workshop on Computational Terminology   2004

     More details

  • Paraphrasing of Japanese Light-verb Constructions Based on Lexical Conceptual Structure

    A. Fujita, K. Furihata, K. Inui, Y. Matsumoto, K. Takeuchi

    Proceedings of the 2nd ACL Workshop on Multiword Expression   2004

     More details

  • Comparison of Character-level and Part of Speech Features for Name Recognition in Bio-medical Texts Reviewed

    N. Collier, K. Takeuchi

    Journal of Biomedical Informatics, Elsevier Science   2004

  • Open Ontology Forge: A Tool for Ontology Creation and Text Annotation Applied to the Biomedical Domain Reviewed

    Kawazoe Ai, Mullen Tony, Takeuchi Koichi, Wattarujeekrit Tuangthong, Collier Nigel

    GI   14   677 - 678   2003.12

     More details

    Language:English   Publishing type:Research paper (scientific journal)   Publisher:Japanese Society for Bioinformatics  

    DOI: 10.11234/gi1990.14.677

    CiNii Article

    researchmap

  • Bio-Medical Entity Extraction using Support Vector Machines Reviewed

    K. Takeuchi, N. Collier

    Proceedings of the ACL2003 Workshop on Natural Language Processing in Biomedicine   2003

     More details

    Authorship:Lead author  

    researchmap

  • Deverbal Compound Noun Analysis Based on Lexical Conceptual Structure

    K. Takeuchi, K. Kageura, T. Koyama

    Proceedings of Poster/Demo session in 41st Annual Meeting of the Association for Computational Linguistics (ACL03)   2003

     More details

  • Building Disambiguation System for Compound Noun Analysis Based on Lexical Conceptual Structure

    K. Takeuchi, K. Kageura, T. Koyama

    Proceedings of the second International Workshop on Generative Approaches to the Lexicon   2003

     More details

  • Use of Support Vector Machines in Extended Named Entity Recognition

    K. Takeuchi, N. Collier

    In 6th Conference on Natural Language Learning 2002 (CoNLL-2002)   2002

     More details

  • An LCS-based Approach for Analyzing Japanese Compound Nouns with Deverbal Heads

    K.Takeuchi, K.Kageura, T.Koyama

    In Proceedings of the 2nd International Workshop on Computational Terminology (COMPUTERM2002)   2002

     More details

  • Categorising deverbal nouns based on lexical conceptual structure for analysing Japanese compounds

    K Takeuchi, K Uchiyama, M Yoshioka, K Kageura, T Koyama

    2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5   904 - 909   2002

     More details

    Language:English   Publishing type:Research paper (international conference proceedings)   Publisher:IEEE  

    This paper proposes a method to describe conceptual meanings of Japanese deverbal nouns for the purpose of analysing Japanese compound nouns. A lexical meaning of each constituent word is very important since it deeply influences a word formation of compound words. Therefore we intend to use the lexical meanings of constituent words to build a Japanese compound noun analyser. As a first step to describe lexical meanings of words, we have established a simple semantic structure based on lexical conceptual structure (we refer to the structure we establish by TLCS1 in this paper). After the analysing 250 deverbal nouns that appeared in technical terms, we found that the only 14 types of TLCSes cover various deverbal nouns. We will explain in this paper the types of TLCS and also show the procedure of assigning TLCSes to deverbals. In order to evaluate the efficacy of TLCS, we made use of the accuracy of simple compound noun analyser for analysing technical terms. The result was very good, which shows that our proposed method is very promising.

    Web of Science

    researchmap

  • 語彙概念構造を利用した複合名詞内の係り関係の解析

    竹内孔一, 内山清子, 吉岡真治, 影浦峡, 小山照夫

    情報処理学会論文誌   2002

     More details

  • Analysis of Relations between Noun and Deverbal Nouns in Japanese Compounds Based on Lexical Conceptual Structure

    K. Takeuchi, K. Uchiyama, M. Yoshioka, K. Kageura, T. Koyama

    Proceedings of Pacific Assciation for Computational Linguistics (PACLING'01)   2001

     More details

  • Defining Principled but Practically Manageable Lexical Units in Japanese Textual Corpora

    M. Okada, K. Takeuchi, M. Yoshioka, K. Kageura, T. Koyama

    Proceedings of the Workshop of Language Resources in Asia (6th Natural Language Processing Pacific Rim Symposium Post-Conference Workshop)   2001

     More details

  • Categorising deverbal nouns based on lexical conceptual structure for analysing Japanese compounds Reviewed

    Koichi Takeuchi, Kiyoko Uchiyama, Masaharu Yoshioka, Kyo Kageura, Teruo Koyama

    Proceedings of the IEEE International Conference on Systems, Man and Cybernetics   2   904 - 909   2001

     More details

    Publishing type:Research paper (scientific journal)  

    This paper proposes a method to describe conceptual meanings of Japanese deverbal nouns for the purpose of analysing Japanese compound nouns. A lexical meaning of each constituent word is very important since it deeply influences a word formation of compound words. Therefore we intend to use the lexical meanings of constituent words to build a Japanese compound noun analyzer. As a first step to describe lexical meanings of words, we have established a simple semantic structure based on lexical conceptual structure (we refer to the structure we establish by TLCS in this paper). After the analysing 250 deverbal nouns that appeared in technical terms, we found that the only 14 types of TLCSes cover various deverbal nouns. We will explain in this paper the types of TLCS and also show the procedure of assigning TLCSes to deverbals. In order to evaluate the efficacy of TLCS, we made use of the accuracy of simple compound noun analyser for analyzing technical terms. The result was very good, which shows that our proposed method is very promising.

    DOI: 10.1109/ICSMC.2001.973032

    Scopus

    researchmap

  • Recent advances in automatic term recognition: Experiences from the NTCIR workshop on information retrieval and term recognition Reviewed

    Kyo Kageura, Masaharu Yoshioka, Koichi Takeuchi, Teruo Koyama, Keita Tsuji, Fuyuki Yoshikane

    Terminology   6 ( 2 )   151 - 173   2000

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    This article provides basic background information on the articles included in this special issue on Japanese term extraction, by (i) clarifying the basic background of research into automatic term recognition, (ii) explaining briefl y the ‘contest’-style workshop we organised in 1999, and (iii) briefl y summarising the ATR methodologies proposed in the articles, and positioning their ideas, philosophies and methodologies within ATR from a unifi ed perspective. Through this information, we intend to consolidate the contributions of the NTCIR TMREC workshop, and, hopefully, clarify a basic framework for discussion which different researchers can use to constructively communicate with each other about automatic term extraction and beyond. © 2000 John Benjamins Publishing Co.

    DOI: 10.1075/term.6.2.03kag

    Scopus

    researchmap

  • Japanese OCR Error Correction Using Stochastic Morphological Analyzer and Probabilistic Word N-gram Model

    International Journal of Computer Processing of Oriental Language   13 ( 1 )   69 - 82   2000

     More details

  • 統計的言語モデルを用いたOCR誤り訂正システムの構築

    竹内孔一, 松本裕治

    情報処理学会論文誌   40 ( 6 )   2679 - 2689   1999

     More details

  • A Grammatical Framework for Analysing Japanese Nominal Compounds with Special Reference to Specialised Terms

    K. Uchiyama, K. Takeuchi, M. Yoshioka, K. Kageura, T. Koyama

    Proceedings of the 12th World Congress of Applied Linguistics (AILA'99)   1999

     More details

  • Overview of TMREC Tasks. Reviewed

    Kyo Kageura, Masaharu Yoshioka, Koichi Takeuchi, Teruo Koyama, Keita Tsuji, Fuyuki Yoshikane, Maho Okada

    Proceedings of the First NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition, NTCIR-1, Tokyo, Japan, August 30 - September 1, 1999   1999

     More details

    Publisher:National Center for Science Information Systems (NACSIS)  

    CiNii Article

    researchmap

    Other Link: http://dblp.uni-trier.de/db/conf/ntcir/ntcir1999.html#conf/ntcir/KageuraYTKTYO99

  • Evaluation of the Term Recognition Task. Reviewed

    Kyo Kageura, Masaharu Yoshioka, Keita Tsuji, Fuyuki Yoshikane, Koichi Takeuchi, Teruo Koyama

    Proceedings of the First NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition, NTCIR-1, Tokyo, Japan, August 30 - September 1, 1999   417 - 434   1999

     More details

    Publisher:National Center for Science Information Systems (NACSIS)  

    CiNii Article

    researchmap

    Other Link: http://dblp.uni-trier.de/db/conf/ntcir/ntcir1999.html#conf/ntcir/KageuraYTYTK99

  • Evaluation of the Keyword Extraction Task. Reviewed

    Koichi Takeuchi, Masaharu Yoshioka, Teruo Koyama, Kyo Kageura

    Proceedings of the First NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition, NTCIR-1, Tokyo, Japan, August 30 - September 1, 1999   1999

     More details

    Publisher:National Center for Science Information Systems (NACSIS)  

    researchmap

    Other Link: http://dblp.uni-trier.de/db/conf/ntcir/ntcir1999.html#conf/ntcir/TakeuchiYKK99

  • Evaluation of the Role Analysis Task. Reviewed

    Teruo Koyama, Masaharu Yoshioka, Koichi Takeuchi, Kyo Kageura

    Proceedings of the First NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition, NTCIR-1, Tokyo, Japan, August 30 - September 1, 1999   1999

     More details

    Publisher:National Center for Science Information Systems (NACSIS)  

    researchmap

    Other Link: http://dblp.uni-trier.de/db/conf/ntcir/ntcir1999.html#conf/ntcir/KoyamaYTK99

  • 隠れマルコフモデルによる日本語形態素解析のパラメータ推定 Reviewed

    竹内孔一, 松本裕治

    情報処理学会論文誌   38 ( 3 )   500 - 509   1997

     More details

▼display all

Books

  • コーパスと自然言語処理

    朝倉出版  2017 

     More details

  • Thesaurus with Predicate-Argument Structure to Provide Base Framework to Determine States, Actions, and Change-of-States

    IGI Global  2016  ( ISBN:152250432X

     More details

MISC

  • ChatGPTによる意味役割付与システムの構築

    大岡 史明, 竹内 孔一

    研究報告情報基礎とアクセス技術(IFAT)   2023-IFAT-153 ( 1 )   1 - 5   2023.12

     More details

    Authorship:Last author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    researchmap

  • 医療事故報告書に対するテキストマイニングシステム

    松村 崇光, 竹内 孔一

    第 22 回情報科学技術フォーラム (FIT 2023)   387 - 390   2023.9

     More details

    Authorship:Last author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    researchmap

  • 災害発生時のクロノロジーに対する優先度推定

    孝壽 真治, 竹内 孔一, 渡邉 暁洋, 平山 隆浩, 中尾 博之

    第 22 回情報科学技術フォーラム (FIT 2023)   第2分冊   33 - 36   2023.9

     More details

    Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    researchmap

  • ブラウザ上でユーザが編集可能な言語パターンマッチシステムの構築

    桂 辰弥, 竹内 孔一

    第 22 回情報科学技術フォーラム (FIT2023)   287 - 290   2023.9

     More details

    Authorship:Last author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    researchmap

  • BERTを利用したチャットボットのQ&Aデータ自動作成

    小原孝介, 美尾樹, 多村新平, 今上博司, 佐藤功一, 滝澤一樹, 竹内孔一

    言語処理学会第29回年次大会発表論文   519 - 521   2023.3

     More details

    Authorship:Last author   Language:Japanese   Publishing type:Internal/External technical report, pre-print, etc.  

    researchmap

  • 舶用ディーゼル機関に関する問い合わせ対応用チャットボットのための類義語辞書の自動生成

    美尾樹, 小原孝介, 多村新平, 今上博司, 佐藤功一, 滝澤一樹, 竹内孔一

    言語処理学会第29回年次大会発表論文集   516 - 518   2023.3

     More details

    Authorship:Last author   Language:Japanese   Publishing type:Internal/External technical report, pre-print, etc.  

    researchmap

  • 深層学習を利用した PropBank 形式の日本語意味役割付与モデル

    タロック カラム, 竹内 孔一, バトラー アラステア, 長崎 郁, パルデシ プラシャント

    言語処理学会第29回年次大会発表論文集   2419 - 2421   2023.3

     More details

    Language:Japanese   Publishing type:Internal/External technical report, pre-print, etc.  

    researchmap

  • 災害医療におけるクロノロジーの優先度識別

    孝壽真治, 竹内孔一, 渡邉暁洋, 平山隆浩, 中尾博之

    情報処理学会研究報告第149回 情報基礎とアクセス技術研究会 (SIG-IFAT)   2023.2

     More details

    Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    researchmap

  • 事前学習済みモデルを利用した日本語小論文採点手法の構築. 第21回情報科学技術フォーラム

    藩 宇偉, 竹内 孔一

    第21回情報科学技術フォーラム(FIT2022)   2022.9

     More details

    Authorship:Last author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    researchmap

  • ブロック形式を利用したパターンマッチシステムの構築

    竹内孔一, 小笠原崇, 岡田魁人, 今田将也

    言語処理学会 第28回年次大会発表論文集   E1-3   2022.3

     More details

    Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    researchmap

  • 機械学習を利用した日本語小論文採点手法の比較

    堀江遼河, 竹内孔一

    言語処理学会第28回年次大会発表論文集   G2-2   2022.3

     More details

    Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    researchmap

  • 模範答案のみを利用した日本語小論文採点支援システム

    江島知優, 竹内孔一

    言語処理学会第28回年次大会発表論文集   E4-5   2022.3

     More details

    Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    researchmap

  • 災害医療におけるクロノロジーの分析

    竹内孔一, 山崎瑶, 渡邉暁洋, 平山隆浩, 中尾博之

    電子情報通信学会信学技報   121(415) NLC2021-31   19 - 23   2022.3

     More details

    Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    researchmap

  • 述語の概念フレームとPropBank形式の意味役割を付与したNPCMJ-PTの構築

    竹内孔一, アラステアバトラー, 長崎郁, プラシャントパルデシ

    言語処理学会第28回年次大会発表論文集   E1-2   2022.3

     More details

    Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    researchmap

  • Sentence-BERTを利用したFAQ検索におけるデータ拡張手法

    加納渉, 竹内孔一

    言語処理学会第28回年次大会発表論文集   C7-1   2022.3

     More details

    Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    researchmap

  • Annotation Results of Semantic Role Labels and Frames in NPCMJ-PT and Estimation of Semantic Role Labels

    IPSJ SIG Technical Report   2022-IFAT-146 ( 3 )   1 - 5   2022.3

     More details

    Authorship:Lead author   Language:Japanese   Publishing type:Internal/External technical report, pre-print, etc.  

    researchmap

  • Construction of Japanese Essay Data for Evaluating Automatic Essay Scoring System

    竹内孔一, 田口雅弘, 稲田佳彦, 飯塚誠也, 阿保達彦, 上田均

    IEICE Technical Report   121 ( 178(NLC2021-15) )   40 - 44   2021.9

     More details

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    J-GLOBAL

    researchmap

  • NPCMJへのPropBank形式の意味役割と概念フレームの付与の進捗報告

    竹内孔一, バトラー アラステア, 長崎郁, パルデシ プラシャント

    言語処理学会年次大会発表論文集   27th   2021.3

     More details

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    J-GLOBAL

    researchmap

  • Blocklyを利用したタグ付きコーパス検索パタン構築ツール

    岡田魁人, 竹内孔一

    言語処理学会年次大会発表論文集   27th   2021.3

     More details

    Authorship:Last author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    J-GLOBAL

    researchmap

  • 意味役割付与テキストに対するPrologベースの探索木による言語パタンマッチシステム構築

    小笠原崇, 竹内孔一

    言語処理学会年次大会発表論文集   27th   2021.3

     More details

    Authorship:Last author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    J-GLOBAL

    researchmap

  • BERTを利用した日本語小論文採点支援システムの検討

    江島知優, 堀江遼河, 竹内孔一

    言語処理学会年次大会発表論文集   27th   2021.3

     More details

    Authorship:Last author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    J-GLOBAL

    researchmap

  • Building a Japanese Semantic Role Labeling Using Bayesian Inference

    岸本廉, 竹内孔一

    19th   2020.9

     More details

    Authorship:Last author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    J-GLOBAL

    researchmap

  • PropBank形式を考慮したNPCMJに対する意味役割付与~態の違いと経験者の付与~

    竹内孔一, バトラー アラステア, 長崎郁, パルデシ プラシャント

    言語処理学会年次大会発表論文集   26th   633 - 636   2020

     More details

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    J-GLOBAL

    researchmap

  • Annotating Semantic Role Labels of Both PropBank Style Roles and Conventional Named Roles to NPCMJ

    竹内孔一

    The Special Interest Group Technical Reports of IPSJ   2020-IFAT-138 ( 4 )   2020

     More details

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    J-GLOBAL

    researchmap

  • ニューラルネットワークを利用した日本語小論文の自動採点の検討

    清野光雄, 竹内孔一

    情報科学技術フォーラム講演論文集   18th   2019.9

     More details

    Authorship:Last author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    J-GLOBAL

    researchmap

  • 日本語WordNetにおける語義・概念の分散表現獲得

    國府大輝, 竹内孔一

    情報科学技術フォーラム講演論文集   18th   2019.9

     More details

    Authorship:Last author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    J-GLOBAL

    researchmap

  • 日英対訳データとニューラルネットワーク機械翻訳を利用した類語表現の抽出

    徳原生輝, 竹内孔一, 村上仁一, 徳久雅人

    情報科学技術フォーラム講演論文集   18th   2019.9

     More details

    Authorship:Corresponding author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    J-GLOBAL

    researchmap

  • NPCMJに対する述語項構造シソーラスの意味役割と概念フレームの付与

    竹内 孔一, Alastair Butler, 長崎 郁, Prashant Pardeshi

    情報処理学会研究報告   2019-NL-241 ( 4 )   2019.8

     More details

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    researchmap

  • Development of a parsed corpus and its application to linguistic research and education

    Prashant Pardeshi, Kei Yoshimoto, Susanne Miyata, Koichi Takeuchi, Hideki Kishimoto

    The Japanese Society for Language Sciences 2019 Conference Handbook   11 - 13   2019.7

     More details

    Language:English   Publishing type:Research paper, summary (international conference)  

    researchmap

  • 構文や語彙意味論の分析成果をプログラムとして具現化する言語パターンマッチAPIの可能性 Invited

    竹内孔一

    情報処理学会研究報告   2019 ( NL-239 )   2019.3

     More details

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    J-GLOBAL

    researchmap

  • 小論文採点支援システムにおける文字誤り検出モジュールの構築

    小畑友也, 竹内孔一, 大野雅幸, 泉仁宏太, 田口雅弘, 稲田佳彦, 飯塚誠也, 阿保達彦, 上田均

    言語処理学会年次大会発表論文集   25th   807 - 810   2019.3

     More details

    Authorship:Corresponding author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    researchmap

  • 述語項構造シソーラスの構築と概念体系を利用した専門用語の処理

    竹内孔一

    情報の科学と技術   69 ( 9 )   421 - 426   2019

     More details

    Authorship:Lead author   Language:Japanese   Publishing type:Article, review, commentary, editorial, etc. (trade magazine, newspaper, online media)  

    researchmap

  • PropBankスタイルの意味役割タグを導入した述語項構造シソーラスとNPCMJへの付与計画

    竹内孔一, バトラー アラステア, 長崎郁, ホーン スティーブンライト

    言語処理学会年次大会発表論文集   25th   2019

     More details

    Authorship:Lead author   Language:Japanese   Publishing type:Research paper, summary (national, other academic conference)  

    J-GLOBAL

    researchmap

  • Computational terminology and filtering of terminological information: Introduction to the special issue

    Patrick Drouin, Natalia Grabar, Thierry Hamon, Kyo Kageura, Koichi Takeuchi

    Terminology   24 ( 1 )   1 - 6   2018

     More details

    Language:English   Publishing type:Book review, literature introduction, etc.   Publisher:John Benjamins Publishing Company  

    DOI: 10.1075/term.00010.dro

    Scopus

    researchmap

  • Construction of a Bilingual Term Extension System

    116 ( 451 )   13 - 17   2017.2

     More details

  • Construction of Object-Oriented Language Interface

    49   23 - 27   2015.9

     More details

  • 形態素解析の系統的誤りと用語抽出

    小山 照夫, 竹内 孔一

    研究報告自然言語処理(NL)   2015 ( 6 )   1 - 4   2015.1

     More details

    Language:Japanese   Publisher:一般社団法人情報処理学会  

    日本語用語抽出にあたっては形態素解析器および形態素辞書が必要となるが,実際に専門分野の文書を既存の形態素解析器と形態素辞書を用いて解析した場合,解析精度の制約により,用語抽出性能を低下させる傾向がある.一方で解析誤りの中には,系統的な誤りと考えられるものがあり,さらにはその本来の結果がどのようなものであるのかを推定できる場合もある.これらの誤りについて解析結果を事後的に修正した上で,その結果から用語抽出を行うことにより,抽出性能を向上させることが期待できる。今回の報告では,情報処理分野の文書を解析する際に発生する系統的な誤りパタンがいくつか存在することをを明かにした上で,誤りを修正した結果から用語抽出を行うことにより,用語抽出性能が向上することを報告する.

    CiNii Article

    CiNii Books

    researchmap

  • Annotating Semantic Role Information to Japanese Balanced Corpus

    Koichi Takeuchi, Masayuki Ueno, Nao Takeuchi

    Proceedings of MAPLEX 2015   2015

     More details

  • D-14-2 Adapting a speech recognition engine Julius to recognize local area names

    Nishikawa Takaya, Takeuchi Koichi

    Proceedings of the IEICE General Conference   2014 ( 1 )   129 - 129   2014.3

     More details

    Language:Japanese   Publisher:The Institute of Electronics, Information and Communication Engineers  

    CiNii Article

    CiNii Books

    researchmap

  • テキストマイニングシンポジウム

    竹内孔一

    電子情報通信学会 情報・システムソサイエティ誌   19 ( 3 )   17 - 18   2014

     More details

  • 用語管理システムの開発

    小山照夫, 竹内孔一, 濱田宏平

    研究報告自然言語処理(NL)   2013 ( 2 )   1 - 4   2013.7

     More details

    Language:Japanese   Publisher:一般社団法人情報処理学会  

    多くの研究分野において用語の整理と管理の重要性が指摘されながら、実際には十分な対応がなされているとは言い難い。この理由として、そもそも用語を管理する枠組み自体が十分に整備されておらず、また実際の用語管理を継続的に遂行していくにあたっては多大な人的資源を必要とすることなどが挙げられる。本研究では、適切に選定された研究領域ごとに、用語管理を総合的に行うデータベースを構築した上で、用語管理に関わる労力を低減する目的で、用語候補自動抽出機能を利用した、用語登録支援機能を備えたシステムを開発する試みについて発表する。

    CiNii Article

    CiNii Books

    researchmap

  • Construction of Verb-Adjectival Verb Dictionary toward Identifying Completion State of Verb's Activity

    ISHIHARA Yasuhiro, TAKEUCHI Koichi

    Technical report of IEICE. Thought and language   111 ( 227 )   73 - 78   2011.10

     More details

    Language:Japanese   Publisher:The Institute of Electronics, Information and Communication Engineers  

    In this paper we construct a verb-adjectival verb dictionary to deal with paraphrase relations between verbs and adjectival verbs. This is because we found that some adjectival verbs can express completion state of some verbs ; e.g. an adjectival verb 'healthy' can express the completion state of 'He is cured'. For this purpose, we elaborate a verb-adjectival verb dictionary in which we collect adjectival verbs that can express completion states of verbs. To check the quality of our dictionary, we build a paraphrase system that inputs a verb phrase and then outputs expressions using adjectival verbs. From this construction, we found several phrasal patterns to paraphrase verb expressions. In preliminary paraphrase experiments, we evaluate the quality of our proposed dictionary and paraphrasing system.

    CiNii Article

    CiNii Books

    researchmap

  • Analyzing Deverbal Compounds Based on Verb Semantics and Noun Categories

    MORIYASU Yuuki, TAKEUCHI Koichi

    Technical report of IEICE. Thought and language   111 ( 227 )   51 - 56   2011.10

     More details

    Language:Japanese   Publisher:The Institute of Electronics, Information and Communication Engineers  

    This paper presents Japanese deverbal compound analyzer which disambiguates senses of deverbal nouns and identify their arguments based on Thesaurus of Predicate Argument Structure. In Japanese there appear a lot of complex compound nouns such as 'koukuuki-tsuiraku-jiko-boushi-sisutemu' (airplane-crash-incident-prevention-system) i.e., a system of prevention of airplane crash. The proposed system identifies deverbal nouns such as 'prevention' and 'crash', and idenfies its arguments 'incident' and 'airplane'. To identify senses of deverbal nouns and we utilize example sentences in Japanese Verb Thesaurus as well as manually constructed noun category. The preliminary evaluation of deverbal-sense disambiguation shows that the proposed noun category is effective.

    CiNii Article

    CiNii Books

    researchmap

  • Japanese Idiom Identification System Based on Variation Patterns Focused on Exhaustive Detection

    MORIYA Masato, TAKEUCHI Koichi

    Technical report of IEICE. Thought and language   111 ( 227 )   45 - 50   2011.10

     More details

    Language:Japanese   Publisher:The Institute of Electronics, Information and Communication Engineers  

    Identifying idioms in sentences is essential to structure text, especially to identify verb semantics. This task is, however, difficult because an idiom has not only different semantics by relation between a verb and its modifier nouns, but also different paraphrases. To solve this, we have constructed idiom identification system using grammatical rules based on user-editable dictionaries. This system can detect idioms exhaustively, absorbing morphological variants such as replacement of particles or transformation of idiom construction caused by insertion of words. It can also cooperate with Argument Structure Annotator to identify verb semantics, dealing with idiom paraphrases and adopting semantic concept on Thesaurus of Predicate Argument Structure. Furthermore, it allows users easily to register idiom rules in the user-editable dictionaries in XML without word segmentation nor POS information. In this paper, we propose specifications of the user-editable dictionaries and an exhaustive detection method to absorb morphological variants. The experimental results of idiom identification task reveal that the proposal method works well and is effective for semantic disambiguation of idioms.

    CiNii Article

    CiNii Books

    researchmap

  • Comparison between statistical-learning-based system and rule-based system on biomedical term extraction task

    MINAMOTO Shosaburo, TAKEUCHI Koichi

    Technical report of IEICE. Thought and language   111 ( 227 )   33 - 37   2011.10

     More details

    Language:Japanese   Publisher:The Institute of Electronics, Information and Communication Engineers  

    We compare the term extraction system to extract technical terms automatically in texts. The methods of comparative target are based on statistical-learning-based model and rule-based model. In comparison, since the text data identified term on infection exists, we use this data as correct answer data. In statistical-learning-based model, we build the term extraction system by learning by CRF based on correct answer data. And in rule-based model, we use the extraction system using SRL as a rule-based word extraction language. We experimented in term extraction, and showed that it is good to perform term extraction by statical-learning based model when there are many correct answer data, and by rule-based model when texts depend on some fields.

    CiNii Article

    CiNii Books

    researchmap

  • Construction of Japanese Semantic Role Label Identification System Based on Very Small Correct Examples

    TSUCHIYAMA Suguru, TAKEUCHI Koichi

    Technical report of IEICE. Thought and language   111 ( 227 )   69 - 72   2011.10

     More details

    Language:Japanese   Publisher:The Institute of Electronics, Information and Communication Engineers  

    In this paper, we propose a method to identify semantic role labels utilizing a few correctly label-annotated examples. The similarity between input sentence and correct labeled example sentences is evaluated by manually denned similarity function. This is because a statistical learning model doesn't work well without many correct examlpes. We develop Japanese semantic role label identification system based on the proposal method. Moreover, this system can be added deverbal compound analyzer and idiom identification system. We also report the comparison between the proposal method and CRFs in the experiment.

    CiNii Article

    CiNii Books

    researchmap

  • Extraction of Verb Synonyms Using Graph-Based Clustering

    TAKEUCHI Koichi, TAKAHASHI Hideyuki, KOBAYASHI Daisuke

    IEICE technical report   110 ( 244 )   13 - 18   2010.10

     More details

    Language:Japanese   Publisher:The Institute of Electronics, Information and Communication Engineers  

    This manuscript describes evaluation results of Kernel K-means clustering approach comparing with modified Aizawa's co-clustering approach for verb synonym extraction task. Kernel K-means approach is one of the state-of-the-art vector-based clustering method which can divide vector-spaces with non-linear boundary by incorporating Kernel method. Besides the mathematical framework of Kernel K-means can cover Spectral Graph Clustering. In this manuscript, however, we reveal Aizawa's co-clustering approach overcomes Kernel K-means on the verb synonym extraction task (bi-graph clustering) in Japanese. From this results we discuss that the equivalence between graph-vector space in Kernel K-means approach can be limited, and then Kernel K-means decease their accuracy in our verb synonym extraction.

    CiNii Article

    CiNii Books

    researchmap

  • Construction of Annotated Corpus of Verb Meanings and Semantic Role Labels Based on Verb Thesaurus

    TAKEUCHI Koichi, MORIMOTO Maiko

    IEICE technical report   109 ( 234 )   13 - 18   2009.10

     More details

    Language:Japanese   Publisher:The Institute of Electronics, Information and Communication Engineers  

    Argument structure is widely recognized as an interface of mapping from grammatical structure of a sentence to shallow semantic structure. In English several large-scale language resources such as FrameNet, Propbank, and Dorr's LCS are proposed and each of them defines a kind of argument structure and some of them construct annotated corpora. These annotated corpora are very useful to build a statistical annotation system of semantic role labels. While in Japanese EDR provided a large-scale annotated corpus of semantic role labels; however the annotated sentences are not collected on the basis of verbs, thus it is hard to utilize the annotated corpus as a training corpus of statistical semantic role label system. Thus we propose another annotation corpus of argument structure on the basis of the Japanese Verb Thesaurus which is provided in previous work. Currently we annotated 1483 sentences for 120 verbs. In this manuscript we confirm that the problem issues of argument structure annotation, current annotation scheme, development of tool and quality of annotated corpus.

    CiNii Article

    CiNii Books

    researchmap

  • Term Extraction based on the Forward and Backward Connectivities of Candidates

    KOYAMA TERUO, TAKEUCHI KOICHI

    193   M1 - M6   2009.9

     More details

  • Extraction of Verb Synonyms Using Co-clustering Approach with Active Extraction for Polysemous Verbs

    TAKAHASHI Hideyuki, TAKEUCHI Koichi

    IEICE technical report   108 ( 408 )   37 - 42   2009.1

     More details

    Language:Japanese   Publisher:The Institute of Electronics, Information and Communication Engineers  

    In the previous work we show that a co-clustering approach is suitable for verb synonym extraction because of polysemy of both noun and verb. The proposed approach is promising, however, it has not succeeded in initial graphs of verb-noun pairs. Thus in this paper we present a modified co-clustering approach with recursively extracting other possible verb gourps from the same verb-noun pairs; this enables us to actively obtain other meanings of each verb. The experimental results of verb synonym extraction from Japanese news paper corpus and balanced corpus show that the modified approach outperformed the previous one in precision and recall rates.

    CiNii Article

    CiNii Books

    researchmap

  • Extraction of Noun Synonyms and Other Related Words Using Dense-Subclusters

    KANEMOTO Masaya, TAKEUCHI Koichi

    IEICE technical report   108 ( 408 )   31 - 35   2009.1

     More details

    Language:Japanese   Publisher:The Institute of Electronics, Information and Communication Engineers  

    In this paper we propose a noun clustering approach on the basis of CBC proposed by Pantel. CBC is a clustering approach that carefully extracts clusters by finding sub-clusters regarded as committees with the same meanings, and try to extract unknown clusters from the remaining elements. In preliminary experiments of Japanese noun clustering, however, we found that CBC does not work well at the measurement of basic similarity between words with context vectors and scoring method that decides to merge sub-clusters. To these problems in this paper we propose to apply Jensen-Shannon formula as a measurement and a new scoring method. In the experimental results of constructing sub-clusters of Japanese nouns from a new paper article we will show that our proposed approaches overcome the approaches in CBC at the clustering accuracy.

    CiNii Article

    CiNii Books

    researchmap

  • An evaluation of document set similarity based on morpheme occurrence patterns

    KOYAMA Teruo, TAKEUCHI Koichi

    IPSJ SIG Notes   188   51 - 56   2008.11

     More details

    Language:Japanese   Publisher:Information Processing Society of Japan (IPSJ)  

    We can assume that two different document sets may show similar morpheme occurrence patterns, if the sets both discuss about similar topics with similar discussion manners. In this paper, the authors show the occurrence patterns of morphemes really indicates the similarity of the sets. The authors also show the difference of the patterns in both sets indicate the difference of topics or discussion manner between the sets. The authors also show how to find key morphemes that indicate the similarity or difference of the sets.

    CiNii Article

    CiNii Books

    researchmap

  • Annotating Semantic Role Labels and Verb Categories to Japanese News Corpus

    TAKEUCHI Koichi, KOYAMA Teruo

    IEICE technical report   108 ( 283 )   19 - 22   2008.11

     More details

    Language:Japanese   Publishing type:Article, review, commentary, editorial, etc. (scientific journal)   Publisher:The Institute of Electronics, Information and Communication Engineers  

    In this manuscript we discuss how we make design policy of semantic role labels and predicate argument structure. In previous work of construction of language resources such as FrameNet and VerbNet semantic role labels are practically proposed, however, it still leaves room for discussion about SRL's definition, functionality and design policy from a view of how to use in natural language processing. In this research we are annotating SRLs for Japanese news corpus on the basis of a free Verb thesaurus of argument structure. In this manuscript, from the view of event description of natural language, we make sure that 1) the position of predicate argument structure description, 2) semantic role labels contains three functions and 3) SRLs should be annotated on the basis of not what happened but speaker's expression.

    CiNii Article

    CiNii Books

    researchmap

  • Identification of Research Sub-Domain and Term Classification Based on Term Clustering

    KOYAMA Teruo, TAKEUCHI Koichi

    IPSJ SIG Notes   89   87 - 92   2008.1

     More details

    Language:Japanese   Publisher:Information Processing Society of Japan (IPSJ)  

    Term classification associate to research sub-domain is an important approach for systematized classification of term candidates extracted from text corpora. The authors, have developed a method which identify some of the important research sub-domains in research abstract corpora. The authors also proved that relatively frequent term candidates extracted from the corpus can be related to identified sub-domains.

    CiNii Article

    CiNii Books

    researchmap

  • 言語処理を指向した動詞項構造シソーラス

    竹内孔一

    月刊言語   58 - 64   2008

     More details

  • Toward Construction of Verb Thesaurus for Paraphrasing Metaphor Expressions on the Basis of Metaphor Analysis

    ICHINOSE Mitsuru, TAKEUCHI Koichi

    IPSJ SIG Notes   182   31 - 38   2007.11

     More details

    Language:Japanese   Publisher:Information Processing Society of Japan (IPSJ)  

    Metaphor expression, especially consisting of a noun and verb, is irregular expression thus it is not easy to be dealt with in natural language applications such as machine translation, summarization, and document understanding. We believe it is important that metaphor expressions should be paraphrased into normalized basic expressions in order to extract a fact embedded in the expression: for example a metaphor expression "the wind plays" can be paraphrased into a basic expression "the wind blows". In order to realize this task, in this paper we propose a verb thesaurus organized on the basis of semantic coherency for verbs. From the work of manual construction of a thesaurus on 50 verbs we discuss the possibility of our approach.

    CiNii Article

    CiNii Books

    researchmap

  • Constrution of Semantic Verb Class Using Graph-Based Co-clustering Approach

    TAKEUCHI Koichi

    IPSJ SIG Notes   182   39 - 44   2007.11

     More details

    Language:Japanese   Publisher:Information Processing Society of Japan (IPSJ)  

    This paper presents our ongoing research for clustering Japanese verbs for constructing Japanese verb lexicon which is founded on the theory of lexical conceptual structure (LCS). The key issue of this research is how to extract a core cluster of Japanese verbs with a highly relating cluster of nouns because not only verbs but also nouns are polysemouns words. In this paper we applied an approach of co-clustering on the basis of graph sctructure into clustering task of verbs and nouns, and present experimental results on Japanese Verb-Case-Noun data from both large Web corpus and Maichini news paper corpus from 1991 to 1998.

    CiNii Article

    CiNii Books

    researchmap

  • Hierarchical Structurization of Japanese Composite Terms based on Nesting Relations

    KOAYAMA Teruo, TAKEUCHI Koichi

    IPSJ SIG Notes   180   49 - 54   2007.7

     More details

    Language:Japanese   Publisher:Information Processing Society of Japan (IPSJ)  

    We introduce a method for structurizing term candidates extracted from a Japanese domain corpus, based on nesting relations between the candidates. Prom the nesting relations, we can infer hypernym-hyponym relations and related term relations. Arranging both relations separately, we can get clearer hierarchical relations.

    CiNii Article

    CiNii Books

    researchmap

  • Hierarchical Structurization of Japanese Composite Terms based on Nesting Relations

    KOAYAMA Teruo, TAKEUCHI Koichi

    IEICE technical report   107 ( 158 )   49 - 54   2007.7

     More details

    Language:Japanese   Publisher:The Institute of Electronics, Information and Communication Engineers  

    We introduce a method for structurizing term candidates extracted from a Japanese domain corpus, based on nesting relations between the candidates. Prom the nesting relations, we can infer hypernym-hyponym relations and related term relations. Arranging both relations separately, we can get clearer hierarchical relations.

    CiNii Article

    CiNii Books

    researchmap

  • Building an Event Ontology for Textual Entailment Computation

    INUI Kentaro, TAKEUCHI Koichi, FUJITA Atsushi

    IEICE technical report   106 ( 518 )   13 - 18   2007.1

     More details

    Language:Japanese   Publisher:The Institute of Electronics, Information and Communication Engineers  

    CiNii Article

    CiNii Books

    researchmap

  • A Method for Extracting Composite Terms from Japanese Domain Corpora

    KOYAMA Teruo, KAGEURA Kyo, TAKEUCHI Koichi

    IPSJ SIG Notes   176   55 - 60   2006.11

     More details

    Language:Japanese   Publisher:Information Processing Society of Japan (IPSJ)  

    Term extraction is one of the most important application of natural language processing technologies. Statistic criteria are widely adopted to evaluate the termhood of the extracted candidates. However, it is difficult to evaluate the termhood of less frequent candidates. In this study we propose a method for Japanese composite term extraction in which unproper morpheme patterns are el iminated. Using the new method, high precision of term extraction can be attained for Japanese composite terms.

    CiNii Article

    CiNii Books

    researchmap

  • Web Search Result Clustering Based on Structure of Compound Nouns

    HIRAO Kazuki, TAKEUCHI Koichi

    IPSJ SIG Notes   84   35 - 42   2006.9

     More details

    Language:Japanese   Publisher:Information Processing Society of Japan (IPSJ)  

    In this paper, we propose a clustering method based on structure of compound nouns. Japanese compound nouns usually provide concrete concepts, thus we can assume that compound nouns must be a good clue to index documents for Web document clustering. The other benefit of using compound nouns is their compositional structure that can be divided into sub concepts. This indicates that we can construct hierarchical Web document clustering based on their structure. From the practical experiments of constructing hierarchical clusters on the results of a Web search engine, we succeeded in showing that the clearly clusterized Web documents with understandable hierarchical indexes of compound nouns.

    CiNii Article

    CiNii Books

    researchmap

  • Construction of rule-based model for semantic role labeling with the EDR thesaurus and an LCS dictionary

    Shimomura Takuya, Takeuchi Koichi

    IPSJ SIG Notes   2006 ( 94 )   13 - 20   2006.9

     More details

    Language:Japanese   Publisher:Information Processing Society of Japan (IPSJ)  

    We present a rule-based model for semantic role labeling because semantic role labels should be assigned by means of semantic relationships between nouns atid verbs. The purpose of this research is to reveal the restritional and preference rules between nouns and verbs. Our model uses the EDR thesaurus as semantics of nouns, and a lexical conceptual structure dictionary as verb semantics. Experimental results show that dealing with systematic nominal categorization is the key to improve the accuracy of semantic role labeling.

    CiNii Article

    CiNii Books

    researchmap

  • Construction of Compositional Lexical Database Based on Lexical Conceptual Structure

    Takeuchi Koichi, Inui Kentaro, Fujita Atsushi, Takeuchi Nao, Abe Shuya

    IPSJ SIG Notes   2005 ( 94 )   123 - 130   2005.9

     More details

    Language:Japanese   Publisher:Information Processing Society of Japan (IPSJ)  

    This paper presents our ongoing research for constructing a Japanese verb lexicon which is founded on the theory of lexical conceptual structure (LCS). LCS provides a framework for capturing the relation between syntactic behaviors of lexical items and their semantic properties, which is useful for a range of NLP tasks including translation, paraphrasing and summarization. We discuss design issues involved in LCS dictionary development, and present an overview of the current specification of the lexicon, which is designed to allow successive future refinements.

    CiNii Article

    CiNii Books

    researchmap

  • Pattern Based Term Extraction Using ACABIT System

    TAKEUCHI Koichi, KAGEURA Kyo, KOYAMA Teruo, DAILLE Beatrice, ROMARY Laurent

    IEICE technical report. Natural language understanding and models of communication   103 ( 280 )   31 - 36   2003.8

     More details

    Language:English   Publisher:The Institute of Electronics, Information and Communication Engineers  

    In this paper, we propose a pattern-based term extraction approach for Japanese, applying ACABIT system originally developed for French. The proposed approach evaluates termhood using morphological patterns of basic terms and term variants. After extracting term candidates, ACABIT system filters out non-terms from the candidates based on log-likelihood. This approach is suitable for Japanese term extraction because most of Japanese terms are compound nouns or simple phrasal patterns.

    CiNii Article

    CiNii Books

    researchmap

  • Analysis of Dative Case Relations between Constituents of Japanese Deverbal Nouns Using Lexical Conceptual Structure

    TAKEUCHI Koichi, KAGEURA Kyo, KOYAMA Teruo

    IPSJ SIG Notes   150   133 - 140   2002.7

     More details

    Language:Japanese   Publisher:Information Processing Society of Japan (IPSJ)  

    In this paper we propose the method to analyze relationships between constituents of Japanese compound nouns, especially the dative case relationship in deverbal noun compounds. The kinds of relations between noun-deverbal noun compounds can be roughly classified into two: the one is that the noun becomes an argument of head verb, and the other is that the noun becomes an adjunct of head verb. The relation of the dative case we take up is a kind of argument relation but it has not been investigated carefully. We try to analyze these relationships and to make sure how we can explain those linguistic phenomena based on lexical semantics theory. We introduce lexical conceptual structure (LOS) as a lexical semantic expression and show the possibility to build compound noun analyzer based on LCS to dead with the dative case relations.

    CiNii Article

    CiNii Books

    researchmap

  • Analysis of Dative Case Relations between Constituents of Japanese Deverbal Nouns Using Lexical Conceptual Structure

    TAKEUCHI Koichi, KAGEURA Kyo, KOYAMA Teruo

    IEICE technical report. Natural language understanding and models of communication   102 ( 200 )   35 - 42   2002.7

     More details

    Language:Japanese   Publisher:The Institute of Electronics, Information and Communication Engineers  

    In this paper we propose the method to analyze relationships between constituents of Japanese compound nouns, especially the dative case relationship in deverbal noun compounds. The kinds of relations between noun-deverbal noun compounds can be roughly classified into two: the one is that the noun becomes an argument of head verb, and the other is that the noun becomes an adjunct of head verb. The relation of the dative case we take up is a kind of argument relation but it has not been investigated carefully. We try to analyze these relationships and to make sure how we can explain those linguistic phenomena based on lexical semantics theory. We introduce lexical conceptual structure (LCS) as a lexical semantic expression and show the possibility to build compound noun analyzer based on LCS to dead with the dative case relations.

    CiNii Article

    CiNii Books

    researchmap

  • 言語学を利用した複合名詞解析モデルの構築

    竹内孔一

    月刊日本語学12月号   pp. 28-35   2001

     More details

  • Building Japanese Compound Analyzer Using Lexical Restriction

    Takeuchi Koichi, Uchiyama Kiyoko, Yoshioka Masaharu, Kageura Kyo, Koyama Teruo

    IPSJ SIG Notes   2000 ( 29 )   71 - 78   2000.3

     More details

    Language:Japanese   Publisher:Information Processing Society of Japan (IPSJ)  

    The aim of our research is to build a Japanese compound analyzer using grammatical constraint. To analyze the compounds which have verbs in their heads is to reveal the relationship between incorporated nouns and the head verbal nouns. The kind of relationship is adjunctive and internal argument of a head verb. In our previous research, we categorize verbs based on their argument structure and lexical conceptual structure (LCS), but it is necessary to categorize nouns to reveal the relationship between a noun and a head verb in compounds. Therefore we propose the method to categorize nouns based on argument structure and LCS of head verbs in this paper. In the case of categorization, we also use qualia structure of generative lexicon. After the assumptions of restriction about categorized nouns is constructed, we attempt the experiment of analyzing technical terms (103 words) in the domain of information processing. Finally we show the experimental results that 92 words (89%) can be solved using our restriction and discuss the limitation of our model.

    CiNii Article

    CiNii Books

    researchmap

  • 文法的制約を用いた複合語解析モデルの作成

    竹内孔一, 内山清子, 吉岡真治, 影浦峡, 小山照夫

    学術情報センター紀要   第12号 pp. 7-15   2000

     More details

  • Analysis of Japanese Compound Words Based on Rich Lexical Knowledge

    TAKEUCHI Koichi, UCHIYAMA Kiyoko, YOSHIOKA Masaharu, KAGEURA Kyo, KOYAMA Teruo

    IEICE technical report. Natural language understanding and models of communication   99 ( 387 )   7 - 14   1999.10

     More details

    Language:Japanese   Publisher:The Institute of Electronics, Information and Communication Engineers  

    It is difficult problem to know how to prepare lexical knowledge in order to analyze Japanese compound words. In this paper, we analyze technical terms that consist of two verbal nouns by means of the rich lexical knowledge that are a level ordering assumption, an argument structure and a lexical conceptual structure used on morphology or syntax. Our analysis of compound words aims to reveal the relations between the constituent words of compound words. From the results, our method can explain some parts of phenomena of the impossible compound words and the conversion of verbal noun into noun in compound words. The lexical knowledge we have assumed is aimed at describing the details of verbal nouns. While analyzing verbal nouns, we also clarify how to describe the lexical knowledge of noun words.

    CiNii Article

    CiNii Books

    researchmap

  • Grammatical Categories for Constituent Elements of Japanese Nominal Compounds with Special Reference to Technical Terms

    Uchiyama Kiyoko, Takeuchi Koichi, Yoshioka Masaharu, Kageura Kyo, Koyama Teruo

    Research bulletin of the National Center for Science Information Systems   第11号 pp. 49-57 ( 11 )   49 - 57   1999.3

     More details

    Language:Japanese   Publisher:National Institute of Informatics  

    Grammatical information for constituent elements of nominal compounds are needed for analyzing nominal compounds, especially those of technical fields. In this research we analyzed the grammatical categories suitable for nominal compound analysis, as it is difficult to apply conventional grammatical categories defined with respect to words in sentences to the analysis of nominal compounds.

    CiNii Article

    CiNii Books

    researchmap

  • Japanese OCR Error Correction Using Stochastic Language Models Trained on OCR Outputs

    K. Takeuchi, Y. Matsumoto

    Proceedings of the 5th Natural Language Processing Pacific Rim Symposium 1999(NLPRS'99)   1999

     More details

▼display all

Presentations

  • Building a Japanese Semantic Role Labeling System for Predicate-Argument Extraction Invited

    Koichi Takeuchi

    2023.12.16 

     More details

    Event date: 2023.12.15 - 2023.12.17

    Language:English   Presentation type:Oral presentation (invited, special)  

    researchmap

  • 小論文自動採点データ構築と理解力および妥当性評価手法の構築

    言語処理学会第24回年次大会  2018 

     More details

  • 統計的学習モデルを利用した日本語慣用句の意味的曖昧性解消

    情報処理学会第79回全国大会  2017 

     More details

  • 専門用語辞書拡張システムの構築

    電子情報通信学会言語理解とコミュニケーション研究会  2017 

     More details

  • 異なる言語間における専門語彙の体系性の対応の分析

    言語処理学会第23回年次大会  2017 

     More details

  • HTML内の並列構造を利用したWebページ上のイベント情報抽出

    情報処理学会第79回全国大会  2017 

     More details

  • 述語項構造による言語処理

    意味と理解の研究会  2017 

     More details

  • 名詞項構造付与データの構築

    言語資源活用ワークショップ  2017 

     More details

  • 小論文採点支援のための関連文書取得法の考察

    電子情報通信学会言語理解とコミュニケーション研究会  2017 

     More details

  • 論述採点支援システム構築のための模擬試験データの構築

    多様なデータに対する情報縮約・クラスタリングと情報表現に関する研究  2017 

     More details

  • コピュラ文を考慮した述語項構造解析器による含意認識

    電子情報通信学会言語理解とコミュニケーション研究会  2017 

     More details

  • 小論文の自動採点に向けたオープンな基本データの構築および現段階での自動採点手法の評価

    言語処理学会第23回年次大会  2017 

     More details

  • 医療分野における形態素解析のための基本単語付与システムの構築

    第36回中国四国医療情報学研究会  2016 

     More details

  • 医療分野における形態素解析システム構築に向けて

    第37回中国四国医療情報学研究会  2016 

     More details

  • テキストマイニング・シンポジウムでの発表内容と言語処理技術

    言語処理学会第22回年次大会ワークショップ言語処理の応用  2016 

     More details

  • デジタル 4R 訓練システム構築のための成否判定システムの最適化手法の研究

    第15回, 情報科学技術フォーラム(FIT2016)  2016 

     More details

  • BACTを利用した日本語慣用句意味曖昧性解消

    言語処理学会第22回年次大会  2016 

     More details

  • ノンパラメトリックベイズによる意味役割推定

    言語処理学会第21回年次大会  2015 

     More details

  • 日本語イディオム異形規則の構築

    言語処理学会第21回年次大会  2015 

     More details

  • 外来語の扱いを考慮した日本語専門文書からの用語抽出

    言語処理学会第21回年次大会  2015 

     More details

  • 名詞の項構造データの構築

    第8回コーパスワークショップ  2015 

     More details

  • 述語項構造による意味記述の可能性

    意味と理解の研究会  2015 

     More details

  • 述語項構造を意識した名詞データの構築

    第7回コーパスワークショップ  2015 

     More details

  • デジタル危険予知訓練システム開発のための回答文の正否判定システムの開発,

    第14回, 情報科学技術フォーラム(FIT2015)  2015 

     More details

  • オブジェクト指向に基づく言語インターフェース

    ことば工学研究会  2015 

     More details

  • BCCWJへの述語項構造シソーラスの付与による意味役割の検討

    第2回自然言語処理シンポジウム  2015 

     More details

  • 質問応答における言語的な知識と一般的な知識の飛躍

    人工知能学会全国大会  2015 

     More details

  • WebページのHTML構文構造を考慮した地域イベント情報の抽出

    言語理解とコミュニケーション研究会  2014 

     More details

  • 述語項構造シソーラスを意識した名詞の意味構造アノテーションのための名詞意味構造の検討

    第6回コーパスワークショップ  2014 

     More details

  • 述語項構造シソーラスによる述語と名詞の構造化

    人工知能学会全国大会  2014 

     More details

  • 含意認識における主題に着目した文の比較手法の検討

    言語処理学会第20回年次大会  2014 

     More details

  • 言語学の知見に基づく関数オブジェクトを利用した言語理解システムの構成

    言語処理学会第20回年次大会  2014 

     More details

  • Construction of Predicate Argument Structure Annotator Based on Event Type Thesaurus

    Annual Meeting of Japanese Society of Artificial Intelligence 2012  2012 

     More details

  • 述語の分析に基づく文書解析の考察

    自然言語処理研究会  2012 

     More details

  • 述語概念をベースとした抽象名詞を含む文の意味構造アノテーション

    テキストアノテーションワークショップ・コンテスト  2012 

     More details

  • 動詞語義及び意味役割付与作業システムの構築

    第2回コーパス日本語学ワークショップ  2012 

     More details

  • 統計的学習モデルとルールベースモデルに基づく用語抽出システムの比較

    言語理解とコミュニケーション研究会  2011 

     More details

  • 動詞項構造シソーラスの構築

    2011年度人工知能学会全国大会  2011 

     More details

  • 網羅的な検出を重視した異形パターンに基づく日本語慣用句同定システム

    言語理解とコミュニケーション研究会  2011 

     More details

  • サ変名詞を含む複合名詞の語義解析システム及び名詞辞書の構築

    言語理解とコミュニケーション研究会  2011 

     More details

  • 動詞とその結果状態を関係付ける結果状態辞書の構築

    言語理解とコミュニケーション研究会  2011 

     More details

  • 少数正解事例に基づく動詞語義及び名詞意味役割付与システム

    言語理解とコミュニケーション研究会  2011 

     More details

  • グラフに基づくクラスタリングによる動詞類義語の獲得

    言語理解とコミュニケーション研究会  2010 

     More details

  • 類似した動作や状況を検索するための意味役割及び動詞語義付与システムの構築

    言語理解とコミュニケーション研究会  2010 

     More details

  • Web上の兄弟ページを利用した対訳文書からの段落アラインメント

    言語処理学会第15回年次大会  2009 

     More details

  • 小規模な用語リストを利用した画像読影レポートからの用語抽出

    言語処理学会第15回年次大会  2009 

     More details

  • 動詞項構造シソーラスに基づく動詞語義ならびに意味役割付与データの構築

    言語理解とコミュニケーション研究会  2009 

     More details

  • 類似度の高いサブクラスタに基づく名詞クラスタリング

    言語理解とコミュニケーション研究会  2009 

     More details

  • 候補の接続関係を考慮した複合語用語抽出

    自然言語処理研究会 NL-193  2009 

     More details

  • SRLを利用した規則ベースの感染症用語抽出

    言語理解とコミュニケーション研究会  2009 

     More details

  • 多義性を考慮した同時共起クラスタリングによる動詞の類語抽出

    言語理解とコミュニケーション研究会  2009 

     More details

  • 動詞語義を推定するための語義付与コーパスの作成

    言語処理学会第14回年次大会  2008 

     More details

  • 用語クラスタリングに基づく部分研究領域推定と用語分類

    情報処理学会、自然言語処理研究会  2008 

     More details

  • 項関係にある名詞との共起を考慮した動詞のクラスタリング

    言語処理学会第14回年次大会  2008 

     More details

  • WordNetと同音異義語を利用した異形イディオム検索

    言語処理学会第14回年次大会  2008 

     More details

  • 語彙意味論に基づく言語資源の構築

    言語処理学会第14回年次大会  2008 

     More details

  • 意味の包含関係に基づく動詞項構造の細分類

    言語処理学会第14回年次大会  2008 

     More details

  • 医学用語辞書で学習した分類器による放射線読影レポート用語の分類

    言語処理学会第14回年次大会  2008 

     More details

  • 語彙概念構造に基づく事態上位オントロジーの構築

    言語処理学会第13回年次大会  2007 

     More details

  • 含意関係計算のための事態オントロジーの開発に向けて電子情報通信学会

    言語理解とコミュニケーション研究会  2007 

     More details

  • 言語処理を指向した語彙概念構造辞書の構築

    大阪外国語大学多言語自然研究会  2007 

     More details

  • 置換・挿入を考慮した異形イディオム検索システムの構築

    言語処理学会第13回年次大会  2007 

     More details

  • 統計的手法を利用した伝染病検索システムの構築に向けて

    言語処理学会第13回年次大会  2007 

     More details

  • 日本語複合語用語の入れ子関係に基づく階層的体系化

    情報処理学会、自然言語処理研究会  2007 

     More details

  • メタファ分析に基づく置換可能な動詞カテゴリの作成

    情報処理学会、自然言語処理研究会  2007 

     More details

  • グラフ構造に基づく同時クラスタリングを利用した動詞の属性クラスの抽出

    情報処理学会、自然言語処理研究会  2007 

     More details

  • 英語イディオムの異形を整理する

    言語処理学会第12回年次大会  2006 

     More details

  • 名詞の概念体系を利用した規則に基づく意味役割付与システムの構築

    情報処理学会,自然言語処理研究会  2006 

     More details

  • 語彙概念構造に基づく動詞意味辞書の設計

    語彙資源の深化とNLP新時代  2006 

     More details

  • 翻訳者支援のための言語レファレンス・ツール高度化方針

    言語処理学会第12回年次大会  2006 

     More details

  • イディオムの異形規則を利用したイディオム検索システムの構築

    言語処理学会第12回年次大会  2006 

     More details

  • 複合名詞に着目したWeb検索結果のクラスタリング

    情報処理学会,自然言語処理研究会  2006 

     More details

  • 日本語専門分野テキストーコーパスからの複合語用語抽出

    情報処理学会,自然言語処理研究会  2006 

     More details

  • 語彙意味論に基づく動詞語彙概念構造辞書の構築

    名古屋大学 COE社会情報基盤のための音声映像の知的統合 招待講演  2006 

     More details

  • 分類の根拠を明示した動詞語彙概念構造の構築

    自然言語処理研究会  2005 

     More details

  • 多言語専門用語抽出モデルの構築

    自然言語処理学会年次大会  2005 

     More details

  • Web上のQAデータの構造の抽出と利用

    第11回言語処理学会年次大会  2005 

     More details

  • 言語処理を意識した語彙概念構造の構築

    東京大学21世紀 COE「心とことば」シンポジウム「語彙概念構造辞書の構築と応用」  2005 

     More details

  • 語彙概念構造を用いた機能動詞結合の言い換え

    や第10回言語処理学会年次大会  2004 

     More details

  • 語彙意味論とコンピュータへの応用

    日本英語学会ワークショップ  2002 

     More details

  • 生物学文献からの専門用語抽出における機械学習モデル の検討

    情報処理学会自然言語処理研究会  2002 

     More details

  • 語彙概念構造を利用した「に」に関する複合名詞の分析

    情報処理学会自然言語処理研究会  2002 

     More details

  • 語彙の研究を考慮した専門用語コーパスの作成

    言語処理学会第7回年次大会発表論文集  2001 

     More details

  • 複合名詞解析モデルにおける動詞に対する語彙概念構造の付与法

    言語処理学会第7回年次大会発表論文集  2001 

     More details

  • Building Japanese Compound Words Analyzer Based on Grammatical Constraint

    International Symposium on Advanced Informatics  2000 

     More details

  • 語彙の制約を考慮した複合語解析モデルの構築

    報処理学会自然言語処理研究会  2000 

     More details

  • Evaluation of the Keyword Extraction Task

    NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition  1999 

     More details

  • Evaluation of the Term Recognition Task

    NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition  1999 

     More details

  • 統計的形態素解析と文字 N-gram を利用したOCR誤り訂正

    情報処理学会 自然言語処理研究会  1999 

     More details

  • 専門分野における複合名詞の語構成要素の品詞相当カテゴリーに関する一考察

    言語処理学会第5回年次大会  1999 

     More details

  • 語基の詳細な特徴を考慮した複合語解析モデル

    電子情報通信学会言語 言語理解とコミュニケーション研究会  1999 

     More details

  • Evaluation of the Role Analysis Task

    NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition  1999 

     More details

  • Overview of the TMREC Tasks

    NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition  1999 

     More details

▼display all

Works

  • GSK2021-B 日本語小論文データ

    阿保達彦, 飯塚誠也, 稲田佳彦, 上田均, 田口雅弘, 竹内孔一

    2021.8

     More details

    Work type:Database science  

    researchmap

Awards

  • テレコム学際研究賞

    2023.3   電気通信普及財団   「研究利用可能な小論文データに基づく参照文書を利用した小論文採点手法の開発」

    竹内 孔一, 大野 雅幸, 泉仁 宏太, 田口 雅弘, 稲田 佳彦, 飯塚 誠也, 阿保 達彦, 上田 均

     More details

  • Best Teacher Award

    2023.3   Faculty of Engineering, Okayama University  

    Koichi Takeuchi

     More details

  • 教育貢献賞

    2021.3   岡山大学工学部  

     More details

  • 社会貢献賞

    2020.3   岡山大学工学部  

     More details

  • 教育貢献賞

    2017.3   岡山大学工学部  

     More details

  • ベストティーチャー賞

    2010.3   岡山大学工学部  

     More details

  • ベストティーチャー賞

    2008.3   岡山大学工学部  

     More details

  • 言語処理学会第10回年次大会優秀発表賞

    2004  

     More details

    Country:Japan

    researchmap

▼display all

Research Projects

  • Development of a method for synonymous expressions based on annotated predicate-argument graph data and its application to automatic essay scoring

    Grant number:22K00530  2022.04 - 2025.03

    Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research Grant-in-Aid for Scientific Research (C)  Grant-in-Aid for Scientific Research (C)

    竹内 孔一

      More details

    Grant amount:\3510000 ( Direct expense: \2700000 、 Indirect expense:\810000 )

    researchmap

  • Aiを用いた2D超音波プローブのフリーハンド3D化システムの開発

    Grant number:19K09624  2019.04 - 2023.03

    日本学術振興会  科学研究費助成事業  基盤研究(C)

    中原 龍一, 那須 義久, 竹内 孔一

      More details

    Grant amount:\4420000 ( Direct expense: \3400000 、 Indirect expense:\1020000 )

    ①ナビゲーション:ナビゲーションシステムを超音波装置の2Dプローブに装着しグローバルな三次元座標を取得し、3D画像の再構成を行った。画質が低い場合は良いが、画質が高い2Dプローブはセンサーの精度よりも解像度が高くなってしまうため、追加の補正が重要であることが改めて判明した。この問題を解決するためには三次元センサーのハード性能を向上させることで解決は可能であるかもしれないがが、精度が高くなればなるほど三次元センサーの価格が高くなるし、最終的にはAIによる補正が必要であるため、ハード的な補正を行わずAIによるソフト的な補正のみで3D化を目指すことにした。
    ②AIを用いた超音波画像解析:AIを用いた3D解析を行うためにデジタルデータとして収集した画像に対して、リアルタイム処理を行うシステムを構築した。AI処理がリアルタイム処理の律速段階となっているため、AI処理の高速化を目指した。CNN系統のネットワークを用いていたが、モデルが高性能になればなるほど学習時間がかかる傾向にあるためUMAPを用いた手法を開発した。画像全体を巨大な数字とみなして直接解析することを目指した。UMAPによって次元削減された空間において連続性を保つように修正することで、疑似的な3Dを再構成可能であることが判明した。最終的な目標は3D再構成であるが、臨床的には同一断面の迅速な検出のほうが利用価値が高い可能性があるため、同一断面検出のアプリケーション開発を行っている。

    researchmap

  • Construction of Japanese Predicate-Argument Structure Dictionary for Natural Language Processing and Linguistic Analysis with Concordancer

    Grant number:19K00552  2019.04 - 2022.03

    Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research Grant-in-Aid for Scientific Research (C)  Grant-in-Aid for Scientific Research (C)

    Takeuchi Koichi

      More details

    Grant amount:\2600000 ( Direct expense: \2000000 、 Indirect expense:\600000 )

    First, we incorporate the semantic role system defined in PropBank into the Predicate-argument structure Thesaurus. This extension allows us to capture the semantic relations between predicates and their arguments even in different constructions, such as the causative, passive and adversative passive forms, by describing both numbered arguments and named arguments (e.g., Agent and Theme). The dictionary entries have also been maintained and published on the website. Second, for construction of automatic annotation system of predicate-argument structures, we have applied various kinds of models such as deep learning models and models based on Bayes' theorems and revealed the approaches to improve the performance. Finally, we have developed a concordancer using an automatic annotation system of predicate-argument structures. In the concordancer, the users can combine block-based patterns that allow us to extract matched texts .

    researchmap

  • Extending predicate-argument thesaurus with large-scale language resources for applying to natural language processing and linguistic analysis

    Grant number:26370485  2014.04 - 2017.03

    Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research Grant-in-Aid for Scientific Research (C)  Grant-in-Aid for Scientific Research (C)

    TAKEUCHI Koichi

      More details

    Grant amount:\3120000 ( Direct expense: \2400000 、 Indirect expense:\720000 )

    We developed and published a new framework of nominal argument structure for Japanese. The key technique of describing arguments is number-based semantic roles, which enables us to identify each argument between paraphrased sentences in nominal predicates. We constructed 5,000 example sentences of nominal predicates; those of 13000 sentences have semantic role labels and links to the corresponding examples in the predicate thesaurus. We also constructed paraphrase data between different part-of-speech words and incorporate the paraphrase data into an argument structure analyzer. Besides, we improve the performance of the argument structure analyzer by applying a statistical learning method, and published the results in IPSJ Journal and the proceedings of International Conference PACLING 2015. We also show that the argument structure analyzer with nominal predicates enables us to improve performance of Japanese textual entailment task.

    researchmap

  • Research on a Term management Support System

    Grant number:24500303  2012.04 - 2015.03

    Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research Grant-in-Aid for Scientific Research (C)  Grant-in-Aid for Scientific Research (C)

    KOYAMA Teruo, TAKEUCHI Koichi

      More details

    Grant amount:\3510000 ( Direct expense: \2700000 、 Indirect expense:\810000 )

    In research fields, administration of terms is an essential task for the advance of researches. In this study we implement a term management support system with term recommendation utility on a term database management system. The system is evaluated to be useful for term management tasks.
    We also modify our term extraction algorithm so that the performance of extraction is highly improved.

    researchmap

  • Augmenting Terminologies through Proactive Extraction of Term Translation Pairs from the Web

    Grant number:24650122  2012.04 - 2015.03

    Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research Grant-in-Aid for Challenging Exploratory Research  Grant-in-Aid for Challenging Exploratory Research

    KAGEURA Kyo, TAKEUCHI Koichi

      More details

    Grant amount:\3900000 ( Direct expense: \3000000 、 Indirect expense:\900000 )

    How native and borrowed constituent elements contribute to the construction of technical terminology, how these elements are used when the terminology glows. By defining terminological network (with terms as vertices and shared constituents as edges) and constituent network (with constituent elements as vertices and co-occurrence in terms as edges), indices to evaluate consistency and coherency of terminology were defined. By using these observations, we developed a method of producing bilingual new term pair candidates from existing terminologies and validating them through monolingual and comparable domain corpora obtained from the web. Experiments have shown that the performance of bilingual term crawling is at least comparable with existing corpus-based extraction method, and complementary in the sense that they extract different types of pairs, which are more relevant to existing terminologies. Theoretical implications of this work was clarified in terms of lexicograpic issues.

    researchmap

  • Developing an integrated translation-aid site that provides comprehensive reference resources for translators

    Grant number:21240021  2009 - 2012

    Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research Grant-in-Aid for Scientific Research (A)  Grant-in-Aid for Scientific Research (A)

    KAGEURA Kyo, ABEKAWA Takeshi, UTIYAMA Masao, SATO Satoshi, UTSURO Takehito, TAKEUCHI Koichi, AIZAWA Akiko, TODA Shinichi

      More details

    Grant amount:\44980000 ( Direct expense: \34600000 、 Indirect expense:\10380000 )

    This research (1) clarified the concept of “comprehensiveness” for reference tools and concrete factors that should be incorporated into reference tools; (2) developed a comprehensive bilingual terminology crawler and a parallel and comparable archive constructor and constructed reference resources for translators; and (3) developed and made publicly available the integrated translation-aid environment Minna no Hon’yaku (http://trans-aid.jp/).

    researchmap

  • Structurized Term Extraction from Academic Text Corpora

    Grant number:19500135  2007 - 2009

    Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research Grant-in-Aid for Scientific Research (C)  Grant-in-Aid for Scientific Research (C)

    KOYAMA Teruo, TAKEUCHI Koichi

      More details

    Grant amount:\4160000 ( Direct expense: \3200000 、 Indirect expense:\960000 )

    In this study, we established a method for comprehensive term extraction from domain text corpora with high precision. The method is based on basically two new principles. One is the reconsideration and modification of Japanese morpheme classification, and another is the evaluating the certainty of composite boarders. We also have developed methods to structurize terms from two points of view, namely, nesting relations between composites, and the term relationships to various research subdomains.

    researchmap

  • Construction of verb sense disambiguation model on the basis of context and noun attribute toward information summarization

    Grant number:19500122  2007 - 2008

    Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research Grant-in-Aid for Scientific Research (C)  Grant-in-Aid for Scientific Research (C)

    TAKEUCHI Kouichi

      More details

    Grant amount:\3770000 ( Direct expense: \2900000 、 Indirect expense:\870000 )

    動詞の語義の曖昧性を解消するために必要な事例と語義の曖昧性を記述するための動詞と名詞の定義集合を構築した.さらに事例をもとに動詞の語義を判別する自動付与システムの構築を行った.事例は新聞記事約1500文に対して約120語の動詞について動詞の語義, 名詞の語義(日本語語彙大系), 名詞の意味役割の付与を行った(整理後公開予定).さらに, 事例を元に統計的学習モデルを利用して動詞の語義と名詞の意味役割を自動付与するシステムを構築した.

    researchmap

  • Japanese semantic analysis using balanced corpus of contemporary Written Japanese

    Grant number:18061003  2006 - 2010

    Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research Grant-in-Aid for Scientific Research on Priority Areas  Grant-in-Aid for Scientific Research on Priority Areas

    OKUMURA Manabu, SHIRAI Kiyoaki, SHINNOU Hiroyuki, TAKAMURA Hiroya, TAKEUCHI Kouichi, SASAKI Minoru, NAKAMURA Makoto

      More details

    Grant amount:\84700000 ( Direct expense: \84700000 )

    1) We constructed a corpus with word-sense annotation, based on the balanced contemporary corpus of written Japanese.
    2) We organized the SemEval-2 Japanese Word Sense Disambiguation (WSD) task by using the corpus that we constructed in 1). Nine systems from four organizations participated in the task.
    3) We showed that when domain adaptation for WSD (word sense disambiguation) was performed, the most effective domain adaptation method varies according to the properties of the source data and target data. We also presented the way to select the most effective method for domain adaptation depending on these properties using decision tree learning. The average accuracy of WSD showed significant improvement when the domain adaptation method which is selected automatically was used respectively, compared to when the original methods were used collectively.
    4) We proposed a supervised word sense disambiguation (WSD) system that uses features obtained from clustering results of word instances. Our approach is novel in that we employ semi-supervised clustering that controls the fluctuation of the centroid of a cluster, and we select seed instances by considering the frequency distribution of word senses and exclude outliers when we introduce "must-link" constraints between seed instances. In addition, we improved the supervised WSD accuracy by using features computed from word instances in clusters generated by the semi-supervised clustering.
    5) We proposed a method of detecting new word senses in a corpus. It consists of two procedures : (A) clusters of word instances are constructed so that the instances of the same sense are merged, (B) then similarity between a cluster and a sense in a dictionary is measured in order to determine senses of instances in each cluster.
    6) We proposed the method to detect peculiar examples of the target word from a corpus. Our method is to combine the density based method, Local Outlier Factor (LOF), and One Class SVM, which are representative outlier detection methods in the data mining domain. Our method improved precision and recall of LOF and One Class SVM. And we show that our method can detect new meanings by using the noun 'midori (green)'.
    7) We presented a co-clustering-based verb synonym extraction approach that increases the number of extracted meanings of polysemous verbs from a large text corpus. Our proposed approach can extract the different meanings of polysemous verbs by recursively eliminating the extracted clusters from the initial data set. The experimental results of verb synonym extraction show that the proposed approach increases the correct verb clusters by about 50% with a 0.9% increase in precision and a 1.5% increase in recall over the previous approach.

    researchmap

  • Construction of online multilingual reference tools for aiding translators

    Grant number:17200018  2005 - 2008

    Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research Grant-in-Aid for Scientific Research (A)  Grant-in-Aid for Scientific Research (A)

    KAGEURA Kyo

      More details

    Grant amount:\49140000 ( Direct expense: \37800000 、 Indirect expense:\11340000 )

    近年、ボランティアの翻訳者によるオンライン文書の翻訳紹介、NGOやNPOなどによるオンラインでの多言語情報発信などが急激に増えている。こうした活動に従事するオンライン翻訳者を支援するために、本研究では、(1)大規模で高品質かつ高機能なレファレンス情報資源群(辞書や事典)を、既存の高品質辞書とWebを活用して構築するとともに、(2)それらを統合的に参照しつつ翻訳作業を進めることのできるオンラインの翻訳支援環境を開発した。レファレンス情報資源群(QRlex)については、一般語対訳辞書として三省堂『グランドコンサイス英和辞典』を用い、高度で柔軟な活用手法を開発するとともに、オンライン情報資源を用いて大規模な固有名と専門語の対訳辞書を自動構築した。また、既に訳された関連する文書対を収集参照するメカニズムを開発した。翻訳支援環境として、これらのレファレンス情報資源を統合的に参照でき、辞書引きからオンライン情報資源の探索までをシームレスに実現できるシステムQReditを構築した。研究を終えるにあたり、三省堂の協力を得て、QReditを組み込んだ「みんなの翻訳」サイトを情報通信研究機構言語翻訳グループと研究代表者の研究室で共同開発し、オンライン翻訳者向けに一般公開している。

    researchmap

  • Building resources and a model for computing paraphrase based on lexical semantics

    Grant number:17300047  2005 - 2007

    Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research Grant-in-Aid for Scientific Research (B)  Grant-in-Aid for Scientific Research (B)

    INUI Kentaro, TAKEUCHI Koichi, FUJITA Atsushi, NAKATANI Kentaro

      More details

    Grant amount:\15860000 ( Direct expense: \14600000 、 Indirect expense:\1260000 )

    Aiming at building a computational model and computational recourses for computing paraphrase at the level of predicate-argument structure, this research project gained the following results:
    (i) For paraphrase knowledge, a large-scale hierarchical lexicon of predicate-argument structure was built. The lexicon organizes about 4,000 Japanese basic verbs (about 7,000 senses in total) with predicate-argument structure information in a fine-grained semantic hierarchy so that lexical entries in a semantic class can be regarded as near synonyms. For augmenting this knowledge base, additional knowledge about event relations are extracted from glosses found in a human-use dictionary of Japanese. Over 35,000 relations are extracted and classified into 8 relation types, all of which are considered useful for recognizing paraphrase or textual entailment.
    (ii) For scaling the basic paraphrase knowledge above, automatic acquisition of semantic relations between events from a large corpus was also explored. We proposed several extensions to a state-of-the-art method originally designed for entity relation extraction, reporting on the present results of our experiments on a Japanese Web corpus. The results show that (a) there are indeed specific cooccurrence patterns useful for event relation acquisition, (b) the use of cooccurrence samples involving verbal nouns has positive impacts on both re-call and precision, and (c) over five thousand relation instances are acquired from a 500M-sentence Web corpus with a precision of about 66% for action-effect relations.
    (iii) For building a computational model of paraphrase, we explore the regularity underlying these classes of paraphrases, focusing on the paraphrasing of Japanese light-verb constructions (LVCs). We propose a paraphrasing model for LVCs that is based on transforming the Lexical Conceptual Structures (LCSs) of verbal elements. We also propose a refinement of an existing LCS dictionary. Experimental results show that our LCS-based paraphrasing model characterizes some of the semantic features of those verbs required for generating paraphrases, such as the direction of an action and the relationship between arguments and surface cases.

    researchmap

  • 日本語複合語解析における語彙の概念構造の構築

    Grant number:14780313  2002 - 2003

    日本学術振興会  科学研究費助成事業 若手研究(B)  若手研究(B)

    竹内 孔一

      More details

    Grant amount:\2900000 ( Direct expense: \2900000 )

    初年度の研究成果で語彙概念構造をある基準に従って動詞に付与可能であることならびにその構造が複合名詞内の語の係り関係に関与していることを明らかにした.本年度はまず初年度で作成した最新の語彙概念構造辞書を利用した複合名詞解析モデルの精度についてまとめ2つの国際会議で発表を行なった.次に付与作業を続けながら問題点を整理し語彙概念構造の理論的な構築の背景について見直しを行なった.その結果,語彙概念構造がもつ情報は動詞の項構造,アスペクト分析による語彙としての分類,認知意味論に基づく意味概形の掲示の3種類であることを明確にすることができた.本年度の後半でこれに基づいた見直しから従来うまく付与できなかった動詞について再考察を行ないその結果を3月の言語処理学会の予稿集にまとめた.さらに大きな成果として作成中の語彙概念構造辞書を他の言語処理に利用する申し出を奈良先端科学技術大学院大学の自然言語処理研究室から受けた.これは本研究の辞書が構文構造に直結した項構造と意味関係の情報を付与している点が評価されたことによる.翻訳のための言葉の言い換えに語彙概念構造辞書を利用するもので,意見交換を行なう中で概念構造がうまく働く部分と働かない部分が明らかになり検討を加える材料を得ることができた.この成果については同じく3月の言語処理学会の発表論文集で奈良先端大降幡氏を筆頭とした論文で発表を行なう.語彙概念構造辞書は約1300語の日本語サ変名詞について分析を行なっておりこの3月からウェブ上に公開し更新していく.語数としては少ないが頻出語と機能動詞が重要であることを奈良先端大との処理モデルの研究から明らかになり,最終的な辞書の整理の段階で機能動詞に対する付与を重点的に行なった.

    researchmap

  • A stydy knowledge lifecycle for commmmumication support on the netwook

    Grant number:11480078  1999 - 2001

    Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research Grant-in-Aid for Scientific Research (B)  Grant-in-Aid for Scientific Research (B)

    TAKEDA Hideaki, TAKEUCHI Koichi, UENO Atsushi, NISHIDA Toyoaki

      More details

    Grant amount:\15200000 ( Direct expense: \15200000 )

    In this research, we proposed concept "knowledge as media" to realize knowledge lifecycle for communication support on the network. In the traditional view of artificial intelligence, Knowledge exists within agents such as human and computer programs. On the contrary, we regard knowledge exists among agents, i. e., knowledge is media which enables communication among agents. Under this view, we investigated knowledge from the abstract way to the practical one. Ontology is the most abstract form of knowledge, knowledge for community and knowledge for environments are intermediate, and the embodied knowledge is the most practical one. We have three major results. One is community-support system called TelMeA in which interface agents as participants can form a field of communication. We tested this system by the psychological experiment and showed that using interface agents like this system is effective for communication. The second result is a system to capture personal knowledge. The system called MindHeap can gather knowledge through WWWW browsing. If a user can pick up sentences in WWWW, the system memorizes the sentences and their contexts. The sentences are connected to each other through their contexts in this system. The user can retrieve related memorized sentences automatically and even summarize WWW pages based on memorized information. The third result is embodiment-based learning, in which embodiment knowledge is captured through actions. The agent can learn how its body is by moving around and sensing the environment.

    researchmap

  • Study on International Sharing of Japanese Scholarly Information

    Grant number:10044018  1998 - 2000

    Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research Grant-in-Aid for Scientific Research (A).  Grant-in-Aid for Scientific Research (A).

    NAITO Eisuke, GOTODA Hironobu, KAGEURA Kyo, MIYAZAWA Akira, KIM Yong won, TAKEUCHI Koichi

      More details

    Grant amount:\25700000 ( Direct expense: \25700000 )

    The aim of the study is "to organize databases, either real or virtual, of Japanese scholarly information being collected, processed, stored and used in abroad, for international sharing." In FY 2000, 13 experts in 3 groups were invited and 12 NII researchers were commissioned for overseas survey.
    Regarding on Jananese collections acquired by scholarly libraries in Europe (especially in Germany), USA, China and Korea, information is pursued on technical and administrative aspects for utilizing NII/NACSIS facility for registering their acquisitions into the Union Catalog Database.Discussion was made with existing participating libraries for promotion of improving their effective use.
    Based on these activities, fact-finding was made at the visiting institutions on scholarly policy, scholarly information policy, scholarly information use behavior. Invitation was made for experts in abroad for sharing knowledge among research team.
    Demonstration, suggested in the 1st and 2nd year achievement, was made in Berlin. Technical and administrative coordination is made with US scholarly information utilities. Evaluation workshop was organized on the use and demand on Japanese scholarly information with experts from Thailand.

    researchmap

▼display all

 

Class subject in charge

  • Seminar in Pattern Information Processing (2023academic year) Year-round  - その他

  • Seminar in Pattern Information Processing (2023academic year) Other  - その他

  • Seminar in Pattern Information Processing (2023academic year) Year-round  - その他

  • Pattern Recognition and Learning (2023academic year) Third semester  - 水1~2

  • Pattern Recognition and Learning (2023academic year) Third semester  - 水1~2

  • Media Information Processing (2023academic year) Late  - 木1~2

  • Media Information Processing (2023academic year) Late  - 木1~2

  • English Engineering (2023academic year) 3rd and 4th semester  - 木7~8

  • Technical English (2023academic year) 3rd and 4th semester  - 木7~8

  • Engineering English (2023academic year) Late  - その他

  • Engineering English (2023academic year) Late  - その他

  • Advanced Study (2023academic year) Other  - その他

  • Knowledge Engineering (2023academic year) 1st semester  - 月1~2,木1~2

  • Knowledge Engineering (2023academic year) 1st semester  - 月1~2,木1~2

  • Natural Language Processing (2023academic year) Late  - その他

  • Natural Language Processing (2023academic year) Late  - その他

  • Technical Writing 1 (2023academic year) Prophase  - その他

  • Technical Writing 2 (2023academic year) Late  - その他

  • Technical Writing (2023academic year) Prophase  - その他

  • Technical Presentation (2023academic year) Late  - その他

  • Natural Language Processing (2023academic year) Second semester  - 水1~2

  • Natural Language Processing (2023academic year) Second semester  - 水1~2

  • Natural Language Processing (2023academic year) Second semester  - 水1~2

  • Specific Research of Electronics and Information Systems Engineering (2023academic year) Year-round  - その他

  • (L19)Media Information Processing (2023academic year) special  - その他

  • Undergraduate Research Experience 3 (2023academic year) special  - その他

  • Internship (2022academic year) Summer concentration  - その他

  • Seminar in Pattern Information Processing (2022academic year) Year-round  - その他

  • Knowledge Engineering (2022academic year) 1st semester  - 水1~2,金1~2

  • Knowledge Engineering (2022academic year) 1st semester  - 水1~2,金1~2

  • Natural Language Processing (2022academic year) Second semester  - 水1~2

  • Topics in Electronic and Information Systems Engineering (2022academic year) Prophase  - 金1,金2

  • Internship (2021academic year) Summer concentration  - その他

  • Seminar in Pattern Information Processing (2021academic year) Year-round  - その他

  • Pattern recognition and learning (2021academic year) Third semester  - 水1,水2

  • Pattern Recognition and Learning (2021academic year) Third semester  - 水1,水2

  • Media Information Processing (2021academic year) Late  - 木1,木2

  • English Engineering (2021academic year) 1st semester  - 火5,火6,木1,木2

  • Engineering English (2021academic year) Late  - その他

  • Knowledge Engineering (2021academic year) 1st semester  - 水1,水2,金1,金2

  • Knowledge Engineering (2021academic year) 1st semester  - 水1,水2,金1,金2

  • Knowledge Engineering (2021academic year) 1st semester  - 水1,水2,金1,金2

  • Knowledge Engineering (2021academic year) 1st semester  - 水1,水2,金1,金2

  • Technical Writing (2021academic year) Prophase  - その他

  • Technical Presentation (2021academic year) Late  - その他

  • Natural Language Processing (2021academic year) Second semester  - 水1,水2

  • Natural Language Processing (2021academic year) Second semester  - 水1,水2

  • Specific Research of Electronics and Information Systems Engineering (2021academic year) Year-round  - その他

  • Seminar in Pattern Information Processing (2020academic year) Year-round  - その他

  • Pattern recognition and learning (2020academic year) Third semester  - 水1,水2

  • Pattern Recognition and Learning (2020academic year) Third semester  - 水1,水2

  • Media Information Processing (2020academic year) Late  - 木1,木2

  • English Engineering (2020academic year) 1st semester  - 火5,火6,木1,木2

  • English Engineering (2020academic year) 1st semester  - 火5,火6,木1,木2

  • Engineering English (2020academic year) Late  - その他

  • Knowledge Engineering (2020academic year) 1st semester  - 水1,水2,金1,金2

  • Knowledge Engineering (2020academic year) 1st semester  - 水1,水2,金1,金2

  • Knowledge Engineering (2020academic year) 1st semester  - 水1,水2,金1,金2

  • Knowledge Engineering (2020academic year) 1st semester  - 水1,水2,金1,金2

  • Language Media (2020academic year) Late  - その他

  • Technical Writing (2020academic year) Prophase  - その他

  • Technical Presentation (2020academic year) Late  - その他

  • Natural Language Processing (2020academic year) Second semester  - 水1,水2

  • Natural Language Processing (2020academic year) Second semester  - 水1,水2

  • Specific Research of Electronics and Information Systems Engineering (2020academic year) Year-round  - その他

  • Topics in Electronic and Information Systems Engineering (2020academic year) Prophase  - 金1,金2

▼display all