0% found this document useful (0 votes)
51 views6 pages

Demos 000

proceedings 1

Uploaded by

music2850
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
51 views6 pages

Demos 000

proceedings 1

Uploaded by

music2850
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

COLING 2012

24th International Conference on Computational Linguistics Proceedings of COLING 2012: Demonstration Papers

Program chairs: Martin Kay and Christian Boitet 8-15 December 2012 Mumbai, India

Diamond sponsors
Tata Consultancy Services Linguistic Data Consortium for Indian Languages (LDC-IL)

Gold Sponsors
Microsoft Research Beijing Baidu Netcon Science Technology Co. Ltd.

Silver sponsors
IBM, India Private Limited Crimson Interactive Pvt. Ltd. Yahoo Easy Transcription & Software Pvt. Ltd.

Proceedings of COLING 2012: Demonstration Papers Martin Kay and Christian Boitet (eds.) Preprint edition Published by The COLING 2012 Organizing Committee Mumbai, 2012 This volume c 2012 The COLING 2012 Organizing Committee. Licensed under the Creative Commons Attribution-Noncommercial-Share Alike 3.0 Nonported license.

http://creativecommons.org/licenses/by-nc-sa/3.0/
Some rights reserved.

Contributed content copyright the contributing authors. Used with permission. Also available online in the ACL Anthology at http://aclweb.org

ii

Table of Contents

Complex Predicates in Telugu: A Computational Perspective Rahul Balusu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Heloise - An Ariane-G5 Compatible Rnvironment for Developing Expert MT Systems Online Vincent Berment and Christian Boitet . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 Keyphrase Extraction in Scientic Articles: A Supervised Approach Pinaki Bhaskar, Kishorjit Nongmeikapam and Sivaji Bandyopadhyay . . . . . . . . . . . . . . . . . 17 IKAR: An Improved Kit for Anaphora Resolution for Polish Bartosz Broda, ukasz Burdka and Marek Maziarz . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 Intention Analysis for Sales, Marketing and Customer Service Cohan Sujay Carlos and Madhulika Yalamanchi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33 Authorship Identication in Bengali Literature: a Comparative Analysis Tanmoy Chakraborty . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 Word Root Finder: a Morphological Segmentor Based on CRF Joseph Z. Chang and Jason S. Chang. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51 An Efcient Technique for De-Noising Sentences using Monolingual Corpus and Synonym Dictionary Sanjay Chatterji, Diptesh Chatterjee and Sudeshna Sarkar . . . . . . . . . . . . . . . . . . . . . . . . . . . 59 An Example-Based Japanese Proofreading System for Offshore Development Yuchang Cheng and Tomoki Nagase. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67 DomEx: Extraction of Sentiment Lexicons for Domains and Meta-Domains Ilia Chetviorkin and Natalia Loukachevitch . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77 On the Romanian Rhyme Detection Alina Ciobanu and Liviu P . Dinu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87 Hierarchical Dialogue Policy Learning using Flexible State Transitions and Linear Function Approximation Heriberto Cuayhuitl, Ivana Kruijff-Korbayov and Nina Dethlefs . . . . . . . . . . . . . . . . . . . . 95 Automated Paradigm Selection for FSA based Konkani Verb Morphological Analyzer Shilpa Desai, Jyoti Pawar and Pushpak Bhattacharyya . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103 Hindi and Marathi to English NE Transliteration Tool using Phonology and Stress Analysis Manikrao Dhore, Shantanu Dixit and Ruchi Dhore . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111 Dealing with the Grey Sheep of the Romanian Gender System, the Neuter Liviu P . Dinu, Vlad Niculae and Maria Sulea . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119 Authorial Studies using Ranked Lexical Features Liviu P . Dinu and Sergiu Nisioi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125

iii

ScienQuest: a Treebank Exploitation Tool for non NLP-Specialists Achille Falaise, Olivier Kraif, Agns Tutin and David Rouquet . . . . . . . . . . . . . . . . . . . . . . . 131 An In-Context and Collaborative Software Localisation Model Amel Fraisse, Christian Boitet and Valrie Bellynck . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141 Efcient Feedback-based Feature Learning for Blog Distillation as a Terabyte Challenge Dehong Gao, Wenjie Li and Renxian Zhang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147 Beyond Twitter Text: A Preliminary Study on Twitter Hyperlink and its Application Dehong Gao, Wenjie Li and Renxian Zhang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 155 Rule Based Hindi Part of Speech Tagger Navneet Garg, Vishal Goyal and Suman Preet . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163 Fangorn: A System for Querying very large Treebanks Sumukh Ghodke and Steven Bird . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 175 CRAB Reader: A Tool for Analysis and Visualization of Argumentative Zones in Scientic Literature Yufan Guo, Ilona Silins, Roi Reichart and Anna Korhonen . . . . . . . . . . . . . . . . . . . . . . . . . . 183 Automatic Punjabi Text Extractive Summarization System Vishal Gupta and Gurpreet Lehal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 191 Complete Pre Processing Phase of Punjabi Text Extractive Summarization System Vishal Gupta and Gurpreet Lehal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 199 Revisiting Arabic Semantic Role Labeling using SVM Kernel Methods Laurel Hart, Hassan Alam and Aman Kumar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 207 fokas: Formerly Known As A Search Engine Incorporating Named Entity Evolution Helge Holzmann, Gerhard Gossen and Nina Tahmasebi . . . . . . . . . . . . . . . . . . . . . . . . . . . . 215 An Annotation System for Development of Chinese Discourse Corpus Hen-Hsen Huang and Hsin-Hsi Chen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 223 Modeling Pollyanna Phenomena in Chinese Sentiment Analysis Ting-Hao Huang, Ho-Cheng Yu and Hsin-Hsi Chen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231 Eating Your Own Cooking: Automatically Linking Wordnet Synsets of Two Languages Salil Joshi, Arindam Chatterjee, Arun Karthikeyan Karra and Pushpak Bhattacharyya. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 239 I Can Sense It: a Comprehensive Online System for WSD Salil Joshi, Mitesh M. Khapra and Pushpak Bhattacharyya . . . . . . . . . . . . . . . . . . . . . . . . . . 247 Collaborative Computer-Assisted Translation Applied to Pedagogical Documents and Literary Works Ruslan Kalitvianski, Christian Boitet and Valrie Bellynck . . . . . . . . . . . . . . . . . . . . . . . . . . 255 Discrimination-Net for Hindi Diptesh Kanojia, Arindam Chatterjee, Salil Joshi and Pushpak Bhattacharyya . . . . . . . 261

iv

Rule Based Urdu Stemmer Rohit Kansal, Vishal Goyal and Gurpreet Singh Lehal. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 267 JMaxAlign: A Maximum Entropy Parallel Sentence Alignment Tool Max Kaufmann . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 277 MIKE: An Interactive Microblogging Keyword Extractor using Contextual Semantic Smoothing Osama Khan and Asim Karim . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 289 Domain Based Classication of Punjabi Text Documents Nidhi Krail and Vishal Gupta . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 297 Open Information Extraction for SOV Language Based on Entity-Predicate Pair Detection Woong-Ki Lee, Yeon-Su Lee, Hyoung-Gyu Lee, Won-Ho Ryu and Hae-Chang Rim . . . 305 An Omni-Font Gurmukhi to Shahmukhi Transliteration System Gurpreet Singh Lehal, Tejinder Singh Saini and Savleen Kaur Chowdhary . . . . . . . . . . 313 THUTR: A Translation Retrieval System Chunyang Liu, Qi Liu, Yang Liu and Maosong Sun . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 321 Recognition of Named-Event Passages in News Articles Luis Marujo, Wang Ling, Anatole Gershman, Jaime Carbonell, Joo P . Neto and David Matos . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 329 Nonparametric Model for Inupiaq Word Segmentation ThuyLinh Nguyen and Stephan Vogel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 337 Stemming Tigrinya Words for Information Retrieval Omer Osman and Yoshiki Mikami. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 345 OpenWordNet-PT: An Open Brazilian Wordnet for Reasoning Valeria de Paiva, Alexandre Rademaker and Gerard de Melo . . . . . . . . . . . . . . . . . . . . . . . 353 WordNet Website Development And Deployment using Content Management Approach Neha Prabhugaonkar, Apurva Nagvenkar, Venkatesh Prabhu and Ramdas Karmali . . 361 A Demo for Constructing Domain Ontology from Academic Papers Feiliang Ren. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 369 A Practical Chinese-English ON Translation Method Based on ONs Distribution Characteristics on the Web Feiliang Ren. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 377 Elissa: A Dialectal to Standard Arabic Machine Translation System Wael Salloum and Nizar Habash . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 385 Domain Based Punjabi Text Document Clustering Saurabh Sharma and Vishal Gupta . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 393 Open source multi-platform NooJ for NLP Max Silberztein, Tams Vradi and Marko Tadi c . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 401

Punjabi Text-To-Speech Synthesis System Parminder Singh and Gurpreet Singh Lehal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 409 EXCOTATE: An Add-on to MMAX2 for Inspection and Exchange of Annotated Data Tobias Stadtfeld and Tibor Kiss . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 417 Bulgarian Inectional Morphology in Universal Networking Language Velislava Stoykova. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 423 Central and South-East European Resources in META-SHARE Marko Tadi c and Tams Vradi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 431 Markov Chains for Robust Graph-Based Commonsense Information Extraction Niket Tandon, Dheeraj Rajagopal and Gerard de Melo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 439 Visualization on Financial Terms via Risk Ranking from Financial Reports Ming-Feng Tsai and Chuan-Ju Wang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 447 UNL Explorer Hiroshi Uchida, Meiying Zhu and Md. Anwarus Salam Khan . . . . . . . . . . . . . . . . . . . . . . . 453 An SMT-driven Authoring Tool Sriram Venkatapathy and Shachar Mirkin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 459 Generating Questions from Web Community Contents Baoxun Wang, Bingquan Liu, Chengjie Sun, Xiaolong Wang and Deyuan Zhang. . . . . 467 Demo of iMAG Possibilities: MT-postediting, Translation Quality Evaluation, Parallel Corpus Production Ling Xiao Wang, Ying Zhang, Christian Boitet and Valrie Bellynck . . . . . . . . . . . . . . . . . 475 Jane 2: Open Source Phrase-based and Hierarchical Statistical Machine Translation Joern Wuebker, Matthias Huck, Stephan Peitz, Malte Nuhn, Markus Freitag, Jan-Thorsten Peter, Saab Mansour and Hermann Ney . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 483 Automatic Extraction of Turkish Hypernym-Hyponym Pairs From Large Corpus Savas Yildirim and Tugba Yildiz. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 493 Chinese Web Scale Linguistic Datasets and Toolkit Chi-Hsin Yu and Hsin-Hsi Chen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 501 Developing and Evaluating a Computer-Assisted Near-Synonym Learning System Liang-Chih Yu and Kai-Hsiang Hsu. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 509 Arabic Morphological Analyzer with Agglutinative Afx Morphemes and Fusional Concatenation Rules Fadi Zaraket and Jad Makhlouta . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 517 SMR-Cmp: Square-Mean-Root Approach to Comparison of Monolingual Contrastive Corpora HuaRui Zhang, Chu-Ren Huang and Francesca Quattri . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 527 A Machine Learning Approach to Convert CCGbank to Penn Treebank Xiaotian Zhang, Hai Zhao and Cong Hui . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 535

vi

You might also like