CICLing 2016 Accepted Papers
Notes:
-
The title, authors, and abstract are as registered in the system and may differ from the actual data in the printed paper.
-
The papers are listed in no particular order.
LNCS
Santanu Pal, Sudip Naskar and Josef van Genabith. Forest to String Based Statistical Machine Translation with Hybrid Word Alignments Wuying Liu and Lin Wang. Fast-Syntax-Matching-based Japanese-Chinese Limited Machine Translation
Arjun Mukherjee. Extracting Aspect Specific Sentiment Expressions implying Negative Opinions
Geli Fei, Arjun Mukherjee, Zhiyuan Chen and Bing Liu. Discovering Correspondence of Sentiment Words and Aspects
Mohamed Dermouche, Julien Velcin, Rémi Flicoteaux, Sylvie Chevret and Namik Taright. Supervised Topic Models for Diagnosis Code Assignment to Discharge Summaries
Adiel Mittmann, Aldo von Wangenheim and Alckmar Dos Santos. Aoidos: A System for the Automatic Scansion of Poetry Written in Portuguese
Batuer Aisha. Uyghur Shallow Parsing using Part-of-Speech Features
Aneeta Niazi. Morphological Analysis of Urdu Verbs
Ella Rabinovich, Shuly Wintner and Ofek Luis Lewinsohn. A Parallel Corpus of Translationese
Elnaz Davoodi, Leila Kosseim, Felix-Herve Bachand, Majid Laali and Emmanuel Argollo. Classification of Textual Genres using Discourse Information Fériel Ben Fraj Trabelsi, Chiraz Ben Othmane Zribi and Saoussen Mathlouthi. Arabic Anaphora resolution using Markov decision process
Calkin Suero Montero, Hatem Haddad, Maxim Mozgovoy and Chedi Bechikh Ali. Detecting the Likely Causes behind the Emotion Spikes of Influential Twitter Users
Mourad Gridach. Deep Learning Approach for Arabic Named Entity Recognition Jie Yang, Zhiyang Teng, Meishan Zhang and Yue Zhang. Combining Discrete and Neural Features for Sequence Labeling
İlknur Dönmez and Eşref Adalı. Turkish Document Classification with Coarse-grained Semantic Matrix Zijian Győző Yang, László János Laki and Borbála Siklósi. Quality Estimation for English-Hungarian with Optimized Semantic Features and the HuQ corpus Joan Byamugisha, C. Maria Keet and Langa Khumalo. Pluralising Nouns in isiZulu and Related Languages Ameur Douib, David Langlois and Kamel Smaïli. Genetic-based decoder for statistical machine translation
Mohamed Amine Menacer, Abdelfetah Boumerdas, Chahnez Zakaria and Kamel Smaili. A new language model based on possibility theory
Viet Tran Hong, Huyen Vu Thuong, Vinh Nguyen Van and Minh Nguyen Le. A Classifier-based Preordering Approach for English-Vietnamese Statistical Machine Translation
Mahdi Mohseni, Javad Ghofrani and Heshaam Faili. Persianp: a Persian Text Processing Toolbox
Daniela Gîfu and Radu Simionescu. Tracing Language Variation for Romanian Ugur Sopaoglu and Gonenc Ercan. Evaluation of Semantic Relatedness Measures for Turkish Language Aslı Eyecioğlu Özmutlu and Bill Keller. Constructing A Turkish Corpus for Paraphrase Identification and Semantic Similarity
Lyndon White, Roberto Togneri, Wei Liu and Mohammed Bennamoun. Generating Bags of Words from the Sums of their Word Embeddings Sachin Pawar, Pushpak Bhattacharyya and Girish Palshikar. End-to-End Relation Extraction using Markov Logic Networks Megala Uthayakumar, Pranavan Theivendiram, Nilusija Nadarasamoorthy, Mokanarangan Thayaparan, Sanath Jayasena, Gihan Dias and Surangika Ranathunga. Named-Entity-Recognition (NER) for Tamil Language Using Margin-Infused Relaxed Algorithm (MIRA)
Gozde Gul Sahin. Verb Sense Annotation For Turkish PropBank via Crowdsourcing Sachin Pawar, Nitin Ramrakhiyani, Swapnil Hingmire and Girish Palshikar. Topics and Label Propagation: Best of Both Worlds for Weakly Supervised Text Classification
Vladislav Kubon, Marketa Lopatkova and Jiří Mírovský. Analysis of Word Order in Multiple Treebanks
Begoña Altuna, María Jesús Aranzabe and Arantza Díaz de Ilarraza. Adapting TimeML to Basque: Event annotation
Vigneshwaran Muralidaran and Dipti Sharma. Construction Grammar Approach for Tamil Dependency Parsing Agnivo Saha and Sudeshna Sarkar. Enhancing Neural Network based Dependency Parsing Using Morphological Information for Hindi Kovida Nelakuditi, Divya Sai Jitta and Radhika Mamidi. Part-of-Speech Tagging for Code mixed English-Telugu Social media data
Seniz Demir, Murat Tan and Berkay Topcu. Turkish Normalization Lexicon for Social Media
Chahira Lhioui, Anis Zouaghi and Mounir Zrigui. Knowledge Extraction with NooJ Using a syntactico-Semantic Approach for the Arabic Utterances Understanding
Sobha Lalitha Devi and Pattabhi Rk Rao. Mining of Social Networks from Literary Texts of Resource Poor Languages
Burcu Can, Ahmet Üstün and Murathan Kurfalı. Turkish PoS Tagging by Reducing Sparsity with Morpheme Tags in Small Datasets
Vasiliki Simaki, Iosif Mporas and Vasileios Megalooikonomou. Age Identification of Twitter Users: Classification Methods and Sociolinguistic Analysis
Borbála Siklósi. Using embedding models for lexical categorization in morphologically rich languages
Ladislav Lenc and Pavel Král. Deep Neural Networks for Czech Multi-label Document Classification Md Shad Akhtar, Asif Ekbal and Pushpak Bhattacharyya. Aspect Based Sentiment Analysis: Category Detection and Sentiment Classifcation for Hindi
Silpa Kanneganti, Himani Chaudhry and Dipti Mishra Sharma. Comparative Error Analysis Of Parser Outputs On Telugu Dependency Treebank Data
Malek Lhioui, Kais Haddar and Laurent Romary. Algebraic specification for interoperability between data formats: Application on Arabic lexical data
Łukasz Kobyliński and Witold Kieraś. Part of Speech Tagging for Polish: State of the Art and Future Perspectives
Marco Dinarelli and Isabelle Tellier. New Recurrent Neural Network Variants for Sequence Labeling Shiva Taslimipoor, Ruslan Mitkov, Gloria Corpas Pastor and Afsaneh Fazly. Bilingual Contexts from Comparable Corpora to Mine for Translations of Collocations
Wafa Wali, Bilel Gargouri and Abdelmajid Ben Hamadou. Using sentence semantic similarity to improve LMF standardized Arabic dictionary quality
Souha Mezghani Hammami and Lamia Hadrich Belguith. Arabic Pronominal Anaphora Resolution Based on New Set of Features
Wafa Neifar, Thierry Hamon, Pierre Zweigenbaum, Mariem Ellouze Khemakhem and Lamia Hadrich Belguith. Adaptation of a term extractor to Arabic specialised texts : first experiments and limits Rihab Bouchlaghem, Aymen Elkhelifi and Rim Faiz. Sentiment analysis in Arabic Twitter posts using supervised methods with combined features Ameni Bouaziz, Christel Dartigues, Célia Da Costa Pereira and Frédéric Precioso. Introducing Semantics in Short Text Classification Rakesh Verma, Vasanthi Vuppuluri, Arjun Mukherjee, An Nguyen, Ghita Mammar, Reed Armstrong and Shahryar Baki. Mining the Web for Collocations: IR Models of Term Associations Shonosuke Ishiwatari, Naoki Yoshinaga, Masashi Toyoda and Masaru Kitsuregawa. Instant Translation-Model Adaptation by Projecting Word Semantic Representations
Pierre Marchal and Thierry Poibeau. A Continuous-based Model of Lexical Acquisition João Sequeira, Teresa Gonçalves, Paulo Quaresma, Amália Mendes and Iris Hendrickx. Using syntactic and semantic features for classifying modal values in the Portuguese language Hanen Ameur, Salma Jamoussi and Abdelmajid Ben Hamadou. A New Emotional Vector Representation For Sentiment Analysis
Bahar Karaoğlan, Tarik Kisla and Senem Kumova Metin. DESCRIPTION OF TURKISH PARAPHRASE CORPUS STRUCTURE AND GENERATION METHOD
Prasha Shrestha, Arjun Mukherjee and Thamar Solorio. Large Scale Authorship Attribution of Online Reviews Nabil Khoufi, Chafik Aloulou and Lamia Hadrich Belguith. A Corpus Based System for Language Resource Construction and Syntactic Analysis: Case of Arabic
Saket Kumar and Omar El Ariss. Word Sense Disambiguation Using Swarm Intelligence: A Bee Colony Optimization Approach
Miran Shahine and Mohamed Sakre. Hybrid Feature Selection Approach for Arabic Named Entity Recognition
Ahmad Musleh, Nadir Durrani, Irina Temnikova, Preslav Nakov, Stephan Vogel and Osama Alsaad. Enabling Medical Translation for Low-Resource Languages Fahad Al-Obaidli, Stephen Cox and Preslav Nakov. Bi-Text Alignment of Movie Subtitles for Spoken English-Arabic Statistical Machine Translation Hiram Calvo and Omar Juárez Gambino. Cascading Classifiers for Twitter Sentiment Analysis with Emotion Lexicons Adèle Désoyer, Frédéric Landragin, Isabelle Tellier, Anaïs Lefeuvre, Jean-Yves Antoine and Marco Dinarelli. Coreference Resolution for French Oral Data: Machine Learning Experiments with ANCOR
Alawya Alawami. Aspect Terms Extraction of Arabic Dialects for Opinion Mining Using Conditional Random Fields
Svetlana Toldova and Max Ionov. Features for discourse-new referent detection in Russian
Orna Almogi, Lena Dankin, Nachum Dershowitz, Yair Hoffman, Dimitri Pauls, Dorji Wangchuk and Lior Wolf. Stemming and Segmentation for Classical Tibetan
CyS
Chenggang Mi, Yating Yang, Xi Zhou, Lei Wang, Xiao Li and Tonghai Jiang. Exploiting Bishun to Predict the Pronunciation of Chinese
Sandipan Dandapat and Andy Way. Improved Named Entity Recognition using Machine Translation-based Cross-lingual Information
Kwang-Yong Jeong and Kyung-Soon Lee. Follower Behavior Analysis via Influential Transmitters on Social Issues in Twitter
Tomas Hercig, Tomas Brychcin, Lukas Svoboda, Michal Konkol and Josef Steinberger. Unsupervised methods to improve aspect-based sentiment analysis in Czech
Nouha Othman and Rim Faiz. Question Answering Passage Retrieval and Re-ranking Using N-grams and SVM
Ahmed Nabhan and Khaled Shaalan. A Graph-based Approach to Text Genres Analysis Yu Zhao, Sheng Gao, Patrick Gallinari and Jun Guo. A Novel Multimodal Deep Neural Network Framework for Extending Knowledge Base
Jin-Xia Huang, Kyung Soon Lee, Key-Sun Choi and Young-Kil Kim. Extract Reliable Relations from Wikipedia Texts for Practical Ontology Construction
Kunal Chakma and Amitava Das. CMIR:A Corpus for Evaluation of Code Mixed Information Retrieval of Hindi-English Tweets
Katrin Prikrylova, Vladislav Kubon and Katerina Veselovska. The Role of Conjunctions in Adjective Polarity Analysis in Czech
Irvin Vargas-Campos and Fernando Alva-Manchego. SciEsp: Structural Analysis of Abstracts Written in Spanish
Sandeep Kumar Dash, Dr. Partha Pakray and Alexander Gelbukh. Natural Language Text to Virtual Action Malek Hajjem and Chiraz Latiri. Thematic Clustering For Comparable Tweet corpora Building
Laurent Jakubina and Philippe Langlais. A Comparison of Methods for Identifying the Translation of Words in a Comparable Corpus: Recipes and Limits
Luz Marina Sierra, Carlos Cobos and Juan Carlos Corrales. Tokenizer adapted for nasa yuwe language Edmundo Pavel Soriano Morales, Julien Ah-Pine and Sabine Loudcher. Using a Heterogeneous Linguistic Network for Word Sense Induction and Disambiguation
Polibits
Nouri Nouha and Talel Ladhari. An Efficient Iterated Greedy Algorithm for the Makespan Blocking Flow Shop Scheduling Problem
Zhixuan Yang, Chong Ruan, Caihua Li and Junfeng Hu. Optimize Hierarchical Softmax with Word Similarity Knowledge
Henning Wold, Linn Vikre, Özlem Özgöbek and Jon Atle Gulla. Hybrid Entity Driven News Detection on Twitter
Andrea Vanzo, Danilo Croce, Emanuele Bastianelli, Roberto Basili and Daniele Nardi. Robust Spoken Language Understanding for House Service Robots Attila Novák. Improving corpus annotation quality using word embedding models King Ip Lin and Yang Zhou. Twitter Sentiment Analysis -- Comparison between Lexicon Based vs. Machine Learning Based approaches
Mohammad Golam Sohrab, Makoto Miwa and Yutaka Sasaki. IN-DEDUCTIVE and DAG-Tree Approaches for Large-Scale Extreme Multi-label Hierarchical Text Classification
Jiaqiang Chen, Niket Tandon, Charles Darwis Hariman and Gerard de Melo. WebBrain: Joint Neural Learning of Large-Scale Common Sense Doru Anastasiu Popescu, Nicolae Bold and Daniel Nijloveanu. A method based on genetic algorithms for generating assessment tests used for learning
IJCLA
Aytuğ Onan, Serdar Korukoğlu and Hasan Bulut. LDA-based topic modelling in text sentiment classification: an empirical analysis
Masaki Murata, Shunsuke Tsudo, Masato Tokuhisa and Qing Ma. Correcting Redundant Japanese Sentences Using Patterns and Machine Learning for the Development of Writing Support Systems
Shirley Anugrah Hayati, Alfan Farizki Wicaksono and Mirna Adriani. Short Text Classification on Complaint Documents
Mahran Farhat and Gammoudi Mohamed Mohsen. Enhanced metric for comparability analysis of multilingual documents
Achraf Ben Romdhane, Salma Jamoussi, Abdelmajid Ben Hamadou and Kamel Smaili. Phrase-Based Language Model in Statistical Machine Translation Ilnar Salimzianov and Ozlem Cetinoglu. Dependency-based Sentence Simplification for Large-scale LFG Parsing: Selecting Simplified Candidates for Efficiency and Coverage
Cédric Maigrot, Sandra Bringay and Jérôme Azé. Concept drift vs suicide : How one can help prevent the other?
Vandan Mujadia, Palash Gupta and Dipti Sharma. Pronominal reference type identification and event anaphora resolution for Hindi Diana Inkpen, T.Sima Paribakht, Farahnaz Faez and Ehsan Amjadian. Term Evaluator: A Tool for Terminology Annotation and Evaluation Attila Novák and Borbála Siklósi. Grapheme-to-phoneme transcription in Hungarian Fernando Antônio Asevedo Nóbrega and Thiago Pardo. Improving Content Selection for Update Summarization with Subtopic-Enriched Sentence Ranking Functions Roy Khristopher Bayot and Teresa Gonçalves. Multilingual author profiling using SVMs and linguistic features Sapan Shah, Dhwani Vora, B P Gautham and Sreedhar Reddy. A Domain Specific Search Engine for Material Science Literature
Jan Motl and Wei Nie. What Makes a Fairy Tale
ACLing
Ertuğrul Yilmaz, İlknur Durgar El-Kahlout and Coşkun Mermer. Effects of Pre/Post-processing Techniques on Improving Egyptian Arabic to English SMT
Ahmad Abd Al-Aziz, Mervat Gheith and Ahmed Sharaf Eldin. Toward Building Arabizi Sentiment Lexicon based on Orthographic Variants Identification Hussein Khalil, Taha Osman, Paul Bowden and Mohammad Mlidan Mlitam. Extracting Arabic Composite Names using a knowledge Driven Approach
Mallat Souheyl, Emna Hkiri, Mohsen Maraoui, Anis Zouaghi and Zrigui Mounir. Statistical Approach to Semantic Indexing in Multilingual Documents
Mallat Souheyl, Emna Hkiri, Maraoui Mohsen, Anis Zouaghi and Mounir Zrigui. Hybrid model of query lexical expansion and translation for multilingual information retrieval on the web
Jihene Younes, Emna Souissi and Hadhémi Achour. A Hidden Markov Model for Automatic Transliteration of Romanized Tunisian Dialect
Fatma Ben Mesmia, Kais Haddar, Denis Maurel and Nathalie Friburger. Recognition and TEI annotation of Arabic Events Using Transducers Aymen Trigui, Naim Terbeh, Mohsen Maraoui and Mounir Zrigui. Statistical approach for spontaneous Arabic Speech understanding based on stochastic speech recognition module
Chihebeddine Ammar, Kais Haddar and Laurent Romary. TMF Normalization of Arabic Technical and Scientific Terms Nadia Ghezaiel Hammouda and Kais Haddar. A rule-Based lexical disambiguation for Arabic corpora
Nadia Soudani, Ibrahim Bounhas and Yahya Slimani. Experimenting an unsupervised LSA-based Approach for Arabic queries SEmantic Disambiguation Mohamed Aly Fall Seideh, Héla Fehri and Kais Haddar. Toward building a bilingual lexicon for Arabic and French named entities of Herbalism
Lin Kassem, Caroline Sabty, Nada Sharaf, Menna Bakry and Slim Abdennadher. tashkeelWAP: A Game With A Purpose For Digitizing Arabic Diacritics
Ayoub Kadim, Azzeddine Lazrek and Yahya Ould Mohamed Elhadj. An improved version of the Nemlar Arabic written corpus used in a developed Arabic POS Tagger
Caroline Sabty, Mirna Yacout, Mohamed Sameh and Slim Abdennadher. Gamified Collection of Arabic Named Entity Recognition Data Mourad Mars. Spell Checking Arabic text: From error detection to error correction
Karim Sayadi, Marcus Liwicki, Marc Bui and Rolf Ingold. Sentiment analysis on Tunisian dialect : case study of the Tunisian election context
Haithem Afli, Walid Aransa, Pintu Lohar and Andy Way. From Arabic User-Generated Content to Machine Translation: Integrating Automatic Error Correction
Abdulrahman Alosaimy and Eric Atwell. Ensemble Morphosyntactic Analyser for Classical Arabic Khaled Dahmri, Hassina Aliane and Hamid Azzoune. Enhancing HeidelTime for Time expression Annotation in Arabic News
Yousef Alotaibi, Yasser M. Seddiq, Ali Meftah, Sid-Ahmed Selouani and Mansour M. Alghamdi. Distinctive Phonetic Features of Arabic Dialects: Comparative Study
Asmaa Mountassir, Houda Benbrahim and Ilham Berrada. Building and Annotating an Arabic Corpus for Opinion Mining (ACOM)
Rami Ayadi. A survey of Arabic Text Representation and Classification Methods
TurCLing
Batuer Aisha. A Uyghur Lemmatization and Part-Of-Speech Tagging
Zhenisbek Assylbekov, Jonathan N. Washington, Francis M. Tyers, Assulan Nurkas, Aida Sundetova, Aidana Karibayeva, Balzhan Abduali and Dina Amirova. A free/open-source hybrid morphological disambiguation tool for Kazakh Kadir Yalcin and Ilyas Cicekli. PlagDS: A Plagiarism Detection System Based on Document Similarity Dilara Torunoğlu Selamet, Eren Bekar, Tugay İlbay and Gülşen Eryiğit. Exploring Spelling Correction Approaches for Turkish
Yeşim Aksan, S. Ayşe Özel, Hakan Yılmazer and Umut Demirhan. The Turkish National Corpus (TNC): Comparing the Architectures of v1 and v2
Serkan Kumyol, Burcu Can and Cem Bozşahin. Using Allomorphs in Turkish Morphological Segmentation Reduces Sparsity Umut Sulubacak, Gülşen Eryiğit and Tuğba Pamay. A Revisited Turkish Dependency Treebank Dilara Torunoğlu Selamet, Tuğba Pamay and Gülşen Eryiğit. Simplification of Turkish Sentences
RCS
Heba Ismail, Saad Harous and Boumediene Belkhouche. A Comparative Analysis of Machine Learning Classifiers for Twitter Sentiment Analysis C. Alberto Ochoa-Zezatti. Identifying consumption patterns in Twitter using text mining to classify trends in shopping
Sukhada and Dipti Sharma. Analyzing English Phrases from Paninian Perspective
Rishabh Srivastava and Soma Paul. Hindi Question Answering system using PurposeNet-based Ontology
Zeineb Neji, Lamia Belguith and Marieme Ellouze. Question Answering Based on Temporal Inference
Athira U and Sabu M.Thampi. A Psychometric Approach to Authorship Analysis of Short Documents
Sinan Polat, Merve Selcuk-Simsek and Ilyas Cicekli. A Modified Earley Parser for Huge Natural Language Grammars
Malarkodi C.S., Elisabeth Lex and Sobha Lalitha Devi. Named Entity Recognition for the Agricultural Domain
Sindhuja Gopalan, Paolo Rosso and Sobha Lalitha Devi. Discourse Connective - A Marker for Identifying Featured Articles in Biological Wikipedia
Firas Hmida, Emmanuel Morin and Beatrice Daille. Aligned Knowledge-Rich Contexts from Specialized Comparable Corpora
Biswanath Barik, Erwin Marsi and Pinar Öztürk. Event Causality Extraction from Natural Science Literature Václav Rajtmajer and Pavel Král. Event Detection in Czech Twitter Ivan Garrido Marquez, Jorge Garcia Flores, François Lévy and Adeline Nazarenko. Blog annotation: from corpus analysis to automatic tag suggestion Tereza Pařilová, Filip Mrváň, Bruno Mižík and Eva Hladká. Emerging Technology Enabling Dyslexia Users To Read and Perceive Written Text Correctly
Sarra Zrigui, Anis Zouaghi, Rami Ayadi, Salah Zrigui and Mounir Zrigui. An opinions analysis system for Arabic
Khaireddine Bacha. Contribution to the achievement of a spellchecker for Arabic
Octavio Augusto Sánchez Velázquez and Gerardo Eugenio Sierra Martínez. Let's agree to disagree: Measuring agreement between annotators for opinion mining task
Víctor Mijangos, Gerardo Sierra and Abel Herrera. A word embeddings model for sentence similarity Amit Mishra and Sanjay Kumar Jain. Computing Sentiment Polarity of Questioners Asking Why Type Questions in Opinion Question Answering
Raheem Sarwar and Sarana Nutanong. The Key Factors and Their Influence in Authorship Attribution : Systematic Literature Review Protocol and Preliminary Results
Seifeddine Mechti, Maher Jaoua, Rim Faiz, Heni Bouhamed and Lamia Hadrich Belguith. Author profiling: Age prediction based on advanced Bayesian networks
Rajendra Prasath and Pinar Ozturk. Learning to Extract Information from Scientific Articles - A Case Based Reasoning Approach Alibek Barlybayev. Development of an automated recognition system of Kazakh speech in Smart University Seifeddine Mechti, Maher Jaoua, Rim Faiz and Lamia Hadrich Belguith. An Analysis Framework for Hybrid Authorship Verification
Bagdat Myrzakhmetov and Aibek Makazhanov. Initial Experiments on Russian to Kazakh SMT
Poster
Samar Anbarkhan. Extracting semantic associations between obesity, weight loss and herbal medicine features