Building a multi-domain comparable corpus using a learning to rank method

Our paper "Building a multi-domain comparable corpus using a learning to rank method", with Razieh Rahimi, Azadeh Shakery, Javid Dadashkarimi, Mozhdeh Ariannezhad, and Hossein Nasr Esfahani has been published at the Journal of Natural Language Engineering. \o/ Multilingual nature of the Web makes interlingual translation a crucial requirement for information management applications. Bilingual humanly constructed […]

Building a Domain-Based Persian-English Comparable Corpus

This project is one of the Persian Corpus Creation projects in the IIS Lab, University of Tehran. Comparable corpora have been identified as a key resource for obtaining translation knowledge. Domain specificity of available comparable corpora, demands more attentions to create multi-domain corpora. This project seeks to construct a Persian-English comparable corpus using Web data, based […]

Persian Information Retrieval / Persian-English Cross-Language Information Retrieval

Persian text information processing is one of the main research lines at IIS Lab, University of Tehran. As one of the members of this lab, I am involved in research projects that focus on leveraging different available resources for Persian text processing and evaluating variation of information retrieval models for Persian information retrieval. On the […]