当前位置: X-MOL 学术Found. Trends Inf. Ret. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Arabic Information Retrieval
Foundations and Trends in Information Retrieval ( IF 10.4 ) Pub Date : 2014-2-4 , DOI: 10.1561/1500000031
Kareem Darwish

In the past several years, Arabic Information Retrieval (IR) has garnered significant attention. The main research interests have focused on retrieval of formal language, mostly in the news domain, with ad hoc retrieval, OCR document retrieval, and cross-language retrieval. The literature on other aspects of retrieval continues to be sparse or non-existent, though some of these aspects have been investigated by industry. Others aspects of Arabic retrieval that have received attention include document image retrieval, speech search, social media and web search, and filtering. However, efforts on different aspects of Arabic retrieval continue to be deficient and severely lacking behind efforts in other languages. The survey covers: 1) general properties of the Arabic language; 2) some of the aspects of Arabic that affect retrieval; 3) Arabic processing necessary for effective Arabic retrieval; 4) Arabic retrieval in public IR evaluations; 5) specialized retrieval problems, namely Arabic-English CLIR, Arabic Document Image Retrieval, Arabic Social Search, Arabic Web Search, Question Answering, Image retrieval, and Arabic Speech Search; 6) Arabic IR and NLP resources; and 7) open IR problems that require further attention.



中文翻译:

阿拉伯语信息检索

在过去的几年中,阿拉伯语信息检索(IR)受到了广泛关注。主要研究兴趣集中在正式语言的检索上,主要是在新闻领域,包括即席检索,OCR文档检索和跨语言检索。关于检索的其他方面的文献仍然很少或不存在,尽管其中一些方面已经由业界进行了调查。阿拉伯语检索的其他方面已引起关注,包括文档图像检索,语音搜索,社交媒体和网络搜索以及过滤。但是,在阿拉伯文检索的不同方面的努力仍然不足,严重缺乏其他语言的努力。该调查涵盖:1)阿拉伯语的一般属性;2)阿拉伯语影响检索的某些方面;3)有效进行阿拉伯语检索所需的阿拉伯语处理;4)在公共关系评估中检索阿拉伯语;5)专门的检索问题,即阿拉伯语-英语CLIR,阿拉伯语文档图像检索,阿拉伯语社交搜索,阿拉伯语Web搜索,问答,图像检索和阿拉伯语语音搜索;6)阿拉伯国家关系和自然语言资源;7)需要进一步关注的开放IR问题。

更新日期:2014-02-04
down
wechat
bug