当前位置: X-MOL 学术Lang. Resour. Eval. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
LDC-IL: The Indian repository of resources for language technology
Language Resources and Evaluation ( IF 2.7 ) Pub Date : 2021-01-03 , DOI: 10.1007/s10579-020-09523-3
Narayan Choudhary

This paper introduces the Government of India Initiative on linguistic data creation in Indian languages. The Linguistic Data Consortium for Indian Languages (LDC-IL) is a fully funded Government of India scheme established in 2007 to cater to the needs of linguistic resources required for the development of language technology in Indian languages. LDC-IL worked silently for more than a decade with a team of around regular fifty people and involving thousands of resource persons, covering twenty major languages of India. Part of the output of LDC-IL was launched in April 2019 by the Hon’ble Vice President of India. This paper, the first introductory paper in an academic journal, aims to give a brief of the works done by LDC-IL and how these works are crucial in the development of language technology for Indian languages. Within a short span of eight months of its release, the language resources released by LDC-IL have been procured and utilized by scores of industry and academic bodies and individual researchers, including major industry leaders like Microsoft, Google, Samsung etc.



中文翻译:

LDC-IL:印度语言技术资源库

本文介绍了印度政府关于以印度语言创建语言数据的倡议。印度语言语言数据联盟(LDC-IL)是由印度政府资助的全额计划,成立于2007年,旨在满足开发印度语言技术所需的语言资源需求。LDC-IL与大约50名常规人员组成的团队默默工作了十多年,涉及数千名专家,涉及印度的二十种主要语言。印度Hon'ble副总裁于2019年4月启动了LDC-IL的部分输出。本文是学术期刊上的第一篇介绍性论文,旨在简要介绍LDC-IL所做的工作以及这些工作在印度语言技术发展中如何发挥关键作用。

更新日期:2021-01-04
down
wechat
bug