LDC-IL: The Indian repository of resources for language technology,Language Resources and Evaluation

当前位置： X-MOL 学术 › Lang. Resour. Eval. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

LDC-IL: The Indian repository of resources for language technology
Language Resources and Evaluation ( IF 2.7 ) Pub Date : 2021-01-03 , DOI: 10.1007/s10579-020-09523-3
Narayan Choudhary

This paper introduces the Government of India Initiative on linguistic data creation in Indian languages. The Linguistic Data Consortium for Indian Languages (LDC-IL) is a fully funded Government of India scheme established in 2007 to cater to the needs of linguistic resources required for the development of language technology in Indian languages. LDC-IL worked silently for more than a decade with a team of around regular fifty people and involving thousands of resource persons, covering twenty major languages of India. Part of the output of LDC-IL was launched in April 2019 by the Hon’ble Vice President of India. This paper, the first introductory paper in an academic journal, aims to give a brief of the works done by LDC-IL and how these works are crucial in the development of language technology for Indian languages. Within a short span of eight months of its release, the language resources released by LDC-IL have been procured and utilized by scores of industry and academic bodies and individual researchers, including major industry leaders like Microsoft, Google, Samsung etc.

中文翻译：

LDC-IL：印度语言技术资源库

本文介绍了印度政府关于以印度语言创建语言数据的倡议。印度语言语言数据联盟（LDC-IL）是由印度政府资助的全额计划，成立于2007年，旨在满足开发印度语言技术所需的语言资源需求。LDC-IL与大约50名常规人员组成的团队默默工作了十多年，涉及数千名专家，涉及印度的二十种主要语言。印度Hon'ble副总裁于2019年4月启动了LDC-IL的部分输出。本文是学术期刊上的第一篇介绍性论文，旨在简要介绍LDC-IL所做的工作以及这些工作在印度语言技术发展中如何发挥关键作用。

更新日期：2021-01-04

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>