{"data": [{"paragraphs": [{"context": "According to the Indian census of 2001, there were 30,803,747 speakers of Malayalam in Kerala, making up 93.2% of the total number of Malayalam speakers in India, and 96.7% of the total population of the state. There were a further 701,673 (2.1% of the total number) in Karnataka, 557,705 (1.7%) in Tamil Nadu and 406,358 (1.2%) in Maharashtra. The number of Malayalam speakers in Lakshadweep is 51,100, which is only 0.15% of the total number, but is as much as about 84% of the population of Lakshadweep. In all, Malayalis made up 3.22% of the total Indian population in 2001. Of the total 33,066,392 Malayalam speakers in India in 2001, 33,015,420 spoke the standard dialects, 19,643 spoke the Yerava dialect and 31,329 spoke non-standard regional variations like Eranadan. As per the 1991 census data, 28.85% of all Malayalam speakers in India spoke a second language and 19.64% of the total knew three or more languages. Large numbers of Malayalis have settled in Bangalore, Mangalore, Delhi, Coimbatore, Hyderabad, Mumbai (Bombay), Ahmedabad, Pune, and Chennai (Madras). A large number of Malayalis have also emigrated to the Middle East, the United States, and Europe. Accessed November 22, 2014.</ref> including a large number of professionals. There were 7,093 Malayalam speakers in Australia in 2006. The 2001 Canadian census reported 7,070 people who listed Malayalam as their mother tongue, mostly in the Greater Toronto Area and Southern Ontario. In 2010, the Census of Population of Singapore reported that there were 26,348 Malayalees in Singapore. The 2006 New Zealand census reported 2,139 speakers. 134 Malayalam speaking households were reported in 1956 in Fiji. There is also a considerable Malayali population in the Persian Gulf regions, especially in Bahrain, Muscat, Doha, Dubai, Abu Dhabi, Kuwait and European region mainly in London. World Malayalee Council, the organisation working with the Malayali diaspora across the Globe has embarked upon a project for making a data bank of the diaspora. CANNOTANSWER", "qas": [{"followup": "m", "yesno": "x", "question": "Where is Malayali located?", "answers": [{"text": "30,803,747 speakers of Malayalam in Kerala, making up 93.2% of the total number of Malayalam speakers in India,", "answer_start": 51}], "id": "C_69758fcdfc1f46baba0e92c0f3b0919c_1_q#0", "orig_answer": {"text": "30,803,747 speakers of Malayalam in Kerala, making up 93.2% of the total number of Malayalam speakers in India,", "answer_start": 51}}, {"followup": "n", "yesno": "x", "question": "What other languages are spoken there?", "answers": [{"text": "33,015,420 spoke the standard dialects, 19,643 spoke the Yerava dialect and 31,329 spoke non-standard regional variations like Eranadan.", "answer_start": 640}], "id": "C_69758fcdfc1f46baba0e92c0f3b0919c_1_q#1", "orig_answer": {"text": "33,015,420 spoke the standard dialects, 19,643 spoke the Yerava dialect and 31,329 spoke non-standard regional variations like Eranadan.", "answer_start": 640}}, {"followup": "n", "yesno": "x", "question": "What else is this place known for?", "answers": [{"text": "World Malayalee Council, the organisation working with the Malayali diaspora across the Globe has embarked upon a project for making a data bank of the diaspora.", "answer_start": 1862}], "id": "C_69758fcdfc1f46baba0e92c0f3b0919c_1_q#2", "orig_answer": {"text": "World Malayalee Council, the organisation working with the Malayali diaspora across the Globe has embarked upon a project for making a data bank of the diaspora.", "answer_start": 1862}}, {"followup": "n", "yesno": "x", "question": "Were they ever successful in doing this?", "answers": [{"text": "CANNOTANSWER", "answer_start": 2024}], "id": "C_69758fcdfc1f46baba0e92c0f3b0919c_1_q#3", "orig_answer": {"text": "CANNOTANSWER", "answer_start": 2024}}, {"followup": "n", "yesno": "x", "question": "Do they produce anything from here?", "answers": [{"text": "CANNOTANSWER", "answer_start": 2024}], "id": "C_69758fcdfc1f46baba0e92c0f3b0919c_1_q#4", "orig_answer": {"text": "CANNOTANSWER", "answer_start": 2024}}, {"followup": "n", "yesno": "y", "question": "Is this population still growing?", "answers": [{"text": "In 2010, the Census of Population of Singapore reported that there were 26,348 Malayalees in Singapore.", "answer_start": 1461}], "id": "C_69758fcdfc1f46baba0e92c0f3b0919c_1_q#5", "orig_answer": {"text": "In 2010, the Census of Population of Singapore reported that there were 26,348 Malayalees in Singapore.", "answer_start": 1461}}, {"followup": "n", "yesno": "x", "question": "Is the country thriving?", "answers": [{"text": "CANNOTANSWER", "answer_start": 2024}], "id": "C_69758fcdfc1f46baba0e92c0f3b0919c_1_q#6", "orig_answer": {"text": "CANNOTANSWER", "answer_start": 2024}}], "id": "C_69758fcdfc1f46baba0e92c0f3b0919c_1"}], "section_title": "Geographic distribution and population", "background": "The Malayali people or Keralite people (also spelt Malayalee, Malayalam script: mlyaalli and keerlliiy[?]) are an Indian ethnic group originating from the present-day state of Kerala, located in South India. They are identified as native speakers of the Malayalam language, which is classified as part of the Dravidian family of languages. As they primarily live in Kerala, the word Keralite is used as an alternative to Malayali. According to the Indian census of 2011, there are approximately 33 million Malayalis in Kerala, making up 96.7% of the total population of the state.", "title": "Malayali"}, {"paragraphs": [{"context": "Malayalam is the language spoken by the Malayalis. Malayalam is derived from old Tamil and Sanskrit in the 6th century. For cultural purposes Malayalam and Sanskrit formed a language known as Manipravalam, where both languages were used in an alternating style. Malayalam is the only among the major Dravidian languages without diglossia. This means, that the Malayalam which is spoken does not differ from the written variant. Malayalam is written using the Malayalam script. Malayalam literature is ancient in origin. The oldest literature works in Malayalam, distinct from the Tamil tradition, is dated between the 9th century and 11th century. Malayalam literature includes the 14th century Niranam poets (Madhava Panikkar, Sankara Panikkar and Rama Panikkar), whose works mark the dawn of both modern Malayalam language and indigenous Keralite poetry. The Triumvirate of poets (Kavithrayam: Kumaran Asan, Vallathol Narayana Menon and Ulloor S. Parameswara Iyer) are recognized for moving Keralite poetry away from archaic sophistry and metaphysics and towards a more lyrical mode. In 19th century Chavara Kuriakose Elias, the founder of Carmelites of Mary Immaculate and Congregation of Mother of Carmel congregations, contribute different streams in the Malayalam Literature. All his works are written between 1829 and 1870. Chavara's contribution to Malayalam literature includes, Chronicles, Poems - athmanuthapam (compunction of the soul), Maranaveettil Paduvanulla Pana (Poem to sing in the bereaved house) and Anasthasiayude Rakthasakshyam - and other Literary works . In the second half of the 20th century, Jnanpith awardees like G. Sankara Kurup, S. K. Pottekkatt, Thakazhi Sivasankara Pillai and M. T. Vasudevan Nair and non Jnanpith awardees like Vaikom Muhammad Basheer have made valuable contributions to the Malayalam literature. Later, such Keralite writers as O. V. Vijayan, Kamaladas, M. Mukundan, and Booker Prize winner Arundhati Roy, whose 1996 semi-autobiographical bestseller The God of Small Things is set in the Kottayam town of Ayemenem, have gained international recognition. Kerala remains a fascinating riddle for the Indian diaspora, especially the younger generations - World Malayali Council with its sister organisation, International Institute for Scientific and Academic Collaboration (IISAC) has come out with a comprehensive book on Kerala titled 'Introduction to Kerala Studies,' specially intended fo
没有合适的资源?快使用搜索试试~ 我知道了~
机器学习(大模型):机器阅读理解和问答任务的数据集
共2个文件
json:2个
0 下载量 195 浏览量
2024-11-24
16:45:52
上传
评论
收藏 17.59MB ZIP 举报
温馨提示
数据集是一个用于模拟、理解和参与信息寻求对话的资源。它包含了14,000个信息寻求型的问答对话,总共涉及100,000个问题。这些对话由两名众包工作者进行:(1)一名学生,他们提出一系列自由格式的问题,以尽可能多地了解一个隐藏的维基百科文本;(2)一名教师,他们通过提供文本的简短摘录来回答这些问题。QuAC数据集引入了现有机器理解数据集中未发现的挑战:其问题通常是开放式的、无答案的,或者只在对话上下文中才有意义。 QuAC的特点在于它的问题高度依赖上下文,有时甚至是无法回答的。这要求参与对话的模型不仅要理解单个问题,还要能够跟踪整个对话的历史,以便在对话的上下文中生成合适的回答。这种类型的问答任务比传统的问答任务更具挑战性,因为它需要模型具备更强的上下文理解能力和对话管理能力。 此外,QuAC数据集的问题类型多样,包括事实性问题、推理问题和开放性问题,这增加了模型理解和生成准确答案的难度。对话的动态性和不确定性也使得数据标注和模型训练变得复杂。QuAC数据集的发布为研究者提供了一个标准化的评估平台,极大地推动了对话系统在教育、客户服务等领域的应用,并促进了自然语言处理技术的发展。
资源推荐
资源详情
资源评论
收起资源包目录
机器阅读理解和问答任务的数据集.zip (2个子文件)
val_v0.2 QuaC.json 8.52MB
train_v0.2 QuaC.json 64.96MB
共 2 条
- 1
资源评论
庄小焱
- 粉丝: 2838
- 资源: 82
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功