Skip to content

Commit c08dbdd

Browse files
authored
Merge pull request keon#227 from saitros/master
Add Multiple Korean Datasets
2 parents 7929aa9 + f304d11 commit c08dbdd

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

README.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -316,6 +316,10 @@ NLP as API with higher level functionality such as NER, Topic tagging and so on
316316
- [KAIST Corpus](http://semanticweb.kaist.ac.kr/home/index.php/KAIST_Corpus) - A corpus from the Korea Advanced Institute of Science and Technology in Korean.
317317
- [Naver Sentiment Movie Corpus in Korean](https://github.com/e9t/nsmc/)
318318
- [Chosun Ilbo archive](http://srchdb1.chosun.com/pdf/i_archive/) - dataset in Korean from one of the major newspapers in South Korea, the Chosun Ilbo.
319+
- [Chat data](https://github.com/songys/Chatbot_data) - Chatbot data in Korean
320+
- [Petitions](https://github.com/akngs/petitions) - Collect expired petition data from the Blue House National Petition Site.
321+
- [Korean Parallel corpora](https://github.com/j-min/korean-parallel-corpora) - Neural Machine Translation(NMT) Dataset for **Korean to French** & **Korean to English**
322+
- [KorQuAD](https://korquad.github.io/) - Korean SQuAD dataset with Wiki HTML source. Mentions both v1.0 and v2.1 at the time of adding to Awesome NLP
319323

320324
## NLP in Arabic
321325

0 commit comments

Comments
 (0)