Natural language processing (NLP) is a subfield of artificial intelligence and machine learning that involves transforming or extracting useful information from natural language data.
Questions tagged [nlp]
151 questions
2
votes
1 answer
Is there any public dataset on Bangla Language handwriting available?
I am willing to work on a natural language processing project that encompasses learning from Bangla handwritten texts. Hence, I am looking for publicly available Bangla handwritten texts' dataset.
Sanjay
- 123
- 3
1
vote
0 answers
Are there NLP datasets that label each word type
I'm looking for NLP datasets that contain word types.
For example,
There is an apple on the table.
They have to table a motion for debate at the next Party Conference.
Clearly, the world table is used as noun in the first sentence and as a verb in…
MoneyBall
- 111
- 2
1
vote
1 answer
What measure can I use for rank order comparison of most-common n-grams in two sets of data?
I have a real dataset of sequences of events and a "fake" dataset generated using an lstm model. The two datasets are made up of the same vocabulary but are of a different length. I'm putting together an evaluation script to assess how similar the…
Boris
- 121
- 3
1
vote
0 answers
Conversation logs open data
Is there any open data on conversation logs?
I am looking for open data on conversation logs, yes between individual people. Ideally private chat logs. I am considering using a bot to scrape a website like Omegle which has a "ask a question" mode…
FX_NINJA
- 111
- 2
0
votes
0 answers
reliable spanish word empeddings (word2vec output format)
I'm looking for spanish word vectors (preferably word2vec output format) to use in a scala nlp application. I've tried the ones on https://github.com/uchile-nlp/spanish-word-embeddings, but they are either in the wrong language, missing headers (not…
Boris
- 121
- 3