Natural language processing in java
Hi,
Sorry if i might go on a tangent with this one. I'm not a java coder however i want to get a program built that can analyse text data using NLP and return useful insights out it (essentially text classification). Can anyone help me to figure out the following questions:
1. Is it possible to create one such program on one's self?
2. How difficult/easy is it?
3. How long would it take?
4. Does it require a coder with significant mathematical skill or a good coder with some outside help would do?
5. Are there any softwares in the market that do much the same things and how long would training on such a program take?
I need to get this software in place in about 2 weeks. Quick answered would be appreciated! :)
Re: Natural language processing in java
1. Natural language processing is still an open area of research (see Wikipedia: Natural Language Processing). So, it is extremely hard to create (even with a dozen really smart computer scientists and linguists).
2. See the answer to number 1
3. Hmm.. I'll let you know when someone creates a "good" natural language processor (there are a ton of average ones, but sometimes these can be sufficient.)
4. Depending on the application, you will need a strong background in logic and/or statistics, as well as computer science.
5. Stanford University has a natural language processor licenced under GPL, you might want to take a look at it. Link: The Stanford NLP. There's also a ton of research papers published in the subject of Natural Language processing, a quick search on Google Scholar might point you in the right direction.
Re: Natural language processing in java
@Helloworld922: Thanks for the help! I've used the tool called ProAnalyst to certain degree of consistancy, however, it doesnt give me great results which I feel can come out of programs based on NLP. Are you aware of any commercial softwares that might help me with this? I've heard of IBM's SPSS tools, but dont know much about them.
Re: Natural language processing in java
Can anyone supply the coding of this project?
Re: Natural language processing in java
how to compute relative frequency of ngrams?
it is give that P(am|eating) = P (am & eating) / P (eating)
i know values P(am) and P(eating) . what is P(am & eating)?
Re: Natural language processing in java
how to compute relative frequency of ngrams?
it is give that P(am|eating) = P (am & eating) / P (eating)
i know values P(am) and P(eating) . what is P(am & eating)?