Apr 29, 2008

Initial discussions

The first two discussions with Shalini Urs:
1st Discussion: Itentify problem: Took with me the following topics - patent mapping using text mining techniques, collaborative (collective) intelligence, open science and open innovation.

We finalized on the topic "Text mining". The reasons being, the topic seems doable and we know experts in the area we can get in touch with for guidence. We may use patents or any other corpus to demonstrate the research. The important thing is to develop a new method/alogrithm/technology and use a corpus to demonstrate it. We may use patents to accomplish this.

Target also: One paper per quarter

2nd Discussion:
Implementation:
- Download patents and time them to see pace (to estimate how much time is required to download the files)
- Build a test database out of these downloade patents
- Install the text mining software from the book by Manu K. and see how it works for the test database.
- Results could be published as a paper

Other than the practical implementation, I will continue to:
- Advance my knowledge on text mining techniques by reading books and papers
- Summarize the different techniques used for text mining

No comments: