Learning to Link with Wikipedia – I
I hope to maintain a log of the project I’m working on for my Data Mining course this quarter. I find blogging makes me feel more accountable on a day-to-day basis, and I could really use any help that comes my way on this.
So now to the problem:
Identifying which terms in a Wikipedia article need to be linked to other articles.
I have a dataset to work with. It has information about labels on the data and the words present in each document. I’m now trying to extract which words are linked.
So, yeah, still stuck in preprocessing.
I’ll post the python script after I’m done with it. Which should happen in the next few hours. Till then, I’m offline 🙂
Posted on February 15, 2010, in internet, machine learning, python, text mining and tagged data mining, dataset, naive bayes, prerprocessing, python, wikipedia. Bookmark the permalink. Leave a comment.