What I learned by information retrieval in one week

October 19th, 2008

It has been about a week since I began doing a deeper study of infor­ma­tion retrieval. Actu­ally, every­thing just began with a new course at my uni­ver­sity about that and I just fallen in love almost imme­di­ately. The fact is that this thing really got me inter­ested, and I began doing some exper­i­ments (one involves django as well, keep read­ing to know more).

In this week I learned a lot of things about infor­ma­tion retrieval, text cat­e­go­riza­tion, nat­ural lan­guage pro­cess­ing and machine learn­ing. But the most rel­e­vant thing is: the prin­ci­ples are easy, their imple­men­ta­tion is not. The fact is that most of the tech­niques are rel­a­tively simple but you usu­ally have to deal with very large datasets and this could be chal­leng­ing, since one of the main require­ments about infor­ma­tion retrieval is time. It’s really much more impor­tant that you give less results in one second rather than giving better results in one hour. No one will ever care to use your system if it takes an hour to get some result. And if you’re con­sid­er­ing to store your data in a data­base forget about nor­mal­iza­tion, it wouldn’t really take you anywhere.

(Con­tinue reading…)

4 Comments, tagged with Coding, Django, Python

Running Django with fastcgi

October 8th, 2008

Running django with fastcgi is not a dif­fi­cult task, also because of the excel­lent doc­u­men­ta­tion pro­vided. Anyway the doc pro­vides a very basic script to autom­a­tize the start/stop fcgi process, so today I had to write my own so I don’t have to man­u­ally fix things if some­thing goes wrong since I let my script handle the var­i­ous situations.

(Con­tinue reading…)

0 Comments, tagged with Django

Microblogging

March 10th

twitter (feed #2)
headache. [krat]
7:17pm via Twitter

March 9th

twitter (feed #2)
I hate scribd. [krat]
7:58pm via Twitter
twitter (feed #2)
Drawing fancy charts for my thesis. For some definitions of "fancy". [krat]
4:34pm via Twitter

March 8th

twitter (feed #2)
it's probably better to have a break now, my eyes feel quite tired [krat]
5:45pm via Twitter

March 7th

twitter (feed #2)
cleaning dead RSS feeds from google reader. Apparently, more than half my feeds are dead. [krat]
9:58am via Twitter

March 6th

twitter (feed #2)
I forget things lately. A lot. Damn stressful life. [krat]
4:28pm via Twitter

March 5th

twitter (feed #2)
Another reason to love LaTeX is that you can put your text under version control [krat]
7:24pm via Twitter

March 4th

twitter (feed #2)
Focaccia and beer as study lunch: absolutely priceless. Only downside is that now it's kinda difficult to stay awake. [krat]
2:21pm via Twitter

March 3rd

twitter (feed #2)
I just decided to buy "Flatland" by Edwin Abbot. Only problem is that I won't have time to read it 'til after my graduation [krat]
3:14pm via Twitter
twitter (feed #2)
I'm probably not gonna make this year's #pycon-it. Awful. [krat]
11:34am via Twitter

March 2nd

twitter (feed #2)
God bless \LaTeX [krat]
6:27pm via Twitter

March 1st

twitter (feed #2)
just wrote almost ten pages for my thesis, I guess I'm on a good rhythm [krat]
7:02pm via Twitter

February 26th

twitter (feed #2)
my thesis writing is interspersed by short killing rounds at sauerbraten. That's a good way to get stressed even more. [krat]
5:26pm via Twitter

Powered by Lifestream.

Search

« Authored by Giuliani Vito Ivan »