Difference between revisions of "Extract events from teletext"

From Master Projects
Jump to: navigation, search
Line 1: Line 1:
 
{{Projectproposal
 
{{Projectproposal
 
|Contact person=Valentina Maccatrozzo
 
|Contact person=Valentina Maccatrozzo
|Contact person2=Lora Aroyo
 
 
|Project page=http://www.vista-tv.eu/
 
|Project page=http://www.vista-tv.eu/
 
|Fulfilled=No
 
|Fulfilled=No
 
}}
 
}}
 
Teletext services are either disappearing or moving to the web. Those that are moving to the web, update their content, making it clickable and navigable, i.e. standard html content. Besides the standard program guide information, they expose also news. This project has the objective of extracting events from teletext content and matching those news with newspapers articles. Practically speaking you need to build a crawler to extract the content of the teletext website, extract the news from the content and search for those news in other newspapers, for instance using the NYTimes API (http://developer.nytimes.com) or LexisNexis (accessible only from the VU http://academic.lexisnexis.nl/).
 
Teletext services are either disappearing or moving to the web. Those that are moving to the web, update their content, making it clickable and navigable, i.e. standard html content. Besides the standard program guide information, they expose also news. This project has the objective of extracting events from teletext content and matching those news with newspapers articles. Practically speaking you need to build a crawler to extract the content of the teletext website, extract the news from the content and search for those news in other newspapers, for instance using the NYTimes API (http://developer.nytimes.com) or LexisNexis (accessible only from the VU http://academic.lexisnexis.nl/).

Revision as of 09:49, 30 October 2012


About Extract events from teletext

Description

Teletext services are either disappearing or moving to the web. Those that are moving to the web, update their content, making it clickable and navigable, i.e. standard html content. Besides the standard program guide information, they expose also news. This project has the objective of extracting events from teletext content and matching those news with newspapers articles. Practically speaking you need to build a crawler to extract the content of the teletext website, extract the news from the content and search for those news in other newspapers, for instance using the NYTimes API (http://developer.nytimes.com) or LexisNexis (accessible only from the VU http://academic.lexisnexis.nl/).