Automatic web-scale information extraction

Philip Bohannon, Nilesh Dalvi, Yuval Filmus, Nori Jacoby, Sathiya Keerthi and Alok Kirpal

In this demonstration, we showcase the technologies that we are building at Yahoo! for web-scale information extraction. Given any new website, containing semi-structured information about a pre-specified set of schemas, we show how to populate objects in the corresponding schema by automatically extracting information from the website.

Work done while I was a summer intern in Yahoo! Tel Aviv.


 author = {Philip Bohannon and Nilesh Dalvi and Yuval Filmus and Nori
 Jacoby and Sathiya Keerthi and Alok Kirpal},
 title = {Automatic web-scale information extraction},
 booktitle = {Proceedings of the 2012 {ACM} {SIGMOD} International Conference on Management of Data},
 year = {2012},
 pages = {609--612}
copy to clipboard