This is an exciting vision for reordering how end-users retrieve and organize digital information. Once information is encoded in a database, it could be organized into a taxonomy or searched over by textual attribute or feature. This stands as a vast improvement over the usual search protocol: index content and query full-text documents by keyword. IE is an attempt to convert information from various text documents into database entries, which plays a key role in improving online knowledge discovery.
The two main methods of information extraction technology are
§ Natural Language Processing
§ Wrapper Induction