
This chapter is organized as follows. We
continue with some background (next section) on
information extraction in general and information
extraction from blogs in particular. We outline the
history of information extraction. In a subsequent
section we consider the different steps in an information
extraction task and focus on particular
issues when dealing with blog data. We discuss
tokenization and lexical analysis, natural language
processing and finally information extraction.
In the latter part of the chapter we go deeper
into a few specific applications: topic and thread
detection, opinion mining, and argumentation
detection. Wherever possible, we illustrate our
findings with our own research experiences. We conclude with a number of prospects for further
research.