Information Extraction from Blogs

Marie-Francine Moens

Zu finden in: Handbook of Research on Web Log Analysis (Seite 469 bis 487), 2008 local

Diese Seite wurde seit 3 Jahren inhaltlich nicht mehr aktualisiert. Unter Umständen ist sie nicht mehr aktuell.

Zusammenfassungen

This chapter introduces information extraction from blog texts. It argues that the classical techniques for information extraction that are commonly used for mining well-formed texts lose some of their validity in the context of blogs. This finding is demonstrated by considering each step in the information extraction process and by illustrating this problem in different applications. In order to tackle the problem of mining content from blogs, algorithms are developed that combine different sources of evidence in the most flexible way. The chapter concludes with ideas for future research.

Von Marie-Francine Moens im Buch Handbook of Research on Web Log Analysis (2008) im Text Information Extraction from Blogs

This chapter is organized as follows. We continue with some background (next section) on information extraction in general and information extraction from blogs in particular. We outline the history of information extraction. In a subsequent section we consider the different steps in an information extraction task and focus on particular issues when dealing with blog data. We discuss tokenization and lexical analysis, natural language processing and finally information extraction. In the latter part of the chapter we go deeper into a few specific applications: topic and thread detection, opinion mining, and argumentation detection. Wherever possible, we illustrate our findings with our own research experiences. We conclude with a number of prospects for further research.

Von Marie-Francine Moens im Buch Handbook of Research on Web Log Analysis (2008) im Text Information Extraction from Blogs

Dieser Text erwähnt ...

Begriffe
KB IB clear

machine learning ,

Phishing ,

Weblogs

blogging

Dieser Text erwähnt vermutlich nicht ...

Nicht erwähnte Begriffe

Weblogs in education

Anderswo finden

Volltext dieses Dokuments

Information Extraction from Blogs: Article als Fulltext-PDF (IGI-Global) ( lokal

, 430 kByte; WWW

Link unterbrochen? Letzte Überprüfung: 2021-03-21 Letzte erfolgreiche Überprüfung: 2015-02-28)

Anderswo suchen

Beat und dieser Text

Beat hat Dieser Text während seiner Zeit am Institut für Medien und Schule (IMS) ins Biblionetz aufgenommen. Beat besitzt kein physisches, aber ein digitales Exemplar. Eine digitale Version ist auf dem Internet verfügbar (s.o.). Aufgrund der wenigen Einträge im Biblionetz scheint er es nicht wirklich gelesen zu haben. Es gibt bisher auch nur wenige Objekte im Biblionetz, die dieses Werk zitieren.

Beats Biblionetz - Texte