Subject-based extraction of a latent blog community

Seok Ho Yoon, Jung Hwan Shin, Sang Wook Kim, Sunju Park, Jae Bum Lee

Research output: Contribution to journalArticle

5 Citations (Scopus)

Abstract

In the blogosphere, there exist posts relevant to a particular subject and blogs that show interest in the subject. In this paper, we define a set of such posts and blogs as a blog community and propose a method for extracting the blog community associated with a particular subject. The proposed method is based on the idea that the blogs who have performed actions (e.g., read, comment, trackback, scrap) to the posts of a particular subject are the ones with interest in the subject, and that the posts that have received actions from such blogs are the ones that contain the subject. The proposed method starts with a small number of manually-selected seed posts containing the subject. Then, the method selects the blogs that have performed actions to the seed posts over some threshold and the posts that have received actions over some threshold. Repeating these two steps gradually expands the blog community. This paper presents various techniques to improve the accuracy of the proposed method. The experimental results show that the proposed method exhibits a higher level of accuracy than the methods proposed in prior research. This paper also discusses business applications of the extracted community, such as target marketing, market monitoring, improving search results, finding power bloggers, and revitalization of the blogosphere.

Original languageEnglish
Pages (from-to)215-229
Number of pages15
JournalInformation sciences
Volume184
Issue number1
DOIs
Publication statusPublished - 2012 Feb 1

Fingerprint

Blogs
Seed
Community
Expand
Marketing
Monitoring
Target
Experimental Results

All Science Journal Classification (ASJC) codes

  • Software
  • Control and Systems Engineering
  • Theoretical Computer Science
  • Computer Science Applications
  • Information Systems and Management
  • Artificial Intelligence

Cite this

Yoon, Seok Ho ; Shin, Jung Hwan ; Kim, Sang Wook ; Park, Sunju ; Lee, Jae Bum. / Subject-based extraction of a latent blog community. In: Information sciences. 2012 ; Vol. 184, No. 1. pp. 215-229.
@article{62d67a4532bf447aad768382e0286544,
title = "Subject-based extraction of a latent blog community",
abstract = "In the blogosphere, there exist posts relevant to a particular subject and blogs that show interest in the subject. In this paper, we define a set of such posts and blogs as a blog community and propose a method for extracting the blog community associated with a particular subject. The proposed method is based on the idea that the blogs who have performed actions (e.g., read, comment, trackback, scrap) to the posts of a particular subject are the ones with interest in the subject, and that the posts that have received actions from such blogs are the ones that contain the subject. The proposed method starts with a small number of manually-selected seed posts containing the subject. Then, the method selects the blogs that have performed actions to the seed posts over some threshold and the posts that have received actions over some threshold. Repeating these two steps gradually expands the blog community. This paper presents various techniques to improve the accuracy of the proposed method. The experimental results show that the proposed method exhibits a higher level of accuracy than the methods proposed in prior research. This paper also discusses business applications of the extracted community, such as target marketing, market monitoring, improving search results, finding power bloggers, and revitalization of the blogosphere.",
author = "Yoon, {Seok Ho} and Shin, {Jung Hwan} and Kim, {Sang Wook} and Sunju Park and Lee, {Jae Bum}",
year = "2012",
month = "2",
day = "1",
doi = "10.1016/j.ins.2011.08.004",
language = "English",
volume = "184",
pages = "215--229",
journal = "Information Sciences",
issn = "0020-0255",
publisher = "Elsevier Inc.",
number = "1",

}

Subject-based extraction of a latent blog community. / Yoon, Seok Ho; Shin, Jung Hwan; Kim, Sang Wook; Park, Sunju; Lee, Jae Bum.

In: Information sciences, Vol. 184, No. 1, 01.02.2012, p. 215-229.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Subject-based extraction of a latent blog community

AU - Yoon, Seok Ho

AU - Shin, Jung Hwan

AU - Kim, Sang Wook

AU - Park, Sunju

AU - Lee, Jae Bum

PY - 2012/2/1

Y1 - 2012/2/1

N2 - In the blogosphere, there exist posts relevant to a particular subject and blogs that show interest in the subject. In this paper, we define a set of such posts and blogs as a blog community and propose a method for extracting the blog community associated with a particular subject. The proposed method is based on the idea that the blogs who have performed actions (e.g., read, comment, trackback, scrap) to the posts of a particular subject are the ones with interest in the subject, and that the posts that have received actions from such blogs are the ones that contain the subject. The proposed method starts with a small number of manually-selected seed posts containing the subject. Then, the method selects the blogs that have performed actions to the seed posts over some threshold and the posts that have received actions over some threshold. Repeating these two steps gradually expands the blog community. This paper presents various techniques to improve the accuracy of the proposed method. The experimental results show that the proposed method exhibits a higher level of accuracy than the methods proposed in prior research. This paper also discusses business applications of the extracted community, such as target marketing, market monitoring, improving search results, finding power bloggers, and revitalization of the blogosphere.

AB - In the blogosphere, there exist posts relevant to a particular subject and blogs that show interest in the subject. In this paper, we define a set of such posts and blogs as a blog community and propose a method for extracting the blog community associated with a particular subject. The proposed method is based on the idea that the blogs who have performed actions (e.g., read, comment, trackback, scrap) to the posts of a particular subject are the ones with interest in the subject, and that the posts that have received actions from such blogs are the ones that contain the subject. The proposed method starts with a small number of manually-selected seed posts containing the subject. Then, the method selects the blogs that have performed actions to the seed posts over some threshold and the posts that have received actions over some threshold. Repeating these two steps gradually expands the blog community. This paper presents various techniques to improve the accuracy of the proposed method. The experimental results show that the proposed method exhibits a higher level of accuracy than the methods proposed in prior research. This paper also discusses business applications of the extracted community, such as target marketing, market monitoring, improving search results, finding power bloggers, and revitalization of the blogosphere.

UR - http://www.scopus.com/inward/record.url?scp=80055025795&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=80055025795&partnerID=8YFLogxK

U2 - 10.1016/j.ins.2011.08.004

DO - 10.1016/j.ins.2011.08.004

M3 - Article

AN - SCOPUS:80055025795

VL - 184

SP - 215

EP - 229

JO - Information Sciences

JF - Information Sciences

SN - 0020-0255

IS - 1

ER -