A model for evaluating the quality of user-created documents

Linh Hoang, Jung Tae Lee, Young In Song, Hae-Chang Rim

Research output: Chapter in Book/Report/Conference proceedingConference contribution

11 Citations (Scopus)

Abstract

In this paper, we propose a model for evaluating the quality of general user-created documents. The model is based on supervised classification approach, in which output scores are considered as quality of given document. In order to utilize both textual and non-textual attributes of documents, we incorporated a number of objectively measurable, real-valued features selected upon predefined criteria for quality. Experiments on two datasets of real world documents show that textual features are stable indicators for evaluating documents' quality. Some features are inferred to be effective for general kinds of documents.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages496-501
Number of pages6
Volume4993 LNCS
DOIs
Publication statusPublished - 2008 Jun 25
Event4th Asia Information Retrieval Symposium, AIRS 2008 - Harbin, China
Duration: 2008 Jan 152008 Jan 18

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4993 LNCS
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other4th Asia Information Retrieval Symposium, AIRS 2008
CountryChina
CityHarbin
Period08/1/1508/1/18

Fingerprint

Supervised Classification
Model
Experiments
Attribute
Datasets
Output
Experiment

ASJC Scopus subject areas

  • Computer Science(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Theoretical Computer Science

Cite this

Hoang, L., Lee, J. T., Song, Y. I., & Rim, H-C. (2008). A model for evaluating the quality of user-created documents. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4993 LNCS, pp. 496-501). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4993 LNCS). https://doi.org/10.1007/978-3-540-68636-1_54

A model for evaluating the quality of user-created documents. / Hoang, Linh; Lee, Jung Tae; Song, Young In; Rim, Hae-Chang.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 4993 LNCS 2008. p. 496-501 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4993 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Hoang, L, Lee, JT, Song, YI & Rim, H-C 2008, A model for evaluating the quality of user-created documents. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 4993 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 4993 LNCS, pp. 496-501, 4th Asia Information Retrieval Symposium, AIRS 2008, Harbin, China, 08/1/15. https://doi.org/10.1007/978-3-540-68636-1_54
Hoang L, Lee JT, Song YI, Rim H-C. A model for evaluating the quality of user-created documents. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 4993 LNCS. 2008. p. 496-501. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-540-68636-1_54
Hoang, Linh ; Lee, Jung Tae ; Song, Young In ; Rim, Hae-Chang. / A model for evaluating the quality of user-created documents. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 4993 LNCS 2008. pp. 496-501 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{01763c809cd740a1aedf23d5c8b5fee9,
title = "A model for evaluating the quality of user-created documents",
abstract = "In this paper, we propose a model for evaluating the quality of general user-created documents. The model is based on supervised classification approach, in which output scores are considered as quality of given document. In order to utilize both textual and non-textual attributes of documents, we incorporated a number of objectively measurable, real-valued features selected upon predefined criteria for quality. Experiments on two datasets of real world documents show that textual features are stable indicators for evaluating documents' quality. Some features are inferred to be effective for general kinds of documents.",
author = "Linh Hoang and Lee, {Jung Tae} and Song, {Young In} and Hae-Chang Rim",
year = "2008",
month = "6",
day = "25",
doi = "10.1007/978-3-540-68636-1_54",
language = "English",
isbn = "3540686339",
volume = "4993 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "496--501",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - A model for evaluating the quality of user-created documents

AU - Hoang, Linh

AU - Lee, Jung Tae

AU - Song, Young In

AU - Rim, Hae-Chang

PY - 2008/6/25

Y1 - 2008/6/25

N2 - In this paper, we propose a model for evaluating the quality of general user-created documents. The model is based on supervised classification approach, in which output scores are considered as quality of given document. In order to utilize both textual and non-textual attributes of documents, we incorporated a number of objectively measurable, real-valued features selected upon predefined criteria for quality. Experiments on two datasets of real world documents show that textual features are stable indicators for evaluating documents' quality. Some features are inferred to be effective for general kinds of documents.

AB - In this paper, we propose a model for evaluating the quality of general user-created documents. The model is based on supervised classification approach, in which output scores are considered as quality of given document. In order to utilize both textual and non-textual attributes of documents, we incorporated a number of objectively measurable, real-valued features selected upon predefined criteria for quality. Experiments on two datasets of real world documents show that textual features are stable indicators for evaluating documents' quality. Some features are inferred to be effective for general kinds of documents.

UR - http://www.scopus.com/inward/record.url?scp=45449086664&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=45449086664&partnerID=8YFLogxK

U2 - 10.1007/978-3-540-68636-1_54

DO - 10.1007/978-3-540-68636-1_54

M3 - Conference contribution

AN - SCOPUS:45449086664

SN - 3540686339

SN - 9783540686330

VL - 4993 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 496

EP - 501

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

ER -