An improved kNN learning based korean text classifier with heuristic information

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

Automatic text categorization is a problem of assigning predefined categories to free text documents based on the likelihood suggested by a training set of labelled texts. kNN learning based text classifier is a well known statistical approach and its algorithm is quite simple. While the method has been applied to many systems and shown relatively good performance, a through evaluation of the method has rarely been done. There are some parameters which play important roles in the performance of the method: decision function, k value of kNN, and size of feature set. This paper focuses on an improving method for a kNN learning based Korean text classifier by using heuristic information found experimentally. Our results show that kNN method with carefully chosen parameters is very significant in improving the performance and decreasing the size of feature set.

Original languageEnglish
Title of host publicationICONIP 2002 - Proceedings of the 9th International Conference on Neural Information Processing: Computational Intelligence for the E-Age
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages731-735
Number of pages5
Volume2
ISBN (Print)9810475241, 9789810475246
DOIs
Publication statusPublished - 2002
Externally publishedYes
Event9th International Conference on Neural Information Processing, ICONIP 2002 - Singapore, Singapore
Duration: 2002 Nov 182002 Nov 22

Other

Other9th International Conference on Neural Information Processing, ICONIP 2002
CountrySingapore
CitySingapore
Period02/11/1802/11/22

Fingerprint

Classifiers

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Information Systems
  • Signal Processing

Cite this

Lim, H. S. (2002). An improved kNN learning based korean text classifier with heuristic information. In ICONIP 2002 - Proceedings of the 9th International Conference on Neural Information Processing: Computational Intelligence for the E-Age (Vol. 2, pp. 731-735). [1198154] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICONIP.2002.1198154

An improved kNN learning based korean text classifier with heuristic information. / Lim, Heui Seok.

ICONIP 2002 - Proceedings of the 9th International Conference on Neural Information Processing: Computational Intelligence for the E-Age. Vol. 2 Institute of Electrical and Electronics Engineers Inc., 2002. p. 731-735 1198154.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Lim, HS 2002, An improved kNN learning based korean text classifier with heuristic information. in ICONIP 2002 - Proceedings of the 9th International Conference on Neural Information Processing: Computational Intelligence for the E-Age. vol. 2, 1198154, Institute of Electrical and Electronics Engineers Inc., pp. 731-735, 9th International Conference on Neural Information Processing, ICONIP 2002, Singapore, Singapore, 02/11/18. https://doi.org/10.1109/ICONIP.2002.1198154
Lim HS. An improved kNN learning based korean text classifier with heuristic information. In ICONIP 2002 - Proceedings of the 9th International Conference on Neural Information Processing: Computational Intelligence for the E-Age. Vol. 2. Institute of Electrical and Electronics Engineers Inc. 2002. p. 731-735. 1198154 https://doi.org/10.1109/ICONIP.2002.1198154
Lim, Heui Seok. / An improved kNN learning based korean text classifier with heuristic information. ICONIP 2002 - Proceedings of the 9th International Conference on Neural Information Processing: Computational Intelligence for the E-Age. Vol. 2 Institute of Electrical and Electronics Engineers Inc., 2002. pp. 731-735
@inproceedings{e04f3d2ff6104e2a96384a2b4b6770f4,
title = "An improved kNN learning based korean text classifier with heuristic information",
abstract = "Automatic text categorization is a problem of assigning predefined categories to free text documents based on the likelihood suggested by a training set of labelled texts. kNN learning based text classifier is a well known statistical approach and its algorithm is quite simple. While the method has been applied to many systems and shown relatively good performance, a through evaluation of the method has rarely been done. There are some parameters which play important roles in the performance of the method: decision function, k value of kNN, and size of feature set. This paper focuses on an improving method for a kNN learning based Korean text classifier by using heuristic information found experimentally. Our results show that kNN method with carefully chosen parameters is very significant in improving the performance and decreasing the size of feature set.",
author = "Lim, {Heui Seok}",
year = "2002",
doi = "10.1109/ICONIP.2002.1198154",
language = "English",
isbn = "9810475241",
volume = "2",
pages = "731--735",
booktitle = "ICONIP 2002 - Proceedings of the 9th International Conference on Neural Information Processing: Computational Intelligence for the E-Age",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - An improved kNN learning based korean text classifier with heuristic information

AU - Lim, Heui Seok

PY - 2002

Y1 - 2002

N2 - Automatic text categorization is a problem of assigning predefined categories to free text documents based on the likelihood suggested by a training set of labelled texts. kNN learning based text classifier is a well known statistical approach and its algorithm is quite simple. While the method has been applied to many systems and shown relatively good performance, a through evaluation of the method has rarely been done. There are some parameters which play important roles in the performance of the method: decision function, k value of kNN, and size of feature set. This paper focuses on an improving method for a kNN learning based Korean text classifier by using heuristic information found experimentally. Our results show that kNN method with carefully chosen parameters is very significant in improving the performance and decreasing the size of feature set.

AB - Automatic text categorization is a problem of assigning predefined categories to free text documents based on the likelihood suggested by a training set of labelled texts. kNN learning based text classifier is a well known statistical approach and its algorithm is quite simple. While the method has been applied to many systems and shown relatively good performance, a through evaluation of the method has rarely been done. There are some parameters which play important roles in the performance of the method: decision function, k value of kNN, and size of feature set. This paper focuses on an improving method for a kNN learning based Korean text classifier by using heuristic information found experimentally. Our results show that kNN method with carefully chosen parameters is very significant in improving the performance and decreasing the size of feature set.

UR - http://www.scopus.com/inward/record.url?scp=84964546317&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84964546317&partnerID=8YFLogxK

U2 - 10.1109/ICONIP.2002.1198154

DO - 10.1109/ICONIP.2002.1198154

M3 - Conference contribution

SN - 9810475241

SN - 9789810475246

VL - 2

SP - 731

EP - 735

BT - ICONIP 2002 - Proceedings of the 9th International Conference on Neural Information Processing: Computational Intelligence for the E-Age

PB - Institute of Electrical and Electronics Engineers Inc.

ER -