TY - GEN
T1 - An efficient string searching algorithm based on occurrence frequency and pattern of vowels and consonants in a pattern
AU - Chung, Kwang Sik
AU - Kim, Soo Young
AU - Yu, Heon Chang
N1 - Funding Information:
This work was supported by 2014 Korea National Open University Research Fund.
Publisher Copyright:
© Springer International Publishing Switzerland 2015.
PY - 2015
Y1 - 2015
N2 - Information and communication technologies enable people to access to various documentations and information. Huge documents and information in the Internet or storage disks have made search time more important. Especially as the volume size and the number of documents on the Internet increase, string search times and costs increase have become big burden to search service. But, most of string searching algorithms have not consider lexical structures nor vowels’ occurrence frequency. Formal documents (articles, news, novels, etc.) have important characteristic that is ‘well-formed written’ English. And words of formal documents have ‘limit number of words and alphabets’ that are listed in a dictionary. The ‘limit number of words and alphabets’ has predictable occurrence probability in real world’s documentations. We try to use the alphabet occurrence probability as first search condition. We analyze all the words in the dictionaries (dictionary of free dictionary project, scrabblehelper–Revision 20, Winedit dictionary) and calculate each alphabet occurrence probability of repeated vowels, repeated consonants, not-repeated vowels and not-repeated consonants. In this paper, we define and propose the search rules and string searching algorithm, based on occurrence frequency and patterns of vowels and consonants. We use only the occurrence patterns and repeated positions of vowel and consonant in a text. Therefore, in the real world, proposed string searching algorithm (OFRP algorithm) is based on occurrence frequency and repetition pattern of vowels and consonants and is usefully and effectively applied to string search service and web search engine.
AB - Information and communication technologies enable people to access to various documentations and information. Huge documents and information in the Internet or storage disks have made search time more important. Especially as the volume size and the number of documents on the Internet increase, string search times and costs increase have become big burden to search service. But, most of string searching algorithms have not consider lexical structures nor vowels’ occurrence frequency. Formal documents (articles, news, novels, etc.) have important characteristic that is ‘well-formed written’ English. And words of formal documents have ‘limit number of words and alphabets’ that are listed in a dictionary. The ‘limit number of words and alphabets’ has predictable occurrence probability in real world’s documentations. We try to use the alphabet occurrence probability as first search condition. We analyze all the words in the dictionaries (dictionary of free dictionary project, scrabblehelper–Revision 20, Winedit dictionary) and calculate each alphabet occurrence probability of repeated vowels, repeated consonants, not-repeated vowels and not-repeated consonants. In this paper, we define and propose the search rules and string searching algorithm, based on occurrence frequency and patterns of vowels and consonants. We use only the occurrence patterns and repeated positions of vowel and consonant in a text. Therefore, in the real world, proposed string searching algorithm (OFRP algorithm) is based on occurrence frequency and repetition pattern of vowels and consonants and is usefully and effectively applied to string search service and web search engine.
KW - Occurrence frequency of consonants
KW - Occurrence frequency of vowels
KW - Repetition pattern of consonants
KW - Repetition pattern of vowels
KW - String search
KW - Vowel and consonant-based string search
UR - http://www.scopus.com/inward/record.url?scp=84945948004&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-23862-3_27
DO - 10.1007/978-3-319-23862-3_27
M3 - Conference contribution
AN - SCOPUS:84945948004
SN - 9783319238616
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 277
EP - 286
BT - Intelligence Science and Big Data Engineering
A2 - Zhou, Zhi-Hua
A2 - Fu, Baochuan
A2 - Hu, Fuyuan
A2 - Zhang, Zhancheng
A2 - Liu, Zhi-Yong
A2 - Zhang, Yanning
A2 - He, Xiaofei
A2 - Gao, Xinbo
PB - Springer Verlag
T2 - 5th International Conference on Intelligence Science and Big Data Engineering, IScIDE 2015
Y2 - 14 June 2015 through 16 June 2015
ER -