Data Set A is a Pattern Matching Problem

Jens Kohlmorgen, Klaus Muller

Research output: Contribution to journalArticle

2 Citations (Scopus)


Several data sets have been proposed for benchmarking in time series prediction. A popular one is Data Set A from the Santa Fe Competition. This data set was the subject of analysis in many papers. In this note, it is shown that predicting the continuation of Data Set A is nothing else than a pattern matching problem. Looking at studies of this data set, it is remarkable that most of the very good forecasts of Data Set A used upsampled training data. We explain why upsampling is crucial for this data set. Finally, it is demonstrated that simple pattern matching performs as good as sophisticated prediction methods on Data Set A.

Original languageEnglish
Pages (from-to)43-47
Number of pages5
JournalNeural Processing Letters
Issue number1
Publication statusPublished - 1998 Dec 1
Externally publishedYes


  • Benchmarking
  • Pattern matching
  • Santa Fe Competition
  • Time series prediction

ASJC Scopus subject areas

  • Artificial Intelligence
  • Neuroscience(all)

Fingerprint Dive into the research topics of 'Data Set A is a Pattern Matching Problem'. Together they form a unique fingerprint.

  • Cite this

    Kohlmorgen, J., & Muller, K. (1998). Data Set A is a Pattern Matching Problem. Neural Processing Letters, 7(1), 43-47.