stree_datasets/data/tanveer/hill-valley/Hill-Valley.names

1. Title: Hill-Valley Dataset

2. Source Information

   a)	Creators:

	Lee Graham (lee@stellaralchemy.com)

	Franz Oppacher (oppacher@scs.carleton.ca)
	Carleton University, Department of Computer Science
	Intelligent Systems Research Unit
	1125 Colonel By Drive, Ottawa, Ontario, Canada, K1S5B6

   c) Date of release: March 2008

3. Past Usage:

   (a) Non-published. Evaluation of dataset by various learning algorithms in the Waikato Environment for Knowledge Analysis (WEKA).


4. Relevant Information:

   Each record represents 100 points on a two-dimensional graph. When plotted in order (from 1 through 100) as the Y co-ordinate, the points will create either a Hill (a “bump” in the terrain) or a Valley (a “dip” in the terrain).

   There are six files, as follows:

   (a) Hill_Valley_without_noise_Training.data
   (b) Hill_Valley_without_noise_Testing.data

   These first two datasets (without noise) are a training/testing set pair where the hills or valleys have a smooth transition.

   (c) Hill_Valley_with_noise_Training.data
   (d) Hill_Valley_with_noise_Testing.data

   These next two datasets (with noise) are a training/testing set pair where the terrain is uneven, and the hill or valley is not as obvious when viewed closely.

   (e) Hill_Valley_sample_arff.text

   The sample ARFF file is useful for setting up experiments, but is not necessary.

   (f) Hill_Valley_visual_examples.jpg

   This graphic file shows two example instances from the data.


5. Number of Instances: 606 for each training and testing set

6. Number of Attributes: 100 predictive attributes, 1 goal attribute

7. Attribute Information:
   1-100: Labeled “X##”. Floating point values (numeric)
   101: Labeled “class”. Binary {0, 1} representing {valley, hill}

8. Missing Attribute Values: None

   There is no class noise. The “noisy” datasets are named as such because it more accurately represents the terrain.

9. Class Distribution:

   Hill_Valley_with_noise_Training.data (307 / 299)
   Hill_Valley_with_noise_Testing.data  (299 / 307)

   Hill_Valley_without_noise_Training.data (305 / 301)
   Hill_Valley_without_noise_Testing.data  (295 / 311)