mirror of
https://github.com/Doctorado-ML/Stree_datasets.git
synced 2025-08-18 00:46:03 +00:00
Commit Inicial
This commit is contained in:
64
data/tanveer/hill-valley/Hill-Valley.names
Executable file
64
data/tanveer/hill-valley/Hill-Valley.names
Executable file
@@ -0,0 +1,64 @@
|
||||
1. Title: Hill-Valley Dataset
|
||||
|
||||
2. Source Information
|
||||
|
||||
a) Creators:
|
||||
|
||||
Lee Graham (lee@stellaralchemy.com)
|
||||
|
||||
Franz Oppacher (oppacher@scs.carleton.ca)
|
||||
Carleton University, Department of Computer Science
|
||||
Intelligent Systems Research Unit
|
||||
1125 Colonel By Drive, Ottawa, Ontario, Canada, K1S5B6
|
||||
|
||||
c) Date of release: March 2008
|
||||
|
||||
3. Past Usage:
|
||||
|
||||
(a) Non-published. Evaluation of dataset by various learning algorithms in the Waikato Environment for Knowledge Analysis (WEKA).
|
||||
|
||||
|
||||
4. Relevant Information:
|
||||
|
||||
Each record represents 100 points on a two-dimensional graph. When plotted in order (from 1 through 100) as the Y co-ordinate, the points will create either a Hill (a <20>bump<6D> in the terrain) or a Valley (a <20>dip<69> in the terrain).
|
||||
|
||||
There are six files, as follows:
|
||||
|
||||
(a) Hill_Valley_without_noise_Training.data
|
||||
(b) Hill_Valley_without_noise_Testing.data
|
||||
|
||||
These first two datasets (without noise) are a training/testing set pair where the hills or valleys have a smooth transition.
|
||||
|
||||
(c) Hill_Valley_with_noise_Training.data
|
||||
(d) Hill_Valley_with_noise_Testing.data
|
||||
|
||||
These next two datasets (with noise) are a training/testing set pair where the terrain is uneven, and the hill or valley is not as obvious when viewed closely.
|
||||
|
||||
(e) Hill_Valley_sample_arff.text
|
||||
|
||||
The sample ARFF file is useful for setting up experiments, but is not necessary.
|
||||
|
||||
(f) Hill_Valley_visual_examples.jpg
|
||||
|
||||
This graphic file shows two example instances from the data.
|
||||
|
||||
|
||||
5. Number of Instances: 606 for each training and testing set
|
||||
|
||||
6. Number of Attributes: 100 predictive attributes, 1 goal attribute
|
||||
|
||||
7. Attribute Information:
|
||||
1-100: Labeled <20>X##<23>. Floating point values (numeric)
|
||||
101: Labeled <20>class<73>. Binary {0, 1} representing {valley, hill}
|
||||
|
||||
8. Missing Attribute Values: None
|
||||
|
||||
There is no class noise. The <20>noisy<73> datasets are named as such because it more accurately represents the terrain.
|
||||
|
||||
9. Class Distribution:
|
||||
|
||||
Hill_Valley_with_noise_Training.data (307 / 299)
|
||||
Hill_Valley_with_noise_Testing.data (299 / 307)
|
||||
|
||||
Hill_Valley_without_noise_Training.data (305 / 301)
|
||||
Hill_Valley_without_noise_Testing.data (295 / 311)
|
Reference in New Issue
Block a user