Commit Inicial

2025-08-18 00:46:03 +00:00 · 2020-11-20 11:23:40 +01:00
commit 5611e5bc01
2914 changed files with 2625178 additions and 0 deletions
--- a/data/tanveer/hill-valley/Hill-Valley.names
+++ b/data/tanveer/hill-valley/Hill-Valley.names
@@ -0,0 +1,64 @@
+1. Title: Hill-Valley Dataset 
+
+2. Source Information
+
+   a)	Creators:
+
+	Lee Graham (lee@stellaralchemy.com)
+
+	Franz Oppacher (oppacher@scs.carleton.ca)
+	Carleton University, Department of Computer Science
+	Intelligent Systems Research Unit
+	1125 Colonel By Drive, Ottawa, Ontario, Canada, K1S5B6
+
+   c) Date of release: March 2008
+ 
+3. Past Usage:
+
+   (a) Non-published. Evaluation of dataset by various learning algorithms in the Waikato Environment for Knowledge Analysis (WEKA).
+
+
+4. Relevant Information:
+
+   Each record represents 100 points on a two-dimensional graph. When plotted in order (from 1 through 100) as the Y co-ordinate, the points will create either a Hill (a <20>bump<6D> in the terrain) or a Valley (a <20>dip<69> in the terrain).
+
+   There are six files, as follows:
+
+   (a) Hill_Valley_without_noise_Training.data
+   (b) Hill_Valley_without_noise_Testing.data
+
+   These first two datasets (without noise) are a training/testing set pair where the hills or valleys have a smooth transition.
+
+   (c) Hill_Valley_with_noise_Training.data
+   (d) Hill_Valley_with_noise_Testing.data
+
+   These next two datasets (with noise) are a training/testing set pair where the terrain is uneven, and the hill or valley is not as obvious when viewed closely. 
+
+   (e) Hill_Valley_sample_arff.text
+
+   The sample ARFF file is useful for setting up experiments, but is not necessary.
+
+   (f) Hill_Valley_visual_examples.jpg
+
+   This graphic file shows two example instances from the data.
+
+
+5. Number of Instances: 606 for each training and testing set
+
+6. Number of Attributes: 100 predictive attributes, 1 goal attribute
+
+7. Attribute Information:
+   1-100: Labeled <20>X##<23>. Floating point values (numeric)
+   101: Labeled <20>class<73>. Binary {0, 1} representing {valley, hill}
+
+8. Missing Attribute Values: None
+
+   There is no class noise. The <20>noisy<73> datasets are named as such because it more accurately represents the terrain.
+
+9. Class Distribution:
+
+   Hill_Valley_with_noise_Training.data (307 / 299)
+   Hill_Valley_with_noise_Testing.data  (299 / 307)
+
+   Hill_Valley_without_noise_Training.data (305 / 301)
+   Hill_Valley_without_noise_Testing.data  (295 / 311)