mirror of
https://github.com/rmontanana/mdlp.git
synced 2025-08-16 07:55:58 +00:00
22 lines
779 B
Markdown
22 lines
779 B
Markdown
# mdlp
|
|
Discretization algorithm based on the paper by Fayyad & Irani [Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning](https://www.ijcai.org/Proceedings/93-2/Papers/022.pdf)
|
|
|
|
The implementation tries to mitigate the problem of different label values with the same value of the variable:
|
|
|
|
- Sorts the values of the variable using the label values as a tie-breaker
|
|
- Once found a valid candidate for the split, it checks if the previous value is the same as actual one, and tries to get previous one, or next if the former is not possible.
|
|
|
|
The algorithm returns the cut points for the variable.
|
|
|
|
## Sample
|
|
|
|
To run the sample, just execute the following commands:
|
|
|
|
```bash
|
|
cd sample
|
|
mkdir build
|
|
cd build
|
|
cmake ..
|
|
make
|
|
./sample iris
|
|
``` |