141 Commits

Author SHA1 Message Date
3bae1fe390 Fix attribute name extraction in ArffFiles 2023-07-06 16:57:25 +02:00
Ricardo Montañana Gómez
a7098a907e Merge pull request #7 from Doctorado-ML/hiperparameters
Add hyperparameters to discretized

Hyperparameters added:

- min_length: int, default=3: The minimum length of an interval to be considered to be discretized.
- max_depth: int, default=1e6: The maximum depth of the discretization process.
- max_cuts: float, default=0: The maximum number of cut points to be computed for each feature. Compute all cutpoints and select the ones that produce less entropy
v0.9.4
2023-04-25 18:05:12 +02:00
6e17548563 Add url to pyproject and comment to mdlp 2023-04-25 17:53:36 +02:00
dd42e186d5 Reformat Arfffiles.h 2023-04-25 17:16:04 +02:00
3a100bbba7 Add mdlp version to Makefile
Refactor sample.py
2023-04-25 17:11:40 +02:00
17a66858f8 Update version number to 0.9.4 2023-04-25 16:58:23 +02:00
3ed491cd34 Update mdlp version
Add mimimun mdlp version test
Update sample.cpp
2023-04-25 12:05:52 +02:00
878cd379ee Change arff library to sample.py 2023-04-14 11:20:48 +02:00
25d341aee5 Update samples and Readme 2023-04-12 17:40:25 +02:00
fa8c4a221d Remove duplicated lines 2023-04-11 19:45:37 +02:00
947d54202d Update hyperparams info 2023-04-11 19:35:39 +02:00
d04cb389c0 Update tests and module mdlp version 2023-04-11 19:33:57 +02:00
0768d68a36 add join_fit target info 2023-04-08 12:22:03 +02:00
e44bca0420 Move limits include to CPPFImldp header 2023-03-22 18:21:52 +01:00
c2294613df Move limits include to CPPFImldp header 2023-03-22 18:19:01 +01:00
1069fc8ff4 Add last mdlp version and update sample.cpp 2023-03-21 10:18:51 +01:00
95bc29c7f2 Remove trailing space in attribute type of Arff 2023-03-20 20:27:47 +01:00
da9db322da Fix sklearn requirement 2023-03-20 18:58:55 +01:00
e3c329b2e5 Add min_length as percentage of # samples 2023-03-20 18:57:26 +01:00
7368dd9ff4 Refactor ArffFiles in main project 2023-03-20 17:45:58 +01:00
b5c6a49e19 Add last version of mdlp 2023-03-19 19:22:07 +01:00
c2a0d33604 Add last mdlp version 2023-03-19 19:14:32 +01:00
e6a56e3140 Update samples 2023-03-14 11:47:30 +01:00
ccce9725b3 Add max_cuts hyperparamter as in mdlp 2023-03-13 18:14:56 +01:00
aa55d3a340 New version of library and tests 2023-02-26 17:59:08 +01:00
900cccf76b Update discretizer to new library 2023-02-25 18:52:21 +01:00
f65efe3dfd Update the c++ sources with new version v0.9.3 2023-02-24 11:04:06 +01:00
e9d19d41da Add changed submodule 2023-02-22 11:56:39 +01:00
6450ccb9bd Add changed submodule 2023-02-22 11:34:27 +01:00
5d2f32bb0e Add needed header file to MANIFEST 2023-02-22 11:33:26 +01:00
Ricardo Montañana Gómez
8249e55b0c Merge pull request #6 from Doctorado-ML/joinfeatures
- Add a join_fit feature that can update a fitted discretizer. Making it possible to discretize a variable by taking into account the label and a list of other features of the dataset. Used in local discretization with bayesian estimators.
- Add factorize method to be able to simulate the pandas factorize method.
- Remove the algorithm hyperparameter as it is no longer needed
- Add get_states_feature method to obtain a list of states of any feature based on the number of cut points computed while fitting the discretizer
2023-02-22 10:44:43 +01:00
40871f128d Add 1.1.0 version of mdlp 2023-02-22 10:15:33 +01:00
718c9d0e63 make static methods factorize and test_sklrn_trans 2023-02-20 20:12:36 +01:00
e0b7cae9a0 Remove algorithm hyperparameter in discretizer 2023-02-20 18:26:51 +01:00
31d79a77fa Add get_states_feature method 2023-02-13 17:34:50 +01:00
2d495293bb Add range_features method 2023-02-13 16:15:50 +01:00
9899781640 Complete join_fit and remove MultiDiscretizer 2023-02-05 00:30:03 +01:00
f20496203e refactor Multidiscretizer to use one per column 2023-02-04 19:23:15 +01:00
cf09d92ccc add MultiDiscretizer 2023-02-04 17:45:36 +01:00
1186e4ad53 chore: 🔖 Upgrade version number to 0.9.3 2023-01-28 19:15:26 +01:00
7913f5151e Add version command to Makefile 2023-01-28 19:14:32 +01:00
050b923631 feat: Add factorize method to transformer 2023-01-28 10:35:07 +01:00
29fc88cecc test: Add scikit learn compatibility check_estimator test 2023-01-26 23:20:51 +01:00
16b31ec293 test: Complete join_transform test 2023-01-26 11:17:10 +01:00
ca7d158ac8 feat: ⚗️ Add join_transform method and cpp factorize 2023-01-26 10:47:27 +01:00
Ricardo Montañana Gómez
34cd54f77e feat: ♻️ Add Classic algorithm as number 2 to compare performance 2023-01-13 11:47:01 +01:00
70bf03155c Add scikit-learn as requirement 2022-12-23 14:07:36 +01:00
77b571af71 Update README to include link to pypi v0.9.2 2022-12-22 19:41:55 +01:00
ff7a91a7ec build: 🚀 2022-12-22 19:39:05 +01:00
621c19d00c style: 🎨 Remove unused variable in c++ module 2022-12-22 11:02:16 +01:00