Shortcuts | Code | Contributions
This is the collection of my open source contributions to scikit-learn, a Python module for machine learning. It has its code base maintained on GitHub, with over 2500 contributors.
I am currently a core developer of scikit-learn and also a member of its documentation team. You may check out the people behind scikit-learn, my blog interview on board, and my current contributor ranking.
Partial Contributions
Here I will list some of my non-trivial contributions and experiences that I want to share.
-
neighbors.KernelDensity
may change its behavior in a future version! Check out this blog post to see why. - I led the rework of the scikit-learn main website, deployed since version 1.5. Check out this blog post for more details.
List of Merged PRs
You may also check my merged PRs and conversations involved on GitHub.
- #29863 MAINT filename typo
test_monotonic_contraints
->constraints
- #29848 DOC fix all unintentional block quotes in docs
- #29843 MAINT set commit status during lock-files update in PR
- #29700 DOC update sphinx-gallery 0.17.1 and remove our workaround
- #27516 DOC
GradientBoosting*
will not implement monotonic constraints, useHistGradientBoosting*
instead - #29701 MAINT fix bot for lock files update on comment
- #29505 MAINT trigger lock file update in PR with a comment
- #29544 DOC fix links in the machine learning map
- #29412 DOC reorganize the maintainer info page
- #29158 ENH improve HTML display of
FunctionTransformer
- #29493 DOC fix dropdown anchor and collapse-all button in
sphinx-design==0.6.0
- #29393 DOC fix the caption of installation tabs
- #29160 DOC tweak appearance of installation instructions
- #29124 DEPR deprecate
copy
parameter inBirch
- #29134 DOC bump
pydata-sphinx-theme
to 0.15.3 - #29123 DOC fix mistakes in the machine learning map
- #29125 DOC reflect deprecations in 1.5 on API page
- #29126 DOC improve contributing docs
- #29090 DOC temporary fix for search summary
- #29059 DOC fix APIs marked incorrectly as deprecated
- #29037 DOC final cleanup for switching to
pydata-sphinx-theme
- #28623 DOC [PST] fix styling/simplify structure of testimonial and about pages
- #28630 DOC [PST] FIX/RFC machine learning map
- #28401 DOC [PST] fix dropdowns
- #28722 ENH HTML repr show best estimator in
*SearchCV
whenrefit=True
- #28428 DOC [PST] refactor API structure and improve display
- #28448 DOC [PST] tune FAQ page styling
- #28512 DOC [PST] disable gallery link tweaks when
html-noplot
to avoid warnings - #28347 DOC [PST] version switcher and warning banner
- #28521 FIX safe indexing for polars
Series
- #28101 FIX make ‘array-like’ reject sparse containers in parameter validation
- #28511 DOC [PST] use jinja2 template for rst generation
- #28331 DOC [PST] landing page
- #28457 DOC fix jupyterlab and binder could not find
polars
- #28415 DOC [PST] better integrate gallery with
pydata-sphinx-theme
- #28409 DOC [PST] fix changelog badges
- #28408 DOC [PST] tune toctree-related styles
- #28336 DOC [PST] install page
- #28387 MAINT remove unneeded
robots.txt
- #28379 DOC [PST] fix lock files and add
sphinx-design
- #28353 MAINT [PST] set up live preview
- #28376 MAINT create
robots.txt
for setting uppydata-sphinx-theme
preview - #28132 DOC [PST] conf, setup, general styling
- #28315 DOC fix a sphinx warning and a rendering issue
- #28271 DOC fix some hooks that fail to capture the titles in pydata-sphinx-theme setup
- #28272 DOC nitpicks on the FAQ page
- #28255 DOC restructure changelog (in particular for switching to pydata-sphinx-theme)
- #28121 FIX
AffinityPropagation
assigning multiple clusters for equal points - #28133 MAINT fix
update_environments_and_lock_files
for non-posix systems - #28134 DOC solve some sphinx errors when updating to
pydata-sphinx-theme
- #28128 DOC make up for errors in #26410
- #26410 FIX
KNeighborsClassifier
raise when all neighbors of some sample have zero weights - #28120 DOC fix the confusing ordering of
whats_new/v1.5.rst
- #28090 FIX improve error message in
check_array
when getting aSeries
and expecting a 2D container - #28107 DOC fix wrong indentations in the documentation that lead to undesired blockquotes
- #28111 FIX dump svmlight when data is read-only
- #27969 MNT Work-around sphinx-gallery
UnicodeDecodeError
in recommender system - #27970 DOC update doc build sphinx link to by matching regex in lock file
- #27954 CLN avoid nested conftests
- #27723 TST Extend tests for
scipy.sparse.*array
insklearn/svm/tests/test_sparse
- #27790 DOC minor fixes of splitter docstrings (from #26423)
- #27240 TST Extend tests for
scipy.sparse/*array
insklearn/manifold/tests/test_spectral_embedding
- #27468 FIX make dataset fetchers accept
os.Pathlike
fordata_home
- #27472 DOC fix return type of
make_sparse_spd_matrix
- #27438 ENH
make_sparse_spd_matrix
use sparse memory layout - #27250 TST Extend tests for
scipy.sparse/*array
insklearn/neighbors/tests/test_neighbors
- #27277 TST Extend tests for
scipy.sparse/*array
insklearn/impute/tests/test_common
- #27219 TST Extend tests for
scipy.sparse/*array
insklearn/feature_extraction/tests/test_text
- #27216 TST Extend tests for
scipy.sparse/*array
insklearn/ensemble/tests/test_forest
- #27217 TST Extend tests for
scipy.sparse/*array
insklearn/ensemble/tests/test_gradient_boosting
- #27218 TST Extend tests for
scipy.sparse/*array
insklearn/ensemble/tests/test_iforest
- #27261 TST Extend tests for
scipy.sparse/*array
insklearn/tree/tests/test_tree
- #27253 TST Extend tests for
scipy.sparse/*array
insklearn/preprocessing/tests/test_data
- #27225 TST Extend tests for
scipy.sparse/*array
insklearn/linear_model/tests/test_base
- #27226 TST Extend tests for
scipy.sparse/*array
insklearn/linear_model/tests/test_coordinate_descent
- #27237 TST Extend tests for
scipy.sparse/*array
insklearn/linear_model/tests/test_sparse_coordinate_descent
- #27222 TST Extend tests for
scipy.sparse/*array
insklearn/feature_selection/tests/test_variance_threshold
- #27235 TST Extend tests for
scipy.sparse/*array
insklearn/linear_model/tests/test_ridge
- #27228 TST Extend tests for
scipy.sparse/*array
insklearn/linear_model/tests/test_quantile
- #27233 TST Extend tests for
scipy.sparse/*array
insklearn/linear_model/tests/test_ransac
- #27254 TST Extend tests for
scipy.sparse/*array
insklearn/preprocessing/tests/test_function_transformer
- #27252 TST Extend tests for
scipy.sparse/*array
insklearn/neural_network/tests/test_rbm
- #27241 TST Extend tests for
scipy.sparse/*array
insklearn/metrics/cluster/tests/test_unsupervised
- #27246 TST Extend tests for
scipy.sparse/*array
insklearn/model_selection/tests/test_split
- #27262 TST Extend tests for
scipy.sparse/*array
insklearn/utils/tests/test_extmath
- #27276 TST Extend tests for
scipy.sparse/*array
insklearn/utils/tests/test_testing
- #27274 TST Extend tests for
scipy.sparse/*array
insklearn/utils/tests/test_multiclass
- #26602 FIX ravel prediction of
PLSRegression
when fitted on 1dy
- #26759 CLN
v1.4.rst
entries are not sorted - #26748 FIX
mutual_info_regression
whenX
is of integer dtype - #26661 DOC show usage of
__
inPipeline
andFeatureUnion
- #26682 MAINT Parameters validation for sklearn.utils.gen_even_slices
- #26250 MAINT Parameters validation for sklearn.linear_model.ridge_regression
- #26125 MAINT Parameters validation for sklearn.metrics.pairwise_distances_chunked
- #26610 DOC search link to sphinx version
- #26400 FIX
PowerTransformer
raise when “box-cox” has nan column - #26124 MAINT Parameters validation for sklearn.metrics.pairwise_distances_argmin
- #26019 ENH PrecisionRecallDisplay add option to plot chance level
- #26337 FIX KernelPCA inverse transform when gamma is not given
- #26034 MAINT Parameters validation for sklearn.tree.export_graphviz
- #26289 FIX
export_text
andexport_graphviz
accepts feature and class names as array-like - #26122 MAINT Parameters validation for sklearn.metrics.pairwise.manhattan_distances
- #26227 MAINT Parameters validation for sklearn.model_selection.learning_curve
- #26229 MAINT Parameters validation for sklearn.model_selection.validation_curve
- #26230 MAINT Parameters validation for sklearn.model_selection.permutation_test_score
- #26161 MAINT Parameters validation for sklearn.datasets.fetch_species_distributions
- #26165 MAINT Parameters validation for sklearn.datasets.load_breast_cancer
- #26166 MAINT Parameters validation for sklearn.datasets.load_diabetes
- #26126 MAINT Parameters validation for sklearn.datasets.fetch_rcv1
- #26072 MAINT Parameters validation for sklearn.metrics.pairwise.sigmoid_kernel
- #26071 MAINT Parameters validation for sklearn.metrics.pairwise.rbf_kernel
- #26070 MAINT Parameters validation for sklearn.metrics.pairwise.polynomial_kernel
- #26075 MAINT Parameters validation for sklearn.metrics.pairwise.paired_cosine_distances
- #26074 MAINT Parameters validation for sklearn.metrics.pairwise.paired_manhattan_distances
- #26073 MAINT Parameters validation for sklearn.metrics.pairwise.paired_euclidean_distances
- #26046 MAINT Parameters validation for sklearn.metrics.pairwise.cosine_distances
- #26049 MAINT Parameters validation for sklearn.metrics.pairwise.linear_kernel
- #26048 MAINT Parameters validation for sklearn.metrics.pairwise.laplacian_kernel
- #26047 MAINT Parameters validation for sklearn.metrics.pairwise.haversine_distances
- #26036 MAINT Parameters validation for sklearn.preprocessing.scale
- #25987 ENH RocCurveDisplay add option to plot chance level
- #26018 DOC fix SplineTransformer include_bias docstring
- #25973 FIX SequentialFeatureSelector throws IndexError when cv is a generator
- #25931 FIX Remove spurious feature names warning in IsolationForest