.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "auto_examples/plot_gpr.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download_auto_examples_plot_gpr.py>`
        to download the full example code

.. rst-class:: sphx-glr-example-title

.. _sphx_glr_auto_examples_plot_gpr.py:


.. _l-gpr-example:

Discrepencies with GaussianProcessorRegressor: use of double
============================================================

The `GaussianProcessRegressor
<https://scikit-learn.org/stable/modules/generated/sklearn.gaussian_process.
GaussianProcessRegressor.html>`_ involves
many matrix operations which may requires double
precisions. *sklearn-onnx* is using single floats by default
but for this particular model, it is better to use double.
Let's see how to create an ONNX file using doubles.

.. contents::
    :local:

Train a model
+++++++++++++

A very basic example using *GaussianProcessRegressor*
on the Boston dataset.

.. GENERATED FROM PYTHON SOURCE LINES 27-47

.. code-block:: default

    import pprint
    import numpy
    import sklearn
    from sklearn.datasets import load_diabetes
    from sklearn.gaussian_process import GaussianProcessRegressor
    from sklearn.gaussian_process.kernels import DotProduct, RBF
    from sklearn.model_selection import train_test_split
    import onnx
    import onnxruntime as rt
    import skl2onnx
    from skl2onnx.common.data_types import FloatTensorType, DoubleTensorType
    from skl2onnx import convert_sklearn

    dataset = load_diabetes()
    X, y = dataset.data, dataset.target
    X_train, X_test, y_train, y_test = train_test_split(X, y)
    gpr = GaussianProcessRegressor(DotProduct() + RBF(), alpha=1.)
    gpr.fit(X_train, y_train)
    print(gpr)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    somewhere/.local/lib/python3.9/site-packages/sklearn/gaussian_process/kernels.py:430: ConvergenceWarning: The optimal value found for dimension 0 of parameter k1__sigma_0 is close to the specified upper bound 100000.0. Increasing the bound and calling fit again may find a better value.
      warnings.warn(
    somewhere/.local/lib/python3.9/site-packages/sklearn/gaussian_process/kernels.py:420: ConvergenceWarning: The optimal value found for dimension 0 of parameter k2__length_scale is close to the specified lower bound 1e-05. Decreasing the bound and calling fit again may find a better value.
      warnings.warn(
    GaussianProcessRegressor(alpha=1.0,
                             kernel=DotProduct(sigma_0=1) + RBF(length_scale=1))


.. GENERATED FROM PYTHON SOURCE LINES 48-53

First attempt to convert a model into ONNX
++++++++++++++++++++++++++++++++++++++++++

The documentation suggests the following way to
convert a model into ONNX.

.. GENERATED FROM PYTHON SOURCE LINES 53-65

.. code-block:: default


    initial_type = [('X', FloatTensorType([None, X_train.shape[1]]))]
    onx = convert_sklearn(gpr, initial_types=initial_type,
                          target_opset=12)

    sess = rt.InferenceSession(onx.SerializeToString())
    try:
        pred_onx = sess.run(
            None, {'X': X_test.astype(numpy.float32)})[0]
    except RuntimeError as e:
        print(str(e))


.. GENERATED FROM PYTHON SOURCE LINES 66-78

Second attempt: variable dimensions
+++++++++++++++++++++++++++++++++++

Unfortunately, even though the conversion
went well, the runtime fails to compute the prediction.
The previous snippet of code imposes fixed dimension
on the input and therefore let the runtime assume
every node output has outputs with fixed dimensions
And that's not the case for this model.
We need to disable these checkings by replacing
the fixed dimensions by an empty value.
(see next line).

.. GENERATED FROM PYTHON SOURCE LINES 78-91

.. code-block:: default


    initial_type = [('X', FloatTensorType([None, None]))]
    onx = convert_sklearn(gpr, initial_types=initial_type,
                          target_opset=12)

    sess = rt.InferenceSession(onx.SerializeToString())
    pred_onx = sess.run(
        None, {'X': X_test.astype(numpy.float32)})[0]

    pred_skl = gpr.predict(X_test)
    print(pred_skl[:10])
    print(pred_onx[0, :10])


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    [145.31738281 157.765625   201.18457031 136.91113281 176.2734375
     146.81738281 173.46679688 125.90527344 173.24609375 102.34277344]
    [-32768.]


.. GENERATED FROM PYTHON SOURCE LINES 92-95

The differences seems quite important.
Let's confirm that by looking at the biggest
differences.

.. GENERATED FROM PYTHON SOURCE LINES 95-101

.. code-block:: default


    diff = numpy.sort(numpy.abs(numpy.squeeze(pred_skl) -
                                numpy.squeeze(pred_onx)))[-5:]
    print(diff)
    print('min(Y)-max(Y):', min(y_test), max(y_test))


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    [32967.69335938 32968.28027344 32969.18457031 32970.25195312
     32980.12988281]
    min(Y)-max(Y): 25.0 346.0


.. GENERATED FROM PYTHON SOURCE LINES 102-118

Third attempt: use of double
++++++++++++++++++++++++++++

The model uses a couple of matrix computations
and matrices have coefficients with very different
order of magnitude. It is difficult to approximate
the prediction made with scikit-learn if the converted
model sticks to float. Double precision is needed.

The previous code requires two changes. The first
one indicates that inputs are now of type
``DoubleTensorType``. The second change
is the extra parameter ``dtype=numpy.float64``
tells the conversion function that every real
constant matrix such as the trained coefficients
will be dumped as doubles and not as floats anymore.

.. GENERATED FROM PYTHON SOURCE LINES 118-128

.. code-block:: default


    initial_type = [('X', DoubleTensorType([None, None]))]
    onx64 = convert_sklearn(gpr, initial_types=initial_type,
                            target_opset=12)

    sess64 = rt.InferenceSession(onx64.SerializeToString())
    pred_onx64 = sess64.run(None, {'X': X_test})[0]

    print(pred_onx64[0, :10])


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    [145.3137207]


.. GENERATED FROM PYTHON SOURCE LINES 129-130

The new differences look much better.

.. GENERATED FROM PYTHON SOURCE LINES 130-136

.. code-block:: default


    diff = numpy.sort(numpy.abs(numpy.squeeze(pred_skl) -
                                numpy.squeeze(pred_onx64)))[-5:]
    print(diff)
    print('min(Y)-max(Y):', min(y_test), max(y_test))


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    [0.00585938 0.00592041 0.00604248 0.00714111 0.00848389]
    min(Y)-max(Y): 25.0 346.0


.. GENERATED FROM PYTHON SOURCE LINES 137-143

Size increase
+++++++++++++

As a result, the ONNX model is almost twice bigger
because every coefficient is stored as double and
and not as floats anymore.

.. GENERATED FROM PYTHON SOURCE LINES 143-149

.. code-block:: default


    size32 = len(onx.SerializeToString())
    size64 = len(onx64.SerializeToString())
    print("ONNX with floats:", size32)
    print("ONNX with doubles:", size64)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    ONNX with floats: 29814
    ONNX with doubles: 57694


.. GENERATED FROM PYTHON SOURCE LINES 150-161

return_std=True
+++++++++++++++

`GaussianProcessRegressor <https://scikit-learn.org/stable/modules/
generated/sklearn.gaussian_process.GaussianProcessRegressor.html>`_
is one model which defined additional parameter to the predict function.
If call with ``return_std=True``, the class returns one more results
and that needs to be reflected into the generated ONNX graph.
The converter needs to know that an extended graph is required.
That's done through the option mechanism
(see :ref:`l-conv-options`).

.. GENERATED FROM PYTHON SOURCE LINES 161-170

.. code-block:: default


    initial_type = [('X', DoubleTensorType([None, None]))]
    options = {GaussianProcessRegressor: {'return_std': True}}
    try:
        onx64_std = convert_sklearn(gpr, initial_types=initial_type,
                                    options=options, target_opset=12)
    except RuntimeError as e:
        print(e)


.. GENERATED FROM PYTHON SOURCE LINES 171-175

This error highlights the fact that the *scikit-learn*
computes internal variables on first call to method predict.
The converter needs them to be initialized by calling method
predict at least once and then converting again.

.. GENERATED FROM PYTHON SOURCE LINES 175-185

.. code-block:: default


    gpr.predict(X_test[:1], return_std=True)
    onx64_std = convert_sklearn(gpr, initial_types=initial_type,
                                options=options, target_opset=12)

    sess64_std = rt.InferenceSession(onx64_std.SerializeToString())
    pred_onx64_std = sess64_std.run(None, {'X': X_test[:5]})

    pprint.pprint(pred_onx64_std)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    [array([[145.3137207 ],
           [157.76715088],
           [201.1854248 ],
           [136.90722656],
           [176.27374268]]),
     array([  0.        , 329.36523916, 103.58507287, 396.68282841,
           271.86538202])]


.. GENERATED FROM PYTHON SOURCE LINES 186-187

Let's compare with *scikit-learn* prediction.

.. GENERATED FROM PYTHON SOURCE LINES 187-190

.. code-block:: default


    pprint.pprint(gpr.predict(X_test[:5], return_std=True))


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    (array([145.31738281, 157.765625  , 201.18457031, 136.91113281,
           176.27539062]),
     array([1.00585084, 1.00662422, 1.01830701, 1.00510059, 1.01611786]))


.. GENERATED FROM PYTHON SOURCE LINES 191-192

It looks good. Let's do a better checks.

.. GENERATED FROM PYTHON SOURCE LINES 192-202

.. code-block:: default


    pred_onx64_std = sess64_std.run(None, {'X': X_test})
    pred_std = gpr.predict(X_test, return_std=True)


    diff = numpy.sort(numpy.abs(numpy.squeeze(pred_onx64_std[1]) -
                                numpy.squeeze(pred_std[1])))[-5:]
    print(diff)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    [749.89538989 763.27326573 775.65189487 807.24411217 883.30602091]


.. GENERATED FROM PYTHON SOURCE LINES 203-206

There are some discrepencies but it seems reasonable.

**Versions used for this example**

.. GENERATED FROM PYTHON SOURCE LINES 206-212

.. code-block:: default


    print("numpy:", numpy.__version__)
    print("scikit-learn:", sklearn.__version__)
    print("onnx: ", onnx.__version__)
    print("onnxruntime: ", rt.__version__)
    print("skl2onnx: ", skl2onnx.__version__)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    numpy: 1.23.5
    scikit-learn: 1.2.2
    onnx:  1.13.1
    onnxruntime:  1.14.1
    skl2onnx:  1.14.0


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** ( 0 minutes  1.397 seconds)


.. _sphx_glr_download_auto_examples_plot_gpr.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example


    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: plot_gpr.py <plot_gpr.py>`

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: plot_gpr.ipynb <plot_gpr.ipynb>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_