.. _onnxsklearncustomrst:

====================================
Convert custom transformer into ONNX
====================================


.. only:: html

    **Links:** :download:`notebook <onnx_sklearn_custom.ipynb>`, :downloadlink:`html <onnx_sklearn_custom2html.html>`, :download:`PDF <onnx_sklearn_custom.pdf>`, :download:`python <onnx_sklearn_custom.py>`, :downloadlink:`slides <onnx_sklearn_custom.slides.html>`, :githublink:`GitHub|_doc/notebooks/2019/sklearn/onnx_sklearn_custom.ipynb|*`


The notebook explains how to create a converter for a custom transformer
following `scikit-learn <https://scikit-learn.org/stable/>`__ API.

**Xavier Dupré** - Senior Data Scientist at Microsoft - Computer Science
Teacher at `ENSAE <http://www.ensae.fr/>`__,
`github/xadupre <https://github.com/xadupre>`__,
`github/sdpython <https://github.com/sdpython>`__.

.. code:: 

    from jyquickhelper import add_notebook_menu
    add_notebook_menu(last_level=2)


.. contents::
    :local:


.. code:: 

    import numpy as np
    from pyquickhelper.helpgen import NbImage
    from sklearn.pipeline import Pipeline
    from sklearn.datasets import load_iris
    from sklearn.linear_model import LogisticRegression
    from jupytalk.talk_examples.sklearn2019 import (
        profile_fct_graph, onnx2str, onnx2dotnb)
    from sklearn.base import TransformerMixin
    from sklearn.preprocessing import MinMaxScaler
    from skl2onnx import to_onnx
    from onnxruntime import InferenceSession
    %matplotlib inline

.. code:: 

    from logging import getLogger
    logger = getLogger('skl2onnx')
    logger.disabled = True

Many functions are implemented in
`sklearn2019.py <https://github.com/sdpython/jupytalk/blob/master/src/jupytalk/talk_examples/sklearn2019.py>`__.

Custom converter
----------------

Let’s implement a converter which applies of
`MinMaxScaler <https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.MinMaxScaler.html>`__
and then applies a logarithm.

.. code:: 

    class MinMaxLogScaler(TransformerMixin):
        
        def __init__(self, feature_range=(1, 10), op_version=None):
            self.feature_range = feature_range
            self.op_version = op_version
        
        def fit(self, X, y=None):
            self.estimator_ = MinMaxScaler(feature_range=self.feature_range)
            self.estimator_.fit(X)
            return self
        
        def transform(self, X):
            X2 = self.estimator_.transform(X)
            return np.log(X2)
    
        def __repr__(self):
            return "{0}(feature_range={1})".format(self.__class__.__name__, 
                                                   self.feature_range)
    
    
    X = np.array([[0, 1, 2], 
                  [-1, 0, 100], 
                  [1, 0, 3], 
                  [4, 4, 4]], dtype=np.float64)
    
    tr = MinMaxLogScaler()
    tr.fit(X)
    tr.transform(X)


.. parsed-literal::
    array([[ 1.02961942e+00,  1.17865500e+00,  0.00000000e+00],
           [-2.22044605e-16,  0.00000000e+00,  2.30258509e+00],
           [ 1.52605630e+00,  0.00000000e+00,  8.78613558e-02],
           [ 2.30258509e+00,  2.30258509e+00,  1.68622712e-01]])


Custom conversion based on OnnxOperatorMixin
--------------------------------------------

Let’s rewrite the previous class by inheriting from *OnnxOperatorMixin*.
We need two operators: \*
`Scaler <https://github.com/onnx/onnx/blob/master/docs/Operators-ml.md#ai.onnx.ml.Scaler>`__
\*
`Log <https://github.com/onnx/onnx/blob/master/docs/Operators.md#Log>`__

.. code:: 

    from skl2onnx.algebra.onnx_operator_mixin import OnnxOperatorMixin
    from skl2onnx.algebra.onnx_ops import OnnxScaler, OnnxLog
    
    target_opset = 12
    
    
    class MinMaxLogScalerOnnx(MinMaxLogScaler, OnnxOperatorMixin):
        
        def to_onnx_operator(self, inputs=None, outputs=('Y', )):
            if inputs is None:
                raise RuntimeError("inputs should contain one name")
    
            op = self.estimator_
            i0 = self.get_inputs(inputs, 0)
            return OnnxLog(OnnxScaler(i0, scale=op.scale_, 
                                      offset = -op.min_ / (op.scale_ + 1e-8),
                                      op_version=self.op_version),
                           output_names=outputs,
                           op_version=self.op_version)
        
    X = np.array([[0, 1, 2], 
                  [-1, 0, 100], 
                  [1, 0, 3], 
                  [4, 4, 4]], dtype=np.float64)
    
    tr = MinMaxLogScalerOnnx(op_version=target_opset)
    tr.fit(X)
    
    try:
        tr.to_onnx(X.astype(np.float32))
    except Exception as e:
        print(e)


.. parsed-literal::
    Shape of output 'Y' cannot be infered. onnx_shape_calculator must be overriden and return a shape calculator.


*onnx* cannot always infer the output shape so a new method must be
added to return this information (design might evolve in the future).

.. code:: 

    from skl2onnx.common.data_types import FloatTensorType
    
    
    class MinMaxLogScalerOnnx(MinMaxLogScaler, OnnxOperatorMixin):
        
        def to_onnx_operator(self, inputs=None, outputs=('Y', )):
            if inputs is None:
                raise RuntimeError("inputs should contain one name")
    
            op = self.estimator_
            i0 = self.get_inputs(inputs, 0)
            return OnnxLog(
                    OnnxScaler(
                        i0, scale=op.scale_.astype(np.float32), 
                        offset=(-op.min_ / (op.scale_ + 1e-8)).astype(np.float32),
                        op_version=self.op_version),
                    output_names=outputs,
                    op_version=self.op_version)
        
        def onnx_shape_calculator(self):
            def shape_calculator(operator):
                operator.outputs[0].type = FloatTensorType(shape=operator.inputs[0].type.shape)
            return shape_calculator    
    
    
    X = np.array([[0, 1, 2], 
                  [-1, 0, 100], 
                  [1, 0, 3], 
                  [4, 4, 4]], dtype=np.float64)
    
    tr = MinMaxLogScalerOnnx(op_version=target_opset)
    tr.fit(X)
    
    model_onnx = tr.to_onnx(X.astype(np.float32))
    onnx2dotnb(model_onnx)


.. raw:: html

    <div id="M5bbd8c0813894d90b7b7d69819feeb87-cont"><div id="M5bbd8c0813894d90b7b7d69819feeb87" style="width:100%;height:100%;"></div></div>
    <script>

    require(['http://www.xavierdupre.fr/js/vizjs/viz.js'], function() { var svgGraph = Viz("digraph aded3c906aee4f018c6d214682dd322e {\nrankdir=LR;\n\"Sc_Scaler/Scaler (op#0)\n input0 X\n output0 Sc_Y0\" [URL=\"javascript:alert('')\", color=yellow, fillcolor=yellow, style=filled];\n\"X0\" [label=\"X\", shape=octagon];\n\"X0\" -> \"Sc_Scaler/Scaler (op#0)\n input0 X\n output0 Sc_Y0\";\n\"Sc_Scaler/Scaler (op#0)\n input0 X\n output0 Sc_Y0\" -> \"Sc_Y00\";\n\"Lo_Log/Log (op#1)\n input0 Sc_Y0\n output0 Y\" [URL=\"javascript:alert('')\", color=yellow, fillcolor=yellow, style=filled];\n\"Sc_Y00\" [label=\"Sc_Y0\", shape=octagon];\n\"Sc_Y00\" [label=\"Sc_Y0\", shape=octagon];\n\"Sc_Y00\" -> \"Lo_Log/Log (op#1)\n input0 Sc_Y0\n output0 Y\";\n\"Y0\" [label=\"Y\", shape=octagon];\n\"Lo_Log/Log (op#1)\n input0 Sc_Y0\n output0 Y\" -> \"Y0\";\n}\n");
    document.getElementById('M5bbd8c0813894d90b7b7d69819feeb87').innerHTML = svgGraph; });

    </script>


Comparison with raw outputs
~~~~~~~~~~~~~~~~~~~~~~~~~~~

.. code:: 

    sess = InferenceSession(model_onnx.SerializeToString())
    
    inputs = {'X': X.astype(np.float32)}
    sess.run(None, inputs)[0]


.. parsed-literal::
    array([[ 1.0296195e+00,  1.1786550e+00, -5.9604645e-08],
           [ 0.0000000e+00,  0.0000000e+00,  2.3025851e+00],
           [ 1.5260563e+00,  0.0000000e+00,  8.7861314e-02],
           [ 2.3025849e+00,  2.3025851e+00,  1.6862264e-01]], dtype=float32)


.. code:: 

    from skl2onnx.helpers.investigate import compare_objects
    compare_objects(tr.transform(X), sess.run(None, inputs)[0])

Everything is ok.

Custom transformer in a pipeline
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

.. code:: 

    data = load_iris()
    X, y = data.data, data.target

.. code:: 

    pipe = Pipeline([('scaler', MinMaxLogScalerOnnx(op_version=target_opset)),
                     ('lr', LogisticRegression(multi_class="auto"))])
    pipe.fit(X, y)


.. parsed-literal::
    Pipeline(steps=[('scaler', MinMaxLogScalerOnnx(feature_range=(1, 10))),
                    ('lr', LogisticRegression())])


.. code:: 

    pipe_onnx = to_onnx(pipe, X.astype(np.float32), target_opset=target_opset)
    onnx2dotnb(pipe_onnx)


.. raw:: html

    <div id="M8f000989f6a1486c813c5b859422b8d9-cont"><div id="M8f000989f6a1486c813c5b859422b8d9" style="width:100%;height:100%;"></div></div>
    <script>

    require(['http://www.xavierdupre.fr/js/vizjs/viz.js'], function() { var svgGraph = Viz("digraph \"ONNX(Pipeline)\" {\nrankdir=LR;\n\"Sc_Scaler/Scaler (op#0)\n input0 X\n output0 Sc_Y0\" [URL=\"javascript:alert('')\", color=yellow, fillcolor=yellow, style=filled];\n\"X0\" [label=\"X\", shape=octagon];\n\"X0\" -> \"Sc_Scaler/Scaler (op#0)\n input0 X\n output0 Sc_Y0\";\n\"Sc_Scaler/Scaler (op#0)\n input0 X\n output0 Sc_Y0\" -> \"Sc_Y00\";\n\"Lo_Log/Log (op#1)\n input0 Sc_Y0\n output0 Y\" [URL=\"javascript:alert('')\", color=yellow, fillcolor=yellow, style=filled];\n\"Sc_Y00\" [label=\"Sc_Y0\", shape=octagon];\n\"Sc_Y00\" [label=\"Sc_Y0\", shape=octagon];\n\"Sc_Y00\" -> \"Lo_Log/Log (op#1)\n input0 Sc_Y0\n output0 Y\";\n\"Lo_Log/Log (op#1)\n input0 Sc_Y0\n output0 Y\" -> \"Y0\";\n\"LinearClassifier/LinearClassifier (op#2)\n input0 Y\n output0 label\n output1 probability_tensor\" [URL=\"javascript:alert('')\", color=yellow, fillcolor=yellow, style=filled];\n\"Y0\" [label=\"Y\", shape=octagon];\n\"Y0\" [label=\"Y\", shape=octagon];\n\"Y0\" -> \"LinearClassifier/LinearClassifier (op#2)\n input0 Y\n output0 label\n output1 probability_tensor\";\n\"LinearClassifier/LinearClassifier (op#2)\n input0 Y\n output0 label\n output1 probability_tensor\" -> \"label0\";\n\"LinearClassifier/LinearClassifier (op#2)\n input0 Y\n output0 label\n output1 probability_tensor\" -> \"probability_tensor0\";\n\"Normalizer/Normalizer (op#3)\n input0 probability_tensor\n output0 probabilities\" [URL=\"javascript:alert('')\", color=yellow, fillcolor=yellow, style=filled];\n\"probability_tensor0\" [label=\"probability_tensor\", shape=octagon];\n\"probability_tensor0\" [label=\"probability_tensor\", shape=octagon];\n\"probability_tensor0\" -> \"Normalizer/Normalizer (op#3)\n input0 probability_tensor\n output0 probabilities\";\n\"Normalizer/Normalizer (op#3)\n input0 probability_tensor\n output0 probabilities\" -> \"probabilities0\";\n\"Cast/Cast (op#4)\n input0 label\n output0 output_label\" [URL=\"javascript:alert('')\", color=yellow, fillcolor=yellow, style=filled];\n\"label0\" [label=\"label\", shape=octagon];\n\"label0\" [label=\"label\", shape=octagon];\n\"label0\" -> \"Cast/Cast (op#4)\n input0 label\n output0 output_label\";\n\"output_label0\" [label=\"output_label\", shape=octagon];\n\"Cast/Cast (op#4)\n input0 label\n output0 output_label\" -> \"output_label0\";\n\"ZipMap/ZipMap (op#5)\n input0 probabilities\n output0 output_probability\" [URL=\"javascript:alert('')\", color=yellow, fillcolor=yellow, style=filled];\n\"probabilities0\" [label=\"probabilities\", shape=octagon];\n\"probabilities0\" [label=\"probabilities\", shape=octagon];\n\"probabilities0\" -> \"ZipMap/ZipMap (op#5)\n input0 probabilities\n output0 output_probability\";\n\"output_probability0\" [label=\"output_probability\", shape=octagon];\n\"ZipMap/ZipMap (op#5)\n input0 probabilities\n output0 output_probability\" -> \"output_probability0\";\n}\n");
    document.getElementById('M8f000989f6a1486c813c5b859422b8d9').innerHTML = svgGraph; });

    </script>


Separate converter
------------------

The previous design requires the operator to inherit from
*OnnxOperatorMixin*. The conversion involves: \* *scope*: every node in
the graph must have a unique name, the scope ensures that it is. \*
*container*: internal container for all nodes added during the
conversion

The converter uses an *operator*. This refers to ONNX operator, like a
placeholder with named inputs and outputs which receives the ONNX nodes.
When the converter is implemented, it needs to be registered so that
*skl2onnx* can use it when needed in a pipeline.

.. code:: 

    from skl2onnx.common.data_types import FloatTensorType
    
    
    def convert_sklearn_minxmaxlog_scaler(scope, operator, container):
        # operator = placeholder for the converted scikit-learn operator
        # operator.inputs = defined inputs
        # operator.outputs : defined outputs
        # The conversion is independant from any other converted models in the pipeline.
        X = operator.inputs[0]
        out = operator.outputs[0]
        opv = container.target_opset
        
        # The raw operator is the scikitl-learn model to be converted.
        op = operator.raw_operator
    
        # The ONNX definition of the new operator which links
        # X to out.
        # X.onnx_name is the unique name of X.
        # out.onnx_name is the unique name of out.
        onnx_op = OnnxLog(OnnxScaler(X.onnx_name, scale=op.estimator_.scale_, 
                                     offset = -op.estimator_.min_ / (op.estimator_.scale_ + 1e-8),
                                     op_version=opv),
                          output_names=out.onnx_name,
                          op_version=opv)
        
        # Let's finally add this subgraph to the container
        # by adding the final node.
        onnx_op.add_to(scope, container)
    
        
    def shape_sklearn_minxmaxlog_scaler(operator):
        # The shape calculator defines the dimension of
        # every output.
        op_input = operator.inputs[0]
        op = operator.raw_operator
        N = op_input.type.shape[0]
        C = op_input.type.shape[1]
        
        # This line tells the first output is a float matrix
        # which has the same dimension as the input.
        operator.outputs[0].type = FloatTensorType([N, C])
    
    
    from skl2onnx import update_registered_converter
    
    # registration of the converter.
    update_registered_converter(
                MinMaxLogScaler,
                "MinMaxLogScaler",
                shape_sklearn_minxmaxlog_scaler,
                convert_sklearn_minxmaxlog_scaler)

.. code:: 

    pipe = Pipeline([('scaler', MinMaxLogScaler()),
                     ('lr', LogisticRegression(multi_class="auto"))])
    pipe.fit(X, y)


.. parsed-literal::
    Pipeline(steps=[('scaler', MinMaxLogScaler(feature_range=(1, 10))),
                    ('lr', LogisticRegression())])


.. code:: 

    pipe_onnx = to_onnx(pipe, X.astype(np.float32), target_opset=target_opset)
    onnx2dotnb(pipe_onnx)


.. raw:: html

    <div id="M90ddad59c0e84993aaf67317e1b30079-cont"><div id="M90ddad59c0e84993aaf67317e1b30079" style="width:100%;height:100%;"></div></div>
    <script>

    require(['http://www.xavierdupre.fr/js/vizjs/viz.js'], function() { var svgGraph = Viz("digraph \"ONNX(Pipeline)\" {\nrankdir=LR;\n\"Sc_Scaler/Scaler (op#0)\n input0 X\n output0 Sc_Y0\" [URL=\"javascript:alert('')\", color=yellow, fillcolor=yellow, style=filled];\n\"X0\" [label=\"X\", shape=octagon];\n\"X0\" -> \"Sc_Scaler/Scaler (op#0)\n input0 X\n output0 Sc_Y0\";\n\"Sc_Scaler/Scaler (op#0)\n input0 X\n output0 Sc_Y0\" -> \"Sc_Y00\";\n\"Lo_Log/Log (op#1)\n input0 Sc_Y0\n output0 variable\" [URL=\"javascript:alert('')\", color=yellow, fillcolor=yellow, style=filled];\n\"Sc_Y00\" [label=\"Sc_Y0\", shape=octagon];\n\"Sc_Y00\" [label=\"Sc_Y0\", shape=octagon];\n\"Sc_Y00\" -> \"Lo_Log/Log (op#1)\n input0 Sc_Y0\n output0 variable\";\n\"Lo_Log/Log (op#1)\n input0 Sc_Y0\n output0 variable\" -> \"variable0\";\n\"LinearClassifier/LinearClassifier (op#2)\n input0 variable\n output0 label\n output1 probability_tensor\" [URL=\"javascript:alert('')\", color=yellow, fillcolor=yellow, style=filled];\n\"variable0\" [label=\"variable\", shape=octagon];\n\"variable0\" [label=\"variable\", shape=octagon];\n\"variable0\" -> \"LinearClassifier/LinearClassifier (op#2)\n input0 variable\n output0 label\n output1 probability_tensor\";\n\"LinearClassifier/LinearClassifier (op#2)\n input0 variable\n output0 label\n output1 probability_tensor\" -> \"label0\";\n\"LinearClassifier/LinearClassifier (op#2)\n input0 variable\n output0 label\n output1 probability_tensor\" -> \"probability_tensor0\";\n\"Normalizer/Normalizer (op#3)\n input0 probability_tensor\n output0 probabilities\" [URL=\"javascript:alert('')\", color=yellow, fillcolor=yellow, style=filled];\n\"probability_tensor0\" [label=\"probability_tensor\", shape=octagon];\n\"probability_tensor0\" [label=\"probability_tensor\", shape=octagon];\n\"probability_tensor0\" -> \"Normalizer/Normalizer (op#3)\n input0 probability_tensor\n output0 probabilities\";\n\"Normalizer/Normalizer (op#3)\n input0 probability_tensor\n output0 probabilities\" -> \"probabilities0\";\n\"Cast/Cast (op#4)\n input0 label\n output0 output_label\" [URL=\"javascript:alert('')\", color=yellow, fillcolor=yellow, style=filled];\n\"label0\" [label=\"label\", shape=octagon];\n\"label0\" [label=\"label\", shape=octagon];\n\"label0\" -> \"Cast/Cast (op#4)\n input0 label\n output0 output_label\";\n\"output_label0\" [label=\"output_label\", shape=octagon];\n\"Cast/Cast (op#4)\n input0 label\n output0 output_label\" -> \"output_label0\";\n\"ZipMap/ZipMap (op#5)\n input0 probabilities\n output0 output_probability\" [URL=\"javascript:alert('')\", color=yellow, fillcolor=yellow, style=filled];\n\"probabilities0\" [label=\"probabilities\", shape=octagon];\n\"probabilities0\" [label=\"probabilities\", shape=octagon];\n\"probabilities0\" -> \"ZipMap/ZipMap (op#5)\n input0 probabilities\n output0 output_probability\";\n\"output_probability0\" [label=\"output_probability\", shape=octagon];\n\"ZipMap/ZipMap (op#5)\n input0 probabilities\n output0 output_probability\" -> \"output_probability0\";\n}\n");
    document.getElementById('M90ddad59c0e84993aaf67317e1b30079').innerHTML = svgGraph; });

    </script>


Node in ONNX may have multiple outputs. *skl2onnx* defines a default
number of outputs for transformers, regressor, classifier but this
default number can be changed by defining a default parser.

Appendix
--------

.. code:: 

    import onnx, skl2onnx, sklearn, onnxruntime
    mods = [onnx, skl2onnx, onnxruntime, sklearn]
    for m in mods:
        print(m.__name__, m.__version__)


.. parsed-literal::
    onnx 1.7.105
    skl2onnx 1.7.1076
    onnxruntime 1.3.996
    sklearn 0.24.dev0