.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "auto_tutorial/plot_gbegin_transfer_learning.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download_auto_tutorial_plot_gbegin_transfer_learning.py>`
        to download the full example code

.. rst-class:: sphx-glr-example-title

.. _sphx_glr_auto_tutorial_plot_gbegin_transfer_learning.py:


Transfer Learning with ONNX
===========================

.. index:: transfer learning, deep learning

Transfer learning is common with deep learning.
A deep learning model is used as preprocessing before
the output is sent to a final classifier or regressor.
It is not quite easy in this case to mix framework,
:epkg:`scikit-learn` with :epkg:`pytorch`
(or :epkg:`skorch`), the Keras API for Tensorflow,
`tf.keras.wrappers.scikit_learn
<https://www.tensorflow.org/api_docs/python/tf/
keras/wrappers/scikit_learn>`_. Every combination
requires work. ONNX reduces the number of platforms to
support. Once the model is converted into ONNX,
it can be inserted in any :epkg:`scikit-learn` pipeline.

.. contents::
    :local:

Retrieve and load a model
+++++++++++++++++++++++++

We download one model from the :epkg:`ONNX Zoo` but the model
could be trained and produced by another converter library.

.. GENERATED FROM PYTHON SOURCE LINES 31-75

.. code-block:: default

    import sys
    from io import BytesIO
    import onnx
    from mlprodict.sklapi import OnnxTransformer
    from sklearn.decomposition import PCA
    from sklearn.pipeline import Pipeline
    from mlinsights.plotting.gallery import plot_gallery_images
    import matplotlib.pyplot as plt
    from skl2onnx.tutorial.imagenet_classes import class_names
    import numpy
    from PIL import Image
    from onnxruntime import InferenceSession
    from onnxruntime.capi.onnxruntime_pybind11_state import InvalidArgument
    import os
    import urllib.request


    def download_file(url, name, min_size):
        if not os.path.exists(name):
            print("download '%s'" % url)
            with urllib.request.urlopen(url) as u:
                content = u.read()
            if len(content) < min_size:
                raise RuntimeError(
                    "Unable to download '{}' due to\n{}".format(
                        url, content))
            print("downloaded %d bytes." % len(content))
            with open(name, "wb") as f:
                f.write(content)
        else:
            print("'%s' already downloaded" % name)


    model_name = "squeezenet1.1-7.onnx"
    url_name = ("https://github.com/onnx/models/raw/main/vision/"
                "classification/squeezenet/model")
    url_name += "/" + model_name
    try:
        download_file(url_name, model_name, 100000)
    except RuntimeError as e:
        print(e)
        sys.exit(1)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    download 'https://github.com/onnx/models/raw/main/vision/classification/squeezenet/model/squeezenet1.1-7.onnx'
    downloaded 4956208 bytes.


.. GENERATED FROM PYTHON SOURCE LINES 76-77

Loading the ONNX file and use it on one image.

.. GENERATED FROM PYTHON SOURCE LINES 77-83

.. code-block:: default


    sess = InferenceSession(model_name)

    for inp in sess.get_inputs():
        print(inp)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    NodeArg(name='data', type='tensor(float)', shape=[1, 3, 224, 224])


.. GENERATED FROM PYTHON SOURCE LINES 84-86

The model expects a series of images of size
`[3, 224, 224]`.

.. GENERATED FROM PYTHON SOURCE LINES 88-90

Classifying an image
++++++++++++++++++++

.. GENERATED FROM PYTHON SOURCE LINES 90-100

.. code-block:: default


    url = ("https://upload.wikimedia.org/wikipedia/commons/d/d2/"
           "East_Coker_elm%2C_2.jpg")
    img = "East_Coker_elm.jpg"
    download_file(url, img, 100000)

    im0 = Image.open(img)
    im = im0.resize((224, 224))
    # im.show()


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    download 'https://upload.wikimedia.org/wikipedia/commons/d/d2/East_Coker_elm%2C_2.jpg'
    downloaded 712230 bytes.


.. GENERATED FROM PYTHON SOURCE LINES 101-102

Image to numpy and predection.

.. GENERATED FROM PYTHON SOURCE LINES 102-117

.. code-block:: default


    def im2array(im):
        X = numpy.asarray(im)
        X = X.transpose(2, 0, 1)
        X = X.reshape(1, 3, 224, 224)
        return X


    X = im2array(im)
    out = sess.run(None, {'data': X.astype(numpy.float32)})
    out = out[0]

    print(out[0, :5])


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    [145.59464   55.067673  60.599747  46.29393   37.98244 ]


.. GENERATED FROM PYTHON SOURCE LINES 118-119

Interpretation

.. GENERATED FROM PYTHON SOURCE LINES 119-124

.. code-block:: default


    res = list(sorted((r, class_names[i]) for i, r in enumerate(out[0])))
    print(res[-5:])


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    [(205.84172, 'Samoyed, Samoyede'), (212.0366, 'park bench'), (225.50684, 'lakeside, lakeshore'), (232.90251, 'fountain'), (258.10968, 'geyser')]


.. GENERATED FROM PYTHON SOURCE LINES 125-130

Classifying more images
+++++++++++++++++++++++

The initial image is rotated,
the answer is changing.

.. GENERATED FROM PYTHON SOURCE LINES 130-156

.. code-block:: default


    angles = [a * 2. for a in range(-6, 6)]
    imgs = [(angle, im0.rotate(angle).resize((224, 224)))
            for angle in angles]


    def classify(imgs):
        labels = []
        for angle, img in imgs:
            X = im2array(img)
            probs = sess.run(None, {'data': X.astype(numpy.float32)})[0]
            pl = list(sorted(
                ((r, class_names[i]) for i, r in enumerate(probs[0])),
                reverse=True))
            labels.append((angle, pl))
        return labels


    climgs = classify(imgs)
    for angle, res in climgs:
        print("angle={} - {}".format(angle, res[:5]))


    plot_gallery_images([img[1] for img in imgs],
                        [img[1][0][1][:15] for img in climgs])


.. image-sg:: /auto_tutorial/images/sphx_glr_plot_gbegin_transfer_learning_001.png
   :alt: plot gbegin transfer learning
   :srcset: /auto_tutorial/images/sphx_glr_plot_gbegin_transfer_learning_001.png
   :class: sphx-glr-single-img


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    angle=-12.0 - [(247.06139, 'obelisk'), (238.95375, 'car mirror'), (235.27644, 'flagpole, flagstaff'), (231.51715, 'window screen'), (230.90665, 'picket fence, paling')]
    angle=-10.0 - [(254.24683, 'car mirror'), (251.51355, 'obelisk'), (235.1051, 'groom, bridegroom'), (234.5295, 'picket fence, paling'), (232.13913, 'church, church building')]
    angle=-8.0 - [(235.56947, 'obelisk'), (226.59702, 'car mirror'), (226.46767, 'picket fence, paling'), (221.46799, 'groom, bridegroom'), (220.8851, 'fountain')]
    angle=-6.0 - [(265.50803, 'geyser'), (243.6862, 'obelisk'), (238.92964, 'fountain'), (226.73685, 'pedestal, plinth, footstall'), (226.11945, 'Great Pyrenees')]
    angle=-4.0 - [(287.74472, 'geyser'), (255.25311, 'fountain'), (236.8495, 'obelisk'), (223.02892, 'Great Pyrenees'), (222.80464, 'church, church building')]
    angle=-2.0 - [(267.63535, 'geyser'), (251.4896, 'fountain'), (214.64238, 'obelisk'), (214.56233, 'mobile home, manufactured home'), (213.12416, 'flagpole, flagstaff')]
    angle=0.0 - [(258.10968, 'geyser'), (232.90251, 'fountain'), (225.50684, 'lakeside, lakeshore'), (212.0366, 'park bench'), (205.84172, 'Samoyed, Samoyede')]
    angle=2.0 - [(222.7483, 'geyser'), (213.38457, 'fountain'), (212.24373, 'obelisk'), (198.37137, 'beacon, lighthouse, beacon light, pharos'), (197.43808, 'picket fence, paling')]
    angle=4.0 - [(221.34749, 'geyser'), (209.60358, 'fountain'), (207.06915, 'American egret, great white heron, Egretta albus'), (201.63094, 'obelisk'), (198.75664, 'Great Pyrenees')]
    angle=6.0 - [(230.98729, 'American egret, great white heron, Egretta albus'), (216.63416, 'fountain'), (212.7324, 'groom, bridegroom'), (209.60928, 'flagpole, flagstaff'), (209.46211, 'swimming trunks, bathing trunks')]
    angle=8.0 - [(253.32701, 'American egret, great white heron, Egretta albus'), (222.69963, 'golf ball'), (222.50493, 'groom, bridegroom'), (222.36345, 'sulphur-crested cockatoo, Kakatoe galerita, Cacatua galerita'), (217.73135, 'swimming trunks, bathing trunks')]
    angle=10.0 - [(244.30115, 'solar dish, solar collector, solar furnace'), (239.57332, 'flagpole, flagstaff'), (234.92137, 'picket fence, paling'), (230.62117, 'car mirror'), (221.87946, 'screen, CRT screen')]

    array([[<Axes: >, <Axes: >, <Axes: >, <Axes: >],
           [<Axes: >, <Axes: >, <Axes: >, <Axes: >],
           [<Axes: >, <Axes: >, <Axes: >, <Axes: >]], dtype=object)


.. GENERATED FROM PYTHON SOURCE LINES 157-163

Transfer learning in a pipeline
+++++++++++++++++++++++++++++++

The proposed transfer learning consists
using a PCA to projet the probabilities
on a graph.

.. GENERATED FROM PYTHON SOURCE LINES 163-181

.. code-block:: default


    with open(model_name, 'rb') as f:
        model_bytes = f.read()

    pipe = Pipeline(steps=[
        ('deep', OnnxTransformer(
            model_bytes, runtime='onnxruntime1', change_batch_size=0)),
        ('pca', PCA(2))
    ])

    X_train = numpy.vstack(
        [im2array(img) for _, img in imgs]).astype(numpy.float32)
    pipe.fit(X_train)

    proj = pipe.transform(X_train)
    print(proj)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    [[-676.57605  -203.35477 ]
     [-570.66583  -208.09705 ]
     [-339.812     -86.33975 ]
     [ -14.555829 -168.44807 ]
     [ 357.22385  -157.6135  ]
     [ 596.3859    -90.21095 ]
     [ 918.8612    -26.340424]
     [ 499.87146   128.27252 ]
     [ 306.686     156.42966 ]
     [-125.911835  119.21939 ]
     [-446.60458   342.4585  ]
     [-504.90277   194.0244  ]]


.. GENERATED FROM PYTHON SOURCE LINES 182-184

Graph for the PCA
-----------------

.. GENERATED FROM PYTHON SOURCE LINES 184-196

.. code-block:: default


    fig, ax = plt.subplots(1, 1, figsize=(5, 5))
    ax.plot(proj[:, 0], proj[:, 1], 'o')
    ax.set_title("Projection of classification probabilities")
    text = ["%1.0f-%s" % (el[0], el[1][0][1]) for el in climgs]
    for label, x, y in zip(text, proj[:, 0], proj[:, 1]):
        ax.annotate(
            label, xy=(x, y), xytext=(-10, 10), fontsize=8,
            textcoords='offset points', ha='right', va='bottom',
            bbox=dict(boxstyle='round,pad=0.5', fc='yellow', alpha=0.5),
            arrowprops=dict(arrowstyle='->', connectionstyle='arc3,rad=0'))


.. image-sg:: /auto_tutorial/images/sphx_glr_plot_gbegin_transfer_learning_002.png
   :alt: Projection of classification probabilities
   :srcset: /auto_tutorial/images/sphx_glr_plot_gbegin_transfer_learning_002.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 197-203

Remove one layer at the end
---------------------------

The last is often removed before the model is
inserted in a pipeline. Let's see how to do that.
First, we need the list of output for every node.

.. GENERATED FROM PYTHON SOURCE LINES 203-211

.. code-block:: default


    model_onnx = onnx.load(BytesIO(model_bytes))
    outputs = []
    for node in model_onnx.graph.node:
        print(node.name, node.output)
        outputs.extend(node.output)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    squeezenet0_conv0_fwd ['squeezenet0_conv0_fwd']
    squeezenet0_relu0_fwd ['squeezenet0_relu0_fwd']
    squeezenet0_pool0_fwd ['squeezenet0_pool0_fwd']
    squeezenet0_conv1_fwd ['squeezenet0_conv1_fwd']
    squeezenet0_relu1_fwd ['squeezenet0_relu1_fwd']
    squeezenet0_conv2_fwd ['squeezenet0_conv2_fwd']
    squeezenet0_relu2_fwd ['squeezenet0_relu2_fwd']
    squeezenet0_conv3_fwd ['squeezenet0_conv3_fwd']
    squeezenet0_relu3_fwd ['squeezenet0_relu3_fwd']
    squeezenet0_concat0 ['squeezenet0_concat0']
    squeezenet0_conv4_fwd ['squeezenet0_conv4_fwd']
    squeezenet0_relu4_fwd ['squeezenet0_relu4_fwd']
    squeezenet0_conv5_fwd ['squeezenet0_conv5_fwd']
    squeezenet0_relu5_fwd ['squeezenet0_relu5_fwd']
    squeezenet0_conv6_fwd ['squeezenet0_conv6_fwd']
    squeezenet0_relu6_fwd ['squeezenet0_relu6_fwd']
    squeezenet0_concat1 ['squeezenet0_concat1']
    squeezenet0_pool1_fwd ['squeezenet0_pool1_fwd']
    squeezenet0_conv7_fwd ['squeezenet0_conv7_fwd']
    squeezenet0_relu7_fwd ['squeezenet0_relu7_fwd']
    squeezenet0_conv8_fwd ['squeezenet0_conv8_fwd']
    squeezenet0_relu8_fwd ['squeezenet0_relu8_fwd']
    squeezenet0_conv9_fwd ['squeezenet0_conv9_fwd']
    squeezenet0_relu9_fwd ['squeezenet0_relu9_fwd']
    squeezenet0_concat2 ['squeezenet0_concat2']
    squeezenet0_conv10_fwd ['squeezenet0_conv10_fwd']
    squeezenet0_relu10_fwd ['squeezenet0_relu10_fwd']
    squeezenet0_conv11_fwd ['squeezenet0_conv11_fwd']
    squeezenet0_relu11_fwd ['squeezenet0_relu11_fwd']
    squeezenet0_conv12_fwd ['squeezenet0_conv12_fwd']
    squeezenet0_relu12_fwd ['squeezenet0_relu12_fwd']
    squeezenet0_concat3 ['squeezenet0_concat3']
    squeezenet0_pool2_fwd ['squeezenet0_pool2_fwd']
    squeezenet0_conv13_fwd ['squeezenet0_conv13_fwd']
    squeezenet0_relu13_fwd ['squeezenet0_relu13_fwd']
    squeezenet0_conv14_fwd ['squeezenet0_conv14_fwd']
    squeezenet0_relu14_fwd ['squeezenet0_relu14_fwd']
    squeezenet0_conv15_fwd ['squeezenet0_conv15_fwd']
    squeezenet0_relu15_fwd ['squeezenet0_relu15_fwd']
    squeezenet0_concat4 ['squeezenet0_concat4']
    squeezenet0_conv16_fwd ['squeezenet0_conv16_fwd']
    squeezenet0_relu16_fwd ['squeezenet0_relu16_fwd']
    squeezenet0_conv17_fwd ['squeezenet0_conv17_fwd']
    squeezenet0_relu17_fwd ['squeezenet0_relu17_fwd']
    squeezenet0_conv18_fwd ['squeezenet0_conv18_fwd']
    squeezenet0_relu18_fwd ['squeezenet0_relu18_fwd']
    squeezenet0_concat5 ['squeezenet0_concat5']
    squeezenet0_conv19_fwd ['squeezenet0_conv19_fwd']
    squeezenet0_relu19_fwd ['squeezenet0_relu19_fwd']
    squeezenet0_conv20_fwd ['squeezenet0_conv20_fwd']
    squeezenet0_relu20_fwd ['squeezenet0_relu20_fwd']
    squeezenet0_conv21_fwd ['squeezenet0_conv21_fwd']
    squeezenet0_relu21_fwd ['squeezenet0_relu21_fwd']
    squeezenet0_concat6 ['squeezenet0_concat6']
    squeezenet0_conv22_fwd ['squeezenet0_conv22_fwd']
    squeezenet0_relu22_fwd ['squeezenet0_relu22_fwd']
    squeezenet0_conv23_fwd ['squeezenet0_conv23_fwd']
    squeezenet0_relu23_fwd ['squeezenet0_relu23_fwd']
    squeezenet0_conv24_fwd ['squeezenet0_conv24_fwd']
    squeezenet0_relu24_fwd ['squeezenet0_relu24_fwd']
    squeezenet0_concat7 ['squeezenet0_concat7']
    squeezenet0_dropout0_fwd ['squeezenet0_dropout0_fwd']
    squeezenet0_conv25_fwd ['squeezenet0_conv25_fwd']
    squeezenet0_relu25_fwd ['squeezenet0_relu25_fwd']
    squeezenet0_pool3_fwd ['squeezenet0_pool3_fwd']
    squeezenet0_flatten0_reshape0 ['squeezenet0_flatten0_reshape0']


.. GENERATED FROM PYTHON SOURCE LINES 212-213

We select one of the last one.

.. GENERATED FROM PYTHON SOURCE LINES 213-217

.. code-block:: default


    selected = outputs[-3]
    print("selected", selected)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    selected squeezenet0_relu25_fwd


.. GENERATED FROM PYTHON SOURCE LINES 218-221

And we tell *OnnxTransformer* to use that
specific one and to flatten the output
as the dimension is not a matrix.

.. GENERATED FROM PYTHON SOURCE LINES 221-235

.. code-block:: default


    pipe2 = Pipeline(steps=[
        ('deep', OnnxTransformer(
            model_bytes, runtime='onnxruntime1', change_batch_size=0,
            output_name=selected, reshape=True)),
        ('pca', PCA(2))
    ])

    try:
        pipe2.fit(X_train)
    except InvalidArgument as e:
        print("Unable to fit due to", e)


.. GENERATED FROM PYTHON SOURCE LINES 236-241

We check that it is different.
The following values are the shape of the
PCA components. The number of column is the number
of dimensions of the outputs of the transfered
neural network.

.. GENERATED FROM PYTHON SOURCE LINES 241-245

.. code-block:: default


    print(pipe.steps[1][1].components_.shape,
          pipe2.steps[1][1].components_.shape)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    (2, 1000) (2, 169000)


.. GENERATED FROM PYTHON SOURCE LINES 246-247

Graph again.

.. GENERATED FROM PYTHON SOURCE LINES 247-260

.. code-block:: default


    proj2 = pipe2.transform(X_train)

    fig, ax = plt.subplots(1, 1, figsize=(5, 5))
    ax.plot(proj2[:, 0], proj2[:, 1], 'o')
    ax.set_title("Second projection of classification probabilities")
    text = ["%1.0f-%s" % (el[0], el[1][0][1]) for el in climgs]
    for label, x, y in zip(text, proj2[:, 0], proj2[:, 1]):
        ax.annotate(
            label, xy=(x, y), xytext=(-10, 10), fontsize=8,
            textcoords='offset points', ha='right', va='bottom',
            bbox=dict(boxstyle='round,pad=0.5', fc='yellow', alpha=0.5),
            arrowprops=dict(arrowstyle='->', connectionstyle='arc3,rad=0'))


.. image-sg:: /auto_tutorial/images/sphx_glr_plot_gbegin_transfer_learning_003.png
   :alt: Second projection of classification probabilities
   :srcset: /auto_tutorial/images/sphx_glr_plot_gbegin_transfer_learning_003.png
   :class: sphx-glr-single-img


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** ( 0 minutes  13.228 seconds)


.. _sphx_glr_download_auto_tutorial_plot_gbegin_transfer_learning.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example


    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: plot_gbegin_transfer_learning.py <plot_gbegin_transfer_learning.py>`

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: plot_gbegin_transfer_learning.ipynb <plot_gbegin_transfer_learning.ipynb>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_