{"cells": [{"cell_type": "markdown", "metadata": {}, "source": ["# 2A.ml - Classification binaire avec features textuelles - correction\n", "\n", "Ce notebook propose de voir comment incorporer des features pour voir l'am\u00e9lioration des performances sur une classification binaire. "]}, {"cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [{"data": {"text/html": ["
\n", ""], "text/plain": ["\n", " | code | \n", "url | \n", "creator | \n", "created_t | \n", "created_datetime | \n", "last_modified_t | \n", "last_modified_datetime | \n", "product_name | \n", "generic_name | \n", "quantity | \n", "... | \n", "collagen-meat-protein-ratio_100g | \n", "cocoa_100g | \n", "chlorophyl_100g | \n", "carbon-footprint_100g | \n", "nutrition-score-fr_100g | \n", "nutrition-score-uk_100g | \n", "glycemic-index_100g | \n", "water-hardness_100g | \n", "hasE | \n", "s100 | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "1.008255e+10 | \n", "http://world-fr.openfoodfacts.org/produit/0010... | \n", "usda-ndb-import | \n", "1489064583 | \n", "2017-03-09T13:03:03Z | \n", "1489064583 | \n", "2017-03-09T13:03:03Z | \n", "Golden Island, Pork Jerky, Grilled Barbecue | \n", "NaN | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "23.0 | \n", "23.0 | \n", "NaN | \n", "NaN | \n", "False | \n", "17.0 | \n", "
1 | \n", "1.182204e+10 | \n", "http://world-fr.openfoodfacts.org/produit/0011... | \n", "usda-ndb-import | \n", "1489070197 | \n", "2017-03-09T14:36:37Z | \n", "1489070197 | \n", "2017-03-09T14:36:37Z | \n", "Big Fizz, Soda, Orange | \n", "NaN | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "True | \n", "7.0 | \n", "
2 | \n", "2.548401e+10 | \n", "http://world-fr.openfoodfacts.org/produit/0025... | \n", "usda-ndb-import | \n", "1489052024 | \n", "2017-03-09T09:33:44Z | \n", "1489052024 | \n", "2017-03-09T09:33:44Z | \n", "Tofubaked Marinated Baked Tofu, Sesame Ginger | \n", "NaN | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "-2.0 | \n", "-2.0 | \n", "NaN | \n", "NaN | \n", "True | \n", "17.0 | \n", "
3 | \n", "1.229250e+10 | \n", "http://world-fr.openfoodfacts.org/produit/0012... | \n", "usda-ndb-import | \n", "1489133493 | \n", "2017-03-10T08:11:33Z | \n", "1489133493 | \n", "2017-03-10T08:11:33Z | \n", "Milk Chocolate Eggs | \n", "NaN | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "23.0 | \n", "23.0 | \n", "NaN | \n", "NaN | \n", "True | \n", "17.0 | \n", "
4 | \n", "1.115054e+10 | \n", "http://world-fr.openfoodfacts.org/produit/0011... | \n", "usda-ndb-import | \n", "1489052892 | \n", "2017-03-09T09:48:12Z | \n", "1489052892 | \n", "2017-03-09T09:48:12Z | \n", "Fresh Polish Sausage | \n", "NaN | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "22.0 | \n", "22.0 | \n", "NaN | \n", "NaN | \n", "True | \n", "17.0 | \n", "
5 rows \u00d7 165 columns
\n", "