XD blog

blog page

~technical


2015-12-11 Machine learning automatique

Et si plutôt que d'essayer de caler le meilleur modèle sur votre jeu de données, vous esssayiez d'apprendre un modèle qui le fait pour vous... Il existe une conférence pour cela : AutoML workshop @ ICML'15 et un module auto-sklearn.

Et toujours awesome-machine-learning

2015-12-01 Quelques articles de blog, Rodeo, TensorFlow, Tableau, Autoreload, RLPy

Rodeo facilite l'écriture de rapports avec des équations, du code et des graphes. Il est convertit en markdown et PDF : Rodeo 1.1 - Markdown, Autoupdates, Feedback.

Lorsqu'on met à jour un module, les modifications ne sont pas prises en compte automatiquement dans un notebook. Il faut le recharger. Il existe une extension qui fait ça pour vous : Autoreload des modules sous iPython.

Tableau est une application gratuite dans certains cas qui permet de réaliser facilement des dashboards afin de visualiser rapidement des données avec des graphiques animés.

Module pour faire de l'apprentissage par renforcement : RLPy: A Value-Function-Based Reinforcement Learning Framework for Education and Research. Lire également : Batch Learning from Logged Bandit Feedback through Counterfactual Risk Minimization.

Un article sur TensorFlow : What you wanted to know about TensorFlow.

2015-11-17 Building xgboost on Python 3.5

Go to Build xgboost on Python 3.5.

2015-11-12 Python 3.5, Scipy and scikit-learn on Windows

With Python 3.5 on Windows, the module scipy.sparse requires Visual Studio 2015, Community Edition otherwise it displays an error message Issue with Scipy on Windows.

2015-10-27 Travis, Appveyor, PyPi

I was surprised to see that a module I develop to produce my teaching materials gets downloaded 14k times last month on PiPy (pyquickhelper). It seems a lot... But if I count that Travis downloads it everytime I commit something on GitHub. I did 50 commits last week. I would say that the number of time I manually download this module is not significant compare to the number of times it gets automatically downloaded. It seems difficult to get a sense of those counts given by PiPy.

2015-09-26 Python 3.5 and virtual environments on Windows

I began to test my modules against Python 3.5. As I used virtual environement, I discover the following issue : virtualenv fails with Python 3.5 on Windows. Surprisingly, it works on another machine probably due to the different set of softwares installed on it. I guess I'll wait a little bit before digging into it or trying to fix it on my own.

2015-09-25 MOOC Big Data

Il y a trois jours, Le Monde a publié l'article Treize MOOC à suivre en cette rentrée. Le premier de la liste Comprendre le Big Data à travers les films de cinéma est produit par le groupement ENSAE-ENSAI-CEPE.

2015-09-07 Quelques applications du Big Data

Quelques aperçus de notre quotidien dans quelques années :

2015-09-04 IoT in Python

I did not have time to give it a try but it looks promising: homeassistant according to their website: Home Assistant.

2015-08-24 Open the notebook with a different browser

I was looking for an easy to launch the notebook server with a different browser than the default one. I created a batch file (for Windows but easily adaptable to Linux):

set PATH=%PATH%;C:\Python34_x64;C:\Python34_x64\Scripts
set PYTHONPATH=<extra_path>;%PYTHONPATH%
set BROWSER="C:\Program Files (x86)\Google\Chrome\Application\chrome.exe"
jupyter-notebook --notebook-dir=<your_folder_for_notebooks> --port=XXXX

The notebook opens on Chrome with the following url http://localhost:XXXX.

2015-08-23 Building xgboost on Windows for Python

I'm unlucky. The day I decide to deal with xgboost on Windows, a couple of hours later, I see a commit which does that. xgboost is now on pipy even if the version for Python 3 is not ready yet (Missing parentheses in call to 'print').


more...

2015-08-17 Github, Gitlab, Gogs

I use GitHub for my open source project and GitLab at my school. But I prefer GitHub interface so maybe I'll try to switch to gogs. See Gogs, an alternative to Gitlab, it seems quite efficient and works on all plaforms even if it does not implement all features yet.

2015-08-14 IPython becomes Jupyter

It was announced on their blog but the module IPython was split in many modules. Here is the list I needed to install while upgrading to Jupyter 4.0.

ipykernel
ipython
ipython_genutils
ipywidgets
jupyter
jupyter_client
jupyter_core
jupyter-console
nbformat
nbconvert
notebook
path.py
pickleshare
qtconsole
simplegeneric
traitlets

With Anaconda, you get all these updates by typing: conda install jupyter.

2015-08-02 Which version of R to install for bigmemory?

While trying to install a package for R on Windows, I realized not all of them are available for the latest version. The package bigmemory does not seem to exist on Windows (click on the link). However after a couple of web searches, you can find that the instruction might work:

install.packages("bigmemory", repos="http://R-Forge.R-project.org")

But it does not work on the latest version, only R-3.1. That's what this page indicates: R Development Page (click on this link).

2015-07-28 PyData 2015 in Seattle

I attended my first conference pydata in Seattle and I must say I learned a lot. I discovered much what I could ever do by looking on Internet for a library for a precise need. That was really worth taking a plane and attend. Most of all, I felt people very passionnated, constantly looking for improvement. So passionate that I would definitely recommend Python over R as a first choice for a machine learning language. R seems only to grow by the number of available packages. But Python catches up. And its environment is also extending by various initiatives to improve plotting or the handling of very big datasets.

I would not be surprised if a language named Rython pops up one day.


more...
<-- -->

Xavier Dupré