Getting started

Lorsqu’on fait des statistiques, le language Python est loin d’être aussi complet que R qui a été pensé dans ce but. Il faut lui ajouter plusieurs dizaines de modules. C’est à ce moment qu’on découvrir les subtilités dans les différents systèmes d’exploitations, les compilateurs, les dépendances, la ligne de commande. Ce paragraphe décrit un moyen d’installation Python sur les trois systèmes principaux Windows, OS X, Linux avec les modules nécessaires présentés dans ce cours. Le premier paragraphe explique comment installer rapidement Python, il faudra lire les suivants si vos besoins vont au delà.

Notes

La grande majorité des exemples et des notebooks proposés sur ce site sont testés une fois par semaine. La distribution utilisée varie mais s’appuie sur les dernières versions des modules. Une part moins importante est également testée à chaque modification sur github/sdpython sur Windows, Linux (Ubuntu) et Linux (Debian).

2018-2019

Pour l’année 2017, les tests sont dorénavant effectués sur Linux Debian 9 et la distribution standard Python 3.7.0. L’installation de cette machine est assez fastidieuse. Les traces de l’installation sont conservées sur cet article de blog : Install Python 3.7 and many packages on Linux Debian 9. Il reste quelques problèmes comme TensorFlow qui n’est pas encore compatible avec Python 3.7 (août 2018) mais cette pull request devrait résoudre le problème.

2017-2018

Toutes les modules étaient testées sur Windows 10 et une distribtion standard Python 3.6.4 puis Python 3.6.5 64 bits.

En résumé : Anaconda

En résumé, le conseil le plus fréquent qu’on donne à ceux qui souhaitent installer Python est d’utiliser la distribution Anaconda. C’est l’équivalent de R. Sans autre étape supplémentaire, elle permet de faire du calcul matriciel numpy, de tracer des graphiques avec matplotlib, de manipuler les données pandas et de faire du machine du machine learning scikit-learn. La plupart des exercices proposés sur ce site n’utilisent pas plus que ce qui est proposé dans cette distribution standard. Pour les autres, les instructions mentionnées ci-dessous fonctionnent sous Windows, Linux et Mac.

  • Installation Anaconda (python 64 bit)

  • Mise à jour de la distribution avec conda update --all (en ligne de commande).

Pour installer le module implémenté pour ce cours :

  • pip install ensae_teaching_cs

Windows

Certains modules nécessitent une compilation C++. Anaconda fournit la plupart d’entre eux. Pour les autres, il faut allez à Unofficial Windows Binaries for Python Extension Packages. Certains modules n’existent pas sous forme précompilée à moins de le faire soi-même. Et c’est ce que j’ai fait pour certains modules comme dlib.

pip install pymyinstall

Linux en ligne de commande / MacOS

Voir Install Miniconda through SSH connection. Cela fonctionne avec les distributions Ubuntu 14.04 et Ubuntu 16.04. Vous pouvez par exemple suivre les instructions utilisées pour tester et compiler ce site disponibles dans le fichier config.yml. Les sorties disponibles sur le site CircleCI peuvent vous aider à comprendre les erreurs si vous en avez.

Les mêmes instructions devraient fonctionner sous MacOS depuis une fenêtre terminal car ce système s’appuie sur une distribution Linux.

IDE

Un IDE est un environnement de développement. Et comme on ne fait pas tout depuis un notebook, il faut en choisir un un.

Editeur de texte

Si vous êtes dans le train et que vous n’avez pas beaucoup de batterie, il faut revenir à l’essentiel : un éditeur de texte.

Installer un module

pip, python et ligne de commande

Le language python s’est doté d’un système de distribution de modules (ou packages) qui est aisément accessible depuis la ligne de commande. Sous Windows, on peut lancer la ligne de commande par la commande cmd. Sous Linux ou OS/X, c’est une fenêtre terminal (Linux, OS/X). Il suffit alors de se déplacer dans le répertoire d’installation de Python

cd c:\Python370_x64\Scripts

Ou encore :

cd c:\Anaconda3\Scripts

Puis d’écrire :

pip install <module>

Sous Linux ou OS/X (Apple), la ligne de commande s’appelle le terminal. Comme Python est déjà installé en version 2.7, je recommande l’installation de la distribution Anaconda en version 3.6 qui facilite la coexistence de plusieurs versions de Python. On procède de la même manière

cd /home/<alias>/anaconda3/bin

Puis

pip install <module>

Pour vous assurer que cela correspond bien à la version de Python souhaitée, il suffit de demander la version installée

pip --version

Sous Windows, pour l’ajout d’un module ponctuel, si l’instruction pip install <module> ne fonctionne pas, c’est vraisemblablement parce que ce module contient une partie en C++. Dans ce cas, il faut aller voir sur ce site Unofficial Windows Binaries for Python Extension Packages s’il est disponible. S’il ne l’est pas, l’installation du module est réservée aux experts.

conda ou pip

Anaconda maintient des versions de librairies Python. Pour tous les modules de cette liste, Anaconda Package List, il faut utiliser conda install <module>. Pour les autres, pip install <module>. Cela ne fonctionne qu’avec la distribution Anaconda.

L’instruction pip install ne fonctionne pas sous Windows lorsque le module est implémenté en Python et C++. C’est pourquoi il est préférable d’installer une version précompilée.

Dépendances

Par défaut, l’installation d’un module implique celle de ses dépendances ce qu’il est possible d’éviter :

pip install <module> --no-deps

Installer un module avec pymy_install

Le module pymyinstall recherche la meilleure façon d’installer un module quelque soit votre installation. Pour l’installer ou le mettre à jour :

pip install pymyinstall --upgrade

L’installation du module crée deux scripts, pymy_install3 pour installer un module, pymy_update3 pour mettre à jour. Le module permet aussi d’installer un ensemble de modules. La ligne suivante installe un module développé pour ce cours et ses dépendances :

pymy_install3 --set=pyensae

Désinstallation des modules implémentés pour ce cours

Il est possible de désinstaller simplement les modules installés pour ces enseignements :

    pip uninstall -y _benchmarks
    pip uninstall -y _check_python_install
    pip uninstall -y actuariat_python
    pip uninstall -y botadi
    pip uninstall -y code_beatrix
    pip uninstall -y cpyquickhelper
    pip uninstall -y csharpy
    pip uninstall -y csharpyml
    pip uninstall -y ensae_projects
    pip uninstall -y ensae_teaching_cs
    pip uninstall -y ensae_teaching_dl
    pip uninstall -y jupytalk
    pip uninstall -y jyquickhelper
    pip uninstall -y lecture_citation
    pip uninstall -y lightmlboard
    pip uninstall -y lightmlrestapi
    pip uninstall -y machinelearningext
    pip uninstall -y manydataapi
    pip uninstall -y mathenjeu
    pip uninstall -y mlinsights
    pip uninstall -y mlprodict
    pip uninstall -y mlstatpy
    pip uninstall -y myblog
    pip uninstall -y pandas_streaming
    pip uninstall -y papierstat
    pip uninstall -y pyenbc
    pip uninstall -y pyensae
    pip uninstall -y pymlbenchmark
    pip uninstall -y pymmails
    pip uninstall -y pymyinstall
    pip uninstall -y pyquickhelper
    pip uninstall -y pyrsslocal
    pip uninstall -y pysqllike
    pip uninstall -y python3_module_template
    pip uninstall -y sparkouille
    pip uninstall -y teachpyx
    pip uninstall -y tkinterquickhelper
    pip uninstall -y wrapclib

Configuration pour ces cours

Les notebooks utilisent le module pyensae développé pour ces enseignements. Pour installer ses dépendances, il faut utiliser le module pymyinstall. Les dépendances s’installent comme suit :

pymy_install3 --set=ensae_teaching_cs
pip install ensae_teaching_cs

Certains notebooks requièrent des outils supplémentaires :

Distributions

  • Anaconda (Windows, Linux, Mac). Sous Linux ou OS/X, la distribution n’interfère pas avec la distribution existante souvent différente. C’est un point très appréciable. Les modules de la distribution ne sont pas tous à jour. Il faut penser à mettre à jour avec la commande conda install <module> depuis le répertoire Anaconda3/Scripts (conda install cvxopt par exemple). Il existe une version différente : miniconda. La liste des packages manquant sera probablement différente. Il suffit d’écrire sur la ligne de commande conda update --all pour mettre à jour tous les modules.

  • WinPython (Windows). Sous Windows, elle inclut parfois R ou Julia (ces version ne sont pas aussi à jour que la version principale). On passe alors facilement de python à R ou Julia depuis le même notebooks. Uniquement disponible sous Windows, cette installation a l’avantage de ne pas nécessiter les droits administrateur pour être installée. Elle ne modifie pas les registres et on peut la recopier telle quelle sur une clé USB pour la recopier sur un autre ordinateur. On peut également préparer sa propre version How To Make WinPython.

  • Distribution officielle de Python, il faut ensuite installer de nombreux modules (voir Liste exhaustive de modules Python) pour obtenir une distribution équivalente aux deux précédentes.

  • miniconda est une version light de Anaconda sans tous ces packages. Elle peut être installée depuis une ligne de commande, via une connexion SSH. Voir Install Anaconda through SSH connection.

La liste des packages de WinPython ou Anaconda sont d’excellents moyens de découvrir de nouveaux modules intéressants.

Modules incournables pour un data scientist

Les modules indispensables sont intégrés à la distribution Anaconda, WinPython.

Les indispensables

  • dask : dataframe distribué et capables de gérer des gros volumes de données (> 5Go)

  • Jupyter : gestion des notebooks (des pages blanches mélangeant code, équations, graphiques)

  • matplotlib : graphes scientifiques

  • numpy : calcul matriciel

  • pandas : gestion de DataFrame

  • Scipy : calcul scientifique

  • scikit-learn : machine learning, statistique descriptive

  • statsmodels : séries temporelles

Visualisation

Voir 10 plotting libraries at PyData 06/14/2016 in Paris.

Jeux

  • pygame + thorpy

  • kivy : pour faire des jeux ou des applications pour tablettes, téléphones

Pour les TD et projets à l’ENSAE

Pour faire du machine learning sans programmer

Outils, ressources pour développer

Développer un programme informatique prend du temps et il est important d’être à l’aise. Une grande difficulté lorsqu’on programme c’est de travailler à plusieurs sur le même projet. Il faut se sychroniser. Fort heureusement, le problème est connu depuis longtemps et il existe beaucoup d’outils open source dont on aurait tort de se passer ou des services gratuits sous certains conditions qui facilitent l’archivage.

En vrac

Suivi de sources distant

Git

git est un logiciel de suivi de source. Il a supplanté tous les autres et il est indispensable aujourd’hui de le connaître. On ne retient pas toujours les commandes mais un moteur de recherche fournit rapidement la réponse. Voir aussi Cheat Sheet.

Archivage distant

  • hubiC (25 Go gratuit - août 2015)

  • OneDrive (15 Go gratuit - août 2015)

Ce ne sont pas les seuls, vous trouverez d’autres options ici : cloud-gratuit. Toutefois, il est recommandé de faire attention avec les données personnelles sensibles. Il n’est pas toujours possible de choisir le lieu de stockage et chaque pays a une législation différente. Même si vos données sont protégées par un mot de passe et ne sont pas publiques, il arrive que certains mots de passe soient interceptés.

Comparaison de fichiers

Partager des notes, des idées

Editeur de texte

  • SciTE : le plus simple, pas d’explorateur de fichier, pas d’installeur, autocomplétion perturbante

  • TextWrangler (seulement sur iOS - Apple)

  • SublimeText : configuration nécessaire avant d’exécuter un script python

  • Notepad++ : configuration nécessaire avant d’exécuter un script python

IDE

Python et Domotique

Navigateur

Les navigateur sont importants pour l’utilisation des notebooks. Je recommande soit Firefox, soit Chrome. Ces deux navigateurs sont indispensables si vous insérez du javascript dans nos notebooks. Le débuggeur de Chrome est le plus pratique à utiliser quand il s’agit d’aller fouiller dans les feuilles de styles ou de voir l’exécution du javascript.

Documentation

La documentation et les tests unitaires les modules classés dans les catégories SPHINX, TEACH (voir table ci-dessous). Certaines séances pratiques utilisent des données depuis ce site. Elles sont facilement téléchargeables avec ces deux modules :

  • pyquickhelper : ce module compile ce cours

  • pyensae : outils variés pour les élèves de l’ENSAE

  • pymyinstall : installer facilement des modules sous Windows

Pour être compilée, la documentation requiert également :

Continuous build

Compression

  • 7zip : pour compresser, décompresser tous les formats

Ressources

Setup pour déveloper ce cours (Windows)

Pour chaque version de Python, il faut installer pymyinstall puis écrire pymy_install puis supprimer les modules qu’on souhaite compiler et tester (voir Désinstallation des modules implémentés pour ce cours).

Quelques modules particuliers : plus trop maintenus mais parfois utiles et parfois modifiés

pip install https://github.com/sdpython/pyPdf/archive/trunk.zip

En plus :

Pour Jenkins, quelques extensions :

Pour Jupyter :

pip install widgetsnbextension
jupyter nbextension enable --py --sys-prefix widgetsnbextension

Un serveur en local doit être démarré, la ligne de commande ressemble à ceci :

c:\Python370_x64\Scripts\pypi-server.exe -u -p 8067 --disable-fallback ..\..\local_pypi\local_pypi_server

Si le serveur Jenkins utilise des mots-clés via keyring, ce qui est le cas pour plusieurs modules utilisés pour ces enseignements, il est nécessaire de créer un serveur un serveur Jenkins authentifié. Sous Windows , il faut chercher services.msc et renseigner les identifiants. Pour certains projets (comme la compilation de pywin32), il faut installer Windows SDK.

Maintenir sa distribution Python à jour

Manipuler les données est différent de savoir programmer. Si le second est nécessaire au premier, il est impensable aujourd’hui de ne pas tenir compte ce que d’autres programmeurs ont mis à disposition de tous en libre accès. Tous les modules proposés dans la suite sont utilisées par beaucoup, et sont très adaptés à la manipulation des données. Ils bénéficient de ce fait d’un développement rapide et d’une robustesse qu’il faut environ un an à un bon programmeur pour obtenir avec un de ses outils sur le même éventail de fonctionnalités (en y consacrant 10 à 20% de son temps).

J’ai cherché à regrouper les outils qui permettent à un ingénieur, statisticiens, data scientist de manipuler aisément des données, qui peuvent aller de quelques kilo-octets à quelques giga octets. En tant que data scientist, je pioche très régulièrement des éléments des sept premiers chapitres. Les sept suivants ne sont utiles que de temps en temps, surtout si les données sont de taille supérieure à 250 Mo.

L’essentiel n’est pas de tout faire en Python, l’essentiel est d’être agile, de passer le moins de temps sur l’implémentation et le plus de temps possible sur les données.

Listes de modules

Quelques articles

Liens, blogs à suivre

Articles Livres, Vidéos

Liste exhaustive de modules Python

Les modules suivant font partie du setup proposé aux étudiants (voir plus bas).

  • usage : classification, la plus importante DATA/ML regroupe les modules les plus importantes pour faire du machine learning

  • name : nom du module

  • kind : façon d’installer le module sous Windows, si c’est wheel, cela signifie que le module inclut une partie C++ qu’il est préférable de récupérer déjà compilée via le site Unofficial Windows Binaries for Python Extension Packages.

  • version : la version à installer car d’autres peuvent provoquer des conflits

  • license : license du module, toutes ne permettent pas un usage commercial, voir choose a license, licences commentées

  • purpose : description plus détaillée

<<<

from ensae_teaching_cs.automation import rst_table_modules
print(rst_table_modules())

>>>

index

usage

name

kind

version

license

purpose

0

abcpmc

pip

GPLv3+

Monte Carlo and ABC methods

1

absl-py

pip

Apache 2.0

Collection of Python library code for building Python applications.

2

TEACH

actuariat_python

pip

MIT

teachings, insurance examples

3

ML

ad3

wheel

GPLv3

AD3 (approximate MAP decoder with Alternating Direction Dual Decomposition)

4

adal

pip

MIT

The ADAL for Python library makes it easy for python application to authenticate to Azure Active Directory (AAD) in order to access AAD protected web resources.

5

aenum

pip

Advanced Enumerations (compatible with Python’s stdlib Enum), NamedTuples, and NamedConstants

6

SPHINX

alabaster

pip

BSD

A configurable sidebar-enabled Sphinx theme

7

alembic

pip

A database migration tool for SQLAlchemy.

8

OPTIM

algopy

pip

ALGOPY: Taylor Arithmetic Computation and Algorithmic Differentiation

9

amqp

pip

LGPL

Low-level AMQP client for Python (fork of amqplib) (for celery)

10

ansi2html

pip

GPLv3+

Convert text with ANSI color codes to HTML

11

ansiconv

pip

MIT

A Python module for converting ANSI coded text and converts it to either plain text or HTML.

12

antlr4-python3-runtime

pip

BSD

antlr4 runtime, grammar parser

13

anyjson

pip

BSD

Wraps the best available JSON implementation available in a common interface (for celery)

14

AnyQt

pip

GPLv3

PyQt4/PyQt5 compatibility layer.

15

apache-libcloud

pip

A standard Python library that abstracts away differences among multiple cloud provider APIs.

16

appdirs

pip

MIT

A small Python module for determining appropriate platform-specific dirs

17

apscheduler

pip

MIT

to schedule the execution of jobs, tasks

18

arrow

pip

Better dates and times for Python

19

asciitree

pip

MIT

Draws ASCII trees.

20

asn1crypto

pip

MIT

Fast ASN.1 parser and serializer with definitions for private keys, public keys, certificates, CRL, OCSP, CMS, PKCS#3, PKCS#7, PKCS#8, PKCS#12, PKCS#5, X.509 and TSP

21

asteval

pip

Safe, minimalistic evaluator of python expression using ast module

22

astor

pip

BSD-3-Clause

astor is designed to allow easy manipulation of Python source via the AST.

23

astroid

pip

LGPL

A abstract syntax tree for Python with inference support.

24

astropy

wheel

BSD

Community-developed python astronomy tools

25

async_generator

pip

for jupyterhup

26

atomicwrites

pip

MIT

Atomic file writes.

27

attrs

pip

MIT

Attributes without boilerplate.

28

autograd

pip

MIT

Efficiently computes derivatives of numpy code.

29

Automat

pip

MIT

Self-service finite-state machines for the programmer on the go.

30

STYLE

autopep8

pip

Expat License

apply pep8 on a script

31

autopy3

wheel

MIT

A simple, cross-platform GUI automation toolkit for Python 3 (issue on Linux and Anaconda)

32

AZURE

azure

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

33

AZURE

azure-batch

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

34

AZURE

azure-common

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

35

AZURE

azure-datalake-store

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

36

AZURE

azure-graphrbac

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

37

AZURE

azure-keyvault

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

38

AZURE

azure-mgmt

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

39

AZURE

azure-mgmt-authorization

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

40

AZURE

azure-mgmt-batch

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

41

AZURE

azure-mgmt-cdn

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

42

AZURE

azure-mgmt-cognitiveservices

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

43

AZURE

azure-mgmt-commerce

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

44

AZURE

azure-mgmt-common

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

45

AZURE

azure-mgmt-compute

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

46

AZURE

azure-mgmt-containerregistry

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

47

AZURE

azure-mgmt-datalake-analytics

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

48

AZURE

azure-mgmt-datalake-nspkg

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

49

AZURE

azure-mgmt-datalake-store

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

50

AZURE

azure-mgmt-devtestlabs

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

51

AZURE

azure-mgmt-dns

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

52

AZURE

azure-mgmt-documentdb

pip

MIT

Python wrapper for Azure API (HDInsight, Blog Storage)

53

AZURE

azure-mgmt-iothub

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

54

AZURE

azure-mgmt-keyvault

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

55

AZURE

azure-mgmt-logic

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

56

AZURE

azure-mgmt-monitor

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

57

AZURE

azure-mgmt-network

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

58

AZURE

azure-mgmt-notificationhubs

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

59

AZURE

azure-mgmt-nspkg

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

60

AZURE

azure-mgmt-powerbiembedded

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

61

AZURE

azure-mgmt-rdbms

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

62

AZURE

azure-mgmt-redis

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

63

AZURE

azure-mgmt-resource

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

64

AZURE

azure-mgmt-scheduler

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

65

AZURE

azure-mgmt-sql

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

66

AZURE

azure-mgmt-storage

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

67

AZURE

azure-mgmt-trafficmanager

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

68

AZURE

azure-mgmt-web

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

69

AZURE

azure-nspkg

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

70

AZURE

azure-servicebus

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

71

AZURE

azure-servicefabric

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

72

AZURE

azure-servicemanagement-legacy

pip

Apache License 2.0

Python wrapper for Azure API (HDInsight, Blog Storage)

73

AZURE

azure-storage

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

74

AZURE

azure-storage-nspkg

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

75

AZURE

azure_applicationinsights

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

76

AZURE

azure_cosmosdb_nspkg

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

77

AZURE

azure_cosmosdb_table

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

78

AZURE

azure_eventgrid

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

79

AZURE

azure_loganalytics

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

80

AZURE

azure_mgmt_advisor

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

81

AZURE

azure_mgmt_applicationinsights

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

82

AZURE

azure_mgmt_batchai

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

83

AZURE

azure_mgmt_billing

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

84

AZURE

azure_mgmt_consumption

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

85

AZURE

azure_mgmt_containerinstance

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

86

AZURE

azure_mgmt_containerservice

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

87

AZURE

azure_mgmt_cosmosdb

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

88

AZURE

azure_mgmt_datafactory

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

89

AZURE

azure_mgmt_datamigration

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

90

AZURE

azure_mgmt_devspaces

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

91

AZURE

azure_mgmt_eventgrid

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

92

AZURE

azure_mgmt_eventhub

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

93

AZURE

azure_mgmt_hanaonazure

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

94

AZURE

azure_mgmt_iotcentral

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

95

AZURE

azure_mgmt_iothubprovisioningservices

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

96

AZURE

azure_mgmt_loganalytics

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

97

AZURE

azure_mgmt_machinelearningcompute

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

98

AZURE

azure_mgmt_managementgroups

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

99

AZURE

azure_mgmt_managementpartner

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

100

AZURE

azure_mgmt_maps

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

101

AZURE

azure_mgmt_marketplaceordering

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

102

AZURE

azure_mgmt_media

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

103

AZURE

azure_mgmt_msi

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

104

AZURE

azure_mgmt_policyinsights

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

105

AZURE

azure_mgmt_recoveryservices

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

106

AZURE

azure_mgmt_recoveryservicesbackup

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

107

AZURE

azure_mgmt_relay

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

108

AZURE

azure_mgmt_reservations

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

109

AZURE

azure_mgmt_search

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

110

AZURE

azure_mgmt_servermanager

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

111

AZURE

azure_mgmt_servicebus

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

112

AZURE

azure_mgmt_servicefabric

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

113

AZURE

azure_mgmt_signalr

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

114

AZURE

azure_mgmt_subscription

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

115

AZURE

azure_storage_blob

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

116

AZURE

azure_storage_common

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

117

AZURE

azure_storage_file

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

118

AZURE

azure_storage_queue

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

119

azureml

pip

MIT

Microsoft Azure Machine Learning Python client library

120

SPHINX

babel

pip

BSD

Internationalization utilities, version 2.0 has bugs

121

JUPYTER

backcall

pip

BSD

Specifications for callback functions passed in to an API (IPython, Jupyter)

122

backports.functools_lru_cache

pip

MIT

backports.functools_lru_cache

123

backports.lzma

wheel

Backport of Python 3.3’s “lzma” module for XZ/LZMA compressed files.

124

JUPYTER

backports.shutil-get-terminal-size

pip

needed for Jupyter

125

JUPYTER

backports_abc

pip

Python Software Foundation License

A backport of recent additions to the “collections.abc” module

126

DATA/ML

bayespy

pip

bayesian modelling and computation

127

DATA/ML

bcolz

wheel

BSD

compressed dataframe, in memory or on disk

128

bcrypt

pip

Apache License, Version 2.0

Modern password hashing for your software and your servers

129

beautifulsoup4

pip

MIT

Beautiful Soup sits atop an HTML or XML parser, providing Pythonic idioms for iterating, searching, and modifying the parse tree.

130

bigfloat

wheel

LGPLv3

big float (issue with Linux and Anaconda)

131

billiard

pip

BSD

Python multiprocessing fork with improvements and bugfixes (for celery)

132

BIO

biopython

wheel

~MIT

The Biopython Project is an international association of developers of freely available Python tools for computational molecular biology.

133

VIZ

bkcharts

pip

High level chart types built on top of Bokeh

134

WEB

bleach

pip

Apache Software License

An easy whitelist-based HTML-sanitizing tool.

135

blist

wheel

BSD

a list-like type with better asymptotic performance and similar performance on small lists

136

blockdiag

pip

Apache License 2.0

blockdiag generates block-diagram image from text

137

blosc

wheel

https://opensource.org/licenses/BSD-3-Clause

Blosc (http://blosc.org) is a high performance compressor optimized for binary data.

138

blz

wheel

BSD

blz: a compressed data container

139

VIZ

bokeh

pip

BSD-3-Clause

interactive graphs, zoomable, javascript

140

boto

pip

MIT

Amazon Web Services Library

141

AWS

boto3

pip

Apache License 2.0

A Python interface to Amazon Web Services

142

AWS

botocore

pip

Apache License 2.0

A low-level interface to a growing number of Amazon Web Services. The botocore package is the foundation for the AWS CLI as well as boto3.

143

DATA/ML

bottleneck

wheel

Simplified BSD

pandas depenencies, faster arrays

144

VIZ

bqplot

pip

Apache

interactive graphs, zoomable, d3.js for notebooks

145

VIZ

branca

pip

Copyright (C) 2013, Martin Journois

146

brewer2mpl

pip

MIT

Connect colorbrewer2.org color maps to Python and matplotlib

147

Brotli

wheel

Brotli is a generic-purpose lossless compression algorithm that compresses data using a combination of a modern variant of the LZ77 algorithm, Huffman coding and 2nd order context modeling, with a compression ratio comparable to the best currently available general-purpose compression methods. It is similar in speed with deflate but offers more dense compression.

148

brythonmagic

pip

MIT

Magics to use brython in Jupyter notebook.

149

ALGO

BTrees

wheel

This package contains a set of persistent object containers built around a modified BTree data structure.

150

bz2file

pip

Apache License, Version 2.0

process bz2 files

151

CacheControl

pip

Apache Software License

httplib2 caching for requests

152

cached_property

pip

BSD

A decorator for caching properties in classes (for luigi)

153

cachetools

pip

MIT

Extensible memoizing collections and decorators

154

cairocffi

wheel2

BSD

cairocffi is a CFFI-based drop-in replacement for Pycairo, a set of Python bindings and object-oriented API for cairo

155

cairosvg

pip

LGPLv3+

Convert your SVG files to PDF and PNG.

156

VIZ

Cartopy

wheel

LGPLv3

Cartopy is a Python package designed to make drawing maps for data analysis and visualisation as easy as possible (issue on Linux and Anaconda)

157

DATA/ML

catboost

pip

Apache License, Version 2.0

CatBoost is a machine learning method based on gradient boosting over decision trees.

158

ML

category_encoders

pip

BSD

A set of scikit-learn-style transformers for encoding categorical variables into numeric by means of different techniques.

159

celery

pip

BSD

Celery is an asynchronous task queue/job queue based on distributed message passing.

160

certifi

pip

MPL-2.0

Certifi is a carefully curated collection of Root Certificates for validating the trustworthiness of SSL certificates while verifying the identity of TLS hosts.

161

C++

cffi

wheel

MIT

Foreign Function Interface for Python calling C code.

162

cftime

wheel

Time-handling functionality from netcdf4-python.

163

DATA/DML

chainer

pip

MIT License

A flexible framework of neural networks (GPU)

164

DATA/DML

chainercv

pip

MIT

ChainerCV is a deep learning based computer vision library built on top of Chainer.

165

chalmers

pip

MIT

Chalmers is an application that allows its users to monitor and control a number of processes on any operating system (Posix and Win32 included)

166

WEB

chardet

pip

LGPL

Universal encoding detector.

167

cheroot

pip

Highly-optimized, pure-python HTTP server

168

CherryPy

pip

BSD

create web application, needed by Spyre

169

chest

pip

BSD

Simple on-disk dictionary

170

citeproc-py

pip

2-clause BSD License

citeproc-py is a CSL processor for Python. It aims to implement the CSL 1.0.1 specification. citeproc-py can output styled citations and bibliographies in a number of different output formats. Currently supported are plain text, reStructuredText and HTML. Other formats can be added easily.

171

click

pip

BSD

A simple wrapper around optparse for powerful command line utilities.

172

click-plugins

pip

New BSD

An extension module for click to enable registering CLI commands via setuptools entry-points.

173

cligj

pip

BSD

Click params for commmand line interfaces to GeoJSON

174

climate-toolbox

pip

Command-line utilities (turn function into command line)

175

SPHINX

cloud_sptheme

pip

BSD

sphinx theme

176

cloudpickle

pip

BSD 3-Clause License

Extended pickling support for Python objects

177

clyent

pip

Command line client Library for windows and posix

178

DATA/ML

cntk

wheel

Deep Learning from Microsoft see https://docs.microsoft.com/en-us/cognitive-toolkit/setup-windows-python

179

cobble

pip

BSD

Cobble is a Python library that allows easy creation of data objects, including implementations of common methods such as __eq__ and __repr__.

180

cobra

wheel

Constraint-based reconstruction and analysis in python.

181

TEACH

code_beatrix

pip

MIT

teaching programming to kids, lesenfantscodaient.fr

182

codecov

pip

http://www.apache.org/licenses/LICENSE-2.0

submit coverage report to codecov

183

colander

pip

A simple schema-based serialization and deserialization library

184

SPHINX

colorama

pip

BSD

Cross-platform colored terminal text.

185

colorcet

pip

CC-BY License

colorcet is a collection of perceptually uniform colormaps for use with Python plotting programs like bokeh, matplotlib, holoviews, and datashader.

186

colorlog

pip

MIT License

Log formatting with colors!

187

VIZ

colorlover

pip

MIT

Color scales for IPython notebook

188

VIZ

colormap

pip

LGPL

Utilities to ease manipulation of matplotlib colormaps and color codecs (e.g., hex2rgb)

189

colormath

pip

Color math and conversion library.

190

colorspacious

pip

MIT

A powerful, accurate, and easy-to-use Python library for doing colorspace conversions (for viscm)

191

CommonMark

pip

BSD-3-Clause

Python parser for the CommonMark Markdown spec

192

constantly

pip

Symbolic constants in Python

193

contextlib2

pip

Python

Backports and enhancements for the contextlib module

194

convertdate

pip

for fbprophet

195

coverage

pip

Apache 2.0

measure the coverage of unit tests

196

TEACH

cpyquickhelper

wheel

MIT

Experiments with C++ and Python

197

cryptography

pip

BSD or Apache License, Version 2.0

cryptography is a package which provides cryptographic recipes and primitives to Python developers.

198

TEACH

csharpy

wheel

Python + C# + C

199

cssselect

pip

BSD

cssselect parses CSS3 Selectors and translates them to XPath 1.0

200

cssselect2

pip

BSD

CSS selectors for Python ElementTree

201

cubes

pip

MIT

Cubes is a light-weight Python framework and set of tools for development of reporting and analytical applications, Online Analytical Processing (OLAP), multidimensional analysis and browsing of aggregated data. It is part of Data Brewery.

202

cuda4py

pip

BSD

Python cffi CUDA bindings and helper classes

203

VIZ

cufflinks

pip

MIT

Productivity Tools for Plotly + Pandas

204

DATA/DML

cupy

wheel

CuPy : NumPy-like API accelerated with CUDA

205

custom_inherit

pip

A Python package that provides tools for inheriting docstrings in customizable ways.

206

OPTIM

CVXcanon

wheel

?

A low-level library to perform the matrix building step in cvxpy, a convex optimization modeling software.

207

OPTIM

cvxopt

wheel

GNU GPL version 3

linear, quadratique optimization

208

OPTIM

cvxpy

pip

GNU

linear, quadratic optimization, depends on cvxopt

209

VIZ

cycler

pip

BSD

dependency for matplotlib

210

cymem

pip

MIT

Manage calls to calloc/free through Cython

211

cyordereddict

wheel

MIT

Cython implementation of Python’s collections.OrderedDict

212

Cython

wheel

Apache

pseudo C++ in python

213

DATA/ML

cytoolz

wheel

BSD

Cython implementation of Toolz: High performance functional utilities

214

DATA/ML

dask

pip

BSD

parallization of operations with dataframe

215

DataProperty

pip

Python library for extract property from data.

216

VIZ

datashader

pip

New BSD

Datashader is a graphics pipeline system for creating meaningful representations of large amounts of data.

217

datashape

pip

BSD

A data description language.

218

VIZ

dataspyre

pip

MIT

create simple web application to visualize data

219

ALGO

datrie

wheel

LGPL v2.1

Fast, efficiently stored Trie for Python.

220

db.py

pip

BSD

db.py is an easier way to interact with your databases. It makes it easier to explore tables, columns, views, etc. It puts the emphasis on user interaction, information display, and providing easy to use helper functions.

221

dbf

pip

access DBase format

222

dbfread

pip

MIT

access DBase format

223

deap

pip

LGPL

deep learning

224

decorator

pip

new BSD License

Better living through Python with decorators

225

defusedxml

pip

PSFL

XML bomb protection for Python stdlib modules

226

depinfo

pip

A utility Python package intended for other library packages. Provides a function that when called with your package name, will print platform and dependency information.

227

deprecation

pip

A library to handle automated deprecations

228

descartes

pip

BSD

Use Shapely or GeoJSON-like geometric objects as matplotlib paths and patches

229

dialite

pip

for flexx

230

dill

pip

3-clause BSD

serialize all of python (almost), Dill extends python’s pickle module for serializing and de-serializing python objects to the majority of the built-in python types.

231

discretize

wheel

A python package for finite volume discretization.

232

django

pip

Django

233

django-audiotracks

pip

MIT

read audio with django

234

django-celery

pip

BSD

localshop dependency, Old django celery integration project.

235

django-configurations

pip

BSD

localshop dependency, A helper for organizing Django settings.

236

WEB

django-contrib-comments

pip

BSD

Django used to include a comments framework; since Django 1.6 it’s been separated to a separate project. This is that project.

237

django-environ

pip

MIT

localshop dependency, Django-environ allows you to utilize 12factor inspired environment variables to configure your Django application.

238

django-guardian

pip

BSD

localshop dependency, Implementation of per object permissions for Django 1.2 or later.

239

django-model-utils

pip

BSD

localshop dependency, Django model mixins and utilities.

240

django-storages

pip

BSD

localshop dependency, django-storages is a collection of custom storage backends for Django.

241

django-userena

pip

BSD

localshop dependency, Accounts for Django made beautifully simple

242

django-uuidfield

pip

~BSD

localshop dependency, UUIDField in Django

243

ML

dlib

pip

boost

A toolkit for making real world machine learning and data analysis applications

244

WEB

dnspython

pip

~MIT

dnspython is a DNS toolkit for Python. It supports almost all record types. It can be used for queries, zone transfers, and dynamic updates. It supports TSIG authenticated messages and EDNS0.

245

STYLE

docformatter

pip

Expat License

Formats docstrings to follow PEP 257.

246

docopt

pip

Pythonic argument parser, that will make you smile

247

SPHINX

docrepr

pip

Modified BSD

docrepr renders Python docstrings in HTML

248

SPHINX

docutils

pip

public domain, Python, 2-Clause BSD, GPL 3 (see COPYING.txt)

interpret RST format

249

docx2txt

pip

MIT

A pure python-based utility to extract text and images from docx files.

250

dominate

pip

LGPLv3

Dominate is a Python library for creating and manipulating HTML documents using an elegant DOM API.

251

VIZ

drawtree

pip

Apache License 2.0

Draw binary tree in plain text

252

duecredit

pip

2-clause BSD License

Publications (and donations) tracer

253

dukpy

pip

MIT

DukPy is a simple javascript interpreter for Python built on top of duktape engine without any external dependency. It comes with a bunch of common transpilers built-in for convenience.

254

dynd

wheel

BSD

DyND-Python, a component of the Blaze project, is the Python exposure of the DyND dynamic multi-dimensional array library.

255

easy-thumbnails

pip

BSD

Easy thumbnails for Django

256

easydev

pip

new BSD

Common utilities to ease the development of Python packages

257

ecdsa

pip

ECDSA cryptographic signature library (pure python)

258

OPTIM

ecos

wheel

ECOS is a numerical software for solving convex second-order cone programs (SOCPs)

259

DATA/ML

edward

pip

MIT

Edward is a Python library for probabilistic modeling, inference, and criticism

260

elasticsearch

pip

Apache License 2.0

Python client for Elasticsearch

261

emcee

pip

Kick ass affine-invariant ensemble MCMC sampling

262

empyrical

pip

empyrical is a Python library with performance and risk statistics commonly used in quantitative finance

263

TEACH

ensae_projects

pip

MIT

single use code

264

TEACH

ensae_teaching_cs

pip

MIT

teachings, introduction to programming, machine learning, map/reduce

265

JUPYTER

entrypoints

pip

MIT

Discover and load entry points from installed packages.

266

envoy

pip

~MIT

Simple API for running external processes.

267

ephem

pip

for fbprophet

268

et_xmlfile

pip

MIT

et_xmlfile is a low memory library for creating large XML files (for openpyxl).

269

VIZ

ete3

pip

GPLv3

tree visualisation, ete3 does not work with PyQt4 (2017-11).

270

expressions

pip

Arithmetic expression parser library. Embed customized expression evaluation into your application or library.

271

fabric

pip

BSD

Fabric is a Python library and command-line tool for streamlining the use of SSH for application deployment or systems administration tasks.

272

Fabric3

pip

BSD

Fabric is a simple, Pythonic tool for remote execution and deployment (py2.7/py3.4+ compatible fork)

273

ML

fairtest

wheel

FairTest enables developers or auditing entities to discover and test for unwarranted associations between an algorithm’s outputs and certain user subpopulations identified by protected features.

274

NETWORK

falcon

pip

Apache 2.0

Falcon is a very fast, very minimal Python web framework for building microservices, app backends, and higher-level frameworks.

275

NETWORK

falcon-auth

pip

A falcon middleware + authentication backends that adds authentication layer to you app/api service.

276

fast-histogram

wheel2

BSD

Mizani is a scales package for graphics. It is written in Python and is based on Hadley Wickham’s Scales.

277

fastcache

wheel

C implementation of Python 3 lru_cache for Python.

278

fasteners

pip

ASL 2.0

A python package that provides useful locks.

279

fastparquet

wheel

Apache License 2.0

fastparquet is a python implementation of the parquet format, aiming integrate into python-based big data work-flows.

280

fastrlock

wheel

Fast, re-entrant optimistic lock implemented in Cython (cupy)

281

DATA/ML

fasttext

wheel

BSD 3-Clause License

fastText is a library for efficient learning of word representations and sentence classification.

282

fbpca

pip

Functions for principal component analysis (PCA) and accuracy checks.

283

DATA/ML

fbprophet

pip

BSD

Prophet is a procedure for forecasting time series data. It is based on an additive model where non-linear trends are fit with yearly and weekly seasonality, plus holidays. It works best with daily periodicity data with at least one year of historical data. Prophet is robust to missing data, shifts in the trend, and large outliers.

284

ML

fcn

pip

MIT

Fully Convolutional Networks

285

feedparser

pip

~BSD - 2 clauses

parse RSS streams

286

WEB

filebrowser_safe

pip

~MIT

A snapshot of the filebrowser_3 branch of django-filebrowser, packaged as a dependency for the Mezzanine CMS for Django.

287

filelock

pip

Public Domain <http://unlicense.org>

A platform independent file lock.

288

GEO

Fiona

wheel

BSD

Fiona is OGR’s neat, nimble, no-nonsense API for Python programmers.

289

STYLE

flake8

pip

MIT

verify pep8 on a script

290

NETWORK

flask

pip

BSD

Flask is a microframework for Python based on Werkzeug, Jinja 2 and good intentions. And before you ask: It’s BSD licensed!

291

Flask-Cors

pip

A Flask extension for handling Cross Origin Resource Sharing (CORS), making cross-origin AJAX possible.

292

NETWORK

Flask-Login

pip

MIT

293

NETWORK

flask-sqlalchemy

pip

294

GUI

flexx

pip

Flexx is a pure Python toolkit for creating graphical user interfaces (GUI’s), that uses web technology for its rendering. Apps are written purely in Python; Flexx” transpiler generates the necessary JavaScript on the fly.

295

VIZ

folium

pip

MIT

This library is a spinoff from folium, that would host the non-map-specific features. It may become a HTML+JS generation library in the future.

296

ML

foolbox

pip

Foolbox is a Python toolbox to create adversarial examples that fool neural networks.

297

ML

forestci

pip

MIT

Python module for calculating variance and adding confidence intervals to scikit-learn random forest regression or classification objects. The core functions calculate an in-bag and error bars for random forest objects

298

SPHINX

foundation-sphinx-theme

pip

MIT

sphinx theme

299

NLP

fr_core_news_sm

wheel2

French ressources for spacy

300

ftfy

pip

MIT

Fixes some problems with Unicode text after the fact

301

funcparserlib

pip

MIT

Recursive descent parsing library based on functional combinators

302

funcsigs

pip

Python function signatures from PEP362

303

future

pip

MIT

Clean single-source support for Python 3 and 2

304

fuzzywuzzy

pip

Fuzzy string matching in python

305

gast

pip

BSD 3-Clause

A generic AST to represent Python2 and Python3’s Abstract Syntax Tree(AST).

306

gatspy

pip

BSD

General tools for Astronomical Time Series in Python

307

GDAL

wheel

MIT

GDAL is a translator library for raster and vector geospatial data formats that is released under an X/MIT style Open Source license by the Open Source Geospatial Foundation.

308

ML

gdbn

pip

This package contains python code for pre-trained deep neural networks

309

gdown

pip

MIT

Google Drive direct download of big files.

310

gensim

wheel

LGPLv2.1

genetic algorithm

311

geographiclib

pip

MIT

This implements algorithms for geodesics (Karney, 2013) for solving the direct and inverse problems for an ellipsoid of revolution.

312

geojson

pip

Functions for encoding and decoding GeoJSON formatted data

313

geopandas

pip

BSD

GeoPandas is an open source project to make working with geospatial data in python easier.

314

VIZ

geoplotlib

pip

MIT

geoplotlib is a python toolbox for visualizing geographical data and making maps

315

VIZ

geopy

pip

MIT

Python Geocoding Toolbox

316

VIZ

geoviews

pip

Composable, declarative data structures for building complex visualizations easily.

317

gevent

pip

MIT

gevent is a coroutine-based Python networking library

318

ggplot

pip

BSD

ggplot graphics style

319

JUPYTER

ghost.py

pip

ghost.py is a webkit web client written in python

320

git-pandas

pip

BSD

321

gitdb2

pip

BSD License

GitDB is a pure-Python git object database

322

GitPython

pip

BSD

Python Git Library

323

gizeh

pip

see LICENSE.txt

Simple Vector Graphics for Python

324

glue-core

pip

BSD

Multidimensional data visualzation across files

325

glue-vispy-viewers

pip

As Is

Vispy-based viewers for Glue

326

DATA/ML

glueviz

pip

BSD

ploting, Multidimensional data visualzation across files

327

gmpy2

wheel

LGPLv3+

big real numbers (issue on Linux and Anaconda)

328

gnumpy

pip

Gnumpy: an easy way to use GPU boards in Python

329

google-api-python-client

pip

Apache 2.0

The Google API Client for Python is a client library for accessing the Plus, Moderator, and many other Google APIs.

330

google-auth

pip

Apache 2.0

This library simplifies using Google’s various server-to-server authentication mechanisms to access Google APIs.

331

google-auth-httplib2

pip

Apache 2.0

This library provides an httplib2 transport for google-auth.

332

googlemaps

pip

Apache

Python client library for Google Maps API Web Services

333

ML

gplearn

pip

gplearn implements Genetic Programming in Python, with a scikit-learn inspired and compatible API.

334

PROFILING

gprof2dot

pip

LGPL

This is a Python script to convert the output from many profilers into a dot graph.

335

grab

pip

MIT

Grab is a python web scraping framework. Grab provides tons of helpful methods to scrape web sites

336

grako

wheel

Grako (for grammar compiler) is a tool that takes grammars in a variation of EBNF as input, and outputs memoizing (Packrat) PEG parsers in Python.

337

VIZ

graphviz

pip

MIT

wrapper for graphviz (most famous tool to draw graphs

338

WEB

grappelli_safe

pip

~MIT

A snapshot of the grappelli_2 branch of django-grappelli, packaged as a dependency for the Mezzanine CMS for Django.

339

greenlet

wheel

MIT

Greenlet allows lightweight in-process concurrent programming.

340

grequests

pip

BSD

GRequests allows you to use Requests with Gevent to make asynchronous HTTP Requests easily.

341

CLI

grin

pip

BSD

A grep program configured the way I like it.

342

grpcio

wheel

Apache License 2.0

Package for gRPC Python.

343

guidata

pip

CeCILL v2

Automatic graphical user interfaces generation for easy dataset editing and display

344

guiqwt

wheel

CECILL

Efficient 2D plotting Python library based on PythonQwt (Spyder)

345

gunicorn

pip

MIT

localshop dependency, WSGI HTTP Server for UNIX

346

SPHINX

guzzle_sphinx_theme

pip

BSD~MIT

sphinx theme

347

gvar

wheel

GPLv3+

Utilities for manipulating correlated Gaussian random variables.

348

h11

pip

MIT

This is a little HTTP/1.1 library written from scratch in Python, heavily inspired by hyper-h2.

349

h2

pip

MIT License

HTTP/2 State-Machine based protocol implementation

350

ML

h2o

pip

H2O, Fast Scalable Machine Learning, for python

351

h5netcdf

pip

netCDF4 via h5py

352

DATA/ML

h5py

wheel

BSD

The h5py package is a Pythonic interface to the HDF5 binary data format. Trillion-Particle Simulation.

353

SPHINX

hachibee-sphinx-theme

pip

MIT

sphinx theme

354

ML

HDDM

wheel

BSD

HDDM is a python module that implements Hierarchical Bayesian estimation of Drift Diffusion Models.

355

heapdict

pip

BSD

a heap with decrease-key and increase-key operations

356

VIZ

heatmap

wheel

MIT

draw heatmap

357

ML

hmmlearn

wheel

MIT + Copyright

Hidden Markov Models in Python, with scikit-learn like API

358

holidays

pip

for fbprophet

359

HoloPy

wheel

Hologram processing and light scattering in python

360

VIZ

holoviews

pip

BSD

Composable, declarative data structures for building complex visualizations easily.

361

hpack

pip

MIT License

Pure-Python HPACK header compression

362

html2text

pip

GNU GPL 3

Turn HTML into equivalent Markdown-structured text.

363

html5lib

pip

MIT License

pure-python library for parsing HTML

364

httpie

pip

BSD

HTTPie - a CLI, cURL-like tool for humans

365

httplib2

pip

MIT

A comprehensive HTTP client library.

366

httpretty

pip

MIT

HTTP client mock for Python

367

httptools

pip

MIT

httptools is a Python binding for nodejs HTTP parser. It’s still in a very early development stage, expect APIs to break.

368

hyper

pip

MIT

HTTP/2 Client for Python

369

hypercorn

pip

MIT

Hypercorn is an ASGI web server based on the sans-io hyper, h11, h2, and wsproto libraries and inspired by Gunicorn.

370

hyperframe

pip

MIT License

HTTP/2 framing layer for Python

371

hyperlink

pip

MIT

A featureful, correct URL for Python.

372

ML

hyperopt

github

Hyperopt is a Python library for serial and parallel optimization over awkwardsearch spaces, which may include real-valued, discrete, and conditional dimensions.

373

hyperspy

wheel

GNU General Public License v3 (GPLv3)

HyperSpy is an open source Python library which provides tools to facilitate the interactive data analysis of multi-dimensional datasets that can be described as multi-dimensional arrays of a given signal (e.g. a 2D array of spectra a.k.a spectrum image).

374

hypothesis

pip

Hypothesis is an advanced testing library for Python. It lets you write tests which are parametrized by a source of examples, and then generates simple and comprehensible examples that make your tests fail. This lets you find more bugs in your code with less work.

375

WEB

idna

pip

BSD-like

Internationalized Domain Names in Applications (IDNA)

376

ijson

pip

BSD

Iterative JSON parser with a standard Python iterator interface

377

VIDEO

imageio

pip

(new) BSD

Library for reading and writing a wide range of image, video, scientific, and volumetric data formats (for moviepy)

378

VIDEO

imageio-ffmpeg

pip

(new) BSD

ffmpeg

379

images2gif

pip

Unlicensed

Create a GIF from a list of images.

380

SPHINX

imagesize

pip

MIT

Getting image size from png/jpeg/jpeg2000/gif file

381

ML

imbalanced-learn

pip

MIT

imbalanced-learn is a python package offering a number of re-sampling techniques commonly used in datasets showing strong between-class imbalance. It is compatible with scikit-learn and is part of scikit-learn-contrib projects.

382

imbox

pip

MIT

Python library for reading IMAP mailboxes and converting email content to machine readable data

383

JUPYTER

importlib-metadata

pip

Apache Software License

library to access the metadata for a Python package. It is intended to be ported to Python 3.8.

384

incremental

pip

Incremental is a small library that versions your Python projects.

385

indexed_gzip

wheel

Fast random access of gzip files.

386

inflection

pip

MIT

A port of Ruby on Rails inflector to Python

387

intervaltree

pip

Editable interval tree data structure for Python 2 and 3

388

invoke

pip

BSD

Invoke is a Python task execution tool & library, drawing inspiration from various sources to arrive at a powerful & clean feature set.

389

ipaddress

pip

Python Software Foundation License

IPv4/IPv6 manipulation library

390

JUPYTER

ipydatawidgets

pip

BSD

A set of widgets to help facilitate reuse of large datasets across widgets

391

JUPYTER

ipykernel

pip

BSD

IPython, Jupyter, kernels

392

JUPYTER

ipympl

pip

Matplotlib Jupyter Extension

393

JUPYTER

ipyparallel

pip

BSD

Interactive Parallel Computing with IPython

394

JUPYTER

ipyscales

pip

A widget library for scales

395

JUPYTER

ipython

pip

BSD

IPython, Jupyter

396

JUPYTER

ipython_genutils

pip

BSD

IPython utils (nbformat)

397

JUPYTER

ipywidgets

pip

BSD

IPython, Jupyter, widgets

398

iso8601

pip

Simple module to parse ISO 8601 dates

399

isodate

pip

BSD

An ISO 8601 date/time/duration parser and formatter

400

isort

pip

MIT

A Python utility / library to sort Python imports.

401

SPHINX

itcase_sphinx_theme

pip

no license

sphinx theme

402

itsdangerous

pip

BSD

Various helpers to pass trusted data to untrusted environments and back.

403

jaraco.functools

pip

Additional functools in the spirit of stdlib’s functools.

404

javascripthon

pip

GPLv3+

a Python 3 to ES6 JavaScript translator

405

jdcal

pip

BSD

Julian dates from proleptic Gregorian and Julian calendars.

406

jedi

pip

MIT

An autocompletion tool for Python that can be used for text editors.

407

jeepney

pip

This is a low-level, pure Python DBus protocol client. It has an I/O-free core, and integration modules for different event loops.

408

jieba

pip

MIT

Chinese Words Segementation Utilities

409

jinja2

pip

BSD-3-Clause

templating

410

jmespath

pip

MIT

JSON Matching Expressions

411

joblib

pip

BSD

distribute jobs, parallelization

412

JPype1

wheel

Apache Software License

A Python to Java bridge.

413

Js2Py

pip

MIT

JavaScript to Python Translator & JavaScript interpreter written in 100% pure Python.

414

jsonpickle

pip

BSD

Python library for serializing any arbitrary object graph into JSON.

415

jsonschema

pip

An implementation of JSON Schema validation for Python

416

DATA/ML

julia

pip

MIT

Julia/Python bridge with IPython support

417

TEACH

jupytalk

pip

MIT

materials for presentations

418

JUPYTER

jupyter

pip

BSD

Jupyter

419

JUPYTER

jupyter-console

pip

BSD

Jupyter console, new in Jupyter 4.0

420

jupyter-echarts-pypkg

pip

MIT

The project packages jupyter-echarts and distributes it via pypi.

421

JUPYTER

jupyter-pip

pip

Allows Jupyter notebook extension writers to make their extension pip installable!

422

JUPYTER

jupyter_client

pip

BSD

Jupyter client

423

JUPYTER

jupyter_contrib_nbextensions

github

This repository contains a collection of extensions that add functionality to the Jupyter notebook.

424

JUPYTER

jupyter_core

pip

BSD

Jupyter Core

425

JUPYTER

jupyter_sphinx

pip

BSD

Jupyter Sphinx Extensions

426

JUPYTER

jupyterhub

pip

JupyterHub: A multi-user server for Jupyter notebooks

427

JUPYTER

jupyterlab

pip

BSD

Jupyter Lab

428

JUPYTER

jupyterlab-launcher

pip

Jupyter Lab Launcher

429

JUPYTER

jupyterlab-server

pip

BSD

Jupyter Lab Server

430

JUPYTER

jupyterlab_pygments

pip

Jupyterlab

431

TEACH

jyquickhelper

pip

MIT

Helpers for Jupyter notebooks.

432

ML

kabuki

pip

As Is

kabuki is a python toolbox that allows easy creation of hierarchical bayesian models for the cognitive sciences.

433

DATA/ML

keras

pip

MIT

deep learning

434

DATA/ML

Keras-Applications

pip

MIT

Keras Applications is the applications module of the Keras deep learning library. It provides model definitions and pre-trained weights for a number of popular archictures, such as VGG16, ResNet50, Xception, MobileNet, and more.

435

DATA/ML

Keras-Preprocessing

pip

MIT

Keras Preprocessing is the data preprocessing and data augmentation module of the Keras deep learning library. It provides utilities for working with image data, text data, and sequence data.

436

DATA/ML

keras-vis

pip

keras-vis is a high-level toolkit for visualizing and debugging your trained keras neural net models.

437

keyring

pip

MIT, Python

Store and access your passwords safely.

438

GUI

Kivy

wheel

MIT

GUI, interface for games, mobile (use sudo apt-get install python3-kivy on Linux)

439

GUI

kivy-garden

pip

MIT

Garden tool for kivy flowers.

440

kiwisolver

wheel

BSD

Kiwi is an efficient C++ implementation of the Cassowary constraint solving algorithm.

441

ML

knnimpute

pip

k-Nearest Neighbor imputation

442

kombu

pip

BSD

Messaging library for Python (for celery)

443

lazy-object-proxy

pip

BSD-2-Clause

A fast and thorough lazy object proxy

444

WEB

ldap3

pip

LGPLv3

ldap3 is a strictly RFC 4510 conforming LDAP V3 pure Python client library.

445

VIZ

leather

pip

Leather is the Python charting library for those who need charts now and don’t care if they’re perfect.

446

libLAS

wheel

BSD

libLAS is a C/C++ library for reading and writing the very common LAS LiDAR format.

447

liblinear

wheel

A Library for Large Linear Classification

448

libsvm

wheel

SVM library

449

libtiff

wheel

PyLibTiff is a package that provides: a wrapper to the libtiff library to Python using ctypes, a pure Python module for reading and writing TIFF and LSM files. The images are read as numpy.memmap objects so that it is possible to open images that otherwise would not fit to computers RAM. Both TIFF strips and tiles are supported for low-level data storage.

450

OPTIM

lifelines

pip

MIT

survival analysis

451

DATA/ML

lightgbm

pip

The MIT License (Microsoft)

Parallelized Stochastic Gradient Descent (only available on Python 3 and x64)

452

TEACH

lightmlboard

pip

lightmlboard implements a light machine learning leaderboard based on tornado.

453

TEACH

lightmlrestapi

pip

lightmlrestapi implements a light machine learning REST API based on falcon.

454

VIZ

lightning-python

pip

MIT

Python client for the lightning API

455

linecache2

pip

A backport of linecache to older supported Pythons.

456

llvmlite

wheel

BSD

lightweight wrapper around basic LLVM functionality, check issue https://github.com/cmderdev/cmder/issues/490 for missing api-ms-win-crt-runtime-l1-1-0.dll

457

OPTIM

lmfit

pip

BSD

Least-Squares Minimization with Bounds and Constraints

458

lml

pip

New BSD

lml seamlessly finds the lml based plugins from your current python environment but loads your plugins on demand. It is designed to support plugins that have external dependencies, especially bulky and/or memory hungry ones. lml provides the plugin management system only and the plugin interface is on your shoulder.

459

locket

pip

BSD

File-based locks for Python for Linux and Windows

460

lockfile

pip

MIT

Platform-independent file locking module

461

Logbook

wheel

A logging replacement for Python

462

logutils

pip

Logging utilities

463

lru_dict

pip

An Dict like LRU container.

464

OPTIM

lsqfit

wheel

GPLv3+

Utilities for nonlinear least-squares fits

465

WORKFLOW

luigi

pip

Apache 2.0

workflows, data workflows

466

lunardate

pip

for fbprophet

467

lxml

wheel

BSD

xml parsers (C++)

468

lz4

wheel

BSD

LZ4 Bindings for Python (for dpark)

469

m2r

pip

M2R converts a markdown file including reStructuredText (rst) markups to a valid rst format.

470

macropy3

pip

MIT

MacroPy is an implementation of Syntactic Macros in the Python Programming Language. MacroPy provides a mechanism for user-defined functions (macros) to perform transformations on the abstract syntax tree (AST) of a Python program at import time. This is an easy way to enhance the semantics of a Python program in ways which are otherwise impossible, for example providing an extremely concise way of declaring classes.

471

VIZ

mahotas

wheel

Mahotas: Computer Vision Library

472

Mako

pip

MIT

templating

473

mammoth

pip

BSD

Convert Word documents from docx to simple and clean HTML and Markdown

474

TEACH

manydataapi

pip

MIT

Helpers to access APIs

475

marisa_trie

wheel

MIT

Static memory-efficient & fast Trie-like structures for Python (based on marisa-trie C++ library)

476

markdown

pip

BSD License

markdown parser (for bokeh)

477

markdown2

pip

MIT

markdown parser

478

markupsafe

pip

BSD-3-Clause

parses mardown

479

VIZ

matplotlib

wheel

PSF

most used plotting library

480

mbstrdecoder

pip

MIT

multi-byte character string decoder

481

STYLE

mccabe

pip

Expat license

This module provides a plugin for flake8, the Python code checker.

482

SPHINX

mdn-sphinx-theme

pip

Mozilla Public License 2.0 (MPL 2.0)

A Sphinx theme for the Mozilla Developer Network

483

ML

MDP

pip

Python data processing framework.

484

PROFILING

memory-profiler

pip

BSD

A module for monitoring memory usage of a python program

485

JUPYTER

metakernel

pip

BSD

more magic commands for Jupyter

486

WEB

mezzanine

pip

BSD

Mezzanine is a powerful, consistent, and flexible content management platform.

487

JUPYTER/LINUX

micropython-fcntl

pip

dependency for ptyprocess, Functions to compute fnctl.ioctl’s opt argument

488

JUPYTER/LINUX

micropython-ffilib

pip

dependency for ptyprocess, MicroPython FFI helper module

489

JUPYTER/LINUX

micropython-libc

pip

dependency for ptyprocess, MicroPython FFI helper module

490

DATA/ML

milk

wheel

machine learning toolkit

491

minepy

wheel

GPLv3

interface to MineCraft

492

VIZ

missingno

pip

As Is

Messy datasets? Missing values? missingno provides a small toolset of flexible and easy-to-use missing data visualizations and utilities that allows you to get a quick visual summary of the completeness (or lack thereof) of your dataset.

493

mistune

pip

BSD

The fastest markdown parser in pure Python with renderer features, inspired by marked.

494

mizani

pip

BSD (3-clause)

Mizani is a scales package for graphics. It is written in Python and is based on Hadley Wickham’s Scales.

495

DATA/ML

mkl_service

wheel

This package exposes a few functions which are declared in mkl_service.h. The main purpose of the package is to allow the user to change the number of CPU’s MKL is using at runtime.

496

TEACH

mlinsights

wheel

MIT

mlinsights implements functions to get insights on machine learned models.

497

TEACH

mlprodict

wheel

MIT

mlprodict implements a couple of ways to productionize machine learning predictions.

498

mlpy

wheel

mlpy is a Python module for Machine Learning built on top of NumPy/SciPy, has wavelets

499

TEACH

mlstatpy

pip

MIT

materials for machine learning

500

ML

mlxtend

pip

A library consisting of useful tools and extensions for the day-to-day data science tasks.

501

mnist

pip

BSD

Python utilities to download and parse the MNIST dataset

502

mock

pip

OSI Approved :: BSD License

mock is a library for testing in Python. It allows you to replace parts of your system under test with mock objects and make assertions about how they have been used.

503

mockextras

pip

Extensions to the mock library

504

monotonic

pip

Apache

This module provides a monotonic() function which returns the value (in fractional seconds) of a clock which never goes backwards. It is compatible with Python 2 and Python 3.

505

monty

pip

MIT

Monty is the missing complement to Python.

506

more-itertools

pip

MIT

More routines for operating on iterables, beyond itertools

507

VIDEO

moviepy

pip

MIT License

Video editing with Python

508

mpi4py

wheel

MPI for Python

509

mpl-scatter-density

pip

BSD

Matplotlib helpers to make density scatter plots

510

VIZ

mpl_finance

github

This module consists of code extracted from the deprecated matplotlib.finance module along with a few examples of usage.

511

mpmath

pip

BSD

mpmath is a free (BSD licensed) Python library for real and complex floating-point arithmetic with arbitrary precision.

512

msgfy

pip

msgfy is a Python library for convert Exception instance to a human-readable error message.

513

msgpack

wheel

Apache 2.0

MessagePack (de)serializer.

514

msgpack-numpy

pip

BSD

Numpy data serialization using msgpack

515

msrest

pip

MIT License

AutoRest swagger generator Python client runtime.

516

msrestazure

pip

MIT License

AutoRest swagger generator Python client runtime. Azure-specific module.

517

mss

pip

An ultra fast cross-platform multiple screenshots module in pure python using ctypes

518

multi-key-dict

pip

License :: OSI Approved :: MIT License (http://opensource.org/licenses/MIT)

Multi key dictionary implementation

519

multimethods

pip

MIT

A multimethod implementation, loosely based on Guido’s initial ‘Five-minute Multimethods in Python.

520

multipledispatch

pip

BSD

A relatively sane approach to multiple dispatch in Python.

521

multiprocess

wheel

better multiprocessing and multithreading in python

522

munch

pip

MIT

A dot-accessible dictionary (a la JavaScript objects).

523

murmurhash

pip

MIT

Cython .pxd files for some of the MurmurHash 2 and 3 hash functions, with a slightly more Pythonic API. The only access to these functions is via Cython — I don’t see why they should be useful from pure Python.

524

mutagen

pip

GPL-2.0-or-later

read and write audio tags for many formats in Python 3

525

mutagenx

pip

read and write audio tags for many formats in Python 3

526

mypy

pip

Mypy is an experimental optional static type checker for Python that aims to combine the benefits of dynamic (or “duck”) typing and static typing.

527

mypy_extensions

pip

for mypy

528

SQL

mysqlclient

wheel

MySQL driver written in Python which does not depend on MySQL C client libraries and implements the DB API v2.0 specification (PEP-249).

529

natgrid

wheel

Python interface to NCAR natgrid library (for matplotlib)

530

natsort

pip

MIT

Sort lists naturally

531

JUPYTER

nbbrowserpdf

pip

LaTeX-free PDF generation from Jupyter Notebooks

532

JUPYTER

nbconvert

pip

BSD

IPython, notebooks conversion, new in Jupyter 4.0

533

nbdime

pip

BSD

Comparison of notebooks

534

JUPYTER

nbformat

pip

BSD

IPython, notebooks conversion, new in Jupyter 4.0

535

JUPYTER

nbpresent

pip

BSD-3-Clause

Next generation slides from Jupyter Notebooks

536

JUPYTER

nbsphinx

pip

MIT

nbsphinx is a Sphinx extension that provides a source parser for *.ipynb files.

537

JUPYTER

nbupload

pip

widget to upload a file in a notebook

538

ndg-httpsclient

pip

BSD - See ndg/httpsclient/LICENCE file for details

Provides enhanced HTTPS support for httplib and urllib2 using PyOpenSSL

539

netaddr

pip

Pythonic manipulation of IPv4, IPv6, CIDR, EUI and MAC network addresses

540

netCDF4

wheel

xarray uses this module to save and read data (netCDF=Unidata network Common Data Form)

541

VIZ

networkx

pip

BSD

graph libraries, basic drawing

542

DATA/ML

neural-python

pip

MIT

NeuralPy is the Artificial Neural Network library implemented in Python.

543

nibabel

pip

Access a multitude of neuroimaging data formats.

544

DATA/ML

nitime

wheel

Nitime is a library for time-series analysis of data from neuroscience experiments.

545

DATA/ML

NLopt

wheel

LGPL

linear, quadratic optimization

546

NLP

nltk

pip

Apache License, Version 2.0

NLP, natural language processing

547

nodeenv

pip

BSD

Node.js virtual environment builder

548

nose

pip

GNU LGPL

run unit tests

549

JUPYTER

notebook

pip

BSD

Jupyter notebooks, new in Jupyter 4.0

550

JUPYTER

notedown

pip

Convert markdown to IPython notebook.

551

C++

nuitka

pip

C++ compilation, code optimization

552

C++

numba

wheel

BSD

Numba is an Open Source NumPy-aware optimizing compiler for Python sponsored by Continuum Analytics, Inc.

553

numcodecs

wheel

MIT

Numcodecs is a Python package providing buffer compression and transformation codecs for use in data storage and communication applications.

554

OPTIM

numdifftools

pip

Solves automatic numerical differentiation problems in one or more variables.

555

numexpr

wheel

MIT

Fast numerical array expression evaluator for Python, NumPy, PyTables, pandas, bcolz and more.

556

DATA/ML

numpy

wheel

BSD

matrix computation

557

numpydoc

pip

Sphinx extension to support docstrings in Numpy format

558

oauth2client

pip

The oauth2client is a client library for OAuth 2.0.

559

oauthlib

pip

BSD

A generic, spec-compliant, thorough implementation of the OAuth request-signing logic

560

DATA/ML

oct2py

pip

Python to GNU Octave bridge –> run m-files from python.

561

octave_kernel

pip

BSD

A Jupyter kernel for Octave.

562

odfpy

pip

GNU General Public License v.2 + Apache License v.2

Python API and tools to manipulate OpenDocument files

563

olefile

pip

Python package to parse, read and write Microsoft OLE2 files (Structured Storage or Compound Document, Microsoft Office) - Improved version of the OleFileIO module from PIL, the Python Image Library.

564

ML

onnx

wheel2

Open Neural Network Exchange

565

ML

onnxmltools

pip

Converts Machine Learning models to ONNX

566

opencv_python

wheel

BSD

OpenVC wrapper

567

OpenImageIO

wheel

A library for reading and writing images with emphasis on animation and visual effects.

568

openpyxl

pip

MIT

reads/writes Excel files, version is 1.8.6 due to pandas which does not work with more recent verrsions yet

569

ML

openTSNE

wheel

Fast t-SNE

570

optlang

pip

Formulate optimization problems using sympy expressions and solve them using interfaces to third-party optimization software (e.g. GLPK).

571

ORANGE

orange3

pip

Orange is a component-based data mining software. It includes a range of data visualization, exploration, preprocessing and modeling techniques. It can be used through a nice and intuitive user interface or, for more advanced users, as a module for the Python programming language.

572

ORANGE

orange3-associate

pip

This module implements FP-growth [1] frequent pattern mining algorithm with bucketing optimization [2] for conditional databases of few items.

573

ORANGE

Orange3-ImageAnalytics

pip

GPLv3+

Orange3 add-on for image data mining.

574

ORANGE

Orange3-Network

pip

GPLv3+

Orange Network is an add-on for Orange data mining software package. It provides network visualization and analysis tools.

575

ORANGE

orange3-text

pip

Orange3 Text extends Orange3, a data mining software package, with common functionality for text mining. It provides access to publicly available data, like NY Times, Twitter and PubMed. Further, it provides tools for preprocessing, constructing vector spaces (like bag-of-words, topic modeling and word2vec) and visualizations like word cloud end geo map. All features can be combined with powerful data mining techniques from the Orange data mining framework.

576

VIZ

osmapi

pip

GPLv3

Python wrapper for the OSM API

577

osqp

wheel

The OSQP (Operator Splitting Quadratic Program) solver is a numerical optimization package for solving quadratic problems

578

packaging

pip

BSD or Apache License, Version 2.0

Core utilities for Python packages

579

palettable

pip

MIT

Color palettes for Python

580

pamela

pip

An interface to the Pluggable Authentication Modules (PAM) library on linux, written in pure python (using ctypes)

581

DATA/ML

pandas

wheel

BSD

manipulate table as SQL in memory

582

DATA/ML

pandas-datareader

pip

BSD License

Up to date remote data access for pandas, works for multiple versions of pandas.

583

VIZ

pandas-highcharts

pip

MIT

plotting in javascript and pandas

584

TEACH

pandas_streaming

pip

pandas_streaming aims at processing big files with pandas, too big to hold in memory,too small to be parallelized with a significant gain. The module replicates a subset of pandas API and implements other functionalities for machine learning.

585

pandoc-attributes

pip

An Attribute class to be used with pandocfilters

586

pandocfilters

pip

BSD

Utilities for writing pandoc filters in python

587

param

pip

BSD

Declarative Python programming using Parameters.

588

NETWORK

paramiko

pip

LGPL

SSH2 protocol library

589

parsel

pip

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors.

590

parsimonious

pip

(Soon to be) the fastest pure-Python PEG parser I could muster

591

parso

pip

MIT

Parso is a Python parser that supports error recovery and round-trip parsing for different Python versions (in multiple Python versions). Parso is also able to list multiple syntax errors in your python file.

592

partd

pip

BSD

Appendable key-value storage

593

passlib

pip

BSD

comprehensive password hashing framework supporting over 30 schemes

594

JUPYTER

path.py

pip

MIT

IPython, dependency

595

pathlib2

pip

for ggplot, Object-oriented filesystem paths

596

pathvalidate

pip

MIT

A python library to validate/sanitize a string such as filenames/variable-names/excel-sheet-names.

597

DATA/ML

patsy

pip

2-clause BSD

A Python package for describing statistical models and for building design matrices.

598

pbr

pip

Apache Software License

PBR is a library that injects some useful and sensible default behaviors into your setuptools run.

599

pdfminer3k

pip

MIT

PDF parser and analyzer

600

PDF

pdfrw

pip

MIT

PDF file reader/writer library

601

pep8

pip

official guidelines on Python style

602

persistent

wheel

Objets persistants translucides

603

JUPYTER

pexpect

pip

ISC license

needed by ipykernel on Linux, Pexpect makes Python a better tool for controlling other applications (needed by metakernel).

604

SQL

pg8000

pip

A Pure-Python PostgreSQL

605

picklable-itertools

pip

MIT

A reimplementation of the Python standard library’s itertools, in Python, using picklable iterator objects. (fuel)

606

JUPYTER

pickleshare

pip

MIT

IPython, dependency

607

Pillow

wheel

HPND

read/create images

608

PIMS

pip

As Is

Python Image Sequence (for trackpy)

609

pint

pip

Physical quantities module

610

pipdeptree

pip

MIT

displays module dependencies as a tree

611

pivottablejs

pip

MIT

PivotTable.js integration for Jupyter/IPython Notebook.

612

pkginfo

pip

MIT

Query metadatdata from sdists / bdists / installed packages.

613

plac

pip

BSD License

The smartest command line arguments parser in the world

614

planar

wheel

BSD

2D planar geometry library for Python.

615

VIZ

plotly

pip

MIT

Plotly’s Python graphing library makes interactive, publication-quality graphs online. Examples of how to make line plots, scatter plots, area charts, bar charts, error bars, box plots, histograms, heatmaps, subplots, multiple-axes, polar charts and bubble charts.

616

plotnine

pip

GPL-2

A grammar of graphics for Python

617

pluggy

pip

MIT license

plugin and hook calling mechanisms for python

618

ply

pip

Python Lex & Yacc (for thrifty)

619

Pmw

pip

Pmw is a toolkit for building high-level compound widgets in Python using the Tkinter module.

620

pocket

pip

BSD

Access to pocket API.

621

Polygon3

wheel

Handles polygonal shapes in 2D. This library is free for non-commercial use only.

622

portend

pip

TCP port monitoring utilities

623

preshed

wheel

MIT

Cython hash table that trusts the keys are pre-hashed

624

prince

pip

Prince is a factor analysis library for datasets that fit in memory.

625

priority

pip

MIT License

Priority is a pure-Python implementation of the priority logic for HTTP/2.

626

proglog

pip

MIT - copyright Edinburgh Genome Foundry

Progress bar.

627

VIZ

progressbar2

pip

BSD

Text progress bar library for Python. (fuel)

628

JUPYTER

prometheus_client

pip

The official Python 2 and 3 client for Prometheus.

629

JUPYTER

prompt_toolkit

pip

Library for building powerful interactive command lines in Python

630

properties

pip

An organizational aid and wrapper for validation and tab completion of class properties

631

protobuf

pip

3-Clause BSD License

Protocol Buffers are Google’s data interchange format

632

pscript

pip

for flexx

633

psutil

wheel

BSD

cross-platform library for retrieving information onrunning processes and system utilization (CPU, memory, disks, network)in Python.

634

psycopg2

wheel

Python-PostgreSQL Database Adapter

635

PTable

pip

BSD (3 clause)

A simple Python library for easily displaying tabular data in a visually appealing ASCII table format. (for streamparse)

636

JUPYTER/LINUX

ptyprocess

pip

ISC

dependency for the terminado, Run a subprocess in a pseudo terminal

637

DATA/ML

PuLP

wheel

linear, quadratique optimization with constraints

638

py

pip

MIT license

library with cross-python path, ini-parsing, io, code, log facilities

639

py-cpuinfo

pip

MIT

Py-cpuinfo gets CPU info with pure Python. Py-cpuinfo should work without any extra programs or libraries, beyond what your OS provides. It does not require any compilation(C/C++, assembly, et cetera) to use. It works with Python 2 and 3.

640

py4j

pip

BSD License

Enables Python programs to dynamically access arbitrary Java objects

641

OPTIM

pyamg

wheel

PyAMG is a library of Algebraic Multigrid (AMG) solvers with a convenient Python interface.

642

pyasn1

pip

BSD

ASN.1 types and codecs (for pysnmp)

643

pyasn1-modules

pip

BSD-2-Clause

A collection of ASN.1-based protocols modules.

644

PyAudio

wheel

MIT

PyAudio provides Python bindings for PortAudio v19, the cross-platform audio I/O library. With PyAudio, you can easily use Python to play and record audio streams on a variety of platforms (e.g., GNU/Linux, Microsoft Windows, and Mac OS X).

645

pybars3

pip

GNU Library or Lesser General Public License (LGPL)

Handlebars.js templating

646

C++

pybind11

pip

BSD

pybind11 is a lightweight header-only library that exposes C++ types in Python and vice versa, mainly to create Python bindings of existing C++ code.

647

DATA/ML

pyclustering

pip

GNU General Public License v3 (GPLv3)

many kinds of clustering (Optics, DBScan, x-means, …)

648

STYLE

pycodestyle

pip

Expat license

Python style guide checker

649

pycontracts

pip

PyContracts is a Python package that allows to declare constraints on function parameters and return values, setup for version 1.7.7 is bugged

650

pycosat

wheel

PicoSAT is a popular SAT solver written by Armin Biere in pure C.

651

pycparser

pip

BSD

pycparser is a complete parser of the C language, written in pure Python using the PLY parsing library. It parses C code into an AST and can serve as a front-end for C compilers or analysis tools.

652

pycrypto

wheel2

Public Domain + patent for some algorithm

Cryptographic modules for Python (not available on x64 and Python 3)

653

pycryptodomex

pip

BSD, Public Domain, Apache

Cryptographic modules for Python (not available on x64 and Python 3)

654

pyct

pip

BSD 3-Clause License

A utility package

655

GPU

pycuda

wheel

MIT

PyCUDA lets you access Nvidia’s CUDA parallel computation API from Python.

656

pycurl

wheel

LGPL/MIT

PycURL, a interface to the libcurl library. (for grab)

657

PyDispatcher

pip

Multi-producer-multi-consumer signal dispatching mechanism

658

MUSIC

pydub

pip

MIT

Pydub lets you do stuff to audio in a way that isn’t stupid.

659

pydy

pip

LICENSE.txt

Multibody Dynamics with Python

660

pyecharts

pip

MIT

pyecharts is a library to generate charts using Echarts. It simply provides the interface of 28+ kinds of charts between Echarts and Python.

661

pyecharts-javascripthon

pip

MIT

pyecharts-javascripthon helps translate Python functions into javascript ones. It uses javascripthon and dukpy to blend Python codes into javascript runtime. It supports python 2.7, 3.4, 3.5 and 3.6. It works on Linux, MacOS and Windows platforms.

662

pyecharts-snapshot

pip

pyecharts-snapshot renders the output of pyecharts as a png, jpeg, gif, svg image or a pdf file at command line or in your code.

663

pyeda

wheel

A library for electronic design automation.

664

pyemd

wheel

MIT

A Python wrapper for Ofir Pele and Michael Werman’s implementation of the Earth Mover’s Distance.

665

TEACH

pyenbc

pip

Helpers for remote work, clusters, hadoop

666

TEACH

pyensae

pip

MIT

helpers, Hadoop, SQL, financial times series, …

667

NETWORK

pyexecjs

pip

MIT

Run JavaScript code from Python (for pyreact)

668

STYLE

pyflakes

pip

MIT

verify pep8 on a script

669

DATA/ML

pyflux

wheel

As Is

An open source time series library for the Python Programming Language

670

ML

pyfm

wheel

Factorization Machine

671

VIZ

pygal

pip

GNU LGPL v3+

plotting (javascript)

672

VIZ

pygal_maps_ch

pip

GNU LGPL v3+

Swiss canton map for pygal

673

VIZ

pygal_maps_fr

pip

GNU LGPL v3+

French maps for pygal

674

VIZ

pygal_maps_world

pip

LGPLv3+

extension to pygal (maps)

675

SPHINX

pygal_sphinx_directives

pip

GNU LGPL v3+

Pygal sphinx integration

676

GUI

pygame

wheel

LGPL

GUI, interface for games (needs to be installed from www.pygame.org on Linux)

677

pygit2

wheel

Pygit2 is a set of Python bindings to the libgit2 shared library, libgit2 implements the core of Git.

678

GUI

pyglet

pip

a cross-platform windowing and multimedia library for Python

679

pygments

pip

BSD License

syntax highlighting package written in Python

680

pyhamcrest

pip

for twisted

681

PROFILING

pyinstrument

pip

BSD

A Python profiler that records the call stack of the executing code, instead of just the final function in it.

682

PROFILING

pyinstrument_cext

pip

Pyinstrument’s C extensions - reducing the overhead of statistical profilers

683

pyjsparser

pip

MIT

Fast javascript parser (based on esprima.js)

684

PyJWT

pip

MIT

JSON Web Token library for Python 3.

685

pylint

pip

GPL

statistics on Python script style

686

ML

pyltr

pip

BSD-new

pyltr is a Python learning-to-rank toolkit with ranking models, evaluation metrics, data wrangling helpers, and more.

687

pylzma

wheel2

LGPL

Python bindings for the LZMA library by Igor Pavlov.

688

pymatbridge

pip

pymatbridge is a set of python and matlab functions to allow these two systems to talk to each other

689

PHYS

pymatgen

wheel

MIT

Pymatgen (Python Materials Genomics) is a robust, open-source Python library for materials analysis.

690

pymatsolver

pip

A (sparse) matrix solver for python.

691

DATA/ML

pymc

wheel

Monte Carlo computation

692

PyMeta3

pip

Pattern-matching language based on OMeta for Python 3 and 2

693

pymkl

pip

Python wrapper of Intel MKL routines

694

TEACH

pymmails

pip

MIT

read/send emails

695

NoSQL

pymongo

wheel

Apache License, Version 2.0

Python wrapper for MongoDB

696

pympler

pip

Apache License, Version 2.0

Pympler is a development tool to measure, monitor and analyze the memory behavior of Python objects in a running Python application.

697

SQL

pymssql

wheel

A simple database interface for Python that builds on top of FreeTDS to provide a Python DB-API (PEP-249) interface to Microsoft SQL Server.

698

TEACH

pymyinstall

pip

MIT

easy installation of modules including Windows

699

SQL

PyMySQL

pip

MIT

Pure-Python MySQL Driver

700

NETWORK

PyNaCl

pip

Apache License 2.0

Python binding to the Networking and Cryptography (NaCl) library

701

SQL

pyodbc

wheel

access to protocal ODBC (SQL databases)

702

GPU

pyopencl

wheel

MIT

PyOpenCL lets you access the OpenCL parallel computation API from Python.

703

PyOpenGL

wheel

BSD

use OpenGL in Python

704

PyOpenGL_accelerate

wheel

BSD

Acceleration code for PyOpenGL

705

pyOpenSSL

pip

Apache License, Version 2.0

Python wrapper module around the OpenSSL library

706

pyotp

pip

PyOTP is a Python library for generating and verifying one-time passwords.

707

pypandoc

pip

MIT

Pypandoc provides a thin wrapper for pandoc, a universal document converter.

708

pyparsing

pip

MIT License

alternative approach to creating and executing simple grammars

709

PDF

pyPdf

github

BSD

read PDF

710

PDF

PyPDF2

pip

BSD

PDF toolkit

711

pypiserver

pip

BSD

run a local pypi server

712

pypmc

wheel

GPLv2

pypmc is a python package focusing on adaptive importance sampling.

713

pypng

pip

MIT

Pure Python PNG image encoder/decoder

714

VIZ

pyproj

wheel

OSI Approved

python interface to PROJ4 library for cartographic transformations https://jswhit.github.io/pyproj, needed by cartopy

715

GUI

PyQt5

pip

GPL v3

716

GUI

PyQt5-sip

pip

SIP

717

GUI

pyqtgraph

pip

Scientific Graphics and GUI Library for Python, depends on PySide

718

TEACH

pyquickhelper

pip

MIT

helpers to generation documentation

719

NETWORK

pyreact

pip

Python bridge to JSX & the React JavaScript library. (for pyxley)

720

pyreadline

pip

python implementation of GNU readline functionality

721

pyrsistent

pip

MIT

Pyrsistent is a number of persistent collections (by some referred to as functional data structures). Persistent in the sense that they are immutable.

722

TEACH

pyrsslocal

pip

MIT

RSS readers

723

MATHS

PySAL

pip

PySAL is an open source library of spatial analysis functions written in Python intended to support the development of high level applications.

724

pyshp

pip

MIT

Pure Python read/write support for ESRI Shapefile format

725

GUI

PySide2

pip

The Qt for Python project aims to provide a complete port of the PySide module to Qt 5.

726

pysmi

pip

BSD

SNMP SMI/MIB Parser (for pysnmp)

727

NETWORK

pysnmp

pip

A pure-Python SNMPv1/v2c/v3 library

728

PySocks

pip

A semi-actively maintained SocksiPy fork. Contains many improvements to the original.

729

pystache

pip

MIT

Mustache for Python (for bokeh)

730

DATA/ML

pystan

pip

GNU General Public License v3 (GPLv3)

PyStan provides an interface to Stan, a package for Bayesian inference using the No-U-Turn sampler, a variant of Hamiltonian Monte Carlo.

731

pystorm

pip

Battle-tested Apache Storm Multi-Lang implementation for Python.

732

ML

pystruct

wheel

Learning Structured Prediction in Python

733

pytablereader

pip

MIT

A python library to load structured table data from files/URL with various data format: CSV/Excel/HTML/JSON/LTSV/Markdown/TSV.

734

pytablewriter

pip

convert a dataframe into many formats

735

pytagcloud

pip

Create beautiful tag clouds as images or HTML

736

pytest

pip

MIT license

pytest: simple powerful testing with Python

737

pytest_runner

pip

Setup scripts can use pytest-runner to add setup.py test support for pytest runner.

738

python-daemon

pip

Library to implement a well-behaved Unix daemon process (for luigi)

739

python-dateutil

pip

Dual License

helpers to manipulate dates

740

python-docx

pip

read/write Word document

741

python-editor

pip

Programmatically open an editor, capture the result.

742

python-gmaps

pip

BSD

Google Maps API client http://python-gmaps.readthedocs.org

743

python-jenkins

pip

BSD

interact with Jenkins

744

ML

python-louvain

pip

Louvain algorithm

745

python-mimeparse

pip

MIT

A module provides basic functions for parsing mime-type names and matching them against a list of media-ranges. (falcon)

746

python-pptx

pip

The MIT License (MIT)

read/write PowerPoint presentation

747

python-utils

pip

Python Utils is a module with some convenient utilities not included with the standard Python install

748

python3-linkedin

pip

MIT

Python Interface to the LinkedIn API

749

python_Levenshtein

wheel

C implementation of Levenshtein distance.

750

python_oauth2

pip

for jupyterhup

751

pythonnet

wheel

MIT

Python binding for C#

752

pythonqwt

pip

MIT

Qt plotting widgets (Spyder)

753

VIZ

pythreejs

pip

BSD

A Python / ThreeJS bridge utilizing the Jupyter widget infrastructure.

754

pytils

pip

Russian-specific string utils (for selection, weblib)

755

pytool

pip

A collection of tools for Python

756

pytools

pip

MIT

A collection of tools for Python

757

pytube

pip

MIT

play with youtube videos

758

pytz

pip

MIT

World timezone definitions, modern and historical

759

VIZ

pyviz_comms

pip

Bidirectional communication for PyViz.

760

DATA/ML

PyWavelets

wheel

MIT

wavelets computation

761

pywinpty

wheel

Python bindings for the winpty pseudo terminal library. It allows to create and communicate with Windows processes that print outputs and recieve inputs via console input and output pipes.

762

VIZ

PyX

wheel

GNU General Public License (GPL)

plotting

763

NETWORK

pyxley

pip

A pure-Python SNMPv1/v2c/v3 library

764

PyYAML

wheel

MIT

YAML parser and emitter for Python

765

NETWORK

pyzmq

wheel

LGPL+BSD

python librairies for Omz (pipy distributes the binaries)

766

VIZ

qgrid

pip

A Pandas DataFrame viewer for IPython Notebook.

767

DATA/ML

QInfer

pip

QInfer is a library using Bayesian sequential Monte Carlo for quantum parameter estimation.

768

QtAwesome

pip

QtAwesome enables iconic fonts such as Font Awesome and Elusive Icons in PyQt and PySide applications.

769

JUPYTER

qtconsole

pip

BSD

IPython, notebooks, qtconsole

770

GUI

qtpy

pip

single interface for QtPy4, 5, PySide

771

Quandl

pip

MIT

access Quandl API

772

queuelib

pip

BSD

Collection of persistent (disk-based) queues

773

OPTIM

qutip

wheel

BSD

QuTiP is open-source software for simulating the dynamics of open quantum systems.

774

recommonmark

pip

MIT

For pymc3???

775

recordclass

wheel

A mutable variant of collections.namedtuple, which supports assignments.

776

redis

pip

Python client for Redis key-value store

777

regex

wheel

Python Software Foundation License

Alternative regular expression module, to replace re.

778

reportlab

wheel

BSD license (see license.txt for details), Copyright (c) 2000-2018, ReportLab Inc.

This is the ReportLab PDF Toolkit. It allows rapid creation of rich PDF documents, and also creation of charts in a variety of bitmap and vector formats.

779

requests

pip

Apache 2.0

human interface for http

780

requests-cache

pip

Persistent cache for requests library

781

requests-file

pip

Apache 2.0

File transport adapter for Requests

782

requests-ftp

pip

Apache 2.0

FTP Transport Adapter for Requests

783

requests-oauthlib

pip

ISC

OAuthlib authentication support for Requests.

784

retrying

pip

Apache 2.0

Retrying is an Apache 2.0 licensed general-purpose retrying library, written in Python, to simplify the task of adding retry behavior to just about anything.

785

rope_py3k

pip

GPL=OpenBSD

refactoring library

786

DATA/ML

rpy2

wheel

interact with R (R_HOME needs to be set up on Linux)

787

rsa

pip

ASL 2

Pure-Python RSA implementation

788

MATHS

Rtree

wheel

GNU Library or Lesser General Public License (LGPL)

R-Tree spatial index for Python GIS

789

ruamel.yaml

pip

ruamel.yaml is a YAML parser/emitter that supports roundtrip preservation of comments, seq/map flow style, and map key order

790

rx

pip

Reactive Extensions (Rx) for Python (LINQ)

791

AWS

s3fs

pip

Convenient Filesystem interface over S3

792

AWS

s3transfer

pip

Apache License 2.0

An Amazon S3 Transfer Manager

793

ML

sacred

pip

MIT

Facilitates automated and reproducible experimental research

794

sas7bdat

pip

read/write SAS format

795

scikit-fusion

pip

GPLv3

A Python module for data fusion built on top of factorized models.

796

scikit-image

wheel

Modified BSD

scikit-image is a collection of algorithms for image processing.

797

DATA/ML

scikit-learn

wheel

new BSD

machine learning

798

VIZ

scikit-plot

pip

MIT License

nicer graphs than matplotlib for datascientist

799

DATA/ML

scilab2py

pip

Python to Scilab bridge

800

DATA/ML

scipy

wheel

BSD

scientific computation, eigen values, linear algebra

801

DATA/ML

scoop

pip

GNU Library or Lesser General Public License (LGPL)

SCOOP (Scalable COncurrent Operations in Python) is a distributed task module allowing concurrent parallel programming on various environments, from heterogeneous grids to supercomputers

802

ML

scrapy

pip

A high-level Web Crawling and Web Scraping framework

803

OPTIM

scs

wheel

Solves convex cone programs via operator splitting.

804

VIZ

seaborn

pip

BSD (3-clause)

nicer graphs than matplotlib for statistical purposes

805

ML

seasonal

pip

MIT

timeseries

806

secretstorage

pip

BSD 3-Clause License

This module provides a way for securely storing passwords and other secrets.

807

selection

pip

API to extract data from HTML and XML documents. (for grab)

808

NETWORK

selenium

pip

Apache 2.0

Python wrapper for Selenium

809

semantic_version

pip

BSD

A library implementing the “SemVer” scheme.

810

semver

pip

Python helper for Semantic Versioning (http://semver.org/)

811

Send2Trash

pip

BSD License

Send file to trash natively under Mac OS X, Windows and Linux.

812

ML

seqlearn

wheel

sequence classification toolkit for Python

813

serverfiles

pip

An utility that accesses files on a HTTP server and stores them locally for reuse.

814

service_identity

pip

MIT

Service identity verification for pyOpenSSL.

815

setproctitle

wheel

BSD

A Python module to customize the process title

816

setuptools-git

pip

BSD

This is a plugin for setuptools that enables git integration. Once installed, Setuptools can be told to include in a package distribution all the files tracked by git. This is an alternative to explicit inclusion specifications with MANIFEST.in.

817

OPTIM

sfepy

wheel

Simple Finite Elements in Python

818

Shapely

wheel

BSD

Manipulation and analysis of geometric objects in the Cartesian plane.

819

shiboken2

pip

for PySide

820

simhash

pip

MIT

A Python implementation of Simhash Algorithm

821

JUPYTER

simplegeneric

pip

ZPL 2.1

IPython, dependency

822

simplejson

wheel

MIT License

Simple, fast, extensible JSON encoder/decoder for Python

823

SimpleSQLite

pip

MIT

SimpleSQLite is a Python library to simplify the table creation and data insertion into SQLite database.

824

singledispatch

pip

MIT

for nltk

825

GUI

SIP

pip

GPL v2 or GPL v3 or BSD

For PyQt5

826

six

pip

MIT

helpers for python 2/3 conversion

827

ML

sklearn_contrib_lightning

wheel

large-scale linear classification, regression and ranking

828

ML

sklearn_contrib_py_earth

wheel

A Python implementation of Jerome Friedman’s Multivariate Adaptive Regression Splines algorithm, in the style of scikit-learn. The py-earth package implements Multivariate Adaptive Regression Splines using Cython and provides an interface that is compatible with scikit-learn’s Estimator, Predictor, Transformer, and Model interfaces. For more information about Multivariate Adaptive Regression Splines, see the references below.

829

ML

sklearn_pandas

pip

~MIT

This module provides a bridge between Scikit-Learn’s machine learning methods and pandas-style Data Frames.

830

ML

skll

pip

BSD

SciKit-Learn Laboratory makes it easier to run machinelearning experiments with scikit-learn.

831

slicerator

pip

BSD

A lazy-loading, fancy-sliceable iterable.

832

smart_open

pip

Public Domain

Utils for streaming large files (S3, HDFS, gzip, bz2…), provides the same API for many format

833

smmap2

pip

BSD

A pure python implementation of a sliding window memory map manager

834

PROFILING

snakeviz

pip

BSD

SnakeViz is a browser based graphical viewer for the output of Python’s cProfile module.

835

snowballstemmer

pip

BSD-3-Clause

This package provides 16 stemmer algorithms (15 + Porter English stemmer) generated from Snowball algorithms, needed by sphinx-rtd-theme.

836

NETWORK

sockjs-tornado

pip

SockJS-tornado is a Python server side counterpart of SockJS-client browser library running on top of Tornado framework.

837

SPHINX

solar_theme

pip

BSD~MIT

sphinx theme

838

sortedcontainers

pip

Apache 2.0

Python Sorted Container Types: SortedList, SortedDict, and SortedSet

839

sounddevice

wheel

MIT

This Python module provides bindings for the PortAudio library and a few convenience functions to play and record NumPy arrays containing audio signals.

840

soupsieve

pip

MIT License

Soup Sieve is a CSS selector library designed to be used with Beautiful Soup 4.

841

NLP

spacy

wheel

MIT

Industrial-strength NLP

842

sparse

pip

This implements sparse arrays of arbitrary dimension on top of numpy and scipy.sparse

843

spectrum

wheel

Spectrum contains tools to estimate Power Spectral Densities using methods based on Fourier transform, Parametric methods or eigenvalues analysis

844

spglib

wheel

BSD

Python bindings for C library for finding and handling crystal symmetries

845

SPHINX

sphinx

pip

BSD

documentation generation based on RST

846

SPHINX

sphinx-better-theme

pip

sphinx theme

847

SPHINX

sphinx-bulma-theme

pip

sphinx theme

848

SPHINX

sphinx-corlab-theme

pip

LGPLv3+

sphinx theme

849

SPHINX

sphinx-gallery

pip

3-clause BSD

gallery for examples

850

SPHINX

sphinx-ioam-theme

pip

sphinx theme

851

SPHINX

sphinx-materialdesign-theme

pip

sphinx theme

852

SPHINX

sphinx-modern-theme

pip

MIT

sphinx theme

853

SPHINX

sphinx-redactor-theme

pip

MIT

sphinx theme

854

SPHINX

sphinx-rtd-theme

pip

MIT

sphinx theme

855

SPHINX

sphinx_bootstrap_theme

pip

MIT

sphinx theme

856

SPHINX

sphinx_modern_theme_modified

pip

sphinx theme

857

SPHINX

sphinx_py3doc_enhanced_theme

pip

BSD

sphinx theme

858

SPHINX

sphinx_readable_theme

pip

MIT

sphinx theme

859

SPHINX

sphinx_theme_pd

pip

MIT

sphinx theme

860

SPHINX

sphinxcontrib-applehelp

pip

BSD

extension for Sphinx

861

sphinxcontrib-blockdiag

pip

BSD

Sphinx “blockdiag” extension

862

SPHINX

sphinxcontrib-devhelp

pip

BSD

extension for Sphinx

863

SPHINX

sphinxcontrib-htmlhelp

pip

BSD

extension for Sphinx

864

SPHINX

sphinxcontrib-imagesvg

pip

BSD

support svg in Sphinx

865

SPHINX

sphinxcontrib-jsdemo

pip

BSD

to demo HTML or Javascript in Sphinx documentation

866

SPHINX

sphinxcontrib-jsmath

pip

BSD

extension for Sphinx

867

SPHINX

sphinxcontrib-lunrsearch

pip

simple autocompletion for the sphinx documentation

868

SPHINX

sphinxcontrib-qthelp

pip

BSD

extension for Sphinx

869

SPHINX

sphinxcontrib-serializinghtml

pip

BSD

extension for Sphinx

870

SPHINX

sphinxcontrib-websupport

pip

BSD

extension for Sphinx

871

SPHINX

sphinxjp.themecore

pip

A sphinx theme plugin extension.

872

SPHINX

sphinxjp.themes.basicstrap

pip

MIT

sphinx theme

873

SPHINX

sphinxjp.themes.gopher

pip

MIT

sphinx theme

874

SPHINX

sphinxjp.themes.htmlslide

pip

MIT

sphinx theme

875

SPHINX

sphinxjp.themes.sphinxjp

pip

MIT

sphinx theme

876

SPHINX

sphinxtrap

pip

sphinx theme

877

NETWORK

splinter

pip

As Is

browser abstraction for web acceptance testing

878

sputnik

pip

Data package manager library

879

spyder

pip

scientific IDE

880

spyder-kernels

pip

Jupyter Kernels for the Spyder console

881

SQL

SQLAlchemy

wheel

MIT

model SQL queries as objects

882

sqlite_bro

pip

MIT

GUI for SQLite

883

sqliteschema

pip

sqliteschema is a Python library to dump table schema of a SQLite database file.

884

SQL

sqlparse

pip

Non-validating SQL parser

885

starlette

pip

BSD

Starlette is a lightweight ASGI framework/toolkit, which is ideal for building high performance asyncio services.

886

DATA/ML

statsmodels

wheel

BSD License

statistical modelling, depends on scipy

887

stopit

pip

GPLv3

Timeout control decorator and context managers, raise any exception in another thread.

888

streamparse

pip

Streamparse lets you run Python code against real-time streams of data via Apache Storm.

889

structures

pip

User-friendly library for creating data structures.

890

suds-jurko

pip

Lightweight SOAP client (Jurko’s fork)

891

supersmoother

pip

BSD 3-clause

This is an efficient implementation of Friedman’s SuperSmoother [1] algorithm in pure Python. It makes use of numpy for fast numerical computation.

892

swiglpk

wheel

swiglpk - Simple swig bindings for the GNU Linear Programming Kit

893

sympy

pip

BSD

SymPy is a Python library for symbolic mathematics.

894

TA_Lib

wheel

This is a Python wrapper for TA-LIB based on Cython instead of SWIG.

895

tabledata

pip

A Python library to represent tabular data for pytablewriter/pytablereader/SimpleSQLite.

896

tables

wheel

BSD 2-Clause

PyTables is a package for managing hierarchical datasets and designed to efficiently and easily cope with extremely large amounts of data.

897

tabulate

pip

MIT

Pretty-print tabular data

898

tblib

pip

BSD-2-Clause

Traceback fiddling library. For now allows you to pickle tracebacks and raise exceptions with pickled tracebacks in different processes. This allows better error handling when running code over multiple processes (imagine multiprocessing, billiard, futures, celery etc)

899

TEACH

teachpyx

pip

MIT

materials for teachings

900

tempora

pip

MIT

Objects and routines pertaining to date and time (tempora)

901

DATA/ML

tensorboard

pip

Apache 2.0

TensorBoard lets you watch Tensors Flow

902

DEEP LEARNING

tensorflow

wheel

Apache 2.0

Deep Learning from Google

903

DEEP LEARNING

tensorflow-estimator

pip

Apache 2.0

Deep Learning from Google

904

termcolor

pip

MIT

ANSII Color formatting for output in terminal.

905

JUPYTER/LINUX

terminado

pip

BSD

dependency for the notebooks, Terminals served to term.js using Tornado websockets

906

testpath

pip

MIT

Test utilities for code working with files and commands

907

text-unidecode

pip

The most basic Text::Unidecode port

908

textblob

pip

MIT

TextBlob is a Python (2 and 3) library for processing textual data. It provides a simple API for diving into common natural language processing (NLP) tasks such as part-of-speech tagging, noun phrase extraction, sentiment analysis, classification, translation, and more.

909

texttable

pip

module for creating simple ASCII tables

910

OPTIM

thinc

wheel

MIT

Practical Machine Learning for NLP. Thinc is the machine learning library powering spaCy.

911

thrift

pip

Python bindings for the Apache Thrift RPC system

912

thriftpy

pip

pure python implemention of Apache Thrift.

913

tinycss2

pip

BSD

tinycss2 is a complete yet simple CSS parser for Python. It supports the full syntax and error handling for CSS 2.1 as well as some CSS 3 modules

914

noSQL

tinydb

pip

TinyDB is a tiny, document oriented database optimized for your happiness :) It’s written in pure Python and has no external requirements.

915

TEACH

tkinterquickhelper

pip

MIT

windows on the top of tkinter

916

toml

pip

MIT

Python Library for Tom’s Obvious, Minimal Language

917

DATA/ML

toolz

pip

BSD

Toolz provides a set of utility functions for iterators, functions, and dictionaries.

918

DEEP LEARNING

torch

wheel

BSD-3

PyTorch is a deep learning framework that puts Python first.

919

DEEP LEARNING

torchvision

pip

BSD

image and video datasets and models for torch deep learning

920

NETWORK

tornado

wheel

http://www.apache.org/licenses/LICENSE-2.0

python server, IPython relies on it

921

VIZ

toyplot

pip

BSD

The kid-sized plotting toolkit for Python with grownup-sized goals.

922

ML

TPOT

pip

GNU/LGPLv3

Consider TPOT your Data Science Assistant. TPOT is a Python tool that automatically creates and optimizes machine learning pipelines using genetic programming.

923

JUPYTER

tqdm

pip

MPLv2.0, MIT Licences

A Simple Python Progress Meter

924

traceback2

pip

traceback2 is a backport of the new features added to the traceback testing framework in Python 2.7 and onwards.

925

DATA/ML

trackpy

pip

BSD - 3 clauses

trackpy is a Python package for particle tracking in 2D, 3D, and higher dimensions.

926

JUPYTER

traitlets

pip

BSD

IPython, dependency

927

traits

pip

BSD

explicitly typed attributes for Python

928

traittypes

pip

BSD

Custom trait types for scientific computing.

929

transaction

pip

Transaction management for Python. This package contains a generic transaction implementation for Python. It is mainly used by the ZODB.

930

translationstring

pip

Utility library for i18n relied on by various Repoze and Pyramid packages

931

ML

treeinterpreter

pip

BSD

Package for interpreting scikit-learn’s decision tree and random forest predictions. Allows decomposing each prediction into bias and feature contribution components

932

triangle

wheel

GNU LGPL

Python Triangle is a python wrapper around Jonathan Richard Shewchuk’s two-dimensional quality mesh generator and delaunay triangulator library.

933

trollius

pip

Port of the Tulip project (asyncio module, PEP 3156)

934

TEACH

tutormagic

pip

brings PythonTutor in a notebok

935

tweepy

pip

Python wrapper for the twitter API

936

Twisted

wheel

An asynchronous networking framework written in Python

937

typecheck-decorator

pip

verifies decorators at running time

938

typed_ast

pip

typed_ast is a Python 3 package that provides a Python 2.7 and Python 3 parser similar to the standard ast library.

939

typepy

pip

MIT

A python library for variable type checker/validator/converter at run time.

940

ML

typing_extensions

pip

Typing Extensions - Backported and Experimental Type Hints for Python

941

tzlocal

pip

MIT

tzinfo object for the local timezone

942

ufal.udpipe

pip

for orange3-text

943

ujson

wheel

BSD License

Ultra fast JSON encoder and decoder for Python

944

uncertainties

pip

uncertainties allows calculations such as (2 +/- 0.1)*2 = 4 +/- 0.2 to be performed transparently.

945

unidecode

pip

GPLv2+

ASCII transliterations of Unicode text

946

STYLE

unify

pip

Expat License

Modifies strings to all use the same (single/double) quote where possible.

947

unittest2

pip

unittest2 is a backport of the new features added to the unittest testing framework in Python 2.7 and onwards.

948

untangle

pip

Converts XML to Python objects

949

STYLE

untokenize

pip

Expat License

Transforms tokens into original source code (while preserving whitespace).

950

update_checker

pip

Simplified BSD License

A python module that will check for package updates.

951

uritemplate

pip

BSD 3-Clause License or Apache License, Version 2.0

URI templates

952

urllib3

pip

MIT

urllib2 extension

953

urwid

pip

for pyfm

954

user-agent

pip

MIT

A library to identify devices (phones, tablets) and their capabilities by parsing (browser/HTTP) user agent strings

955

uvicorn

pip

BSD

Uvicorn is a lightning-fast ASGI server implementation, using uvloop and httptools.

956

uvloop

pip

MIT

Fast implementation of asyncio event loop on top of libuv

957

validate_email

pip

LGPL

Validate_email verify if an email address is valid and really exists.

958

vectormath

pip

Vector math utilities for python (used by discretize).

959

VIZ

vega

pip

BSD 3-clause

Python/Jupyter notebook module for Vega, and Vega-Lite, Polestar, and Voyager. Notebooks with embedded visualizations can be viewed on github and nbviewer.

960

Versio

pip

localshop dependency, manages versions

961

version-information

pip

BSD

Version information

962

VIZ

vincent

pip

plotting

963

vine

pip

Promises, promises, promises

964

virtualenv

pip

MIT

creatre virtual environments

965

viscm

pip

MIT

tool for analyzing colormaps and creating new colormaps.

966

vispy

pip

(new) BSD

Vispy is a high-performance interactive 2D/3D data visualization library.

967

PROFILING

vprof

pip

BSD

vprof is a Python package providing rich and interactive visualizations for various Python program characteristics such as running time and memory usage.

968

w3lib

pip

BSD

Library of web-related functions

969

NETWORK

waitress

pip

ZPL 2.1

Waitress WSGI server

970

JUPYTER

wcwidth

pip

MIT

Measures number of Terminal column cells of wide-character codes

971

webcolors

pip

BSD 3-Clause

A library for working with color names and color value formats defined by the HTML and CSS specifications for use in documents on the Web.

972

webencodings

pip

BSD

Character encoding aliases for legacy web content

973

weblib

pip

Set of tools for web scraping projects (for grab)

974

webruntime

pip

for flexx

975

websockets

pip

BSD

websockets is a library for building WebSocket servers and clients in Python with a focus on correctness and simplicity.

976

werkzeug

pip

BSD-3-Clause

The Swiss Army knife of Python web development

977

wget

pip

none

pure python download utility

978

wheel

pip

MIT

handle wheels

979

whoosh

pip

search engine in Python

980

JUPYTER

widgetsnbextension

pip

BSD

Interactive HTML widgets for Jupyter notebooks.

981

wikipedia

pip

MIT

Wikipedia API for Python

982

SPHINX

wild_sphinx_theme

pip

BSD

sphinx theme

983

win_unicode_console

pip

MIT

984

winrandom

wheel

This module gives direct access to Windows Cryptographic API CryptGetRandom() function, which is cryptographically strong pseudo-random number generator (PRNG) on Windows:

985

VIZ

wordcloud

wheel

MIT

A little word cloud generator in Python.

986

wptools

pip

MIT

Wikipedia tools (for Humans)

987

TEACH

wrapclib

wheel

Wraps existing C libraries.

988

wrapt

wheel

BSD

A Python module for decorators, wrappers and monkey patching.

989

wsproto

pip

MIT

Pure Python, pure state-machine WebSocket implementation.

990

wurlitzer

pip

for spyder

991

x86cpu

wheel

BSD license

Uses cpuid instruction to get information about CPU. Queries OS as well as cpuid to see if the OS / CPU supports AVX instructions.

992

DATA/ML

xarray

pip

Apache

pandas like library for cubes (N-dimensional data)

993

DATA/ML

xgboost

wheel

Apache-2.0

Parallelized Stochastic Gradient Descent (only available on Python 3 and x64)

994

xlrd

pip

BSD

reads Excel files

995

XlsxWriter

pip

BSD

writes Excel files

996

xlwt

pip

BSD

writes Excel files

997

xmltodict

pip

MIT

Makes working with XML feel like you are working with JSON

998

xxhash

wheel

BSD

xxHash is an Extremely fast Hash algorithm, running at RAM speed limits.

999

yahoo-historical

pip

MIT License

Python module to get stock data from Yahoo! Finance

1000

yapf

pip

Code formatter

1001

SPHINX

yummy-sphinx-theme

pip

sphinx theme

1002

zarr

pip

MIT

A minimal implementation of chunked, compressed, N-dimensional arrays for Python.

1003

zc.lockfile

pip

Basic inter-process locks

1004

ZConfig

pip

Structured Configuration Library

1005

zdaemon

pip

Daemon process control library and tools for Unix-based systems

1006

ZEO

pip

ZEO provides a client-server storage implementation for ZODB.

1007

SPHINX

zerovm-sphinx-theme

pip

Apache License 2.0

sphinx theme

1008

zict

pip

BSD

The dictionary / mutable mapping interface is powerful and multi-faceted.

1009

zipline

wheel

A backtester for financial algorithms.

1010

zipp

pip

A pathlib-compatible Zipfile object wrapper.

1011

ZODB

pip

The Zope Object Database provides an object-oriented database for Python that provides a high-degree of transparency.

1012

ZODB3

pip

ZODB3 - Meta release for ZODB, persistent, BTrees and ZEO

1013

zodbpickle

wheel

This package presents a uniform pickling interface for ZODB.

1014

zope.exceptions

pip

Zope Public License

Zope exception

1015

zope.interface

wheel

interfaces for python

1016

zs

wheel

BSD

S is a compressed, read-only file format for efficiently distributing, querying, and archiving arbitrarily large record-oriented datasets.

Footnotes