Getting started

Lorsqu’on fait des statistiques, le language Python est loin d’être aussi complet que R qui a été pensé dans ce but. Il faut lui ajouter plusieurs dizaines de modules. C’est à ce moment qu’on découvrir les subtilités dans les différents systèmes d’exploitations, les compilateurs, les dépendances, la ligne de commande. Ce paragraphe décrit un moyen d’installation Python sur les trois systèmes principaux Windows, OS X, Linux avec les modules nécessaires présentés dans ce cours. Le premier paragraphe explique comment installer rapidement Python, il faudra lire les suivants si vos besoins vont au delà.

Notes

La grande majorité des exemples et des notebooks proposés sur ce site sont testés une fois par semaine. La distribution utilisée varie mais s’appuie sur les dernières versions des modules. Une part moins importante est également testée à chaque modification sur github/sdpython sur Windows, Linux (Ubuntu) et Linux (Debian).

2018-2019

Pour l’année 2017, les tests sont dorénavant effectués sur Linux Debian 9 et la distribution standard Python 3.7.0. L’installation de cette machine est assez fastidieuse. Les traces de l’installation sont conservées sur cet article de blog : Install Python 3.7 and many packages on Linux Debian 9. Il reste quelques problèmes comme TensorFlow qui n’est pas encore compatible avec Python 3.7 (août 2018) mais cette pull request devrait résoudre le problème.

2017-2018

Toutes les modules étaient testées sur Windows 10 et une distribtion standard Python 3.6.4 puis Python 3.6.5 64 bits.

En résumé : Anaconda

En résumé, le conseil le plus fréquent qu’on donne à ceux qui souhaitent installer Python est d’utiliser la distribution Anaconda. C’est l’équivalent de R. Sans autre étape supplémentaire, elle permet de faire du calcul matriciel numpy, de tracer des graphiques avec matplotlib, de manipuler les données pandas et de faire du machine du machine learning scikit-learn. La plupart des exercices proposés sur ce site n’utilisent pas plus que ce qui est proposé dans cette distribution standard. Pour les autres, les instructions mentionnées ci-dessous fonctionnent sous Windows, Linux et Mac.

  • Installation Anaconda (python 64 bit)

  • Mise à jour de la distribution avec conda update --all (en ligne de commande).

Pour installer le module implémenté pour ce cours :

  • pip install ensae_teaching_cs

Windows

Certains modules nécessitent une compilation C++. Anaconda fournit la plupart d’entre eux. Pour les autres, il faut allez à Unofficial Windows Binaries for Python Extension Packages. Certains modules n’existent pas sous forme précompilée à moins de le faire soi-même. Et c’est ce que j’ai fait pour certains modules comme dlib.

pip install pymyinstall

Linux en ligne de commande / MacOS

Voir Install Miniconda through SSH connection. Cela fonctionne avec les distributions Ubuntu 14.04 et Ubuntu 16.04. Vous pouvez par exemple suivre les instructions utilisées pour tester et compiler ce site disponibles dans le fichier config.yml. Les sorties disponibles sur le site CircleCI peuvent vous aider à comprendre les erreurs si vous en avez.

Les mêmes instructions devraient fonctionner sous MacOS depuis une fenêtre terminal car ce système s’appuie sur une distribution Linux.

IDE

Un IDE est un environnement de développement. Et comme on ne fait pas tout depuis un notebook, il faut en choisir un un.

Editeur de texte

Si vous êtes dans le train et que vous n’avez pas beaucoup de batterie, il faut revenir à l’essentiel : un éditeur de texte.

Installer un module

pip, python et ligne de commande

Le language python s’est doté d’un système de distribution de modules (ou packages) qui est aisément accessible depuis la ligne de commande. Sous Windows, on peut lancer la ligne de commande par la commande cmd. Sous Linux ou OS/X, c’est une fenêtre terminal (Linux, OS/X). Il suffit alors de se déplacer dans le répertoire d’installation de Python

cd c:\Python370_x64\Scripts

Ou encore :

cd c:\Anaconda3\Scripts

Puis d’écrire :

pip install <module>

Sous Linux ou OS/X (Apple), la ligne de commande s’appelle le terminal. Comme Python est déjà installé en version 2.7, je recommande l’installation de la distribution Anaconda en version 3.6 qui facilite la coexistence de plusieurs versions de Python. On procède de la même manière

cd /home/<alias>/anaconda3/bin

Puis

pip install <module>

Pour vous assurer que cela correspond bien à la version de Python souhaitée, il suffit de demander la version installée

pip --version

Sous Windows, pour l’ajout d’un module ponctuel, si l’instruction pip install <module> ne fonctionne pas, c’est vraisemblablement parce que ce module contient une partie en C++. Dans ce cas, il faut aller voir sur ce site Unofficial Windows Binaries for Python Extension Packages s’il est disponible. S’il ne l’est pas, l’installation du module est réservée aux experts.

conda ou pip

Anaconda maintient des versions de librairies Python. Pour tous les modules de cette liste, Anaconda Package List, il faut utiliser conda install <module>. Pour les autres, pip install <module>. Cela ne fonctionne qu’avec la distribution Anaconda.

L’instruction pip install ne fonctionne pas sous Windows lorsque le module est implémenté en Python et C++. C’est pourquoi il est préférable d’installer une version précompilée.

Dépendances

Par défaut, l’installation d’un module implique celle de ses dépendances ce qu’il est possible d’éviter :

pip install <module> --no-deps

Installer un module avec pymy_install

Le module pymyinstall recherche la meilleure façon d’installer un module quelque soit votre installation. Pour l’installer ou le mettre à jour :

pip install pymyinstall --upgrade

L’installation du module crée deux scripts, pymy_install3 pour installer un module, pymy_update3 pour mettre à jour. Le module permet aussi d’installer un ensemble de modules. La ligne suivante installe un module développé pour ce cours et ses dépendances :

pymy_install3 --set=pyensae

Désinstallation des modules implémentés pour ce cours

Il est possible de désinstaller simplement les modules installés pour ces enseignements :

    pip uninstall -y _benchmarks
    pip uninstall -y actuariat_python
    pip uninstall -y botadi
    pip uninstall -y code_beatrix
    pip uninstall -y cpyquickhelper
    pip uninstall -y csharpy
    pip uninstall -y csharpyml
    pip uninstall -y ensae_projects
    pip uninstall -y ensae_teaching_cs
    pip uninstall -y ensae_teaching_dl
    pip uninstall -y jupytalk
    pip uninstall -y jyquickhelper
    pip uninstall -y lecture_citation
    pip uninstall -y lightmlboard
    pip uninstall -y lightmlrestapi
    pip uninstall -y machinelearningext
    pip uninstall -y manydataapi
    pip uninstall -y mathenjeu
    pip uninstall -y mlinsights
    pip uninstall -y mlprodict
    pip uninstall -y mlstatpy
    pip uninstall -y myblog
    pip uninstall -y pandas_streaming
    pip uninstall -y papierstat
    pip uninstall -y pyenbc
    pip uninstall -y pyensae
    pip uninstall -y pymlbenchmark
    pip uninstall -y pymmails
    pip uninstall -y pymyinstall
    pip uninstall -y pyquickhelper
    pip uninstall -y pyrsslocal
    pip uninstall -y pysqllike
    pip uninstall -y python3_module_template
    pip uninstall -y sparkouille
    pip uninstall -y teachpyx
    pip uninstall -y tkinterquickhelper
    pip uninstall -y wrapclib

Configuration pour ces cours

Les notebooks utilisent le module pyensae développé pour ces enseignements. Pour installer ses dépendances, il faut utiliser le module pymyinstall. Les dépendances s’installent comme suit :

pymy_install3 --set=ensae_teaching_cs
pip install ensae_teaching_cs

Certains notebooks requièrent des outils supplémentaires :

Distributions

  • Anaconda (Windows, Linux, Mac). Sous Linux ou OS/X, la distribution n’interfère pas avec la distribution existante souvent différente. C’est un point très appréciable. Les modules de la distribution ne sont pas tous à jour. Il faut penser à mettre à jour avec la commande conda install <module> depuis le répertoire Anaconda3/Scripts (conda install cvxopt par exemple). Il existe une version différente : miniconda. La liste des packages manquant sera probablement différente. Il suffit d’écrire sur la ligne de commande conda update --all pour mettre à jour tous les modules.

  • WinPython (Windows). Sous Windows, elle inclut parfois R ou Julia (ces version ne sont pas aussi à jour que la version principale). On passe alors facilement de python à R ou Julia depuis le même notebooks. Uniquement disponible sous Windows, cette installation a l’avantage de ne pas nécessiter les droits administrateur pour être installée. Elle ne modifie pas les registres et on peut la recopier telle quelle sur une clé USB pour la recopier sur un autre ordinateur. On peut également préparer sa propre version How To Make WinPython.

  • Distribution officielle de Python, il faut ensuite installer de nombreux modules (voir Liste exhaustive de modules Python) pour obtenir une distribution équivalente aux deux précédentes.

  • miniconda est une version light de Anaconda sans tous ces packages. Elle peut être installée depuis une ligne de commande, via une connexion SSH. Voir Install Anaconda through SSH connection.

La liste des packages de WinPython ou Anaconda sont d’excellents moyens de découvrir de nouveaux modules intéressants.

Modules incournables pour un data scientist

Les modules indispensables sont intégrés à la distribution Anaconda, WinPython.

Les indispensables

  • dask : dataframe distribué et capables de gérer des gros volumes de données (> 5Go)

  • Jupyter : gestion des notebooks (des pages blanches mélangeant code, équations, graphiques)

  • matplotlib : graphes scientifiques

  • numpy : calcul matriciel

  • pandas : gestion de DataFrame

  • Scipy : calcul scientifique

  • scikit-learn : machine learning, statistique descriptive

  • statsmodels : séries temporelles

Visualisation

Voir 10 plotting libraries at PyData 06/14/2016 in Paris.

Jeux

  • pygame + thorpy

  • kivy : pour faire des jeux ou des applications pour tablettes, téléphones

Pour les TD et projets à l’ENSAE

Pour faire du machine learning sans programmer

Outils, ressources pour développer

Développer un programme informatique prend du temps et il est important d’être à l’aise. Une grande difficulté lorsqu’on programme c’est de travailler à plusieurs sur le même projet. Il faut se sychroniser. Fort heureusement, le problème est connu depuis longtemps et il existe beaucoup d’outils open source dont on aurait tort de se passer ou des services gratuits sous certains conditions qui facilitent l’archivage.

En vrac

Suivi de sources distant

Git

git est un logiciel de suivi de source. Il a supplanté tous les autres et il est indispensable aujourd’hui de le connaître. On ne retient pas toujours les commandes mais un moteur de recherche fournit rapidement la réponse. Voir aussi Cheat Sheet.

Archivage distant

  • hubiC (25 Go gratuit - août 2015)

  • OneDrive (15 Go gratuit - août 2015)

Ce ne sont pas les seuls, vous trouverez d’autres options ici : cloud-gratuit. Toutefois, il est recommandé de faire attention avec les données personnelles sensibles. Il n’est pas toujours possible de choisir le lieu de stockage et chaque pays a une législation différente. Même si vos données sont protégées par un mot de passe et ne sont pas publiques, il arrive que certains mots de passe soient interceptés.

Comparaison de fichiers

Partager des notes, des idées

Editeur de texte

  • SciTE : le plus simple, pas d’explorateur de fichier, pas d’installeur, autocomplétion perturbante

  • TextWrangler (seulement sur iOS - Apple)

  • SublimeText : configuration nécessaire avant d’exécuter un script python

  • Notepad++ : configuration nécessaire avant d’exécuter un script python

IDE

Python et Domotique

Navigateur

Les navigateur sont importants pour l’utilisation des notebooks. Je recommande soit Firefox, soit Chrome. Ces deux navigateurs sont indispensables si vous insérez du javascript dans nos notebooks. Le débuggeur de Chrome est le plus pratique à utiliser quand il s’agit d’aller fouiller dans les feuilles de styles ou de voir l’exécution du javascript.

Documentation

La documentation et les tests unitaires les modules classés dans les catégories SPHINX, TEACH (voir table ci-dessous). Certaines séances pratiques utilisent des données depuis ce site. Elles sont facilement téléchargeables avec ces deux modules :

  • pyquickhelper : ce module compile ce cours

  • pyensae : outils variés pour les élèves de l’ENSAE

  • pymyinstall : installer facilement des modules sous Windows

Pour être compilée, la documentation requiert également :

Continuous build

Compression

  • 7zip : pour compresser, décompresser tous les formats

Ressources

Setup pour déveloper ce cours (Windows)

Pour chaque version de Python, il faut installer pymyinstall puis écrire pymy_install puis supprimer les modules qu’on souhaite compiler et tester (voir Désinstallation des modules implémentés pour ce cours).

Quelques modules particuliers : plus trop maintenus mais parfois utiles et parfois modifiés

pip install https://github.com/sdpython/pyPdf/archive/trunk.zip

En plus :

Pour Jenkins, quelques extensions :

Pour Jupyter :

pip install widgetsnbextension
jupyter nbextension enable --py --sys-prefix widgetsnbextension

Un serveur en local doit être démarré, la ligne de commande ressemble à ceci :

c:\Python370_x64\Scripts\pypi-server.exe -u -p 8067 --disable-fallback ..\..\local_pypi\local_pypi_server

Si le serveur Jenkins utilise des mots-clés via keyring, ce qui est le cas pour plusieurs modules utilisés pour ces enseignements, il est nécessaire de créer un serveur un serveur Jenkins authentifié. Sous Windows , il faut chercher services.msc et renseigner les identifiants. Pour certains projets (comme la compilation de pywin32), il faut installer Windows SDK.

Maintenir sa distribution Python à jour

Manipuler les données est différent de savoir programmer. Si le second est nécessaire au premier, il est impensable aujourd’hui de ne pas tenir compte ce que d’autres programmeurs ont mis à disposition de tous en libre accès. Tous les modules proposés dans la suite sont utilisées par beaucoup, et sont très adaptés à la manipulation des données. Ils bénéficient de ce fait d’un développement rapide et d’une robustesse qu’il faut environ un an à un bon programmeur pour obtenir avec un de ses outils sur le même éventail de fonctionnalités (en y consacrant 10 à 20% de son temps).

J’ai cherché à regrouper les outils qui permettent à un ingénieur, statisticiens, data scientist de manipuler aisément des données, qui peuvent aller de quelques kilo-octets à quelques giga octets. En tant que data scientist, je pioche très régulièrement des éléments des sept premiers chapitres. Les sept suivants ne sont utiles que de temps en temps, surtout si les données sont de taille supérieure à 250 Mo.

L’essentiel n’est pas de tout faire en Python, l’essentiel est d’être agile, de passer le moins de temps sur l’implémentation et le plus de temps possible sur les données.

Listes de modules

Quelques articles

Liens, blogs à suivre

Articles Livres, Vidéos

Liste exhaustive de modules Python

Les modules suivant font partie du setup proposé aux étudiants (voir plus bas).

  • usage : classification, la plus importante DATA/ML regroupe les modules les plus importantes pour faire du machine learning

  • name : nom du module

  • kind : façon d’installer le module sous Windows, si c’est wheel, cela signifie que le module inclut une partie C++ qu’il est préférable de récupérer déjà compilée via le site Unofficial Windows Binaries for Python Extension Packages.

  • version : la version à installer car d’autres peuvent provoquer des conflits

  • license : license du module, toutes ne permettent pas un usage commercial, voir choose a license, licences commentées

  • purpose : description plus détaillée

<<<

from ensae_teaching_cs.automation import rst_table_modules
print(rst_table_modules())

>>>

index

usage

name

kind

version

license

purpose

0

abcpmc

pip

GPLv3+

Monte Carlo and ABC methods

1

absl-py

pip

Apache 2.0

Collection of Python library code for building Python applications.

2

TEACH

actuariat_python

pip

MIT

teachings, insurance examples

3

ML

ad3

wheel

GPLv3

AD3 (approximate MAP decoder with Alternating Direction Dual Decomposition)

4

adal

pip

MIT

The ADAL for Python library makes it easy for python application to authenticate to Azure Active Directory (AAD) in order to access AAD protected web resources.

5

aenum

pip

Advanced Enumerations (compatible with Python’s stdlib Enum), NamedTuples, and NamedConstants

6

SPHINX

alabaster

pip

BSD

A configurable sidebar-enabled Sphinx theme

7

alembic

pip

A database migration tool for SQLAlchemy.

8

OPTIM

algopy

pip

ALGOPY: Taylor Arithmetic Computation and Algorithmic Differentiation

9

amqp

pip

LGPL

Low-level AMQP client for Python (fork of amqplib) (for celery)

10

ansi2html

pip

GPLv3+

Convert text with ANSI color codes to HTML

11

ansiconv

pip

MIT

A Python module for converting ANSI coded text and converts it to either plain text or HTML.

12

antlr4-python3-runtime

pip

BSD

antlr4 runtime, grammar parser

13

anyjson

pip

BSD

Wraps the best available JSON implementation available in a common interface (for celery)

14

AnyQt

pip

GPLv3

PyQt4/PyQt5 compatibility layer.

15

apache-libcloud

pip

A standard Python library that abstracts away differences among multiple cloud provider APIs.

16

appdirs

pip

MIT

A small Python module for determining appropriate platform-specific dirs

17

apscheduler

pip

MIT

to schedule the execution of jobs, tasks

18

arrow

pip

Better dates and times for Python

19

asciitree

pip

MIT

Draws ASCII trees.

20

asn1crypto

pip

MIT

Fast ASN.1 parser and serializer with definitions for private keys, public keys, certificates, CRL, OCSP, CMS, PKCS#3, PKCS#7, PKCS#8, PKCS#12, PKCS#5, X.509 and TSP

21

asteval

pip

Safe, minimalistic evaluator of python expression using ast module

22

astor

pip

BSD-3-Clause

astor is designed to allow easy manipulation of Python source via the AST.

23

astroid

pip

LGPL

A abstract syntax tree for Python with inference support.

24

astropy

wheel

BSD

Community-developed python astronomy tools

25

async_generator

pip

for jupyterhup

26

atomicwrites

pip

MIT

Atomic file writes.

27

attrs

pip

MIT

Attributes without boilerplate.

28

autograd

pip

MIT

Efficiently computes derivatives of numpy code.

29

Automat

pip

MIT

Self-service finite-state machines for the programmer on the go.

30

STYLE

autopep8

pip

Expat License

apply pep8 on a script

31

autopy3

wheel

MIT

A simple, cross-platform GUI automation toolkit for Python 3 (issue on Linux and Anaconda)

32

AZURE

azure

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

33

AZURE

azure-batch

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

34

AZURE

azure-common

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

35

AZURE

azure-datalake-store

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

36

AZURE

azure-graphrbac

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

37

AZURE

azure-keyvault

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

38

AZURE

azure-mgmt

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

39

AZURE

azure-mgmt-authorization

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

40

AZURE

azure-mgmt-batch

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

41

AZURE

azure-mgmt-cdn

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

42

AZURE

azure-mgmt-cognitiveservices

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

43

AZURE

azure-mgmt-commerce

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

44

AZURE

azure-mgmt-common

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

45

AZURE

azure-mgmt-compute

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

46

AZURE

azure-mgmt-containerregistry

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

47

AZURE

azure-mgmt-datalake-analytics

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

48

AZURE

azure-mgmt-datalake-nspkg

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

49

AZURE

azure-mgmt-datalake-store

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

50

AZURE

azure-mgmt-devtestlabs

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

51

AZURE

azure-mgmt-dns

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

52

AZURE

azure-mgmt-documentdb

pip

MIT

Python wrapper for Azure API (HDInsight, Blog Storage)

53

AZURE

azure-mgmt-iothub

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

54

AZURE

azure-mgmt-keyvault

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

55

AZURE

azure-mgmt-logic

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

56

AZURE

azure-mgmt-monitor

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

57

AZURE

azure-mgmt-network

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

58

AZURE

azure-mgmt-notificationhubs

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

59

AZURE

azure-mgmt-nspkg

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

60

AZURE

azure-mgmt-powerbiembedded

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

61

AZURE

azure-mgmt-rdbms

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

62

AZURE

azure-mgmt-redis

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

63

AZURE

azure-mgmt-resource

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

64

AZURE

azure-mgmt-scheduler

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

65

AZURE

azure-mgmt-sql

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

66

AZURE

azure-mgmt-storage

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

67

AZURE

azure-mgmt-trafficmanager

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

68

AZURE

azure-mgmt-web

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

69

AZURE

azure-nspkg

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

70

AZURE

azure-servicebus

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

71

AZURE

azure-servicefabric

pip

MIT License

Python wrapper for Azure API (HDInsight, Blog Storage)

72

AZURE

azure-servicemanagement-legacy

pip

Apache License 2.0

Python wrapper for Azure API (HDInsight, Blog Storage)

73

AZURE

azure-storage

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

74

AZURE

azure-storage-nspkg

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

75

AZURE

azure_applicationinsights

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

76

AZURE

azure_cosmosdb_nspkg

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

77

AZURE

azure_cosmosdb_table

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

78

AZURE

azure_eventgrid

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

79

AZURE

azure_loganalytics

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

80

AZURE

azure_mgmt_advisor

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

81

AZURE

azure_mgmt_applicationinsights

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

82

AZURE

azure_mgmt_batchai

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

83

AZURE

azure_mgmt_billing

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

84

AZURE

azure_mgmt_consumption

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

85

AZURE

azure_mgmt_containerinstance

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

86

AZURE

azure_mgmt_containerservice

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

87

AZURE

azure_mgmt_cosmosdb

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

88

AZURE

azure_mgmt_datafactory

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

89

AZURE

azure_mgmt_datamigration

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

90

AZURE

azure_mgmt_devspaces

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

91

AZURE

azure_mgmt_eventgrid

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

92

AZURE

azure_mgmt_eventhub

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

93

AZURE

azure_mgmt_hanaonazure

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

94

AZURE

azure_mgmt_iotcentral

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

95

AZURE

azure_mgmt_iothubprovisioningservices

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

96

AZURE

azure_mgmt_loganalytics

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

97

AZURE

azure_mgmt_machinelearningcompute

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

98

AZURE

azure_mgmt_managementgroups

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

99

AZURE

azure_mgmt_managementpartner

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

100

AZURE

azure_mgmt_maps

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

101

AZURE

azure_mgmt_marketplaceordering

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

102

AZURE

azure_mgmt_media

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

103

AZURE

azure_mgmt_msi

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

104

AZURE

azure_mgmt_policyinsights

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

105

AZURE

azure_mgmt_recoveryservices

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

106

AZURE

azure_mgmt_recoveryservicesbackup

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

107

AZURE

azure_mgmt_relay

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

108

AZURE

azure_mgmt_reservations

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

109

AZURE

azure_mgmt_search

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

110

AZURE

azure_mgmt_servermanager

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

111

AZURE

azure_mgmt_servicebus

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

112

AZURE

azure_mgmt_servicefabric

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

113

AZURE

azure_mgmt_signalr

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

114

AZURE

azure_mgmt_subscription

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

115

AZURE

azure_storage_blob

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

116

AZURE

azure_storage_common

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

117

AZURE

azure_storage_file

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

118

AZURE

azure_storage_queue

pip

Python wrapper for Azure API (HDInsight, Blog Storage)

119

azureml

pip

MIT

Microsoft Azure Machine Learning Python client library

120

SPHINX

babel

pip

BSD

Internationalization utilities, version 2.0 has bugs

121

JUPYTER

backcall

pip

BSD

Specifications for callback functions passed in to an API (IPython, Jupyter)

122

backports.functools_lru_cache

pip

MIT

backports.functools_lru_cache

123

backports.lzma

wheel

Backport of Python 3.3’s “lzma” module for XZ/LZMA compressed files.

124

JUPYTER

backports.shutil-get-terminal-size

pip

needed for Jupyter

125

JUPYTER

backports_abc

pip

Python Software Foundation License

A backport of recent additions to the “collections.abc” module

126

DATA/ML

bayespy

pip

bayesian modelling and computation

127

DATA/ML

bcolz

wheel

BSD

compressed dataframe, in memory or on disk

128

bcrypt

pip

Apache License, Version 2.0

Modern password hashing for your software and your servers

129

beautifulsoup4

pip

MIT

Beautiful Soup sits atop an HTML or XML parser, providing Pythonic idioms for iterating, searching, and modifying the parse tree.

130

bigfloat

wheel

LGPLv3

big float (issue with Linux and Anaconda)

131

billiard

pip

BSD

Python multiprocessing fork with improvements and bugfixes (for celery)

132

BIO

biopython

wheel

~MIT

The Biopython Project is an international association of developers of freely available Python tools for computational molecular biology.

133

VIZ

bkcharts

pip

High level chart types built on top of Bokeh

134

WEB

bleach

pip

Apache Software License

An easy whitelist-based HTML-sanitizing tool.

135

blist

wheel

BSD

a list-like type with better asymptotic performance and similar performance on small lists

136

blockdiag

pip

Apache License 2.0

blockdiag generates block-diagram image from text

137

blosc

wheel

https://opensource.org/licenses/BSD-3-Clause

Blosc (http://blosc.org) is a high performance compressor optimized for binary data.

138

blz

wheel

BSD

blz: a compressed data container

139

VIZ

bokeh

pip

New BSD

interactive graphs, zoomable, javascript

140

boto

pip

MIT

Amazon Web Services Library

141

AWS

boto3

pip

Apache License 2.0

A Python interface to Amazon Web Services

142

AWS

botocore

pip

Apache License 2.0

A low-level interface to a growing number of Amazon Web Services. The botocore package is the foundation for the AWS CLI as well as boto3.

143

Bottleneck

wheel

Simplified BSD

Fast NumPy array functions written in Cython, needed by la

144

VIZ

bqplot

pip

Apache

interactive graphs, zoomable, d3.js for notebooks

145

VIZ

branca

pip

Copyright (C) 2013, Martin Journois

146

brewer2mpl

pip

MIT

Connect colorbrewer2.org color maps to Python and matplotlib

147

Brotli

wheel

Brotli is a generic-purpose lossless compression algorithm that compresses data using a combination of a modern variant of the LZ77 algorithm, Huffman coding and 2nd order context modeling, with a compression ratio comparable to the best currently available general-purpose compression methods. It is similar in speed with deflate but offers more dense compression.

148

brythonmagic

pip

MIT

Magics to use brython in Jupyter notebook.

149

ALGO

BTrees

wheel

This package contains a set of persistent object containers built around a modified BTree data structure.

150

bz2file

pip

Apache License, Version 2.0

process bz2 files

151

CacheControl

pip

Apache Software License

httplib2 caching for requests

152

cached_property

pip

BSD

A decorator for caching properties in classes (for luigi)

153

cachetools

pip

MIT

Extensible memoizing collections and decorators

154

cairocffi

wheel2

BSD

cairocffi is a CFFI-based drop-in replacement for Pycairo, a set of Python bindings and object-oriented API for cairo

155

cairosvg

pip

LGPLv3+

Convert your SVG files to PDF and PNG.

156

VIZ

Cartopy

wheel

LGPLv3

Cartopy is a Python package designed to make drawing maps for data analysis and visualisation as easy as possible (issue on Linux and Anaconda)

157

DATA/ML

catboost

pip

Apache License, Version 2.0

CatBoost is a machine learning method based on gradient boosting over decision trees.

158

ML

category_encoders

pip

BSD

A set of scikit-learn-style transformers for encoding categorical variables into numeric by means of different techniques.

159

celery

pip

BSD

Celery is an asynchronous task queue/job queue based on distributed message passing.

160

certifi

pip

MPL-2.0

Certifi is a carefully curated collection of Root Certificates for validating the trustworthiness of SSL certificates while verifying the identity of TLS hosts.

161

C++

cffi

wheel

MIT

Foreign Function Interface for Python calling C code.

162

cftime

wheel

Time-handling functionality from netcdf4-python.

163

DATA/DML

chainer

pip

MIT License

A flexible framework of neural networks (GPU)

164

DATA/DML

chainercv

pip

MIT

ChainerCV is a deep learning based computer vision library built on top of Chainer.

165

chalmers

pip

MIT

Chalmers is an application that allows its users to monitor and control a number of processes on any operating system (Posix and Win32 included)

166

WEB

chardet

pip

LGPL

Universal encoding detector.

167

cheroot

pip

Highly-optimized, pure-python HTTP server

168

CherryPy

pip

BSD

create web application, needed by Spyre

169

chest

pip

BSD

Simple on-disk dictionary

170

citeproc-py

pip

2-clause BSD License

citeproc-py is a CSL processor for Python. It aims to implement the CSL 1.0.1 specification. citeproc-py can output styled citations and bibliographies in a number of different output formats. Currently supported are plain text, reStructuredText and HTML. Other formats can be added easily.

171

click

pip

BSD

A simple wrapper around optparse for powerful command line utilities.

172

click-plugins

pip

New BSD

An extension module for click to enable registering CLI commands via setuptools entry-points.

173

cligj

pip

BSD

Click params for commmand line interfaces to GeoJSON

174

climate-toolbox

pip

Command-line utilities (turn function into command line)

175

SPHINX

cloud_sptheme

pip

BSD

sphinx theme

176

cloudpickle

pip

BSD 3-Clause License

Extended pickling support for Python objects

177

clyent

pip

Command line client Library for windows and posix

178

DATA/ML

cntk

wheel

Deep Learning from Microsoft see https://docs.microsoft.com/en-us/cognitive-toolkit/setup-windows-python

179

cobble

pip

BSD

Cobble is a Python library that allows easy creation of data objects, including implementations of common methods such as __eq__ and __repr__.

180

cobra

wheel

Constraint-based reconstruction and analysis in python.

181

TEACH

code_beatrix

pip

MIT

teaching programming to kids, lesenfantscodaient.fr

182

codecov

pip

http://www.apache.org/licenses/LICENSE-2.0

submit coverage report to codecov

183

colander

pip

A simple schema-based serialization and deserialization library

184

SPHINX

colorama

pip

BSD

Cross-platform colored terminal text.

185

colorcet

pip

CC-BY License

colorcet is a collection of perceptually uniform colormaps for use with Python plotting programs like bokeh, matplotlib, holoviews, and datashader.

186

colorlog

pip

MIT License

Log formatting with colors!

187

VIZ

colorlover

pip

MIT

Color scales for IPython notebook

188

VIZ

colormap

pip

LGPL

Utilities to ease manipulation of matplotlib colormaps and color codecs (e.g., hex2rgb)

189

colormath

pip

Color math and conversion library.

190

colorspacious

pip

MIT

A powerful, accurate, and easy-to-use Python library for doing colorspace conversions (for viscm)

191

CommonMark

pip

BSD-3-Clause

Python parser for the CommonMark Markdown spec

192

constantly

pip

Symbolic constants in Python

193

contextlib2

pip

Python

Backports and enhancements for the contextlib module

194

convertdate

pip

for fbprophet

195

coverage

pip

Apache 2.0

measure the coverage of unit tests

196

TEACH

cpyquickhelper

wheel

MIT

Experiments with C++ and Python

197

cryptography

pip

BSD or Apache License, Version 2.0

cryptography is a package which provides cryptographic recipes and primitives to Python developers.

198

TEACH

csharpy

wheel

Python + C# + C

199

cssselect

pip

BSD

cssselect parses CSS3 Selectors and translates them to XPath 1.0

200

cssselect2

pip

BSD

CSS selectors for Python ElementTree

201

cubes

pip

MIT

Cubes is a light-weight Python framework and set of tools for development of reporting and analytical applications, Online Analytical Processing (OLAP), multidimensional analysis and browsing of aggregated data. It is part of Data Brewery.

202

cuda4py

pip

BSD

Python cffi CUDA bindings and helper classes

203

VIZ

cufflinks

pip

MIT

Productivity Tools for Plotly + Pandas

204

DATA/DML

cupy

wheel

CuPy : NumPy-like API accelerated with CUDA

205

custom_inherit

pip

A Python package that provides tools for inheriting docstrings in customizable ways.

206

OPTIM

CVXcanon

wheel

?

A low-level library to perform the matrix building step in cvxpy, a convex optimization modeling software.

207

OPTIM

cvxopt

wheel

GNU GPL version 3

linear, quadratique optimization

208

OPTIM

cvxpy

pip

GNU

linear, quadratic optimization, depends on cvxopt

209

VIZ

cycler

pip

BSD

dependency for matplotlib

210

cymem

pip

MIT

Manage calls to calloc/free through Cython

211

cyordereddict

wheel

MIT

Cython implementation of Python’s collections.OrderedDict

212

Cython

wheel

Apache

pseudo C++ in python

213

DATA/ML

cytoolz

wheel

BSD

Cython implementation of Toolz: High performance functional utilities

214

DATA/ML

dask

pip

BSD

parallization of operations with dataframe

215

DataProperty

pip

Python library for extract property from data.

216

VIZ

datashader

pip

New BSD

Datashader is a graphics pipeline system for creating meaningful representations of large amounts of data.

217

datashape

pip

BSD

A data description language.

218

VIZ

dataspyre

pip

MIT

create simple web application to visualize data

219

ALGO

datrie

wheel

LGPL v2.1

Fast, efficiently stored Trie for Python.

220

db.py

pip

BSD

db.py is an easier way to interact with your databases. It makes it easier to explore tables, columns, views, etc. It puts the emphasis on user interaction, information display, and providing easy to use helper functions.

221

dbf

pip

access DBase format

222

dbfread

pip

MIT

access DBase format

223

deap

pip

LGPL

deep learning

224

decorator

pip

new BSD License

Better living through Python with decorators

225

defusedxml

pip

PSFL

XML bomb protection for Python stdlib modules

226

depinfo

pip

A utility Python package intended for other library packages. Provides a function that when called with your package name, will print platform and dependency information.

227

deprecation

pip

A library to handle automated deprecations

228

descartes

pip

BSD

Use Shapely or GeoJSON-like geometric objects as matplotlib paths and patches

229

dialite

pip

for flexx

230

dill

pip

3-clause BSD

serialize all of python (almost), Dill extends python’s pickle module for serializing and de-serializing python objects to the majority of the built-in python types.

231

discretize

wheel

A python package for finite volume discretization.

232

django

pip

Django

233

django-audiotracks

pip

MIT

read audio with django

234

django-celery

pip

BSD

localshop dependency, Old django celery integration project.

235

django-configurations

pip

BSD

localshop dependency, A helper for organizing Django settings.

236

WEB

django-contrib-comments

pip

BSD

Django used to include a comments framework; since Django 1.6 it’s been separated to a separate project. This is that project.

237

django-environ

pip

MIT

localshop dependency, Django-environ allows you to utilize 12factor inspired environment variables to configure your Django application.

238

django-guardian

pip

BSD

localshop dependency, Implementation of per object permissions for Django 1.2 or later.

239

django-model-utils

pip

BSD

localshop dependency, Django model mixins and utilities.

240

django-storages

pip

BSD

localshop dependency, django-storages is a collection of custom storage backends for Django.

241

django-userena

pip

BSD

localshop dependency, Accounts for Django made beautifully simple

242

django-uuidfield

pip

~BSD

localshop dependency, UUIDField in Django

243

ML

dlib

pip

boost

A toolkit for making real world machine learning and data analysis applications

244

WEB

dnspython

pip

~MIT

dnspython is a DNS toolkit for Python. It supports almost all record types. It can be used for queries, zone transfers, and dynamic updates. It supports TSIG authenticated messages and EDNS0.

245

STYLE

docformatter

pip

Expat License

Formats docstrings to follow PEP 257.

246

docopt

pip

Pythonic argument parser, that will make you smile

247

SPHINX

docrepr

pip

Modified BSD

docrepr renders Python docstrings in HTML

248

SPHINX

docutils

pip

public domain, Python, 2-Clause BSD, GPL 3 (see COPYING.txt)

interpret RST format

249

docx2txt

pip

MIT

A pure python-based utility to extract text and images from docx files.

250

dominate

pip

GNU Lesser General Public License (LGPL)

Dominate is a Python library for creating and manipulating HTML documents using an elegant DOM API.

251

VIZ

drawtree

pip

Apache License 2.0

Draw binary tree in plain text

252

duecredit

pip

2-clause BSD License

Publications (and donations) tracer

253

dukpy

pip

MIT

DukPy is a simple javascript interpreter for Python built on top of duktape engine without any external dependency. It comes with a bunch of common transpilers built-in for convenience.

254

dynd

wheel

BSD

DyND-Python, a component of the Blaze project, is the Python exposure of the DyND dynamic multi-dimensional array library.

255

easy-thumbnails

pip

BSD

Easy thumbnails for Django

256

easydev

pip

new BSD

Common utilities to ease the development of Python packages

257

ecdsa

pip

ECDSA cryptographic signature library (pure python)

258

OPTIM

ecos

wheel

ECOS is a numerical software for solving convex second-order cone programs (SOCPs)

259

DATA/ML

edward

pip

MIT

Edward is a Python library for probabilistic modeling, inference, and criticism

260

elasticsearch

pip

Apache License 2.0

Python client for Elasticsearch

261

emcee

pip

Kick ass affine-invariant ensemble MCMC sampling

262

empyrical

pip

empyrical is a Python library with performance and risk statistics commonly used in quantitative finance

263

TEACH

ensae_projects

pip

MIT

single use code

264

TEACH

ensae_teaching_cs

pip

MIT

teachings, introduction to programming, machine learning, map/reduce

265

JUPYTER

entrypoints

pip

MIT

Discover and load entry points from installed packages.

266

envoy

pip

~MIT

Simple API for running external processes.

267

ephem

pip

for fbprophet

268

et_xmlfile

pip

MIT

et_xmlfile is a low memory library for creating large XML files (for openpyxl).

269

VIZ

ete3

pip

GPLv3

tree visualisation, ete3 does not work with PyQt4 (2017-11).

270

expressions

pip

Arithmetic expression parser library. Embed customized expression evaluation into your application or library.

271

fabric

pip

BSD

Fabric is a Python library and command-line tool for streamlining the use of SSH for application deployment or systems administration tasks.

272

Fabric3

pip

BSD

Fabric is a simple, Pythonic tool for remote execution and deployment (py2.7/py3.4+ compatible fork)

273

ML

fairtest

wheel

FairTest enables developers or auditing entities to discover and test for unwarranted associations between an algorithm’s outputs and certain user subpopulations identified by protected features.

274

NETWORK

falcon

pip

Apache 2.0

Falcon is a very fast, very minimal Python web framework for building microservices, app backends, and higher-level frameworks.

275

NETWORK

falcon-auth

pip

A falcon middleware + authentication backends that adds authentication layer to you app/api service.

276

fast-histogram

wheel2

BSD

Mizani is a scales package for graphics. It is written in Python and is based on Hadley Wickham’s Scales.

277

fastcache

wheel

C implementation of Python 3 lru_cache for Python.

278

fasteners

pip

ASL 2.0

A python package that provides useful locks.

279

fastparquet

wheel

Apache License 2.0

fastparquet is a python implementation of the parquet format, aiming integrate into python-based big data work-flows.

280

fastrlock

wheel

Fast, re-entrant optimistic lock implemented in Cython (cupy)

281

DATA/ML

fasttext

wheel

BSD 3-Clause License

fastText is a library for efficient learning of word representations and sentence classification.

282

fbpca

pip

Functions for principal component analysis (PCA) and accuracy checks.

283

DATA/ML

fbprophet

pip

BSD

Prophet is a procedure for forecasting time series data. It is based on an additive model where non-linear trends are fit with yearly and weekly seasonality, plus holidays. It works best with daily periodicity data with at least one year of historical data. Prophet is robust to missing data, shifts in the trend, and large outliers.

284

ML

fcn

pip

MIT

Fully Convolutional Networks

285

feedparser

pip

~BSD - 2 clauses

parse RSS streams

286

WEB

filebrowser_safe

pip

~MIT

A snapshot of the filebrowser_3 branch of django-filebrowser, packaged as a dependency for the Mezzanine CMS for Django.

287

filelock

pip

Public Domain <http://unlicense.org>

A platform independent file lock.

288

GEO

Fiona

wheel

BSD

Fiona is OGR’s neat, nimble, no-nonsense API for Python programmers.

289

STYLE

flake8

pip

MIT

verify pep8 on a script

290

NETWORK

flask

pip

BSD-3-Clause

Flask is a microframework for Python based on Werkzeug, Jinja 2 and good intentions. And before you ask: It’s BSD licensed!

291

Flask-Cors

pip

A Flask extension for handling Cross Origin Resource Sharing (CORS), making cross-origin AJAX possible.

292

NETWORK

Flask-Login

pip

MIT

293

NETWORK

flask-sqlalchemy

pip

294

GUI

flexx

pip

Flexx is a pure Python toolkit for creating graphical user interfaces (GUI’s), that uses web technology for its rendering. Apps are written purely in Python; Flexx” transpiler generates the necessary JavaScript on the fly.

295

VIZ

folium

pip

MIT

This library is a spinoff from folium, that would host the non-map-specific features. It may become a HTML+JS generation library in the future.

296

ML

foolbox

pip

Foolbox is a Python toolbox to create adversarial examples that fool neural networks.

297

ML

forestci

pip

MIT

Python module for calculating variance and adding confidence intervals to scikit-learn random forest regression or classification objects. The core functions calculate an in-bag and error bars for random forest objects

298

SPHINX

foundation-sphinx-theme

pip

MIT

sphinx theme

299

NLP

fr_core_news_sm

wheel2

French ressources for spacy

300

ftfy

pip

MIT

Fixes some problems with Unicode text after the fact

301

funcparserlib

pip

MIT

Recursive descent parsing library based on functional combinators

302

funcsigs

pip

Python function signatures from PEP362

303

future

pip

MIT

Clean single-source support for Python 3 and 2

304

fuzzywuzzy

pip

Fuzzy string matching in python

305

gast

pip

BSD 3-Clause

A generic AST to represent Python2 and Python3’s Abstract Syntax Tree(AST).

306

gatspy

pip

BSD

General tools for Astronomical Time Series in Python

307

GDAL

wheel

MIT

GDAL is a translator library for raster and vector geospatial data formats that is released under an X/MIT style Open Source license by the Open Source Geospatial Foundation.

308

ML

gdbn

pip

This package contains python code for pre-trained deep neural networks

309

gdown

pip

MIT

Google Drive direct download of big files.

310

gensim

wheel

LGPLv2.1

genetic algorithm

311

geographiclib

pip

MIT

This implements algorithms for geodesics (Karney, 2013) for solving the direct and inverse problems for an ellipsoid of revolution.

312

geojson

pip

Functions for encoding and decoding GeoJSON formatted data

313

geopandas

pip

BSD

GeoPandas is an open source project to make working with geospatial data in python easier.

314

VIZ

geoplotlib

pip

MIT

geoplotlib is a python toolbox for visualizing geographical data and making maps

315

VIZ

geopy

pip

MIT

Python Geocoding Toolbox

316

VIZ

geoviews

pip

Composable, declarative data structures for building complex visualizations easily.

317

gevent

pip

MIT

gevent is a coroutine-based Python networking library

318

ggplot

pip

BSD

ggplot graphics style

319

JUPYTER

ghost.py

pip

ghost.py is a webkit web client written in python

320

git-pandas

pip

BSD

321

gitdb2

pip

BSD License

GitDB is a pure-Python git object database

322

GitPython

pip

BSD License

Python Git Library

323

gizeh

pip

see LICENSE.txt

Simple Vector Graphics for Python

324

glue-core

pip

BSD

Multidimensional data visualzation across files

325

glue-vispy-viewers

pip

As Is

Vispy-based viewers for Glue

326

DATA/ML

glueviz

pip

BSD

ploting, Multidimensional data visualzation across files

327

gmpy2

wheel

LGPLv3+

big real numbers (issue on Linux and Anaconda)

328

gnumpy

pip

Gnumpy: an easy way to use GPU boards in Python

329

google-api-python-client

pip

Apache 2.0

The Google API Client for Python is a client library for accessing the Plus, Moderator, and many other Google APIs.

330

google-auth

pip

Apache 2.0

This library simplifies using Google’s various server-to-server authentication mechanisms to access Google APIs.

331

google-auth-httplib2

pip

Apache 2.0

This library provides an httplib2 transport for google-auth.

332

googlemaps

pip

Apache

Python client library for Google Maps API Web Services

333

ML

gplearn

pip

gplearn implements Genetic Programming in Python, with a scikit-learn inspired and compatible API.

334

PROFILING

gprof2dot

pip

LGPL

This is a Python script to convert the output from many profilers into a dot graph.

335

grab

pip

MIT

Grab is a python web scraping framework. Grab provides tons of helpful methods to scrape web sites

336

grako

wheel

Grako (for grammar compiler) is a tool that takes grammars in a variation of EBNF as input, and outputs memoizing (Packrat) PEG parsers in Python.

337

VIZ

graphviz

pip

MIT

wrapper for graphviz (most famous tool to draw graphs

338

WEB

grappelli_safe

pip

~MIT

A snapshot of the grappelli_2 branch of django-grappelli, packaged as a dependency for the Mezzanine CMS for Django.

339

greenlet

wheel

MIT

Greenlet allows lightweight in-process concurrent programming.

340

grequests

pip

BSD

GRequests allows you to use Requests with Gevent to make asynchronous HTTP Requests easily.

341

CLI

grin

pip

BSD

A grep program configured the way I like it.

342

grpcio

wheel

Apache License 2.0

Package for gRPC Python.

343

guidata

pip

CeCILL v2

Automatic graphical user interfaces generation for easy dataset editing and display

344

guiqwt

wheel

CECILL

Efficient 2D plotting Python library based on PythonQwt (Spyder)

345

gunicorn

pip

MIT

localshop dependency, WSGI HTTP Server for UNIX

346

SPHINX

guzzle_sphinx_theme

pip

BSD~MIT

sphinx theme

347

gvar

wheel

GPLv3+

Utilities for manipulating correlated Gaussian random variables.

348

h2

pip

MIT License

HTTP/2 State-Machine based protocol implementation

349

ML

h2o

pip

H2O, Fast Scalable Machine Learning, for python

350

h5netcdf

pip

netCDF4 via h5py

351

DATA/ML

h5py

wheel

BSD

The h5py package is a Pythonic interface to the HDF5 binary data format. Trillion-Particle Simulation.

352

SPHINX

hachibee-sphinx-theme

pip

MIT

sphinx theme

353

ML

HDDM

wheel

BSD

HDDM is a python module that implements Hierarchical Bayesian estimation of Drift Diffusion Models.

354

heapdict

pip

BSD

a heap with decrease-key and increase-key operations

355

VIZ

heatmap

wheel

MIT

draw heatmap

356

ML

hmmlearn

wheel

MIT + Copyright

Hidden Markov Models in Python, with scikit-learn like API

357

holidays

pip

for fbprophet

358

HoloPy

wheel

Hologram processing and light scattering in python

359

VIZ

holoviews

pip

BSD

Composable, declarative data structures for building complex visualizations easily.

360

hpack

pip

MIT License

Pure-Python HPACK header compression

361

html2text

pip

GNU GPL 3

Turn HTML into equivalent Markdown-structured text.

362

html5lib

pip

MIT

pure-python library for parsing HTML

363

httpie

pip

BSD

HTTPie - a CLI, cURL-like tool for humans

364

httplib2

pip

MIT

A comprehensive HTTP client library.

365

httpretty

pip

MIT

HTTP client mock for Python

366

hyper

pip

MIT

HTTP/2 Client for Python

367

hyperframe

pip

MIT License

HTTP/2 framing layer for Python

368

hyperlink

pip

MIT

A featureful, correct URL for Python.

369

ML

hyperopt

github

Hyperopt is a Python library for serial and parallel optimization over awkwardsearch spaces, which may include real-valued, discrete, and conditional dimensions.

370

hyperspy

wheel

GNU General Public License v3 (GPLv3)

HyperSpy is an open source Python library which provides tools to facilitate the interactive data analysis of multi-dimensional datasets that can be described as multi-dimensional arrays of a given signal (e.g. a 2D array of spectra a.k.a spectrum image).

371

hypothesis

pip

Hypothesis is an advanced testing library for Python. It lets you write tests which are parametrized by a source of examples, and then generates simple and comprehensible examples that make your tests fail. This lets you find more bugs in your code with less work.

372

WEB

idna

pip

BSD-like

Internationalized Domain Names in Applications (IDNA)

373

ijson

pip

BSD

Iterative JSON parser with a standard Python iterator interface

374

VIDEO

imageio

pip

(new) BSD

Library for reading and writing a wide range of image, video, scientific, and volumetric data formats (for moviepy)

375

VIDEO

imageio-ffmpeg

pip

(new) BSD

ffmpeg

376

images2gif

pip

Unlicensed

Create a GIF from a list of images.

377

SPHINX

imagesize

pip

MIT

Getting image size from png/jpeg/jpeg2000/gif file

378

ML

imbalanced-learn

pip

MIT

imbalanced-learn is a python package offering a number of re-sampling techniques commonly used in datasets showing strong between-class imbalance. It is compatible with scikit-learn and is part of scikit-learn-contrib projects.

379

imbox

pip

MIT

Python library for reading IMAP mailboxes and converting email content to machine readable data

380

JUPYTER

importlib-metadata

pip

Apache Software License

library to access the metadata for a Python package. It is intended to be ported to Python 3.8.

381

incremental

pip

Incremental is a small library that versions your Python projects.

382

indexed_gzip

wheel

Fast random access of gzip files.

383

inflection

pip

MIT

A port of Ruby on Rails inflector to Python

384

intervaltree

pip

Editable interval tree data structure for Python 2 and 3

385

invoke

pip

BSD

Invoke is a Python task execution tool & library, drawing inspiration from various sources to arrive at a powerful & clean feature set.

386

ipaddress

pip

Python Software Foundation License

IPv4/IPv6 manipulation library

387

JUPYTER

ipydatawidgets

pip

BSD

A set of widgets to help facilitate reuse of large datasets across widgets

388

JUPYTER

ipykernel

pip

BSD

IPython, Jupyter, kernels

389

JUPYTER

ipympl

pip

Matplotlib Jupyter Extension

390

JUPYTER

ipyparallel

pip

BSD

Interactive Parallel Computing with IPython

391

JUPYTER

ipyscales

pip

A widget library for scales

392

JUPYTER

ipython

pip

BSD

IPython, Jupyter

393

JUPYTER

ipython_genutils

pip

BSD

IPython utils (nbformat)

394

JUPYTER

ipywidgets

pip

BSD

IPython, Jupyter, widgets

395

iso8601

pip

Simple module to parse ISO 8601 dates

396

isodate

pip

BSD

An ISO 8601 date/time/duration parser and formatter

397

isort

pip

MIT

A Python utility / library to sort Python imports.

398

SPHINX

itcase_sphinx_theme

pip

no license

sphinx theme

399

itsdangerous

pip

BSD

Various helpers to pass trusted data to untrusted environments and back.

400

jaraco.functools

pip

Additional functools in the spirit of stdlib’s functools.

401

javascripthon

pip

GPLv3+

a Python 3 to ES6 JavaScript translator

402

jdcal

pip

BSD

Julian dates from proleptic Gregorian and Julian calendars.

403

jedi

pip

MIT

An autocompletion tool for Python that can be used for text editors.

404

jeepney

pip

This is a low-level, pure Python DBus protocol client. It has an I/O-free core, and integration modules for different event loops.

405

jieba

pip

MIT

Chinese Words Segementation Utilities

406

jinja2

pip

BSD-3-Clause

templating

407

jmespath

pip

MIT

JSON Matching Expressions

408

joblib

pip

BSD

distribute jobs, parallelization

409

JPype1

wheel

Apache Software License

A Python to Java bridge.

410

Js2Py

pip

MIT

JavaScript to Python Translator & JavaScript interpreter written in 100% pure Python.

411

jsonpickle

pip

BSD

Python library for serializing any arbitrary object graph into JSON.

412

jsonschema

pip

An implementation of JSON Schema validation for Python

413

DATA/ML

julia

pip

MIT

Julia/Python bridge with IPython support

414

TEACH

jupytalk

pip

MIT

materials for presentations

415

JUPYTER

jupyter

pip

BSD

Jupyter

416

JUPYTER

jupyter-console

pip

BSD

Jupyter console, new in Jupyter 4.0

417

jupyter-echarts-pypkg

pip

MIT

The project packages jupyter-echarts and distributes it via pypi.

418

JUPYTER

jupyter-pip

pip

Allows Jupyter notebook extension writers to make their extension pip installable!

419

JUPYTER

jupyter_client

pip

BSD

Jupyter client

420

JUPYTER

jupyter_contrib_nbextensions

github

This repository contains a collection of extensions that add functionality to the Jupyter notebook.

421

JUPYTER

jupyter_core

pip

BSD

Jupyter Core

422

JUPYTER

jupyter_sphinx

pip

BSD

Jupyter Sphinx Extensions

423

JUPYTER

jupyterhub

pip

JupyterHub: A multi-user server for Jupyter notebooks

424

JUPYTER

jupyterlab

pip

BSD

Jupyter Lab

425

JUPYTER

jupyterlab-launcher

pip

Jupyter Lab Launcher

426

JUPYTER

jupyterlab-server

pip

BSD

Jupyter Lab Server

427

TEACH

jyquickhelper

pip

MIT

Helpers for Jupyter notebooks.

428

ML

kabuki

pip

As Is

kabuki is a python toolbox that allows easy creation of hierarchical bayesian models for the cognitive sciences.

429

DATA/ML

keras

pip

MIT

deep learning

430

DATA/ML

Keras-Applications

pip

MIT

Keras Applications is the applications module of the Keras deep learning library. It provides model definitions and pre-trained weights for a number of popular archictures, such as VGG16, ResNet50, Xception, MobileNet, and more.

431

DATA/ML

Keras-Preprocessing

pip

MIT

Keras Preprocessing is the data preprocessing and data augmentation module of the Keras deep learning library. It provides utilities for working with image data, text data, and sequence data.

432

DATA/ML

keras-vis

pip

keras-vis is a high-level toolkit for visualizing and debugging your trained keras neural net models.

433

keyring

pip

MIT, Python

Store and access your passwords safely.

434

GUI

Kivy

wheel

MIT

GUI, interface for games, mobile (use sudo apt-get install python3-kivy on Linux)

435

GUI

kivy-garden

pip

MIT

Garden tool for kivy flowers.

436

kiwisolver

wheel

BSD

Kiwi is an efficient C++ implementation of the Cassowary constraint solving algorithm.

437

ML

knnimpute

pip

k-Nearest Neighbor imputation

438

kombu

pip

BSD

Messaging library for Python (for celery)

439

lazy-object-proxy

pip

BSD 2-Clause License

A fast and thorough lazy object proxy

440

WEB

ldap3

pip

LGPLv3

ldap3 is a strictly RFC 4510 conforming LDAP V3 pure Python client library.

441

VIZ

leather

pip

Leather is the Python charting library for those who need charts now and don’t care if they’re perfect.

442

libLAS

wheel

BSD

libLAS is a C/C++ library for reading and writing the very common LAS LiDAR format.

443

liblinear

wheel

A Library for Large Linear Classification

444

libsvm

wheel

SVM library

445

libtiff

wheel

PyLibTiff is a package that provides: a wrapper to the libtiff library to Python using ctypes, a pure Python module for reading and writing TIFF and LSM files. The images are read as numpy.memmap objects so that it is possible to open images that otherwise would not fit to computers RAM. Both TIFF strips and tiles are supported for low-level data storage.

446

OPTIM

lifelines

pip

MIT

survival analysis

447

DATA/ML

lightgbm

pip

The MIT License (Microsoft)

Parallelized Stochastic Gradient Descent (only available on Python 3 and x64)

448

TEACH

lightmlboard

pip

lightmlboard implements a light machine learning leaderboard based on tornado.

449

TEACH

lightmlrestapi

pip

lightmlrestapi implements a light machine learning REST API based on falcon.

450

VIZ

lightning-python

pip

MIT

Python client for the lightning API

451

PROFILING

line-profiler

wheel

BSD

line_profiler is a module for doing line-by-line profiling of functions. kernprof is a convenient script for running either line_profiler or the Python standard library’s cProfile or profile modules, depending on what is available.

452

linecache2

pip

A backport of linecache to older supported Pythons.

453

llvmlite

wheel

BSD

lightweight wrapper around basic LLVM functionality, check issue https://github.com/cmderdev/cmder/issues/490 for missing api-ms-win-crt-runtime-l1-1-0.dll

454

OPTIM

lmfit

pip

BSD

Least-Squares Minimization with Bounds and Constraints

455

lml

pip

New BSD

lml seamlessly finds the lml based plugins from your current python environment but loads your plugins on demand. It is designed to support plugins that have external dependencies, especially bulky and/or memory hungry ones. lml provides the plugin management system only and the plugin interface is on your shoulder.

456

locket

pip

BSD

File-based locks for Python for Linux and Windows

457

lockfile

pip

MIT

Platform-independent file locking module

458

Logbook

wheel

A logging replacement for Python

459

logutils

pip

Logging utilities

460

lru_dict

pip

An Dict like LRU container.

461

OPTIM

lsqfit

wheel

GPLv3+

Utilities for nonlinear least-squares fits

462

WORKFLOW

luigi

pip

Apache 2.0

workflows, data workflows

463

lunardate

pip

for fbprophet

464

lxml

wheel

BSD

xml parsers (C++)

465

lz4

wheel

BSD

LZ4 Bindings for Python (for dpark)

466

m2r

pip

M2R converts a markdown file including reStructuredText (rst) markups to a valid rst format.

467

macropy3

pip

MIT

MacroPy is an implementation of Syntactic Macros in the Python Programming Language. MacroPy provides a mechanism for user-defined functions (macros) to perform transformations on the abstract syntax tree (AST) of a Python program at import time. This is an easy way to enhance the semantics of a Python program in ways which are otherwise impossible, for example providing an extremely concise way of declaring classes.

468

VIZ

mahotas

wheel

Mahotas: Computer Vision Library

469

Mako

pip

MIT

templating

470

mammoth

pip

BSD

Convert Word documents from docx to simple and clean HTML and Markdown

471

TEACH

manydataapi

pip

MIT

Helpers to access APIs

472

marisa_trie

wheel

MIT

Static memory-efficient & fast Trie-like structures for Python (based on marisa-trie C++ library)

473

markdown

pip

BSD License

markdown parser (for bokeh)

474

markdown2

pip

markdown parser

475

markupsafe

pip

BSD-3-Clause

parses mardown

476

VIZ

matplotlib

wheel

PSF

most used plotting library

477

mbstrdecoder

pip

MIT

multi-byte character string decoder

478

STYLE

mccabe

pip

Expat license

This module provides a plugin for flake8, the Python code checker.

479

SPHINX

mdn-sphinx-theme

pip

Mozilla Public License 2.0 (MPL 2.0)

A Sphinx theme for the Mozilla Developer Network

480

ML

MDP

pip

Python data processing framework.

481

PROFILING

memory-profiler

pip

BSD

A module for monitoring memory usage of a python program

482

JUPYTER

metakernel

pip

BSD

more magic commands for Jupyter

483

WEB

mezzanine

pip

BSD

Mezzanine is a powerful, consistent, and flexible content management platform.

484

JUPYTER/LINUX

micropython-fcntl

pip

dependency for ptyprocess, Functions to compute fnctl.ioctl’s opt argument

485

JUPYTER/LINUX

micropython-ffilib

pip

dependency for ptyprocess, MicroPython FFI helper module

486

JUPYTER/LINUX

micropython-libc

pip

dependency for ptyprocess, MicroPython FFI helper module

487

DATA/ML

milk

wheel

machine learning toolkit

488

minepy

wheel

GPLv3

interface to MineCraft

489

VIZ

missingno

pip

As Is

Messy datasets? Missing values? missingno provides a small toolset of flexible and easy-to-use missing data visualizations and utilities that allows you to get a quick visual summary of the completeness (or lack thereof) of your dataset.

490

mistune

pip

BSD

The fastest markdown parser in pure Python with renderer features, inspired by marked.

491

mizani

pip

BSD (3-clause)

Mizani is a scales package for graphics. It is written in Python and is based on Hadley Wickham’s Scales.

492

DATA/ML

mkl_service

wheel

This package exposes a few functions which are declared in mkl_service.h. The main purpose of the package is to allow the user to change the number of CPU’s MKL is using at runtime.

493

TEACH

mlinsights

wheel

MIT

mlinsights implements functions to get insights on machine learned models.

494

TEACH

mlprodict

wheel

MIT

mlprodict implements a couple of ways to productionize machine learning predictions.

495

mlpy

wheel

mlpy is a Python module for Machine Learning built on top of NumPy/SciPy, has wavelets

496

TEACH

mlstatpy

pip

MIT

materials for machine learning

497

ML

mlxtend

pip

A library consisting of useful tools and extensions for the day-to-day data science tasks.

498

mnist

pip

BSD

Python utilities to download and parse the MNIST dataset

499

mock

pip

OSI Approved :: BSD License

mock is a library for testing in Python. It allows you to replace parts of your system under test with mock objects and make assertions about how they have been used.

500

mockextras

pip

Extensions to the mock library

501

monotonic

pip

Apache

This module provides a monotonic() function which returns the value (in fractional seconds) of a clock which never goes backwards. It is compatible with Python 2 and Python 3.

502

monty

pip

MIT

Monty is the missing complement to Python.

503

more-itertools

pip

MIT

More routines for operating on iterables, beyond itertools

504

VIDEO

moviepy

pip

MIT License

Video editing with Python

505

mpi4py

wheel

MPI for Python

506

mpl-scatter-density

pip

BSD

Matplotlib helpers to make density scatter plots

507

VIZ

mpl_finance

github

This module consists of code extracted from the deprecated matplotlib.finance module along with a few examples of usage.

508

mpmath

pip

BSD

mpmath is a free (BSD licensed) Python library for real and complex floating-point arithmetic with arbitrary precision.

509

msgfy

pip

msgfy is a Python library for convert Exception instance to a human-readable error message.

510

msgpack

wheel

Apache 2.0

MessagePack (de)serializer.

511

msgpack-numpy

pip

BSD

Numpy data serialization using msgpack

512

msrest

pip

MIT License

AutoRest swagger generator Python client runtime.

513

msrestazure

pip

MIT License

AutoRest swagger generator Python client runtime. Azure-specific module.

514

mss

pip

An ultra fast cross-platform multiple screenshots module in pure python using ctypes

515

multi-key-dict

pip

License :: OSI Approved :: MIT License (http://opensource.org/licenses/MIT)

Multi key dictionary implementation

516

multimethods

pip

MIT

A multimethod implementation, loosely based on Guido’s initial ‘Five-minute Multimethods in Python.

517

multipledispatch

pip

BSD

A relatively sane approach to multiple dispatch in Python.

518

multiprocess

wheel

better multiprocessing and multithreading in python

519

munch

pip

MIT

A dot-accessible dictionary (a la JavaScript objects).

520

murmurhash

pip

MIT

Cython .pxd files for some of the MurmurHash 2 and 3 hash functions, with a slightly more Pythonic API. The only access to these functions is via Cython — I don’t see why they should be useful from pure Python.

521

mutagen

pip

GPL-2.0-or-later

read and write audio tags for many formats in Python 3

522

mutagenx

pip

read and write audio tags for many formats in Python 3

523

mypy

pip

Mypy is an experimental optional static type checker for Python that aims to combine the benefits of dynamic (or “duck”) typing and static typing.

524

mypy_extensions

pip

for mypy

525

SQL

mysqlclient

wheel

MySQL driver written in Python which does not depend on MySQL C client libraries and implements the DB API v2.0 specification (PEP-249).

526

natgrid

wheel

Python interface to NCAR natgrid library (for matplotlib)

527

natsort

pip

MIT

Sort lists naturally

528

JUPYTER

nbbrowserpdf

pip

LaTeX-free PDF generation from Jupyter Notebooks

529

JUPYTER

nbconvert

pip

BSD

IPython, notebooks conversion, new in Jupyter 4.0

530

nbdime

pip

BSD

Comparison of notebooks

531

JUPYTER

nbformat

pip

BSD

IPython, notebooks conversion, new in Jupyter 4.0

532

JUPYTER

nbpresent

pip

BSD-3-Clause

Next generation slides from Jupyter Notebooks

533

JUPYTER

nbsphinx

pip

MIT

nbsphinx is a Sphinx extension that provides a source parser for *.ipynb files.

534

JUPYTER

nbupload

pip

widget to upload a file in a notebook

535

ndg-httpsclient

pip

BSD - See ndg/httpsclient/LICENCE file for details

Provides enhanced HTTPS support for httplib and urllib2 using PyOpenSSL

536

netaddr

pip

Pythonic manipulation of IPv4, IPv6, CIDR, EUI and MAC network addresses

537

netCDF4

wheel

xarray uses this module to save and read data (netCDF=Unidata network Common Data Form)

538

VIZ

networkx

pip

BSD

graph libraries, basic drawing

539

DATA/ML

neural-python

pip

MIT

NeuralPy is the Artificial Neural Network library implemented in Python.

540

nibabel

pip

Access a multitude of neuroimaging data formats.

541

DATA/ML

nitime

wheel

Nitime is a library for time-series analysis of data from neuroscience experiments.

542

DATA/ML

NLopt

wheel

LGPL

linear, quadratic optimization

543

NLP

nltk

pip

Apache License, Version 2.0

NLP, natural language processing

544

nodeenv

pip

BSD

Node.js virtual environment builder

545

nose

pip

GNU LGPL

run unit tests

546

JUPYTER

notebook

pip

BSD

Jupyter notebooks, new in Jupyter 4.0

547

JUPYTER

notedown

pip

Convert markdown to IPython notebook.

548

C++

nuitka

pip

C++ compilation, code optimization

549

C++

numba

wheel

BSD

Numba is an Open Source NumPy-aware optimizing compiler for Python sponsored by Continuum Analytics, Inc.

550

numcodecs

wheel

MIT

Numcodecs is a Python package providing buffer compression and transformation codecs for use in data storage and communication applications.

551

OPTIM

numdifftools

pip

Solves automatic numerical differentiation problems in one or more variables.

552

numexpr

wheel

MIT

Fast numerical array expression evaluator for Python, NumPy, PyTables, pandas, bcolz and more.

553

DATA/ML

numpy

wheel

BSD

matrix computation

554

numpydoc

pip

Sphinx extension to support docstrings in Numpy format

555

oauth2client

pip

The oauth2client is a client library for OAuth 2.0.

556

oauthlib

pip

BSD

A generic, spec-compliant, thorough implementation of the OAuth request-signing logic

557

DATA/ML

oct2py

pip

Python to GNU Octave bridge –> run m-files from python.

558

octave_kernel

pip

BSD

A Jupyter kernel for Octave.

559

odfpy

pip

GNU General Public License v.2 + Apache License v.2

Python API and tools to manipulate OpenDocument files

560

olefile

pip

Python package to parse, read and write Microsoft OLE2 files (Structured Storage or Compound Document, Microsoft Office) - Improved version of the OleFileIO module from PIL, the Python Image Library.

561

ML

onnx

wheel2

Open Neural Network Exchange

562

ML

onnxmltools

pip

Converts Machine Learning models to ONNX

563

opencv_python

wheel

BSD

OpenVC wrapper

564

OpenImageIO

wheel

A library for reading and writing images with emphasis on animation and visual effects.

565

openpyxl

pip

MIT/Expat

reads/writes Excel files, version is 1.8.6 due to pandas which does not work with more recent verrsions yet

566

ML

openTSNE

wheel

Fast t-SNE

567

optlang

pip

Formulate optimization problems using sympy expressions and solve them using interfaces to third-party optimization software (e.g. GLPK).

568

ORANGE

orange3

pip

Orange is a component-based data mining software. It includes a range of data visualization, exploration, preprocessing and modeling techniques. It can be used through a nice and intuitive user interface or, for more advanced users, as a module for the Python programming language.

569

ORANGE

orange3-associate

pip

This module implements FP-growth [1] frequent pattern mining algorithm with bucketing optimization [2] for conditional databases of few items.

570

ORANGE

Orange3-ImageAnalytics

pip

GPLv3+

Orange3 add-on for image data mining.

571

ORANGE

Orange3-Network

pip

GPLv3+

Orange Network is an add-on for Orange data mining software package. It provides network visualization and analysis tools.

572

ORANGE

orange3-text

pip

Orange3 Text extends Orange3, a data mining software package, with common functionality for text mining. It provides access to publicly available data, like NY Times, Twitter and PubMed. Further, it provides tools for preprocessing, constructing vector spaces (like bag-of-words, topic modeling and word2vec) and visualizations like word cloud end geo map. All features can be combined with powerful data mining techniques from the Orange data mining framework.

573

VIZ

osmapi

pip

GPLv3

Python wrapper for the OSM API

574

osqp

wheel

The OSQP (Operator Splitting Quadratic Program) solver is a numerical optimization package for solving quadratic problems

575

packaging

pip

BSD or Apache License, Version 2.0

Core utilities for Python packages

576

palettable

pip

MIT

Color palettes for Python

577

pamela

pip

An interface to the Pluggable Authentication Modules (PAM) library on linux, written in pure python (using ctypes)

578

DATA/ML

pandas

wheel

BSD

manipulate table as SQL in memory

579

DATA/ML

pandas-datareader

pip

BSD License

Up to date remote data access for pandas, works for multiple versions of pandas.

580

VIZ

pandas-highcharts

pip

MIT

plotting in javascript and pandas

581

TEACH

pandas_streaming

pip

pandas_streaming aims at processing big files with pandas, too big to hold in memory,too small to be parallelized with a significant gain. The module replicates a subset of pandas API and implements other functionalities for machine learning.

582

pandoc-attributes

pip

An Attribute class to be used with pandocfilters

583

pandocfilters

pip

BSD

Utilities for writing pandoc filters in python

584

param

pip

BSD

Declarative Python programming using Parameters.

585

NETWORK

paramiko

pip

LGPL

SSH2 protocol library

586

parsel

pip

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors.

587

parsimonious

pip

(Soon to be) the fastest pure-Python PEG parser I could muster

588

parso

pip

MIT

Parso is a Python parser that supports error recovery and round-trip parsing for different Python versions (in multiple Python versions). Parso is also able to list multiple syntax errors in your python file.

589

partd

pip

BSD

Appendable key-value storage

590

passlib

pip

BSD

comprehensive password hashing framework supporting over 30 schemes

591

JUPYTER

path.py

pip

MIT

IPython, dependency

592

pathlib2

pip

for ggplot, Object-oriented filesystem paths

593

pathvalidate

pip

MIT

A python library to validate/sanitize a string such as filenames/variable-names/excel-sheet-names.

594

DATA/ML

patsy

pip

2-clause BSD

A Python package for describing statistical models and for building design matrices.

595

pbr

pip

Apache Software License

PBR is a library that injects some useful and sensible default behaviors into your setuptools run.

596

pdfminer3k

pip

MIT

PDF parser and analyzer

597

PDF

pdfrw

pip

MIT

PDF file reader/writer library

598

pep8

pip

official guidelines on Python style

599

persistent

wheel

Objets persistants translucides

600

JUPYTER

pexpect

pip

ISC license

needed by ipykernel on Linux, Pexpect makes Python a better tool for controlling other applications (needed by metakernel).

601

SQL

pg8000

pip

A Pure-Python PostgreSQL

602

picklable-itertools

pip

MIT

A reimplementation of the Python standard library’s itertools, in Python, using picklable iterator objects. (fuel)

603

JUPYTER

pickleshare

pip

MIT

IPython, dependency

604

Pillow

wheel

HPND

read/create images

605

PIMS

pip

As Is

Python Image Sequence (for trackpy)

606

pint

pip

Physical quantities module

607

pipdeptree

pip

MIT

displays module dependencies as a tree

608

pivottablejs

pip

MIT

PivotTable.js integration for Jupyter/IPython Notebook.

609

pkginfo

pip

MIT

Query metadatdata from sdists / bdists / installed packages.

610

plac

pip

BSD License

The smartest command line arguments parser in the world

611

planar

wheel

BSD

2D planar geometry library for Python.

612

VIZ

plotly

pip

MIT

Plotly’s Python graphing library makes interactive, publication-quality graphs online. Examples of how to make line plots, scatter plots, area charts, bar charts, error bars, box plots, histograms, heatmaps, subplots, multiple-axes, polar charts and bubble charts.

613

plotnine

pip

GPL-2

A grammar of graphics for Python

614

pluggy

pip

MIT license

plugin and hook calling mechanisms for python

615

ply

pip

Python Lex & Yacc (for thrifty)

616

Pmw

pip

Pmw is a toolkit for building high-level compound widgets in Python using the Tkinter module.

617

pocket

pip

BSD

Access to pocket API.

618

Polygon3

wheel

Handles polygonal shapes in 2D. This library is free for non-commercial use only.

619

portend

pip

TCP port monitoring utilities

620

preshed

wheel

MIT

Cython hash table that trusts the keys are pre-hashed

621

prince

pip

Prince is a factor analysis library for datasets that fit in memory.

622

proglog

pip

MIT - copyright Edinburgh Genome Foundry

Progress bar.

623

VIZ

progressbar2

pip

BSD

Text progress bar library for Python. (fuel)

624

JUPYTER

prometheus_client

pip

The official Python 2 and 3 client for Prometheus.

625

JUPYTER

prompt_toolkit

pip

Library for building powerful interactive command lines in Python

626

properties

pip

An organizational aid and wrapper for validation and tab completion of class properties

627

protobuf

pip

3-Clause BSD License

Protocol Buffers are Google’s data interchange format

628

pscript

pip

for flexx

629

psutil

wheel

BSD

cross-platform library for retrieving information onrunning processes and system utilization (CPU, memory, disks, network)in Python.

630

psycopg2

wheel

LGPL with exceptions or ZPL

Python-PostgreSQL Database Adapter

631

PTable

pip

BSD (3 clause)

A simple Python library for easily displaying tabular data in a visually appealing ASCII table format. (for streamparse)

632

JUPYTER/LINUX

ptyprocess

pip

ISC

dependency for the terminado, Run a subprocess in a pseudo terminal

633

DATA/ML

PuLP

wheel

linear, quadratique optimization with constraints

634

py

pip

MIT license

library with cross-python path, ini-parsing, io, code, log facilities

635

py-cpuinfo

pip

MIT

Py-cpuinfo gets CPU info with pure Python. Py-cpuinfo should work without any extra programs or libraries, beyond what your OS provides. It does not require any compilation(C/C++, assembly, et cetera) to use. It works with Python 2 and 3.

636

py4j

pip

BSD License

Enables Python programs to dynamically access arbitrary Java objects

637

OPTIM

pyamg

wheel

PyAMG is a library of Algebraic Multigrid (AMG) solvers with a convenient Python interface.

638

pyasn1

pip

BSD

ASN.1 types and codecs (for pysnmp)

639

pyasn1-modules

pip

BSD-2-Clause

A collection of ASN.1-based protocols modules.

640

PyAudio

wheel

MIT

PyAudio provides Python bindings for PortAudio v19, the cross-platform audio I/O library. With PyAudio, you can easily use Python to play and record audio streams on a variety of platforms (e.g., GNU/Linux, Microsoft Windows, and Mac OS X).

641

pybars3

pip

GNU Library or Lesser General Public License (LGPL)

Handlebars.js templating

642

C++

pybind11

pip

BSD

pybind11 is a lightweight header-only library that exposes C++ types in Python and vice versa, mainly to create Python bindings of existing C++ code.

643

DATA/ML

pyclustering

pip

GNU General Public License v3 (GPLv3)

many kinds of clustering (Optics, DBScan, x-means, …)

644

STYLE

pycodestyle

pip

Expat license

Python style guide checker

645

pycontracts

pip

PyContracts is a Python package that allows to declare constraints on function parameters and return values, setup for version 1.7.7 is bugged

646

pycosat

wheel

PicoSAT is a popular SAT solver written by Armin Biere in pure C.

647

pycparser

pip

BSD

pycparser is a complete parser of the C language, written in pure Python using the PLY parsing library. It parses C code into an AST and can serve as a front-end for C compilers or analysis tools.

648

pycrypto

wheel2

Public Domain + patent for some algorithm

Cryptographic modules for Python (not available on x64 and Python 3)

649

pycryptodomex

pip

BSD, Public Domain, Apache

Cryptographic modules for Python (not available on x64 and Python 3)

650

pyct

pip

BSD 3-Clause License

A utility package

651

GPU

pycuda

wheel

MIT

PyCUDA lets you access Nvidia’s CUDA parallel computation API from Python.

652

pycurl

wheel

LGPL/MIT

PycURL, a interface to the libcurl library. (for grab)

653

PyDispatcher

pip

Multi-producer-multi-consumer signal dispatching mechanism

654

MUSIC

pydub

pip

MIT

Pydub lets you do stuff to audio in a way that isn’t stupid.

655

pydy

pip

LICENSE.txt

Multibody Dynamics with Python

656

pyecharts

pip

MIT

pyecharts is a library to generate charts using Echarts. It simply provides the interface of 28+ kinds of charts between Echarts and Python.

657

pyecharts-javascripthon

pip

MIT

pyecharts-javascripthon helps translate Python functions into javascript ones. It uses javascripthon and dukpy to blend Python codes into javascript runtime. It supports python 2.7, 3.4, 3.5 and 3.6. It works on Linux, MacOS and Windows platforms.

658

pyecharts-snapshot

pip

pyecharts-snapshot renders the output of pyecharts as a png, jpeg, gif, svg image or a pdf file at command line or in your code.

659

pyeda

wheel

A library for electronic design automation.

660

pyemd

wheel

MIT

A Python wrapper for Ofir Pele and Michael Werman’s implementation of the Earth Mover’s Distance.

661

TEACH

pyenbc

pip

Helpers for remote work, clusters, hadoop

662

TEACH

pyensae

pip

MIT

helpers, Hadoop, SQL, financial times series, …

663

NETWORK

pyexecjs

pip

MIT

Run JavaScript code from Python (for pyreact)

664

STYLE

pyflakes

pip

MIT

verify pep8 on a script

665

DATA/ML

pyflux

wheel

As Is

An open source time series library for the Python Programming Language

666

ML

pyfm

wheel

Factorization Machine

667

VIZ

pygal

pip

GNU LGPL v3+

plotting (javascript)

668

VIZ

pygal_maps_ch

pip

GNU LGPL v3+

Swiss canton map for pygal

669

VIZ

pygal_maps_fr

pip

GNU LGPL v3+

French maps for pygal

670

VIZ

pygal_maps_world

pip

LGPLv3+

extension to pygal (maps)

671

SPHINX

pygal_sphinx_directives

pip

GNU LGPL v3+

Pygal sphinx integration

672

GUI

pygame

wheel

LGPL

GUI, interface for games (needs to be installed from www.pygame.org on Linux)

673

pygit2

wheel

Pygit2 is a set of Python bindings to the libgit2 shared library, libgit2 implements the core of Git.

674

GUI

pyglet

pip

a cross-platform windowing and multimedia library for Python

675

pygments

pip

BSD License

syntax highlighting package written in Python

676

pyhamcrest

pip

for twisted

677

PROFILING

pyinstrument

pip

BSD

A Python profiler that records the call stack of the executing code, instead of just the final function in it.

678

PROFILING

pyinstrument_cext

pip

Pyinstrument’s C extensions - reducing the overhead of statistical profilers

679

pyjsparser

pip

MIT

Fast javascript parser (based on esprima.js)

680

PyJWT

pip

MIT

JSON Web Token library for Python 3.

681

pylint

pip

GPL

statistics on Python script style

682

ML

pyltr

pip

BSD-new

pyltr is a Python learning-to-rank toolkit with ranking models, evaluation metrics, data wrangling helpers, and more.

683

pylzma

wheel2

LGPL

Python bindings for the LZMA library by Igor Pavlov.

684

pymatbridge

pip

pymatbridge is a set of python and matlab functions to allow these two systems to talk to each other

685

PHYS

pymatgen

wheel

MIT

Pymatgen (Python Materials Genomics) is a robust, open-source Python library for materials analysis.

686

pymatsolver

pip

A (sparse) matrix solver for python.

687

DATA/ML

pymc

wheel

Monte Carlo computation

688

PyMeta3

pip

Pattern-matching language based on OMeta for Python 3 and 2

689

pymkl

pip

Python wrapper of Intel MKL routines

690

TEACH

pymmails

pip

MIT

read/send emails

691

NoSQL

pymongo

wheel

Apache License, Version 2.0

Python wrapper for MongoDB

692

pympler

pip

Apache License, Version 2.0

Pympler is a development tool to measure, monitor and analyze the memory behavior of Python objects in a running Python application.

693

SQL

pymssql

wheel

A simple database interface for Python that builds on top of FreeTDS to provide a Python DB-API (PEP-249) interface to Microsoft SQL Server.

694

TEACH

pymyinstall

pip

MIT

easy installation of modules including Windows

695

SQL

PyMySQL

pip

MIT

Pure-Python MySQL Driver

696

NETWORK

PyNaCl

pip

Apache License 2.0

Python binding to the Networking and Cryptography (NaCl) library

697

SQL

pyodbc

wheel

access to protocal ODBC (SQL databases)

698

GPU

pyopencl

wheel

MIT

PyOpenCL lets you access the OpenCL parallel computation API from Python.

699

PyOpenGL

wheel

BSD

use OpenGL in Python

700

PyOpenGL_accelerate

wheel

BSD

Acceleration code for PyOpenGL

701

pyOpenSSL

pip

Apache License, Version 2.0

Python wrapper module around the OpenSSL library

702

pyotp

pip

PyOTP is a Python library for generating and verifying one-time passwords.

703

pypandoc

pip

MIT

Pypandoc provides a thin wrapper for pandoc, a universal document converter.

704

pyparsing

pip

MIT License

alternative approach to creating and executing simple grammars

705

PDF

pyPdf

github

BSD

read PDF

706

PDF

PyPDF2

pip

BSD

PDF toolkit

707

pypiserver

pip

BSD

run a local pypi server

708

pypmc

wheel

GPLv2

pypmc is a python package focusing on adaptive importance sampling.

709

pypng

pip

MIT

Pure Python PNG image encoder/decoder

710

VIZ

pyproj

wheel

OSI Approved

python interface to PROJ4 library for cartographic transformations https://jswhit.github.io/pyproj, needed by cartopy

711

GUI

PyQt5

pip

GPL v3

712

GUI

PyQt5-sip

pip

SIP

713

GUI

pyqtgraph

pip

Scientific Graphics and GUI Library for Python, depends on PySide

714

TEACH

pyquickhelper

pip

MIT

helpers to generation documentation

715

NETWORK

pyreact

pip

Python bridge to JSX & the React JavaScript library. (for pyxley)

716

pyreadline

pip

python implementation of GNU readline functionality

717

pyrsistent

pip

MIT

Pyrsistent is a number of persistent collections (by some referred to as functional data structures). Persistent in the sense that they are immutable.

718

TEACH

pyrsslocal

pip

MIT

RSS readers

719

MATHS

PySAL

pip

PySAL is an open source library of spatial analysis functions written in Python intended to support the development of high level applications.

720

pyshp

pip

MIT

Pure Python read/write support for ESRI Shapefile format

721

GUI

PySide2

pip

The Qt for Python project aims to provide a complete port of the PySide module to Qt 5.

722

pysmi

pip

BSD

SNMP SMI/MIB Parser (for pysnmp)

723

NETWORK

pysnmp

pip

A pure-Python SNMPv1/v2c/v3 library

724

PySocks

pip

A semi-actively maintained SocksiPy fork. Contains many improvements to the original.

725

pystache

pip

MIT

Mustache for Python (for bokeh)

726

DATA/ML

pystan

pip

GNU General Public License v3 (GPLv3)

PyStan provides an interface to Stan, a package for Bayesian inference using the No-U-Turn sampler, a variant of Hamiltonian Monte Carlo.

727

pystorm

pip

Battle-tested Apache Storm Multi-Lang implementation for Python.

728

ML

pystruct

wheel

Learning Structured Prediction in Python

729

pytablereader

pip

MIT

A python library to load structured table data from files/URL with various data format: CSV/Excel/HTML/JSON/LTSV/Markdown/TSV.

730

pytablewriter

pip

convert a dataframe into many formats

731

pytagcloud

pip

Create beautiful tag clouds as images or HTML

732

pytest

pip

MIT license

pytest: simple powerful testing with Python

733

pytest_runner

pip

Setup scripts can use pytest-runner to add setup.py test support for pytest runner.

734

python-daemon

pip

Library to implement a well-behaved Unix daemon process (for luigi)

735

python-dateutil

pip

Dual License

helpers to manipulate dates

736

python-docx

pip

read/write Word document

737

python-editor

pip

Programmatically open an editor, capture the result.

738

python-gmaps

pip

BSD

Google Maps API client http://python-gmaps.readthedocs.org

739

python-jenkins

pip

BSD

interact with Jenkins

740

ML

python-louvain

pip

Louvain algorithm

741

python-mimeparse

pip

MIT

A module provides basic functions for parsing mime-type names and matching them against a list of media-ranges. (falcon)

742

python-pptx

pip

The MIT License (MIT)

read/write PowerPoint presentation

743

python-utils

pip

Python Utils is a module with some convenient utilities not included with the standard Python install

744

python3-linkedin

pip

MIT

Python Interface to the LinkedIn API

745

python_Levenshtein

wheel

C implementation of Levenshtein distance.

746

python_oauth2

pip

for jupyterhup

747

pythonnet

wheel

MIT

Python binding for C#

748

pythonqwt

pip

MIT

Qt plotting widgets (Spyder)

749

VIZ

pythreejs

pip

BSD

A Python / ThreeJS bridge utilizing the Jupyter widget infrastructure.

750

pytils

pip

Russian-specific string utils (for selection, weblib)

751

pytool

pip

A collection of tools for Python

752

pytools

pip

MIT

A collection of tools for Python

753

pytube

pip

MIT

play with youtube videos

754

pytz

pip

MIT

World timezone definitions, modern and historical

755

VIZ

pyviz_comms

pip

Bidirectional communication for PyViz.

756

DATA/ML

PyWavelets

wheel

MIT

wavelets computation

757

pywinpty

wheel

Python bindings for the winpty pseudo terminal library. It allows to create and communicate with Windows processes that print outputs and recieve inputs via console input and output pipes.

758

VIZ

PyX

wheel

GNU General Public License (GPL)

plotting

759

NETWORK

pyxley

pip

A pure-Python SNMPv1/v2c/v3 library

760

PyYAML

wheel

MIT

YAML parser and emitter for Python

761

NETWORK

pyzmq

wheel

LGPL+BSD

python librairies for Omz (pipy distributes the binaries)

762

VIZ

qgrid

pip

A Pandas DataFrame viewer for IPython Notebook.

763

DATA/ML

QInfer

pip

QInfer is a library using Bayesian sequential Monte Carlo for quantum parameter estimation.

764

QtAwesome

pip

QtAwesome enables iconic fonts such as Font Awesome and Elusive Icons in PyQt and PySide applications.

765

JUPYTER

qtconsole

pip

BSD

IPython, notebooks, qtconsole

766

GUI

qtpy

pip

single interface for QtPy4, 5, PySide

767

Quandl

pip

MIT

access Quandl API

768

queuelib

pip

BSD

Collection of persistent (disk-based) queues

769

OPTIM

qutip

wheel

BSD

QuTiP is open-source software for simulating the dynamics of open quantum systems.

770

recommonmark

pip

MIT

For pymc3???

771

recordclass

wheel

A mutable variant of collections.namedtuple, which supports assignments.

772

redis

pip

Python client for Redis key-value store

773

regex

wheel

Python Software Foundation License

Alternative regular expression module, to replace re.

774

reportlab

wheel

BSD license (see license.txt for details), Copyright (c) 2000-2018, ReportLab Inc.

This is the ReportLab PDF Toolkit. It allows rapid creation of rich PDF documents, and also creation of charts in a variety of bitmap and vector formats.

775

requests

pip

Apache 2.0

human interface for http

776

requests-cache

pip

Persistent cache for requests library

777

requests-file

pip

Apache 2.0

File transport adapter for Requests

778

requests-ftp

pip

Apache 2.0

FTP Transport Adapter for Requests

779

requests-oauthlib

pip

ISC

OAuthlib authentication support for Requests.

780

retrying

pip

Apache 2.0

Retrying is an Apache 2.0 licensed general-purpose retrying library, written in Python, to simplify the task of adding retry behavior to just about anything.

781

rope_py3k

pip

GPL=OpenBSD

refactoring library

782

DATA/ML

rpy2

wheel

interact with R (R_HOME needs to be set up on Linux)

783

rsa

pip

ASL 2

Pure-Python RSA implementation

784

MATHS

Rtree

wheel

GNU Library or Lesser General Public License (LGPL)

R-Tree spatial index for Python GIS

785

ruamel.yaml

pip

ruamel.yaml is a YAML parser/emitter that supports roundtrip preservation of comments, seq/map flow style, and map key order

786

rx

pip

Reactive Extensions (Rx) for Python (LINQ)

787

AWS

s3fs

pip

Convenient Filesystem interface over S3

788

AWS

s3transfer

pip

Apache License 2.0

An Amazon S3 Transfer Manager

789

ML

sacred

pip

MIT

Facilitates automated and reproducible experimental research

790

sas7bdat

pip

read/write SAS format

791

scikit-fusion

pip

GPLv3

A Python module for data fusion built on top of factorized models.

792

scikit-image

wheel

Modified BSD

scikit-image is a collection of algorithms for image processing.

793

DATA/ML

scikit-learn

wheel

new BSD

machine learning

794

VIZ

scikit-plot

pip

MIT License

nicer graphs than matplotlib for datascientist

795

DATA/ML

scilab2py

pip

Python to Scilab bridge

796

DATA/ML

scipy

wheel

BSD

scientific computation, eigen values, linear algebra

797

DATA/ML

scoop

pip

GNU Library or Lesser General Public License (LGPL)

SCOOP (Scalable COncurrent Operations in Python) is a distributed task module allowing concurrent parallel programming on various environments, from heterogeneous grids to supercomputers

798

ML

scrapy

pip

A high-level Web Crawling and Web Scraping framework

799

OPTIM

scs

wheel

Solves convex cone programs via operator splitting.

800

VIZ

seaborn

pip

BSD (3-clause)

nicer graphs than matplotlib for statistical purposes

801

ML

seasonal

pip

MIT

timeseries

802

secretstorage

pip

BSD 3-Clause License

This module provides a way for securely storing passwords and other secrets.

803

selection

pip

API to extract data from HTML and XML documents. (for grab)

804

NETWORK

selenium

pip

Apache 2.0

Python wrapper for Selenium

805

semantic_version

pip

BSD

A library implementing the “SemVer” scheme.

806

semver

pip

Python helper for Semantic Versioning (http://semver.org/)

807

Send2Trash

pip

BSD License

Send file to trash natively under Mac OS X, Windows and Linux.

808

ML

seqlearn

wheel

sequence classification toolkit for Python

809

serverfiles

pip

An utility that accesses files on a HTTP server and stores them locally for reuse.

810

service_identity

pip

MIT

Service identity verification for pyOpenSSL.

811

setproctitle

wheel

BSD

A Python module to customize the process title

812

setuptools-git

pip

BSD

This is a plugin for setuptools that enables git integration. Once installed, Setuptools can be told to include in a package distribution all the files tracked by git. This is an alternative to explicit inclusion specifications with MANIFEST.in.

813

OPTIM

sfepy

wheel

Simple Finite Elements in Python

814

Shapely

wheel

BSD

Manipulation and analysis of geometric objects in the Cartesian plane.

815

shiboken2

pip

for PySide

816

simhash

pip

MIT

A Python implementation of Simhash Algorithm

817

JUPYTER

simplegeneric

pip

ZPL 2.1

IPython, dependency

818

simplejson

wheel

MIT License

Simple, fast, extensible JSON encoder/decoder for Python

819

SimpleSQLite

pip

MIT

SimpleSQLite is a Python library to simplify the table creation and data insertion into SQLite database.

820

singledispatch

pip

MIT

for nltk

821

GUI

SIP

pip

GPL v2 or GPL v3 or BSD

For PyQt5

822

six

pip

MIT

helpers for python 2/3 conversion

823

ML

sklearn_contrib_lightning

wheel

large-scale linear classification, regression and ranking

824

ML

sklearn_contrib_py_earth

wheel

A Python implementation of Jerome Friedman’s Multivariate Adaptive Regression Splines algorithm, in the style of scikit-learn. The py-earth package implements Multivariate Adaptive Regression Splines using Cython and provides an interface that is compatible with scikit-learn’s Estimator, Predictor, Transformer, and Model interfaces. For more information about Multivariate Adaptive Regression Splines, see the references below.

825

ML

sklearn_pandas

pip

~MIT

This module provides a bridge between Scikit-Learn’s machine learning methods and pandas-style Data Frames.

826

ML

skll

pip

BSD

SciKit-Learn Laboratory makes it easier to run machinelearning experiments with scikit-learn.

827

slicerator

pip

BSD

A lazy-loading, fancy-sliceable iterable.

828

smart_open

pip

Public Domain

Utils for streaming large files (S3, HDFS, gzip, bz2…), provides the same API for many format

829

smmap2

pip

BSD

A pure python implementation of a sliding window memory map manager

830

PROFILING

snakeviz

pip

BSD

SnakeViz is a browser based graphical viewer for the output of Python’s cProfile module.

831

snowballstemmer

pip

BSD-3-Clause

This package provides 16 stemmer algorithms (15 + Porter English stemmer) generated from Snowball algorithms, needed by sphinx-rtd-theme.

832

NETWORK

sockjs-tornado

pip

SockJS-tornado is a Python server side counterpart of SockJS-client browser library running on top of Tornado framework.

833

SPHINX

solar_theme

pip

BSD~MIT

sphinx theme

834

sortedcontainers

pip

Apache 2.0

Python Sorted Container Types: SortedList, SortedDict, and SortedSet

835

sounddevice

wheel

MIT

This Python module provides bindings for the PortAudio library and a few convenience functions to play and record NumPy arrays containing audio signals.

836

soupsieve

pip

MIT License

Soup Sieve is a CSS selector library designed to be used with Beautiful Soup 4.

837

NLP

spacy

wheel

MIT

Industrial-strength NLP

838

sparse

pip

This implements sparse arrays of arbitrary dimension on top of numpy and scipy.sparse

839

spectrum

wheel

Spectrum contains tools to estimate Power Spectral Densities using methods based on Fourier transform, Parametric methods or eigenvalues analysis

840

spglib

wheel

BSD

Python bindings for C library for finding and handling crystal symmetries

841

SPHINX

sphinx

pip

BSD

documentation generation based on RST

842

SPHINX

sphinx-better-theme

pip

sphinx theme

843

SPHINX

sphinx-bulma-theme

pip

sphinx theme

844

SPHINX

sphinx-corlab-theme

pip

LGPLv3+

sphinx theme

845

SPHINX

sphinx-gallery

pip

3-clause BSD

gallery for examples

846

SPHINX

sphinx-ioam-theme

pip

sphinx theme

847

SPHINX

sphinx-materialdesign-theme

pip

sphinx theme

848

SPHINX

sphinx-modern-theme

pip

MIT

sphinx theme

849

SPHINX

sphinx-redactor-theme

pip

MIT

sphinx theme

850

SPHINX

sphinx-rtd-theme

pip

MIT

sphinx theme

851

SPHINX

sphinx_bootstrap_theme

pip

MIT

sphinx theme

852

SPHINX

sphinx_modern_theme_modified

pip

sphinx theme

853

SPHINX

sphinx_py3doc_enhanced_theme

pip

BSD

sphinx theme

854

SPHINX

sphinx_readable_theme

pip

MIT

sphinx theme

855

SPHINX

sphinx_theme_pd

pip

MIT

sphinx theme

856

SPHINX

sphinxcontrib-applehelp

pip

BSD

extension for Sphinx

857

sphinxcontrib-blockdiag

pip

BSD

Sphinx “blockdiag” extension

858

SPHINX

sphinxcontrib-devhelp

pip

BSD

extension for Sphinx

859

SPHINX

sphinxcontrib-htmlhelp

pip

BSD

extension for Sphinx

860

SPHINX

sphinxcontrib-imagesvg

pip

BSD

support svg in Sphinx

861

SPHINX

sphinxcontrib-jsdemo

pip

BSD

to demo HTML or Javascript in Sphinx documentation

862

SPHINX

sphinxcontrib-jsmath

pip

BSD

extension for Sphinx

863

SPHINX

sphinxcontrib-lunrsearch

pip

simple autocompletion for the sphinx documentation

864

SPHINX

sphinxcontrib-qthelp

pip

BSD

extension for Sphinx

865

SPHINX

sphinxcontrib-serializinghtml

pip

BSD

extension for Sphinx

866

SPHINX

sphinxcontrib-websupport

pip

BSD

extension for Sphinx

867

SPHINX

sphinxjp.themecore

pip

A sphinx theme plugin extension.

868

SPHINX

sphinxjp.themes.basicstrap

pip

MIT

sphinx theme

869

SPHINX

sphinxjp.themes.gopher

pip

MIT

sphinx theme

870

SPHINX

sphinxjp.themes.htmlslide

pip

MIT

sphinx theme

871

SPHINX

sphinxjp.themes.sphinxjp

pip

MIT

sphinx theme

872

SPHINX

sphinxtrap

pip

sphinx theme

873

NETWORK

splinter

pip

As Is

browser abstraction for web acceptance testing

874

sputnik

pip

Data package manager library

875

spyder

pip

scientific IDE

876

spyder-kernels

pip

Jupyter Kernels for the Spyder console

877

SQL

SQLAlchemy

wheel

MIT

model SQL queries as objects

878

sqlite_bro

pip

MIT

GUI for SQLite

879

sqliteschema

pip

sqliteschema is a Python library to dump table schema of a SQLite database file.

880

SQL

sqlparse

pip

Non-validating SQL parser

881

DATA/ML

statsmodels

wheel

BSD License

statistical modelling, depends on scipy

882

stopit

pip

GPLv3

Timeout control decorator and context managers, raise any exception in another thread.

883

streamparse

pip

Streamparse lets you run Python code against real-time streams of data via Apache Storm.

884

structures

pip

User-friendly library for creating data structures.

885

suds-jurko

pip

Lightweight SOAP client (Jurko’s fork)

886

supersmoother

pip

BSD 3-clause

This is an efficient implementation of Friedman’s SuperSmoother [1] algorithm in pure Python. It makes use of numpy for fast numerical computation.

887

swiglpk

wheel

swiglpk - Simple swig bindings for the GNU Linear Programming Kit

888

sympy

pip

BSD

SymPy is a Python library for symbolic mathematics.

889

TA_Lib

wheel

This is a Python wrapper for TA-LIB based on Cython instead of SWIG.

890

tabledata

pip

A Python library to represent tabular data for pytablewriter/pytablereader/SimpleSQLite.

891

tables

wheel

BSD 2-Clause

PyTables is a package for managing hierarchical datasets and designed to efficiently and easily cope with extremely large amounts of data.

892

tabulate

pip

MIT

Pretty-print tabular data

893

tblib

pip

BSD 2-Clause License

Traceback fiddling library. For now allows you to pickle tracebacks and raise exceptions with pickled tracebacks in different processes. This allows better error handling when running code over multiple processes (imagine multiprocessing, billiard, futures, celery etc)

894

TEACH

teachpyx

pip

MIT

materials for teachings

895

tempora

pip

MIT

Objects and routines pertaining to date and time (tempora)

896

DATA/ML

tensorboard

pip

Apache 2.0

TensorBoard lets you watch Tensors Flow

897

DEEP LEARNING

tensorflow

wheel

Apache 2.0

Deep Learning from Google

898

DEEP LEARNING

tensorflow-estimator

pip

Apache 2.0

Deep Learning from Google

899

termcolor

pip

MIT

ANSII Color formatting for output in terminal.

900

JUPYTER/LINUX

terminado

pip

BSD

dependency for the notebooks, Terminals served to term.js using Tornado websockets

901

testpath

pip

MIT

Test utilities for code working with files and commands

902

text-unidecode

pip

The most basic Text::Unidecode port

903

textblob

pip

MIT

TextBlob is a Python (2 and 3) library for processing textual data. It provides a simple API for diving into common natural language processing (NLP) tasks such as part-of-speech tagging, noun phrase extraction, sentiment analysis, classification, translation, and more.

904

texttable

pip

module for creating simple ASCII tables

905

OPTIM

thinc

wheel

MIT

Practical Machine Learning for NLP. Thinc is the machine learning library powering spaCy.

906

thrift

pip

Python bindings for the Apache Thrift RPC system

907

thriftpy

pip

pure python implemention of Apache Thrift.

908

tinycss2

pip

BSD

tinycss2 is a complete yet simple CSS parser for Python. It supports the full syntax and error handling for CSS 2.1 as well as some CSS 3 modules

909

noSQL

tinydb

pip

TinyDB is a tiny, document oriented database optimized for your happiness :) It’s written in pure Python and has no external requirements.

910

TEACH

tkinterquickhelper

pip

MIT

windows on the top of tkinter

911

toml

pip

MIT

Python Library for Tom’s Obvious, Minimal Language

912

DATA/ML

toolz

pip

BSD

Toolz provides a set of utility functions for iterators, functions, and dictionaries.

913

DEEP LEARNING

torch

wheel

BSD-3

PyTorch is a deep learning framework that puts Python first.

914

DEEP LEARNING

torchvision

pip

BSD

image and video datasets and models for torch deep learning

915

NETWORK

tornado

wheel

http://www.apache.org/licenses/LICENSE-2.0

python server, IPython relies on it

916

VIZ

toyplot

pip

BSD

The kid-sized plotting toolkit for Python with grownup-sized goals.

917

ML

TPOT

pip

GNU/LGPLv3

Consider TPOT your Data Science Assistant. TPOT is a Python tool that automatically creates and optimizes machine learning pipelines using genetic programming.

918

JUPYTER

tqdm

pip

MPLv2.0, MIT Licences

A Simple Python Progress Meter

919

traceback2

pip

traceback2 is a backport of the new features added to the traceback testing framework in Python 2.7 and onwards.

920

DATA/ML

trackpy

pip

BSD - 3 clauses

trackpy is a Python package for particle tracking in 2D, 3D, and higher dimensions.

921

JUPYTER

traitlets

pip

BSD

IPython, dependency

922

traits

pip

BSD

explicitly typed attributes for Python

923

traittypes

pip

BSD

Custom trait types for scientific computing.

924

transaction

pip

Transaction management for Python. This package contains a generic transaction implementation for Python. It is mainly used by the ZODB.

925

translationstring

pip

Utility library for i18n relied on by various Repoze and Pyramid packages

926

ML

treeinterpreter

pip

BSD

Package for interpreting scikit-learn’s decision tree and random forest predictions. Allows decomposing each prediction into bias and feature contribution components

927

triangle

wheel

GNU LGPL

Python Triangle is a python wrapper around Jonathan Richard Shewchuk’s two-dimensional quality mesh generator and delaunay triangulator library.

928

trollius

pip

Port of the Tulip project (asyncio module, PEP 3156)

929

TEACH

tutormagic

pip

brings PythonTutor in a notebok

930

tweepy

pip

Python wrapper for the twitter API

931

Twisted

wheel

An asynchronous networking framework written in Python

932

typecheck-decorator

pip

verifies decorators at running time

933

typed_ast

pip

typed_ast is a Python 3 package that provides a Python 2.7 and Python 3 parser similar to the standard ast library.

934

typepy

pip

MIT

A python library for variable type checker/validator/converter at run time.

935

ML

typing_extensions

pip

Typing Extensions - Backported and Experimental Type Hints for Python

936

tzlocal

pip

MIT

tzinfo object for the local timezone

937

ufal.udpipe

pip

for orange3-text

938

ujson

wheel

BSD License

Ultra fast JSON encoder and decoder for Python

939

uncertainties

pip

uncertainties allows calculations such as (2 +/- 0.1)*2 = 4 +/- 0.2 to be performed transparently.

940

unidecode

pip

GPLv2+

ASCII transliterations of Unicode text

941

STYLE

unify

pip

Expat License

Modifies strings to all use the same (single/double) quote where possible.

942

unittest2

pip

unittest2 is a backport of the new features added to the unittest testing framework in Python 2.7 and onwards.

943

untangle

pip

Converts XML to Python objects

944

STYLE

untokenize

pip

Expat License

Transforms tokens into original source code (while preserving whitespace).

945

update_checker

pip

Simplified BSD License

A python module that will check for package updates.

946

uritemplate

pip

BSD 3-Clause License or Apache License, Version 2.0

URI templates

947

urllib3

pip

MIT

urllib2 extension

948

urwid

pip

for pyfm

949

user-agent

pip

MIT

A library to identify devices (phones, tablets) and their capabilities by parsing (browser/HTTP) user agent strings

950

uvloop

pip

MIT

Fast implementation of asyncio event loop on top of libuv

951

validate_email

pip

LGPL

Validate_email verify if an email address is valid and really exists.

952

vectormath

pip

Vector math utilities for python (used by discretize).

953

VIZ

vega

pip

BSD 3-clause

Python/Jupyter notebook module for Vega, and Vega-Lite, Polestar, and Voyager. Notebooks with embedded visualizations can be viewed on github and nbviewer.

954

Versio

pip

localshop dependency, manages versions

955

version-information

pip

BSD

Version information

956

VIZ

vincent

pip

plotting

957

vine

pip

Promises, promises, promises

958

virtualenv

pip

MIT

creatre virtual environments

959

viscm

pip

MIT

tool for analyzing colormaps and creating new colormaps.

960

vispy

pip

(new) BSD

Vispy is a high-performance interactive 2D/3D data visualization library.

961

PROFILING

vprof

pip

BSD

vprof is a Python package providing rich and interactive visualizations for various Python program characteristics such as running time and memory usage.

962

w3lib

pip

BSD

Library of web-related functions

963

NETWORK

waitress

pip

ZPL 2.1

Waitress WSGI server

964

JUPYTER

wcwidth

pip

MIT

Measures number of Terminal column cells of wide-character codes

965

webcolors

pip

BSD 3-Clause

A library for working with color names and color value formats defined by the HTML and CSS specifications for use in documents on the Web.

966

webencodings

pip

BSD

Character encoding aliases for legacy web content

967

weblib

pip

Set of tools for web scraping projects (for grab)

968

webruntime

pip

for flexx

969

werkzeug

pip

BSD-3-Clause

The Swiss Army knife of Python web development

970

wget

pip

none

pure python download utility

971

wheel

pip

MIT

handle wheels

972

whoosh

pip

search engine in Python

973

JUPYTER

widgetsnbextension

pip

BSD

Interactive HTML widgets for Jupyter notebooks.

974

wikipedia

pip

MIT

Wikipedia API for Python

975

SPHINX

wild_sphinx_theme

pip

BSD

sphinx theme

976

win_unicode_console

pip

MIT

977

winrandom

wheel

This module gives direct access to Windows Cryptographic API CryptGetRandom() function, which is cryptographically strong pseudo-random number generator (PRNG) on Windows:

978

VIZ

wordcloud

wheel

MIT

A little word cloud generator in Python.

979

wptools

pip

MIT

Wikipedia tools (for Humans)

980

TEACH

wrapclib

wheel

Wraps existing C libraries.

981

wrapt

wheel

BSD

A Python module for decorators, wrappers and monkey patching.

982

wurlitzer

pip

for spyder

983

x86cpu

wheel

BSD license

Uses cpuid instruction to get information about CPU. Queries OS as well as cpuid to see if the OS / CPU supports AVX instructions.

984

DATA/ML

xarray

pip

Apache

pandas like library for cubes (N-dimensional data)

985

DATA/ML

xgboost

wheel

Apache-2.0

Parallelized Stochastic Gradient Descent (only available on Python 3 and x64)

986

xlrd

pip

BSD

reads Excel files

987

XlsxWriter

pip

BSD

writes Excel files

988

xlwt

pip

BSD

writes Excel files

989

xmltodict

pip

MIT

Makes working with XML feel like you are working with JSON

990

xxhash

wheel

BSD

xxHash is an Extremely fast Hash algorithm, running at RAM speed limits.

991

yahoo-historical

pip

MIT License

Python module to get stock data from Yahoo! Finance

992

yapf

pip

Code formatter

993

SPHINX

yummy-sphinx-theme

pip

sphinx theme

994

zarr

pip

MIT

A minimal implementation of chunked, compressed, N-dimensional arrays for Python.

995

zc.lockfile

pip

Basic inter-process locks

996

ZConfig

pip

Structured Configuration Library

997

zdaemon

pip

Daemon process control library and tools for Unix-based systems

998

ZEO

pip

ZEO provides a client-server storage implementation for ZODB.

999

SPHINX

zerovm-sphinx-theme

pip

Apache License 2.0

sphinx theme

1000

zict

pip

BSD

The dictionary / mutable mapping interface is powerful and multi-faceted.

1001

zipline

wheel

A backtester for financial algorithms.

1002

zipp

pip

A pathlib-compatible Zipfile object wrapper.

1003

ZODB

pip

The Zope Object Database provides an object-oriented database for Python that provides a high-degree of transparency.

1004

ZODB3

pip

ZODB3 - Meta release for ZODB, persistent, BTrees and ZEO

1005

zodbpickle

wheel

This package presents a uniform pickling interface for ZODB.

1006

zope.exceptions

pip

Zope Public License

Zope exception

1007

zope.interface

wheel

interfaces for python

1008

zs

wheel

BSD

S is a compressed, read-only file format for efficiently distributing, querying, and archiving arbitrarily large record-oriented datasets.

Footnotes