best python modules for machine learning, data mining, natural language processing, network analysis, and web scraping
Basics:
-
numpy - numerical library, numpy.scipy.org/
-
scipy - Advanced math, signal processing, optimization, statistics,www.scipy.org/
-
matplotlib, python plotting - Matplotlib, matplotlib.org
Machine Learning and Data Mining:
-
MDP, a collection of supervised and unsupervised learning algorithms,pypi.python.org/pypi/MDP/2.4
-
mlpy, Machine Learning Python, mlpy.sourceforge.net
-
NetworkX, for graph analysis, networkx.lanl.gov/
-
Orange, Data Mining Fruitful & Fun, biolab.si
-
pandas, Python Data Analysis Library,pandas.pydata.org
-
pybrain, pybrain.org
-
scikits-learn - Classic machine learning algorithms - Provide simple an efficient solutions to learning problems, scikit-learn.org/stable/
Natural Language:
- NLTK, Natural Language Toolkit, nltk.org
For web scraping:
-
Scrapy, An open source web scraping framework for Python scrapy.org
-
urllib/urllib2
Mind Map
[caption id=”” align=”aligncenter” width=”1000”] Mind Map of python Packages[/caption]
Sources:
-
http://www.kdnuggets.com/2012/11/best-python-modules-for-data-mining.html
-
http://www.datacommunitydc.org/blog/2013/05/stepping-up-to-big-data-with-r-and-python
-
Image: http://www.xmind.net/m/WvfC/
Tutorials-PyCon
- http://www.datacommunitydc.org/blog/2013/07/python-for-data-analysis-the-landscape-of-tutorials