Skip to content

Commit dd89fbd

Browse files
authored
Merge pull request #2 from pauldevos/add-analytics
added new data anlysis libraries
2 parents ab8d99b + e48e1c7 commit dd89fbd

File tree

1 file changed

+36
-0
lines changed

1 file changed

+36
-0
lines changed

Python-Data-Analysis-Libraries.md

Lines changed: 36 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,36 @@
1+
# Python-Data-Analysis-Libraries
2+
3+
- Numpy
4+
- Pandas
5+
- Vaex
6+
- [Modin](https://modin.readthedocs.io/en/stable/)
7+
- Ray
8+
- Fugue
9+
- Spark
10+
- [Datatable](https://github.com/h2oai/datatable)
11+
- [Polars](https://github.com/pola-rs/polars)
12+
- Dask
13+
14+
15+
16+
|Library | Multiprocessing | Distributed | Workload Capacity | Documentation|
17+
|---|----|------|-----|-----|
18+
| Pandas | No | No | < 50 GB | [Docs](https://github.com/pandas-dev/pandas) |
19+
| Numpy | No | No | < 200 GB | [Docs](https://github.com/numpy/numpy) |
20+
| Modin | Yes | Yes | < 5000 GB | [Docs](https://modin.readthedocs.io/en/stable/) |
21+
| Dask | Yes | Yes | < 50000 GB | [Docs](https://github.com/dask/dask) |
22+
| Ray | Yes | Yes | < 2000 GB | [Docs](https://github.com/ray-project/ray) |
23+
| PySpark | Yes | Yes | < 500000 GB | [Docs](https://github.com/apache/spark) |
24+
| Vaex | Yes | No | < 2000 GB | [Docs](https://github.com/vaexio/vaex) |
25+
| Datatable | Yes | No | < 200 GB | [Docs](https://github.com/h2oai/datatable) |
26+
| Polars | Yes | No | < 200 GB | [Docs](https://github.com/pola-rs/polars) |
27+
| cuDF | Yes | Yes | < 200 GB | [Docs](https://github.com/rapidsai/cudf) |
28+
| Fugue | Yes | Yes | < 200 GB | [Docs](https://github.com/rapidsai/cudf) |
29+
30+
31+
32+
| Col1 | col2 | col3 | col4 | col5 |
33+
|---|---|---|---|---|
34+
| a | d | g | j | |
35+
| b | e | h | k | |
36+
| c | f | i | l | |

0 commit comments

Comments
 (0)