# ***Atlantis***
***Atlantis*** is a Python library for simplifying programming with Python for data science.
# Installation
You can just use pip to install Atlantis:
`pip install atlantis`
# Modules
- [***collections***](#collections) helps with working with collections.
- [***colour***](about_colour.md) simplifies using colours.
- [***ds (datascience)***](#ds-data-science) provides tools for:
- data wrangling,
- validation,
- tuning,
- sampling,
- evaluation,
- clustering, and
- parallel processing of machine learning models.
- [***functions***](about_functions.md) manages higher order functions.
- [***hash***](about_hash.md) simplifies and standardizes hashing.
- [***text***](about_text.md) makes working with texts and strings easy.
- [***time***](about_time.md)
- provides methods for interacting with time and date as well as
- progress bars
## *collections*
This module of the package [***atlantis***](README.md) helps with working with collections.
### *`flatten`*
```python
from atlantis.collections import flatten
flatten([1, 2, [3, 4, [5, 6], 7], 8])
```
returns: `[1, 2, 3, 4, 5, 6, 7, 8]`
### *`List`*
This class inherits from Python's list class but implements a few
additional functionalities.
```python
from atlantis.collections import List
l = List(1, 2, 3, 4, 2, [1, 2], [1, 2])
```
Flattening:
```python
l.flatten()
>>> List: [1, 2, 3, 4, 2, 1, 2, 1, 2]
```
Finding duplicates:
```python
l.get_duplicates()
>>> List: [2, List: [1, 2]]
```
**Note:** the ***list*** elements of a ***List*** automatically get converted to ***Lists***, recursively.
## *ds* (Data Science)
This module provides data science tools for:
- data wrangling,
- validation,
- tuning,
- sampling,
- evaluation,
- clustering, and
- parallel processing of machine learning models.
### *KMeans* Clustering
I have used the `KMeans` class from both *sklearn* and that of *pyspark* and was frustrated
by two problems: (a) even though the two classes do exactly the same thing their interfaces
are vastly different and (b) some of the simplest operations are very hard to do with
both classes. I solved this problem by creating my own `KMeans` class that is a wrapper
aroung both of those classes and uses the appropriate one automatically without
complicating it for the data scientist programmer.
### Clustering Optimization
没有合适的资源?快使用搜索试试~ 我知道了~
资源推荐
资源详情
资源评论
收起资源包目录
PyPI 官网下载 | atlantis-2021.7.21.tar.gz (204个子文件)
setup.cfg 38B
README.md 2KB
not-zip-safe 1B
colour_schemes.pickle 20KB
x11_colours.pickle 11KB
svg_colours.pickle 3KB
PKG-INFO 3KB
PKG-INFO 3KB
Colour.py 17KB
Estimator.py 16KB
ProgressBar.py 14KB
_LearningProject.py 14KB
_Processor.py 13KB
KMeans.py 10KB
_Scoreboard.py 8KB
InfluenceSimulator.py 7KB
_get_cross_validation.py 7KB
ProcessController.py 7KB
_ValidationContainer.py 7KB
NearestFinder.py 6KB
_LearningTask.py 6KB
Scheme.py 6KB
_Project.py 6KB
ClusteringOptimizer.py 5KB
_TrainingTestContainer.py 5KB
DataImputer.py 5KB
Imputer.py 5KB
OneHotEncoder.py 5KB
_CrossValidation.py 5KB
_CrossValidationProject.py 4KB
BaseController.py 4KB
MeasurementSet.py 4KB
_Evaluation.py 4KB
_get_model_influence.py 4KB
_drop_bad_columns.py 4KB
_EstimatorRepository.py 4KB
FlexibleNumber.py 4KB
_ColumnImputer.py 4KB
_read_excel.py 4KB
_apply_function.py 3KB
_AutoML.py 3KB
_Task.py 3KB
NumberPart.py 3KB
Imputer.py 3KB
Normalizer.py 3KB
_evaluate_classification.py 3KB
_sample_by_id.py 3KB
_supervise.py 3KB
YearMonth.py 3KB
SingleColumnImputer.py 3KB
_JupyterContainer.py 3KB
_evaluate_regression.py 3KB
Ensemble.py 3KB
_Task.py 3KB
get_date_part.py 2KB
_validate.py 2KB
Series.py 2KB
_train_and_test.py 2KB
UpSampler.py 2KB
add_time.py 2KB
DataSlice.py 2KB
_do_task.py 2KB
_fill_na_and_flag.py 2KB
_join_wisely.py 2KB
List.py 2KB
YearQuarter.py 2KB
_standardize_columns.py 2KB
get_elapsed.py 2KB
_find_good_columns.py 2KB
_map.py 2KB
_sample_per_group.py 2KB
OrderedSet.py 2KB
Timer.py 2KB
_Result.py 2KB
Polynomial.py 2KB
Measurement.py 2KB
_EstimatorRace.py 2KB
_imputate_with_regression.py 2KB
Iterable.py 2KB
Counter.py 1KB
JobController.py 1KB
__init__.py 1KB
_get_training_test.py 1KB
parse_date.py 1KB
_WorkerReport.py 1KB
_colour_schemes.py 1KB
_worker.py 1KB
__init__.py 1KB
ValidationController.py 1KB
_example_1.py 1KB
_bring_to_front.py 1KB
_split_data.py 1KB
Controller.py 1KB
_colourize.py 1023B
PrincipalComponentModelFactory.py 1019B
_get_entropy.py 1003B
_DataSlice.py 990B
_balance_sample.py 981B
convert.py 978B
_create_grid.py 931B
共 204 条
- 1
- 2
- 3
资源评论
挣扎的蓝藻
- 粉丝: 12w+
- 资源: 15万+
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功