Metadata-Version: 2.1
Name: dask-quik
Version: 0.0.4
Summary: function to make working in dask_cudf and dask quik-er
Home-page: https://github.com/donchesworth/dask-quik
Author: Don Chesworth
Author-email: donald.chesworth@gmail.com
License: BSD
Description: ![example workflow](https://github.com/donchesworth/dask-quik/actions/workflows/github-ci.yml/badge.svg)
        [![](https://img.shields.io/pypi/v/dask-quik.svg)](https://pypi.org/pypi/name/)
        [![License](https://img.shields.io/badge/License-BSD%203--Clause-blue.svg)](https://opensource.org/licenses/BSD-3-Clause)
        [![Docker Repository on Quay](https://quay.io/repository/donchesworth/rapids-dask-pytorch/status "Docker Repository on Quay")](https://quay.io/repository/donchesworth/rapids-dask-pytorch)
        [![code style](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)
        [![codecov](https://codecov.io/gh/donchesworth/dask-quik/branch/main/graph/badge.svg?token=U92M8C8AFM)](https://codecov.io/gh/donchesworth/dask-quik)
        
        # dask-quik  <img src="dask-quik.png" align="right" width="120" />
        Utilities for transforming data using dask and dask_cudf. Most can take either object,
        and will still process.
        
        This utility currently has:
        - cartesian: The ability to create a sparse matrix of the entire universe of data (creating a cartesian product of your data), and "indexize" your data for future
        encoding uses
        - combine: a shortcut for a left merge, and the ability to prune rows based on what
        should be grouped by, max, min, and/or avg
        - dummy: The ability to run just dask functions, using a dask_cudf dummy class
        - split: Splitting your train, validation, and testing datasets with test being the
        most recent value (for recommendation systems)
        - transform: The ability to transform back and forth from pandas, dask, and dask_cudf, and also a workaround to sort by index in dask_cudf
        - utils: random utils to setup the dask_cudf cluster, shrink data types when defaulted to 
        64-bit, and a quick check if there are available GPUs (using nvidia-smi).
Platform: UNKNOWN
Classifier: License :: OSI Approved :: BSD License
Classifier: Programming Language :: Python :: 3.8
Description-Content-Type: text/markdown
