Metadata-Version: 2.1
Name: ua-datasets
Version: 0.0.5
Summary: A collection of ukrainian language datasets
Home-page: https://github.com/fido-ai/ua-datasets
Author: FIdo AI
License: MIT
Description: 
        <p align="center">
          <img src="https://github.com/fido-ai/ua-datasets/blob/main/assets/NaUKMA.png" width="350" title="hover text" alt="NaUKMA FIdo Logo">
        </p>
        
        <h1 align="center">
            ua_datasets
        </h1>
        
        <p align="center">
            <a href="https://img.shields.io/badge/Version-0.0.2-orange.svg"><img src="https://img.shields.io/badge/Version-0.0.1-green.svg" alt="Version"/></a>
            <a href="https://img.shields.io/badge/License-Apache%202.0-blue.svg"><img src="https://img.shields.io/badge/License-MIT-blue.svg" alt="License"/></a>
        </p>
        <p align="center">
            <a href="https://www.repostatus.org/badges/0.1.0/active.svg"><img src="https://www.repostatus.org/badges/0.1.0/active.svg" alt="Repo Status"/></a>
        </p>
        
        __ua_datasets__ is a collection of Ukrainian language datasets. Our aim is to build a benchmark for research related to 
        natural language processing in Ukrainian.
        
        This library is provided by FIdo.ai (machine learning research division of the non-profit student's organization
        [FIdo](https://www.facebook.com/fido.naukma/), National University of Kyiv-Mohyla Academy) for research purposes.
        
        ## Installation
        The library can be installed from PyPi in your virtual environment (e.g. venv, conda env)
        ```bash
        pip install ua_datasets
        ```
        
        ## Available Datasets
        
        #### <a href="https://github.com/fido-ai/ua-datasets/tree/main/ua_datasets/src/text_classification">Text classification </a>
        
        #### <a href="https://github.com/fido-ai/ua-datasets/tree/main/ua_datasets/src/token_classification">Token classification </a>
        
        #### <a href="https://github.com/fido-ai/ua-datasets/tree/main/ua_datasets/src/question_answering">Question Answering </a>
        
        
        ## Contribution
        In case you are willing to contribute (update any part of the library, add your dataset) do not hesitate to connect through [GitHub Issue](https://github.com/fido-ai/ua-datasets/issues/new/choose). Thanks in advance for your contribution! 
        Let's make the Ukrainian language even greater!
        
        ## Citation
        ```bib
        @article{
          title={ua_datasets: a collection of Ukrainian language datasets},
          author={Bogdan Ivanyuk-Skulskiy, Anton Zaliznyi, Oleksand Reshetar, Oleksiy Protsyk, Bohdan Romanchuk, Vladyslav Shpihanovych},
          year={2021}
        }
        ```
        
Platform: UNKNOWN
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Education
Classifier: Intended Audience :: Science/Research
Classifier: Natural Language :: Ukrainian
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.7
Requires-Python: >=3.7.0
Description-Content-Type: text/markdown
