We are very excited to join forces with MLCommons and OctoML.ai! Contact Grigori Fursin for more details!


module:dataset (v3.0.0)
License: BSD 3-clause (code) and CC BY-SA 4.0 (data)
Creation date: 2015-01-28
Source: GitHub
cID: 032630d041b4fd8a:8a7141c59cd335f5

Don't hesitate to get in touch if you encounter any issues or would like to discuss this community project!
Please report if this CK component works: 1  or fails: 0 
Sign up to be notified when artifacts are shared or updated!


This is our attempt to share automation actions from scientific research projects as unified Python modules with a common CLI and Python API to help researchers and practitioners reuse best practices. Our on-going project is to make the onboarding process as simple as possible via this platform. Please check this CK white paper and don't hesitate to contact us if you have suggestions or feedback!
  • Automation framework: CK
  • Development repository: ck-ml
  • Source: GitHub
  • How to get the stable version from this portal via the CK client:
    pip install cbench
    cb download module:dataset --version=3.0.0 --all
    ck help dataset
  • How to get the development version:
    pip install ck
    ck pull repo:ck-ml
    ck help dataset
  • CK automation actions (CLI with the Python CK API and JSON IO):
    • ck add dataset - add program with templates (API)
    • ck add_file_to dataset - add file to a given dataset (API)
    • ck check_size dataset - check size of all data sets and if less than threshold, add tag "small" (API)
    • ck generate dataset - TBD: generate new data sets to cover unseen behavior (API)
    • ck import_all_files dataset - (API)
    • ck prune dataset - prune data sets to find minimal representative data set covering behavior (API)


Style 1   Style 2  




Please log in to add your comments!
If you notice any inapropriate content that should not be here, please report us as soon as possible and we will try to remove it within 48 hours!