Check the preview of 2nd version of this platform being developed by the open MLCommons taskforce on automation and reproducibility as a free, open-source and technology-agnostic on-prem platform.
program:tool-prepare-dataset (v3.0.0)
Copyright: See copyright in the source repository
License: See license in the source repository
Creation date: 2018-03-07
Source: GitHub
cID: b0ac08fe1d3c2615:44431bf0eafd9ceb

Don't hesitate to get in touch if you encounter any issues or would like to discuss this community project!
Please report if this CK component works: 1  or fails: 0 
Sign up to be notified when artifacts are shared or updated!

Description  

This portable workflow is our attempt to provide a common CLI with Python JSON API and a JSON meta description to automatically detect or install required components (models, data sets, libraries, frameworks, tools), and then build, run, validate, benchmark and auto-tune the associated method (program) across diverse models, datasets, compilers, platforms and environments. Our on-going project is to make the onboarding process as simple as possible via this platform. Please check this CK white paper and don't hesitate to contact us if you have suggestions or feedback!
  • Automation framework: CK
  • Development repository: ck-ml
  • Source: GitHub
  • Available command lines:
    • ck run program:tool-prepare-dataset --cmd_key=default (META)
  • Support for host OS: any
  • Support for target OS: any
  • How to get the stable version via the client:
    pip install cbench
    cb download program:tool-prepare-dataset --version=3.0.0 --all
    ck run program:tool-prepare-dataset
  • How to get the development version:
    pip install ck
    ck pull repo:ck-ml
    ck run program:tool-prepare-dataset

  • CLI and Python API: module:program
  • Dependencies    

    ReadMe  

    Tensor shape datasets preparation

    This is an utility program that initializes or updates ck-nntest tensor shape datasets:

    ck search dataset --tags=nntest
    

    A dataset can contain a table file data.csv and if it does, the tool will read the file line by line and produce pair of shape files shape-* and shape-*.json. Set of columns in the data.csv file and exact names of resulting files are depend on a test program the dataset is intended for.

    The program searches for datasets by tag nntest and you have to choose one that is needed to be updated. Then it removes all the existed shape-* files, so be warned if you have added some shapes manually, they will be erased.

    Note: The tool does not update a remote repo, so after execution you have to stage changes and commit them by yourself.

    Run

    ck run program:tool-prepare-dataset
    

    or one can explicitly specify which dataset should be processed:

    ck run program:tool-prepare-dataset --env.CK_DATASET=tensor-fullyconnected-0001
    

    Versions  

    Files  

    Comments  

    Please log in to add your comments!
    If you notice any inapropriate content that should not be here, please report us as soon as possible and we will try to remove it within 48 hours!