Open
Description
We always need datasets and pipelines to use in test cases to try to identify bugs and better optimize bottlenecks for real world use cases.
PLEASE NOTE THAT PROVIDED DATASETS AND PIPELINES WILL BE OPENSOURCE AS THEY ARE PUBLICLY AVAILABLE IN OUR REPOSITORY
Steps for Submitting:
- Create a branch on your fork of the repository named
data/data_submission
- Add a new file named
SUBMISSION.md
at the root level and add the following:
# Data Submission
Name: [your-name-here]
DataSet: [link to where we can find the Data] <- leave blank if not applicable
Pipeline: [link to where we can find the Pipeline] <- leave blank if not applicable
Information:
write a short paragraph about what it is, what its for, how it should be used, etc.
- Create a pull request from your branch to our repository | Create a Pull Request From Fork
- In the description of the PR add information about where the dataset/pipeline came from applications and acknowledgement that the data will be made public such as
I hereby acknowledge that the information is mine or I have received permission from the owner and I provide it with the understanding it will be made public.