DATA: Request for Real World Datasets and Pipelines To Test Our Filters

We always need datasets and pipelines to use in test cases to try to identify bugs and better optimize bottlenecks for real world use cases.

### PLEASE NOTE THAT PROVIDED DATASETS AND PIPELINES WILL BE OPENSOURCE AS THEY ARE PUBLICLY AVAILABLE IN OUR REPOSITORY ###

Steps for Submitting:

1. [Create a branch](https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/proposing-changes-to-your-work-with-pull-requests/creating-and-deleting-branches-within-your-repository) on your [fork](https://docs.github.com/en/get-started/quickstart/fork-a-repo) of the repository named `data/data_submission`
2. [Add a new file](https://docs.github.com/en/repositories/working-with-files/managing-files/creating-new-files) named `SUBMISSION.md` at the root level and add the following:

```markdown
# Data Submission

Name: [your-name-here]
DataSet: [link to where we can find the Data] <- leave blank if not applicable
Pipeline: [link to where we can find the Pipeline] <- leave blank if not applicable

Information:
write a short paragraph about what it is, what its for, how it should be used, etc.

```

3. Create a pull request from your branch to our repository | [Create a Pull Request From Fork](https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/proposing-changes-to-your-work-with-pull-requests/creating-a-pull-request-from-a-fork)
4. In the description of the PR add information about where the dataset/pipeline came from applications and acknowledgement that the data will be made public such as

```text
I hereby acknowledge that the information is mine or I have received permission from the owner and I provide it with the understanding it will be made public.
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DATA: Request for Real World Datasets and Pipelines To Test Our Filters #716

PLEASE NOTE THAT PROVIDED DATASETS AND PIPELINES WILL BE OPENSOURCE AS THEY ARE PUBLICLY AVAILABLE IN OUR REPOSITORY

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

DATA: Request for Real World Datasets and Pipelines To Test Our Filters #716

Description

PLEASE NOTE THAT PROVIDED DATASETS AND PIPELINES WILL BE OPENSOURCE AS THEY ARE PUBLICLY AVAILABLE IN OUR REPOSITORY

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions