Skip to content

streamline process for adding a dataset to the table? #30

@cleong110

Description

@cleong110

brainstorming based on #27...

  1. some sort of interactive form that asks for fields like the ones in the other datasets already listed? (here's the schema generated previously with genson: datasets_schema.json ). Maybe with Google Forms export feature?
  2. perhaps in conjunction with that, a way to parse a bibtex citation to autofill would be nice.
  3. automatically add to references.bib if missing.
  4. find and show (or even autofix?) inconsistencies of index.md with references.bib, maybe see if e.g. (a) there's citation keys that are present in references.bib and also in index.md but without a preceding @ in index.md, or (b) if there's a @whatever in index.md which is listed as @dataset:whatever in references.bib.

Of course too much feature creep could turn the relatively simple website into a clone of something like the HuggingFace dataset website huggingface.co/datasets, which is beyond the scope currently. In that case it'd be better to rework the whole thing as a full-on database with proper infra to support it.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions