Skip to content

How do I control the number of QA pairs it generates? #28

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
CristiZZzz27 opened this issue May 16, 2025 · 1 comment
Open

How do I control the number of QA pairs it generates? #28

CristiZZzz27 opened this issue May 16, 2025 · 1 comment

Comments

@CristiZZzz27
Copy link

After running the code, I’ve got a few questions:

How do I control the number of QA pairs it generates?
No matter how much content I put in the jsonl file, it always ends up making just 5 QA pairs.

If the knowledge content changes next time, do I need to delete the cache and run it again?
Because it seems like it’s still using old data from graph.graphml.

Are there any key parameters I should pay attention to or adjust each time I run it?

My API has the logprobs feature, but I'm still seeing all the loss values are the same. Do I need to manually enable hard case mining?

Thanks!!orz

@ChenZiHong-Gavin
Copy link
Collaborator

replied in #27

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants