-
Notifications
You must be signed in to change notification settings - Fork 1
Question about using local database #6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hello! Yes, I believe the issue is that upstream sequences can only be retrieved for proteins whose IDs are found in the NCBI database. This is because the process relies on identifying assemblies from the identical protein database before retrieving the upstream sequences. Yep, this should be stated more clearly in the documentation, I will update. Best |
Hello! Thanks for the explaination. I wonder will this package be updated with a "full local" mode? In case sometimes we may want to process on unpublished sequences or some genomes don't have annotation on NCBI. Here is a possible way of doing it. Sorry I'm more a biologist than a programmer, can't help much on coding.
Best Y.Z. Zhou |
Hello again! ![]() Best |
Hello again!
I have some question about how to work with the -lbdb option.
I suppose this option takes a protein database as input, so I download the desire genomic protein sequences from ensembl. (I'm working on plant species, also wondering if this source is proper for this package?)
The sequences are concatenated into a single fasta using cat, and piped into makeblastdb. Everything works fine.
And then, I call uorf4u, but the following message pops up.
So here are my issues.
BTW. I think it will be nice to add more about this option in the "Example-driven guide" section in the documentation.
Thanks
Y.Z. Zhou
The text was updated successfully, but these errors were encountered: