-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PrepExternalSchema command stuck at 98% completeness #212
Comments
Dear Community, Update on the issue: It is now showing the following error:
thank you! |
Ah now I see it happend because the external SSD has become disconnected or unmounted during the process. I do not understand why. |
Hello Community, What do you think why it was happen? Is it okay just to remove this locus from the analysis? Are there any steps I need to take to resolve this? Thank you so much for your help!
|
Hello @AnyaKovalenko, Thank you for reporting this issue and all the details. We tried to adapt the N. gonorrhoeae cgMLST schema from PubMLST and found the same issue. Two loci are challenging to adapt: NEIS1452 and NEIS2608. These loci are small, and the sequences have a lot of variation or repeats, which forces the PrepExternalSchema module to iterate a lot to find a set of representative alleles that adequately capture the diversity of each locus. A few differences or a variable number of repeats in small sequences can cause a significant variation in the value used to compare sequences and select representative alleles. The NEIS1452 locus is the most problematic due to the sequence size and content variation, forcing the PrepExternalSchema to compare a lot of potential representative alleles since the alignment score for most alleles is below the threshold. Best regards, Rafael |
Hello Rafael, Thank you so much for your quick feedback and support. It’s truly very helpful. kind regards, |
Dear Community,
I’m encountering an issue with the PrepExternalSchema command. It initially worked very quickly, but after reaching 98% completeness, it got stuck and hasn’t progressed for over 24 hours.
Here’s the command I used:
chewBBACA.py PrepExternalSchema -g /mnt/d/Bacteria/cgMLST/schema-v1/ -o validated_schema/ --ptf reference_training.trn --cpu 16
I downloaded the N. gonorrhoeae cgMLST v1.0 schema (fasta file) from PubMLST as part of this process. Currently, in the output I have 1641 fasta file per locus and three folders : <dummy_dir>, <NEIS1452_temp>, and short. The short folder also contains 1641 fasta per locus. I am wondering if the process might be stuck while analysing the remaining loci (eight), as the total is 1649 loci?
Could there be something I did wrong? I would greatly appreciate any help in resolving this issue. Very looking forward to receiving your feedback. Thank you !
this is how it looks now:
The text was updated successfully, but these errors were encountered: