| NrRecipes |
UserPreferences |
| SEED Wiki | FrontPage | RecentChanges | TitleIndex | WordIndex | SiteNavigation | HelpContents |
Recipes related to the Non-redundant (NR) database of sequences
1. Rebuilding the NR, e.g., after adding new organisms.
NOTE: There was a chage in the format of the NR at some point that introduced id's of the form "xxx*" The code assumes this format, so if you have an older NR and peg.syn file, you must change them to this new format first. See recipe #2]
Note- nr and peg.synonyms are present as inputs to preserve id's from run to run and to determine the next free ID. If you are *only* doing build_nr to compare to genomes and have no intention of ever installing the results, then you can get /dev/null for these inputs and speed up the processing.
2. Converting old NR to new NR format
3. Finding the list of sequences that are new (newNr vs. oldNR):
4. Getting fasta info for all the new sequences (see #3)