>gi|17230989|ref|NP_487537.1| hypothetical protein alr3497 [Nostoc sp. PCC 7120] MKRYGNLYQEIINFENILIASRQAQKSKRFRDNVLDFNYHLETELIKLQKHLTDKTYQPGAYRTFRLTNP KSRLISAAPYRDRVVHHALCNIIVPIFERAFVADSYANRIGFGTHRALKKFTHSARNSPYVLQCDIRKYF PSIDHIILKELIRRKIKCPDTLWLIDTIIDNSNEQETVIDYFAGDDLLSPITRRKGLPIGNLTSQFFANI YLNGFDKIIKEELKISKYVRYVDDFALFSDDRELLADARLAIEAYLAELRLKIHPIKSQLFETKIGATFL GFRIFSHKIRVRNSNLHQARRRLKRLKTDYAQGKLELNQVTQSMRSWIAHLEHGDTWRLRKQIFTSLHFR RK >gi|23335577|ref|ZP_00120811.1| COG3344: Retron-type reverse transcriptase [Bifidobacterium longum DJO10A] MFVRRAIDHYLKGKRSRRDVTRFLDTHPDLDRLAERIGDEIREGRYRDTRITYFNRIEPISGKHRVIGRE SVRHQIYDHVAVMALQPLFDAKVGRWQTASIPNRGTIDARRAIKRWTRERSSKWFVKLDVRKYYPSIDRT TLKAMLTRDVGDPILLRLVFHLIDRYQGGNGLNIGSYLSQWLANYYLSHAYHWIESPAMTIERISRRTGE ITRRRLITHQLWYMDDLLLIGTSKRDLKIAARRIVHYLKDRLGLDVHEEWNCKRLDLEPIDMVGYTFRPH GRVNIRSGVFLRARRTFSRAARRPMNEYMARRCCSYYGYLRNSDSIQYRRRHRIDSTMRRATRYLSATPI THRKAPPCSRRYPAPNPSKRSATTRAATASRTSASAATSPPSCTRTAMPRGRNTPPTKPIPCAT >gi|27311204|ref|NP_758929.1| ORF37 [Vibrio phage VHML] MRSLGCSFEKIFDFENLLSAAYSCRKGKTKANATLVFFNNLEENIIDTKRADVGRVQNVPLSPFYYSSRK RRLISAPHFKDRVVHRAIYNVIEPLFDKTYIYDSYACRRGERAPTKALTGLQYFIKKVESKHGKAYALKA DISRYFSSIDHQVLKSILEAKIQCQRTLDLLFYIIDNSPCESMGVGIPLGNLTSQIFANVYLHELDRYAK HALGAKHYIRYMDDFAIIHHDKAVLHQWRKDIEEFLHLYLRLKTNSKTQVFPYQRVMAGAWISRVSNYSS HRLLRNAALSELRQNLKSIDLSC >gi|41179367|ref|NP_958675.1| reverse transcriptase [Bordetella phage BPP-1] MGKRHRNLIDQITTWENLLDAYRKTSHGKRRTWGYLEFKEYDLANLLALQAELKAGNYERGPYREFLVYE PKPRLISALEFKDRLVQHALCNIVAPIFEAGLLPYTYACRPDKGTHAGVCHVQAELRRTRATHFLKSDFS KFFPSIDRAALYAMIDKKIHCAATRRLLRVVLPDEGVGIPIGSLTSQLFANVYGGAVDRLLHDELKQRHW ARYMDDIVVLGDDPEELRAVFYRLRDFASERLGLKISHWQVAPVSRGINFLGYRIWPTHKLLRKSSVKRA KRKVANFIKHGEDESLQRFLASWSGHAQWADTHNLFTWMEEQYGIACH >gi|42527768|ref|NP_972866.1| reverse transcriptase [Treponema denticola ATCC 35405] MKRKGNLYHKITEWNNLIAAFYNASRGKRLKPDVLLYEKNLYTNLKTLQNYLINQTVLLGSYRFFKIYDP KERIICAAPFNERVLHHAIINITESVFEKFQIYDSYACRKNKGTQAALLRALYFSRRFKYFLKLDMKKYF DSIPHSKLSLLLTCKFKDKALLHLFNKLIASYSVTEGWGVPIGNLTSQYFANFYLSFFDHYAKEKMNVRG YIRYMDDVLLFSDNLKDIKLIQKKAKNFLSCELDLTLKEEIIGMVKNGIPFLGFLVKPQGIYLSQKKKKR LKKKIKDYVHKFKIAYWTEEEFALHITPVFAHIAISRCRAYCNKYLLT >gi|71065017|ref|YP_263744.1| RNA-directed DNA polymerase [Psychrobacter arcticus 273-4] MFFLRIVMKRIGNLYESVVSGESLWEGYLGAKKSKGGRRGCFQFEKSLGRELNELQEELANNTYKPRPYF KFIVYEPKKREIYAPAFRDCVVQYAIYLRVMPIFDKTFIDQSFACRTGLGTHKAAEYAQDALRRAGPNTY TLQLDIKKFFYSIDRPTLRKLLERKIKDKRLVDLMMLFADYPEPKGIPIGNLLSQMFALIYMNPVDHYAT RVLKPAAGYCRYVDDFLLFGLTRAQALTYRKLLTDFVEQKLKLTLSRSTIANTKRGANFCGYRTWRSGRF IRKHSLYKTRKAVRANKLESVISHLAHASKTHSLQHLLNYAEQQNHGLYCQLPKIYHTRHHQAVERSGRI NGVMRNRCSNVC >gi|78189651|ref|YP_379989.1| reverse transcriptase [Chlorobium chlorochromatii CaD3] MKRKGKLVEQIADLHNLYEAFYKAQKGKQAKRYVCAYRKQLQENLQLLRHQILSGAIQTGKYHAFTIYDP KERVICATPFSQRVLHHAIMNVCHPFFEKHQIAGSFASRKGKGTYAALDKAREYNCCYRWFLKLDVRKYF DSINHTVLQKQLTRLFKDKTLLLIFEQIIDSYSTADHKGVPIGNLTSQYFANHYLSVADHYAKEGLRVPA YVRYMDDMVLWHNEKEELLAMGYMFQTFIAKELLLELKPFCLNATHKGLPFLGYLLFENQARLAPRSKKR FLAKYQRYENNLQSGVWTQQEFAKHALPLFAFTEYAQAREFRKKSLHSFCSLEGVFVRSSKGID >gi|83309559|ref|YP_419823.1| retron-type reverse transcriptase [Magnetospirillum magneticum AMB-1] MLEGQGLPACAHHLVGVAAASMQLVVAPGAIVDTAGAPPRVPAQLGRRRCLRQSPARPVHPRQIRLGVGP ELQQRQPEQQQGLERPGPGRPQMIIQQPSEITVAQLFEAYYECRRHKRTTRSALAFEVALEANLMQLLTE LRAGTWWPAPATVFAITRPKPREVWAAQFRDRIVHHLVYRAINPLFEPAFIADSCACIKGRGTLYAADRL ERHLRSVTEGWSKPAYYLKADIANFFGSIRHDVLFAMLARRIADPTMLELCRRLVFQDVRQGAIVQDAAG TLARVPSHKSLFQTPAGIGLPIGNLSSQFFANVYLDPVDQMVKRRLKLRYVRYVDDMVIVHQDPKVLLAA ADAIRAHLSGLGLHLAESKTFVAPVEKGVDFVGHVIRPHRRSARPKTHRVALARISTMPKAEVPDSATSY LGLFRHSGSRAQIAAVARVAVRRGFRVDRELTKVVVKRKVSKK >gi|83310593|ref|YP_420857.1| retron-type reverse transcriptase [Magnetospirillum magneticum AMB-1] MPEGQGLRARAHHLVGVAAPPDAYGVAPGADGIRRVLPRASRRSSAGVDEGTQSPARPVHPRQDLVRLRL AAELQQRQPEQQQQEQQKPGPSRPHMITQQPSDITVSQLFEAYYDCRRHKRNTASALDFEMVLENNLMDL LAELQAGTWMPGPATVFAITRPRPREVWAAQFRDRIVHHLVYRAINPLFEPAFIADSCACIKGRGTLYGA ERLHRHLRSATENWSKPAFYLKADIANFFGSIRHADLFAMLARRIKDPTMLELCRKLVFQDVRRDAIVKD GAGTLALVPPHKSLFQALPGIGLPIGNLSSQFFANVYLDGLDQMIKRRLGMRHYVRYVDDMVLIHPESKA LLSAAEAIRDHLSGIGLKLAEHKTFVAPVTKGVDFVGHVIRPHRRQGRPRTHRAALRRLMEIPAEDFMPS CNSYLGLFRHGGSRNQIIALARIALRRRHWVAADLTKVGARRIPTRRTEP >gi|90580666|ref|ZP_01236470.1| Retron-type reverse transcriptase [Photobacterium angustum S14] MKSASKSEQKKTCLLEAGRICHTLASEILTCQYQPEPYHHFAITEPKLREIYAPAFKDRIVQMWIVSQLE KAISHLMIDDTYATQLNKGTFAAINKAQKLMRKPHYRVAMQLDIYSYFNHINKARLKDRLVALIETPPQS IGQFHPVKKHILLYLIEQILQQDAARENNLQTNNHRLLAQIPPHKRLSFSQQGVGLPIGSVTSQMFGNFY LNDLDHFCKHTLKIKGYIRFMDDLIVLSDNIAQLKIWKNHIDAFLKQQLLLTLHPHKTKIKVIEHGVDYL GYTVYPHYKTIRPSTLHTFKARLRYFNALLEPDLFPNTAPPDRGVWQRWHKQGLTPALSPTLLKAMQSTI NSYFGLLSHGSHCRLRKSLYHDHFHYLKRYMMPKDHRYSAIKIKKNLPF >gi|113474819|ref|YP_720880.1| hypothetical protein Tery_1035 [Trichodesmium erythraeum IMS101] MKIIERKPRIISAAPYRDRIIHHALCNIIIPIFEKTFIYDTYANRINFGTHKALSRFTKFSCSSCYVLQC DIVKYFPSIDHQILKEIIRRQIKCQDTLWLIEKIIDGSNQQIPVLTKFPGDDLLSSINRRKGLPLGNLTS QFFANLYLNNFDHFIKEELKVKKYLRYVDDFALFSDDKKFLVIARQKIETYLANLCLKIHPIKSQLFQTK KGANFVGFLVLPNRIRIRSENLRAARKKIRQLHAHYRQGNIESSHIQKSLQSWFAHLDHGNTWRLKQKIL TFLNTMGE >gi|119356297|ref|YP_910941.1| hypothetical protein Cpha266_0459 [Chlorobium phaeobacteroides DSM 266] MKTEKNLFESVTAFENVLSAARNAAKGKRETRSVLLFYAQLEDNLWQIIAELKSKTWQPGSFKSFSIYKP KPRLISAAPFRDRVVHHALINVVGPLLEQTFIHDTYANRMGKGTHKAIRRYQHFLCRFDYALKCDIKKYF PSVDHEILKTSLRRRVACNDTLWLIDTIIDNSNSQDDHLHYFQGDDLFTPVERRKGLPIGNLTSQFFANY YLNFLDHFVKERLHCKGYVRYVDDFVLFSDSRAELWRWKEEIERYLEEFRLVLNAQRTELFPTTEGRCFL GQKVFQSYRLLPAENVRRAKKRVQASDVANPAAMKKMHSGWIGHARQANTGNLLHAMGLDKHR >gi|119509703|ref|ZP_01628848.1| hypothetical protein N9414_00100 [Nodularia spumigena CCY9414] MKRYGNLYPEIIKFENILLASRQAQKGKRFRDNVLEFNYNLETELIRLQKELTDKTYQPGAYRTFHLIDP KSRLISAAPYRDRVVHHALCNIIVPIFEKTFIGDSYANRLGFGTHRALRKFTHFARNSRYVLQCDIRKYF PSIDHIVLKELIRRKIKCPDTLWLIDTIIDNSNEQETVIDYFPGDDLLSPVIQRKGLPIGNLTSQFFSNI YLNGFDHFVKEQLKISKYVRYVDDFALFSDERQLLADARLAIEEYLTTLRLKIHPIKSQLFETQIGATFL GFRVFADRIRVRNSNLHHARRRLKGLQTDYTQGRIEQEKVNQSIQSWFAHLEHGDTWQLRQQIFTSLAWS RK >gi|119511289|ref|ZP_01630404.1| hypothetical protein N9414_06729 [Nodularia spumigena CCY9414] MKRYNNLYPQITDFSNLILAARQAQKGKRFRENILKFNYNLEAELAKIKTQLESKTYQPGRYKTFEICEP KHRLISAAPYRDRVVHHALCNIIVPIFEPTFITDSYANRLGFGTHRALRRFTTFARSHRYVLQCDIKKYF PSINHEILKSLLHRKLKCQDTLWLAETIINSSNPQESVIDYFPGDDLLSPLQGRKGLPIGNLTSQFFANV YLNNLDHFVKEQIKAQNYVRYVDDFALFSDDYGFLAAAKLAIEEHLINLRLKLHPVKSQLFETRHGASFL GFRILPDTIRVRTENLRRGRQRLKQLQSDYSQGKIEFKDVHSSIESWVAHITYGDTWRLRQQIFASLVFT RE >gi|120599269|ref|YP_963843.1| RNA-directed DNA polymerase [Shewanella sp. W3-18-1] MNTLEPIRGGNWNNGVNAGLGALNLNNSRSNSNNNIGFRPALDYARSKHLTGCCQRKNEKDALAPAIAEI DLATPAGAPAGCLFEQIYQFENLLNAAYQCRKGKTKSNSTLVFFNNLEENIIQIQNELIWGMYLSSPYHH FYVFEPKRRLISAPSFRDRVVHRAIYNVIEPILDRQYIYDSYACRRGKGTHRGADRAQLFIRRVEKTHSK AYALKADISRYFSSIDHHILKSLVSAKIQCERTKCLLFYIIDSSPSDAHGVGIPLGNLTSQVFANLYLNE LDRFAKHTLKAKNYVRYMDDFVIIHHDKQQLHQWRVMIERFINCQLRLKTNSKTQVFPVAASAGRSLDFL GYRIYANKKLLRKSSVKRIKAKLKIFRKKYSAGEIDIKDINQTIQSWIGHASHADTFSLRVSLLSQPFKR GEHV >gi|126090247|ref|YP_001041702.1| hypothetical protein Sbal_4396 [Shewanella baltica OS155] MTKLFIQNSNLKYHHQLAAELLQCVKASGSSQQKSANALKFPKLCHELSTEILTNTYQPYSYHHFAITEP KLREIYAPAFRDRIAQMWIALQMAPIMENQFIDDTYANRKGKGTLAAIAKVQKLMRQPRHTWGLQLDIYS YFNSINKKELLTQLYNLIYNSNLSALRQYCLANLIEKIIEQDATKLQNERTGDQYLLNQIPLHKKLQFNN TQQVGLPIGSVTSQLFGNFYLNNLDHEIKHTLKVKGYVRYMDDLFILSDSPEKLQQWKEHIEWYLTSHLQ LKLHPTKVHLAPIEEGFDYLGFRVFPHYKHIRKTTISSLKERLRYFNSILDSGATSVFLKSRPSRGRWSK NGIHYAPFYTQLKAMQSTINSYYGLLSQANHCQLRKQIYHKNFGELKRYLLPNNGYYNHFTIKKGLHFNK MAPDTLCWPETQDA >gi|126659397|ref|ZP_01730532.1| hypothetical protein CY0110_15130 [Cyanothece sp. CCY0110] MKRYGNLYPKIVEFENLLKAAKKAQLGKRFRDSVLEFNDNLEGNLLKLQKELKSHTYQPGEYSTFRIYDP KPRLISAAPYRDRVVHHALCNVIVPLIEKSFIPDSYANREGYGSHRALKRFIGFDRSSKYILQCDIKKYF PSIDHEILKQQIRHYLKCSKTLWLIDIIIDNSNEQEPVNDVFPGDDLLTLTERRKGLAIGNLTSQFFCNL YLNKFDHFVKEELKAKKYVRYVDDFALFSDNQDFLITSKQLIEDYLTTLRLKLHPVKTQLFETKYGANFL GFRVLPHQVRVRNDNLRRSRKRLRQLQYDYRYGEISLKEVVQRLSSWEAHLKHGDTELLRRQIFDHWVFQ RENWS >gi|126660098|ref|ZP_01731218.1| hypothetical protein CY0110_30840 [Cyanothece sp. CCY0110] MKRYGNLWQKIIPFENLYSAAKKAQKGKRYRDNVLDFNYNLATELFKIQQELTNKTYQPGQYRTFHLRDP KSRLISAAPYRDRVVHHALCNIIVPIFEKTFICDSYANREGYGTHRALRRFTDFLRTHTCILQCDIKKYF PSIDHQILKQLIRRKIKCQDTLWLIDKIIDNSNPQEPTIQYFPNDTLLTPIERKQGLPIGNLTSQFFANI YLNPLDHFIKEQLKCKAYVRYVDDFALFSSDPDYLKDCRQQIENFLISLRLKIHPVKSQLFSTNIGATFL GFRIFHHKIRVRNQNIHRGRRRLKQLQKDFAQGKINVSDVITSLNSWNAHLEHGDTWQLRQQIFTSLLFK RD >gi|134299090|ref|YP_001112586.1| RNA-directed DNA polymerase [Desulfotomaculum reducens MI-1] MPKTYKNLFEQIYQFDNLYNAYLKARKGKRYIGEVLEFTANLEENLISIQNDLINQTYQTGRYREFYVYD PKLRLVAALPFRDRVMHHAVCNIIEPIFEKVFIYDSYACRVNKGTHAGANRVTSYLRKAQRHWPRVYCLK GDVKQYFPSINHGILKRILHRKISCPKTRWLLNEIIDSSASSDDLNPSGIPIGNLTSQLFANIYLNELDH FIKEDLCARYYVRYMDDFIILGDNKRQLWAVLDEIKGFLDFKLNLQLNGKTGVFPVNQGIDFLGYRIWAT HRLIRKSSKKRMKRKLRSFKKRYAAGKIDKERINSSIQSWLGHCKHANTHNLRRRLLDNFIP >gi|146277368|ref|YP_001167527.1| RNA-directed DNA polymerase [Rhodobacter sphaeroides ATCC 17025] MAKTYNHLWPQIIAFETLVEAWARTSKGRHRQRDVIAFEADLEPNLFAIQESLIQKTYRTGPYHRFFVYE PKKREIASLPLKDRVVQHALVSVIEPIFEARFIDQSFACRVGKGAHKGADTVQRYMREVLREQGQVFALK ADISKYFPSVCHDALRRIIRRRIACPDTLWLIDSILESSAEPGALTPRGIPIGNLTSQMFANIYLHELDH FVKHTLRERRYVRYMDDFAVIHHDKAHLHEVRRACEDFLWAELGLRTNAKTQVFPIGEPGRALDFLGYRI WPTHRALRKDSVNRMKRKMRRMASLYHRGEITWDDIDPVIMSWIGHARHADTYNLRTKVLGGVAFVPPPL HVAAERRANSGHLRPV >gi|148359926|ref|YP_001251133.1| hypothetical protein LPC_1855 [Legionella pneumophila str. Corby] MAEKEHKHVEQGANLESLFQAYFDCRKNKRNTMNALRFETDYESNLIALRDELNSSTWHPGRSIAFVIDK PVKREIFAADFRDRVVHHWLINQLNPLFEKTFIYDSYASRKGRGAHLGIARAAQFIRKCSLNYQRDCYVL KLDIMSFFICINRRILWEGLRCFIERHYNQSDKKLILEVARKIVENEPTSNCFIKGKRRDWQDFPKDKSL FYAKPHCGLPIGNLTSQVFANFYLNPFDHYIKHNLGVRFYGRYVDDFILVHEDKMFLKSLIPQMEQFLQE ELELEIHPRKRYLQHYRKGIPFLGVILKPHCIYAGRRIKGNFYDAITKHNAVIKDHKPTIDEQMAFLCSI NSYLGILSHYQTYRLRKGMLKKHLSIWWWNLMFFSGGCAKLVAKQRTAR >gi|150390313|ref|YP_001320362.1| RNA-directed DNA polymerase [Alkaliphilus metalliredigens QYMF] MKRYGYLYEQIYDFENLYFAYLEARKDKRFRDEILKFSANLEENLIQIQNELIWKAYKVGRYREFYVHEP KKRLIMALPFKDRVVQWAIYRVLNPLFEKTYTEHSYACRIGRGTHQAAKKLQYWLRQIDRKPQKYYYLKM DISKYFYRVDHSIALKILRKKIKDKDVLWLMEEIIQSEDMAFGLPLGMEPGDCPKYMRLHDKGMPIGNLT SQLLANIYLNELDQFCKHKLQIKYFIRYMDDFIVLHHDKKYLHRLKVEIENFLNSELELHLNRKTCIRPT PVGIEFVGFRIWPTHMKLKKKTAKKMKRRLKYLQKAYGRNDVTFEDVNTCVQSYLGILQHCDSYYLKCSV LKNLTLIRGS >gi|153810696|ref|ZP_01963364.1| hypothetical protein RUMOBE_01080 [Ruminococcus obeum ATCC 29174] MKIKHVFDLIFSDDNLYEAIQDASKGRRYNKDVLRVQHDIWNVIEQIQQDVRSGKYTIDKYYIFYVYEPK KRMIMSITFYHRIVQWAIYRVINPLLVKGYIKDTYGCIPGRGSLAAMQRLRYWIKSVEHKPGTWYYLKLD ISKYFYRISHEVLKEILARKIKDQQLLQVLYNIIDCQYTPFGLPPGKGPGEVPLEERLYDVGMPVGNLLS QVFANIYLDALDQFCKRTLCIHFYVRYMDDIIILSDSKEQLHMWKDEIQKFVETTLRLSLNQKTCIRPIS QGIEFVGYRIWPHYVTIRKSTTLEMKRHLRRKVEEYNAGLIEMEVVTATLKSYLGMLDHCDCKEFRKELI DSVVLDRNKVVGEIAYEVENYSEAMVCGL >gi|153815102|ref|ZP_01967770.1| hypothetical protein RUMTOR_01327 [Ruminococcus torques ATCC 27756] MNYEEIVCDANNLYRAYKASVKTSKWKETTQKFMMNFLRYIFSIQDDLINRTLQNGPTQEFTLFERGRVR PITSIQIRDRIIRHVLCDEILLPEVKKHIIYDNCASIKGRGISHQRDRFEVHLRKYYRLYGNEGWILFGD FSKFYDNIIHEIAKRELLRLFDDDEFIDWLLTQIFDGFKIDVSYMTDEEYATCMTDTFNKLEYRNIPESK LTGEKWMEKSVNIGDQLSQVIGIYYPYRIDNYVKYVRSQKFYGRYMDDWYIMNPSKEELLDLLDHIHQIA EEYGIHINRKKTRIVKISSTYKFLQIKYSLTDSGKVIKRINPKRVTAMRRKLKRLAVKVKNEEISYENVE NMFRSWMGGFYKLLSKEQRKNLIGLYEDLFEKSIEIVNKKMIIVDKTR >gi|158333546|ref|YP_001514718.1| hypothetical protein AM1_0345 [Acaryochloris marina MBIC11017] MRLISAAPYRDRVVHHALCNVIIPVFEPTLISDSYANRWGYGTHRAVQRFTQFARSSRYVLQCDIRKYFP SIDHGLLKAGLRRKIKCPKTLWLIDQIIDHSNPQATVLEYFPGDDLLTPLAHRRGLPIGNLTSQFFANVY LNGFDHFVKEQLKTKKYVRYVDDFALFDDSREQLVQARQAIEAYLNQLRLKIHPIKSQIVATHHGVAFLG FRIFPAHIRVQNANIRRSRKRLRKLQKAYAQGEISATMIDQSIQSWVAHLSHADTWQLRQRIFADLVFAR Q >gi|160885942|ref|ZP_02066945.1| hypothetical protein BACOVA_03947 [Bacteroides ovatus ATCC 8483] MSEQLELFIGHPPGDEPGKTKIADATASSSWNVNFNNGNVNTNNRQNANRVRPLAATGNIIYDILLSSIF EASEDCARQKRTSTDCVEFYNDYQSALVRLWYSIIYGEYVPDFSKVFIRTYPVYREVFAAAFIDRVVHHW IALRIEPILEERFREQGNVSKNCRKGEGCLSAVHYLNNMIVEVSENYTADAYIFKDDLFSFFMSISKSLV WEMLNIFVRDNYKGDDIECLLYLLAVTIFHCPQNKCIRRSPVSMWDKLPSNKSLFHNDPDRGVAIGNLPS QLIANFLASVYDYFVMEILGFMYYVRFVDDFCIVVKSPEEILSKVHLLDGFLKEQLLLRLHPRKLYLQHY KKGVLFVGAFILPGRIYVSDRVVGNTYNAVRKFNRIAENGFAEAYVEKFVSTMNSYYGLMKHFATYNIRR RIAAMLLPEWWEYVYIEGHFEKFVLKNKYNHRKQLIKHIKRHGSKKYLTAWDC >gi|160888221|ref|ZP_02069224.1| hypothetical protein BACUNI_00629 [Bacteroides uniformis ATCC 8492] MNFNNGNVNNDNKNNDNNNRVRSSLAFTLKPMKQKKMETNSLFAPEETGGISLEEVFEAYFACRRKKRGT HNALAFEADYERKCIRLWREINAGTYKPSRSVAFIVFKPVQREVFAADFRDRVVHHLIARKIEPLLEAEF IDDSYSTRKEKGTLYGIRRVEEFVRECSENYTKDCYVMKLDVRSFFMDIHKKLLYERMEKFLRETYRAND LEMLLYLLRETIFNRPEKNCIRRCPASHWKGLPRDKSLFHSDGMHGLPIGNLTSQMSGNFFLSPLDHLIT ELWGIPFYGRYVDDMALVHTSKEYLLERRKRIREWLAANGLTLHPRKMYLQHYSKGIPYIGGVVKPGRTY ISNRTVGYFRDTLAHYNHLAQEPGYVETHGEQFVASVNSYLGQMKHFATYNIRKKLLLEGGIAPQWWKVM YVSGHLEKTVLKKRYTARARLRREMRREIVSYQLIRKEAV >gi|160932588|ref|ZP_02079978.1| hypothetical protein CLOLEP_01427 [Clostridium leptum DSM 753] MKSYNHLYEKTISETNRRYALSQAKHSKRFRKIMKHRHMSDDAAVEQSLDWIVNYENAEHVPVYIYDGIT RKERTIIVPTMEELLVQHCIVNAMKPMFCKGMYEHSYASLPGRGAHKGKQVIEKWIRTDSKNCKYVLKMD IRHFFDSIPHDRLKAKLKKTVHDEKMLELLFRIIDVTEVGIPLGFYTSQWLSNWYLQGLDHFIKEQLCAV HYMRYMDDMVVFGSNKRVLHRMRQAISDYLEMELGLELKANWQVFRFSYGNNQGRDLDFMGFRFYRNRTI LRKSIMYKATRKARKISKKEKATILDARQMLSYLGWIDCTDTYLMYWKWIKPCVSFQQLKRKVSRYDKYD EKRVYQKLVSLYTAKGGKSHGVKLQICREHSPTDCT >gi|160938970|ref|ZP_02086321.1| hypothetical protein CLOBOL_03864 [Clostridium bolteae ATCC BAA-613] MKTVKGLHEKMGTFENANTSFHQAAKCKRYTDEVLAFSMVKEEELLRATEEIQNLTYRQGEYKIFKVFEP KERLIMALPFYDRVVQHMICNAIQPVFENGFYYHSYACRTGKGMHAASDTLYQWMYETEVKQGLRMYAFK GDISKYFASIPHDKLKDENRRYIGDKKALMLMDDIIDHNGILPDGVGIPVGNLTSQLFANVYGNKLDKFC KHVLHIPYFVRYMDDFIILSDDLEQLKEWVKRIEEFLENEMLLHINPKSTILYAGNGIDFCGYIHYADHK KVRKSSIRKLKQDVKAYELGELPPEEFNRKYESRKGHLGHADTYHIAKAVEYELLFYEWERLEAAA >gi|161789197|ref|YP_001595703.1| RNA-directed DNA polymerase [Vibrio sp. 0908] MNARNRFPLRGGNWNNGSNAGLGALNLNNHRSNSNSNIGFRPALENARSTPLKGGCQCKIEKDADASAIA ETVPRPVDASAGCTYEKIYSFENILSAAYHCRKGKTKAQQTLDFFNNLEENVIQIQNELVWGMYSVSPYR HFYVFEPKRRLISAPSFKDRVVHRAIYNIIEPMFDVTYIYDSYACRRDKGTHRGADRAQAFIRKVERQSG KAFALKADISKYFSSIDHFILKRILDRKLKCQKTKELLFYIIDSSPSDALGVGIPLGNLTSQLFANIYLN ELDRFVKHTLRAKRYVRYMDDFVIIHGDKRKLHEWRKDIETFLAGSLRLKTNSKTQVFPVSPAKGRSLDF LGYRIYSTHRLLRKGSVKRIKSKLKKFHRQYEAGEISLKDINPSIQSWLGHAKHANSDGLKRAIFSTPFV RN >gi|167754333|ref|ZP_02426460.1| hypothetical protein ALIPUT_02626 [Alistipes putredinis DSM 17216] MRISGLANDFQKGKNINFESNDPASRQKISPQKGVSRRFRASRTAERPEYEKQSVEMKRIGNLYEKIISL DNLRLADEKARRGKLRSYGVLLHDKNREANILALHETLKNHTFKNSEYSTFTIYEPKERIIFRLPYYPDR ILHHAIMNILEPIWVSVFTKDTYSCIKGRGIHGAMRNVKRAIKDRENARYCLKIDIRKFYPSIDHDVLKT IIRRKIKCKDTLALLDTIIDSTDGVPIGNYLSQYFANLMLAYFDHWIKEEKRVRNYFRYADDMVFLASTK EELHILLADIKKYLAALKLTLKGNEQIFPIAENRADKHGRGLDFVGFVFYHNQTLMRKSIKQNFCRMAAR LNKKLNISARDYKQKLCSWYGWAKVSNSKHLLKTIIKSQFYDTFVLRCKAV >gi|167758195|ref|ZP_02430322.1| hypothetical protein CLOSCI_00533 [Clostridium scindens ATCC 35704] MEFEANREDNLFRAIEVLKDGTYQPGEYRVFKVWEPKERIIMALPFFDRVIQHMIVNFIEPIFEKRFLFH SYACRKEKGVHEASKTLSKWLYELEVVQGKKIYAIKGDIHHYFQSVSHDILKAEIRRYISDKALLKILDR IIDHNGIFPPGVGIPVGNLTSQLFANVYLNKLDQFVKHELKVKYYVRYMDDFIILSEDPAELRRLLAIIE EFLRRELRLELNPKTTILAAKNGINFVGYIHYKDHKKIRKDARRRLTKLLKAFETGEVELEYFDRSIESR FGHMEHADTYNYIRETKKTIEELKERKAV >gi|167759158|ref|ZP_02431285.1| hypothetical protein CLOSCI_01505 [Clostridium scindens ATCC 35704] MGKKSVNNLYKPMLEHSNVEQKFHKAAKGKTERPDVAVILEPTNIQRHVKNVVEQLENTAPEGYNVSHPE KAWKPSRHGKVRINEGTSRKVRMIEKPRYNYEQVIHHIVVSACYDIFMKGMYEFSCGSVPNRGAHYGKKY IERWIQRDKKNCKYVLKMDIRHFFESVDHDVLKAWLKKKIRDERMLYILELIIDGSEVGLPLGFYTSQWL SNFMLQPLDHFIKEQLKAVHYIRYMDDMVVFGKNKKELHRMQQEIERFLREKFNLQMKGNWQVFRFDYTE KKTGKRKGRPLDFMGFQFYHDKTILRESIMLSCTRKVNRVAKKEKITWYDATAILSYMGYLSNTDTYDMY LQRVKPYVNVKKLKKIVSKHSKRKERENHERMERSVRNGGRTAGGVRHSSITDNGVSETQYQESDERGCR RKENHRMAARGA >gi|167763773|ref|ZP_02435900.1| hypothetical protein BACSTE_02153 [Bacteroides stercoris ATCC 43183] MGVVKTEYGLCYTADTCFYQYSDFEDCGLYVGDTGKIFISQAKKIKNVYHLIYECSNLIRAQYKAQQGKG ERTEISKFNENILENLDSLYWDLRNETYTPGEYRIKVIYEPKERVIMIAPFFPDRIVHHCIINVLGRYWT NFFIANTYACIKGRDIHKCMEDVHTALIIDRKGTRFCLKIDIKKFYDNIDHAALKRIIRYTIVDEQLLRL LDKIIDSNGKDKGLPIGNFTSQYLANLYLAYFDHWVKEELAKIVMKRFGVKIYYYRYMDDMVILCADKEA LHFVLDMMGLYLGGELKVEIKSNWQIFPVDARSIDYVGFKQNHYGILLRSGILKRFYKKFHRTINKYEIK DETDIKHFFPSEYGWIIRCSEEHSKFIFNNCLNDGSKCFDYRAAG >gi|167841733|ref|ZP_02468417.1| Retron-type reverse transcriptase [Burkholderia thailandensis MSMB43] MRLERNLRRLYDELADGSYTPGRSKCFVITRPKPREVWAAAFRDRIVHHLLYNRIGPRFERSFIADSCAC IKGRGTLYAAERLESKVRSITQNWSIRAFYLKCDLANFFVSIDKCILLDLLLAKISEPFWRALTERVLMH DPRDDFEYHGDPVMMQLVPPHKRLMEQAENLGLPIGNLSSQFFANVYLDVLDQRAKHILGARHYIRYVDD FVFLHDSPARLNEILADVTTFLPERLGVRINQRKTILQPVDRGVDFVGQVIKPWRRETRKRTRNEAYRRV AATPSEDLMPMANSYFGLMRQASASHYDRAQLANVVRSRGKAVDGALTKTYRSSNA >gi|171058937|ref|YP_001791286.1| RNA-directed DNA polymerase [Leptothrix cholodnii SP-6] MKRIGNLYAQATCLDALHQGYVDARKGKRARFACHAFERRLGAQLSDLAQSLEAGTYAPRPYNTFMVHEP KPREISAPAFRDRVVQHAVYNVIQPIFDRTFIDQSFACRPGAGTHAAADYVQHGMQISRPDSYTLHLDVR RFYYSIDRGILRALVERKLKDRRLVDLMMAFAEMPGPVGLPIGNLLSQLHALIYLNPLDHYIKRELGVRL YCRYVDDLLLLDLSRDEAIAHRDAIEHYLADHLRLQLSKATMAPTRRGVNFVGYRTWASRRFVRKHALST FRQAARRGDLQSVVSSLGHALKTASHHHMVNHLQEHHHALHHQLPKSHRRLHDPCAAAA >gi|186684985|ref|YP_001868181.1| RNA-directed DNA polymerase [Nostoc punctiforme PCC 73102] MKRYGNLYPQIIDFENILLASRQAQKGKRFRDNVLDFNYHLETELIRLQEHLKDKTYQPGAYRTFHLINP KSRLISAAPYRDRVVHHALCNVIVPIFERTFIADSYANRIGFGTHRALKKFTHFVRNSRYILQCDIRKYF PTIDHITLKELIRRKIKCLDTLWLIDAIIDNSNEQETVIDYFPGDDLLTPVTRRRGLPIGNLTSQFFANI YLNGFDHFIKEQLNISKYVRYVDDFALFSNDREFLADARFAIEEYLAQLRLKIHPVKSQLFETKIGATFL GFRVFSDRIRVRNSNLHQARRRLKRLKTDYAQGKIELKEVTQSIKSWVAHLEHGDTWQLRKQIFTSLCFR RK >gi|187929429|ref|YP_001899916.1| Retron-type reverse transcriptase [Ralstonia pickettii 12J] MATRTTRTRTTRTGFASSEYQCDAEGLLSFEALVQAYFDCRRTKRNSRNALEFEQDLERNLGRLYGELRD GSYRPGRSICFVVTRPKPREVWAADFRDRVVHHFLYNHIGARFEDAFIAGSCACIKGRGTLYAAEFLESG IRSITRNWSRQAYYLKCDLSNFFVAIDKTILLDLLLAKVREPFWAWLTELVLMHDPRTDFEFRGDPKLLE KVPSHKRLMEQPSHRGLPIGNLSSQFFANVYLDVLDQRAKHQLKAKHYVRYVDDFLFLHESPARLNEILA DVTAFLPARLGVQINPRKTILQQIDRGIDFVGHVIKPWHRYTRKRTVNEGLRRVAAAPAADVHVLANSYF GLLRQAPASHHDRAQLANVVRSRGRAVDKQFTKTYRASQQEKS >gi|189425132|ref|YP_001952309.1| RNA-directed DNA polymerase [Geobacter lovleyi SZ] MKAVYSPNLPVSVRFSACHAAWLQARRGKKPSANQLLFEARWLDNLHQLYSSLRAGCWQPAPTVCFTVTH PKTREIHAPAFADRIVHHLLVDRLQRLYEPVFVYDSYANRTAKGSHAAVDRLQQMIRRRNGQGWYLQLDI HNYFNSIHRPTLYALLCRRLDLALQKGKLADSQRLALRSLCHKLLARKSREIERPGAAPSSVPPHKRLRN ARPQCGLPVGNLTSQFFANVYLNELDQFIKHQLKVRNYLRYVDDFVLLADSKEQLRTWQAEIAAFLETRL QLRLKDAVVLAPLHHGVDFLGYRVYCGHRLVRPRVVKHCCKKLSGWWQQYGQAAQMVSTESFSKLQALLG SYWGHFCHANSVRLRHALFKRFNWLHNFFQLHADAGLVVKQPARHKLARRRNAMKIVYT >gi|189460481|ref|ZP_03009266.1| hypothetical protein BACCOP_01122 [Bacteroides coprocola DSM 17136] MKRIGNLYKTIISVENLREADRKARKGKTHTYGVRVHDKNREANILALHEALLTKTFKTSPYDVFTIFEP KERLIFRLPYYPDRIVHHAIMNVLEPIWVRTFTHNTFSCVKGRGIEGCARHIDKIIEKYRGKPMYCLKID ITKYYPSIDHETLKKIVRRKIKDKDLLWLLDEIIDSAQGLPIGNYLSQYLANLFLCYFMHRVNEVLKLDA AEYADDITFFATSKEQLREAFKEIKRILEEELRLKIKGNYQIFPIAKNRYDRNGRALDYVGYMFFREQKL IRKNIKKNFCHATARLNRRKPPLDAKAYKQAVAPWLGWAKHSDSKHLLKTIIKPCYYDSIL >gi|189463015|ref|ZP_03011800.1| hypothetical protein BACCOP_03717 [Bacteroides coprocola DSM 17136] MRREGYIIEEIIEYSNMSEAFDSVLRGTDRKRSRQGRFLLAHREKIITELTASIADGSFRLGGYHEREIE EYGKKRILQILSMKDRIAVFAIMNVVDRHLQKRYIRTTGASIKRRGTHDLMNCIRTDLQKNPEGTLYAYK FDIRRFYDNARQDFVMWCFRRVFKDKRLLVLLERFVKLLPEGISFGLRSSQGAGNLLLSVFLDHYLKDKY GVRYYYRYCDDGLVLGKTKAELWKIRDAVHGQMGKIDLEIKPNERVFPVEEGIDFLGYVIRPDYVRLRKR IKQKFARKMHEVKSRKRRRELIASFYGMTKHADCNKLFKKLTGKEMRSFKDLNVAYKPEDGKKRFPGVVV SIRELVNLPIVVKDFETGIKTEQGEDRCIVAIEVNGEAKKFFTNSEEMKNILAQVKEMPDGFPFETTIKT ETFGKGRTKYVFT >gi|194335165|ref|YP_002019731.1| RNA-directed DNA polymerase (Reverse transcriptase) [Prosthecochloris aestuarii DSM 271] MKRVGLLFERVVAFENLLHATRQAARGKKSQLRVAHFLFHQEKECLRLQTELKQGIWQPSGFRVFEIREP KPRRISAADFQDRVVQHALCNILGPLCERRLIFDTWACRRGKGSHLAMKRAQAFSRRFPYFLKCDIRRYF DSVDHTILKRLLWRLIKDKPVLNLLDRIIDHPLPGALPGKGLPIGNLTSQHFANLYLGELDHQLKDRMGV KAYLRYMDDMLIFADDKSRLHELVTGIEDFVKQHLQLSLRPSATLVAPVSEGVPFLGFRIFPGLVRVNGQ ALRRFRHRLRLHEKAYQTGKMDVESLTASVQSMIAHLQHADTHRLRQSLLSSSCALG >gi|194337212|ref|YP_002019006.1| RNA-directed DNA polymerase [Pelodictyon phaeoclathratiforme BU-1] MKTSKNLFQSIVTFENVLSAAQKAAKGKRENQSVLHFFTFLEENLWQILSELRTKTWQPGSYKTFSIYKP KPRMISAAPFKDRVVHHALITIVGPLLERSFIFDTYANRTAKGTHKAIERYQHYLKKYAYVLKCDIRKYF PSIDHEILKSLLRRKIACADTLWLIDTIIDNSNIQAEHFHYFPGDTLFTPHERRKGLPIGNLTSQFFANY YLSFLDHYVKEVLRCKGYVRYVDDYVLFSDSKDELWEWKKAIEEFLQQFRLTLNSGRTELYPATEGKCFL GQKVFQSYRLLPSANVRRAKKRIQCTLLAKPETLQKSLAGWVGHARQADTRNLLRSLGLVEKS >gi|194337359|ref|YP_002019153.1| reverse transcriptase family protein [Pelodictyon phaeoclathratiforme BU-1] MKRQGQLLEEIADLKNLYEAFYKAQKGKVSKHYVGAYKKQLPQNLQRLQQQLFSGEVETGGYHTFTIYDP KKRLICATPFSQRVLHHALMNVCHASFEKQQIVTSFASRPGKGTYAALDKAREYHRHFRWFLKLDVRKYF ESIDHSILKQQLYRMFKDKNVLLMFDNIIDSYATEAGKSVPIGNLTSQYFANHYLLVADYYVKQRLCIPA YVRYMDDMVLWHHDKEALLEAGYRLQDYLARELRLQLKPFCLNESRKGLPFLGYLLFPGSVRLARRSKKR FIQKSWLYEGYLQTGRWSQKEFANHAMPMVACTEYANAREFRKNVYSAMVAIEENGHQSWVRTV >gi|218295563|ref|ZP_03496376.1| RNA-directed DNA polymerase (Reverse transcriptase) [Thermus aquaticus Y51MC23] MDLEDNLFAIVEALNTRTWRTGPYRYLFVRDPKPRQVVAVPYSNRVVHHALVSVLEEIYEPRFIYDSYAC RKGKGVHAGVARALKFIRAVSRKGPVYALKADVAQYFASIDRDRLLALLSRYVADPDVLWLCEEIACSYP GPGLPLGNLTSQLWGNVYLHELDLFVKQTLRERYYIRYMDDFVILSNDKAHLHRLRREIETFLRDRLGLR LHPKTQVFPVRERYGRPLDFLGYRIYPDRVLLRKRTILRIRRALRALSRKAKDHPDLAARLRRTANSYLG LVKHSANHNLYAWVAQFLTDSVIELGEI >gi|218961492|ref|YP_001741267.1| RNA-directed DNA polymerase (Reverse transcriptase) [Candidatus Cloacamonas acidaminovorans str. Evry] MPKRVGYLWEKLTSWQNLYLAYKNACKHKKSKYETAEWMFYCEKNLWELQKELINGNYRPQPYRYFTIKE PKERLISVAVFRDRLVHHSLINVIEPYFESIFIKDSYATRKGKGLHLAVLAVQKYSRQYPWFLKLDIEKF FNNIDHNILLKLISSKIKDPMIINLCSIILKNQNLSMNHNEEIGLPVGNLTSQFFANIYLNQLDHYIKQN LGYKGYVRYMDDFILFSENKDKLKSDLLLIKYFLSNILKLKIKDKSIQMNKVNQGIPFLGYRVFPKLIRV SNINLKRCLQNMQKREKEYIRGKIEIEKLYQSTRSRLGFISFANTYYLQKLIWGGVHKAEPTV >gi|220934786|ref|YP_002513685.1| RNA-directed DNA polymerase [Thioalkalivibrio sulfidophilus HL-EbGr7] MRAEYDRPGGVSSESESPARSHFKHPNGGSVGKRHKRLIEAIVDWDNLQEAHRLARRGKRDRHEVATFEA NLWEELGALQMEMLWGSYQPGRYRSFLVYEPKRREILAAPYRDRVAQHAICTLCGPIWDAAMIDDSYACR PGKGTHVGATRVEQWLRGMTAAGGAVWVVKMDVSKYFASIRHDLAKAVVRDKISCPATLQLIDAIIDSTA DPADPDPVGIPVGNLLSQWIANLVGNRIDQWAKRELRLKRYARYMDDMVVLVRTKQEALTIRDQFDDKLA SMGMRFSKASVLPASRGVNFLGYRIWAHKRLLRRDSVRRIKRNLKAMRWQYARGGIGLEEVRQRVASWVA HADHADTETLKRRVLSQAVFKRRSRSD >gi|225163615|ref|ZP_03725922.1| RNA-directed DNA polymerase (reverse transcriptase) [Diplosphaera colitermitum TAV2] MPRKHRHLFEKVITLENLFAAAENASRGRSGKVPVARGFAELEKTVVTLRDELLAGTWQPGRYYYFTITD PKEREVAAAPFRDRVVHHALVRVLEPIFEPRFIADSFACRPGKGTHAALARAREFTRRHRYCLKCDIKKY FPNIDHALLLREVGRAVDDARVLELIGRILASHADGAAQEWRAGAGLFDVEQRPRGLPIGNLTSQFLANV HLHPLDLFVKQTLRVKGYVRYVDDFLLFGDDRAALKAHGQRVREFVRTLRLRVHPDKFRLSRTEQGVDFV GFVAFPDGRIRVRDSNVRRFTRRLRRQAWCVRTGRMDFDDLCQRACSWAAHAGHAQSRGLLQDIFSGIFH LPNKGSWA >gi|225419931|ref|ZP_03762234.1| hypothetical protein CLOSTASPAR_06272 [Clostridium asparagiforme DSM 15981] MKRYGNLYEQIYSMDNLRKAHQNARKGKGWYEEVKAVDADVEGYLKRLQEMLINHTYQTSPYEKFIKHDS GKDREIFKLPYFPDRICQWAILQVIEPYLMRHMTKNTYSAIPKRGIHAALHDVQDAMWKDVPNCQYCLKL DVRHYYPSINHDILKAKFRRVFKDGELLWLLDEIIDSICTANIEDIRDLWFFDEDVDEETGIPIGNYLSQ YCGNFYLSDFDHWIKEEKRVKHYFRYMDDIVIFGSSKEELHALKREIDVYFMQELRLTVKGNWQVFPSYV RGVDFVGYRTFLNYTLLRKSSCTNFKKKMVAIRKKTAGGQMMNYSEWCSVNSYKGWLKHCDSYRLRKKYI APIQDDADRYYRDVVKTKKYKRKAA >gi|227485876|ref|ZP_03916192.1| RNA-directed DNA polymerase (reverse transcriptase) [Anaerococcus lactolyticus ATCC 51172] MKYYRDLTEFESLRKAFNKAKLGNREKESVARFENNQIEAILYLQYLLRTGKYKTSEYYEFYVYEPKKRL VKTNSFKDKVAQQALCEEVVRPILKNVLIKDNYASQPKKGTHFGLERLQGFLRNYYFSRKAKMEKERRSE GKRPSEEDIKSYSTGYVLKCDIKKYFYNIQHKELKRMVRRYFHEQNVRWLINHIIDSDVDPGIPIGNQLS QWLALLFLNDMDHLIKERLGIKYYGRYMDDFYLIHEDKDYLKYCLGEIEEYLKGIGLELNQKTQIFPLRH GIDFLGFHTYLTESGKVVRKLRRGSKNRMKKKIRKYASMLEKGEITPDEVEKSYKSWKAHASHGDCYYLI KKMDQYYNQKIVKEKLNGAKIK >gi|237710620|ref|ZP_04541101.1| reverse transcriptase [Bacteroides sp. 9_1_42FAA] MNIGRNDIDWRNLSHDEIDRIIAERIEADDRRIEASGGKKPKRVGYILERIAEINNLREADREAQDGKVK KNRFIRRHNLHPEEDLRALQLMILTLDFPAPDYSVMKVRSDAGKVRDIVKQKYFPWRILHHAIMRVIGED IYKSLIYDTSACIKGKGLHFGVRRMKSFLRRYPEYKWFVKTDFKKFYQSILHELIVAALRRKFKDERFIK LIEIAVLSYDSGTELIDVLENEVERKKRCSNWSIYKSTYRQFCGKSDRSYNEGEISCQMPA >gi|238027140|ref|YP_002911371.1| retron-type reverse transcriptase [Burkholderia glumae BGR1] MTERSINLVVPLAHKATAMRNEDTDRQRAERSSAVSRLSNRPGDVDSTILPAGRGARTSTTATRTTTTRT TSSEPEPSADRGPISFAELVEAYLDCRRTKRNSNAALAFEIRLERNLRRLYDELVSGNYTPGRSKCFVIT RPKPREVWAAAFRDRIVHHLLYNRTGLRFERSFIADSCACIKGRGTLYAARRLESKVRSITQNWSRPAFY LKCDLANFFVSIDKPILLKLLLAKIPEPFWRTLTERVLMHDPRTDFEFHGDPQMLALVPPHKRLLEQAGH LGLPIGNLSSQFFANVYLDVLDQHAKQALGARYYIRYVDDFLFLHESPARLNEILVDVTAFLPACLGVRI NPRKTILQPIDRGVDFVGQMIKPWRRETRKRTRNEALRRAAAVPIDDFLAVSNSYFGLMRQATASHQDRA ELANIARARGRAVDSRLTKTFRGSR >gi|238909111|ref|YP_002939578.1| hypothetical protein EUBELI_10007 [Eubacterium eligens ATCC 27750] MIRWVPKHMKDTGDLFSKICDMDNLRKAHKNAKRGKGWYAEVKRIEKDLDHYLRRLQENLIEHRYHTSDY ETFVRKEGSKEREIYKLPYYPDRICQWAILQVIEPYLLNSMTKDTYSAIPNRGIQPIINQLRGYKKKIKK DGKVVSEKWIPSILVSDPEATKYCLKLDVRKYYPSIVHDVLKAKYRELFKDEELIWLMDEIIDSISTCPA TEENIEILQRLGVAVNIIIDDNGREFVDGVGIPIGNYVSQYDGNFNLSVVDHWLKEVKGVKYYFRYMDDM VIFGSSKEELHKLKRELDEFMAVNLKQVLKHNWQVFPTKVRGVDFVGYRFFGEYTLLRKSTCKTFKRRML SISSKRENNVSPTYSEWCSFNSYVGWLQHCDSFRLYQKYVEPNVEYTHNYYLKEVKGNAEICKRKNYSGE RKAS >gi|253574968|ref|ZP_04852307.1| RNA-directed DNA polymerase [Paenibacillus sp. oral taxon 786 str. D14] MTKKHYDLFAKVADYQNIKTSYKNVLKGSRKFKKEAVLFDMCREKQLIGIWKDLRNKRYRVGEYIRFKVY EPKERMISAPRIRDKVVQFAVHNVLYEVYKPVFIKTSFACQKGKGTHAAVDQVQRNMRLCKWKHGTGWIL KMDIKKFFYSIDRDILKRILRKKIADPDMLKLLDDIIDSSPEGEVGIPLGNVTSQDFANIYLNELDQYCV RYLGVKWYVRYMDDIIMILPTKEQAQECLKKATRFLNERLNLETNSKTKVFPLEQGVNAYGFKIWTTHRL VRDHSKRAMKRRIKAMDRKLKAGMIGMKEVQQAVNSWLGHARHSNSYNLAKKIFRKYPYIKVEGEMRFGG RILGNR >gi|253578879|ref|ZP_04856150.1| reverse transcriptase [Ruminococcus sp. 5_1_39B_FAA] MKKCCKNVNILADDFIEDSIYEALDEKWKRSDVAKYLHGRTSSMSLQAMKRLLRDTDERDLMVSGLVHTV AESLHYEIQNRCLKVEPIQYSWRQDGVNGKIREIGVESVKQLILDEIASEGLDELWRRKLGYHQYASIKG KGQLGGKKAIEHQIRKKYSMSRYAWKGDVKKCYPSVDTRKLKRMLEHDVKNEVLLYLVYFLIGTYKQGLN IGSGLSQFLCNYYLAKAYVYVLGLHKVRKHRDGATESKRLVYFCIMYMDDILLIGAREADVKRAARALEK YLLKEYGLTIKPDADLFPIDYRIKTGKKYDSYREKDKAERRGKPIDMMGYVIYREHTEIRSKIFLRARKA YSVAWYCMKNKMEIPLQTAYKCTSYYGWFKHTDSKYAKGKYNIDAVCTAAKRRISKHAKSEIYGTSARSA LAACQ >gi|254523226|ref|ZP_05135281.1| retron-type reverse transcriptase [Stenotrophomonas sp. SKA14] MTKPRYPHPGCAAWSQVYGEAAAWSSASAWNVNFNNGNVNNNHRNNNGFALAVRRAGEFQGEVGLQELYQ AWRRARRQKVPSFNQLRFDHRWADGLLQLQQELVACHWQPRPSTCFVATRPKAREIHAPDFADRVVHHWL VPQLEALWEPTFIHDSYANRKGRGSHAAVRRAQQFVRQVHSGQGGGWYLQLDVANFFNSIHRPTLWRMLR TRLRLRGAPLVVQQATHALLRRSPLHAGVQYRATAAEQAQVPPHKRLANAPAGRGLPIGNLSSQFFANVY LDALDQFAKHVLKAKRYLRYVDDFVLFHHDREQLAAWRDQIEAFLQDQLGLRLKAEQKLCRLTDGLDFLG YVIYPTHTLARRRVVGHLHTALAEWEGKHVHGESLRATPADFRELSNRIASFAGHLLHASSHRLMHRVHI RFPWLRSAARPRRFSYKAERRIHSIRWIKEVPAHG >gi|257093900|ref|YP_003167541.1| RNA-directed DNA polymerase [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] MLVLAGIQPAPLVGKPQVEPEPRQGKVRLAQSGHACFEFALADRLLELKRELETGQYRPGGYLNFFIHEP KRRKISAAPFRDRVVHHPLCNVIEPRFERLFIADSYANRRGKGTHRAIDRLQHFAQRHRYVLRADIVKHF PSIDHQVLHAILARVVPEADLMALIDRIIASGAGVLDEEYATVYFPGDDLLAACRPRGLPIGNLTSQFWS NCYLHPFDQFVTRELRWAAYLRYVDDFALFSDSKRELWAWKRAIVERLARLRLTIHEGPAQVVPVENGIP WLGFVVFPGYRRVKARKVRGATSRLSGRLDDYLAGHISFAELDASVKAWVNHVRQADTWGLRRHVFAGLR FTMVPEKSTGAARP >gi|257094650|ref|YP_003168291.1| reverse transcriptase family protein [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] MKRLAIRLEEVAERSNLMLATWKAARGKRQRPAVARFLADLDGQLDHLAACILNGQAPQGQYTSFTIHDP KRRLIHAACFADRVLQHAILNLAEPRFEAMLVDSTYACRPGKGVHAAARQVQRNLQRFAWRVQVDVDSYF PSIDHACLKALLATRFKGAGFLALLGRIIDTAGDAGRGLPIGSLTSQHFANAYLDTADRRLLEDRRVRAH VRYMDDILWWCDSRADALATLAELNDFLRRERGLQLKPKVSIAPSRTVVAWCGFRIDQAVILPSRRKLAR YRRHARQIECAWASGQVSEAEVQRASANSLATLAHSRSLGFRRRFWQQHPSLDDLAAGQPDHP >gi|257439360|ref|ZP_05615115.1| reverse transcriptase family protein [Faecalibacterium prausnitzii A2-165] MEATGDTEQTAQAGRCLLCGQKGDPDPAAWRPQRKDLTEECHNAGHNYDNTRRNEDMDPRNQPSLLERIY SWENLLDAYHEAASEKWYRNDVTAFAANLEENLISIQNDLIWHTYKVGRYRQFYVHEPKKRLVMALGFRD RVVQWAIYLQTNQYLDNGMIYHSYGCRVGKGTTRAADRLQYWCTLVDRKPGKWYYLKLDVSKYFYRVDHR VLLDILRRKFPNEDGYLWLMETIINCDHTPFGLPPGKSADEIPPSERLFEVGMPIGNLTSQLLANVCLNE LDQYIKHELKAHFYDRYMDDMALLYPDAATLNRWRTAIEKYLNEVLHLELNSKTTIGLVERGITFVGCRI YPGYRKPTAQSVKKMKARMRYIAKEYEAGLIDFDAVDATMQSYFGLMGHCSTHGLQKWIEKNIIFKRKEM ADIELPQEVTQWELNQF >gi|258517326|ref|YP_003193548.1| RNA-directed DNA polymerase [Desulfotomaculum acetoxidans DSM 771] MKHYSNLYSSICSFEGLYQSYLKARKRKRYRNEVLKYTANLGENLIQAEEELISKSYRVSPYRKSFVYEP KKRLVMALPFGDRIVQWSVYRTLNPLLNKRYISHSYACRTGYGSHRAVKQLQYWLRYLERRHGRIYVLKA DMTKYFYRVDHDIIMNILERIIGDYDLIWLLEEIVRCEHTWFGLPLDAEGFECELTGEVGIPIGNLTSQM IANLYLNELDQYAKHNLQIKYYMRYMDDVLILHNDKKYLWHIKEEIEEFLDRNLRLKLNNKTCVRTNTQG IDWIGYRVWPTHVKLRKSTAQRMKARLKYLQGLYAVGEADFKEVNATVQSYLGILKHCDSYNLREKLFGD LTWARDSTQQSQLQGEILL >gi|261880961|ref|ZP_06007388.1| conserved hypothetical protein [Prevotella bergensis DSM 17361] MRREGYIIEEVVEYSNMSDSFDQVLRGTRRKESRQGQWLLAHREEVIRELSDRIKAGTYTVRDYREREIN ENGKIRRIQILTMKDRIAVHAIMAVVDRHLKKHFIRTTSASIKERGMHDLLAYIHRDMLEQPDTTRYCYK FDISKFYESIDQDTVMDCVRRVFKDRRLITLLDGFVRMMPRGLSIGLRSSQGLGNLLLSVHLDHVLKDEC GVRHFYRYCDDGVVLAASKRELWEVREVVHRQMEGIGLKVKTNERIFPITEGIDFLGYVIRPDYIRLRKR IKKKAASKLNEVKSRKRRHEIIASLYGMAKHADCNNMFHKLTGKQMKSFKDLKIAYKPEDGKKRFPGAVV SIRELVNLPIVVKDYETGIHTEQGEDRCIVSIEQNGEPKKFFTNSEEMKNILAQISELPDGFPFETTIRT ETFGKGRTKYVFS >gi|265750836|ref|ZP_06086899.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] MRRVGYIIEEIVEPSNMEASFRQVLRGSKRKRSRQGCYLLAHKPEVLEELVAQIASGTFRVKDYREREII EGGKLRRIQVIPMKDRIAVHAIMAVVDRHLRKRFIRTTSASIKRRGMHDLLAYVRRDMAEDPDGTRYCYK FDITKFYESVKQDFVMYCVSRVFKDAKLVTMLESFIRLMPEGLSIGLRSSQGLGNLLLSVYLDHYLKDRY AVRHFYRYCDDGVVLGKTKAELWKIRDAVHGRMECAGLLVKGNERVFPPGEGIDFLGYVTFGADHVRLRK RIKQKFARKMHEVKSRRRRRELIASFYGMAKHADCHTLFKKLTGKDMRSFKDLNVSYKPEDGKKRFPGVV VSIRELVNLPIVVKDFETGIKTEQGEDRCIVAIEMNGEPKKFFTNSEEMKNILLQVKDMPDGFPFETTIK TETFGKGRTKYIFT >gi|266622220|ref|ZP_06115155.1| conserved hypothetical protein [Clostridium hathewayi DSM 13479] MNYEEIVCDANNLYRAYKVSVKSSKWKESTQKFMMNFLRYIFEIQDDLINRTLQNGPTQEFELHERGRIR PITSIQIRDRIVRHSLCDEVLLPEVRKHIIYDNCASIKGRGISQQRKRFEIHLHKYYQLYGNDGYILFGD FSKFYDNIIHEIAKRELLKLFNDDEFIDWLLTLIFKGFQIDVSYMSDEEYETCMIDTFNKLEYRNIPKEK LTGEKWMEKSVNIGDQLSQVIGIYYPYPIDNYVKYVRQQKFYGRYMDDWYIMNPSKEELENLLENVCKIA AELGIHINRKKTRIVKISSKYKFLQIKYTLTDTGKVIKRINPDRVTAMRRKLKKLAVKVENEEADYDNVE NMFRGWMGGHYKLLSREQRKNLIQLYEDLFSKEITIVNKKLIVSDRSA >gi|283796807|ref|ZP_06345960.1| hypothetical protein CLOM621_06761 [Clostridium sp. M62/1] MRKTHTEIHLRNEPKHGHKEYKYLYQQMLKDDVIRKAYKKLRKGKTKRKEIQYIDAHLDDEVQKIYDMIL NTKPEGVDVPHPELAYKPKKRTPKIIFEHGKRRKIYMPEIHEQWLHHIIVLVLEPIITATAYPYSCGSFP KRGAHYGKRQIERWLLHDPKGTRCFAKMDIRHFYDSIRLKILMRELAIRIKDDWFLYIIGLCLQGFNKGI PLGFYISQWLANYLLEPLDRLITEVLGLPKLQRYMDDIVIFASSKKVLQRAIVEIRKMLGQRFRLKLKHN YQVCKFYYEKGKRKIGRALDFMGFIFYRNKTLIRKNIMLSATRLAKKMERSKEANRGYFHRHIEAMLSYM GWFTCTDTYDCYQSRIKPYIHVGRLKKIISKIKRRQNHEGMDQGKMLRGAAGAAACG >gi|291513637|emb|CBK62847.1| Retron-type reverse transcriptase [Alistipes shahii WAL 8301] MKRIGNLYEKVCSIENLQLADEKARKGKLRTYGVIEHDKKREVNLLKLRETLLNGTFHTSKYDVFTIYEP KEREIYRLPYFPDRILHHAIMNVLEPIWVSTFTADTYSCIKNRGIHAAAKKVKQALREDPEGTTFCLKLD IRKFYPSINHDVLKSILRRKLKDKRLLRLLDEIIDSADGVPIGNYLSQYFANLYLTYFDHWIKEQKRVKH YFRYADDIVILASDKSYLHSLMGEIRAYLGDLKLEVKGNWQVFPVAARGIDFVGYVFFHTHTRMRKGIKK TFCRRLAKLNKRKRPLSEKDFKQAICPWWGWAKSCDSKHLIKKLSKTSKYEIKFKR >gi|291561556|emb|CBL40355.1| Reverse transcriptase (RNA-dependent DNA polymerase) [butyrate-producing bacterium SS3/4] MRAALKKWGEEATVIKLDIKKFFYSIDRTILKQILVKRFKKLKKKYPEKYPDFLKFFRLLCKVIDSSPEG ERGIPLGNVSSQDFANIYLNELD >gi|291612675|ref|YP_003522832.1| reverse transcriptase [Sideroxydans lithotrophicus ES-1] MTTVRKVDLVVPLAKQATAMRIADTAGLDTQGRSGAVSLLTDRDSRRGDVDSTINRSVRQVPPGIRTSTM ETRTGTTPTTSAVLWPSAAQPSRHHADFSFEDLVQAYLDCRRTKRNSASALAFETNLERNLCRLNDELRN GTYQPGKSICFVITRPKPREVWAAEFRDRIVHHLLYNRISPRFYAGFIADSCACIPGRGTLYGAQRLEAK IRSITQNWSKPAHYLKLDLANFFVSIDKHVVRGLLAKRIDGWWMELAELVLFHDPRQDFELRGDPYLLRR VPPHKRLTSQPGHIGLPIGNLSSQFFANVLLDALDQHIKHDLRCRHYVRYVDDMVLLHESPMWLCAARSD IETWLPLHLGLRLNPVKTILQPVDRGVDFVGQVIKPWHRTTRRRTYNEALSRVKQVPANELFETANSYFG LLRQASHSHHDRALLGSVLMYRGHSINKAFTKTFRHSTSHGGNYGNQ >gi|291614412|ref|YP_003524569.1| RNA-directed DNA polymerase (Reverse transcriptase) [Sideroxydans lithotrophicus ES-1] MHARCNKGGRLRIQRFGEDPLRHLITIQKQLRERRYQFGPYKTFTVREKKFRDVVDAPMKDRIVHWMLYQ YLLPIWQPRFIHDTFGNLPGRGTHAALRRLAQFARSERAEWVLQLDISKYFYSVNHALLKERVLRHIGDH ELRALIINLIDSFRTDGSYDHLFAESTLYRQTPAKGMPIGNLSSQLFANIFLNDFDHWVKETLRVKRYVR YVDDMAILGESREELQEVCEQITSNLASEGLTIHPHKIRIAPTRAGVPFLGSIVWPNHISAGRYIRSRYL HRIRQHESGARDRTEALRSYRAMFKHTGVTR >gi|291621968|emb|CAX65001.1| gp20 protein [Vibrio phage VP58.5] MNTRNRFPLRGGNWNNGSNAGLGALNLNNARSNSNSNIGFRPALDVARNNHPKGCCQCNHEKDAASSAIA GTDIKPIDASLGCSFEKIFDFENLLSAAYSCRKGKTKANATLVFFNNLEENIIEIQNELMWGMYKMSPYH HFYVFEPKRRLISAPHFKDRVVHRAIYNVIEPLFDKAYIYDSYACRRGKGTHKGADRAQYFIKKVESKHG KAYALKADISRYFSSIDHQVLKSILAAKIQCQRTLELLFYIIDNSPCESMGVGIPLGNLTSQIFANVYLH ELDRYAKHALGAKHYIRYMDDFAIIHHDKAVLHQWRKDIEEFLHLYLRLKTNSKTQVFPISTSNGRSLDF LGYRIYSSHRLLRKCSVKRIKTKLKKYRSQFAKGEISLSDINQNIQSWLGHAGHASTYNLKKALFAEPFR RKTDV >gi|294674266|ref|YP_003574882.1| hypothetical protein PRU_1581 [Prevotella ruminicola 23] MKCGVSYEEVYEAYWLCLRHKGNTIDAIRFEINAEEECLKLWQELQNCTYKPSRSIVFVSDKPVKREIFG AAFRDRVVDTVFAQRVTPFLEQRFIDDNYSTRVGKGTLYGIQRVEQMVREQSANYTVDCWVMKLDMQSYF MSLPKELAWRKFASLLRQIYRGPDLEWMIWLLRVVIMDRPELNCVRNSPLKAWNGLPPNKSLFHSDGLHG MPIGKVISQMTALVFMDDLDHEITDSLWSMSVSYGHYMDDMIFVSRDKELLLWMRDEIVDKWAARNGVRR HPKKMYLQHYSKGVLFTGGMVMPGRTYISRRTIATCIDKIERYNRLAQSDEHYAREHVNEFASTMNSYLG MMRHFAAYNQVRKLLDRISPEWYQVMYVSMNRKRYKVCVRRVYREKMRQLARLEVELEILFEK >gi|294778518|ref|ZP_06743941.1| hypothetical protein CUU_0540 [Bacteroides vulgatus PC510] MRREGYIIEEIADYSNMSESFNQVLRGTSRKRSRQGRYLLAHREEVIKELAERISDGSFRVSGYRERTIW EYGKARNLQILTMYDRIGVHAIMTVVDKHLRRRFIRTTSASIQGRGTHDLMKFICRDMETDPEGTGYGYK FDIHHFYDNVRQDFAMWCFARVFKDKKLLGILNSFIMMLDSGISFGLRSSQATGNLLLSIFLDHYLKDKY SVRHFYRYCDDGLVLGKTKAELWMIRDAVHSQMERIGLQIKSDERVFPVEEGIDFLGYVIYGPEHVRIRK RIKQKFARKMHEVKSRRRRRELVASFYGMAKHADCHTLFKKLTGKDMRSFKDLNVSYKPEDGKKRFPGVV VSIRELVNLPIVVKDFETGIKTEQGEDRCIVAIEMNGEPKKFFTNSEEMKNILLQVKDMPDGFPFETTIK TETFGKGRTKYIFT >gi|297374652|emb|CBL42939.1| Retron-type reverse transcriptase [Candidatus Magnetobacterium bavaricum] MFSVLSIGQYGLVGIEICESIDNNYVWPVRSGEWLPPLFSFEDLYRCYLKCRRNKRKTMNALRFEVNAEE KLFDLSEELTGKTYCPSRTVCFILERPKMREIIAADFRDRIVHHVLVERLELIYEPMFIYDSYACRKDKG VHKAVAKVRSFISKGSDNARRKLYYLHMDIKNFFMTIDKNILYSMLQKKVKDNDLLYLAYTIIHHKPTSN CVIKGDVNLLRHLPHHKSLFHAPENRGLPVGNLTSQFFANVYLNELDQFAKHTLKCRYYVRYCDDFLILH TSRQWLEEIRDKVRAFVETRLSLVVNERYGVVHPVTNGIDFLGYIIRQGYTLVRRRVVNNLMARLDEYER RLVIEDEAMRQFIYAYDVLEALRCVLASYFGHLKWADTYDLRNAILERYSFLYEFFSFEYDCILPAYRFA SQFKTSKMQYQYYAGVFRHSMVFFQVGYFYEFYEELPEVRNVLCLKRMKDNQRGTKYGFPMSYESVYLQK LMKNGVMSIVIVKETEGYIGRIKNRLPVRRIETKCLN >gi|298385378|ref|ZP_06994936.1| hypothetical protein HMPREF9007_02043 [Bacteroides sp. 1_1_14] MESRNITLTICGEWLDFSPRDRKAVCEIDDLFKLTGNIPLVSYPLYNLIPEIISDENLERSFKRVMANLR NADARNGNRSMPKTIIDGIECSPRMIRYVANKGKIFETLKNQIGNGTFRIKNLKSFLTEDGPKVRTVQAP SVIERIGSNAIMEPLENRLSSLLIETTAASIQGRGPHGLFHQIQAAMAENPNLKYYYQSDYKGYYDSINH ETLISIIKRYVGDPLLLPILENFVKALYPDGECGISKGLRSSQFLGNLYHNDIDHRMIDVHGARYYFRFC DDIFILGESKRELWRLRDCLHIEADKMGLTIKSSERVAPISAGMDALGYVNYGSHTLLRKRIKVNAARKL SKLKSRKRRQQIIGSFKGMACHADCKHLFYILTKKNMKKFSEMGVTYTPADGKKRFPGKVTRLSDIVNIP IEIHDFETGIDTKEGENRYLVSFRNPAKQEWGKFFTASAEMKGILDQVSDIEDGFPFETIIKGEVFDGGK RKYNFT >gi|298387225|ref|ZP_06996778.1| retron-type reverse transcriptase [Bacteroides sp. 1_1_14] MNNGNVNTNNKTNAGRVRPVSATDKPIYDIPLSSIIEAYDDCCRQKRNTDDCIEFSFNYDTGLVAVWESI RYGHYEPGFSDCFMRKKPVLREVFAAAYIDRVVHHWIDLRLDPILEERFQAQGNVSKNCRIGEGCLSAVM RMDEMIKEVSKNYTQDAYIFKGDLKSFFMSMSKSLLWEMIDIFVRDNYKKDDIECLLYLLRIVIFHQPQK KCRRKSPLHLWDKLPKDKSLFYSSPERGVAIGNLPSQKFANFIGSVFDYYVTVICGIKHYVRFVDDFGFV MRFKEDILKNVPLLNSYLKEQLLLELHPKKIYIQHYSKGVLFVGAFILPGRIYISNRVVGNIYDAVSKYN KIAKEGFCEAHIETFVATMNSYYGLMRHFNTYNLRRKIGKIIAPEWWQYIYVEGHWEVFVIKSEYNFKKQ VKKQIRKGNAKKYLTPEIS >gi|299067086|emb|CBJ38282.1| putative retron-type reverse transcriptase [Ralstonia solanacearum CMR15] MPSADWNGSTPFSFAELVEAYLDCRRTKRNSASAMGFEINLEGNLRRLCDDLVSDSYKPGRSKCFVITRP KYREVWAAEFRDRIVHHLLYNRIGPRFERSFIADSCACIKGRGTLYAAQRLEAKVRSITQNWARPAHYLK CDLANFFVSIDKRVLLDLLLAKIPEPFWRELTELVLMHDPRGDFAYLGDPWMMDKVPPHKRLMEQPSHLG LPIGNLSSQFFANVYLNELDQFVKHELRCRHYIRYVDDFVLLHESPQWLNDAHDAIESFLPGRLGARLNP RKTILQSIDRGVDFVGQVIKPWHRATRRRTLNVGLQRLREMPAADVHHAANSYFGLLRQATASHQDRAQL ANVVRSRGHAVNRQLTKTYRAARAAKGA >gi|308273696|emb|CBX30298.1| hypothetical protein N47_D31070 [uncultured Desulfobacterium sp.] MRAVRGGKCSLLSFESVYGAYLDCRKRKRGTINALKFEIGSLDKIFDLAMSLQQGTYRPARSVCFITTSP KLREIFAADFTDRIVHHLVVRELEKVWEPYFIFDSYASRKEKGTHGAVHRLQNFMLKATNNGKRQAWMIQ LDIRSFFMSIDKDVLFGIFERKLSTKGDTSSWVLLYLLHRIIYHSCERDFVFKGNPVMLDKVPPHKSLLK VGEGKGLPIGNLTSQFFANVYLNEMDHFIKHGLKCRYYVRYVDDFVLLADSPTRLEQWVLQIADYLEKKL LLTLKDGWRVKTVSEGADFLGYIVRPKYILVRNRVVNAMKEKLAEHHKKMVECRKDVNGRMILKMKMTPD ATDDLMQTLASYLGHFKHADSYRLTRTVFLKHPWLNEYFTFNGEKLTRKLVYKGSFRSLRSQVGFFRAKL KDTVLLIEVGLFLEIYDGDAVSAGKALGLILQENKRGIGVSAGFPLRFEPRNIKKLLEMGRDVAVIREAG AGKYVKKRYVDDLYRIL >gi|312115534|ref|YP_004013130.1| RNA-directed DNA polymerase [Rhodomicrobium vannielii ATCC 17100] MPKRHEGLFERIASFKALRAAARTAINGKRKKPGAAAFMANLEREILRLERELRDGSYRPGRYVEILVKD PKERLISAAPFRDRVVHHALCAVVCPLFEAGFTDHTFANRTGKGTHKAIRLYERYRDNHSYVLRADIFRY FPAIDHEILKAEFRRKIACERTLWLMDLIVDCSNSQEPVELHFPGDDLFTPYTRRRGLPIGNLTSQFFAN LYLNRFDHWVIEKLGAPYVRYVDDFALFHDDPGILATWREKIERCLEGRRLKLHPRKTLILPVAEPSPFL GFELHPGPRRTAKGGRGRRKLLDGNVARFRNRLRGLRDRWRAGTVTQGEVEARVKSWIAHASHADSFRLR QALFEGGWFEVIPGL >gi|312962032|ref|ZP_07776529.1| Retron-type reverse transcriptase [Pseudomonas fluorescens WH6] MEINLLELHDDLIAGTYRPGRSICFVVTRPKAREVWAAAFRDRVVHHLMYNHVAPRFYASFIADSCACIP GRGTLYAATRLESKIRSASENWSKPVFYLKCDLANFFVAIDKAVLRKQLEARITEPWWLALATQILMHDP REDYETRSPAHLFNRVPQHKRLAAQPARLGLPIGNLSSQFFANIYLDALDQFAKHQLRAKHYIRYVDDFV FLHESPQQLNQWLAEVESFLPRLGAKLNPTKTILQPADRGVDFVGHVIKPWRRTTRKRSLAQALKRTAAA PAEDLRETANSYFGLLSQASHSEKDRAALARVVLKRGNSVNAALTKTFQKK >gi|313113756|ref|ZP_07799330.1| conserved hypothetical protein [Faecalibacterium cf. prausnitzii KLE1255] MTYEELCSFEVLYKAYLEARKGKRSKSKTIEYEAQALACTEKLSRKLAVCNVRQPDGSIRQQIRYVPSKF EVFAVYEPKRRMVHAPAFVDKVVLHALVDNILYDALTKSFIRDSHASQTGKGTDDGLMRLKTHMVDYYRR EGHGADGWVLKGDVRHFFASIDHRKLKRKLKAVLDKRGVDPRVYELLCIYIDVMEDGLPLGYQTSQLFAL MFLDEFDHIIKEKYRIKYYGRYMDDFYIICSDKKKLQCILRDVRALMDSYGLELNQKTAIFPLRNGIDFL GFHSYLTDTGAVIQKLRRDSSKRMKNKIRYWETAYPAGEVTKQEILRSFDAWDAHAAHGDTYSLRRKYAD RLEKLLDCKIPIHRKINSNKLARDRRRARQCRCIYKKQHKALSLSVSQNTRPAEIMPWA >gi|313147618|ref|ZP_07809811.1| RNA-directed DNA polymerase [Bacteroides fragilis 3_1_12] MKRLGNLYDQICSIENLRLADEKARRRKLRSYGVQRHDKNRDENLLRLQEMLLTQTYKTSQYDVFTIYEP KERQIFRLPYFPDRITHHAIMNVLEPIWVSVFTNDTYSCIKNRGIHAAAKRVKYDLKTDPEGTIYCLKID VRKFYPSIDHDILKQVIRRKIKDKRLLWLLDEIIDSADGVPIGNYLSQYFANLYLAYFDHWIKEVKQVRY FYRYADDIVILSSSKESLHALLREMRVYLRDNLKLKIKHNFQVFPVDSRGIDFLGYRFFHTHTLLRKSIK QRFCRRVAELNKKTDIKFESFKQQICSWWGWCKYCDSINLVNKLLKNSAYEISFRR >gi|319764254|ref|YP_004128191.1| retron-type reverse transcriptase [Alicycliphilus denitrificans BC] MRSPSADSCPDGDAATFPALLQAYINCRRGKRNSASALEFEMRLERNLCDLYDELVSGAYQPGRSICFPI TRPKPREVWAASFRDRIVHWLLYSHIAPRFHAAFVADSCACIPGRGTLYGAQRLERHVRSCTRNWARPAH YLKCDLANFFVSIDKHVLRERIAARVHEPWWMALADTILFHDPRQDVEVRGCASDLRRVPPHKSLFNAPD DTGLPIGNLSSQFFANVLLDALDQRVKHRLRAPYYIRYVDDFVLLHPSRTWLTAAHHDIETWLPEQLRLQ LNPRKTIRQPVDRGLDFVGQVISPWRRTTRRRTLASALQRIEQMPADALLAAGNSYLGLVRQATHSHTER AALCRALLKRGHAVEGMHLSKVFRKAAA >gi|322688997|ref|YP_004208731.1| hypothetical protein BLIF_0810 [Bifidobacterium longum subsp. infantis 157F] MKTYCKHSRITEPAFVRDCIERFLKGKRSRRDVNDFLSRHPDLDSLSRQIADEIGRGEYRFAPIRYFRRV EPISGKIRIIGRESIRHQIYDYVCGTALMPLFRAKVGSWQTASIPGRGIADARRAIKRWVREPSSKAFVK LDVRKCYPSISREVLKRLLSRDVGDRRLLDLTFHLIDQYAGDDGLNIGSYLSQWLANYYLSYAYHYCERH LSKERVNRRTGETTTRRLVTHMLFYMDDILLVGRSKRDLTIAVKRIRAYLHDTLRLEIHPTWNVKHVGVE PIDMVGFTFYPDHTGVRAGIFLRARRSFRRYARNPTSLRLAYRCASYYGWLKNSDSIQYRRRNNVDQIVR RARNTVAASRKKGQQDDSERLFRNPVGKSGLPSPR >gi|331088842|ref|ZP_08337752.1| hypothetical protein HMPREF1025_01335 [Lachnospiraceae bacterium 3_1_46FAA] MKATGNLYSKIYDMENLKRAHKNAKRGKGWYAEVKEVEHDLDGYLRRLQESLINHTYKTSPYKMFHKIEG NKEREIYKLPYYPDRICQWAILQVIEEYLLKTMTKDTYSAIPKRGAQPIVNQLRGYWKEVKKDGKVVSKK WIPSVLVSDPEGTAYCLKMDVRKYYPTMVHEVLKEKFRKLFKDKELIWLLDEIIDSISTCPASEENIEIL TEFGMCVNVMVDDTGREFIDGVGIPIGNYLSQYCGNFNLSPLAHWLKEEKKVKYQYFYMDDLVILGSSKE ELHKLKREIDEFLAVNMKQTIKHNWQVFPSKVRGIDFVGYRFFGEYTLLRKSTCKNFKRKMLEISAKREN NVSPTYGEWCSFNSYKGWLENCDSYRLYQKYAKPNEEYMHNYYLKEVKGNAEICKRKEYRRNGTGAGDRR LPCVC >gi|331089069|ref|ZP_08337973.1| hypothetical protein HMPREF1025_01556 [Lachnospiraceae bacterium 3_1_46FAA] MNDMESVYDANSLLDAFNKSKKGTAWKESVQRYEMNLLRNINQTQKEMKDGTYEQKDFYEFKLHERGKTR HIKSMHISDRVVQRSVCDNVLVPELSKYLTYDNGASMEGKGIHFARKRLSTHLHKFYRKHKSNEGYVLLI DFSKFFDNIVHDGLIKEMRKKIGDKETMSFIEKLIDTFRVDVSYMTDEEYANCMKTLYNALEHAQIDKAK LTGEKYMRKSVGIGSQISQISGVYYPTRIDNYCKIVKGMKYYGRYMDDIYIIHEDKEYLKGLLNDIQGIC DELGLFINPKKTQIVKLSHGFTFLKIKYNLTETGKVQERISKDSVTRMRRKLKKFRKLMDAGEMSFDDVR CAYASWKGGVSHYDSYNVVKSMDKLFDELFIHPFIGGGHRDEQNNNEQK >gi|332533803|ref|ZP_08409659.1| RNA-directed DNA polymerase [Pseudoalteromonas haloplanktis ANT/505] MNARNRFPLRGGNWNNGSNAGLGALNLNNARSNANSNIGFRPALDYARSSTLKGGCQCNFEKDAAAPAIA EKAIKPVDASTGCTYEQIYQFENILNAAYQCRKGKTTSASALAFFNNLEENIVQIQNELMWEMYSVLPYR HFYVFEPKRRLISAPHFKDRVIHRAIYNVIEPLFDKRYMHDSYACRTGKGAHAGADRAQLFIKQVESESG KAYALKADISKYFSSIDHQVLKNILGAKLKCQKTKSLLFYIIDNSPSDAVGVGIPLGNLTSQIFANIYLH ELDHFAKHTLKAKRYIRYMDDFVMIHKSKSQLNEWRNLIEEFLYKNLRLKTNSKTQVFPIAKTNGRSLDF LGYRIYANHRLLRRSSVNKISTKLKRFRKEFSQGKVSLAEINQCVQSWLGHASHADTHSIKLKLFNTPFK R >gi|332534155|ref|ZP_08410003.1| RNA-directed DNA polymerase [Pseudoalteromonas haloplanktis ANT/505] MITRERMPYRGGNWNNTSNAGLGALNFNNSRTNANSNIGFRPALDNARNTDLTGLCQCNFEKDAASSAIA ETLNKPVDASTGCIFSEIVSYDNILNSAYQCRKGKANSPATLNFFNNLEENVINLYNELNWGTYELSNYH HFYVFEPKRRLISAPNFRDRVVHRAIFNVIEPLFDKTFIHHSYACRNNKGAHRGADVAQRQIQKIERKHG AAYALKADISKYFSSVDHAILKRLLSNKIKCEMTLTLLHYIIDASPSDSPGVGMPLGNLTSQIFANVYLH ELDWFAKHILKIKNYSRYMDDFVVIHHCKRYLHTQRIAIQHFLQSALRLKTNSKTQVFPIAKNGGRSLDF LGYRIYSTHRLLRKSSVKRIKRKLKIFHIKYELGKVNLPEINQSIQSWIGHASHANSYNLRKKLLSQSFK RGDYV >gi|332653877|ref|ZP_08419621.1| putative reverse transcriptase family protein [Ruminococcaceae bacterium D16] MENIVNSTIALYKAYRKTRCGKRDNPTAMRYRMEAIERTVALSERLQRRDYSFGPYYPFKVYEPKERLVL AIDFEGKVVQHSLCDNVLEPAFSRRFIRDNYAGQIGKGTHDGLDRLAAAMRHYFFSRKAADEAARKAAGL PPRPMNEWDYADGWVLKGDFSKFFYTLLHSYCYETARRALKWLKDPELIDFAEWLLWLIIDSTPDPGIPI GNQSSQLLALLYLDAFDHWLRDDRGLVYGRYMDDFYIIHSDKLLLRQILKEIEAYIKPLGLRLNGKTQIL PLKNGIDFLGFHTYLTQTGKVVRKVRAKSIDNMKRKIRKFRGLVDSGKMTLDSVVQSYASWTGHISHGNT YHLRQNMDAYFFSYFPELKPSPKGDNTHGPKTEQPRKQVEGQVRQPVRQPDRLDRGR >gi|332655364|ref|ZP_08421103.1| hypothetical protein HMPREF0866_03096 [Ruminococcaceae bacterium D16] MFFYGRRCCNGVRWKQSVQNFENHLFSGTARRRREILSGTWKPGKCVHFTLCERGKVWPIDAPHITDRQV HKTLCSEVLIPLYNPSMIFDNGASQRNKGLHWHFQRLKEHLHWHYRRYGRTGAMGLVDLKAFFPGAPRQA LYQRHQLLIPDPALRRVADTVVDYAPSTAPGRGMPLGVEPSQQEMVALPSAVDNWLKCQVGVHCAGHYMD DYYIIMPDVEQLKAVIREMVRRFETMGIRVNKRKCKIIPLTKSFRWCKARFTLTETGKVKVNGSRDGIKR ARRKLKLFHQEFMAGKRPFSEVEQYMECQSAYYRNFNDHGRLLRLRRLYHAIFFGGAKCIKS >gi|332666264|ref|YP_004449052.1| RNA-directed DNA polymerase [Haliscomenobacter hydrossis DSM 1100] MKRTGNLFAAFTAFPNLLNAYYKARKGTRRNQETGFFFLNLEAELFQLQEELLQLTYQPQPYRYFQIYDP KERTISVAAFRDRVVHHALVNVLEPVYERIFIYDSYATRKGKGNHLALLRAQQMIRIHPLFLKSDVDKYF DSISQERLMEIIAHKIKDAQLLDITARIIRNGGHRGLGLPIGNLTSQFFANVYLNELDYFVKHQLKGKYY IRYMDDFVLFEPDRRTLKSHLTAIQYFLADSLQLQLKPSATFINSSANGLTFLGKRIFPQAIRIARPNLL RMTRKMKNREEEYKEGTISEEDFLASMNSYWACLAFGDTYGLRKKLASS >gi|332707562|ref|ZP_08427598.1| retron-type reverse transcriptase [Moorea producens 3L] MKRYGNLWHQVTEFSNLLAAARQAQKGKRFRPDVLEFNYNLEQQLSQLQAELISQTYHPGAYKTFEIKEP KPRFISAAPYRDRVVHHALCNIIVPIFERSFIGDSYANRVGFGTHRALRRFTKFARSSNYILQCDIRKYF PSIDHTILKSLLRRQLKCRETLWLLDTIIDNSNKQELIIDYFPGDDLLSPLNRRRGLPIGNLTSQLFANV YLNGFDHFLKEQVKATKYIRYVDDFALFADNRTFLASAKLAIEDYLANLRLKLHPIKTQLFATKQGANFL GFHILPDCLRVRTENLRRARRRLRRMLVEYKQGKITRQEVSQSLQSWFAHLEHGDTWQLRQQIFSSLPWV RS >gi|336413450|ref|ZP_08593802.1| hypothetical protein HMPREF1017_00910 [Bacteroides ovatus 3_8_47FAA] MVGTFFEEARPGELKANLIIMWREDNIIEEIVEDSNIEDAIKTVLRKRRRKCSFAGRRILADVPKAVERI RQRIRSGRFKLGGYREMTVDDGPKVRIVQSVSLEDRIVLNAVMNVVDRHLKVRFIRTTSASIKNRGTHDL LQYIVKDIKDDPEGTLFGYQFDITKFYESVDQDVLLDAVKKMFKDKILIGILEECIRMMPKGVSIGLRSS QGLCNLLLSIYLDHRLKDQEAVAHYYRYCDDGLVLSGSKKYLWKVRDIIHEQARKARLEIKSNDTVFPIT EGIDFLGYVTRPDHVRLRKRNKQKFARKMHKVKSKKRRQELTASFYGLTKHADCKNLFYKLTGKKMKKLK DLGYKYKPKDGRKRFTGARIKSPELMNKDVIVLDYEKDVPTKNGNRTVIKLELDGKERKYFTSLEETLFI CESAAKDGELPFEAHCEGEVSEKGLIIIHFT >gi|336433792|ref|ZP_08613604.1| hypothetical protein HMPREF0991_02723 [Lachnospiraceae bacterium 2_1_58FAA] MIKDSTVFDKIIDFENLYKAYRDSKSGKGFTKSRIKFELSALDGIYQIKKLLESKQYEVDRYNRFKVYEP KERIIEAGSFKDKIVQHSLCDNVLLPILSNEFIYTNYAGQIGKGTLFGLDCLKYQMYLAYQKYGYDCWII KGDIKKFFYNIDHNILKDIVSYFISNPDTYWLCEKFIDSTSGNGLPLGNQVSQVFALLYLSGFDHFITGE LGVKYYGRYMDDFYLIVESKQYAKYCLCAIEDFVNTLNLELNGKTQIIPFKNGIKFCGFHTYVTKDGKVI RKLTNEKKRKAKKKYRKMAKMVKENKLSKGKFLESYESWKNHISHGNCVKFTYEMDKMIDKILSS >gi|338762709|gb|EGP13976.1| recombinase for Bh.Int [Lactobacillus johnsonii pf01] MELIDQILSQSNLKEAIKRVKANKGAAGVDKRTIYEIDDYFKKHQVEIKQSIRAMKYKPQAVRRVYIPKA NGKKRPLGIPTVVDRVIQQAISQVLMKIYDPEFSAYSYGFRPKRSSHDAMEQVLEYLDEGYQWVIDLDIE KYFDTVNHDKLISTLREQINRQNHPPLNPVILKSWDNGGWFS >gi|338999650|ref|ZP_08638292.1| RNA-directed DNA polymerase [Halomonas sp. TD01] MTRTGHLFERYANFDALHTGYLSARKGCRDSHACMRFELRLEENLIELLNHLHWGSYSTGPYRHFYVHEP KTRRITALTQFRDRVLQHAMYAVLEPIWEKRFISDSYACRVGKGTHRAADKAQAMLGECLRSHGKVYVLK ADIAKYFASIDHVIAKSLLNRAIKCPRTILLLEAMIDTYHEPGKTGKGMPIGNLISQLLANVYLDALDQH VKCRLSERWYCRYMDDWLIIGPDKQHLHKRRIELDWWLAEHLALETNHKTSVFPVSPANGRGLDFVGYHM WPHKRRLRKGSMKRFKRRVNRLRQQYSAGDVDVRDVQMQISSWLAHASHANAEGFVRSVIYDQPWERQNA AICD >gi|340788559|ref|YP_004754024.1| retron-type reverse transcriptase [Collimonas fungivorans Ter331] MSERNFLIWSCRWVANCCPPLCASQIPPGHAGRSGAVSLLIGSRLRQIDVDSMKNRRTPPTPTMHGCRTS TMATRTTTTSRMSTELVLSADQNGTTPSHADFSITELAQAYFDCRQSKRNTPNALAFEQDLERNLTRLYA ELVDGSYKPGQSICFVVTRPKPREVWAADFRDRVVHHLLYNRISPRFYAAFIKDTCACIPGRGTMYAAQR LEAKIRSATENWSKPVWYLKCDLANFFVSIDKNVLHKQIAVRVTEPWWMRLAETILFHDPRQNYQLRGAS ALIELVPPHKRLTNQPAHLGLPIGNLSSQFFANIYLDALDQHVKHQVRARHYIRYVDDFILLHESPQWLN AALADINAFLPDVLHTNLNPTKTILQPVDRGVDFVGHVIKPWFSRTRPRTVRQAVSRIGSMDSADVFTSA NSYFGLLRQAGSSHMDRTKIAKAVMRRGHSVDKQFTKTFRKSI >gi|341642641|gb|EGS66984.1| reverse transcriptase family protein [Vibrio cholerae HE-09] MGKKHKRLISQIADKQNLRIAAYKARKGNPNSIGGIVFMDYLESNVHLLHKSISDGTYSVGSPRIFTIYE PKKRTISALPFIDRVVQHAINNVIEPIFERTFYRQSYGCRTGKGTHKGAIDCQAIARRLGKKQESVWVLK TDFSGYFYNIDRAILHSRIRAKISCKETLSLIEKFIEPTGTGIPIGNLTSQLFANVYGTIADEWLLHHAK RSNFLRYMDDIVIFGSSQQELLSLQREFEAFCKESMKLNLSHWNVQNISRGVNFLGYRIWPTHKLLRKQS VTTAKKKIRRYIKQGRREDLRKFLASWSGHAKWADSKNLTKSVEKMLCAAMQK >gi|344341810|ref|ZP_08772725.1| hypothetical protein ThimaDRAFT_4464 [Thiocapsa marina 5811] MKRLGGVWPRLVSFENLHAAVDRYQGWARRYAYVLKLDISRYFPSIDHRLLKEALRRHLKDARTLSLLDA MIDGSPPAAEPPAYFADDDLLTPLERPRGIPIGNLTSQFFANLYLDRFDHGLLDTRKVPAYMRYVDDLYL LGDDLAALWELRDACAADLAGERLRLHPRKVRVHRTSEAVDVLGYRVSRTRRWLRNDNGYRFRRRFRRLL GLYRAGRLGWADLMPGIQSWIGHARHAETAGLRESIFGPVAFDRDGWYIR >gi|344343882|ref|ZP_08774748.1| RNA-directed DNA polymerase (Reverse transcriptase) [Marichromatium purpuratum 984] MKRADGLFARIVAFDNLLAAERLAARGKRDRGSVARFEFHLERELIQLQEELMEGRYRPGTFYSFEVRDP KPRAICAAPFRDRVVHHAVCDVLEPVFERYAIFDSYACRIGKGTHAAIARAQAFARRDAFFLKCDVRRFF ASVDHEVLKAQLARHFCEPRLLELLGQIIVHGPPDAPPGKGLPIGNLTSQHFANRYLGELDHFVKERLRV KAYLRYMDDLLLFAPDKPSLHLLLAEIRQFLAERLHLELKDAATLVAPVSEGIPFLGFRIYPRTIRLNQR TRRRFRRQVRTLEAAASQGRIDESELANRAACLFAHVSQADAYRLRRRVSDASITRG >gi|345871758|ref|ZP_08823701.1| reverse transcriptase family protein [Thiorhodococcus drewsii AZ1] MARIKPLALAAIGDWHNVAQALHRAARGKRQTPEVVSALARPEATIARVCIALRAGRLPVGTFSAFVIRD PKRRVIHAADFLDRVAHHALVRFMEPVFERVLLPSVYACRPGKGAQAAVLAAQRDARRFGWVMHLDIAHY FPAIDHEILRGQLRRRFRSDGLRLVDAVIDAHQGNQGLGLPIGALTSQHFANHYLNDADRWCLAQPGIGA HVRYMDDYLLFAAEKTPLLALRSAFAAYLSERLALTIKPPLIQRCERGFLFCGVRIHPHWLRPSQRRRLR YRAALHVWEQRWRAGEIDALHLQRAYDAVRAILLPADDPPWRRRCLTRSEVIDV >gi|345883525|ref|ZP_08834966.1| hypothetical protein HMPREF0666_01142 [Prevotella sp. C561] MKRDGYIIEEIIERANLESSFDTVVHGTKRKELKEGKWLLAHRESFLDDVAKEIASGHVNVSNYHEKHIH EGNKWRDIQVFNMRTRIKINAVMSVVDKHLHRRYIRTTAASIKQRGMHDLKTYIEKDIQLYPKEMKYIYK FDIKKFYPTIQQDFVMYCIRRVFKDKRLISILEVFVRLLPNGLSMGLRSSQGLANLLLSLYLDHYLKDRY GIKHFYRYCDDGVIAAGSKRYLWECRKIVHERMEAIGQTVKFNDSIFPITKGLDFLGYVIYPTHVRLRKR VKQHLARKLHKVKSRKRRQQIVGSLYGLCKHCNSKNLLNTLLTTREMRKFSEMGVTYTPEDGKKRFQGKT VRLAEIVNSPIEVHDYEKDVVTKHGDHRYLISFRDKATREFSKFFTNSEELKSILDQVAKMKDGFPFETI IRSEAFDGNKFKYKFT >gi|345893474|ref|ZP_08844272.1| hypothetical protein HMPREF1022_02932 [Desulfovibrio sp. 6_1_46AFAA] MAKTLKNIWQRITSFENLVAAWEEAKRGKRYHMPVLKFGGSVEENLFGIQGELLHRTWKPGPWREFFVNE PKMRLIQAPPFADRVVHHALVRVINPAFEERFIHDSYACRKGRGTLAAGKRLTHFLRCAAMSATGKRRSV YVLKADISKYFPNINHDILMSVLARTVGDEGALWLMERIVRENGFEDCGLPIGALTSQLFANAYLDVLDH YIKDELGVRWYVRYMDDFVIVSTDKRQLQELRTKIEVVLWERLRLRLNPKTAIFPASHGTDFAGYRHWTT FRLPRKRNIRRARRKFRLLRRLYAEGKVDVPFVRARVMSFLGYTRHCKARRTVDSALSELVLKKE >gi|347538767|ref|YP_004846191.1| Retron-type reverse transcriptase [Pseudogulbenkiania sp. NH8B] MPPDTLSIETLVTAYYDCRRSKRNSHNALAFEQNLERNLCQLYDELASGTYSPGRSICFVVTRPKPREVW AADFRDRIVHHLLYNHIAPRFHASFITDSCACIPGRGTLYAAERLEAKVRRITQNWSRPAFYLKCDLANF FVSIDKRVLRDQLAAKIDEAWWLALAEQILFHDPRTDYELHSSPALVDLVPRHKRLAEQPAHLGLPIGNL SSQFFANVYLNALDQYAKHQLRARHYIRYVDDFILLHESAQWLSATHDQIEAWLPARLHVRLNPAKTILQ PVSRGIDFVGQVIKPWHRATRRKTYRQALRRAASIHPTDLFETANSYFGLLTQATHSHHDRARLANVLRL RGMTVNGGLTKTYRKHRK >gi|348026361|ref|YP_004766166.1| RNA-directed DNA polymerase (Reverse transcriptase) [Megasphaera elsdenii DSM 20460] MQFAQSLEENLIEIQNELIWREYRVGKYHEFYVRDPKRRLIMALPSRDRVVQWAIYRQLNPILDRRYLST SYGCRIGGGAHRAVAKLKEYLRLQTGTAYILKMDVSKYFYRIDHDVLMGILERIVKDRGLLWLLHEIIYS DHDFGIATDDYDFTGERLSSVGMPIGNLSSQMFANLYLNEADQYAKRILKCKYYIRYMDDVIVVSNDRAR LWEVWRAMDDFMRERLRLKLNAKTCIRSEVQGVDFCGYRIWRDHIRLRKKSALKMKHRLRWLKRAYARGE VDIQTVAASLTSYFGLLSHCDSYELRKSILNNLVLVRKRKEGNK >gi|350551816|ref|ZP_08921027.1| hypothetical protein ThisiDRAFT_0420 [Thiorhodospira sibirica ATCC 700588] MSTITTRTIQTGCVSFARESETAGYLSLQAIDQAYRRCRQRKRKTYQACLYEQQLLDHLVQTRDALAHQS WFPRPPVVFTVTKPKNREVYAAHYQDRVVHHWLVRELEALIDRDFIHDAAANRTQRGTHFAVARLQRFMR CYHTKAWFLQLDISNFFNSIHHPTLLALLAHKLAKAQRRHGLSSAKAGVLFEVAQRIIEQPCALQAIHIS AKEAYQRVPMHKRLECAPPHTGLPIGNLTSQFFANLYLNELDQFIKHQLRCRHYVRYVDDFIILHAHARQ LQHWQDAIASFLHDTLHLRLKTPVTLAPIQNGANFLGYIVRPSYLLVRHRVVANLYNTLHIQTKQCMLPV AQGYRLELTPKWRDALRSTLASYGGHFRHAQSYRLHAQILHDFPWLALLFCDVLCQRPRWQPECVTSMST QWRWYQTHYPGSILLVECGRELLCSALPGHIKAKVYQGSAHLAAWSVPMQRLADVQSELEAQQRYYIFCS EQGYLKGGLKRRCVRTLYLPQSQYFDILKSLPTF >gi|350574651|ref|ZP_08942908.1| RNA-directed DNA polymerase (Reverse transcriptase) [Thiorhodovibrio sp. 970] MPRRHRDLFGGIANFAALYAATERAARGKRRNPGVAAFLARLEPELLNLERELRGGSYRPGRYVAFEVRE PKRRMISAARFRDRVVHQALCAVVEPLFERGFIHDSYANRLGKGTHRAVQRYEHYRNRRAQVLRADIYRY FPAIDHAILKADLRRRIACEQTLWLVDTVIDASNAQEPVDLLFPGDDLLTPLERRRGLPIGNLTSQFFAN VYLDRLDHFAKEVLRAPGYLRYVDDFALFHDDPSVLAEWQVRIAEFLVGRRLLLHPRKTFIVPTAQPARF LGYELHAGGRRRLPEENVRRFRNRLRGLRDRWRAGRVSCQEVQQRVQAWIAHAEFADTLGLRHAIFRGGV FDPVRGPDRSPARGACCGAAPGTTIQGICARATATGTTPATGTTTSVSAWPVRSQAGADGFTDPSGERGS VQDRS >gi|355363092|gb|EHG10842.1| RNA-directed DNA polymerase (Reverse transcriptase) [Desulfobacter postgatei 2ac9] MKRQRVDMADLLAWKNLSLAVYKAARGKRSRPDVAGFLSDVDGNITNLRDRIQSGSFSLDQYRRFQIMDP KPRNITALSFEMRVLHHAIMNLIGENLIRSQIHSSFACMPGRGVHAAARYVQRGLRRSGWYVKIDIEKYF ERMPHDRLKQKLHNRFKGKAFLDLLDAIIDSFSERPGQGLPIGALTSQYFANFFLETADRFIQGPPYIKS YCRYMDDMIWFTADKTAAKTSLQIAVNFLSGQALKVKDTWQIQPSRHGVTYCGYRILPFSILLSRRKKRN YRRRLRQWEDQWRNGNISDLALQRGYDAVHGMTAQTRSVGFRKSVIAAGPLLDV >gi|355386195|gb|EHG33235.1| hypothetical protein HMPREF9467_00846 [Clostridium clostridioforme 2_1_49FAA] MIESKTDFEKITDFGNLYQAYIKSKSGKGFSKSRQRFQITALDGIHQIKRRLETKTYEVGKYNEFTVYEP KERIIKSGSFVDKIVQHSLCDNVLLPCLKTEFVPNNFAGQVGKGTLFGLDWLRAQMYLAYHKYGYDCWIV KADISKFFYSIDHDILKDMVRYFLKDDDVYWLCEKFIDSTKGFGLPLGNQLSQVFALLYLSGLDHFITGE LGVKYYGRYMDDFYLIVESKEYAKWCLSTIYEFAHSLGLELNGKTQIIPFKNGIKFCGFHTYVTIDGKVI RKLKNENKRVAKKRFKRMAYLVKKGKLNREKFDESYNAWKNHISHGNCVKLGHEMDKYIVVVLKE >gi|355625064|ref|ZP_09048006.1| hypothetical protein HMPREF1020_02085 [Clostridium sp. 7_3_54FAA] MRILKYDIEKVTGIPWKKKKSYRYLYRLACQEDVIKKAFKRMRKGKTKRKDFQMAEENLDAWVKKIQEII LNTKPDGWQTDPQKRFKPVKHNPVIIKEFGKTRVVYVPTMVELWIQHVIVMILEPIIAGSSYPMSFSSFP GRGSLKGQRAIRRWIESGKGIRNFAQADIRHFYSHIQYKIVRKKLERRVKDNFFLHLIDVCMTYFPKEMP LGFYLSQWLANFMLQELDYDIKCKLKIAHHVRYMDNYTLADDNKKKLHQALLYIRQVLGKMRLRMKSDWQ VFRFEYTKKNGKKTGRCVSAMGWLFYRSKVLIRKRILLHVERIARKLHKKEENGQRFPLGLCRGFVSLLG WITHSETYDWYLIHIKELVNVRKIKRIISKMTREVNRHAGMEKRTLQRAA >gi|139439157|ref|ZP_01772609.1| Hypothetical protein COLAER_01619 [Collinsella aerofaciens ATCC 25986] MWKSSTQRYMKDYLRNAVKSRNDLLEGRDICRGFIRFDLWERGKLRHISAVHFPERVIQKSLSQNALVPA IVPTLITANSANIKGRGTDYALKLLKRHLADHWRRHGREGYILLGDFSDYFARIAHQPVKDQVASALLDP RVVALEHRLIDAQGDVGLGLGSEPNQICAVAHPNRIDHYVTEMLRPESYGRYMDDFYLIHESKEYLQVCL LLIGRKCAELGIELNPRKTRVVKLSRGFTWLKKRIFYTETGRIVVKPCRDSITRERRKLKKMARMVADGV MTPEQVERSYQSWRGGMKRLDAHRSVLAMDALYHSLFENLAREGGAQCRPTRGTIQAEASPRNSGEPATQ SSGLSEAAAK >gi|150007547|ref|YP_001302290.1| reverse transcriptase [Parabacteroides distasonis ATCC 8503] MRRKGDFSGDIARKENYYKAFDHASKNKHGKKAIIKFEADLEKNLSDLLYSFENGTFVTSPYRFMTVHEP KKRLIGMLPFPDHVQHWAMLNEVEDYFTRSFSAYTYGGVKGRGPHAYMRMIRKVLRKYPERTTDYLLCDI HHFYPTVNHPVLKSQLRTRIKDNHLLRRLDEIIDSVEGDTGMFPGTKLAQFFSLVYLYLFDHDLKRCFHV GECPALVEYYTKRYIEESIATAKTEHDYEELSKGIQYLSDRFKGYLNRLDFCYRLADDVLILHEDTVFLH LVIEWIGLYYANELRIGLNPRWKIGHVTDGVDTGGYVHFPDHVRVRKRNKVALCRQIARLRKKGLPDEEI RKRASSRIGFIQHADTSNLLNKLGMETPRKRLGQVIRNKKSPWEDLPADRKMRFEDILYDTRIPEDKRGL EDDKLIELIDYKIEDSKIEKNEDGTPKKCLAIRFRWKGEERYAFTGSAVLIDQALTDFSHEDLPVDTVIK VLTNKFGKKFFRFT >gi|154502436|ref|ZP_02039496.1| hypothetical protein RUMGNA_00249 [Ruminococcus gnavus ATCC 29149] MHLQKLKNIVWTKRIGHLFERVTDIENIKRAIKRAAKRKTHRPSVQRILNDIDSYARKIQEMLVNETFVP AKYTIREIYDGIKKKKRVIAVPRFYPDQCIHHAFVQVFREIVEHGADKFSCGCVPGKGTDGARKMIKHWI KSDPIGTSKVLKLDVHHCYPTMNHEALRQKLEKKIKDRKLLNLAFKLIASYQQPMADHTRMLPEVDAVGI PVGLYTSPWFCNFFFQDIDHMIAEKTGAKHHTRYVDDIVLFDSNKRRLHKALRMIADELRKVKMQVKANW QVFPLKDRPLDFLGYKFHAGAWTTLRKSIMFRISHKAKKISKISYISPTNASGMISYMGFIYNSDSWNFW KERVKPFINLKLLKGVVSNENRKQHQAACAA >gi|160944040|ref|ZP_02091270.1| hypothetical protein FAEPRAM212_01541 [Faecalibacterium prausnitzii M21/2] MKFFLEREFRQHGPDGYALLIDVHGYYASIRHETTNQRFERKLPPSHYKRVRDVLDHQYSGETGYNPGSQ MVQLAGISVPDPIDHYIKERLRADKYLRFMDDSIIVHHSKEQLEEWCEAIRQQYAAIGLELHPRKTRIVR LQDGFRYMGFIYRLTPQGKVIMTVDPKNVKSERKRLFRLAQLVKAGEKPKSALYEQYRSWKAHAAKGNSD TLLARMDEYVKTLLEGIP >gi|210634695|ref|ZP_03298223.1| hypothetical protein COLSTE_02147 [Collinsella stercoris DSM 13279] MIPKTCQTAQTAALSAVHLRKEGATIGRVPVNLHPETVAVRFLHGAGFGTCPRFMVIRQAACGCRMKPFG VPPMNSDERRVARRARRDSKRAANRSARIEGCTLESIADLDNLYQSAIDASRGVSWKSSVQRYMLRVVPN IMRARRDLLSGKDFKRGFIEFDIFERGKLRHICSVHFSERVIQKSISRHALAPAIWPTLTEGCAANIKGR GTEYAIKRLKRQLVNHHKRHGERGYILQIDFANYFGNIDHEACKRLIDRALDDDLVKAVVFDQIDAHGTR GLGLGSEPNQVLAVALPSPIDHLMLSTPGILASGRYMDDSYCIALEKSVLYDALKRIESLCDELGIVINR KKTRIVKLSRGFVFLKKRFYFGDGGKVVVRPCRSSITRQRRKLKKQAALVGCGMMTKEQVNQSYQSWRGG MKRLDAHDTVLRMDALYKELFG >gi|229815313|ref|ZP_04445648.1| hypothetical protein COLINT_02359 [Collinsella intestinalis DSM 13280] MPDLNSDERREARRARRAEKRRRNRDSRIEGLDIEAVADLNALYRAAMQAGRGVSWKASIQRYQKDVLKN IVRTRRDILDGNDLHRGFINFDIVERGKRRHISSVHIAERVPQKALAQEVLIPATVPTVIGANSANIKGR GTDYAVRLMKRHLADHYRKHGAEGYILQMDFRDYFARIAHEPLKRQLSSRLDDGRVLSLAESFIDVQGDV GLGLGSEPNQICAVAFPNAIDHFVTEMCGVEAYGRYMDDSYCIHTSKEHLVMVKSAVEILCGDYGIELHP RKTQIVKLSHGFTFLKKKFFYSESGRVIVRPCRDTITRERRKLKALKRMLDSGDITMEQIEQQYQSWRGG LVHLDAHDTLLSMDALYRELFGGSVNGNPPPATVVRIAA >gi|257792377|ref|YP_003182983.1| hypothetical protein Elen_2643 [Eggerthella lenta DSM 2243] MNSDERRAARRARREAERARRKAERNAGCDLEAVADLNALYKAAKQAARGVAWKASVQRYQADVLRNVMK ARRDLLEGRDVCRGFIRFDLWERGKLRHISAVRFSERVIQKSLTQNALVPAIAPTLTYDNSANLKGKGTD FAIARMKKQLARFYRKHGADGYILLVDFSDYFARISHGPAKAIVAGALEDRRLVALEHRFIDAQGDIGLG LGSEPNQILAVAFPSYIDHFAAEMCGLEATGRYMDDSYYIHESKAYLEVVLMLIEQKCDQCGISINRKKT RIVKLSRGFTFLKKKISFGENGRIVVRPSRESITRERRKLKKQRKLVDLGMMTPEQVERSYQSWRGGMKK LDAHRTVLSMDALYKDLFSNPENASRGGVSLK >gi|291522399|emb|CBK80692.1| Reverse transcriptase (RNA-dependent DNA polymerase) [Coprococcus catus GD/7] MKTYCKPATVNVEDWKFNEVAVIECFRNKRGRNDFQRLLCKTGKITKRQIAEDRLNKDFKRTLEAESEVA KMLTQRIINRDLQLKPIRQFQRIDGLTQKLRDICQESPEQQVFEYIGVYALKPLFRAKILPIQYGSIPNK GGVAGKRKIERLLRKKFHGKVVALKGDVTKAYPSVTIPVVMEMLRRDIGKNKVLLWFLGALMSNYPGNHL CIGGYLPAWLFNYVMSYVLRYIYEQAQIRRGKRNRLVYAVVCYADDFTIYGDLSKLKKAMKKATSWAHDK FGLKIKDIWQFYQVASFDEERENLEERKKGSKKRTPGVDMMGYVVRRRYTIIRGRVFRRIRRQVLRAWMD FREKGFIPWWRACRIAAYKGWVKHSNSLKFRVLYCFDELFKMCSYSASKHGKEVENEKRILLIAAISD >gi|291546485|emb|CBL19593.1| Retron-type reverse transcriptase [Ruminococcus sp. SR1/5] MVADTHVSKGKRENTERLRFYDNREGNLEEISTLLRAGKVPKVEYHSFYVYVPKVRKVIFIDYWSKVVQR AIYDVLNPKICRTFIEHTYACVKGRGQLAAMEQLYTWMRETRTSGTEWYYYKFDVAKFFYRIDHEILMDI CRKKIDDPRTVDLLGYYINNDAVPFGMPLDANQLTITEEQMLYDLGIPIGGGLSHMLGNMYLDPLDQFCK RVLGIKRYIRYMDDIIILDNDKERLKGYGRRMTQFLEERLHLNFNNKTALRPVRVGCEFVGFVIYNDHVI LRKSTTLRMKRTLRKTRQDYHGNLITFKEANATMQSYLAMLSHVDCKKFKEKLLDEFVLTHADDNGEEQI INVSETGGIGALDYEAMYC >gi|291561321|emb|CBL40120.1| Reverse transcriptase (RNA-dependent DNA polymerase) [butyrate-producing bacterium SS3/4] MMVNTKHDTTSYENCSCQREIYDGNALYDAYLRAKSGSDWKPQVQRYEMAYLLDLSKMQRELKEHTYEFQ PCSSFPLNERGKTRFITGEQIRDRIAKHSLCDEVLTPAIKDHLIYDNGASQKGKGIDFTRRRLEAHLHKF FRENQSNDGYILLMDFSKYYDNIRHDKLMELFEKYVDDDTALWFLEKIVDNEKVDVSYMNDEEYESAMDD VFNSLEHEKVDKNLLTGKKFLRKHLNIGDQVAQDAGIAYPIPIDNYIKIVKSVKFYGRYMDDSYVIHKDK EFLKGLLIEIVEIAHGLGITVNLRKTRICKLSEMWRFLQIQYSLTDTGRVIHKIHPKRLTGMRRKAKKLA LILSEKDFDDWFRSWFNGHCHYMSKLQRSNMLDLCKKLKEEHYYGKTDFS >gi|293369862|ref|ZP_06616435.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f] MGEQSNLSIGHSPGDEPGKTKTVNAESSNAWYVNMNNGNVNTNNKTNAGRVRPVSATDKPIYDIPLSSIV HAFDVCCKNKRNTDDCIEFSFEYDTDLVAVWDAIRYGRYEPDYSKCFIRKKPVLREIFASAYVDRVIQHW ADLRLDPILEERFQAQGNVSKNCRIGEGALSAVTYLNGMIMEVSENYTKDAYIFKGDFKSFFMSMSKSLL WEMIDLFIRDNYKGDDIECLLYILRTVIFHQPQHKCYRKSPLHLWDELPHDKSLFHADPDHGFAPGSLHA QKFANFIGSCFDYYVSEILGIKYYVRFVDDFAFVMRNKEDILNTVPLLDSYLQEQLLLKLHPKKIYIQHY SKGVLFVGAFILPGRIYISNRVVGNLYDVINKYNKIAEEGFAEAHADKFVATLNSYYGLMRHFNTYNLRR KVAKRINPAWWKYFYVQGHWEVFVLKNEYNFKKQLKKQIRKGNAKKYLTPEIG >gi|317498222|ref|ZP_07956521.1| hypothetical protein HMPREF0996_01502 [Lachnospiraceae bacterium 5_1_63FAA] MPRFFPDQCIHHAFVQIFKEVVTHSAYEYSCGCVPGKGTDGARKVITHWIKTDPAGTSKVAQLDVKHCYP TLSHAKLQGKLKKRIKDKRFLRLAYKIIASYQQPMATGERLVPEIEAVGISVGLYTSPWFLNFFFQDLDH KISEKCGLHHYVRYVDDIVLFDTSKRKLHKAVKFIAEELEAVNMRLKHTWQVFRLKFRPLDFLGYRFHTD HVSLRKSIMYRMSRKARTIAKKDYISLTNASGMVSYMGYIKATDSYRFWEKYIKSYVNIKQLKGVISHEN RKQLKTAAAV >gi|336435571|ref|ZP_08615286.1| hypothetical protein HMPREF0988_00871 [Lachnospiraceae bacterium 1_4_56FAA] MDKDIITNFENLYRSYRKAKLGKSHNGSCARFQNMSLEGIHLLKEQLENQTYQIGKYSQFKIYEPKERVI MSCSFKDKVVQHCLCDNILHPRLQNVFIETNSAGQIRKGTLYGMDKLKEQMLSFYNEHGLDGWILKCDIT KFFYSIDHEILKDIVDYYFDDEYTMWLNHLFINSTNVVGLPLGNQVAQVYALLMLDCVDHMITGELGIEH YGRYMDDFYLIHHDKEYLKECLLHIEEMVSSLGLSLNGKTQICPVKNGLRFLGFHHYVTKDGKYIRRLTS KNKRRAKKKVRNMVRLLKAGRITEKEFEQRYKAWKNHASHGNTVKLLHSMDLYIKSELEKGYG >lcl|JQ680356.1_cdsid_AFB75680.1 MKRITGLMKNICTMKNALIAYQKARRCKRYRPEVLEFEANREEYLSKAIRELESLTYTPGKYKVFKVWEP KERIIMALPFYDRVIQHMIVNYIEPIFEHQFIYHSYACRKGKGAHRASKQLTRWLYNLEVVQGKSVYVLK ADIHHYFQSIDHKVLKREIRTYIKDKDLLVILDRIIDHNGIFPDGVGIPVGNLTSQLFANVYLHRLDMFV KHTLHAEHYMRYMDDFVIISEDLEQLKRWEKQIEIFLADVLKLQLNPKTTIVYAKNGVDFVGYRHWNSTK KIRKDAMRRLKRLMKNFKDGTITEEFFDKSFTSRIGSIKHADTYNLVQKITCEAKELKESHA >lcl|JQ680366.1_cdsid_AFB75882.1 MQSIKNIYEKIYDFENLHKAWEEARKGKRYRDDVLIFNRNYEEQLINIQNHLIYETYEVGKYHTFYVYEP KKRLIMSLSFKDRIVQWAIYRQLFPLYEKTFIFDSYACRKGKGTHKAADRLQYWLRQTERKPERYYYLKM DISKYFYRVDHDILLKILARRIKDQRLLNLLEKIINCESMNFGLPPGKEPDEVAVSDRLSNKGMPIGNLT SQMFANIYLNEVDQYAKHELGLHYYIRYMDDIIILHHDKKYLAEVKELLRAFLSDELRLDLNNKTTIRPC SMGVDFVGFRIWSTHRRLKKKTAVKIKRNLKNQIAKVKAGEERKDRLDRSVASYRGILPHFNSYGLRQSL NTLFKENGMVEKKEEVSRCTGNCNNCPNYTESYFCGFATPYCKLQGKEITHNVY >lcl|JQ680360.1_cdsid_AFB75737.1 MKTPEYGAGGGENNSYGRGGNVSHGGNGEKLNMTTEISPEKIPGMVTLDDIYDEICSYEGLYQSHLEARK GKRYRDDVLVFTDRLEENLIELQNELIWQTYKVGKYRQFYVREPKLRLVMALQYRDRIVQWAIYRQLYPF YDKMFIEDSYACRRGKGSHKAADKLQYWLRQVSRKPGKWYYLKLYISKYFYRVDHLVLLEILSRRIKDPR LFQLLREIINSEDTHFGLPAGVSPDECPEEDWLSDVGMPIGNLTSQLFANIYLNELDQLCKHELHLHYYI RYMDDVIILLPDKKELARVKAIIEAFLNDYLHLDLNNKTAIRPCSLGIDFVGYHIWATHRKLKKQTARKI IHAVDWMCEQGEKGNMSKEEFERRVASYRGILLHCDSYGLRKKLNSIYFDHVVTEEPEKQEAKQKPECAE RKCYTCQNFRREFFCGYGACRCDIYGSLDVDQKERHPDRTAATCPDYTPNE >lcl|JQ680358.1_cdsid_AFB75726.1 MYTFDNANISFHKAAENKRFHEEVLAFSMSKEDELLRACEEVETLTYSQGPYTVFKVWEPKERLIMALPF YDRVVQHMIVNAIGPVFEERFYCHSYACREGKGMHAASNQLYKWLYELMVVQGLRIYAFKGDISKYFASI PHDGLKDENRRYIGDKKALYLMDNIIDRNGILPDGVGIPVGNLTSQLFANVYGNCLDKFIKHTLHIKYYI RYMDDFIILSQDLNQLKEWEKRIEEFLEEEMKLHINPKSTILYAGNGVDFCGYIHHPTYRKVRKGSVRRL KKDVKHLKAGELDRETFDRKYQSRLGHMGHADTYHVTKAIEYDLLFWEFEQTQSGLLVPA >lcl|JQ680372.1_cdsid_AFB75999.1 MSAPLYFAVRKRLDGAKDLATWQKMTNAQKDAGRPVTVRTLPSKAKQTLRHSEPQKQAMKRYGNLFERVV EYGNLEQAFHNAARHKTRRSEVIEYGSHLEANLLQLQRELITGTYRTSEYKTFIIYEPKERKIFKLPFRD RVVHWAIMQVIEPIWLSNFTRDTYSCIRGRGIHPLLYKLRRDLQADPDGTRYCLKIDVRKFYPSIDHEIM KRVIRRKLKDARLLALLDGIVDSAENGVPIGNYLSQFFANLYLSELDHVIKEEMGIRYYYRFADDIVLLD GDKGKLHGTLVFINHYLNNERALSIKPNYQVFPVESRGVNYVGYVTFHDYCLARKQNKKNLCREVAKLRK RGMSDEKIRIKASSRLGFMQHCNSIYLLKTLNMKTFSEVTNSSGNLTGDKYHIDDILNREIHLKGFEVKE SKYKGECLIIQYDIYEQVKDKAGALLTNEDGTPKMDWVEHITFTGSEALIKQLKDVVLDEPCSAKIIKQP IGDRGKCFYKITDPD >lcl|JQ680374.1_cdsid_AFB76048.1 MAGAIHGRDIFMENIVNSFTSLYKAYRKTRCGKRDNPTAMRYCMEAIERTDDLAGRLQRREYTFGPYYPF KVYEPKERLVLAIDFEGKVVQHSLCDNVLEPVFSRRFIRDNYAGQIGKGTHDGLDRLAKAMRHYFFSRKA ADEEARRAAGLPYRPMEEWDYADGWVLKGDFSKFFYTLLHAVCFDKARKALASLSDEELIDFVEWLLWLI IDSTPDPGVPIGNQSSQLLALLYLDGFDHWLRDDLGLVYGRYMDDFCIISSDKLLLREILKQIRAYIEPL GLRLNGKTQIFPLKNGIDFLGFHTYLTSTGKVVRKVRAKSIDNMKRKIRKFRGLVDRKKMTLESVAQSYA SWTGHISHGNTYHLRQSMDAYFFAYFPELKPSPKGEKPYASETRQPPKQGQDQVRQPLRRADHLDQG >lcl|JQ680364.1_cdsid_AFB75824.1 MPLGKTSKYMGCISRPSPDGLKVRFSAYRREQTSVKRYGNLYEKICSMDNLYLAFQHAKKGKGWYKEVQQ IEKRPYYYLAGLQWMLQNHLYKTSEYATFTKKDGKKEREIYKLPFFPDRIAQWAVLQVIEPQLLAYFTDD TYSAIPNKGIHAAYKKLRLAVDTVPEEMTYCLKIDCKKFYPSIDHETLKQKFRRKYKDPELLELIDEVID SISTCPATDENIEFYRSCGNEIKIVKVNGKDFIEGVGIPIGNYFSQYDGNFFLSGFDHWIKEVKRVKHYY RYMDDICIFARTKEELHQLLAEINEYFIQNLKLRIKGNYQIFPSFIRGIDFVGYRIFLKDTLLRKSTCQE FERKMTAIRKKIESGQEMNYSEWCAINSYKGWLKYCDSSRLSEKYIEPIQPYADRYYKDHIKKGGKKHEK VRKSTQYKAA >lcl|JQ680365.1_cdsid_AFB75875.1 MTTPSPPSCMGRQGQVYNVGINPRHRCQAVPKGKDATMTKFPLFIKTAQNIKRPQIPPIMPLAPYEEAVG YERIKGSYKQALRGRRKYTREAVKYDLFREKNNVDLWRELKSSKYTPGPYHFNVITEPKRRDLSIPQLRD KVVQLVIHEELQNMFRPVFINGSFACQYGRGPIRAAFKVQHDMRVARMKWGDNAAIIKIDARKFFYSIDR ALLKKILAKRFKKLKKKRPETYEDLLRFYRLLCKVIDSSPEGETGIPLGNVSSQDFANIYLNELDQFCVR FLGAKLYTRYMDDIVIVAPDKETAREWLAKIKVFLHERLHLDTNKKTKVFYMRQGVNAYGFKIKATHMML RTESKRREKRRVKAMVRKMREGKLTRSAVVQAVNSWLGFARWACAYNLAKKIFAPYRFIKTEGAIPYGAI SRNRQARRLLQQRRDLEAPYKAVA >lcl|JQ680355.1_cdsid_AFB75656.1 MPRMTKFPIMLYNTKNTKKALVPPIPPPSNYEDAVGWSAIEAGYKTALRGSRKFTREAVLYDLYSEVNNV RLWRDLKKIEKTRQAGVSEYTPGKYRHRIIVEPKERSLHIPPLRDKVVQLVIHQELQTLFRPVFVNRSFA CMYGKGPIRAAFNVQHDMRVARMKWGDEATVIKIDVRKFFYSIDRSVLKQIIAKRFKKLKKKYPEKYEDF LRFYRLLCKVIDSSPEGERGIPLGNVSSQDFANIYLNELDQFCIRFLGATLYTRYMDDVVVIAPNKEIAR EWLAKIKVFLQERLHLETNQKTKIFYVRQGVNAYGFKIKATHLLLRTESKRREKRRIKRMMEKLQEGTIT KAAIVQSVNSWLGFARWACAYNLAKKIFAPYRFIKTEGEIPYGAISRNRQARRILQQRRNSQAAHKAVAA >lcl|JQ680373.1_cdsid_AFB76017.1 MTSEERHEARYRRRKAERQRRRDARSEACGSFEQVFSYEHLYRAGRECCKGVGWKCSTQRYLGNFTANIA RTHRELMDGTWKTKGFFAFDLMERGKLRHIRSVHIAERVVQRCLCDNALVPLFSAAFVYDNAASLKGKGI DFAMDRLTCHLQRYYRKHGTDGWALVFDFSDYFNSAPHAPIYAESERRIRDERVQKLACGLMEDFGERGF GLGSQVSQIDALMLPNRLDHFIKEQLHIEGYGRYMDDGYLIHESRDYLQECLKQIRAVCADLGIRMNEKK TRIVKLQELHFLKTRFYLTETGKVRRKMCRKSARRMRRKLKTFRRWMEEGRMTEEDIRTAYESWRGHMRR GNSYRVLRRMDRFYKRLMEKGA >lcl|JQ680354.1_cdsid_AFB75628.1 MSRRKGRYERRKTRREENRLRRAATVGGLHDVFGYDDMYKAGKKCCNGVRWKNSTQRFEMHLFSGTARRR RLLLERKWIPGAYVHFTISERGKTRPIDAPRIQDRQVHKVYTKKVLLPLYRPEMIYNNGASLEGKGFEFS KRMLKEDLRWHFRRYGRDGNVILIDFKQFFPSVSHEEIFKRHEKLLLNPDIRKIGDDVVNTVSGGVGLPL GVEPSQAEMIAFPSALDNFIKCQLSIKCAGHYMDDYYVIVPPDRDAKEIMALIVAKAESLKLTVSKSKSR IVPLTKPFRYCKAKFILTETGRVVMNGNRDGVKRARRKIKAFRTKIQNGEMSYDDLWTSVNGMLAYFESY DDHNRVLRLRRLFYSVFGFSPERIENFRERGKKDEICCA >lcl|JQ680367.1_cdsid_AFB75923.1 MRRITQHLHEHYRKYGNEGYILLFDFSKFFDNVSHEVVKAILHKEFTDERLLALTEHFIDAFGDKGMGLG SQISQVLALASANRLDHYVKEVLQVRGYGRYMDDGYLIHTSKAYLQNCVAHIRAICAELGITLNEKKTQI VKLSHGFSWLKVRFFITKTGKVVRKIYKRSVTKMRQKMKKLHRKYLRGKMTFADIYATWQSWRSYAARFN AWHTVQNMGALYTNLFINSKEDCYGLLQNPVC >lcl|JQ680352.1_cdsid_AFB75579.1 MKYTLSGNGNFRCSQPDAACMVSDCTVAHFMAGTTMQLEPVPHNNTVRREPLFMTSEERREIRYQRRKAK RDEARLKRSMVCGDFDEVFSFRHLYLSAKKCCKGVYWKSSTQRYISNLIPNISETLLSLRNGTFIRRGFH EFYIMERGKKRHIRSVHISERTVQKCLCDYCIVPIYSASFIYDNSASLKHRGMDFALRRMVYHLERHFRK HGLAGGILIYDFKSFFDDAPHEPLLAEAERRLHDDRIRALHNSFIADFGPVGLGLGSQISQTNALLLPSP LDHYFKEVLGIEGYARYMDDGYAIHEDLDYLKGECMLGLEEVTRHLGLRLNWKKTRVVPLVDFYRWLKTK FIVTPQGKVILKMNPASTKIIRRKLRSFYQKWQMDEMTLTDIRNSVDSYNGHMMRGNSYKVRERTNQYFK SMFGFYPNKKGWERNVSNVQRRSCTGNSYEPCLGQKAG >SRS011134.41192-T1-C MYYERIYGFDNLHKAFKLARRGKRWKPATARFEVNLLENLLRLSRELQDKTYELSEYHTF KVYEPKERDVMSNSFRDKVVQHSLCDNVLEILLRKNFLYDNYASQVGKGTDFGLNRLDGF MHKFYRQHGLEGWVLKCDIRKYFYSIPHEYLKRILEPYVPEEDVRWLLWYIIDSTADPGI PIGNQSSQLLAVLCLSPLDHFIKEKLGIKYYGRYMDDFYLIHEDKEYLKQCLKDIGMFLA PMGMQLNQKTQIFPLKNGIDFLGFHIYLTETGKTVWKIRRRSKGNMSRKLKKFRKLLDRG LITMESIHQSYQSWKGHALRGNCHHLVREMDELYNSLFKEDNKDVSITVESADRGESQVR >SRS011134.332117-T1-C MKLYDQICDINKIEKIYYNIKLKTKHKKKNLNFEMYYMSNVMYLYHVLKERKYKHNYYHI FLVKEPKYRIIMSESIFDKIVNHLISEEILLPLIEPKLLDINVATRREKGTKSGLNYVKK YINAMRKQYKNNFYILKCDISKYFYNIDHDILLSKLEKFITDKEIINIIKDILKTATDPA TSQKIRKIINDELERLAKCTFDTTYKVNTLKNLPIYDKPKKGLPIGNMTSQIFAIFYLND LDHYIKEQLHIKYYIRYMDDFILFHKSKKYLEYCYRKINEYVKKEKLSLNSKTKIFPMKE GLNFLGYRFIVKNDHLIILINPKTKKRIVRKINRIKNNKNYNQVLASYYGFFLGSNVKSL VNKYYKEHTF >SRS011239.9504-T1-C LAKNTSQRGRFSKSLKDRKKIKGYSMKRVGYLYEKMCDVDFIKKAIKNAAKGKTDRLYVK AILSDIDGYAQKLKAMLETETVKLSPNEHIEIYDRSCSKTRKITIPKFYPDQIVHWLIIT AAQPVITRGMYRYCCGSIPNRGGIDAKAYIETAIRDEKMRYVAKLDVSKFFDSVRPSILL DMLKRKIKDDKFLRLVGQVLENGGDHLPIGYYTSQWFSNFYLEGLDHYIKEVLHVKYYVR YVDDMVLIDSNKRKLHKAVKAIDDYLHGIGLKIKDNWQVWKINSRPIDFVGYRFYKNKTI LRKKIFFRLCRRVRKVKKTQHITPRQAMSLLSLIGWLSHINACGFYKEKIYPYAPKNKLK KIVSNYSKQNGGNLKNGKHDKKSIQQRKVARNGERG >SRS011239.23828-T1-C MKNTEQKPPSDFEVMADFNRLYSAYLEARNGKRWKYAVVRFEVNLLENLMALHFLLTSRK YCPSPYNYFLVHEPKERLIMYNGFRDKIIQHSLCDNVLEPRLAKTFILDNYASQKGKGTH FGLDRLKAFMQRYYRQFGADGWVLKCDIRKYFYSINHDVLKSQLRRIIDDPGVLWLLDLI IDSTEGPGIPIGNHTSQWFAVLYLSGLDHMIKERLGIKFYGRYMDDFFLIHPDKDYLIYC LEEIKKFLVPLGLELNHKTAVFPLTQGIDFLGFRTYMTDSGKVVRKIRRDSKNRIRRKLK KFRHLLDEGRIDFETVVQSYSSWTGHAEHGNSYHLIRQTNELFYDLFKKEMEEYHVEKIV DVARRRGGEIGQHEV >SRS011239.73586-T1-C MRDSAAKNMGKQFDEAIQFGALYKALKKCCRGVRWKPSTAGYEHYALANTYRLRQELLHG SYKLSSYQRFTIREPKVRDIVATRLRDRQFQRALCDAVLYPSITRSFIYDNGACQRGKGV DFALDRMTAHLQQYYREQKQAAEAATGHRLGRFCPGGWVLACDVRHFFDSTPHAVAKAAV AKRVYDSETVRHNARIIDSFGGERGIGLGSQVSQLNQLAVLDTLDHRIKETHRIRHYLRY MDDLALIHSDREKLEQVLADIRTQMAALGLELNSKTRIYPLRQGVMWLQWRFVLTDRGKA VRKLNDKKVGQERRKLRKMRKRVEDGRMTMAQVRDHYRCWKANAQRGNTRNLLKQMDRTY TDIMKEEPP >SRS011239.73933-T1-C MQKTGEYNSNNAFIYNGNTGNVNNNNKYNTNAVRPVSEFQGNVDPFASFYKSMRAAYRLC LKNKAHTANAIRFWLDEESELVALAREVFNCEYVPRQSIAFIVTKPCLREVVAADFRDRI VQHYIVMRLEALFEECGTLDDNMFSCRVGKGNLAAIQALQQQIFHQSKGYTADCYVAKFD LQSFFMSIDKRRLYDELVALVAKRYEGWDKDTLLYLIRVVTLHNPQDNAVRKTPLCDWAD LPRSKSLYNVDWFLGLAIGNLTSQSDANFYNAPAMRWMRSVGLAPVNYVDDFAFVVRDKA SFLTAMPYIRNYFAAERGLTMHPRKFYLQHYSKGIKFLGAVIKYNRVYTNNQTVARCFGK IHYYNEACRHSSRRKARHVEKLATILNSYLGLMRHFDTFNIRKRIAAEVGTVWCDYIRFD DDITTATVVKHFRQREICKYNVRKQRRRDLFTLKNLLNDGNTAAN >SRS011271.46681-T1-C MNLFEDANYLYDAGTKAMNGSKWKYSTQLFEINHLLETAVLQKKLTEKDYHPGQGQKFKI CERGKPRYITSSDMVDKTVYHTLSDDVLGPALKPYIIQENTASQKGKGVAMFRRQLENDL RRYYRVHGTNKGYILLTDFSGYYPNMNHDICKKQLSEFLDKSKLDAETITTAKFIIDGLF KTFETDVSRFSDDEIEKMYYTKIDPMMNCGVDPQLLTGEKMLRKGVDIGTQPSQDIGIIH PYKIDNCAKIVFSIEGYGRYTDDIRAISESKERLENLLEAIKKLADEIGLILNMRKTRIA RIDKPFRILQIQYWLTDTGRVVKKINPKSVTRERKKLKAYKRQLDLGKIDFATVENSFKS WIASNYKIMSRLQIDNMFKLYYSLFGRRITWKKKHSRLRWLMEQALKDLPKTETTS >SRS011271.56696-T1-C LKRVGYITNKDGRRITLLEAMGDYGNVQKAYNKARKCKRHRKDVLIFTKDKEENLDKVRE DIINLAYEPSKYHYFKVYEPKERQIMALPFYDRVVQHAINNVLEPIFDKRFISQSYACRK GKGMHAASDTLKEWLYEWNKYHPDQPLYAIKADIHHYFQSIDHAVLKTEIRKVIKDAGVL ALLDRIIDHNGNMPDGVGIPVGNLTSQLFANIYLDALDQFIKHELGVEAYIRYMDDFVIL SPDKEQLRSWLARIEQFLREELKLEFNPKTTILAAKNGIDFVGYKHRATHRKVRKDSIKR IKRTIKKCESGKITKEQLQKSIQSWTGHAGHADSYNLRKKIETLAEAAIEKAA >SRS011302.2684-T1-C MPKTARAIRAYDEELTACIVRRDLNLLPIRCFQRVDGLTQKLRDICQESPKQQVMEYIAV EALHPLFRAKFLPVQYGSVPGRGQALGKRKIERILRKKLTGKTDVAKGDVKKAYPSVTVE CVMRLLRKDVGKNKVLLWFVAALMENYPGGHLCIGSYLSTWLFNYVMSYVLRYILSLEQC RRGKSDRYVKALVCYADDTSIYGRFSQLVKVIKKATRWAKATLGLNLKPAWQVYHIASFE EEKAMKERRKAGCCQRTDGVDMMGFVVRRTYTIIRGRVFQRIRRQTLRAWDDVQRLGFLP WWRAARIAAYKGWVKYSDSVHFSVKYSFSKLLQLARVSVSQHNRKEIIKHEQRILREAAC CC >SRS011302.11334-T1-C MKRLNNIFETIGSMDNIISAAEKAKKGKRNHRGVRDYEKHKDEYHQNVYQMLKDKSYHVS KYEVIEKVTDAGKVREIHKLPFYPDRIIQHSLLIPMMDRWTKSLTLDSYNCLPKRGITSK VKKHSLVRKMKRTLLEMDKNGKIYVLKMDIKKFYPSVRHSVYKKAYSKDLKDRDALWLMN TLNYSNKGLAIGNPDAQIGSHLVLRSLDHVIKEQFKVKHYFRFADDMVILSHDKKQLHEW LWRIRNYLWYEKKLEMKKNYRIFPVSEGIDFGGFVFTPGHTKIRKRIKKNFASKRNNPKS ITSYMGMLMHCDSKNLINKVLVNNNSHMTKISDLNIRVSRKFDGKDVKIDKLVDEHIDIL DFDVRPSTKKDNSTWVRMQIMFKGEKCFMKGGYEALGAFLSQVDKSLLPLEDVVIKFNRG YYFDGTLDI >SRS011302.40176-T1-C MKQRYKELSHNLCVRAVLECFDKKWHRQDFVAVAEKYGGVSNAEIKREEAQSAVNKKLEA ADGIALELEQRILDLEDGDPEALDLDPVTERPRIDGISMKCRNVANCCVFHQCFGHLAFL GLEPLLRARILPYQHASIPHRGQSGCRRQVQRFLRRKALGIKYARKLDIRHAYENTKAGV IMGILKKEIPTAKWLLLLVEALLNMSPRGCLIIGGYLDAWLFNLVMSYILRYMLSLEKVR RGTRQRLVVALVAYADDVAIMGRRLADLRSAARTAAKWTLKTFGLTFKPGGDEVAFLSIE EEHRRRRLTRPAARGCPGLDIVGFVIRRTYTTVRRAIFRRARRQYLRAGREVDKSGTVPL FRAYKLASYYGYFTQTNSRKCSTTLRTEKIKPLACQVIGWATRQKERIHNEKCKHYAGPP AACRCL >SRS011302.41908-T1-C MENIVNSFNSLYKAYRKTRCGKRDNPTAMRYRMEAIERTADLSDRLQRREYTFGPYYPFK VYEPKERLVLAIDFEGKVVQHSLCDNILEPAFSRRFIRDNYAGQIGKGTHDGLDRLAGAM RHYFFSRKAADEEARRAAGLPYRPMEEWDYAEGWVLKGDFSKFFYTLLHAVCFEKARKAL AFLSDEELIDFVEWLLWIVIDSTPDPGIPIGNQSSQLLALLYLDDFDHWLRDDLGLVYGR YMDDFYIISSDKLLLREILKRIEEYIKPLGLRLNGKTQIFPLKNGIDFLGFHTYLTSTGK VVRKVRAKSIDNMKRKIRKFRGLVDKGKMTLESVSQSYASWTGHISHGNTYHLRQNMDAY FFAYFPELKPTERRQDSCLKPSEALPTRQKSSSAASTARRSSGSRQTKTTPATLLTAPPS >SRS011306.70708-T1-C VFSFKNIFRAYTDCRRNKRTSKAALKFELNALQELCNLQGELEGRTYAPSGAELFYSCSP KLREIFAPSFRDRVVHHLFINELKDSYKGVINGATYSNVKGRGTHKAMLKAREYMSRSKF YLQLDIKNFFYSIDKDRLFEIFKSDLRGMDTADKDELLWLAKTIIYANPTNGCKVMGGGR SMSLAAHKSLFTLPKNKGLPIGNLTSQFFANVYMGKFDEFVQNELEVKEYIRYVDDFVLF HDSKSYLQKYLLEIKEYLARNLALELRRDVKLRANRLGLDLLGYIVRDRYVLVRNRAVKN YKFKKTSYLNKYEKKKGKM >SRS011405.42254-T1-C MTDGAIVQFISGAMSKYTDANYIHEAGTKAMKASMFKYKTQLYEINHLLETAHIQKAMED GTYKPEPGLKFGIKERGHARYITSAATADKAVNHITCDEYLTPLLQKYMQYDNSSSQVGK GVAFHRHRFEIQLRKYYEREGTNEGYIGFSDFSGYYDNIVHEVALAQFSQYLAREIKDPE ELADVMDKLRLAFRTFELDVSRFADEEIEKMYHEKVRSTLNVGVPSSALTGEKMLRKGAD IGDQVSQNTGIFLPVPIDNYVKIVCAVKGYGRYSDDFYIIDKSKEHLREVMAGVRQWAEK LGIIINEKKTHICKLSGQYRHLQVLYSLQEDGRIIRKISPKAITRERRKLKAYKRKMDAG VMIYAEIENSYKSWICANYKYMSRKQIHNMTNLFKDLFGKEPTWKKHGHGRLRWLMAH >SRS011529.6736-T1-C MKRYSGLHDKLCTIENIEVADDNARKNKNKKYGINKHDKNRQYENEDLVDKLFNLKYKTS KYSLYKIYEPKERIIYRLPYYPDRIAHHAIMNVVKDIWTKSFIHNTYSCIEGRGIHLCAS NLKRDLRKYPNKTTYCLKLDIKKFYPSIPHDGLKECIRKKIKDKDFLIILDEIIDSTDSV RNTSSKLTGKNGIGVPIGNYLSQYFANLYLSELDHLCKEELKCKFYYRYADDIVILSNDK DFLHKVLIYIKLYVNTIGLKVKDNYQIYPVDSRGINFVGYVFYHTHTLIRKSIKYKIIRL VNIYLNREIDKKEFKVRMCAYYGWLKHANAKNLLYKIQSLTGVRYSNWNGKRTNIAKYYG KYVRIIQVINYAKYFRINFIRNGKAYYADSRDKTLFYSIHRLNHFPINFKITKYDWRIYT KNRKENIKPQT >SRS011529.234240-T1-C MVNTKCDTSYYDSRGYQRKIFDGNVLYESKAKAMKGSDWKPQVQRFNMTYLLELSKMQRD LEYMEYEFLPTTNFTLHERGKLRRITGEQVQDRIVKHALCDEVLNPLIEPHLIYDNGASV VGKGIAFTRKRLLTHLRKYYAQHGSNEGYILLIDFSKYYDNIRHDVLLKLFEQYVDDEHA LWLLRKTVERSRIDVSYMSDEEYEHCLDRLFDSLLYQYMNPKLFTGEKFMGKHLNIGDQV AQTAGISYRIRIDNYVKIVRGVKFYAGYMDDSYAIHESKEFLQELLEDIIEIANELGITV NTRKTRICKLSEHWRFLQVQYSLTDTGRVIQKINPKRLTAMRRKMKKLAPKLTEKEFTDF YKSWFKNHYKIMSKKQRSNMDTLFNQLKEVTKCTLSPLPMAKS >SRS011586.17386-T1-C MKRFGFLYERIVSVENCRLAILNASQKKKKQKMVKEVLDNLEYYANDLSERMSRMDFLSP YKTRIIKDGLSGKERELQVPAFYPDQCAHHAIMQILKPIIEKSSYRWSCANIPNRGIDLA CKGVERATVRDRKHAKYCVKMDISKFYPSIPHGKLKARLREKIKDEKALQIIFKVIDSHN PGLPIGNYTSPWLAELYLQPLDYLIKQKHRIRHYIRYADDLVLIDSNKRKLRKALHDIFL FVEELGMTVKHDYQLFRIQPYCKERTGRRGRKIDFVGRCFGIGFTTIRKRRALALMRQSR FIQKLQRQNRPVSYRMASGFLSRCACFKHTNSYAMKKKYYETVNIKKLKEVVSNESKRKC LAQLA >SRS011586.24081-T1-C MKRIGHIKDRDGKAVTLIEAMADSGNIQKAYNKARKCKRYRKDVLIFTKDKEENLERVRN DILGLTYEPGEYRYFKVYEPKERQIMALPFYDRVIQHAINNVLEPIFNKRFICHSYACRK DKGMHAASDTLQRWLYDWDKYHKDQPLYAIKADIHHYFQSITHEILKAEIRNIIKDKQAL VLIERIIDHNGQMPDGVGIPVGNLTSQLFANIYLNKLDQYAKHMLGVGMYVRYMDDFIIL SPDKEKLRYWLAEIERFLRDELRLELNPKTTILAAKNGIDFVGYKHRATHRKVRPDSIKR IKKTIKKYERGKITKEQLQKSIQSWTGHAGHADSYNLRKKIIMLA >SRS011586.214693-T1-C MTKTFQEICTFEVLYQAYLDARRGKRGKASAAQYEANALMLTERLANILNTKRYIPSKFE TFYVYEPKKRLVQAPAFVDKVVQHAIVDNYLYETITRSFLLDNYASQIDKGMHFGLDRLK FFMVDYWRKNKTTDGWVLKCDVRHFFASINHDTLKEKLRKKVMDDDIFELMCVYIDASAD GLPLGYQTSQLLALLFLDEFDHFVKERLRIKYYGRYMDDFFLICSDKAYLQYCLKEIRQF LDGLGLELNEKTHIFPLKNGIDFLGFHTYLTESGQVVRKLRHSSVKKMNAKMRKWEKDYP KGEVTKEKILDSWTAWDAHAAHGNTYTLRVKVAARVSKIVGIPLKCHAPIRLSKIQKAQL VYKQRLKAANIAAQSAESEHREDNVPW >SRS012273.12949-T1-C MKRLSNLYEQIISLDNLRLADEKARKGKLRSYGVKRHDRNREANILALHESLKNKTFVNS KYEVFIIRDPKERLIYRLPYYPDRILHHAIMNILEPIWVSLFTEDTYSCIKDRGIHKAAD KVKKALKEDPEHTTYCLKMDIVKFYPSIDHDILKTILRKKIKDKDLLWLLDVIIDSADGV PIGNYLSQYFANIYLAYFDHWIKEVKKVRYYFRYADDIVILGDDPKQLHKLRIEIEEYLH DNLKLSLRKVDPKTGKKKWKFQVFKIDSHRGIDFVGYVFYHTHTLIRKGIKKNLCRKAAK LNKKKHISDMEYKQVICSWFGWAKYSNSKHLLKTIIKKQVYDTLRF >SRS012273.20517-T1-C MNDFEKVWNFESLYRAYMKARRGKRWKGAAAKFEVNLLEALNLLSEQLKSKTYRLSPYNT FKVYEPKERVVMSNSYKDKVVQHALCDNVLQPRVAPSFIKDNYASQVGKGTHFGLDRLEE FMRRFYRKHGVDGWVLKCDIRKYFYSIQHDTLKGLIRKYIDDPECLWLLDMIIDSTEGNV GIPIGNQSSQIFALLYLSPLDHFIKEKLGVKFYGRYMDDFYLIHEDREYLRHCWKEIEEH VNKIGLALNEKTNIYPLKNGIDFLGFHTYISETGAVIRKVRRKSKNNVRRKLKKMRGLLE AGRITPETVEQSYKSWRGHASKGNCHHLIRNMDQYYNQLFGQAAAVAQKRGGGSASQKEE RR >SRS012273.83671-T1-C MKIKNVFDLIFSMENLYGALEDASDQRRYNKDVMLFNFNAWDNLKEIRDSVYDGTYTIDK YYIFYVYEPKKRMIMSIKFKHRVVQWAIYRVINPMLIKGYIKDSYGCIPERGPLTAMFRL KYWLEQVNRKDEQWYYLKLDISKYFYRISHRILKKILAKKIKDQRLLKLLESIIDCKHTP FGLPPGRSPGEVPLEERLFDVGMPIGNLLSQVFANVYLDALDQFCKRELQIHCYVRYMDD VIILSSSKAQLQEWKVRIASFLETELELQLNNKTCIRPINQGIEFVGYRVWPDKVVLRKK TTLHIKRVLKAKKEAYRVKEISFKQATDTLQSYLGMMKYCDCDALKEKILDDFVLTHADM KQIYEEGGDRDENYYRRGDGWSVEDYPVTDETHRPAGTCIAAAWNY >SRS012273.92946-T1-C MCSFDTLWAAFLRTRKNKRSKEGTAAFEYRAVEELLILSKSLSRGNHTPDPLDAFLIYEP KKRLIQAPSFRDKVVQRAENDFVIYPELSPSLTRNTYAAQRGKGTHQGVEHLAENMRTYF LRRKGADEAARKAAGLPYRPMEEWDYADGAVIKGDVRHFFQSIDHERLKRALAERFPDKR MQRLMWAYIDQVEGLALGHQSSHIFAVYFTHSIMHFINEKLGLALSGMYNDDWYVICPDM ETAREALRLTRERFSGLGLELNQKTNIFPLRNGIDFCGFHVYLTQTGKVIKKLRHSSSKR MKRRIRKWEEQYAAGEITREKIEESYVAWEAHAKHGDTGALRKQMRARLSAAMARAEHRR AALRLPAPEPGKIERRTKRYGTVNFQSGRRQPGEAGGADQAHQVHQAGE >SRS012273.325125-T1-C MQQVEYSALLDLDRLREVFFSIKNNSRHEDKIFAYEMFLTANLIGIKTILQDRAYVHGKY NIFLIHRPKSRVIMSESMTDKIINHLVSRYCLFPLIEPKLIDMNVATREGKGSKATLFYV KKYLHHLKENHKHIYVLKCDIHKYFYSIDHEILLSKLSHLIMDEEVFRLVKTIIDSTNLD YVNQELANVIKRRKRYIYHSNINEREKIEHLARLAKIPFYEKGKGLPIGNMTSQILAIFY LNDLDHYIKEKLGVQCYIRYMDDLILIHYDSNYLKKCLHEIEKQVHDLRLSLNEKTQIYE VTNQGFPFLGYRFVLRKKRLHVLLLSNTKRRILRRLKKVADNQKYDFLHQNYFGYLLPAD SRGFLYEMKKKRLR >SRS013476.16684-T1-C VVELEGLFEAYYDCRKDKRDSINSLSFEVSYEEGLVDLCNEINSRTYKPSRSIAFIVDHP VYREVFAADFRDRIIHHYIALRIEPLLEAQFTNRTFNCRKGKGTLYGVKRLHEDIRMCSD GYTKDCYILKMDIKSFFMSISKKLLNKRMDEFIRENYNGPDKEDLRYLSSITILHNPEEN CIRKSPETKWRFLPPGKSLFLQDKDLGLAIGNLTSQLYANFYLDPFDHYLEDTLGFVFHG RYVDDFYIVDENKDKLIAAISPIRKYLEAECRLTLHPNKVYLQHYSKGVKFTGAVVKRDR IYISNRTVANFRNLTYRLNHTEINDFEKVRKCTDGMNSYLGLMKHTLSYAIRRKNLSNID KKYFKYFYISGHFEKITIKRKYTDKTNAINRVKKGGLNGYD >SRS013521.1308-T1-C MKKKAAALREYGDFETVFSFERLYESYRASVRGVGWKASTQRYKAASLANVTKTHEELIA GRYRSKGFYEFDIVERGKPRHIRSVHISERVVQRCLCDYCLVPMLSRSFIYDNGASLRGK GYDFAVSRVTHFLAEHYRKHGREGYVLVFDFSKYFDTAQHEPVFREFERSGIDDRLVALS KYFIQNFGDVGLGLGSQVSQIAALALPNRIDHYIKDVLGMKYYARYMDDGCIISESKAEL EICLRELRRLCAEHGIRLNPKKTQIIKLTRGFTFVKVRFRYGANGKVVRRATYKGIRHMR AKLRIFRRWVDSGRMTAADVETSLVSWRGHMKRFHSYHMEQSVERLYRELFKGG >SRS013521.48931-T1-C MKRYCKNVNILDFEFLEFCASECLRKKWKRRDVLEFFSDITGRSKDEILHAIQNGEKPAL IVIAARYMQKQLAKKELYFVPIWYKEKIDASNHKLRRIGIQHVSQQLYDYVAVYAMRDLM KRIGEYQCASIPGRGPYYGLKHVRKWLRAHDAKYVAQLDVQKCFPSIPQDKLLAFVDKYV ANADIRWLVRELVSTFDVGLSIGSYLSQYLCNLYMSQLYHEISERMCYTRRGKPIRLVKH VLFYMDDILLIGCNAKAMHKAVDAIIAYAKDKMGLTIKPTWNVRKISETDFVDTMGFRVY GDHVTIRRRVFLRVRRAYKKPTKKRYRKMTEKQAQKCVSYFGCIKGTNLLRFAKKFHIFT TIKQAKGVVAYESKLRCRAARRSVCV >SRS013542.671-T1-C MAIALAYHDCFMLYSQAAMEVALHPDSVSRWALGAASDTNSTLLYKGLHMSIKRVEARHK RRQAKRQRNRAEQTRTATFENTSSLQSLTDAAYEACKTIKWKNSVQQYMNSAMLNTLRAH KLMVSGGKITGRRKCFTIMERGKVRHIQSSAFWEKVIQKSIARNVLIPCYTRSYTHGNSA NQRGRGEMYAIKLLRKQLARHYRKHGSQGWILLCDYSNYFASIPREKVLQQASERIQDAR IMPWLEQLMNAECDSGLGLGAETNQQLAVGYVSRIDHWIEECSGCEATGRYMDDLYVIDS DLLKLRSTLDEIKEMSAELGLTLNSTKTYITSLHHGFTLLKKKWYYTQTGRIITRPIPKT IKRMRRHLRALTRLADRGEISWKQITAMYHSWRGTLKHYNAWRTIQSMDAYYKQLKAKNK TL >SRS013687.54482-T1-C MRRIRERGEAENMHNVRCAYGNYSKSKHKRRAVRDYDKRLESNLQRVLDELCDESWQPSP YRPKTIFERKRRDLARAPIHDHVIEAAAILPYETSFYDYIAWQCPAVRPNMGQHALLRAL RNELYKYEQSEVAYTLSMDAHHFFPLMDHAILKRQILRKVKPGKLLNVLFKVVDSYLQGA PLGIKVSQIFGMLYLADFDRLAMRFFDIDKDPEKMAYWTSKYIEWRVITAKTPSDFDDLS RGSIYLADKFKAYVAEGLPHYCRFVDNIIFRHADKAVLGIVRQIAVAILARDYHVDINKD YNIRPTHMGIRICGYVFYHDHVLLSKNNKQELARHVVKLRKRGFTEEQIRLRQASRFGYA KHVDCINLFVKLGMQKSLGKIIKSHRIKSPFEEMTGNQKVNFSNICKTLSNAAGEAVRKP >SRS013687.79028-T1-C MHTKKEVQIMKRYGNLYEKICDLDNLKLAHKNAQKGKGWYKEVKEINANPDYYLKKLQNM LINKIYKTSEYETFMKNDTGKERKIYKLPYYPDRIAQWAILQVIEPILVKNFITDTYSAI PGRGIHKALHRIEKAIQTDVKGTQYCLKIDAKKYYPSINHEILKNKYRKLFKDKDLIWLL DEIIDSTPGDTGIPIGNYISQYSGNFYFSSFDHWIKEEKHIKYYFRYMDDIVILAESKEE LHQLRKEIDVYFKTKLKLKIKENWQVFPTFVRGIDYVGYRTFLNYKLLRKSTCKNFKSKM NKIRKKIKSGKLINYSEWCSINSYHGWLIHCDSYRLNEKYIKPLEFHANQYYLLNVKGKG EKK >SRS013687.79189-T1-C MESRNTTLTTCGECINSPQDRKAVNQQEDLLGQVEAPTSICFPLYNLIPEIISDENMERS FKRVMSNLHNADTRSGIKWREKVVIDGVECTPRMVRYMKRKKEIIAELKEQIGNGTFRVE RLSSFEVDDGPKRRMVQAPPVVKRIGCNAIMEIVEKHLSPLLIENTAASIEGRGPHGLFH KMQEVRAENPDLIYYYQSDYKGYYDHILHDKMIDIIKQYIADPILLPILIDFVKVLHPYG NEGISKGLRSSQFFGNLYHNDIDHAMIEECGKDNYNRFCDDIYILGDDKKEL >SRS013687.92260-T1-C MANLRSADTRSGNRQREIAVIDGIECSPRMARYVKNKHKILDALKEQIGNGTFRIKNLKS FTVDDGPKVRIVQAPSVIERIGSNAIMEPLEKHLSPLLIETTAASIQGRGPHGLFHQVQD TLAENPNIHYYYQSDYKGYYDSIDHDILISTIRRYVGDPVLLPILENFVKALYPNGKHGI SKGLRSSQFFGNLYHNDIDHRMIDEYGAKHYFRFCDDIFILGESKRDLWKLRDKLHYEAA QIGLTIKPSEKVAPISSGMDALGFVNYGDYTLLRKRTKVNAARKLSKIKSRKRRQQIIGS FKGMACHADCKHLFYILTKNNMKKFSEMGVTYTPADGKKRFPGKVMRLSDIVNIPIEIHD FETGIDTKEGEDRYLVSFRNPRTQEWGKFFTASVEMKGILDQISDIEDGFPFETVLKCEM FDGGKRKYNFT >SRS013687.123975-T1-C MKRLGNLYDKIISLDNLRLADERARKGKLRSYGVKLHDRNKEANLLSLHEALKAGTYRTS EYSTFTIYEPKEREIFRLPYFPDRIVHHAVMNILEPIWVSIFTADTYSCIKGRGIQAAAN KLRRVIDRDKAGCAYCLKIDIRKFYPSIDHTVLKSIVRRKIKDTRLLNLLDEIIDSAEGL PIGNYLSQYLANLVLTYFDHWVKEVKRVRYYFRYADDIVVLHSDKKTLHALLAEFESYLA ANVKLEIKQNKQVFPVAHDHRDSFGRGIDFLGYVFYLNETRLRKRIKQNLCRKIAKLRKR KKPLSEDEFKQTLAAWWGWAKYSDSEYLINKLNKITPYEIKFRR >SRS013800.28829-T1-C MKRIGNLYKTIISVENLREADRKARKGKTHTYGVRVHDKNREANILALHEALLTKTFKTS PYDVFTIFEPKERLIFRLPYYPDRIVHHAIMNVLEPIWVKTFTHNTYSCVKKRGIEGCAH QVDKIIKEFEGKPLYCLKIDIKKYYPSISHNVMKRLIRRKIKDADLLWLLDEIIDSAQGL PIGNYLSQYLANLYLCYFMHWVNECLPELVRKALNLKEKPYIKAIEYADDIPFLAESKDV LHQVFKFIKEYIEEELELSIKGNYQIFPIAKNRYDKHGRALDYVGYLFFRKQKLIRKSIK KNFCHTVSRLNRRKPPLDAKAYKQAVAPWLGWAKHSDSKHLLKTIIKPCYYDSIL >SRS013800.181616-T1-C VLVGKPKTPYDKKQINDNKMKRIGNLFDKIANMDNLILADMKARRGKKDSYGIRLFDKDK EGNLSRLLKSLLDGTFKTSKYRTDTIYEPKERIIFKLPYYPDRILHHAIMNVMEPIWVSV FTADTTSCIKGRGITEAYKRTRRALSDRESVYCLKVDIRKFYPSIDHEVLKGIARKKIKD DRLLMLLDEIIDSAPGVPIGNYLSQYLANLYLAYLDHEIKEIIDIRHYIRYADDMTFFHH DKCFLRNVLLPWLIDRLAVLKLELKGNYQIFKIAERRSDKSGRGVDFVGFVFYKEHIRIR KRTKQNLCRAAARLNKVPNISLTEYKAGLAGWLGWIYDSDSKHLAKKILKPEFYEAIMER HNAA >SRS013951.8662-T1-C MAKTIRHEFDKYLTYDNLMKAHLLSRKGKNYKKEVILFNLKQEEYIRWLYEQLKNGTYKH GGYRIFYIQYPKRRKIEASRYMDRIVHRWIVDSFLNRYFVNQFINTSYACIKNRGMHKAS MDVQNTMKHCKRIWQNYYILKMDIRKYFQNIDKDILMNILKRKVKEEKLVELLEKIVYSN SGKKGLPIGNYTSQIFANIYLNEIDQYIKHELKVKYYFRYMDDSILFVKTKKEAIELLEK IKNYLKIKLELELNDKTQIFKSDQGVNFCGYKINEYRLKLRDRGKKAIKQKVKYLKKEVQ KGNMSSKEASRFICGHLGYMKYANTRNLEEKLFYIN >SRS013951.13360-T1-C MKTVKGLHEKMHTFDNANTSFRKAAKCKRYSKEVLAFSMSKEEELLRACEEVKNLTYTQG AYTIFKVWEPKERLIMALPFYDRVVQHMIVNIIGPVFEQGFYYHSYACREGKGMHAASEQ LSQWMYELMIKEGLRLYGFKGDIHKYFASIPHNKLKEENRRYIGDKKALYLMDGIIDKNG ILPDGVGIPVGNLTSQLFANVYGNKPDKFVKHTLHAKYYIRYMDDFIILSADLEQLKEWR KRIEEFLEKEMELQINPKSTILYAGNGIDFCGYIHHPEYKKVRKASVRKLKKNVKQLEAG ELERVEFEKKYQSRLGHMGHADTYHLTKAIEYELLFWEWEQTESGIAIPA >SRS013951.80701-T1-C MNNYYDANALFEAGTKSIKGSRWKYSVQLFEINQLLETAKLQKTLMEGKYRPSVGSKFVI KERGKPRYISSATMQDKTVNHIVCDKILTPHLHKYLQYDNSASQKGKGVAFHRKRFETHL HQYFNETGSNEGYILLWDYSGYYANIPHEKCLDTINGFLDREPIDPQERQLTKEIMANTL QSFEMDVSRFSDKEVTEMYRQKVDPLLNAGVPAAQLTGEKWLRKGVDIGNQQSQDIGIVY PYRVDNFCKIVCGFRHFGRYTDDSYIIHRSKEKLLQAFEGIKKIAAEYGLIINERKTRIC KLSDTYRHLQIQYSLTASGRLIRKINPKAVTRERRKLKAYKRLLDAGRMVYREIEESFKS WIASAYKYMSLQQIQGLSRLFYDLFGKAPTWKKTHGSSHGRLRWMMALPSAA >SRS013951.80751-T1-C MVEIFATGRSLVDLYGSDGSEAKKEMNHPVMPKRVGYLYDKMLDKDLIRVVILDSARHKY RRREVRKVLTHLEKYVERTYKILSEGSYVPTRPKLKTIYDNSSQKQRELKIVPFWPDGIV HRLLVEAMKPVLMRGMHPYSCASIPGRGGARIRKYLAHAMSCDPKGTRYACEMDIRHFYP SVPIRRLIRALGRKIKDKRFLRLIWAILKSCGQGLAIGYYICQWLANFYLEKLDWMLARM PGVKYYTRYMDNITMLGPNKRMLHRARVAAEKFLRDELGLAVKENWQVYRTAVARKAKGR PRAVSAVGFRFWHGFTTLRRRNFLRMLRQARRIQKKQKMGIPVSVQQAAGFLSRAGQLNH CNSFRVKEKYIRSIKIKRLKEVIRNESKRQCKAGWALYGRTPPATA >SRS013951.103515-T1-C LKRIGYLHEQVYDIENIEIADDKARKNKSIRWGIVKHDRNRQNENERLSEQLRDLVYETS EYSTFKVYEPKERLIFRLPYYPDRITHHAIMNVMEPIWTKIFIKQTYSCIKNRGIHNVAH DLKTALIEHPEETTYCLKMDVRKFYPSVNHDILCDIIKKKIKDKYLLTLLIGIIYSADGV PIGNYLSQFFANLYLAYFDHWVKEELKCKFYFRYADDIVILSSDKNFLRNVLIAIKMYLK EVLNLRLKSNYQIFPVDDRGVDFVGYRFYHTHVLLRKSIKIRLFKLIRRYQSGKIDKQEL RRRMQSYFGWLKFCNSKNLLRKIQRDTGLRFSNWDGEEI >SRS013951.103745-T1-C MENIAAEDNIRLAILNVNAAHSKRGVCRWVERTLDERVADLRRMVQDPCWTANPPRKFAF YDKSAGKWRDEVCEPPIWPDQYIHHMIVQALEPVLMRGMDYWCCGSIPGRGISHGMRGIK RWLREDKKGTRYAAELDIKSFYKSIKPKYVIRWMATKIKDKRALRLIWAVIKDGIKIGYY ISQWMANAMLQPLDHLIRERTEVSHNLRYIDNITLFASSKRKLHRAVKAISDWLGGVELQ LKDNWQVYKVDTRMVTALGYRYDHEKTLLRKRNLLRLKRQLARAYKRLDRGRPIAVSMAA GLLSRIGQLKHCDAQNLRRRLVRPGFVKILKAVVRKHSRERSLKQWNRSSSTISA >SRS013951.109193-T1-C MGMNAGSSISESRNALKGDNMSIKNVYAQIVSFDNLLQAEKDARAGKRYENEQLAFWGNL EDNIHSIAEKLKCHNYPPDIYHHFYVYEPKLRKVIFSDYTTKVIQRAAYNVLNPIVCKGM ISDTYSCIEDRGQLKSMQRLAGWVDFVEKSGERWYYLKMDVEKFFYRMDHEVLMSIIRKK IGDKEAVRFLEHYVCHASRAFGLPLGVKSPLEISDKEMLWDVGIAIGGGLSHMYGNMYLN PMDQMAKRKEGIQYYIRYMDDVIILSTDKELLHRYKNMFSDFLGDVLKLRLNNKTAIRPV SHGMEFVGYTIRPFDVRLRKSTSLRMKRHLKTIQELYRDYEIDLDRARSTLMSYKALMDH CDCRALEKKIFEDFVLTHNPKEADTDNG >SRS013951.114321-T1-C MDKDVICSYENLYKAYRKAKTGKGFNGSCAKFQTMNLEGLHLLKEQLENQTYRLNPYNEF KVYEPKERVIKSCSFKDKIVQHCLCDEILHPALRDRFISTNYAGQQGKGTHFGMDCLKEQ MIDFYEEHRLDGWILKCDIKKFFYQIDHEILKRIVDYHFPDEYTKWLNHLFIESTEGLGL PLGNQVAQVYALLMLDGMDHFITETLGISRYGRYMDDFYLIYTDKEYLKFCLEQIRDFLG SLGLELNGKTQIVPFKCGIAFTGFHHYVTLDGKYIRKLSGTNKRKMRKKIRKWSELVRSG KMKEEKFFEKYNAWKNHALHGNCIKLCHSMDLYVKELLEE >SRS013951.126952-T1-C MLTEIRNCNIPKMKRYDHLFEKICDIENLRKAHKNAKKGKGWYKEVQEIDKDPDKYLEQI QEMLINHTYRTSEYEVFYKDDGRKKRKIYKLPYFPDRICQWAILQVIEPCIINNLTTDTY SAIPDRGIHKALHKMQDAMWNHPEECKYCLKLDARHYYQSINHDLLKEKYSRMFNDSELV WLLTEIIDSIQTADIEDLTAIYLLEEDVDPETGIPIGNYLSQYSGNFYFSPFDHWIKEQK HIKYYFRYMDDIVIFAKTKEELVELRKEIDVYFRDELKLNIKGNWQVFPTFVRGVDFLGY RTFCKYTLLRKTTCIDMTKKLTALRVKVESGNMMNYSEWCSLNSYKGWLISCDSFRLYQK YIEPLLPYADDYYKYNIKPKSKKGQKAA >SRS014287.83344-T1-C MKHCQQEMTVIQNAWPVVCNFGWLIEADRNARKGKRYRAEVLNFTARLEDNLFVIQQGMM NGSYVLGPYRKLWVYVPKKRLVMALDYPDRIAQWSLYLYLNPIYDRLFIEDSYACRKGKG SHKAAKRLQYWMCQVQRKPGPGWYCLKLDISKYFYRVSHEKLLAILERRVKDPAMMAFIR GVVNSRAEPFGLPRWRTPQDTPPEEWLYEVGMPIGNLTSQLFANIYLNELDQYCKHKLKI HYYIRYMDDVIILGQDKETLHRWKAAVETFLREELALDLNSKTSIRPVRQGVEFVGVRIW PTHMKLRKSTVRRIKREVRKISALYAAGDMTRQDFYRRVASIRGLLKHTESASLRWRLNE IYRAELEKAKQKQLREEAQHEPFADHSGVGNGDGNAGTGYQDHGNTACRAG >SRS014459.15880-T1-C MKRYGYLIEKIVEESNLLEAFSMVMRGKKRTRTVRLFKKNRDKILADLADEIKSDKYAPE GFREFEVVENGKVREIQSLPFKDRIALHAIMAVLYREVLGGMMIRDTYASLPKRGIHDGL NRLRKALKDRSNTEYCLKLDLKKFYHSIDQDVLIELLRRKIKDETLMQTLIRIIRSYGPG LAIGYHSSQLLGNFYLCLLDHYMKAELGVKYYFRYCDDIVVLGPNKAYLHDIFDKLRSLV ENKLHLTIKQNWQIFPVEARGIDFLGYVTRHDYVLVRKHIKQKVARRLHKVKSKKRKYVV LASFWGWAKHCNSKHLFFKLTNMKSFKDLGVTYKPADGKKRFEGNLTPLGNLQNCKVTIV DFETDIKTKQGEGRYVVQYELDGQKGKFITASEEMKNILDQIKEMGELPFETVIKRETFG GNKTKYVFS >SRS014459.27739-T1-C MGNRAVNKYLDDIRRTLLAWYRIGREPVFHACYAMRLGLPVKLLIQGCRTRYHFDNRLLP ESGGNLIMTSEERKEARFQRRKAKREAKRERVLEEHGDYYKVISRNALSKSAIEAAKGVS YKASVKRYMLRRLTNVAATNKKLTYCEDIHKGFICFGLNERGKHRDIMSVHFSERVPQKS LNHNALVPVLTRSLIHDNGASQKGKGTSFAMKRLVTHLQRHYRHHGTEGYVLLFDFKNYF GNIDHDIAKQIIRRAFDDDKIVWLTNRFIDSYYEHYLKMAIKKGENPDTVEHKGLGLGSE DNQTIAVSYPNRLDHYIKEVLQVHEYARYMDDGYLIHESKEYLEYCLQEIRRICAELKIE LNEKKTRIVKLSHGFTFLKTQIYLTYTGKILRKPCRKAVVRQRRKLVRQYRKFLAGELSF EDIRCSYASWRGCMEKKQARRTIHSMNRLFDRLFIENWQREEVPLYG >SRS014459.31154-T1-C MKEDTLLLDVFHAYYDARRHKRNTHSQLEFEFNLEENLVKLYEELRDHTYKVGRSVCFIT GTSVKREVFAAHFRDRVVHHLLYNYTAPIFECTFIADSYSCREGMGTLYGVRRFEHHLRS CSNNWTRTCYVLKLDIRGYFMHIVRQKLYNQVMETLRRYAGRKNSRGQYWADALDYSLLE YLMREIIFNDPTLDCEMRGTSKDWEGLPKDKSLFHVPEGRGLPIGNLTSQLFSNVYLNVL DQFVKRTLGEKYYGRYVDDFFIIGTDRKRLSQLIPQLRSFLEKELELTLHPNKVFLQQAH KGSAFLGIFVKPHRRYLLQKIKSRISNRMERMNRCLSAKKVNRDKLLYISCVANSYMGYM RHMACHRFKNLLVERNTSFHRIGNFKGEKLIFVPFIKGIVPLMK >SRS014459.31897-T1-C MEDTQKQDKLPPIKYTKRVGHLFEHVRDLDNLKEAIKDAARHKRKRKEVQKVLEDIDGHA LELQRMLDEETFIPAKYTMRRINDGIQKKTRDIAIPRFWPDQCVHHAFVRIFKQIVLHSA YPFSCGCVPGKGTHGAKTAIEKWIRKDPKHTKYVLKLDVRKCYPTMNHEELRKKLQRRIK DKKFLRLADRIIASFQQPMATHERLLPETDAVGIPVGLFTSPWFCNFFFQDIDHKVAEKT GAAHNVRYVDDMVLFDSSKRRLHKALEFIEAEVKATKQTVKDNWQVFILSKRPLDFLGFK FHPNKTTIRKSIMLRISRKARTIARAAYASIRNAHAMVSYIGYIVNSDSQRFYEKWVRPF VNIKHLKGVIADEDRKQHQACVAV >SRS014459.43572-T1-C MPQVKRLKAATHRLLKENRNMKIKNVFDEIFSNDNLYAALEDASQGRRYNKDALVYNLDA WAMVQEIRNEIFNGTYSIDRYYIFYVYEPKKRMIMSISFKHRIVQWAIYRVINPVLVKGY IEDSYGCIPGRGSLSAMQRLKYWVVMASRKEEQWFYLKLDISKYFYRISHRVLKKILAKK IKDKRLLKVLYSIIDCEHTPFGLPLGRSPGDVPLEERLFDVGMPIGNLLSQLFANVYLNE LDQYCKRELQIRFYIRYMDDVIILCNSKLQLRIWKDQIEQFLLRELELHLNKKTCIRPIG QGIEFVGYRIWANRVVIRKSTTLRIRRALRGMAAKYTDYKITMQDFSETLQSYLGMLEHC DSDALINKILDEIVLTHNKENQEEGDEPGGIFGSAEYDHRESEIYHF >SRS014459.69025-T1-C MNSQERREGRYQRRKEKRQKKREDRCAALGSLAEVFSYRKMFFYGRKCCNGVRWKQSAQN FEMHLFSGTARRRREILNGTWKPKKCVHFTLCERGKVRPIDAPHITDRQVHKTLCNEVLI PLYTPSMIHDNGASQIGKGLHWHFDRVKQHLTWHYRRYGREGAVLQIDLKSYFPNAPHNL IYQRHEQLIPSPDLRELADKIIYFSPCTTPGRGMPLGVEPSQQEMVALPSSIDNWMLCQA KAEYGAHFMDDYLLVFPTIEEAKRMGHEIVRRFESMGIRVNKKKCKVTPLTKPFRFCKAK FTLTESGKIRVNGSRDGVKRARRKLKLFYREFKEEKRDFKSIEQYMECQSAYYRSFDDHG RLLRLRRLYHAIFFGGATECSKSSKAGLNSA >SRS014459.126085-T1-C MDDKHITPRLAIPTWMLRLPGARRPVLGVMAGLELALYPDQRKQQQQRSEFMTYQEMCSF ETLYAAYLEARKRKRSKPGMAQYEQNVLACTEKLSTILHTKTYVPSRFEVFYVYEPKKRL VQAPAFVDKVVLHAVVDNILYEAITKSFIRDNFASQTGKGTNDGLMRLKQHMVDYYRREA RGTDGWILKGDVHHFFASIDHDKLKRKLKALLDKRGVDPQIYDLLCVYINTTDGLPLGYQ TSQLLALMFLDEFDHLMKEKYRLKYYGRYMDDFYVILSDKQRLKEILKDIRALMDGWGLE LNQKTGIFPLRNGIDFLGFHSYITESGGIIQKLRRDSIQRIRAKVKFWEEAYKRGEVTKD AILQSFGAWDAHAAYGDTHELRAKYAKKVEAIIGEPVEIHRKLNGNRAVRDKRRLRQCRN LYKKQHQNRETDQSGSFSYAQRPTDVPPWADS >SRS014459.148165-T1-C MITEPIKFTKRIGHLFERVVDLDNIKLAIRNAAKRKSDRPSVRRILLNVDKYAKKLQEIL ITESWVPHAYHIREINDGIKKKKRIIAVPRFFPDQCIHHAFVLVFKEVVEHGSYEHSCGC VPGKGTDGARKVIKRWVVNDPKGTSKVAVLDVKQCYPTLPHEQLRLKLEKRIKDRKFLRL AFKIIASYQQAMANKTQLLPETIAVGIPVGLYTSPWFLNFFFQDLDHLIAEKCGLSHLVR YVDDMVLFDNSKKRLHAAIRTVGDYLQHMQMRLKSTWQVYHLRIRPLDFLGFKFHANEKI TLRKSILYRISRKARTIARKGYASVTNASGMISYKGYMDHSDSTGFYEKWVQPFINFKVL KGVVSNENRKQCKAICAA >SRS014459.165344-T1-C LPLSRCELEQWRQRWRFQRQPEQPALEFQHQHRGPFRFSPTAHGADLHLIRGPPPQGDGR CARPKGARFYSWADSPGENLNCCGDGNATRSAGEYKTFMDYMTDGKAVEIRTEDGRVVVG PGALELISSWEWLQEANRNARRGKRTRPSIMQYQDDLEHNLIQTGEEMRAGTYRTGPYRR LWVYIPKRRLVMALDYRDRVVQWSVYQLLYPYFDRRMIEDSYACRRGKGSHKAVARLQYW LRQIDRKPNGKEWYYLKIDVSKYFYRVDHDVLLRILRRHIADPGLLDLLAGIINNPDEPF GLPPGMKPEDADFEAWLYDVGMPIGNLPSQLFGNVVLNELDQFVKHRLKARKYERYMDDG LFLSDSKETLNAWKQAVGDYLRQELHLDLNDKTAIRPVTMGIEFVGRRVWATHSRLRKST VRRLKNEVHGICRQRAAGTLSKAGFERRCASIRGMLDSCECASLRWRLNEIYLNILGGEQ QNDPALPFG >SRS014459.284627-T1-C MGKNKGIFIMDKEIVCNYENLYKAYKKAKVGKGFNGSSARFQAMSLEGLHMLKEQLENQT YRVNPYNEFKVYEPKERVIKSCSFKDKVVQHCLCDNVLHPQLSGEFIRTNYAGQTGKGTH FGMDCLKEQMLEFYNQHGLDGWILKCDITKFFYQINHDILKDIVDYYFNDEYTVWLNHLY IDSTDGLGLPLGNQVAQVYALLMLNGLDHFVTGELGVNLYGRYMDDFYLIAPSKDYLKWC LECIRRFVESLGLSLNGKTQIIPFKSGILFTGFHHYATKDGKCIRKLTSTNKRRIRKRLR KWCELVKTGRMTEKKFYERYNAWKNHALHGNCIKLCRSMDLYVKELLEREE >SRS014613.6844-T1-C MNKVKNQQLELFSPNDAESGNTNQLSPVPDAELFTGECNTDDCSFFLPAAGYRNSNGTMN NVGKNGNYWSRTPNGNNGYNLNFNNNGNININNNNRNNGQSVRCVKAFTEGAVPSLPAFT IDSQQLLVDLFKAYYDARKHKRNTRNQLRFEVNLEENLVNLRDELMERTYKVGRSTCFII EDHVKREIFAADFRDRVVHHLVYNYIMPIFERTFITDSYSCRKGKGTLYGVERLEHHIRS CSHNYTDLAYALKMDIQGYFMNINRKHLLETVKEDLMKYSFRESDTGQCWKDKLDYSLVF FLLEEIILTDPTNNCIIKGKKSDWKGLPDNKSLFKTPSDCGLPIGNLTSQLFSNIYLNRL DQMVKRQLREKHYGRYVDDSYIINRCYATLRMHKETIRRYLHEELGLTLHPKKSKIVRCC YGIDFLGVFVKPHRRYINNRTKKRIFRKSVPLFNCTDTEKLRAGINSYLGYMKHFKCGKI KERLFGGKPQLELLGEFIGHYSKFSLPQPELKRTGS >SRS014613.8832-T1-C MVNTKHDTHSYESTSNGERGYQREIFDGNVLYESFIRAKQGSDWKPKVQQFEMNFLFELA DSQTELASGDYKFLPNTEFTIHERGKERRITGEQIRDRVSKHALCDEILTPAVQKYLIYD NSASQVGKGIDFARKRLLTHLRKYYSQHKSNDGYILLIDFSKYYDNIQHDRLMEQFEKYI HDPNALNFLRKVIDRSKVDVSYMTDEEYAVCMDTLFNSLEYEKVDKTLRTGERYMYKHLN IGDQVAQVAGIIYPIPIDNYVKIVRGVKFYGRYMDDSYAIHESKEFLEDLLQGIIAIANE LGITVNTRKTRICKLSSMWRFLQVQYSLTDTGRVIQKINPKRLTAMRRKMKKLVYKLSEK EFNDWFNAWMCNHYKIMSKQQRENMNTLYAQLKKEVYHNVHNHPG >SRS014613.37984-T1-C MTKRRGFLIEQIADMDNLREADRDAQDGKVKKNRFIRRHNEHAKDDLEALREMILTLNFP DPNFSIMTVVSDAGKRRDIAKQSYFPWRILHHAIMRVIGEELYKSLILDTSACIKGKGLH FGVRRMKMFLRRYPEYKWFVKTDFKKFYQSIPHEVLLNALRRKFKDEKFIKLIEIALLSY DSGEELIEILENEELRKKRCTDWSIHKPTTRKFRCKPDRPQVQRTV >SRS014683.32645-T1-C MKQYKYLYHKMLDENVIRQAYKKLRKGKTKRKEIIAIDANLDEEVVTMRRMIENTKPPDV EVEHPELSYRPCKRTPKYIYEHGKQRRIYMPEIHEQWLHHIIVLILEPIITATSYPYSCG SFPKRGAHYGKKRLLKWIRDGTNIKYFGKIDIRHFYDNIRIDTLMRELAIRISDTWFLYV IRICMAGFSKGIPLGFYVSQWLANYLLEPLDEFIRKLGFEKYMRYMDDMVFFGTAKKKIH AAITQIRMFIGRRYRLKLKHNYQVCRFYYESKRRAVGRPLDFMGFLFYRNKTIMRKSIML EAVRTAKKLNAAKSAGRGFYIRHVKSMLSHMGWFSCTNTYDCYNIHIKPLVNITKLKKIV SKLDRRKNHYEAVERRTLQYAA >SRS014683.44938-T1-C MGGVLSILSCGIETAYEERLASYRKSDRRPTVCMNFSEICTFAVLYAAYLAARRGKRSRA ATAHYEVHLLENIVNLVYILQTKIYRPGVFRVFYVYEPKKRLVQAPAFVDKVVQHALVDN LLYERITRSFILDNYASQKNKGMHFGLDRLKYFLTDYYRKQHTAEGWILKADVRHFFASI DHDKLKEKLKRLELEPVVYDLLCIYIDCSDGLPLGYQTSQLFALLYLDEFDHFVKERLHI RYYGRYMDDFFLICPDKAYLQYCLKEIRAFMATLGLELNEKTQIFPLRNGIDFLGFHTYL TESGKVIRKLRHSSIKRMRAKLRYWEKAYPQGLVTREAILQSWQAWDAHAAHGNTWALRQ QIKARVQNILKEDF >SRS014683.82758-T1-C VQEDEIIGFDALYTSMGKCSKGVRRKAAVGRYCLFGMDEILRLHQELVTGTYRARPTSKV KITYPKPRVAVATSFRDRVYQRSLNDNAVYPAMSKGFIRHNAACQTGKGTDWARQQVKLM IEREYRQHGPDGWCLLVDIRHYYDTMPHEVANQRFERKLPYNVYARVRDVLDRQYTGEAG YSPGSQMVQLAGISVPDPIDHYIKERLRADKYVRFMDDSWICHHSREQLVEWREAIRARY ATEGMELHPTKTKIVRLRDGFRFLGFIYRLTPEGKVIMTVDPQNVKAERKRLYRLAQLIK AGEKPVTALREQYKSWKAHAAKGNSKQLLQRMDKYVKSLLEGIT >SRS014923.5295-T1-C MIPHIMRSMDRYCIASVPGRGNSYGVKALKKWMKNDVEGTKYCCECDIYHCFEELDPPYV IEALKRVFKDTETLWLCDAIMEYGVLIGAFFSAWFLHLTLQPLDLMIHQKQYGVSHYLRQ MDNFTIFGSNKRKLRRLLEDIKKWLAEIGMKIKGNWQIFRVGFTPKVERAHQALPKKKQR HRRPRLPSALGYRFGHGYTILRKHNLFRLKQSLHLYYYRRDRNRVISFKRASGLISRLGQ LRKCNHQQVLDRHYQPKTMFALKKVVRKECRRLQALYPPYQAA >SRS014923.6816-T1-C MTSEERKEARYQRRKASRQRRREERLKEYDDFDRVKDANNLITAFKKSKSGVDWKASVQR YEMNLLRNINNTVKALESGENVSQGFIVFWLCERGKLRLIKSVHIWERVIQRSLCTNALV PVLQTGLIYDNGASMEGKGIHFALNRLDAHLHRFYRRNGFSNDGYILVIDFSKYFDNILH EPVYQDLQKNFTDERIINLAAQLIRPFCADRKDGKEISLGIGSQISQILAVRFPNGIDHF IEQELDVHEHGRYMDDSYLIHESKEYLQYCLEALKEKFAEKGIIVNTKKTQIIKLSSWFT FLKFRYKLTETGRVVVKPCKDSVTRERRKLHKLKPKFDNGELNFGDLRTQYASWRGYIEY ADSYRTICNMDNIFNQLYVPAGEWR >SRS014923.46917-T1-C MVERGREDRKRIRMRTYRNLYAEFISDDNIKLAIQNFSKGKKRRNKVRKILADLDTYIPK IREYAINFTPFEHKPKEIYDGISRKKRKIVIPTVMESIVHHMIVNVLKPMFNKGMYEHSY GSVPKRGGAYGKKCICKWIKQGGKNIKYCYKLDVKQFYASIPQDKLIEKLKSKIKDFRFI RIVENVIHCVPNGLPLGFYTSVWFANWYLSELDHEIKSLGIELKYARYVDDMAIFCASKK KLRNVKAVIDNSLAELGLTVKGNWQIFRFHYLSQNPYVSKNGKTATYGRPLDFMGYKFYR NRTTLRKTILKKIRAKAVRIWRKTKVTIFDSKQMVSALAWIKNCDMYDYYREHIKPFIDF GKLKHKISTVDRKARCIEYDRIQARRKYAIGQTA >SRS014923.95762-T1-C MTSEERHELRYQRRCQRRQAKRLARSIACGSFEEAFSFSNLFQAGQTCCKNVNWKCSTQR YRMNIISNTAKTHAQLMAGTYKSRGFYEFDIYDRGKWRHIRSVHITERAVQRNLCDQVIT KVFQPAFIYDNAASIKGKGIDFAMDRLNCHLQRHFRKHGLKGGILVFDFKDYFGSAQHWT VKNELARRVHDPKTRKLANDFLENFGPVGYGLSSQISQNAALMLPNKLDHIIKEELQIKG YGRYMDDGYLIHEDIHYLEYCLERIKEVCAELGITLNLRKTKIRPITRGIVFLKTKFILT ETGRVLRKMSRASMRAMKRKLFKFRKWYETGEFSLEDIRTAYDSFKGHMRRGDSFKAVAR IDLFFKHLFGFHPNDKTKWRATNVPNRKRWDYSGADRATELCRAA >SRS014923.97431-T1-C MPKTLKNKYDKKLTYEKLMEAHIKSRKGKGYRKEIIEFNLKQEEYIMWLLGELQTQKYKH GGYTVFYVTEPKLRKIEKSRYIDRIVHRWIVDNFLEPIFVPQFINTSFACLKEKGMHRAC IYVKNTMKHCKKIWKNYYILKMDIAKYFDNIDKEILLKILERKIKDDKLMWLIKEILYAK KREKGLEIRKLYITNVCKYIFKRNRPICKT >SRS014923.126404-T1-C LETKSYQISPYNQFKIYEPKERIIKSCSFKDKIVQHSMCDNVLLPKLKSEFIQTNYAGQK NKGTLYGLDCLSAQMQLAYYKYGYNCWIVKGDIRKYFYSINHAILKDIVRFFIEDNDLYW LCEKFIDSTNEEVGLPLGNQISQVFALLYLSGLDRFVTGELGVKYYGRYMDDFYLIVESK QYAKQCLNCLYDFIDTLNLELNGKTQIIPFKNGIDFCGFHTYVTKDGKVIRKLRNENKRA AKRRYVKMAKLVVENKMKREDFDESYSSWRQHALHGNCKKFVNKMDMKIYQILEGENDGR IHVFKEEAV >SRS014979.2734-T1-C MKRFGNLYYRICDIDNLYLAYTKARKGKGNTYGVIQFEKELDDNINALHEELSEGKYVTS EYQTFIIHDPKKREIYRLPFRDRVVHHAIMNILEDIWTPIFISHTYSCIKGRGIHGVMKH LKKDLKDIQNTKYCLKMDIRKYYPSIDHLILKNIVRKKIKDKRLLELLDGIIDSAPGIPI GNYLSQFFANLYLSYFDHWLKEERRIKYYYRYADDMVILASNKEKLHSLLGDVKSYLHNN LHLDLKDNYQIFPVDNRGIDFVGYVFFHTHILMRKSIKKNFCRKVAKLNKKKTVPHDYKM AICSWIGWAKHCNSKNLIKTIIKNEKVL >SRS014979.20320-T1-C MKRIGNLFNRIISYENLVRAEKKARLGKTKRYGVKKFDRNPYENLVRLQKALIEDTYRTS EYCVYTIIADRGNKEREIYRLPYYSDRIVHHAIMNVIEPYLVSRFTADTFNCLKGRGIHY GVKRLKRDLKADKEGTKYCLKLDIKKFFPSIDQDVLYSQFEKVFKDKKLLRLLHHVVYST PKGLPIGNYISQFAANLNLTWFDRWIKQVLKIKYYYRYCDDIVILHPDKDYLRYCLQEIE KYLADNLKLKVKRNWQIFPVEVRGIDFIGYVFYHGHTLLRKDIKKKFIHKLSYKSNNKRL ASLAAYWGWCKYGSCHNLWYRFTRSYNFKDYRQKLLSDDGIKKSTG >SRS015133.21019-T1-C VANATEDTNQEVKLVKTYCKPADVNVEDLEFIRQQVHLCFIGKRSKGRFQKLLISTGKIT KAELKQEIQDQSCSKTLDAIDAVAEQAQADILARDVHFEPVRQFQLRENGKLRDICEESP KQQVFEYIAKGALDPLFRAKLLPIQYGSLPGKGQIKGKRQNERILRRALHHKTDAAKCDV RKAYPSTTVECVMTLLRRDIGKNKVLLWLVEAIMANYPDGVLLIGGYLPCWLFNYVMSYV LRYILSHRKVRRDKSLKMVLAITCYADDITVYGRISNLEKVMRDTTRWAKETLGLTIKSA WQIVHFASFVQERQQRNRRRKGSRQRTPGLDMMGYVVRRTYTIIRGRNFVRLRRAILRAQ RNLDTLGYVPWWRAQRILSQWGEIKHSDSRGFCTKYNVYKLIKAAKRSASWRGKQLHKLR LEAA >SRS015133.60320-T1-C MYDLPQMGKYKGVYEMDKDIITNFENLYRAYKKAKLGKSHNGSCARFQNMSLEGIHLLKE QLENQTYQIGKYSQFKIYEPKERVIMSCSFKDKVVQHCLCDNILHPRLQNVFIETNSAGQ VGKGTLFGMDKLKEQMLAFYREHRIDGWILKCDIAKFFYSINHEVLKDIVDYYFPNSYTT WLNHLFIDSTNGFGLPLGNQVAQVYALMMLDCIDHMITGELGIRYYGRYMDDFYLIHYDK SYLKYCLLYIEEMVSSLGLSLNGKTQICPFKNGIRYLGFHHYMTKDGKYIRRLNSENKRR AKKKVRNMLRLLKARKISEKEFQNKYGSWKNHASHGNTVKLVHSMDLHIKSEIEKG >SRS015133.81707-T1-C LENKFTDICTFEVLYKAYLAARRGKRSRAATAHYEVHLLENIVNLVYILTTKTYRPGVFR VFYVYEPKKRLVQAPAFVDKVVQHAVVDNILYERITNSFILDNYASQKNKGLHFGLDRLK GFLTDYWNKNHTADGWVLKCDVRHFFASIDHDRLKEKLKKLDLEPALYDLLCVYIDCSDG LPLGYQTSQLFALHYLDEFDHFVKEKLHIRYYGRYMDDFFLIHPDKEYLQYCLTEIQAFM ASLGLELNEKTQIFPLRHGMDFLGFHTYLTDSGKVIRKLRHSSVKKMRAKLRRWEKEYPA GLVTREEILQSWQAWDAHAAHGNTWTLRQQVRDRVQNILKEEI >SRS015190.55038-T1-C MKRHNNLFDKIVDKDNLLLAYKKAKKHKSWQQKVIRVEKDLDNLIEELRLSLINGTYKTA EYRTKKIYEPKERTIYVLPFYPDRIVHHAIMNILEPIWDNLFISDSYACRKNKGQHKGSQ KCMEFVRKNKYCLKCDISKFYPSINHEILKLIIRKKIKCKRTLNLLDTIIDSIDGETNIP IGNYLSQWFGNLYLNELDMFMKQDNKIKCYIRYCDDFLLFSNDKALLKEMAIKIKDYVEN ILKLRLSKCNLFPTSQGIDFLGYRHFSSGYILVRKTTAKRMKKRIRRLKWELAKKKITND RALSVIGSISGWLKWANTYNFQIFLQLNELKESIGGSVNEQI >SRS015190.76944-T1-C VANYCKAIRKRIRMISFNGVSEQLYIPEEQIKDIYNASKGKSKKEQAQIVKANVEHYRKE LDKRLKNNTFAPKRHKTKIIQENSCKKTRKIVKPQYMYEQMAHHSVMRVFVPIAMRGMYY HVYGSIPGKDVHRGKRTVERWIREDSRNCKYIYKLDIRHFFESVPHRRLKKALKRKIRDR ELLKKLFIIIDSHKPGLPLGYYPSQWFGNFYLQPLDHFIMEQLHVKHYIRYMDDMVIFGN NKKELHKARLQIEKFITEELGLQIKKNWQVFRFDYVDRKGKRRGRPLDFMGFKFYRDRTT IRKSILQGIRGKVNRVKRKEKITWVDAGSLLSRLGWIWHSDTYAYYERYIKPYVKVKVLK TLVSKHARKENIRNGMVRSRKHRRRKTERA >SRS015209.76130-T1-C MDDTVLLYDVFDAYYDARRNKRNTKSQLAFEMNLEHNLLQLYEELRTRTYKPSPCTCFIT FDPVQREIFASSFKDRVVHHLLFNNIAHLFEKTFIHDSYSCREERGTFMGIERFEHHLRS CTQNYKFNAYVLKLDIKGYFMSIPKLKLRELLRTTMDKNKKWEEWIDRGFIDYLIDSILM RDPTSNVCLIGDRKEWEGLPPSKSLFKSVAGTGLPIGDLTSQLFSNIYLNQLDQFAKREL GLKHYGRYVDDFYVIDTDRRKLLNLVPVFDHYLREKLSLTLHPKKVVLQHSSKGVLFLGA YIMPYSRYPRWRTIKKFRYKMKQIERFCEQRESLSTKQKVKIRSVINSYCGYLSHFATFR LRKLAVNRSAFYRHFFFYKDFKQSGIKGKSRHPAAYINKMAALRIEKYEAERIAHKGIII G >SRS015217.23574-T1-C MSIKNVYYEITSFENLLRADKNCASQHTDKWEIIEFRRNLEENLLNLRDRLRRLDIPPVR YRSFLVFDPKVRKVIYTDYTTKVIQRAIYDVLYEPIQRGFITDTYACVTDRGQHEAVRRL ASWFHEFNGRGQYAYYYKFDVRKFFYRIDHEVLMNIIKKKISDKYTVELMRYYMCSTQRP FGMPLDGNHLTITDDEMLWDKGIAIGGGLSHMIGNMYLDQLDQYAKRTLGIKKYIRFADD IIITDTDKGKLKEYGKLLTQFLNEKLLLEFNDRCALRPNRCGCEFVGCVIYPDHVLLRKS TTLRMKKNLRRVAEKYKKYEVSFDYCKQVAASYAGMLEHVDGNRFKDKLWEDFVLTHNME E >SRS015217.39220-T1-C LYGEILTTGGKRELHKIKNIFPIIYDFENLFNAYKAGIKCKRYRPDVMAYTDKLEENLIE LQNEFIWQTYTVGRYNIFYVYEPKKRMIMSLQFKDRVAQHAIYSQLNPHFEKQFINDSYA CRVGKGTHKAVNRLHNWLKQTDRKPQRFYYLKLDIAKYFYRIDHEVLMDILRKKIADEDL LHVLSVIINCEDTNFGLPLGADIGDVAFDELLGEVGLPIGNLTSQMFANLYLNELDQFCK HKLHLRYYIRYMDDIIILHPDKKYLEKIKNKIADFLGKELRLQLNKKTCIRPTSMGIEFV GFRIWSTHIKLRKKTAKKLKRRLKYMFAAYHAGEIDKDTLDRSVASYRGILQHFNSYGMR QSLNELYLQEMGKPYPEPEKKPASKCGLFCGYYGSADDYIKQPEEQEVTDSGSNTDA >SRS015217.244270-T1-C MKRIGNLYQKIISVENLRLADEKARRGKTRTYGVRVHDKNREANILALHEALLTKTFKTS PYDVFTIYEPKERLIFRLPYYPDRIVHHAIMNVLEPIWVKTFTYNTYSCVKGRGIEGCAR QVDKIIRDFNGKPLYCLKIDIKKYYPSISHRVTKRLVRRKIKDTDLLWLLDEIIDSAEGL PIGNYLSQYLANLYLCYFMHWVNEKLPELVRQALNLKERPHIACVEYADDIPFLAESKEV LHEVFKFIKDYLENDLELTIKGNYQIFPIAKNRQDRHGRALDYVGYQFFREQKLIRKSIK KNFCHAVSRLNRRQPPLDAKAYKQAVAPWLGWAKHSNSKHLLKTIIKPSYYGSIL >SRS015264.4826-T1-C LRTTTSAAASxxxxQYRCQLFGVTCDNWNFNTSNPCLYVGGNYNQNTNHGLFYVNYNSTS NANDNIGCRTLLCVLLTILYHGTGSRAPLGEDKQFRERVSTLRKERWKARKAKRRKYVPV KRAKNLFEPLISDENLSLAIDEVNRTHHWRTHHRPNRCTAWVEETKSERIEELRRIIIEG FEPKPPHVTQRWDVSARKWRTVSEPAQWRETLGHHALIQILQPVFMRGMDHYCCGSIRGR GPHHARAAIEGWMRKDPKGTRYELCGDIYHFYDSLEADVVMDRMRHLIKDHRVLDLIWRI VKDGILIGSYTSQWFANTVLQPLDMMIRESGLSDHNVRYMDNLTIFGSNKRKLKKLRVLI EKWLVAHKLRLKDDWQIFPVARTNPKMPLDPPRNGFKRQKSRLPDAVGYRYGRGFTIPRK HNLLRIKRAIAKYRKRKRKGKRILAGAAASLISRLGQLKHCNNFNLYRYLYKGEHIVREL KKIIRNQKRKEELTWNMYLERRKTLRSLKSKATPILT >SRS015264.151305-T1-C MRRDGYIIEEIVATDNMKASFRSVLRGTDRKRSRVGRYLLAHEDEVIAELQQRIADGSYR VGGYREMIVMEAGKRRTIQVIPLKDRIAVNAVMRVVDEHLHRRFIRTTAASIRNRGMHDL MEYIRRDIAQDPDGTRYCYTFDIRKFYESVDQQVAIAAVRRVFKDERLLTILDGFIHMMP HGLSMGLRSSQGLANLILSIYLDHELKDRLGVRYYYRYCDDGRVLAASKAELWAVRDAVH RCVEAIGLEVKPNDRITPVEEGIDFLGYVIYPDHVRLRKRNKQTFARKMSEVESRRRRRE LTASFYGMAKHADCRRLFSKLTGIDMKNFKDLGVTYTPADGKKRFKGATISIRELVNLPI VVHDFETGIKTEQGEDRCLVQIEMNGEMRKFFTNSEEMKNILQQIREMPDGFPFETTIKA EQFGKNKTKYVFT >SRS015578.59286-T1-C MTERIGGCNYMKRYGHLYKKIYDMENLKLAHQHAKKGKGWYAEVQMIDSDPDKYLKELQD MLINKTYHTSEYEVFYKNEHGKTRKIYKLPYFPDRVAQWAILQVIEPYLIKHLISDTFSA IPDRGIHKGLSRVKKAVQHDVPNCQYCLKIDARHYYQSVNHDILKQKYRKMFKDNDLLWI LDEIIDSINTAEDEDLVSIYLLDEDIDPNTGIPIGNYLSQYSGNYYFSDFDHWMKEVKHV KYYFRYMDDIVILARTKEELHQLLKEINEYFHNNMKLEIKKNYQVFPTYVRGIDYLGYRV FVSYVLLRKQTCKDMKKKMVKIRKKVESGNMMNYSEWCSINSYKGWTDYGNCFRLTQKYV EPLISYATKYYELNVKKGGKVA >SRS015578.113668-T1-C MKRYCKNIDITDRNLISKATYKCLKDKYTRNDTLELLSGISGLRKYQIYNIHYRYGRKAL KVFIEFLIDTIRSELISKSISFPPIWYKEKIDPSSHKIRNIGIQHVKQQIYDYIAIEGLK PLLCRIGVHQYASIKDRGCLKGSRIIQRWMRNKSLKYFSKLDIRKCYPSIPQDKLIQFLE KHIKNDMLMWLIKELVNSFEQGLSIGSFLSQYLCNLYLSQIYHFIGHLHKVRKHKDGTKS SICLVYHRLFYMDDILMIGTSAKDMHKAVKEVIKYCKSLGLKIKESWFVKQMPFANKKCD GAFIDMMGFRIYRTHITVRRRVFKRIRRIAMRLWKRIKTHHKIFESHARKIISYWGLLKN SNSTKVIQKYHIKDIMKICKKVVKEYDKISLYGKAAFC >SRS015578.240618-T1-C MEYTGLPYQFLMDMAQESRYMLDSIVETVIDGIRQEILEQKYIVKKIRYRIKKDSCTGKL RNIGIQDIKQQIYDYIAVYGLKELFEKKLGFYQCGAVKRKGNEFGAGAIKKWLVDKNIRW AWQADVRHYYENINKNVLKKMLKRDVKNDRLIHLVFFLIDTFEYGLSIGSYLSQFLANYY LSKAYIYASQQFKIRKRKDGTVKKVRLVSHVLFQMDDIIFFGRSKKDMKMLVKRFNEYID KELGLELKETAHFIDLQTEYVDILGRKISRKNLTIRSSTFLKARRTYKEAYSYVCHGKEI PLKLARSCVARYGAIKHTDSKRFQRRYHIETINKAAKKVIGRAAKGEQRENNEIFGETGE SKHL >SRS015663.25826-T1-C MTSEERREARYKRRRARRQERLQSRNAAIGTLEEAFSYRAMFYYGKKCCNGVRWKQSTQN FELHLFSGTAARRRQVLDGTWKPKKCSHFILKERGKIRPIDAPHITDRQIHKTECNNILI PLYTPHMIYDNGASRRGMGLHFAYHQLEQQLHWHFRRYGRAGAVLLLDLKKFFPNAPHAT IYQRHQRLILDPSLRGLADSIVASSPCPTPGRGMPLGVEPSQQEMVALPSAVDNFIKCQA GVHVAGHYMDDYYIVLPDVEELKKLGREIVKRFEAIGVPVNKRKCKIIPLTKPFRFCKAR FTLTESGAIKINGSRDGMKRARRKLKLFRKEVDAGTRTLAYVEQYKESQSAYYRNFDDHG RLLRLRRLEYALFGGLKCSELSKTAAASA >SRS015663.35599-T1-C MTSEERHEARYQRRKKIREEKRKQKSKECGSYEEVFSYEHLYNSGKKCIRGVSWKASIQN YARKRLSNTYKIYDKLQKSKFEFSKPFIFFINERGKTRKIQAQKIEERIVQRTLCDYVIV PTYDRTFIYDNAANRLGKGITHTLNRVNCHMQRHFRKYGLEGYIVTCDFSDFFNSASHDV VYKENEKRIFDQKVKNIANKCMEIYGEKSFGLGAHTSQIYTNITVSPLDHYMKDKLKIKE YGRYVDDFYFFTKNKEEAIRLLEKVREKVKELGLKLNEKKTKIQKFSSGFKFLKTKFFCT ETGKVIRKLNRKSVSRIKKKMKKFKEWIIQGKFSKDQAMNAYYSWRGYAKHCNCYKVIQK MDQYVKENFILYKDLMEMEK >SRS015663.61407-T1-C MSKTIRNEFDKQLTYEKLMIAHKLSRKGKGYRKDVILFNLKQEEYIMWLYEKLKTKTYKH GGYTVFYITEPKVRKIEKSRYLDRIVHRWVVDNFLVPYFVPTFISTTYACLKDRGMHKAC LDVQKAMLHCKRTWNEYYILKMDIKKYFENINKNILYRILKKKIKDQKLLWLINEIVFSN EGIKNLPIGNYTSQMFANIYLNELDQYIKHKLHCEYYYRYMDDGIILIKTKEQAKEILEN IKIFLKENLELELNKKTQIFKSKQGVNFCGYKINEYRLKIRDKGKRKLKKKVKELKYYIR EGKLTSKEAKKYLAGHLGYLKIANVNNLTKKIFYLEDE >SRS015663.74003-T1-C MYLVTTSNTIKVIFMNIFYDANKIYEAGTKAIQGAPFKYQSQLFEMNHLIETAQILKDLK EWKYKPVAGKKFTINERGKIRHITSNNMVDKTINHLLCDNVLSPAISPYLIYDNGASQKN RGVAFHRKRFETHLHQYYRKYKSNEGYILLIDFSGYYASIPHDLCLKKLQYFLRKVNPEE AKITMWILKNLFDVFNIDNKNGKGVDIGSQPSQNIGISYPSQIDNYIKIVRGCKYYGRYT DDSYIIHQDKEFLKQLLKEIKIISSKLGLIVNDRKTRIVKLSQQFKVLQINYSLTPTGRI IKKISTKTVTRERRKLKAYKRLLDKGKMKSNDIENSFKSWMAGNYKKMSMQQISNMSQLY YDLFKEVPKWKNHGKLRYLMEHNLKT >SRS015663.83650-T1-C MKTYCKPKDVDIEDTNFNMEAVHCAFGNGKLRRRDFRTVLTKTGKISEPELFYERKEHKC HKIIDAIDAVAEQETQKIRDECLNLKPVRQFKRIDGIKMKERNLCQESPEQQVHEYILVH ALQPLFRAKFLPAQFGSIPGKGQVAGTRLIERIIRKRILGKLDAVKGDVHHAYPSTTTLC VITLLKRDVGKNKKLIWYAGAVTENYPDGVLLIGGYFSTWAFNYIMSYVLRYLMSLVQVR RGSGSKLVREIVCYADDFVLIGHASQLMKAMKKATRWAKSALGLEIKRAWQEVRFASFEE EKQVKAARAAGSHYRTPALDMMGFAVRRTYTIVRKGVFRRIRRQLLRAARDLATLGFVPH WRASKLTAYNGWFTNSDSTNLEEKYDVEKIMRAARWSVARWSMIQNRRKAA >SRS015663.263840-T1-C MQSALERTGTAIYVMDNNSNYVDREEIIGFDALYDSMMKSKKGVTWKGSVAHYVLNSMEE TYKLSEELEKGTYKARPTTQFKITSPKPRDIISTCFRDRVYQRSLNDNALYPIMTKQLIR DNWACQKGKGTDDARDRMKIFLQRMYRKYGTDFYGLQCDIHGYYPNMRHDLTKELFRDKL DDWLYEQTATVLDGQYAGDVGYNPGSQMIQIAGITFLSEYDHMMREQTEAEDYGRYMDDS TLFHPSKEYLENLKLINEKYLASRGMEYNSKKTKVFSIKEGFTFLGFKYRLTDTGKVIMT VSSEKVKERRRKLRKLVRKAKRGEITKAKVDDCYQAWRSHASKGNSFHLIRRMDKFYKSL WNDQETEVTDNEN >SRS015782.63930-T1-C MKRHRDLWNKIIDKDNLLLAFKKAKRHKSWQQKVIKVEQNLEEMIEKLRESLINGTYKTS GYRQKKIYEPKERTIYILPFYPDRIVHHAIMNILEPIWDKLFISHSYACRKNKGQHKGSI KCMEYVRKNRFCLKCDISKFYPSINHEILKRIIRKKIKCKRTIKLLDEIIDSIDSPTNVP IGNYLSQWFGNLYLNELDMFLKHDCKIKYYLRYCDDFLLFSNDKQFLKDMSVKIEEFVTT KLKLKLSKCNILSTSQGVDFLGYRHFSAGYILVRKSTAKRMKKNIKALKYKLTVKKISKE SALSTVASVEGWLKWANAYNLKKSLQLEDLKKSIGGSVGE >SRS015782.96575-T1-C MPKTIKNIYDNSVSFENLLKAHKKARCGKREKKKIILFELKLEQELLELDKQLKNGRYKH GGYTKFKIYEPKERIIMASEYKDRVVHQWYVEKFIKPYFVPQFISTSYAGIEGRGMHKAS KDVQKAMRSAKSKWKNYYILKMDVTKYFQNIDKRILWEILKRKMKDKKLLWLTRKILLST EGMVGLPLGNYTSQMFANIYLNELDQYVKHKLKCKYYYRYMDDMVIMCENKEIAKDSLNN ITKFLKENLKLTLNSKTRIFKDIQGVNFCGYKINEKRLKIRHTSKCRMKRKLKRYTRQLK EGKITLPEIQRSIAGWLGYVKHADSYNLRKSMFYIEG >SRS015854.49145-T1-C MFESGEEIEDKMHKFKYDIEKETGIPWKKKKSYRYLYTLACQKGVILRAFKRMKRGKSDR KDIQMVEEDLDGWVEKIQKIIQNTKPAGWKVENPELEFKPPKHNPVIIKEAGKTRVIYVP TMVELWIQHVIVLILEPIIQGSSYHHSYSSFPKRGSHRGMKAMKRWIQSGKGIRNFAQCD IRHFYDHAKYKFIRPKLLKRIKDALFMYLIDVCLTWFPDKLPLGFYLSQWLANFLLQELD FLIKCKLKIAHFIRYMDNFTMADDNKKKLHMAIIFIKQWLGKIRLKMKGDWQVFRFEYIK KNGRRTGREVSAMGWLFYRNRVIIRKHTLIHIARIARKLNKKKMEKKKYPLRLCKGFISL MGWITHSDTYEWYLMYIKPLISVRAVKRIISKMDKEANKNARMENRELLLTA >SRS015854.74641-T1-C MQKMKKSYREYYILKMDVRKFFNSIDKKILYKILKKRIKDEKLLWLIRQILSAQERQKGI EIGNYTSQTFANIYLNELDQYATRKLKIKNYYRYMDDIVILARNKNEAKEYLNEIKEFLQ KKLELELNDKTNIFKAKQGVNFCGYKINEYRLKVRNKGKNKFKKKVKNLLKEIKEGNISS KDARQYLTGHLGYFEIANTYNLKQKNIFLQDELLRKNIIN >SRS015893.63378-T1-C MKRYGNMIPQIIDATNMGNAFDEVVGGLKPKRKEHYEKRRSSILKVLTERIRDGTFHVEN YEEFWVQDGPKKRLIQSPTVVDRIGCNAIMRVVEKYVYPTTINTSAASIKGRGMHKLFRK VRSDIGHDFEGTKYYYKCDIKKFYQSIDQKQMKKVVRRYIKDKQLLPILDSFINLMPSGL SIGLRSSQCFGNILLSRLDHRMKENEHVRYYYRYCDDIVLLSNSKRRLWKWCTIIHEEAA KLGLKIKPDEAVRPTRVGLDFLGYINYCTHSRLRKRTKQKAARKLSKVKSRKRRREIIGS FKGMAKWGDCCNLYKQLTGRYMKTFKELGLQYVAEDGKKRFGGKQVSLRTLTNIHIRVVD FEKDVTTENGPRTVVSFQYDDGEMGKYFTADKQQLWYLEKIQSMGELPFETIIKSETYDR GKVRYMFT >SRS015941.185842-T1-C MDLLRDIFSAYYDARRNKRNTESQMKFEMNLEHNLMHLYDELRTRTYRPSPCICFITFDP VQREIFASSFKDRVVHHLLFNYIAPLFETTFIYDSYSCRVGKGTLKGVERFEHHLRSCTQ NYTQSAYVLKLDIKGYFMSIPKRELRELLRKEMDRKPEWNKKFDRQLVDFLIDSILLRDP TSDVRIVGGKEDWEGLPPSKSLFRSAKGSGLPIGDLTSQLFSNIYLNQLDQFAKRQLGLK YYGRYVDDFYVIDTDHRKLARLIGLFRDYLHDELHLTLHPKKISLQHCSKGMTFLGAYIM PYRRYPRWRTICKFRNRMRLIEALCRCRTKMKEKQIFLLRSVINSYCGYLGHFSSYLLRK HLVDRNPFYRYFFFYGGFHVVGLKNIE >SRS015960.36122-T1-C MMFVEKKNLLADLFRAYYDARKNKRNTVNQLRFEMDLEHNLYISYMQILNRIYEPKHSIA FIVFSPVQREVFAADFCDRVVHHLFFNYVNPIFERTYIEDCYSCRKGKGTLYGVKRIFHH IRSCSDNYTRPCFILKLDLQGYFMSIDRRILYEKVRGTIEKYAYRKDRDGIRWKDKLDYG LVMYLAEVIIFNDPIKNYKIKGSKSDWDGLPLNKSLFNSKEGCGLPIGNLTSQLFSNVYL TSFDHYVKRELGYKHYGRYVDDFYLMHEDKENLKSVIPKLAAFLKENLKLTIHPKKVYLQ QYEKGVLFTGGFVKPYRIYIANRVKRRMTERLDSLRNNKKLDLHELRCSVNSYLGVMKHY KSFNIRKKIMIRHAWVFKYGYVCGCCSIFKLNDQGSATCC >SRS015960.175765-T1-C VFSCFSIIERGKPRDIRACHINDRLVQNALCEQVLLPELTPRFIYDNCATLRGKGIDFAL KRVKKHLQQAHREYGLGKDFYGLRIDIQKYFDSIDHKSLKKAAKRLIKDKRIYQLCCYLI DTFSFKLTKDSSPIPGKDYYISKRHKYTRISTTNFKPGRKYYEYDNKSLGLGSQTSQLFA LLALNEIDHFIKEELHIKYYGRYMDDLYLFHNDSQYLAECEKRIEVRLNKQGLKMNKKKT TITRISPIHADGKRHAPLKYLKWNFYLTDTNHIIQIPFKKKVAH >SRS016018.56315-T1-C MPKRVGYLYDKMVDRDFIRSVIQEAAKGRRNRRDIARVLADLDGYVEKTYELVATESFVP SAPKVREIYDESSEKFRKIKMVPFWPDGVIQWMLVTAMKPVLMRGMHPWSCASIPGRGGK RIHKKIRGALRNDPKGTKYAAELDVAQYYPSISGKRLIWALARKIKDKRFLRTVYSIIES CGGGLAIGYYICQWLANFYLEPLDQYIMTLPGVKYMTRYMDNITLLGPNKKQLHKARKLI AAFMQQRLGLSMKANWQIYPTAKRMVSAVGYRFSRTHVILRKRNFLRFTRQCRRVKKRLD AGKPIMFAQASGLLSRAGQLKHCNSHTIRVKYIDPIGVKHLKEVVRNESKRRQRAQQRIL AGGAA >SRS016018.61920-T1-C VYPVAAIHGGDEYRIADTGAYRLPAAGKGYKQRGFFIMTDFEKIHSFESLYNAYRKARQG KRWKGAAAKFEVNLLEALNLLSAQIRTKRYTMSPYNTFEVYEPKRRVVMSNSYKDKVVQH SLCDNVLEPILTRSFIRDNYASQVGKGTHYGLDRLQEFMRRFYRKNGIDGWILKGDISKY FYSIRHDVLKTLIREKITDPDVLWLVDLIIDSTEGNVGIPIGNQTSQLFALLYLDGLDHF VKEKLGIKYYGRYMDDFFLIHHDKAYLQECRKQIEAFVQARGLSLNAKTNIFPLKHGVDF LGFHTYLTESGAVIRKVRRRSKNNMKRKLKKLAALHAAGRIDAKTVEQSYQSWRGHAEKG NSYHLIRRTDHYYNSLMKPKEAAQCQKH >SRS016056.23625-T1-C LNSTERHEAKYQRRKAAREAKRRKHLDQYDNFNNVASVPALIRAHFDSRKGVMWKASVAR YNAHFIKYAVMQSRDLYTGKFKCSGFFIFWIIERGKRRHVHSLHYAERVIRRSCCINAIV PILSSGLIYDNGASLKGKGVSFSVMRCAVHLHQYFRETGSNDGYILVIDFKGYFNHILHK PLMENVIDRYILDPALNALVKQFISASDMETDSEKGKGLYIGPEDSQIYAISYPSSIDHH IKDKWGLRFYARYNDDSYIMLKSKEKLLEYKQQLFSLYDALGIIPNQKKTQIIKISRGFT YLKTKYFLTETGKVIMKPDHNAIVRERRKLKKLKKFYDSEILTLQQICQQYMSWRGAILK KDAFRSVKNMDKLFYSLYHVRPWRYKKDKKRRGEYEQRTA >SRS016086.30688-T1-C MAIREGYIIEEIVDYSNMSDAFDEVLRGTARKKCAEGRYLLAHREKVIQRLSDGILKGHL PKAKCYFITVREGGKDRRIAIYDMTLRIAENAVMRVVDRHIRNRYIRTTAASIKKRGMHD LSLYMRRDIAENPEGTRYAYQFDLKSFYDMIEGDFIMSCVRHLFKDKRLIDILSSCVYSH EGLVKGRRSSQSLANLLLSVHLDHYLKDELGVRYFYRYCDDGLVLSHSKEYLWQIRNLIH QRMNTIHLTVKPNERVYPITEGIDMLGYVVYPDHVRIRKRIKQNCARKLHEIKSRKRRKE VIASFYGMAKHSDCKHLFYTLTGKNMEESFKDFKNKQAPFVKDGKKRFRGPCVSLRTLLN KTVTILDFERDIKTENGMRWLVSVDDGGVLKKYFTDDKEMEYDLECLEKANKLPSMPITI GWDDSNGFGHYIFT >SRS016095.16849-T1-C MAMTGNENKKTRKKSKRMSDDDMQFPFLPFSADGCPPFLPAAGNRWNENVNNSGSNGNYW SASLNENNSNNAYNLNFNSGNRNLNNNRNNGFSVRPVTELTDTTNTSSPVSFHISPDELL LDLFQAYKAARRHKRYKPCQMRFEQNLEEELIRLRNEIVDRTYVPRPSSCFVISEPKKRE IFAADFRDRIVHHLFYNYTYELFERSFIHDSYSCRIGKGTHYGIERLKHHIRSESRNHSR SCFVLKLDIEGYFMHIEREKLLSVCRHTIFSMLSHASNQKGKTWREVLDLSLIDFLMEKI IMNDPVDGCIVKGRKSDWKGLPANKSLFGSPAGFGLPIGNLTSQLFSNVYLNVLDQFVKR QLHCRYYGRYVDDFYFVSADRKRLRQYKVAVT >SRS016095.36110-T1-C MSIKNVFNEITSFENIIGADRDCAPYDSDNYEVLEFRKNLEENALDLRDRLRRMDIPLAK YRSFFVFDPKVRKVIYTDYVTKVIQRSMYNVLYGPVQKGFIPDSYACVTDKGQHKAVAKL ASWFSEYNSAGIKAYYYKFDVRRFFYRIDHDVLMNDIIKKKISDKYTVELIRYYMCSTQR PFGMPLYGDSLTITDDEMLWDKGIAIGGGLSHMIGNMYLDPLDQYAKRTLGIKHYIRFAD DIIITDPDKEKLKEYGRLLTQFLNEKLLLEFNDRCALRPNSCGCEFVGCMIYPDHVLLRK STTLRMKRRLRSVAEDYRDYRVTYDYCKQVAASYSGMLDHVDGDSFKEKLWEDFVLTHNT EE >SRS016095.38107-T1-C VRYQHHGNELNNEEDIIGFEALFESMQKCKKGVMWKGSAAHYVLNGLEETLKLEKQLKTG TYKARQTTKFRVTYPKPRDIVSICFRDRVYQRSLNDNAIYPAMTKSFIQHNCACQKDKGT DYARAVLDEFLHRHYRKYGRAGGVLQVDVHGYYPNMKHQVAKDKFKKHLEPDIYKRAEAV LEDQYEGDVGYNPGSQMIQIAGISVLDELDHFIKEQLGIKRYLRYMDDFLLMHEDLEYLE YCKDKVIEKLAEYGFEPNPKKTKVIPITEEILFLGFYYRLTETGKIIMRLNPANVKQERK KLYRLVAKAKKGESSKAKVDECFNGWKDHAAKGTSYQLLRRMEAYYKELWRNQDGISTDQ NKRL >SRS016095.92391-T1-C MKRERDIIKEIIEPHNLYNSISVVMRGKKRKRTRIGRWIMKNEDKVVDILSRKIKEGTFC ITGYKERLVTDGPKDRRVQAIPIIERIGVNAVMSVVERKVFRRYIRTTSASIKNRGTHDL LNYIRRDLQNDYEGMRYAYKFDITKFYESISQDFMMYCLRRMFKEKVLLTILERFIRMMP DGLSIGLRSSQGFGNMLLSMFLDHYLKDQKGLKHYYRYCDDGDSHANTKKECWKVRNYVH QQASLMGLCIKPNERVFPVAEGLDFLGYVIYPTHTRLRKRNKKNFARKMHRIKSLKRRRE LAASFYGLCKHADCRNLFYRLTGVRMKDFKDLGIKPRYADGKKRFKGNQVSIRDLVNTSI VVLDFETGVIPKFEKEEYEVKVAKAKAEYERLKTKFHGDIPDEIDFINPDDIPQPEGRYV VRILHEKEEKKFFTASKDIWSVLDQIKEQGELPFRTVIKAERYGNGGTKYIFT >SRS016203.22008-T1-C MESRNTTLTTCGECASISPQDRKAVNTLEELLGQVEEKTSICFPLLDLIPEIIEDENMER SFKRVMSNLHNADTRNGLRWRENIVIDGVECTPRMVRYMKRKADIIAMLKAQIANGTFRI KHLKSFETADGPKIRTVQAPSVIERVGSNAIMEIVEKHLAPILIENTAASIEGRGPHGLY HKMQEARRNNPKLIYYYQSDYKGYYDHILHDRLIEIIKRYIADPVLLPILIDFVKALHPN DNVGISKGLRSSQFFGNLYHNDIDHAMIEECGKDNYNRFCDDIYILGDNKKELWKHRDTL HRLCKPYNLIIKPSEKVAPISAGMDALGFVDYGDYSLLRKRTKVNAARKLAKIKSRKRRQ QIIGSFKGMACHADCKHLYYTLTGKHMKKFSEMGVTYTPADGKKRFPGKVTRLGDIVNIP IEIHDFETGIDTKEGEDRYLVSFRNPANSEWGKFFTASLEMKGILDQISDIEDGFPFETI IKCEVFDGSKRKYNFT >SRS016203.116042-T1-C MNDDINENIEENIIGFDALYRSMEKCRKGVIWKDSVAHYYLNAIEETLKLEKQLKTGTYK ERKPVQFTIMAPKKREIVSIAFRDRVYQRSLNDNAIYPQMTKGFINTNVACQKGKGTDLA REILKEYLRAMYRKCGKNFSVLQCDIHGYYPNMQHETARQTFRKGLEPHIYKRANRVLTN QYAGEVGYNPGSQMIQIAGISVLSPIDHFIKERLHIKYYIRYMDDFILLHESEQYLEYCK QEIDKRLHVMGFEFNPKKTHVYSVGKCIPFLGFNFKLDDKGKVVMLLKSDNVKRERKKLR RLVNKCKRGEITKEKVNVCYAGWRVHAEKGDSRRLLDRMDKYYKDLWKGAK >SRS016335.48515-T1-C MKRKGYFSKHIATKKNFEDAFNGYADQKHSRKDIQEFEKDLEDNLLSLLRAYDAGTWQTS PYEYKFIDNPKPRKISKLKPADHVIQWAASLQYEPYMISSLYWKSCSCVPGKGTHYFVKI EKKDLYYAPQKETYYFVQLDMHHYFLHIYHPLMKEKLERKIKDPKLLSFLFEIIDSFLQG LPLGIKISQLNANMYLSAFDWKAIKCFGISEDTDKMNYWRDRYVTDKLLTCRTGEQAAEL SRGVQFLNRQFDRFAKQGLKHYARFADNIIIMHQDKAFLRIISELAIMYLTRDCLLTVNG WNVRPVHAGGIDVCGYVFFHDHLRLRKRNKQSLCRQVAKLKKKGYSKRDIQLKCASRIGF AMHADTRNLLKKLDIDMEKRLGKVITNRRKKAPFEGMAVEQKKLIEEIVCLKDEDEEKKL IRLLDYKIDDSIIEKNEDGSPKQRIAIRYNVIDHVEHQEGEEDPIYIWGSEYYSFSGSKV MIDQAETDFSKKDLPSPTVIREFTNKQKKKFYKFT >SRS016335.63713-T1-C MKRVGNLYNDLYNIDNIIKMTDKVLSKVKNKQRREKFLLYYSEHIISIKNRLESKDINLG KYNIFLITDPKCRIIMSQSIEDKIINHLIAEYLLVRVFENKYIDSMCATRVNKGSGYAIK LMKKYLNDIKLKYNKFYILKIDIKKYFYNIDHDILKRILNDNIKDKDALNILFKVIDSTN EEYINKRIIKLKENRINHLNSDKLIYETKNIPLYYYNKGCGIGDQTSQAFGLIYLNEICH YIKEELHISYFINYMDDFIIIHHDKEYLSFCLELIRNKLLNDYKLELNDKTRIYNICEGV EFLGYRFIIKNNKVIMKLRRNTKKDFKKNVKMLKLLEKYKYINYDKYTLLLSGYKGVLSK GSCNNLYYRSVYD >SRS017191.18037-T1-C MRREGNIIEEIITPENMEESFWTVLRGRKRKRSRSGRALIAHKKEVIAELTERIRNGSFK ACKFFEKEVEEGGKMRHIQIFSLKERVGVHAIMKVVDEHLRGRFIRTTAASIKGRGTHDL LCYVRDSIANDAQGTEFCYTFDIRKFYENVDHDFMKYCVTRVFKDNTLIQLLSGFVDVMQ KGISIGLRSSQGLGNLLLSIFIDHVLKDREAVKHYFRYCDDGRILSGCKKELWKLRDIVC KQAAKINLVIKKIERVFPIKQGIDFLGYVIYPDHTRVRKRNKQNFARKMHKVKSRKRRKE LIASFYGLVKHADCRNLFRKLTGKSMKKFSEMGIVYTPADGKKRFPGQTVSLKTLINLEV EIHDYESDITTKEGEGRYLVSLKVKKTGEWKKFFTNLEEMKAILNQISDVEDGFPFETII ESETFDGNKVKYKFT >SRS017191.21767-T1-C MKTFDVKHSDIVNFENILEADKNASQCKHYRDENLKFSAHREEGIIDLLNRLTYYPVFDE EGNQIPGKTESSYKVGRYRKKQIYEPKPRIIMALEYPDRVVQWAYYQILNPLFDKQFITH SYGCRQGKGTTKARDQLQRWLRKVNRSGKTWYVLKLDISKYFYRVDHAVLMDILSRKIKD EEILRDLYKLINCENTAFGLPAGVQPELCNEEDWLFDRGMPIGNLTSQMFANIYLNELDQ YCKHELGIHLFIRYVDDIIILWPDKEELKGILDNIKNFLEERLHLELNNKTSIRPAWLPV TFVGAQISPKFIRMRKSTRKRMFRRIKFIKKLFEACEIGFQKLNNTMQSYFGLIQHFTAG NLLRKIIDEFSFRIRNNS >SRS017191.159513-T1-C MKTYKNLMEQIASKENIKAAILNASKRKRNRADVKAVLDNMDFHIEKIQKILLNGTFKPH VDSPCIVNEGTHNKIRRIRKPHFVYDQIIHHCIIQILQPIFCVPMYQYSCGSIPKRGAHY GQKRIKKWLRRDIKNTKYIFKMDIKHFYESVDKEVLKAMLKRKIKDWQALELIYTVIDGC EKGLPLGNYTSQWFANFMLTELDHYIKEQLHAKYYMRYMDDIVIFGGNKKELHKMHRAIE QFLRDKLHLRIKENWQVFRLEYKGRGRPLDFMGWQFYREKTILRKSIYVRIMRKARRVGK HTTIKGAQGMISYMGYIQHTDTYGSYRENIRPYVNIGKLKKFVAKQERRKRNAV >SRS017247.24844-T1-C MKNNSYEILLDINVLYKAFQECKKNVDWKCSMQKYESNILQNLNALQKQLKNRSYIPDNY VEFNVSERGKTRHIKSPSIRDKILQRAVCDNILEPILYPKLIYNNGASIKDKGVEFTRQQ LVKHLRRYYVEHGNDGYILTCDFRKFFESIPHDKLISALQKYIPNEDTINLLKIIIYSYN DDGVGLGIGSQASQIFGVFYPTPIDTYCTVVEGNKYYARHMDDFYVIHHDKEYLKQLLPK IRKIADELGLTLNEKKTQICKLSKGFVFLKQFIYMTDTGKIVRRPVKKNVVRERRKLKTF HKKLQNGEMTMKYIREHYKSWRGTTKKYAKPYTIKNMDALYVSLFGTGE >SRS017307.66966-T1-C MTSQERHEARYQRRRAARRARQEARCAALGSLGEVFSYHTMFKYGRKCCNGVRWKQSTQN FERHLFSHTAKQRRLILAKRWRPKKYVHFTVCERGKIRGIDAPHITDRQIHKVISKEVLE PLYDPSMIYDNGASRIGKGLHWQIKRIKQQLARHYRKYGRAGGVLLLDLKKFFPYAPHSI IYQRHQRYILNPDFRRIADTIIDTAPGEFPGRGMPLGVEPSQQEMAAMPSAVDNWIKCQM STHSAGHYMDDYCIILPDIEDLKKLGRAIVRQFEIRGIPVNKKKCKIIPLTKPFRWCKAR FTLTETGKIKVNGSRDGVIRARRKLKLFHREWLAGKRTLQEVAQYMNCQEAYYKNFDDHG RLLRLRRLCYAIFGGRVPCSTKSSKPVMAPSLP >SRS017433.51430-T1-C MKTSSVRHIRSCVKEKTKRKEIIAIDANIDVEVAAMQKMIENTKPPDVPVENPELAYKPC KRTPKIIFEHGKKRKIFMPEIHEQWLHHIIVLILEPIITATSYRYSCGSFPKRGAHYGKK YILKILGRGKGIRNFGKIDIRHFYDNIRINIIMKELAIRIKDNWFLYIIRLCLKGFKKGI PLGFYISQWLANYILEPLDKFITEKLGLKDFVRYMDDMIFFDNAKKKLQAAIVSIQIFIG RRYRLRLKGNYQVCRFYYEGKRKKTGRPLDFMGFQFYRNKVILRKSIMIAATRMAKKLAR AKAAGRSYYETHVKAMLSYVGWFDCTNTYDCYTEYIKPLVNVGKLKKIVSKLNRRKNHYE AVERRTLQYAA >SRS017521.4210-T1-C MEQNFEVVYDFANLYAAYRATRKGKRWKDSVAKVELNTLEAITVLQAELRDGLYKPGNYH EFYVFEPKRRLIQTNSVKDKIVQHAFCDNILYPVLSRPFILDNYGSQVGKGTHFGLDRLR DFMREYYRKHGSADGWVLKADVRHYFASIRHDILKRDVNKLLTDPRSRALSDAIIDSTPG NVGIPIGNQSSQVYALLYLNELDHYVKEVLRMRYYGRYMDDFYIICESKEALREAWRKVE QFLTPRGLELNQKTQIFPLRNGLDFLGFHTYLTDTGKVIRKVRRSSKDRMRRKLRKYAVM YENGAMTRKQIEESYQSWRSHASHGQCRELITKYDAVCASIFERSVKSNHAAENQRPTRK GESAGRKD >SRS017521.9816-T1-C MTDYEKIYNFENLYRAYRKARQGKRWKGAAAKFEVNLLEALNLLRYQLQTKKYTLSPYNT FEVYEPKRRVVMSNAYKDKVVQHSLCDNVLEPILTRSFITDNYASQVGKGTHYGLDRLQE FLRRFYRKNGIDGWILKGDISKYFYSIRHDVLKTLIRRKITDPDVLWLVEMIIDSTEGNV GIPIGNQSSQLFALLYLNNLDHFIKEKLGIKYYGRYMDDFFLIHEDKAYLQYCRAEIEKH VAAIGLSLNNKTNIYPLRNGVDFLGFHTYLTETGAVIRKVRRRSKNNMKRKLKKMRGLVE RGKITTATVEQSYQSWRGHAAKGNCYHLIRRTDHYYNRLFNSKEAEKCQKH >SRS017521.20242-T1-C MSTTNNEAYADQLLEDVFSAYFKARKHKRNTASQLEFEMDLESNLIGLYEELYQRKYKPG PSYCFIVDKPVKREVFASQFRDRVVHHLLFDYINPVFEKRFIFDSYSCRLGKGTLAGIER LEHHIRSCTQNYTTHAWILKLDLQGYFMSIDKKRLYEIVSETLRKHWNQTPPVASTPSTD PDFIDYLLRGIIFKDPKENVIIRTPMDKWEGLPLSKSLLYSCTGNGLPIGDLTSQLFSNI YMGVFDEYVKRILHIRHYGRYVDDFYVVHACKGYLTDLIQIFHRFLEEKLSLILHPHKIY LQPCHKGVPYLGAFIMPYRRYPVRRSVSQFRKAMKLACRLLRLDDLSDDVLCGIRGTLNS YLGYLGNFRSRRIIREILDNAPIFRYFVYSSTENKLLLKKDLPPSSPIYSYIFPSSENVI PSV >SRS017521.56955-T1-C MKSKAMKRIGNLYERIIAVENLQLADQKARRGKLRSYGVRRHDKNREANTLALHEALKNK TFKTSKYETFIIKDPKEREIYRLPYFPDRIVHHAIMNILEPIWVSVFTTDTFSCIKNRGI NGCMLKVDKALKDVENTRYCLKIDVKKFYPSIDHDVLKQIVRRKIKCPDTLALLDQIIDS AAGVPIGNYLSQYFANLFLAYFDHWIKEEVGVKYYFRYADDMVFLHKDKAFLHDLLTQID AYLRDNLHLTIKANYQVFPIAKNRSDKHGRGLDFVGFVFYHEHKLIRKSIKKNFCRAVAR LNKQPNLSAKDYKQGVCSWLGWAKHSNSKHLLKTIIKPSFYGNL >SRS017701.3448-T1-C VKAFDEDLESNVNRLLHLFLTSSYHPSSEDYTYKTILEKKGRKERFLSMLQYWHHVYHWG VLTRTEDIVNRSLDEHCFACIPGRGQHMMVKLISHDLKTHPELKAFANLDVSKMYPHILH DIPKAYLRRKIKDPVLLNSLDAVIDSSIGTPMGNDDPEHPAGIAIGLKISTIYANISLGM FDHDVRRLFGLQSNPDLIHKLAAMYITDKKASAKTESDFAEISKGEAYLEALFCQYVKEG LRFFYRFMDNVLVLHEDKTFLHLVVDWIALYWGRELKFTMNPKWQVGATKGGFTIVGYRI FADGHVRANRDVVLDIKRKIRKGLKMGLTYDQIRVAISSQLGTVMHANSINFLKKYHMEK KERLGAKINRRKSQCPFEIAHGQQRRFENFLFDPEKQQSEDDFLMELRDYAVIDSIKEKN DDGTPKKCLAIRFEWQGKEFSYTDDRGKSVLVKPGEEYFSYTGSKVLIEQCETEFSKEDL PAPTVIKIEINKRNKKFYKFT >SRS017701.39397-T1-C MNSKERHEIRYQRRVAARQAKRTAYSESFGRYEDVFSYEHLYQAGKNCCKGVMWKNSTQS YMSRITTNTASTHDALLRREFRSRGFHDFDLIERGKLRHIRSVHISERVVQRCLCDNILV PVFSHSFVFDNAASLKGKGVDFAMDRLDRHLHRFYRKFGVEGVESGGVLTGDFSDFFNSA PHSIIYREAERRIHDDDVRRIACQFMEDFGDVGFGLGSQVSQIDALMVASPLDHFIKEQL HIKYYGRYMDDFYLIHENREYLKYCMEEIRKKCKEYGFVLNEKKTKIAPLRKGVKFLKTK FFLNETGAVIRKMNRKSPVKMRKKLRIFRRWIDEGRFTITDVETAYQSWRGHMIRGNSTL VLRKMDVFYNSLFKNKEDSGHGKVSEERQLARAC >SRS017701.152621-T1-C MKRKGYLFEQIRSMENLLQAFHNASNGKRKRDEVKRFEADLDANLRQLQAELTTRTYTTS SYEVFVKYEPKRREIYKLPFRDRVVQWAIMQVLEPVWTPQFTSNTHACIRGRGIHSLLRQ LRTDLRRDPDGTRYCLKIDVRKFYPSIDHGILKQVIRRKLKDPDVLWLLDGIIDSASGVP IGNYISQYFANLYLSELDHLLKEDVGVRYYYRYADDIVLLSDSKEYLSGVLVYINHYLNE SRLLTLKSNFQIYPVESRGIDFVGYVTYHTHCLARKRNKQGLCRELAALRKKGLPDEEIR LRVASRMGFMKHCDSNHLLKILGMKKFSDIKPKQGKLTGGKYHIDTILNREIHITAFDVS QSKYDGEMLTLQYEIYEQMEDEQGKVIDDEGNPVMAWIKHITFTGSKALIRQLDGVELTE PVAAKIIKQPIGTDGKRCFYSIVDPDQ >SRS017821.14600-T1-C MKRYGYLFDRICSIDNLRAAAHNAAHGKRKRDEVQKFFADLENNLQGIYTELRAHTYRTS PYEVFIKYEPKRREIYKLPFKDRVVQWAIMQVLEPVWTPQFTADTHACIRGRGIHSLHKR LHEDLAADPEGTRYCLKLDVKKFYPSISHEILKSVLRRKIKDPDVLWLLDGIIDSAPGVP IGNYISQYFANLYLSELDHRIKEVAEVRYYYRYADDIVVLAGEKPVLHGVLIFINDYLQT ERSLSIKSNYQIFPVESRGIDFVGYVSYHTHSLARKRNKKGLCREVAKLRKKGVPEADIM LRTASRVGFMYHCNSKHLLKILGMKKFSELVPAKSGNLTGTKYHIDAILNREIHLTGYTV APSKHNSEPCLTLQYEIEEALTEIMQDGTSRPVIDDEGNTVKGWVQHITFTGSQALIRQL EGVEITEPLRAKIIKQPIERNRCFYKIVDPDD >SRS017821.216176-T1-C VTSEERREQRYQRRKAARLKKRQETIGKYDDFERVASLNSLYEAAREASKGVDWKASVQR YNSLLLFNISKTRAELLAGKDIRRGFICFDICERGKLRHIKSVHFSERVVQKSFCTNILY PTFTRSLIYDNGASQKGKGTQFALDRLTTHLRRHFRKYGREGGILLIDFSDYFGNVAHEP LFKIYRQIFTDPRVIALGMRFISAFGDKGLGLGSETSQINAVMLPNRADHYAKEVLRIRG YGRYMDDSCLLHHSIAYLEECLERLEVIYSEYGIVINKKKTKIVDLKHGFTFLKTHFYIT ETGRIIKKPCRDSITRERRKLKKQAALVASGVLTFDEVRRSYASWRGSMSHRDAYRTVQS MDRLFNRLFIDQWKGGPQP >SRS018351.1434-T1-C MQKTKVNFDTVYEFETLYNAYRASRRGKRWKNTVAKVEMNALEAIAVLQEELSTGTYRPG GYREFYVFEPKKRLIQTNSFKDKIVQHAFCDTILYDVLTRPFILDNYGSQIGKGTHFGLN RLRDFMREYYRRHGSADGWVLKADVHHYFASIRHDILKQDVRELLDDERSLALTDLIIDS TPGNVGIPIGNQSSQVFALLYLNKLDHLMKEGFRFRYYGRYMDDFYIIHESKDTLRAAWK AISEHLAERGLELNDKTQIFPLRNGLDFLGFHSYLTDTGKVVRKLRRASRERMKRKLRKY KVMYESGAITKEKITESYQSWRSHASHGDCHALIQKYDQLYNEIFERSETADVSINQRPA R >SRS018351.18529-T1-C MTSEERKEGRYQRRRAGREAKRRARSEACGSFEQVFSYENLYKAGLACCKGVRWKCSTQR YLASLSENTARTRKALMDGTWKTMGFHEFDIMERGKLRHIRSVHISERVVQRCLCDNALV PLFSSAFIYDNAASLKGKGIDFAMDRMNRHLQRHYRKHGMQGGILVFDFTDYFNSAPQEP IHRENRRRLYDERVRERAESFMADFGERGFGLGSQVSQIDALMLGNGLDHFIKEQLHIRG YGRYMDDGYLICEDVRYLEECAARIRQYCAGIGLQLSEKKTRILPIRQGVRFLKTKFKLT QTGGVIRKVQRKSTRKMRQKLRKFRRWVDDGRMTEEDVRTSYESWKGHMRRGNSWKVLRK TDKLYRKLFGEKGDHQCTKFGRTGA >SRS018656.110309-T1-C MTYEEILCDANTLYAGYKASIKGSKWKESTQRFILNFLRYIFEVQDDLINRTLTNGPVDE FELHERGKIRPITSIPVKDRIVRHVLCDELLIPKIRRKIIYDNCASLKGRGISMQRKRFE VHLRKYYKLYGNEGYILFGDFSKFYDNIIHEIAKKELLELFEDDEFIDWLLNVIFDGFKV DVSYMSDDEYASCLDEVFNKLEYRKIPKNKLTGEKFMEKSVNMGDQLSQAIGIYYPHRID NYVKYVRGIKFYGRYSDDWYIMSPSKEELLDLLENIKRIAHEYGIHINMKKTRIVKISGT YKFLQIKYTLTESGKIMKRINPDRVYTMRRKLKKLAVKVENGEIPYENVEGMFKGWMGDF YKLMPRTQRKDLLELYEGLFNKTITIVKKKMIIKDRSQQESTQEVA >SRS018739.77535-T1-C MEITIEEIFNAYYECRKTKRYAKSALEFEVNYEEHLIKLYEELKARTWQPGNSQCFIVTR PVRREIFAAPFRDRIVHHILIGRLNTAFEKYFIRDSYACRVGKGTHAAIRKVEHNIKSES NNGHKETYILKLDIKGFFMSIDRNILWQKLESFIDSQYKTDSDNSADFEKYLAKAIIFNN PTQHCIFKSKKAEWEPLPRNKSMFTAQPDCALPIGNLTSQVFANFYLSAFDHYIKHTLKF KRYVRYVDDCVFVSRNIDELKTIIHLSKKFLKDELHLTLHPKKIYLQKAGNGVQFLGTFI KPWYTVSDRRIKNNFVQCLKKYTALAEAHLPNVEEKRQCRASVNSYLGIIAHYKTYTFRK AQMMRYFGGRLKTHFAVPQSVKKILLKRAV >SRS019030.50608-T1-C MRSYNNLYEPMLQDDYIKQRFINASKKKKNRNDVREVLENLDEHTELLKKMLTEELFIPD YHKPSIINESSSKKTRRILKPHYKYEQVIHHCAIGQFKPIVMNGLYEFSCGSIPDRGVHY GKKYMRKWLDSYDGKKFFVLKMDVHHFFESINRRILKRKLKAVIRDKRFYRLLCILIEHD KIALVAKILTDAGVEIDAEQTKTLVGCIAFDDISGALEVLREIGIAGAMFEELKIIIEEM RKGVPLGYFTSQWFGNFYLKALDHYIKEELHAEHYMRYMDDMVILGKSKKKLHKMHRAIE TYLNDNLDLEIKGDWQVFRFEYPVMKDGKPVLDENRKQVTKGRMLDFMGFQFHHDRTTIR KSNIEAARRKANHISKQDKISWYNASVMLSYMGLFKHTDTYNYYIEYIKPKINIKKLKRI VSKHSRKENEQHDRLEKGDRNTAGTSGGNRQDIVSVNGLSA >SRS019030.82895-T1-C MKSYKHLFDICISEANRRRAVKSAKKSKRIREMIRRRNLSDDELIGQSYDWIINYENAEH EPKIIQDGIRHKERKIIVPTLEELIVQHCVVQALQEMFWKGMYQHSYASIQRRGAHKAKK VIEKWINTDPKNVKYVLKMDIHHFFDSIPHDILKRMLTRKIHDERMLELLFKILDVTEVG LPLGFYTSQWLSNWFLQGLDHFIKEQLHAVHYARYMDDMVIFGSNKKVLHQIRLAISEYM ASELGLSLKGDWQVFRFSYTVNGEDKGRPLDFMGFQFYRNRTVLRKSIMLKATRKAHKIH KKPYQGRKPTVHDYRQMMSYLGWIDCTDTYGMYLKHIKPMINFQKMKRYVSHCDMRNDRR IYEQLVRLYLPRGGKRSGTYLHPR >SRS019161.73020-T1-C MKRYGYLIEQVIEESNLIDAFDAVMRGKKRTRTVRYLIKNRDSLLSELAEEIKAGTYKLA GYREFTVVEHGKVREIQSLPFKDRIALHAIMNILGKIFGGMLIRDTYASLPKRGIHDGLM RIRKALKDKAGTKYCLKIDLKKFYHSVDQDVLIELLGRKIKDSRMMDILIGIIRSYDTGL PIGYHSSQLLGNFYLCLLDYYVKMDLGVKYYFRYCDDIVILSSSKQELHAILEKMRTVIE GRLHLTVKSNYQVFPVEARGIDFLGYVIRHDYVLVRKHIKVRVARRLHKIKSKKRKYIVI ASFWGWIKHCNGTHLFFKLTNMKSFKELGVTYKPADGKKRFEGNLTPLGQLQNCKINVLD FETDIKTKEGEGRYVVQYELEGQKGKFITASDEMKNILDQIKELGELPFETTIRRETFGG NKTKYVFS >SRS019161.115760-T1-C MFLGPVNIFTIYEPKERRIVSQNVQDKIVNHLVARYILYPALLPCLLSINVASRKDMGTS KGLELASNFHRICKIKYKRYYILKADVSKFFASIDHDILKEKLKRKINDVEALKIVFDII DSEENGLGIGAMTSQVLAIFYLNDLDHYIKEVLKIKYYVRYQDDFLLFHHSKEYLKYCLE QIKEFLSKEKLFLNKKTRIYRDTNNFIFLGRDSNGKYARYRTVKRKLRKNLHLYKNNKIN LSCIVSSIICYDSLRNNKKKSCTLSSQK >SRS019161.137737-T1-C LKDRKKIKGYSMKRVGYLYEKMCDVDFIKTAIRNAAKSKTDRLYVKAILSDIDGYARKIK AMLEIETIKLSPSEHIEIYDNSCCKTRQITVPKFYPDQIVHWLIITALNPVITRGMYRYC CGSIPNRGGIDAKAYVETAIRDVKMRYCAKLDVSKFFDSVRPPILLEMLKRKIKDEKVLR LIGQVLENGGDHLPIGYYTSQWFSNFYLEGLDHYIKEVLHVKYYVRYVDDMVLIDSNKRK LHKAVAAIDNYLHGIGLKIKGNWQVWKLNSRPIDFVGYRFYKNKTILRKRIFFRLCRRVR KVSKTGYTTPRQAMSLLSLIGWLSHINGRNFYKKNIYPYAPKNKLKKIVSNYSKQNGGNL KNGKRKTKSIQQRQMARNGKRGQGGGDTHSTRN >SRS019161.137781-T1-C MKRAKNLYAELISDENLKMAILEVNRTHRWRPHHRPNKVVVWVDSDIPQRIIDLRNIIEQ GFVPAPAALKRRWDKSAGKWRDIHEPRLWPDQYIHHALVQVLQPVMMRGMDRWCCGSIRG RGIHYGMDAVKKWMRNDPKGTKYCAELDIHHFYDSLKPEVVFARMKQLVKDNRVLDLVWR VIKDGILIGAYFSQWFANVVLQPLDHLIREGGFRVSHYMRCMDNFTIFSPNKRSLKKLIA AISNWLAGLGLKLKDTWQHFATAVRLSTALGYRYGHGYTLLRKRNLFRLKRQLLCYYRKL KRGAVIPYTMALGLLSRLGQLKHCNSVRLYARLVRGRLQRDLKNIVRAYARKERAKWNTS SAMCSATA >SRS019582.8010-T1-C MQRESMSSLSYAEQLRNDLFRAYEDARKNKRNTIAQLEFEKDAEHNLIELYHELLDGSYV PGKSICFLTHFPVLREVFASQFRDRVVHHLLFNYIAPIFEKTFIYDSYSCRKEKGTLFGV ERLDHHIRSCTNNYTHTAYILKLDIQGYFMSINKNILYGIIYNKLMKQWEEKGKNCNIPG KNPEFILFLVESIIFKDHTLDCEVMGNKKEWMLLPASKSLFCQPKGIGLPIGDLTSQLFS NIYLNEFDNMIKRRFKIRHYGRYVDDFYVVHPSKAYLKSLIPEIRTFLKERLGLTLHPKK IYLQPYTHGVAFLGAFVKPYRKYAIPRSVNNFRSKAKKIISFSTRDELTIGQLEAIQASL NSYLGYLGHYKSYKIIHKSLTGSAVFKHLYFASGYKKSIPYKRYTNKRRERCEIAHDLNR ILTA >SRS019582.11075-T1-C MDLSISFMDMEEVIGFEALYDSMHKCKKGVIWKESVAHYVLNSSEETYKLNEQLENGTYK ARQIAKFTITRPKKREIISVCFRDRVYQRSLNDNALYPIMTNSFIRDNWACQRGKGTDDA RDRMKLFLQRMYRKYGTEFYGLQIDVHGYYPNMRHDLTNAMLERKLEPEIAKRAIDVLDG QYAGDVGYNPGSQMVQIVGISALDDHDHKIKEELDVDEFGRYMDDSLAFHPSREYLEYCR KVIGEILAEKGLEFNPKKTKVFCITDGFTFLGFKYRLTDTGKVIMIIDPKNVKERRRILR RLVRKAKRGELTKAKVDECYYAWRNHASKGNSFKLLQRMDKYYKSLWR >SRS019582.24615-T1-C MGKDYEKICEWGNLYDAYLKARRGKRWKNSVAKVEASALEAVALIQRELQTKTYRPGGYR AFYVYEPKRRLIQTNSFKDKIVQHAFCDQVLYDALTKPFILDNYGSQVGKGTHFGLNRLR DFMREYYRKNGFSADGWVLKADVRHYFQSIRPDVLKKDVAKYLHDPDCLTLAYQIIDSAP DPLGIPIGNQSSQIFALLYLNQLDHLCKEQLRFRYYGRYMDDFYIICESKERLQEALVVI RQHLAERGLELNKKTNIFPLRNGLDFLGFHTYIDDAGRVIRKVRKSSRDRMKRKLRKYAA LYQRGEIDREKIAESYTSWRAHALHGDCRQLVAKYDQQFLSIFERRPEHVQEDQHSGRGC >SRS019582.24907-T1-C MTSEERHEARYQRRVKKRQERRLALSKSCGDFEDVFSYENLYQAGHICCRGVSWKSSTQT YRFNLVTNTAATRRALLDGTYKSRGFIEFDLYDRGKMRHIRSIHISERVVQRTLCDKVIN PTLKPSFIYDNGASTENKGIDFALNRLSCHLQRHYRKYGREGYVLLFDFSNYFANAQHWP VSRELAKRVHDVRIRALANECLDNFGPVGYGLGSQISQTAALMLPNKLDHFIKEKLGIKG YARYMDDGYLIHPSKEYLKECLTRMKEVCDSLGIILNTKKTKIKKLSEGFKFLQIRFKLT ETGKVLRKMSFESIKKIRRKLKKFKRWNIEGRVVKIAGKFVRRVFPLSDICSAYESWRGH MKRGNSFHAVERMDLYFKKLFGFHPNNKIEWRKALCT >SRS019582.29191-T1-C MASTWIRKMKRKGNVMSRLTPELVHQAVINASRKHMCKNEVIEFLQDKENEYSVYRQLLK GESIDVEYRYKEVVSVNGKKRIVAISSFRSRVIMHTLMLLIKKEYAARLSDDCYNCIKGR GINASRKRYDPVRQIKRIIGRYRPWGYLQLDIRKCYESTRPEVLFACHEAIWKDKRILRY LQRVSFCDIGLPIGTPSSPMNQHIMMMAFDRFIRQDLKIRYYARYADDIILFGDKDKLHE AKWRIANYLWYNLGYELKKDAHPTPMRNGTDILGYVFRCGYTRVRKSIKERMKKSWRNPR SRSSYLGILKGADAKNLKRKLNMKLSFLITNETKVRRRMDSPLVDIAELTGKVFDILDFE VREPDKKKGKAWMRMQVRYEDMGDDGKPVVKTRLVKGFHVAICEFLKNMTQYINKTSAIS GMSYEETFKKTLPFEDCEVENRNGWCIKGTLEIEE >SRS019582.44351-T1-C MFTGVQRPVLKFSEICTFSVLYKAYLAARRGKRSRAATANYEVHLLANIVNLVYILQTKI YRPGLFRVFYVYEPKKRLVQAPAFVDKVVQHALVDNLIYERITNSFILDNYASQKGKGLH FGLDRLRGFFTEYWNKYRTAEGWVLKADVRHFFASIDHDKLKEKLKKLDLEPIVFDLLCT YIDSTDGLPLGYQTSQLFALLFLDEFDHFVKERLHIRWYGRYMDDFFLIHPDKDYLQFCL KEIRAFMASLGLELNEKTQIFPIRNGIDFLGFHSYLTEEGKVIRKLRHSSIKRMRSKLRR WEQDYPAGLVTREKILQSWQAWDAHAAHGNTWSLRQQVRDRVQNILKEEI >SRS019582.135512-T1-C MGRIESVRGAELFAPEETGGVSLEDVFTAYMDCRRKKRGTYNALAFEVDWERRCADLLGR INAGEYRPLRSLVFIIRKPVMREVFAPAFESRVVDHLIARKIEPLLEERFIDDSYSTRKG KGTLFGIERMERHIRACSHDYTRDCYVMKLDIRSFFMDLPKRELYDRMAAFLRERYDAPD LPTLLCLLRATYSDCPQQHCVRRSPLVAWDKLPPRKSLFNGDAEHGLAIGRLTSQLGALF YLDPLDHLVTEAWGVPHYGRYVDDMVFVHESREHLIGVAEKVRTWLRENRLQLHPKKFYL QHYTQGVAFIGGVLKPGRRYLSNRTVGFCCNVLHYYNQAAAEDARFVYGHAEEFVASINS YLGLMRHFASYRLRERIVARIDKRWWKVVHAAAGLEKLIVKKPYKKLTRKRREIRRAVAA YRKSYATDTAI >SRS019582.255537-T1-C MRREGNIIQEIVEYNNVAAAFDHVVRGERQEYEEGKELLAKREEVIAQLQHEIATGTFTV EEYNEVEIKEYGKVRRLQIVKMIKRIGCFAIMQIVDIHLHRRYIRTTNASIKGRGMHDMM HQIRKAIRENPKLKYAYQFDIRHFYDSIQHQLAKNSFEHVFKDATLLKILGSLTDMLDEG ISFGLRSSQATGNLVLSIHLDHPLKDELGAKYFFRYCDDGLALAETKAELWLIRDAVHEC LESVGFKIKPNE >SRS019787.800-T1-C MKFSEICTFAVIYAAYLAARRGKRSRAATAHYEVRLLENIVNLVYILKTKIYRPGVFRVF YVYEPKKRLVQAPAFVDKVVQHAIVDNLLYDRITRSFILDNYASQKNKGLHFGLDRLKGF FTDYWNKHHTAEGWVLKCDVRHFFASIDHDKLKEKLKKLDLEPVVYDLLCIYIDCSDGLP LGYQTSQLFALLFLDDFDHFVKEQLHIQYYGRYMDDFFLIHPDKEYLQFCLREIRAYMDS LGLELNEKTQIFPIRNGIDFLGFHTYLTESGKVIRKLRHGSIKRMRAKLRHWEKEYPAGL VTREQILQSWQAWDAHAAHGNTWTLRQQVRDRVQNILKEEI >SRS019787.53330-T1-C MKIKNVYDIIFSMDNLYDAFLDASESRRYNRDVLRFGYDSWTNLEELRERVLRGEYEIDR YFIFFVYEPKKRMIMSIAFEHRVVQWAIYRVVNPMLIKGYIKDSYGCIPGRGALGAMTRL RGWLEYVSKKDGDWYYLKLDISKYFYRISHRVLKNILRKKIKDERLLEVLFGIIDCKHTP FGLPPGKSPGDVPLEERLFDVGMPIGNLLSQMFANIYLNELDQFCKRVLGIKYYVRYMDD IIILSNSKAQLHEWRWTIDAFLEKELELSLNQKTCIRPINQGIEFVGYRLWYNKVVLRKS TTLGMKRSLRGVANKYHDYEMTLEQVAQTFNSYTGMLEHTDSEELLASLYTDMILTHGER RENEERFIQMLPQEEEMLYGI >SRS019787.60340-T1-C MTGTKDLFNSICSMDNLYRAYQNAKSGKGWYKEVKQIEKRPFYYLAGLQYMLKNHLFKTS EYEIFILNEGKKKRDVYKLPFFPDRIAQWAILQVIEPFLVANMTADTYSAIPGKGIQPIV NDLRGYYKTKRVDGKKKSVWVPSILLTDEENTRYCYKIDLHHYYQSINHEVLKQKFRKVF KDPELLWLLDEIADSINTATEEDLIELSLSGEIEVDPNTGIPIGNYMSQYSGNFYLSSFD HWVKEELHVKHYYRYMDDVVIFASSKEELHEIHRKVTAYTRDYLYLNIKGNYQIFPTKVR GVDFVGYRFFGEYTLLRKSTAINFKRKMRACRKKMENNIPPTYSEWCSFNSYKGWLGNCD SYRLFKKYMEPLIEYMQNYYEREVKDHGEVYKCYVQCG >SRS019910.5302-T1-C LRGELKKSAQEEEIIIGFDALYNSEGKCAKGVCRKASVGRFHLFRMDEILKLQKELATGT YKARPTIKVRITYPKPRTAVANGFRDRVYQRSLNDNAVYPAMTRSFIRQNAACQTGKGTD WARKQVKLMMEREYRQHGADGYVLLVDIRHYYDTMPHDVANRCFERHLPPSVHNRVREVL DRQYTGEAGYNPGSQMVQLAGISVPDPIDHYIKERLRAKKYVRFMDxxxxxGGRRSAPGT LPMAWSCTRPRPRSSG >SRS020233.320631-T1-C MAKKVKNCFVPSITYEKMYKAYLFARKGKRYRKDVILFSLRVEENILSICNCLLLGNYKF GNYKEFYVYEPKERRILAASFKDRILHTWYVKEFIEKIFVPQFISTSYACIKGKGMHKCV VDIKKAMYKLYKRWEKGYIIKMDVSKFFFSINRKILYEIISKKVNDKNFLKFTYTLLEST KEYDDIVEKGIPIGNYTSQMYGNIYLNEVDRYIKSELKCKYYYRYMDDSCIICESKEKAK ELLCKINKFYIEKLDLKLNSKTNIFPLKQGIKFCGYTIKIGNVRINKRGKKSIVKKLGKI RYMFKNGEINVDEAKQMLVGHIGYIKIANVDSFVKKYFYLEQ >SRS020233.335787-T1-C MDDKSIICNFENLYNAYKRAKAGKRRNESCARFQTMSLDGVHILLEQLKNKTYKMNPYNE FKVYEPKERLIRSCSFKDKVVQHCLSDTILHPRLENQFIKTNYAGQKNKGTLFGMDCLKK QMLEFYQKHKLDGWILRCDVTKFFYSIDHEILKDIVDYYFPDNYTMWLNHLLIDSTDGIG VPLGNQVAQIYALLMLDGLDHMVTGELGINLYGRYMDDFYLIHHDKEYLKWCLDFINQFV ESLGLTLNGKTQIVPFKCGIPFLGFHHYITKDGKYIRRLKGENKRKIRKKIRKWVKLVKS ERMTETKFYEKYNAWKNHASHGNCVKLCHSMDLYVEKLFKSNIDSR >SRS020328.38554-T1-C MKRKSHLYEATYDFNNIVNAYNEVCRNTKNKRKVEFFREYKCVYISRIYNILKNEEYEVG PYNIFTIYEPKERRIVSQNMQDKVINHLISRHILYPALMPCLLNFNVASRKDMGMKRGLE LYIYYRNYFKQKYGTYYILKCDISKFFASINHDILKRKLERKIKDKKALDVVFKVIDSDS EGLSIR >SRS020328.71561-T1-C MNSTERHEARYQRRKAARIEKKAKALKEFGDFGAVFSFDHLYASYRASIKGVGWKASTQR YKSSALAHIAKTQEELLTGKYRSRGFYEFDLVERGKPRHIRSVHISERVVQRCLCDYCLV PALSKSFIYDNGASLPGKGYDFAVSRVTRFLADHYRRYGNEGYALIFDFSKYFDTAHHEP IFEQFRRSGIDGNLVRLSEYFISNFGDVGLGLGSQVSQIAALALPNKIDHFIKDVLRMKQ YVRYMDDGCIIDRSKKQLENCLHHLRRLCAAHGIRLNEKKTQIIKLTRGFSFVKVRFRYG RTGKIVRKATYQGVRHMMQKLKIFRRWVDRGRMAAADVAASVTSWLGHMRRFHSYFAVQK VLRQYNQLFPGGTYGLHHLQTV >SRS020328.93934-T1-C MKRKGNLYNSICDYNNILNSYNEVCKNTRNERRVANLKEYKSIYISRIYDILKNKQYTVG PYNKFIIYEPKERLIVSQNVQDKIVNHLVARYILYPALLPCLLDINVASRKNMGTSKGLE LARNFHQKCKIKYQNYYVLKCDISKFFANIDHDILKEKLLRRIKDKDALKIVFDIIDSNE EGLFIGSMTSQILAIFYLNDMDHFIKENLKIKHYVRYQDDFLLFHESKDYLKFCLEEIRK FLDTQKLKLNIKTRIYKSTNNFLFLGRDTNNNYSRYRTVKRKLKSKYFLYKNHKLSLGSF ISSLRCYEQLCNRNNLFKLKK >SRS020869.10015-T1-C MVQSDFFEASEEALTDEIALEDIFQAYYDCRKNKRRTINALAFELNFEQELIRLWREINS GKYRIGRSIAFVVQKPVQREVFAADFRDRIVHHLIINKLNPLFEEYFSDGSYSCRAGRGT LYGVRNIAAAVKECSAGYTRDCYILKLDIRSFFMSIDKNILYQMLHAFITEKYRQPDQRI ILSLVRRIVYYNPEDACIIKGRRSDWNGLPCHKSLFWSSRHCGLPIGNLTSQVFANFYLD RLDKFVTEELGFKYYGRYVDDFILIHPNKKALLEARQKIDVFLQMKLKLKLHPQKFYLQH YSKGVKFIGAVIKPNRIYIGNRTKGNLYHKIYTMMPELAKGIQPLLDGLAKFSACVNSYI GFMRHYNTYNLRVKILKLLNGTFLGEILELRPRAEKLAIDKRFLPQKQKQRQLRRQRRYR HIQHKKQFKEKKDGSI >SRS021484.198268-T1-C MMESEVRDEVCDFDNLYRAMQHCKNNVMWKDSVAGYVKNGLVNVHKLKESVENGTYKLDD YTQFKVYEPKERDIVSTRIKDRVFQRSLCDNYFYDTMTKSFIYDNCACQDGRGTEFARKR LICHLQKYYRKQGTEGWVLKADLKNFFGSTSHELAYSAVTKRVNDEWVNGEIKRIIDSFN QGDDPEVGMGLGSETTQLIQLAVLDDFDHFIKEQLHIKHYVRYNDDFIIIHEDKAYLQEC LIKIDAWISSRGLKLSPKKTQLFKVTQGIKFLGFRFRLTKTGKVVMTLLPEKLSHERRKL RKLVERAKQGYMTKEEVDRCYESWKAHVGNESSKKRKSPGRRARRNCHNLIICMDQYYKN LWRESKCLDLSVQENSL >SRS021484.210114-T1-C MSLEGLHILKEQLENQTYSMNPYNKFKIYEPKEREIKSCAFKDKVVQNCLCYTVLRPRLQ SQFIRTNYAGQIDKGTHFGMDCLKEQMLSFYEEYGTNGWILKCDIRKFFYTIEHDPVKDI VDYYFYDEYTVWLNHLFIDSVESPGLPLGNPVALMYALLMLDGLDHFVTGELGIDKYGRY SDDFYLICSSRSYAKWCKEAVEAFVSTLGLSLNGKTQIVPFRKGISFLGFHHYVTEDGKY IRKIKGENKRKIKKKLNNWVKAVRAGEMMLTEFYTNYNAWKNHALHGNCKKLCHSMDLYV EELLKGVSQ >SRS021484.211873-T1-C MKRAKDLYPKLISDENLRLAILTVNATHKWHPHHRPNKTVLRVEADIDGYVEKLREIIVN GYDAAPPRIARRWDKSAGKWRDISEPRLWPDQYVHHAVIQVLEPVLMRGMDKFCCGSIKG RGIHYGVKAIKKWMRTDPKGTKYAEELDIHHFYDSLTIETVMARLRRLVKDRRMLDVCER LMKYGVLIGAFFSQWFANTVLQPLDQMIRNSGLCDHYMRYMDNLTLFGRNKRKLRKLRGM IEDWLAGRRLKLNNKWQLYPTAKRTVAALGYRFGHKFSLLRKRNMVRLKKSLSECYRAMR KHRKIRPKLAQGLLSRLGQMKHCNHVHFFEKYVETGLQRKLKLVVREHTRKEQARWNMCT EPLKSTA >SRS021484.224982-T1-C MVKYSNEDANTQKASSAWYWSSTENSQNNAWNVNFSSGNTNNNNKYNSNRVRAVAAYGKD FECFLETVIEAYKDCLRGKMSSKQAVEYMQIAEEDIVCLAIEMWTGVYKPATSTCFLVRY PKLREVFAANFRDRIVHHWICLRLEPLFEERFVSQGNVSHNCRKGFGTQSAVESAEQGMK KVSDGYRRPAWVFKGDLVSFFMSIVRMLLLERLLRFTEKKYHGEYKEILLRLVRVIVLHS PEKDCLFNSNPELWQKLPANKSLLRNGEGKGGPIGNLTTQLFANFLMSYFDTHVRWIMRG ANYHYVRFVDDFLLICDDLKALQEVIPEIESFLATHLKLKLHKDKRYLQPVSHGVLFVGV YIKPGRSYLSNRTLGRFKEKVIGFNRLAETTELTSSDCIRIQSVLNSYLGFCKGLRTYRK RKEILSLLSSEFYKYFYISGHYEKICIRKKHKFLNKDINYVLPNSINNGKKKVRRETIGS GD >SRS021484.227633-T1-C MDALNFLLSNGRSTEIPELDKGIEVLVRTADCASPSSTENSQNNAWNVNFSNGNTNNNNK YNSNAVRAVAALGEEIKEGWVAAFHDCCRNKKSSHNCNEYRAADWELDLWLLVYEIYFLH NYTPKTSVCFIVTRPTLREIFAAYFRDRIVQHWICMRLEPLFDKRFKSQGDVSHNCRKGY GTRSAVDALERDIKEVSQTYTREAWVAKIDIQSFFMSIDTRILWQNLEQFIKEEYKGDDI DILLYLTEVTVKHRPQNDCIRQSPGELWEMLAPHKSLFNRPEGIGIAIGNITSQQEANFH MSFYVEEIKPIAEEAGAKIEQFVDDVPCVAPTKEACLKFRKESERILREKLNLKMHPNKF YLQPVKKGVKFVGQVIMPGRRYISNRTLGNFVNQLRRTERLCQRILDGKITTDTLDQLRH DVCSINSYLGFMVHTASYRMRAKIFKRECRAFWKICYTRQFAVCKVKVKYDIKQFLINQE IQNYGMDLHRDRAAAGRKKDAPRHEAAHRELQHRAPRRKRPKRLQVPIQSSDA >SRS022609.28002-T1-C MDINNLLSLNGCGTGAVFDAAFVTTKINHMAARSASENSRNNAWNVNFSDGNTNNNNKYN SNRVRAVVALDEEIKEGWVRAKDECCANKYSTSQCEEWRMIEHWELWNLMYEIYYGNYQP TTSTCFIVSFPSYREIFAAAFRDRVVQHWICARLNPLFEMRFHLQGNVSFNCRKDFGTLK AAQALRVDMEQMSNNWQDHNIWIGRFDIKAFFMHIDRNILWALLETFIRKYYTEKDLDVL LRLTKLQVFHCPQDDCIRKSDIRLWDLLPPHKSMFKRPRNFGMAIDNILSQLCANFYLSF FDDVMNRLCRRYNCAYKRFVDDFTIVGEKQAILKIRSVAERWLSLYLHLTLHRDKFYLQP VHHGCKFVGTEIMPHRTYLANRTYGGMHDQMFALATLCRSIAHRGATLHKLKRLQNEVSS MNSYIGFSVHHRSFQVRRKLYAPFVEDIRKVCVLNSDMGFVRVQKKYDYQRNLILKEERD YEYPNWIYSA >SRS022609.40702-T1-C MDYSFTLADLFKAYFDCRRNKRCKITALKFELNLEENLLKLYKEIKSGQYKIGKSICFVV TRPKPREVWAANFRDRIVHHLIYNFISERFYKKFIYDTYSCIPKRGTLAAARRLLHFARS ATRNYKRKMYYLKADLSNFFVSINKDILYKLLQKEVHEEWILELLKQIIYNDPVKKVYKK SSPKKYSLIPPYKSLFNANYNHGLPIGNLTSQFFSNVYLDVLDKYVKQTLQCKYYLRYVD DFVIIDDSPEFLNFYFNSIKEYLYSTLEITINPRKKLVNKLDIGIDFVGFVVKPNRIFLR QKTLQNIFTAVNNWRFSYNPYHWKYLVKFYRSINSYLGMLVNLKTFNIRQAISFSVNSLF IYMNKDSEFSKSVI >SRS022609.88775-T1-C VWFALSFCLCFFALVGGRFDNGSNAGLWYWNYNNDSSLANTTVGARPLILGFGVFCTGVF SVRQALVGGNWGNTTQAGLWYWNFNWSGTDYNTNVGARLLIFINYLHIIFLAPWQKLVAL GWLSRLILERSAGKYKNMEPEMKRKGNIYNEILELNNIESAIFKASVGKTHRKSVEKILD APTYYAMQVQKMLSDRSYIPSPYIEMKIRDGASKKERIVFKPRFYPDQIIHWSLMNKVEP LFKRGMYEFCCASIKGRGIQRGMNYVKRILVNDRKHTKYCLKLDIKKFYPSIDKEILKKK FRKIIKDKDTLYLMDLIVDSSTEGVPIGNFTSQWFANYYLQDLDHYIKEELKVKYYIRYM DDMVLFSNNKKELRKCKYAIDEFLAREHLRIKENWQLFKTDSRPIDFLGYRFYRGYTTLR RSNFLRIKRRIKKISKKEELNFKDACAIMSYNGWIIHSDSYNYTQKYLKPYVDFKKVKEV IKNENRKHNKTGNKI >SRS022609.170649-T1-C MEIQYPIDNLIPEIVSDSNMYSGYDYVISHLESKEQRAKYAPDKDPKTEYDFNTPEEYAA YLEQQIYAAKVRDAIIGTLKRQIADGSFRITMKDVKTLRVKDGPKERECQAPKVPKRVGC HCIMVVIEKYTYPTLIHNTGASIKGRGMHWMHHIIEDDIKAVPELFTHYYKNDISHYYDS ISQERMKAVIRQYICDPILLPILDSFITLLPEGLSKGLRSSQTFGNLFISPVHHKMLTMA ERYFLSYPDGSVEVRYLYYNYCDDTTIGGNDKKRLWQLRNVYVEEMAKLGLKVKDNEAVR PLTDGLDMVGYVHYPTHSLLRKRTKQNAARKLAKVKSRKRRQKIIGSFKGMACHADCKHL YFKLTHHQMQKFSEMGITYTPADGKKRFPGKVWRLSALQNKTLEIHNYETDLTTSHGDDR YLVSFRDVQTGEWGKFFTSSDEMKNILDQISDREDGFPFETVIQSEVFDGNKVKYKFT >SRS023526.14525-T1-C MIDFNILLEAYFDCRRHKRKTVGATEFEMNYMSNLVQLLDEINSRQYKIGKSICFVVKYP RYREVFAGQFRDRIIHHYIALRLEPLFESQFSDRTYNCRKGKGQLAGIRQLQQDIREVSE NYTKDAYVMGIDLKGFFMSISKPLLAKMVDDFIVENYHGDDKEDLRWLCNMVVMHHPERD CEKKSADYLWEFLPKEKSLFTVGEDRGVAIGNLFAQLFVNFLLSKLDWKINYYCKHHVRY VDDMVLVARRKEALLRLMPMIRETLASLGLRLNEKKFYFQHYSKGVRFVGAIIKRDRVYS VNNTVNNYRKSVRNLNDAAKAGNIEAINKAIQSVNSYLGIFSHYNEYGMKRKIIKEELDK ESWQYFTVKGHFQSIHLRKKFNIDIKYKNMANEILNRKIEERKEVPSESEISRMLDEGKE LEIYKTPEGKIHIDITPAE >SRS023526.86892-T1-C MKRVGNLYSKICDMENLKLAHQNARKGKGWYQEVRMVDEDPEKYLGQLQEMLLNKTYNTS EYVTFIKHDSGKDREIFKLPYFPDRICQWAILQVIEPYLVKNFIKNTYSAIPGRGIHLAL HDIDQAVQHDVPGTQYCLKIDARKYYPSINHDILKKKYRRLFKDDDLLWLLDEIIDSTPG DTGIPIGNYLSQYSGNFYLSSFDHWMKEVKHVKYYYRYMDDIVILGSDKKELHKLLLEIK EYFRKELKLTVKDNWQVFPTFVRGIDFVGYRTFLNYKLLRKSTCKQMKRKMNRLHKKVID NNQLMNYSEWCAINSYKGWLIHCDSFRLSKKYIEPLEPYAKAYYEYQIKKGGKAA >SRS023526.256443-T1-C MELQQSLFTDEELGGIPLEDVFEAYFECRKKKRNTCNALAFESDYERRCVELWREINAGT YRPSRSIAFIINKPVKREIFAADFRDRVVHHLIARRLVPLLEEKFINDSYSTRKGKGTLY GIEHVAEHIRLCSENYTKECYILKIDIRSFFMKISKRRLYDLTEELLHERYGGNDLAILL YLLRETIFNRPEKNCIRKTPPQSWRGLPKDKSLFHSDGSCGLPIGNLTSQLLALNFLDGL DHLISEEWGVKHYGRYVDDMVLVHPSKEHLIEVKAKIAGWLSEHGLSLHPRKIYLQHYTK GVLFIGGMILPGRKYLSNRTVGFCYDAIDRLNRLAAGSPDYVKTHGEEFVSTINSYLGMM RHFSSYNLRCKVMSRIGKEWWQVIYFAGHLEKVVLKKRYRQIECKKEEICNEIKELRRTV >SRS023835.180281-T1-C MRVRGISINGRLTDMGKSHVSYDDAKPFLSENSQHNSWILNFGSGNLNNNNKYNSNYVRP CTASIDFRIFQDSMFEAYEDCLIGKRSSPQALEYIPSASVDVCRLAWEVYNFSYEPATST CFMVTFPKLREVFAANFRDRIIHHWICLRLNPLFEKRNEDLGNVSHACRKGYGTISAIKQ VEAGIEKVSHNMQKEAWIYKGDIVGFFMNINKQKMFEVLKELIEKKYFASDKSILLFLVK VTVFHSPEKNCFIKSPFELWKKIKPDKSLFYNGEGIGEPIGNLTTQLFAGYYMGFLDEFV ERLFERKNYSYTRSVDDFVIVCDDRTFLKRAIKLIMNFTHNELKVECHRDNVYFQPASHG VKFLGQIIKYKRRYTINRTIGRMINRVKQCLLECENGSMTLLRAEHWAMVLNSYFGFLVQ SNSWRIRKKVMAMLTPSFYQYYYIVNSRTVRIKNKYKHESNRLCY >SRS023914.9404-T1-C MKRYGYLFERICSLDNLRAAACNAAQGKRRRDEVQKFFADLESNLQEIRTELLCRTYRTS PYEVFIKYEPKRREIYKLPFKDRVVQWAIMLVLEPIWTPQFTADTHACIKGRGIHSLLKQ LRNDLAEDPEGTRYCFKLDVRKFYPSIDHDILKMVIRRKIKDPAVLLLLDGIIDSAPGVP IGNYISQYFANLYLSELDHRIKEVAGVRYYYRYADDIVVLAGDKATLHGVRVFINDYLNT ERNLSMKSNYQIFPVESRGIDFVGYVTYHTHSLARKRNKQELCREVAKLRKKGVPEPDIM LRTASRVGFMVHCNSKHLLKILGMKKFSELVPAKTGNLTGTKYHIDAILNREIHLTGYTV APSKHNAEPCLTLQYEIEEPLMEVMADGTSRHVIDDQGNQVKGWVQHITFTGSQALIRQL DGVEITEPLRAKIIKQPIERNRCFYKIVDPDD >SRS023914.11721-T1-C MTKFPILHYNTKNTKSKLVPPPMPPSKYEDVVGWPMIEASYKQALRGHRKFSREAVCYDL LSEVNNVELWSDLKKIESRPQPGRREYSPGPYRHRTITEPKTRSLHIPHLRDKVVQLVIH EELQNLFRPVFVDRSFACMYGKGPIRAAFNVQHDMRVARMKWGDEVAVIKIDVKKFFYSI DRQVLKKIIAKRFKKLKKKYPDKYEDFLRFYRLLCKVIDSSPEGETGIPLGNVSSQDFAN IYLNELDQYCIRFLGAKLYTRYMDDIVIIAPNKEIAREWLAKIKVFLQERLHLETNQKTK IFYVRQGVNAYGFKIKATHLLLRTESKRKEKRRIKAMIKKLKEGKITKKEIVQAVNSWLG FARWACAYNLAKKIFAPYRFIKTEGELPYGAISRNRQARRVLQQRLRSGTSYKAVA >SRS023914.13282-T1-C MNSEERREARYQRRKAKREEKKAIRNADHTFQKVTEFGSLRKSFYKARRGTNWKASVQKY GCNVLRNSYVISRKMRKHQKISKGFIEFDLYERGKIRHISSVHITERVPQKAVCDYGIVP IMEKSLIYENGASQEGKGTGFSSDQLIKDLRRHYRKSGFTNSGYIVMGDGHDFFASLRHD VIYKNMCKMITDQELIRQTMDFISPFGKGLGLGSQVCQINAVAYANAIDHYIKDVLRKKY YCRHMDDWYVIVDTKEEADEILRDISQMYAEIGIEMNPKKTQKIKLSHGFTWLQDRYFLT EKGKVIRKASHKRIAQYRKKLKKMAVKVENGEMDYNAVRSFYASISGYLAHKDGYLTKRS LANLYNQLFIEKFMQGGMRDEQIFAT >SRS023914.16530-T1-C MRRFGHISPQVETLDNFRRAFYDYARQKMHRQSVQQFEANLDHNLDRMLGAYQSESWHTS PYVAKDIDYPKHRQINKLPVIDHVMQHAALAPVEADLRRTIHGHSPAGTKGKGTHYFYQL VKRDIFSSPQAETFYCLPMDIHHYFQSIDHNLLKAEYRRKIKDRKLLAFIDEVVDSFNPG IVLGVKLAQLLGQLFLARFDYLAIRCFDILQDADRFRYWQARYVSEMLVTCRTPEQARLL SGGVKFLNDRFERFCQQGLSHYYRFMDNIYILHEDKVFLRLMAELSVMHLARDWHLSINK SWGVHRTCDGIDFCGQIIYADHALLRKRFKHDLCAQVAKLRKQGYSERQIQLKAASRLGL GIHANTKNLYKKIGMERFGKLVKARRARVPFEGMEKSQQQSIEDIICSEGQDENKFLIQV IDYKVDDSVIEKETVQVEETAADGSTHLVTKEVPKKRLTLRYRIIDHIEGTTEVWQTTDH YLYTGSKILIDQALNDFCRDELPFSTVVKELHNKFKKKFYKFT >SRS023914.52381-T1-C MITTEGLLEAYYDCRKRKRKTASAIMYEINYESKLIELRDRVNNRTYQPGKSICFIVTRP RYREVFAAAFEDRIIHHWIAIRLEPLFERIFSPRTFNCRKEKGQLYGVNMLVNDIKTCSR NYTRDCYVMKLDLQGFFMSINKAMLAKMVDDFVVETYTDETDKETLRYLCRTVVLHEPEY NCERHSPLGYWDYIPANKSLFTNGRGYGVAIGNLFSQLFANYLLNVLDWFLLEDLGFTYV GRYVDDFYVVDTDKAKMLDAVPKIRALLAGYGLTLHPDKFYMQHYTKGVAFTGSVVKKDR VYTANRTIKNAVMAVRRLNRAETLPEVMHAVDSINSYLGCLRHCREYNQRAKLLRRIEPR CYKWIYIKGRYEIVAIKKKFRKRTRTLQRINDGNY >SRS024132.21355-T1-C MIDFSILLEAYFDCRRHKRKTVGATEFEMNYMSNLVQLLDEINSRQYKIGKSICFVVRYP RYREVFAGQFRDRIIHHYIALRLEPLFESQFSDRTYNCRKGKGQLAGIRQLQEDIMEVSE NYTEDAYVMGIDLKGFFMSIYKPLLAKMVDDFIVRNYHGDDKEDLRWLCNMVVMHHPEKD CEKKSADYLWEFLPKEKSLFTNGEDRGVAIGNLFAQLFANFLLSKLDWKIDYYCKHHVRY VDDMVLVARRKETLLRLMPMIRETLASLGLRLNEKKFYFQHYSKGVRFVGAIIKRDRIYS VNNTINNFRKSVRKLNDVARNGDIEAINHAIQSVNSYLGIFGHYNEYGMKRQIIKEELDE EAWNFFVIKGHYRSIQLRKIYNIDMKYKNMANEILNHKTEERKDIPTENEISKMLDEGYE LEMYIIDGRIHVECYSRDS >SRS024132.73111-T1-C MRLEEIFTFENLYDAYKQCRKSKQHKGEVVRFEANLSYNINKLVDEFATKKYKLGKYKEF LIYEPKERLIEALPFRDRVVIRCFCDVVLRDKIEVKLIYDNTACRTEKGTLFAIKRLEKF LRHEYLKEHNNDIYFLKCDIQKYFPSIDHQILLDLLRKINFSEDEMWMIEKLVKEQPNNA DTGLPLGNQSSQWFALFYLNVVDRYVKEKLRVKGYVRYMDDMILVHRDKEYLRYCLNEIE RICNEKLNLSLNHKTQIGKVKNGIDFLGYRHILNENGSITRKLRSSAKKRLKKHLKTLKK LRDRDIVDDEYVYIRKNAFYNHIKDTKESQKLKTETLPKKIKIN >SRS024388.116126-T1-C MSFGNICKAGKSCCDGARWKTSTINFETNLLGEAQATYDTLHYGKRVFKGFHSFATVEHG KVRNIDALPIQERAIQKCLCKNLLTEVYSRSFIYDNSASLKDRGMDFQLRRLRKHLQDHY RRYGTEGGIYQFDFKNYFGSLPHEEIKRRARKKIMDDRLYTLFCDFVDDFRLMKTADKEA HRGVGLGSEVSQIIALDYASPIDHYVKDVRGIHGYGRYMDDGYVISNSLEELEDIKRNLY RLAEALGIAMSDKKNIITPFRHHSFTFLKMRVTLTETGKVVMKLSRKSIRAMRRKMDIFR RWMDEGRMGPEDVFQSYQSWRAHAKRCNSYDTLRAMDERFTRMFAEELAGRRKPFPCTMK ATRTGCGWIYRRHGAVIEEEMCA >SRS024435.283230-T1-C MKKFNVTYEQIASYSNIYAAYLDARKGKSERNEIMRFSLELDAHLNDLYNDLQEERYKVS GYRIIYIYVPKKRLIMALQFRDRVLQWAVYRLLNPLYEKTYIKDSYACIKERGREKAASR LQYWLRQTERKPKQYYYLKLDISKFFYRVDHEVLLNILKRRIKDERLIKLFDKIINSEKR AFGLPLGVDPCEIDPREMLFDKGMPIGNLTSQMFANIYMNELDQYLKHELHLKYVIRYAD DCIILHDSKEELWQVLEKVETFLLKNLKLNLNNKTVIRPTTCNIDFVGYIINKDEIRLRS ATVKRMRSRIKYIVKAYERGEMTLQEVNATMQSYFGLIKHCTNEGLRENIINGFVLHCTD AARAKARESR >SRS042284.7927-T1-C MITGTSLLKNGRVTGPLPSVKDLVKNKIVHAITASRYWSSSENSQNNSWNVNFSNGSFNN NNNKYNSNMVRAVAALNDKYVEGWFDALDDCCAKKKTSSQCVMYRLIWHEDLLDLAREVY ERTYRPTTSTCFIVTRPKLREVFAANFRDRIVQHWLCLRLEPLFEARFVEHGNVSFNCRK GFGTFACIDQLTKNTIEVSDNYSHEAWYAQFDIKGFFMSIDCERLLEYLLPFIKEKWNYW KGTIYEQDLDLVLWLTEVIVRHRPQDDCIRQGNLKLWRILPKNKSLFYIEWMRGEPIGNL TSQLFANFYMSFFDEWAIKAAEERGAKYVRFVDDFGFVCKTKEDAIYFKRASRNQLRYIL NIQMHPNKIYIQEVKKGIKMVGGVIKPGRAYLSNRTVGNFINTVNYLEEACKNCNKEAIY ANVRSINSYLGFLIHYQSYGIRRKAFSELHYFWKACYIQGKFQVVKVKNTVQCL >SRS043001.7138-T1-C MTSAERKEIRYQRRKAKREEARRKRSMACGDFEEVFSFRHLYLSGKKCCKGVYWKSSTQR YIGNIIPNIALTARSLNEGNFYHRGFHEFTIMERGKKRYIRSVHITERAVQKCLCDYCIV PIYSSSFIYDNSASLKHRGMDFALRRMICHLQKHYRKHGLAGGILIFDFKSYFDEAPHGP LAAEAKRRLHDDHVRSLHDSFIADFGSVGLGLGSQISQTNALLLPSPIDHYFKEILRIKG YARYMDDGYAIHEDIDFLRTEGMFGLEEMTRKLGLRLNWKKTRVIPLADFYRWLKTKFIL TSSGKVILKMNPDSTKIIRRKLRTFHGKWERGEMTVADIRSSVESYHGHMKRGNSFKVRE NTNQYFKFMFGFYPNKKGWERNVSNSQRWNTSGNGDNPCVGQNAEQRVLRPMHRGTGTRH CDRGVCVPH >SRS043411.53472-T1-C MDNLRKAHQNAKKGKGWYEEVKEVEADVDGYLTRLQEMLINHTYQTSKYEKFIKRENGKE REIFKLPYFPDRICQWAILQVIEPYLLRYMTKNTYSAIPDRGIHAALQDVRDAMQKDVPG CQYCLKLDVRKFYPSINHDILKQKFRRLFKDAELLWLLDEIIDSICTAKIEDMRDIWLLD EDIDSETGIPIGNYLSQYCGNFYLSGFDHWIKEEKRVKHYFRYMDDIVIFGADKAELHRL LKEIDAYFRQNLKLTIKGNWQVFPSYVRGVDFVGYRTFLNYTLLRKSSCKKFKARMIAIR KKTETGQMMNYSEWCSVNSYKGWLKHCDSY >SRS043701.41599-T1-C MKRKGYLFEQICSMPNLLQAAHNAGKGKRQRDEVIAFEADLDSNLRQLQEELTTRTYKTS DYDVFVKYEPKRREIYKLPFRDRVVQWAIMQVLEPVWTPQFTADTYACIRGRGIHALLKR LRADLRNDPDGTRYCLKMDVRKFYPSIDHDTLKKVVRRKLKDPALLWLLDGIIDSASGVP IGNYISQYFANLYLSELDHLLKEEIGVRYYYRYADDIVLLAGSKEFLSGALVYINHYLHE NRYLSLKSNFQIYPVESRGIDFVGYVTYHTHSLARKRNKQGLCREVARLRKKGLSDEEIR LRVAARMGFMVHCNSNHLLKTLGMKKFSDIKPNQGKLTGGKYHIDAILNREIHITGFDVS QSKYEGDMLTLQYEIYEQMEDGQGKVIDDDGQPVMAWVKHITFTGSQALIRQLDGVELTE PVAAKIIKQPIGADGKRCFYSIVDPDQ >SRS045004.12817-T1-C MTSEERREARYQRRKAKRDEARLRRSKECGDFDEVFSFRHLYLSGKKCCKGVYWKNSTQR YIGNIIPIIAKTHRELQNGTFKHRGFHAFTIMERGKKRYIRSVHITERAVQKCLCDYCLV PTYSACFIYDNSASLKHRGMDFALRRMTCYLQRHYRKYGLEGGVLLYDFHSFFDSAPHEP LFREADRRLHDPKIRELANSFVTDFGSVGLGLGSQVSQTNALMLPNMIDHYFKEVCRIKA YERYMDDGAAISPDIDDLYLCMDGLKIICEKCGLELNLKKTRVVPLRDYYRWLKTRFIIT PTGKVVRKMNKDSTKIVRHKLRAFRGKLDRGEMTLADIRCSVDSYNGHMKRGHSFKVRQR TNQYFKSLYGFYPDEKGWKSHV >SRS045004.44759-T1-C MKRKGYLFEQICSMPNLLLAAHNAGQGKRQRDEVVEFEKNLAQNLQVLQTELKTRTYTTS AYEIFIKYEPKKREIYKLPYRDRVVQWAILQILEPLWTPTFTNDTYACLRGRGIHSLLHR LRADLRKDPDGTRYCLKIDVRKFYPSINHDTLKMVVRKKIKDPSVLWLLDDIIESADGVP IGNYISQYFANLYLSELDHLVKEFVGVRYYYRYADDIVVLSDSKQFLSSVLVYFNDYLNN DRQLTLKGNYQIFPVESRGIDFIGYVTYHDHCLARKRNKKALCREVARLRKRGMANDQIR LKVASRLGFMVHCDSKHLIKTLGMKKFSDIKPKQGKLTGGKYHIDTILNREIHLTGYDIS KSKYDGEMLTIQYEIYEQLEDNSGKIVDDDGNPVMGWVKHITFTGSKALMRQFDGVELTE PVAAKIIKQPIGSDGKRCFYSIVDPDQ >SRS045645.201001-T1-C MAECYKRQDLLTDLFQAYYEARKKKRNTASQLAFEIDLEHNLMELYEQIRDRNYTPSPGV CFVVNRPVKREIFASDFRDRVVHHLLYNYISPMFEGRMIFDSYSCRRGKGTSQGIARLEH HIRSCTQNYRYSAYILKLDLRGYFMSIRKDRLYDLICKGLERYRRSRKQKECPDMELIDF LLRQIIFRNPQRDCRIRGSAEDWEGLPPTKSLFNSPEGVGLPIGDLTSQLFSNIYLGELD GYVKRVLRCKHYGRYVDDFFIVHPSKRYLKELIPWIREFLKEELELELHPDKIELQHYSR GVAFLGAYVRPYRKYPSKRTAGFFREAVHRLEKESSKGKPAPERLRKMLAVLNSYQGHLK HFRSYALLDRFLRNSPLKEYFYFTGGYRKAGIRSRFLIPIKDTYGRGKEPETFPETGGRI MDNSVHREDSGGRNSK >SRS047044.65321-T1-C VFYTNLNNGRTNTSNNIGFRSALLLCRDKVKILRGITQRTTPKGHISDPKQELLGKRFNC RENAARTARQECHARRKGLGGKIMEQQAESPPINYLTDAYERIYDYEELYNSYLEARKNK RYRDDVLKYTDNLESNLIELQNELIWQTYKVGKYRPFFVYEPKKRLVMALNFKDRVAQWS VYRQLNPYYDRLFIGDSYACRKGKGSHAAADRLQYWLRQTERKPEKWYYLKLDISKYFYR VDHSVLLNILSRRIKDERLMGLLRTIINSEEQAFGLPAGVAPDDCPEDEWLYDVGMPIGN LTSQLFANIYLNELDQHCKHDLHLHYYIRYMDDIIILDNNKSELQTIKDDIEAFLAEELH LGLNNKTAIRPTSLGIDFVGYRIYATHRKLKKQTARRIIRSVTAMSKSLAADTLSREDFD RTAASYKGIFQHCNSFGLRRKLNAIFKEYALCGKGGTANAV >SRS048164.52624-T1-C MKTYKNIFVQVVAFNNLMLAHYHASKGKKHRDEVLIFEQRKAEYCIILGNRLVKQTYKVG SYRIFWIRRPVLRMAMALHYPDRVVQWGIYQVVFPIFDKGFISDSYACRKGKGAHAALDK LQYWMRQADRGGPAYTLKLDVSKYFYRIDHEILLKIIGQKITDPRMIWLFRAILHSDQTK FGLPEGMSADEVPPECRLEDTGVPIGNLTSQMFANIYLDVLDQYVKHTLHIHWYIRYMDD IIIIGRDKQELAHIRDEIAAFLRRELNLALNHKTSIQPLKQGVEFVGMRVWPTHRRLRHA TIRGIKLRLSQVLAQYEAGEITAESVERTIGSYRGVLGHCECMALKHKLNQTYGKFYIIK KERSEQNNGNQSIFLCEGREQGPEQELPRLGV >SRS048164.61614-T1-C LKRTGYIYEKLCDKALIREAIIKASRKKRRRKSVRRILNDIDHYVDEIYTMMLNEEFTPS PYRRFRIKDGATQKEREICCPKFYPDQIIHWMLIMAIQPILQRGMYEFNCGSAPERGAHY GKRYLERWYKRDRKNTKYCAKLDVRKFYPSAKSPVIMRELRRVIKCKRTLRLCEVILNSA DGLPIGNYTSQWFANFLLQRLDHYIKEVLRIPHYVRYMDDMCLFSASKRALHKAVKAIRE FLAGLSLQLKSNWQVFPTASRAVDFLGFRFFREKTTLRKNLSLRMRRRVKKIHCYTAKHG GKARPRDAAAVMSYAGWLNGTATNGFYAKYIRPYINFKKLKEAIRHETRVRARTAYCIGG SAQPCGV >SRS048164.212631-T1-C MKRIGNLYERVISLENLHLADEKARKGKLRSYGVMIHDKNRDANLLALHESLKNGTFKTS KYHIFTIYEPKERLIYRLPYYPDRILHHAIMNVLEPIWVSLFNKNTYSCIKNRGIHKCAK DLRHALKQDPDGTRYCLKIDVRKFYPSIDHELLKQVVRRKIKDNRLLALLDEIIDSVEGV PIGNYLSQYFANLFLAYFDHWLKEEKRVKYYWRYADDIVILAHNKDILHGLLHEIRAYLR GLKLKVKRNYQVFPVDSRGIDFLGYVFYHSHTVLRKSIKQKLCRRVAKLNKRKIAPTKEV YKQQICSWWGWCKYCNSINLMKKLSKTFPYEIKFNRSKCAL >SRS048164.215924-T1-C MNKENAQDNVDNIVYDLNSLYSAFLAAKKDSDWKPQVQKYEIDFLPNIVSTKDLLKNREY HSKPSSEFIINERGKVRPITGLQMSDRVIRHSLCDNVLSPSLIDYLIYDNGASLKGKGID FSRKRFEEHLHKYYRMYGNDGYILLMDYSKFYDNIPHDIAYREIAKHVNDEFSLWLLQQI FDNFKIDVSYMNDEEFSNCMNVPFNSTEYRLKIPKSAHTGEKYMRKSVNIGDQCSQIIGI FYPTPVDNFVKIVKGQKFYGRYMDDSYVIHRDREFLVQLEIEIAEKAKELGMTLNRKKTR IVKLSDYYKFLQITYSLTDTGRVIRKINPKRIVDMRRKLKKLSLKVIRGERTQEEVENMF RAWMGGHAHLMSKIQRDNLNLLYFDLFGGFIDGKYYLQYYPRRQNRDSRHNEWKQLYNRR CG >SRS048870.13195-T1-C MNYEEIVCDANNLYRAYKTSVKSSKWKETTQKFMINFLRYIFEIQDDIINRTLKNGLTQE FTLHERGRVRPITSIQIRDRIVRHVLCDDILLPEVKKHIIYDNCASIKGRGISQQRKRFE IHLHKYYKLHENDGWILFGDFSKFYDNIIHEIAKQELLKLFDDDEFIDWLLTLIFDGFKV DVSYMSDEEYENCYSDLFNKLEYRHIPSEKLTGEKWMAKSVNIGDQLSQVIGIYYPHRID TYVKYVRQQKFYGRYMDDWYIMNPSKEELEDLLSCIIEIAKEYGIHINRKKTHIVKISST YKFLQIKYTLTKDGKVIKRINPKRVTTMRRKLKKLSLKVINGEIEYESIENMFRGWMGAH YKLLSKQQRKNLIQLYEELFNKKISVISRKLIVSDASSLAA >SRS048870.45220-T1-C MIIEIDLYEVQYLRSKEILMKRKNNLYQYACEMGNIMQAFDEVCRNTKNKRKVRKYKEYK CIYISRIHDIMVNKTYEVGPYTVFKIYEPKERRIVSQNLQDKIINHLVSRQILYPALNPC LIPENVASIKDRGTKEGLRLEKSFIRKCNIKYKDYYILKCDISKFFASIDHIRLKEKLIK RIKEKDALNIIFKIIDSEEQGLGIGNMTSQVLAVFYLNDLDHYIKENLKIKYYVRYQDDF LLFHPSKEYLKECFEKIKIFLNKEKLVLNRKSRIYKNTNNFIFLGRNKKGGYAKYRSMKR KIKYKKYLYQNKKIDLINLTNSMICYKNLMxxxxx >SRS048870.68593-T1-C MTSEQRREARYRRRQTRRQAKRKARSDALGPIEEVFSYRAMFFYGRKCCNGVRWKASTQR FEMHLFSGTAKRRRKILNGTWKPGKTAHFTLKERGKVRPIDAPHIEDRQVYKVLTKKVLV PLYVPSMIYDNKASQKGGGLHFHYRRLAKHLRDHYRKHGLEGALFLMDFHHFFPDAPHAL LYERHRGMILNPDLRQLADLVVAAVPGGVGMPLGVEPSQQEMVALPSSLDNRIKAQLSIH GAAHYMDDYYTILPSKQAAEVTAADVIGHAEAMGLQVNAGKSKVVPFSRPFRFCKAKFQV TDTGAVKIHGCRDGMKRARRKLRLFQARVASGEMTVEQVAQWLQTPISYYENFNDHGRVL KLRRLFYAIFKTEV >SRS049959.3572-T1-C MVTTEWLLDAYFDCRHSKRRTASAVVYEMDYESRLIALRDRINSRTYQPGKSICFVVTRP RYREVFAASFEDRIVHHYIALRLTPLFENIFSERTFNCRKGKGQLYGINTLKEDIRQCSN NYTEDCHIMKLDLKGFFMSIDKKLLAEMVDRFIVKYYKGEDIDDLRYLCRVVILHSPEKN CERHSPLSYWEKLDKNKSLFTNGEGKSVAIGNLFAQIFANFLLNTLDWFIENEGIEHHGR YVDDFYCIHKDKEKLLALMPKIRELLAKLGLRLNEKKFYLQHYSKGVEFTGSIVKPGRVY TCNRTITNFVAAVRRLNKANNERQVLHAVCSINSYLGLLRHTNEYAMRRKVLNMIEPHVF KEYVYIKGHYEVLAIKNKHKLRYQTMQRIRNGDY >SRS049959.59097-T1-C MGYRRLKIRKQRNMKRIDNLYDKIISLKNLRLADENARRGKTNTYGVKVHDKNREQNLLA LHEALLTKTFKTSPYDVFTIYEPKERIIYRLPYYPDRIVHHAVMNVLEPIWVRLFTYNTY SCIKGRGIEGCARRVDKIIKSFEGKPLFCLKIDIKKCYPSMRHRVLKRLIRRKIKDKDLL WLLDEIIDSASMDDAGRPLTEADKAQSDPEDAHGLPIGNYLSQYLANLCFCYFMHWVNEQ LAELVKKALRLTVKPHIECTEYADDITFYAESKAVLHEVLKLIRVQLEDGLSLKIKGNYQ IFPVAKNRYDRHGRALDYVGYKFFREQKLMRKSIKQNFCREAARLNKREKPLSQKAYKQA VCPWLGWAKHSNSRHLLKTIIKQKYYGIL >SRS049995.27904-T1-C MTSQERHEARFQRRKAKRLERKQARCDSLGPTNKIFSYRKMFFYGKKCCNGVRWKQSVQN FEGHLFSGTATRRRTVLEQTWKPKSCSHFTLRERGKIRPIDAPHITDRQIHKTLCNEVLI PLYSPSMIYDNGASQKGKGLHWQFKRIKQQLGWHYRRYGREGAVLLLDLKGFFPNASHAL LYQRHRELILNPELQNLADTVIQYSPCPTPGRGMPLGVEPSQQEMVALPSKIDQWIKCQA RVHCAGHYMDDYYAFFPTVDEAKLMGHEIVRRFEAAGIRVNKRKCKVIPLTKPFRFCKAR FTLTETGKIKVNGSRDGVKRARRKLKLFHREFKEGKRSFFDIEQYMECQSAYYRNFNDHG RLLRLRRLYHAIFFGGGQCLESSNPGPVSA >SRS049995.105341-T1-C MPKRKGFLYEWMCDKEHIREAIVFGAKDKHDRRDVRRVLADVDGFTDRVYDLLQTQTFTP AQPKKRKIFDNSSRKWREIEYVPFFPDGIVHTLMVLAAAPVFLRGMNYWCCASVPGRGGK HALRRCKRVIHHGKKGSRYVCKMDVHHFYHSVDRRKLIWMLAHKIKDKKYLKLTWEILQT CEQGLAIGFFICQWLANFYLEPLDRYITTLDGVKYSVRYMDDIVLFGPNKKKLHRARKAI AEYLQKRLRLQMKGNWQVFPLKVRPLDYVGYRFYRDYTTMRRKNFLRFTRQCRKVRKKIE RHQRIAYRTASGLLSRIGQLKHCNSAAARKKYVDPIGVRILKEVVRNESKRRQCAGKCVF AGGAA >SRS050299.5207-T1-C MSIVRTEYGLCYGSDTRLLDYEDFEDCGYFIEESKRLKNIYQYVYDSANLVLSQYKAQKG KGVRTEITKFNNRISENLTDLWEMLYYETYTSGEYRIKTIYEPKERIIMIAPFYPDRIVH HCIINILGEYWTNVFISHTYACIKGRGTHKCMEDIHHDLVTDRKGTQYCLKIDIKKFYDN VDHEALKRIIRYMIADEQMLRLLDKVIDSNGKEKGLPIGNFTSQYLANLYLAYFDHWMKE DLGIKYYYRYMDDIVILHHSKETLHYILDMMGLYLGAELKLEIKHNWQIFPVDARCIDYV GFKQNHYGILLRRGILQRFYTKFDRVRKQYNISDEIAIKHLFPSEYGWIIRCSEEHSKFI LNNCINNGKAKCSEYRAVG >SRS050299.32424-T1-C MKRVSNIFEGIINYNNILYADEKARKGKLHSYGVKHHDRNRERNLHKLQDALTNLTYKTS EYSLFTIHEPKERLIYRLPYFPDRIAHHTLMNYLEPIWEKVFIAHTYACRKNKGIHKAAQ DIKKVLRKDTVNTQYCLKLDIKKFYPSIDHDILKSIVRRKIKDTRALKLLDEIIDSAPGV PIGNYLSQYFANLYLAYFDHWIKECKKIKYYFRYADDIVIFSKDKESLHTLIKDVIEYLN INLKLTVKSNYQVFPLDSRGLDFVGYKFYHTHTLLRKSIKHRMCKCLSRLYNKTYSNSYA RRKVCSYFGWLKFCNSINFCKKLVLRVCKKYNLTTPEIFAPKKDIISNVLDKKLKFIGYK IYNKYFKLYVITNKLISVTSKSKLLLSLMRNIISTYIIIIFHKRYNAYEILL >SRS050299.40079-T1-C MGLTFEKLFEAYLDCRKHKRNTINALMFEYNLEQNLFRLYEDLVSGEYKIGQSICFIVLY PKPREVWAADFRDRIVHHLIYNEIKDKFYKRFIKDTYSCIPDRGTTNAVKSVRAHAESVT HNYTETAYFLKADLKNFFVSIDKNILYDEIQKFVNEEWVLSLIKQVIFHNPKTSVCIKSP AYKFDFLPKYKSLWHTQSDKGLPIGNLTSQFFSNVYLNVLDQYVKHHLKCKYYCRYVDDF VIMHKSPQYLNDVHKDLTVFLKERLNLELHQNKKLINKVDKGIDFVGFVVKPYRINLRQK TIKRIFKIIREQKLNEHWFYEGELETFCSTINSYLGMLRNTNGYRLRKEICLQCINLFVK CDSEFTKLTVVLRSHIEIERNHHLH >SRS050299.40255-T1-C MKTHNHLIEIAVSDECMDISWKTCVKGKTDRPKVKEIIKYYDLAKRKLREKIWEGGFKPL IHKAHIVNDGFKQKIRKIIQPYLSIRRPEQWIQHIVIYVLKPIFMRGMYKYSCGSVPERG VHYGKKYIEKFIRKNPKKIKYVLKLDIHHFYENINIDLLKQRFRKTITDLKFLELVDFVL DSNVGILPDGTVLNKGLPIGYYTSQWFANWFLQPLDHYIKEELRAVCYVRYMDDMVIFGS SKRELHKMFIKIREYLTTLDLEIKDNWQVFLFDYVDKNGVRRGRPIDFMGFKFYRDKTTI RKSIFLRAIRLAKRISKKLKISWRDACQLLSYMGWFLPTDTHKAYEKYIKPFVNPIVCRK IVSKHNKKRKEKQTEENNAINLQKGSI >SRS050422.65682-T1-C MKTFDITHEEIIDPSNLLEADRNASQGKHNRDENLKFSAHREEEIIDLTNRLTYYPKDGI PGNPLESSYRVGKYRMKQIYEPKPRIIMALQYRDRVVQWAYYQKLNPLFDRQYITHSYGC RKGKGTVKARKQLQIWLRKVNRSPKHWYVLKLDIAKYFYRVDHEILMKILKKHIKDELIL RDLDKLINCEHTAFGLPSGVQPELCKQEDWLYNRGMPIGNLTSQMFTNIYLNELDQFCKH VLHIEKYIRFVDDVIALFDSKEKAFAARDEIERFLNEELHLELNNKTSIRPAYLPVTFVG ALITPLTIRVRKSTRQRMYRRIRFIQKMFEMGQLAWEKVNNTMQSYFGLISHFTAGNLLR KIIDEFSFRVPDNNS >SRS050422.182606-T1-C MKRVGNLYEKIISLDNLRLADEKARRGKLNSYGVKIHDKDREANILALHETLQNEAFQTS EYSTFIIHEPKERLIFRLPYYPDRIMHHAIMNILEPIWVSVFTKDTYSCIKGRGIHGAMR AVKRAIKDRENSRYCLKIDIRKFYPSINHNVLKSVIRRKIKCKATLHLLDSIIDSTDGVP IGNYLSQYFANLMLAYFDHWIKEVKRIKYYFRYADDMVFFASTKKKLHALLAEIKEYLGG LKLTLKGNEQVFPVAENRKDKHGRGLDFVGFVFYHAQTLMRKSIKQNFCRAAARLNRKLH ISAKDYKQRLCSWFGWAKVSDSKHLLKTIIKPQFYETCILRRKAA >SRS050752.31225-T1-C MPLFHAKIGRWQTASIPGRGINDARKAIRKWVRERDSRVFVKLDVVKCYPSIDRAVLKAM LAHDVGDRRLLRLMFALVDQYRGDRGLNIGSYLSQWLANYYLSAAWHYAEGSLFAVRRSR RRQGEEIRRRLVTHVLFYADDILLIGRSKRDLTIAVKRLRRFLRDRLHIEIHPTWNVKHI GAEPIDMVGYTFRPGRTGVRPAIFLRAERAYSRAAKRPMTMAMARKCISYYGWLYHSDSV AFRRRHDIDRIFHQARHVVSAAKTGRTTQ >SRS050752.39734-T1-C MFEHAKKRMDDPTTVPALLHADECVAQVQHWIVCGDWVPSKPIHTRHYEPSNGKLRDIDY VPFWPDGVMHWILIDSIYDRVVPKLDPYCVAGIRGRGPHSTKKHVEYWIKTDRAGTKYGA ELDIHHNFPETDHDFVMYGYRQLIKDKYWLRLADAVVQSFANGLPIGYVTSHWFQNLAMT AFDRYVRSLDGVRHYYRYVDNIHMYGPNKRKLHRALQAAMDWLCAAGYTINSSWQVYRTD YIDADGEHRGRALDGLGFVIYCDHTIYRKRTSKRLIRLCLDISKRPHGNPTPHQARQAAC RIGQLKHADMHHFRVKHVDGVISYRKIRKAIQNA >SRS050925.34161-T1-C MESRNISLTTCGERLDHSPRDRKAVSDIYDLFQPAVAATAVCYPLYNLIPEIISDENLER SFKRVMANLRSADTRSGNRQREIAVIDGIECSPRMARYVKNKHKILDALKEQIGNGTFRI KNLKSFTVDDGPKVRIVQAPSVIERIGSNAIMEPLEKHLSPLLIETTAASIQGRGPHGLF HQVQDTLAENPNIHYYYQSDYKGYYDSIDHDILISTIRRYVGDPVLLPILENFVKALYPN GKHGISKGLRSSQFFGNLYHNDIDHRMIDEYGAKHYFRFCDDIFILGESKRDLWKLRDKL HYEAAQIGLTIKPSEKVAPISSGMDALGFVNYGDYTLLRKRTKVNAARKLSKIKSRKRRQ QIIGSFKGMACHADCKHLFYILTKNNMKKFSEMGVTYTPADGKNAFPARLCV >SRS050925.118986-T1-C MTSEDRREARYRRRQARRRRNRQARSDSLGGLAGVFSYRNMFKYGKKCCNGVRWKGSTHN FELHLFSGTAKRRRKILDGTWRQGKTIRFPLRERGKFRIIDAPHITDRQIHKVFTREVLA PLYCPSMIYDNGASQKGKGLHFHYQRLKEQLRWHYRRHGRQGAIKLADFHHFFPDAPHAL LYERHRRLILDPDLRELADLMVAAVPGEVGMYLGVEPSQQEMVALPSYLDNWMKCQLSLH GMGHYMDDYNAILESAERAEEVLEDMIRRAEEKGLTINRNKCHVISLDKPFRFCKAKFQI LPSGRIITHGCRDGMKRARRKMRYFRQQVDAGEKTVEQVAEWLKGPIAYYEQFNDHGRVL KLRRLYYALFIKGRKTEEEKACIGL >SRS050925.123189-T1-C MTYQEMCEFQTLYEAYLEARKGKRSKPGTAQYEANALICTDKLSYVLNQKTYKPSGFEVF YVYEPKKRLVQAPAFVDKVVLHALTDNVLYDTICTSFIRDNHASQRGKGTLDAIVRLKDH MVDYYRKNGSADGWVLKCDVHHFFASIDHDILKAKLRALMQKRGVDMAFYDLMCIYIDKT DGLPLGYQTSQLLALMFLDEFDHYIKEDRGCRYYGRYMDDFYVIARTKRELQFLLKDIER WMSDLHLELNSKTAIFPLKNGLDFLGFHSYxxxxxxxxxxxxxxSAARRFSASRPASNTG KKPTRQGK >SRS050925.138218-T1-C MLSHDSIERSILKTSRGKRNREDVKEVLDNLEPEIEIVYNMVSSGNFKPRKHQTVVINEK NYLKVRRIIKPDFRYEQIVHHLIVQSISDCIMSGMYQYVLGSVPNRGAHMGARSIEKWIK KDPANTKYVLKMDIRHFYESVNHRVLKRWLKKKFRDKFLLELFDVVIAAADMGLPLGYYT SQWFANFLLQPLDHYIKEKLHIKYMTRYVDDIVCLGRNKKELHKVRLAISEYLENELGLT MKGNWQVFRFEYTAEELAITCKSLKELERLGNDLEAKRIRYKSKMHKGKRKIFIKLSGVK NKRELLSSILDKYHATSEVLLMVHGRPLDYMGFEFHRNRTVMRESIMLRATKKAVQVSKQ EKINPKDAASLLSSMGWIKHTDTYDMFLERIKPIVNIKTLRKVVSKNQRRLNNAVKLENS RGHTARAA >SRS051031.14886-T1-C MKREKDIYIKIIDKNNIYKAILNASKGKKDRENVKKILDNPYHYVDVIYNKLVTKIYNPS PYVEMKIHDGVRKKERIIFKPRFYPDQIVHWALMQQIESIIMKGMYEFCCGSVKGRGIMH GMKYLKKILVRDRKNTKYCLKLDIKKFYPSINQTILKRKFMRVIKDRDTLDLIDKIIDSA DEGVPIGNYTSQWFANFFLQDLDHYIKEELKVPYYLRYMDDMVLFHRNKKELHKIKDKIE AYLKNENLKLKENWQLFKVDSRPIDFLGYRFYRGYTTLRKSNFLRIKRRYKKISKKQKIT YTDASASLSYYGWLKHCDSYNFNQKYVKPYVDLNKCKGVIRNESRKQFTTKKQV >SRS051031.143236-T1-C MEALMLSDIGKEICTFRNLYDAMRLCKRGVIWKDSVAGYVVNGLVRIHRLKKSLENDKYD ISPYTEFKVYEPKERDILSTKIKDRVFQRSFVDNYFYDEMTRSFIYDNGACQKGRGTERT RKRLICHMQRFYRKHGLTGYTLKGDLSNFFGSTKHEVAYAAVNKRVSDEWARRESKRIID SFNNGPDPEIGMGLGSQCTQIIQLAVLDKFDHYVKEVLRIKHYVRYNDDFILIHEDKGYL RYCLSGIDNWMTSRGLKLNPKKTQIANFSQGVKFLGFRFRPTETGKVVMTLLPEKVSHER RKLKRMRDGVRQDRLIKSDVDRSYESFKANLTNNGKHKSEHPGRRARRSCHGLELAMDAF YKDLWRDDECSDLRRSSIS >SRS052027.8711-T1-C MSIRNVFSDICSFETLLQAERNVRKGRRYGKAELSFWANLEDNIYRISESISNLRFPPDR YYSFYVYEPKLRRIISADYTTKIIQRAAYDVLNPILCKGFISDTYACIKGRGQVNAMKRL ASWVDQAADSGENWYYLKMDVEKFFYRIDHDILMNIIEKKIGDKKTVRLLEHYICEASSP FGLPLGVKNPMEVAEEDLLWDVGITIGGGLSHMYGNMYLNQLDQLAKRSFGIKKYIRLMD DIVILYPDKSVLHKYKKEFSEFLSDVLHLRLNNKTAIRPINQGMEFVGYRIWPHKVRLRK STSLRMKRRLKKVQEDYRNYEMSFEKANDTVMSYMSLMKHCDCEALKTKIFSDFVLTHNP KEAN >SRS052027.52468-T1-C MATCLWLCGGSGHRWGLVVNPNVLEAERKRNMKTYCKRLVVSDADRIYDVITDYMHDKYR KNSSVKFFACYTGESREYVKQYLKPQLTNLEPVDRKEFAALSVGNLPENDFWRAAHYKLA LEMAYHIRTRTVREHLLANTYGQPLIRYTKINDPGSGKERMLGLETILFRLYEQVAAKAA EPLFKAKLGTYQCASIKGRGQNYGKKAVLRWLSNDVEGTRYSAKADVKKCYPSIRHGKLL ELLKRDLHKSDELLYLFTVFIELYEEWPSPEALAPDRGILIGSPVSKDLCNYFLSYAYHY ASERLVKETCRRGQTKIKRLISHIIFYMDDITMYAANKTHLKQAVKMMIAYMLNFLDLRI KPDWIMQKTMYEDQVGKTKGALLDFMGFRFHGGDVETKSYFGRQKKHKKVWVTIRRNIFL TARRKMNKFLKLVKHHVTVKIKFVRSVISAYGWFKNTNMVKYRIRNKVDELMRIARKIAS DYDKGNGYVEKKYFNMWRRYYAQGSKSRKNGKGSVQGKAEWRSRCVAQEQPA >SRS052027.149772-T1-C MKRSNIGIKDIATFENAEDAYRKARKCKRYREEVLRFTDNLEEELYDLVADLEAGTYRQG EARRFVVYEPKKRDIYALPFRDRVAQHMINNKIEPIVERRFYYHSYACRTDKGMHKAADY AQECIRNLSFEGKQVYILKADIHKYFNSVDHEVLKQILSGIFKDKDLLNLLYYIIDSYGE NGRGLPVGNLLSQLFANLVLNELDNFVKHELKEDKYIRYMDDFAIVSNSREHLVEVLQKI DAFLGERLKLTLNPKTQIINAKNGFDFCGYRIYKDYRKIRKRSPKHIRATIKAYRSGKIT KEKLLMKYASWEGHAKHADTYRLRMKIKGQIEAEIKKKELIGNGSITPDN >SRS053603.76346-T1-C MMTGRKLQAEELLEDLYCAYFDARKNKRGTASQLRFELNLERNLSVLAREISTGHYEVGR SICFVVDKPVKREIFAADFRDRVVHHLLFNWLSPMLERSFVADSYSCRKGKGTLYGIERT RKHIRQCTRNFTQTAYVLKVDISGYFMSINRQRLLDTILTEMEKYRHRRCGRGKMTWEER IDYTLATDLLRKVILNDPTQNCIFRGNRSAWNGLPANKSLFNSEEGCGLPIGNLTSQLFS NVYLSAFDNRMKREEGLRYYGRYVDDIIIVHPDSHYLAQLLGRMERILKTDYGLTMHPRK RLLQRADQGFDFLGMREHGSVLLPGRRLKRGMRLMLMLMKWKPSA >SRS054590.3812-T1-C MACGDFDEVFSFRHLYLSAKKCCKGVYWKSSTQRYIGDLIPNVALTLLSLKNGTFIHRGF HEFYIMERGKKRHIRSVHISERTVQKCLCDYCIVPIYSASFIYDNSASLKHRGMDFALRR MVYHLERHFRKHGLSGGILIYDFKSFFDDAPHAPLLREAERRLHDDRVRELHNSFIADFG PVGLGLGSQISQTNALLLPSPVDHYFKEVLGIEGYARYMDDGYAIHEDLDYLKGECMLGL EEVTRHLGLRLNWKKTRVIPLADFYRCLKTKFIITPQGKVILKMNPHSTKLIRRKLRSFH GKVERGEMALSDIRNSIDSYHGHMKRGNSFKVRERTNQYFKSMFGFYPNKKGWESNVSNV QRRRYSGYGHEPGLGQKAG >SRS054956.25561-T1-C MTSEERHAQRELRRKAEREKHRREKLSEYDDFARVADYNSLYQAYREARKGVTWKASVQR YGSELGKNLCRTHNALINGEDVRKGFIPFDIMERGKLRHIQSVHFAERVPQKSLSKNALM PVLSNSLIYDNGASRQGMGISHAINRVSQFLQEYYEKYGTEGYVLQIDLKNYFASIPHEP LKEMIRKKFTDEKIIKLTESFIDAFADEKMMEVQKQKEEAMLYGDEYARGLGLGSEICQG CAVAYPDPVDHYVKEVLRIRPYERYMDDSLIIHQDKEYLKMVWDVLREKYAELGIQLNEK KTKIVKLSRGFTFLKTRFILTDTGKVVKKLCRESITRERAKLKKFAKLTEEGSMTYAQVR EAYQSWRGYAGQKDAYKTIRRMDILFNRLFIKEWVYVPQIPEYSEERRIAA >SRS054956.84356-T1-C MKRYGNLYQKICSMDNLKEAHRNARKGKGWYAEVKEVDTHLEEYLTKLQEMLINHTYHTS PYEKFIRKENGKEREIFKLPYFPDRICQWAILQVIEPYLLRNMTSTTYSAIPGKGIHAAL RDVQEAMRKDVPNCQFCFKLDVRHFYPSINHAILKAKFRKLFKDAELLWLLDEIIDSIST ASIEDMRNIWLLDEDIDPETGIPIGNYLSQYCGNFYLSSFDHWLKEKMHVKHAFRYMDDI VIFGSSKEALHKLQKEVKRYFKTELHLTVKGNWQIFPTYVRGLDFVGYRSFLNFTLLRKS SCKSFKQKMNSIRKKTENGQMMRYSEWCSINSYKGWLKHCDSYRLQAKYIAPVQADADRY YQEVIQRKAA >SRS054956.105762-T1-C MKDHITSYDSLYESMLKCKKGVTWKPPVKSFVLNGEENILRMKHQHQDGTWKNGKPKTVL ITYPKRREALSIPFKDRVYQRSINDNSLYPQMTKGFTYSNCACQTGKGTDFARTLVKKYL WNYYCRYGTKGWIVQVDIHGYYLNMRHSDVERQIRNLTDKDTTEMSCGVLRDQYAGETGY NPGSQMVQIAGISLLNPLDHYIKEQLHVKYNIRYMDDFWILVKTRKQAERVFGEIMKQLQ IYGLEANGKKSHITPLEKGFTFLGFNYRLTETGRIIMTLNSDSVKHERKTLVRMVHKSQR GELEPEKVDEHHNSWENNADKGNSYKVKQRTQKYLKQLRKGEEHGSKKNDSDTCGSGRGR EPQSNRRKAEKNH >SRS055982.60922-T1-C MSDKYKPENTGVICSLLERDEFQPSPTYNVTINDGKERLLTIVPEFPDKIIHRALLLTLR PIWDKVFISDSYCGIRGRGQLPAAFKMRRYIQEARKGGPVYCLKLDIRKFYPTIKHDVLK GIVRRSIKDKRILRVLDAIIDSEKGVMLGSPISPYLSNLYITPLCHYLKEKKGVKWLINY ADDFGILSNDKEFLHRLLADIEDYTNVKLRIEVKRNKQIFPVALDSSDKHGRGIDFLGFV FYLNETRIRKGIKRSLCRKIAKLRKAKHPISREDFLQAIAPWWGWLKYSDSQYLINKLNK ISPYEIKFRR >SRS058723.13538-T1-C MHNATLDLQLGMRKMKRAYRDYYILKMDIRKYFNSIDKKILYKILKRRIKDEKLLWLIRQ VLSAQKRQKGIEIGNYTSQTFANIYLNELDQYATRKLKVPFYYRYMDDIVILFKTKKDAK EALDKIKKFLKEHLELELNDKTNIFRGKQGVNFCGYKINENRMKVRDKGKKKFKKKVKKL LREIKDENLSSKDARIYLTGHLGYFNIANTYTLKKKNILLQDELLREKIIY >SRS063985.12935-T1-C LSEKSVDEIRKPGGKVKRVGNIYEKITKKENIRKAIINASKGKKDRNGVVKILDNINFYV DEIYNMLINKTYTPSPYIKMLIHDGVRKKERIIYKPKFYPDQIIHWALMQQIQPLIMKGM YEYCCASVPNRGIHYGGKYIKKILVNDRKNTKYALKLDVKKFYPSINKEIMKRKFMRVIK DRDTLDLIDKIIDSSESGLPIRKLYFTMVCKFLFARCRPFYKGRNEGKILFKIYGRYVTF SQKQEGIEENKK >SRS063985.29752-T1-C LNRKSFNLDSIFTYENVEWAYRNVCKNCRSISKKTKFSYFSNSNIFDIYNKLNNFNYTFS KYKIFIIKDPKYRVIMSDTIYDKIVNYIIAYKILLPALSLSLIDQNVATRKGYGAKKAYY YFEKYANTLKSNDNVYVLKIDIKKYFYNINHLILMNLLKRYIKDKLVLKLILSIINQTNC DYINKDINCLKQNVINDLYYKNISIKEKNELIKKINEIPLYKDDKGLSIGNVCSQILAVF YLNEFDHFVKEKLKCKYYLRYMDDIIILSNDKLFLKNIYDLIEIELSKYDLSVNPKSNIY KLNNSFTFLGRTYYIKNGNVLYRCRSITYNGIIKKLNYLRNKDFKKYYPSKISYRGYLKK DIYSLNEEYIFLSSKYNNVIVKDFVNNYGYVIYSNISNDIINASLYLKGICCINYYNYKK LLNYLDINNIKYIYLEKTKIIFKYNC >SRS063985.38136-T1-C LKRAGFLYEKLLDRELIRDAIIKASRKKRRRRSVRRILNNIDHYVDELYTMIANESFTPS PYRRFQIKDGATQKVREICCPKFYPDQIVHWMMILVLEPVFMRGMCETNCGSVPGRGAHY GKKHIEKWYKLDRKNTKYCAKLDIRKFYPSSKAPAVMQELRRVIKCKRMLRLCETVLNSS DGLPIGNYTSQWFANFLLQRLDHFIKEVLHIRYFVRYMDDMCLWASSKKLLHRAVKAIEK FLAGLGLVLKANWQIFPTAARAVDFLGFRFFREKTTLRKNLALRLRRRVKKTYKHTQKTG RVRARDAAAVMSYCGWLKHAHCHGFFVKYVKPYVNFKKLKEAIRHEARIRARTAYCVGNA AQPRTV >SRS063985.40641-T1-C MAALGTIEDIARFDRMYQCGKDCCRGVRWKTSIQAFEAELFLRTAQSCRLVEGGAWHPQR RPVHFTVMERGKVRPIDAPHVDDRQVQKVHSRFVLAPCYGPAMIYDNGASQQGKGLEFAY RRLKRALMRHYRRYGREGSVILCDLKEFFPSAPRRALLDRHRRYMPEGPIRALADEMVLT APETEPGRGMPLGMEQSQQEMVALPSAVDNWLRCQMHMEAQHYMDDYVILVPPGVDAAAV LEAFIARCEALGLRVNRRKCRYAPLRRPFRYCKTKFRLTETGRVVTHRTGDAQRRCRRKM RLLAARGDWEGVRAQIVSAKGYYKRHNDNGRLTALRTLHSALRKERAA >SRS063985.50626-T1-C LSEDQTLNLSDFAKVIDFNSLYQSYTEARKGKRWKYAVCKYEVNVLENLMFVHFMLSAHK YRLSPYNCFIVKEPKERLIMYNSFRDKIVQHSLCDNVLEPYLSKTFIYDNYASQKGKGTH FGLDRLKYFMSRYYRQNGADGWVLKCDIRKYFYSINHDVLKEQLRRLIKDRDVLWLLDMI IDSTEGPGIPIGNHTSQWFAVLYLSGMDHMIKERLGIKMYGRYMDDFYLIHPDKDYLRYC LEEIKKYLVPLGLELNQKTAIFPLTQGIDFLGFRTYLTDTGKVVRKVRRESKNRIRRKLK KYRHLLDEGRIDFETILQSYSSWTGHAEHGNSYHLIRKTDDLFFNLFKNELEGLTYGKIT IRFARWKRRQVDEHEIQRESDQVDRGNPGHGQRPNGPGSRENDHSEMLRREGAFEHKQRP QELRKQPLFPV >SRS063985.62313-T1-C VNIDSAAFNLPATVKAFKGKLKRNDFQRELIHTGLIKKSEIAFERLDKHNDRPKTNAAIA AYNDYLTSCINERNLGLKPIRCFQRVDGLTQKLRDICQESPKQQVLEYIAVEALMPLFRA KLLPVQYGSIPGRGQVLGKRKIERILRTKLKCKIAVAKGDVKKAYPSVTVECSMTLLNRD IGKNKPLLWFVGALMENYPGDHLCIGSYLSTWLFNYVMSYVLRYAMSLAQCRRGVRTAYV KAIVCYADDFSLFGFYSQLVKVIRKSTKWAKDHLGINIKPAWQIYRPDTFDKEKEVHRER AAGSHRRTEGVDMMGFVIRRTYTIIRGRIFLRIRKQVLRAWNDIKRLGYLPWWRACRITA YKGWVKFSDSVKFAVAYRFYPLLKLARQSVSTHGRKEYVKNEQRILLVAASGC >SRS063985.120102-T1-C MAKTIRNQFDKKLTYESLMKAHYESRKNKSCKKDIILFNLKQEEYIKYLYDELKTQKYKH GKYQIFYVHEPKLRKIQKSRYIDRIVHRWLVDNFLYDAFVKQFIYSSYACIKGKGMHNSA IDIQKSMKHCQRIWNEYYILKMDVRKYFQNINKKILYSIIMRKIKDEKLLWLIREVIFST DGENEEVGIAIGNYTSQVFANIYLNEVDQYIKHDLHIKYYFRYMDDSVILVHTKEEAKIA LEKIKKFLKEKLGLELNSKTQIFKNKQGVNFCGYKINEYRMKIRNRGKKKLKKKVKALKI KIKEGKMTSKEAQKYLFGHLGYIKYANNYNLINKLFIRDKSKAQLLRGGSFNNAYASNPA CYREYNYAPNTNTNNGFRPVL >SRS064276.77955-T1-C MKRANNLFPKLVSEENLRLAIFAVNVTHRFHPHHRPNRTVARVEADVDRYVKELREIITG GYEANEPRLARRWDKSAGKWRDISEPRLWPDQYVHHAVIQVLEPIMMRGMDNFCCGSIRN RGIHYGVRAIKKWMRTDPKGTKYAEELDIHHFYDSLTAETVMKRLRRLVKDRRMLEVCER LMKHGILIGAYFSQWFANTVLQPLDRLIRESGLCDHYLRYMDNFTLFGRNKRKLRRLREL IEKWLAAHGLRLNGKWQLYPTAKRTVAALGYRFGRGYTLLRKRNMVRLKHSLSVCRRTMR RHHAIKPALAQGLLSRLGQMKHCNHVHFFQNYVETGLQRKLKCVVREHARKERARWNTST EQALSAA >SRS064276.157306-T1-C METKATYKAITDPNVLMDAAKQYMKGVMWKYSTQAYYLDRIERIRITKERLENRDRMSDG FVQFTVCERGKRREIRSIHINERVVHRALNDVVLVPTLRPKLIYDNAASLKDRGTLFALK RLKVHLWKYYREHGTNDGYILTGDLHSYFDSIDHDVIFREYGKIFAYDPDIVNLIMDFVD AFGDKSLGLGSQVSQMTAVYYPNRIDHYIKEQLKVKGYGRYMDDFYLIHESREYLQECLK QIRQMYADIGIELNEKKTRIVRLSDEFKFLKAKTHLTDTGRVVMRPDRGTITRERRKLKA LRRKLDAGEITFADVRQAYNSWQGHIKHFDSYRTRRNMDNLFQELFSEEIRKERSGNNGK KHKNYRGEFKRYDFPERIDPAERRHRV >SRS064645.16980-T1-C MKRFGNLYHRIYDIDNLYLAYSKAKKGKGKTYGVIQFEKDLDNNILSLHKELSERSYITS QYTTFIIHDPKEREIYRLPFRDRVVHHAIMNILEDIWTPIFISHTYSCIKGKGIHGVVKH LKKDLKDADGTKYCLKMDIRKYYPSIDHSILKRIIRKKIKDIKVLALLDGIIDSAPGVPI GNYLSQFFANLYLSYFDHWIKEEKRMPYYYRYADDMVILSSSKKELHSILLEINSYLNEK LHLQLKGNYQFFPVDSRGIDFVGYVFFHTHTLMRKSIKKNFCRKVSVLNKKNITPHDYKM AICSWLGWAKHCNSKHLIKKIIKNEKIQ >SRS065504.50272-T1-C MKRERDIIREIVEPHNLYNSISVVLHGKKRKRTRIGRWIMTNRELVVDILSRQIQDGTFR ISGYKERIVTDGPKDRKVQAIPIIERIGVNAIMSVVEQRVFRKYIRTTSASIKNRGTHDL LNYIRRDIHQYPEEMRYVYKFDITKFYESIWQDFMMYCLRRMFKDKILLTILERFVRMMP EGLSIGLRSSQGFGNMLLSMFLDHYLKDEKGLKHFYRYCDDGTSHAKTKKECWMIRNYVH QQVELMHLKVKSNERVFPISEGIDFLGYVIYPTHTRLRKRNKKNFARKIHHVKSIKRKRE LIASFYGLCKHADCRNLFYRLTGIRMKDFKDLGIKPKYADGKKRFKGNQVSIRDLVNTPI VVLDFETGVIPKFEKEEYDSKITKARNEYERLKNKYHDNIPEDIDFINPDEIPQPEGRYV VRILHEKEEKKFFTASKDIWSVLDQIKEQGELPFRTVIKAERYGNGGTKYIFT >SRS065504.56291-T1-C MVGYLVKRINNLYDNMISYSNILFVFNKIKNKCHNKNKLMKFIRYKNCYLIDILEKLKSN TYNFSNYHIFLIHEKKYRIIMNEDISDKIVNQLVSYFILIPSLKCLIDSNIATRKGKGSS YGYKLINSYINTIGTNKDIYVLRIDIKKYFYNINHDILIGMLERRIKDVKAINILRDIIT LTDYDYINNNIKKLIYKEINRVNSLNISDREKEILITELNSIPLYRKGYGLSIGCLTNQL MAIFYLNDIDHYIKEILKCKYYIRYMDDLYILSDNKYHLNKIYCDIKNKLKEIDLDINTK SGIYRLKEGVSFLGYTYNIKNDRLLIRYDNNTIRRIDKKLKYLYDNDFNSYYTSILSYKG FFIRCNTKLYFDKYSKLCINNIYDKYIIIKNKYIEYVIFIKVKNRYYTYDKEDKGKYIIL KGNIMEVKCEK >SRS065504.273324-T1-C MQKTFEEICTFEVLYRAYLAARKGKRKKVSAAQYEANALMLTERLAYILNTNTYQPGKFE TFFVYEPKKRLVQAPAFVDKVVQHALVDNTLYEAITHSFIPANCASQIDKGMHYGLDLLK GFMTDYWHKNKTTDGWVLKCDVRHFFASINHDLLKQKLRKKVVDDRIFRLMCIYIDTSAD GLPLGYQTSQLLALLFLDEFDHFVKERLRIKYYVRYMDDFLLIHPDKEYLRYCQKEIETF LAGLHLELNEKTNIFPLRHGVDFLGFHTYITESGQIIRKLRHSSVKRMKAKIRGWKKDYP AGKVTKKEILDSWTAWDAHAAHGNTYTLRREIAAQVSEIIGTQLRCHAPIRLSKTQKAML EYKKRRKASQKAALEAAPPHHDGDPPW >SRS077730.2141-T1-C MQFVTTDCYTRQIFWKEMTDTNTHFEDGCHVVYNLNSLYHAYLAAKKDSDWKPQVQKYEM NFLPHIVKSKAALKDRAYQSKASTEFIISERGKTRPITGLQMSDRVIRHSLCDNILAPSL LSYLIYDNGASLKGKGITFSRQRFEEHLHKYYRLYGNDGYILLMDYSKFYDNIPHDIAYQ EIAKHVHDEFSLWLLDLIFDNFKIDVSYMTDEEYEHCMETPFNSADYRLNVPKSAHTGEK FMRKSVNIGDQCSQIIGIYYPTQIDNFVKIVEGQKFYGRYMDDSYVISQSKEYLIELEAK IAAKATELGMTLNRKKTRIVKLSDYYRFLQISYSLTDTGRVIRKINPQRIVDMRRKLKAL SRKVQRGERTQEEIENMFRSWIGGHAYLMSKIQRDNLNLLYYDLFGGFIDGKYYLQYYPR KRNRDSRYTEWQQLHHR >SRS077730.157923-T1-C MDREIVTDYGNLYYAYRKAKSGKKFNSSTARFSNVALDGINILKEQLENQTYTVAPYNRF EIYEPKQRVIESCSFKDKVVQHILCDNILHPKLKNVFIKYNSAGQIGKGTLYALDGLRDH MESFYQRHGVDGWILKCDIRHFFYEIDHEILKDIVDYFFPDPYTTWLNHTLIDSSKNPGL PLGNQAGQVYALLMVHAVDCMATGELGITEYGRYMDDFYLIHQDKEYLKWCLECIKEMLK TLGLELNGKTQIIPFRKGMRYLGFHHYMTADGKYIRKLTGENKRKNKKKFRKLVKDVKAW KLTEEKFYEKYNSWKNHALHGNCIKLVHSMDLYIEELMKEVT >SRS078176.76507-T1-C MTSPSSPEENFAGSATVKSSRCRFQLCLEEIVTLSALIEAYANASRGKHERLGVYRYADD LGENLTILRDRILSGEYKPQPCHEFDIYCAAGQKVRRIAAPAFEDTIVQHLLYEALYDSF DRGFIFDSYGCRRGKGTHRAADRVQEFMRRADTGAYTLQIDIRKYYYRINHAVLRESIER TVADPRIVDLVMLFAGDGEVGLNVGSLLSQLFGMIYLDRFDHYVKRILKIKSYVRYVDDM VFVVRDKAEANRILAEVQAFLADRLCLELSKWRIQPLSKGVNFAGFRTWTDYRLIRKRSL HNFGRKLAKKDIESVSAILAHAMRSSSYRHLLNRVLDAWEPEEIRQLCDRQRADLRKILF PQDEAPVFP