# Balhimycin baseline
nrpspksdomains_bpsA_AMP-binding.1	FASHVDRTPDAVAVAGGDGVLTYRQLDERAGRLAGRLASRGIRRGDRVAVVMDRSADLVVALLAVWKAGAAYVPVDAGYPAPRVAFMVADSAAKLVVCSAASRGAVPAGVESLEPAAAAEEGASDAPAATVRPGDPAYVMYTSGSTGTPKGVTISQGCVAELTMDAGWAMEPGEAVLMHSPHAFDASLFELWMPLASGVRVVLAEPGSVDARRLREAAAAGVTRVYLTAGSLRAVAEEAPESFAEFREVLTGGDVVPAHAVERVRTAAPRARFRNMYGPTEATMCATWHLLQPGDVVGPVVPIGRPLTGRRVQVLDASLRPVGPGVVGDLYLSGALAEGYFNRAALTAERFVADPSAPGQRMYWTGDLAQWTADGELVFAGRADDQVKIR	LDASFDASLFEMYLLTGGDRNMYGPTEATMCATW	DAFYLGMMCK
nrpspksdomains_bpsA_AMP-binding.2	RMAARPPDNVAVVDGDGALSYADLWERSLKFAAVLRAHGVRSEDRVGLVVGRSAWWTVGMLGVLLAGGTFVPVDPAYPAERKEWIFRSANPMLVVCAGATRGAVPAEFADRLVVIDEVDPAAGSAGDLPRVDPRSAAYVIYTSGSTGTPKGVVVTHAGLGNLALAHIDRFGVSPSSRVLQFAALGFDTIVSEVMMALLSGATLVVPPERDLPPRASFTDALERWDITHVKAPPSVLGTADVLPSTVETVVAAGELCPPGLVDRLSADRRMINAYGPTETTICATMSMPLSPGQHPIPFGKPVPGVRGYLLDSFLRPLPPGVTGELYLAGIGVARGYLGRSALTAERFVADPFVPGERMYRTGDLAYWTEQGELVSAGRADDQVKIR	LAHAFDTIVSEMKAVAAGEINAYGPTETTICATM	DTSKVAAICK
nrpspksdomains_bpsA_AMP-binding.3	FAAQAAATPEATAVVFEDRSLTYAELDARANQLARWLIDQGAGPEGLVAVLLPRSLELVVALLAVTKTGGAWLPIDPGYPADRIAFMLDDAGPALVITTAVLSASPIGDVLAARSRTVVLDEPAAAGQLAGRDRAPVTDTDRARALDPRHPAYLIYTSGSTGRPKAVVVTHRNLTNYLLHCGRMYPGLRGRSVLHSSIAFDLTVTATFTPLIVGGEIHVGALEDLIGVVEAAPSIFLKATPSHLLTLDTASRGSAGSGDLLLGGEQLPADTVVQWRRKYPNIVVVNEYGPTEATVGCVEYRLEPGQECPPGGVVPIGTPLANMRAFVLDSWLRLVPPGAVGELYVAGAGLARGYLGRAGLTATRFVADPFGSGERMYRTGDLVQWNPDGQLVFAGRVDDQVKVR	YHCSFDLTVTATKALLGGEVNEYGPTEATVGCVE	DLTKLGEVGK
nrpspksdomains_bpsB_AMP-binding.1	FGHQVAHRRDEPAVVDGDRTVSYGELAERAERLAGYLNGRGVRRGDRVAVVLDRSPDLIATLLAVWKAGAAYVPVDPAYPVERRKFMLADSGPAAVVCAEAYRAAVPDTCPEPIVLDDPRTRQAVAESPRLSAGTSADDLAYVMYTSGSTGTPKGVAVSHGNVAALAGEPGWRVGPGDAVLLHASHAFDISLFEMWVPLLSGARVVLAGPGAVDGAALAAYVAGGVTAAHLTAGAFRVLADESPEAVAGLREVLTGGDAVPLAAVERVRGRVRNVRVRHLYGPTEATLCATWWLLEPGDETGSVLPIGRPLAGRRVHVLDAFLRPVPPGVAGELYVAGAGVAQGYSSRPALTAERFVADPSGSGARMYRTGDLAYWTEQGALAFAGRADDQVKIR	LEPAFDISLFEVHLLTGGDRHLYGPTEATLCATW	DIFHLGLLCK
nrpspksdomains_bpsB_AMP-binding.2	FRRQAGASPDAVAVVAGERTLSYADLDRESDRLAGHLAGIGVGRGDRVGVVMTRGADLFVALLGVWKAGAAQVPVNVDYPAERIERMLADVGASVAVCVEATRKAVPDGVEPVVVDLPVIGGVRPEAPPVTVGAHDVAYVMYTSGSTGVPKAVAVPHGSVAALASDPGWSQGPGDCVLLHASHAFDASLVEIWVPLVSGARVLVAEPGTVDAERLREAVSRGVTTVHLTAGAFRAVAEESPDSFIGLREILTGGDAVPLASVVRMRQACPDVRVRQLYGPTEITLCATWLVLEPGAATGDVLPIGRPLAGRQAYVLDAFLQPVAPNVTGELYLAGAGLAHGYLGNTAATSERFVANPFSGGGRMYRTGDLARWTDQGELVFAGRADSQVKIR	LDPAFDASLVEVHLLTGGDRQLYGPTEITLCATW	DAVHLGLLCK
nrpspksdomains_bpsB_AMP-binding.3	FRRQAERSPDAVAVEDGARSLTFGGLLGEVEALARLLVGAGVRREHRVGVLVERSAELAVTMMAVSFAGGVFVPVDPDYPRERVEFMLANSAPGVMVCTKTTRAAVPAEFAGTVLVLDELPAADPDVELPPVAPEDAAYVIYTSGSTGVPKGVLVTHSGLANLGYAHIERMAVTSSSRVLQLSATGFDAIVSELYMALLAGATLVLPDAASMPPRVTLGEAIRRAGITHLTVSPSVLASEDDLPDTLRTVLTGGEALPPALVDRWSPGRRVIQAYGPTETTICSTMSAPLSPGHDQVPLGGPIHNVRHYVLDAFLQPVPPGVVGELYITGVGLARGYLGRPGLTAERFVASPFAPGERMYRSGDLFRWTREGQLLFAGRVDAQVKVR	LAHSFDAIVSEMTVLTGGEIQAYGPTETTICSTM	DASTLGAICK
nrpspksdomains_bpsC_AMP-binding.1	FARRVAAAPDAVAITGAAGANLTYAEVDQASNRLAGYLAVRGVGRGDRVGVAMERSPDLLIAFLAIWKAGAAYVPVDVEYPAERISFVFDDSGVSTVLCTLATSAVAPGNAIVLDAPETRVAVRDCAAPEIRPHADDLAYVMYTSGSTGLPKGVAIPHGAVAGLAGDAGWQIGPGDGVLMHATHVFDPSLYAMWVPLVSGARVLLTEPGVLDAAGVRQAVHRGATFVHLTAGTFRALAETAPECFEGLVEIGTGGDVVPLQSVENLRRAQPGLRVRNTYGPTETTLCATWLPIEPGEVLGRELPIGHPMTNRRIYLLDAFLRPVPPGVAGELYIAGTGLAHGYLKSPGLTAGRFVACPFAAGERMYRTGDRARWTRDGEVVFLGRADDQVKIR	LDAAFDPSLYAVHLGTGGDRNTYGPTETTLCATW	DPYHGGTLCK
nrpspksdomains_bpsD_AMP-binding.1	FEAAAAAVPDRPAVAMGTTTLTYAELNTQANRLARRLVAHGVGPERLVALAMPRSIEFAVAMLAVHKAGGAYVPIDPDYPAERRQHMLAGAAAQCLLCLPGQDVAGAPVVLSVALAEPGRPEPDLDDSDRLAPLLPSHPAYVIFTSGSTGQPKGVVVTHRGIPNLAADYVHRQNLLPDSRLLAFASPSFDAAVAEFWPIWLAGACLVLAPAPDLIPGEPLARLVRDRHITHVTLPPSALAPLEEAGGLPPGLTLLVAGEAGPAPVAKRWAAGRVMINAYGPTEATVAVTASDPLTGEDTPPIGRPITGVHTYVLDDRLVPVPDGTVGELYMTGPGLARGYLHRPAATAERFLPDPFGGPGQRMYRTGDRVRARPDGQLVFVGRADDQLKVR	LDYAFDAAVAEPTLLVAGEINAYGPTEATVAVTA	DAATLAAVAK
nrpspksdomains_bpsD_AMP-binding.1	FEAAAAAVPDRPAVAMGTTTLTYAELNTQANRLARRLVAHGVGPERLVALAMPRSIEFAVAMLAVHKAGGAYVPIDPDYPAERRQHMLAGAAAQCLLCLPGQDVAGAPVVLSVALAEPGRPEPDLDDSDRLAPLLPSHPAYVIFTSGSTGQPKGVVVTHRGIPNLAADYVHRQNLLPDSRLLAFASPSFDAAVAEFWPIWLAGACLVLAPAPDLIPGEPLARLVRDRHITHVTLPPSALAPLEEAGGLPPGLTLLVAGEAGPAPVAKRWAAGRVMINAYGPTEATVAVTASDPLTGEDTPPIGRPITGVHTYVLDDRLVPVPDGTVGELYMTGPGLARGYLHRPAATAERFLPDPFGGPGQRMYRTGDRVRARPDGQLVFVGRADDQLKVR	LDYAFDAAVAEPTLLVAGEINAYGPTEATVAVTA	DAATLAAVAK

# some bad domains from NZ_CM000745.1
nrpspksdomains_BPMYX0001_RS25360_AMP-binding.1	YGGRLVLVPSFVARDPEAFAELLLKEKVTILNQTPTAFYQLSHQMKKMKSAEMPIRKVIFGGEKLSPLQLKDWKVAYPSTQLINMYGITETTVHVTYKEITEKEIETNISNIGKPIPTLQVYVLDGQKNLLPVGIGGEMYVAGEGVARGYWNRPELTEERFVENPFQPGERMYKTGDLARWLEDGNLEYLGRMDDQVKIR	------------NQIFGGEINMYGITETTVHVTY	---NIGMVHK
nrpspksdomains_BPMYX0001_RS27050_AMP-binding.1	NKVKKAPDKVALQYKEGSMSYSELHRKSNQVARHLQKQGVGREQIVGLLVDRSPEMIISILGILKAGAAYLPIDPDYPEERIRYMLEDSGAEKLIVQYAEDVPAHYSGQVLGLWEREWEKEDHADLEAEAGPDDLAYIIYTSGSTGQPKGVMVEHRNVVRLLFNDQNLFDFSEKDVWTLFHSYCFDFSVWEMYGALLYGGRLVLVPSFVARDPEVFAELLLKEKVTILNQTPTAFYHLSHQM	LNDHFDFSVWEGNQ--------------------	DFWN-----K
#nrpspksdomains_BPMYX0001_RS27585_AMP-binding.1	FEEQVQRTPELVAVVFGDEQLTYRELNSRANQLARLLRERGTVREELVGIMVDRSLEMIVGILGVLKAGGAYVPLDPSYPEERIRYILEDSGVQRLLTQSSVKIMEGFTGETILIDQPHVYNGADENLEHINQAEDLAYVIYTSGSTGKPKGVLIEHRGVSNLQLMAETYGIREGSRVLQFA	RQLDLEMIVG-GLTYNGADI-------YTSGSTG	EMGLYG-SGK

# bad domain from NZ_JNXE01000007.1
nrpspksdomains_OO69_RS12380_AMP-binding.3	FAESAARDPRAVAVRDRDGHLTRRSLDRAAGWLAARLDRLGAGAGGVVVVAARPGTGWVVAVLGVLRSGAACLIADPDDPAGPLAAAAALGCECPAAFVRSGDTAVPGAGTVPVLRVEDAIRAGNSRLRRTAAVREPSPSQPAVVLPVEAPDGTLVGEMLDHRSLLGRVIDRQAASPAAGPADGAVRVDAGAPVAELLVGVLAALRGDGTVDVVPTAERPGRETARAVVPLPRTARAYVLDAWLRPVPAGVAGDLYVAGPSLARAYAGAPARTAGSFVPDPFGPGGDLMLRTGRRAAWTSDGLV	AWLP-TGWVVAGAGAAVRE--------TLVGEML	TGVAAV-VGK

# bad domains from GCF_002210435.1
nrpspksdomains_ctg3_38_AMP-binding.1	FDVDVDGIPAWTPLSLATLTERVAARADWFRRRGVGWRDPVAVYVTSAPDVFLNFFALNRLGAIPALMNGNMPIELAAEFIRRLRGVGVVVDADHAALREHDLGVPILGDAAETGTGDPDQAPPHYRHHPEDPVAITHSSGTTRVPAAIVHSHHGLFAAIRAVRLTESRPYGEVRELSVLPAAHAAGIITVNQALCNSYQLLYLSAQGGPFASSAETILDAIERWRPTGVFGFAVTWSELARHDLTTRDLGSVRNWFNTGDCAHESHVRRLVAVGSHPAYTRQGMVDVPGSKFVDMLGSTEMGHGAFRISHRLGSDNYDRCVGKPYPFAQIALLDVTTGEEVPEGQVGHVGLKSPTLALGYWNDSVNTFRTRLGGYYLTGDLMYRDPDGNYHHVDRASDAVD	AAVLHAAGIITQFGFNTGD-----------AYTR	AAIFFT--AK
nrpspksdomains_ctg9_85_AMP-binding.1	FRKRVAATPDQRAFAHPAADDSGPVWLTWAQVGRRADAVAAGLHGLGVGLEDPVAILANTRLDWIIADFGIMCAGGATTTVYPTTEPQDATYIIADSGSRVLFAENPAQAAKVAGAQLPALTHVVLFDGTPDPTAAVPQLTLAELEEQGARALAADPGLVDRLIAPVGPEHLATLIYTSGTTGRPKGVELLHGGWCWEGVVQAETGLLRDDDVQYLWLPLSHSFGKTLLCGSTHVGLPTYVDGRVDKLVDLLGVIRPTLMCGAPRVYEKVYNKAVTTAQDAGGAKAKIFAWGVEVGKQKVALEQAGRPVPGLLKAKYAVAEKLVFSKLQARLGGRIRVLVSGAAPLSAEIATFFAAANLPISEGYGLTETSAGNFVNPPDGLRIGTVGRAMGDLECRIDTDGEILVRGRPVMRGYHNLPEETAAAFTEDGFFRTGDIGSLDDDGYLRITDRKKDLVKT	GGLALDWIIADGFA------------------TA	DWAF-----K
nrpspksdomains_ctg23_96_AMP-binding.1	FAARVADDPDAPALVVGDRTVGYAELDRAADRLAGTLRGHGVTGENPVAVLLPRGEPLIRALLGVLKAGGAYLPVDPTTPPARIAGLLAASGTRLLLTTPATAAGLPDLGPVRVLDVDTPAGDPVPAGPPPRIHPLQLACVLHTSGSTGAPKRVGITHRGVVRLVTGPDFATLGPGERLLQFAPTAFDASTWEIWGALLTGAAAVVAPPDPVDLTGLTALIRDGGVTVAFLTAGLFRQLAGTDVTALAGVRQLLTGGDVADPGTVRAVLAARGGLPLVNAYGPTEN	LGPAFDASTWEGFLLTGGDVNAYGPTEN------	DAWFLGA--K
