Tan0011111 (gene) Snake gourd v1

Overview
NameTan0011111
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionChlorophyllase
LocationLG07: 74308151 .. 74311854 (-)
RNA-Seq ExpressionTan0011111
SyntenyTan0011111
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGTGAATTAGAGTTAGAGAAGGTAAAAAGAGAAGTAGGAGATTTTCTTCATGCAAACGTAACGAGAGGGAATTGGAAGGCGGTGGCGGAGGAATATGAAAAAGCCTCAAACGTAGCTCAGTCACTGAAGCTGCTTCGAGAGGAAAACACAGCGCTGCATTTGGCTGTGATTGATAATCAAGAAGAAATAGTTGAAAAGCTTGTCAAATTCATTTGCAGATCCAAAGATGATGATGATGATTACAAGAAACTTCTTGAGACTACAAATAATATGTTAAACAACCCTCTCCACCTTGCCGCCATAATGGGAAGCGTAAGAATGTGCCGAGCCATTGCTTCAGCACATGACGAGTTGGTGAATAAGAGAAACAAACTCGATCAAACGCCTCTCTTCTTGGCGGCTTTGCATGGAAACAAGGACGCCTTTTATTGCCTTTACTACTTTTCCAGAAACTTTTCCTCTCATCAAATTTCTTCCAACTGCAGAGCCAACGGAGACACCGTCCTACATTGCGCCCTCAGAAACGAGCAATTTGGTCTGTTTTTTTTTTTTTTTTTTTCTTTTAACATTTATATATAAAACTTGTTATATTTTAACCAATCTTTTAAAGTGATTATAAGAAGGTGGGGTTTGAGATGACAGATTTGGCATTTCAATTTATTCACCTAAAGGAGGAGGCTATGAACTGGGTGAATGAACAAGGCTCAACCGCTCTTCATGTTTTAGCAAGTAAGCCAACTTCCTTCAAAAGTGGAAGCAACATCAACGGATGGCAGAACATCATCTATTACTGTGAGTATCCATTTTAGTTAATTAAATAATTATTTGTCTTTTCTGTCTTCCCACCAAAACTTTTAATTTCAAATTCCAACCTTTTTTCCCCCTTAAACTTGGTTTGTTGGAAGTGATATTTGTGGATCAACTAAAGCCTCGATCAATAAAAAGCCTTAGCAAAGCGTGCAAGAAACCAAGCACAACTGCTTCGACCTATTTTCCAGTTAAGTACGGGACATGCATCGACTTCTTTACGAGGTTGTGGGATCTGTTTTTAAAAGGTTCGTACAAAATAAATTTCTTTCACTATTTTTTCTCTCTCTCTCTGTATATGTATAATTTATAATAACCCTCTAAATAAGCCTAAATCAGCACATCCGAGCATAGTTCATGAATAAGGTATATGAATTATCATATAAGGATGGTCTAATTTCTTTTTCTTTCTTTATGCAAATATTACAGTCAGCAACTTGAAACGATTAGTCGAGAAGAAGAAAAATGATGTAGTGGAGAATGATACAGATTTCGAGGAAGTTGTTAAAGTGGAAATTGAAAATGAGCTAAGTACGATGTGTTTATATATTTTCAATTGAAATTAAATTAAACCTCAAATTATAATATGATATACTCATTCTTGTTTCATGACAGCAAACGAGGCTTCTTCGATTACAAGTTTTCCGAAAAACTATATTATTAACTTCAACAAGTTTTATCGAATTGTTTTCTCGCCCATCTTGATCATTCTTGGGTTCGGTACGTGAAGCTACAGTAATATAATAGATATATATATATATATATATATATATATATATATATATATATAAAAATATGTATTAATTGTTTTTTGAATAACTTAACTTACAATTACTACAAAGTGGGTTTTGTCCAAAATTAAGGATCTGCTGAAATCAGAAAGATACGAGAGAAGAAAGAGAAACACACTTGGTCAGTTCAAGTGATGGAGAAACTTCTTGAAGCTGTTAAACCCCATAAATATGGTGACGACGGAAGAACTCCCATGAATCCAACATTTAAAACAGACAAAGAAGAAACACAACCTTACTCCGTTGTAGACGATAAAGTCAATTTCAGTCCTAACTACGATCATGAACTACTAGAGAACTCAAAGAATGCTGAAGATGTCGTCAAAGGTTATTTATTTATTTTTTTTAAGGGTAAGCATGGACATAATATATATTTAATTATAATTTGTTGAAATGAACAGAGTCTGCAGAGTCAGAGTCAGAGTCAGCGATGTTATTAGCAACAAAGAATGGCGTGATTGAGATTGTGAGGGGAATGTACAAACGTTTCCCTCTGGCAATTCGTGATAGTAGAAAAGATAAGAAGAATGTGGTTCTTTTGGCTGCGGAGTACAGGCAGCCAGACGTGTACAGGTTTTTACTGAAGAAAAAAAAAGAGATAAAAAGCCAGTTTCGAGCGGTGGATGATGAGGGAAACAGCGGCTTGCATCTCGCAGCAACCGCCATAAATCCTGAGCTTTGGCGCATCAGTGGAGTTGCATTACAAATGCAATGGGAAGTTAAGTGGTACAAGGTAAATTAATTAATTTCCTGATTTTGAATTTTGAATTTTGAATTTTGAATTTTGAATTTTTGAATTTTGAATTTTGTGCAGTACGTGAAGAAATCTATGCCACTCCATTTCTTTGCCCACTATAACAATTCAGGAAAAACTGCAACCACAATTTTCCATGAAACCCACAAGGATTTGATGAAAGAAAGCGGAGAGTGGCTTACTAAAACCGCAAAGTCATGCTCTGTGGTGGGTACCTTGATTGTAACAGTGGCTTTTACTTCTGCTGTCAGCATTCCAGGTGGGTTTGACAACACAGACGGCGCACCATTACTTGAAAAACAGCCAGCTTTTTTAATCTTCGCCGTCTTTTCCCTCATTGCCCTCTGCCTTTCTTCAACCTCAGTCATCATGTTTCTTTCCATCTTGACCTACAGGTTTGATGCCCATGACTTCAAATCAAACTTGCCTTGGAAACTCTTCCTCGGCTTTTCCTCACTTTTCTTTTCCATCATCTCCATGCTGGTTTCATTTTGTTCCGGCCACTACTTTCTCATCGATCACCGCCTTCAAAACGTCGCCGTTCTTCTCTATACACTTGTTTTTCTCCCCGTCGTCCTTTTCTTCTTTCTATCCAAGCTTCCTCTCTACATAGATGTCTTACAGACTATTTTTAAAATAATGCCTAGAAGGAGTTCCCATGTCCTCCTACCAGCTGATCCCCTCCCTGCCCAAAATCCTTCTAAACTTTTCAAAAAAGGAAAATTTGAGGTCACTTCCATCCCTCTTAACCTAACTTCAATTTCAAACCCATTGTTCATCTTCACACCCACCACCCCAGATTCATATCCTCTCATCTTTTTTCTTCCTGGCTGCATCCAATCCGACTATGCCCATTTCCTCCACCTCATAGCTTCACACGGCTTCCTTATACTTGCCCCACAGCTGGTAACTTCTTTATAACAAATATATATATATATAATTGAACATTTTTTTTCTTTTTTAAATTAACTAACTCATCTCCATGTTGAATAATTATATGATTATAGTTCGACGTGAAGTCAACAACATGTAAAATGGACGAAACAGATCAGTTAACAACACAAGTTAAATCTGACCGAGAAGGGGTGGAAGACAAGCTATCAAAACTGTCTGAGGTGAAAGGAGGGAAACCAAAAGTGTCGTTATCTTTAGGCCACCACAACAACCCTTCGAATCCATTTTCAGCAGTGTTTGGCTTCCACCCAGCGCCTGGAACCAAATTCAGCATCCCAGAGTCTCAAATTCAGACCTACCTCCACCCTCAATCCTCCAACATATCCTCACCAATTGTTGAAAGTCAATTTATAATTTCCAAGTTATGCGCAACTGTAAAATTGTCTGTTTCTTGA

mRNA sequence

ATGTGTGAATTAGAGTTAGAGAAGGTAAAAAGAGAAGTAGGAGATTTTCTTCATGCAAACGTAACGAGAGGGAATTGGAAGGCGGTGGCGGAGGAATATGAAAAAGCCTCAAACGTAGCTCAGTCACTGAAGCTGCTTCGAGAGGAAAACACAGCGCTGCATTTGGCTGTGATTGATAATCAAGAAGAAATAGTTGAAAAGCTTGTCAAATTCATTTGCAGATCCAAAGATGATGATGATGATTACAAGAAACTTCTTGAGACTACAAATAATATGTTAAACAACCCTCTCCACCTTGCCGCCATAATGGGAAGCGTAAGAATGTGCCGAGCCATTGCTTCAGCACATGACGAGTTGGTGAATAAGAGAAACAAACTCGATCAAACGCCTCTCTTCTTGGCGGCTTTGCATGGAAACAAGGACGCCTTTTATTGCCTTTACTACTTTTCCAGAAACTTTTCCTCTCATCAAATTTCTTCCAACTGCAGAGCCAACGGAGACACCGTCCTACATTGCGCCCTCAGAAACGAGCAATTTGATTTGGCATTTCAATTTATTCACCTAAAGGAGGAGGCTATGAACTGGGTGAATGAACAAGGCTCAACCGCTCTTCATGTTTTAGCAAGTAAGCCAACTTCCTTCAAAAGTGGAAGCAACATCAACGGATGGCAGAACATCATCTATTACTTGATATTTGTGGATCAACTAAAGCCTCGATCAATAAAAAGCCTTAGCAAAGCGTGCAAGAAACCAAGCACAACTGCTTCGACCTATTTTCCAGTTAAGTACGGGACATGCATCGACTTCTTTACGAGGTTGTGGGATCTGTTTTTAAAAGGATCTGCTGAAATCAGAAAGATACGAGAGAAGAAAGAGAAACACACTTGGTCAGTTCAAGTGATGGAGAAACTTCTTGAAGCTGTTAAACCCCATAAATATGGTGACGACGGAAGAACTCCCATGAATCCAACATTTAAAACAGACAAAGAAGAAACACAACCTTACTCCGTTGTAGACGATAAAGTCAATTTCAGTCCTAACTACGATCATGAACTACTAGAGAACTCAAAGAATGCTGAAGATGTCTCAGAGTCAGCGATGTTATTAGCAACAAAGAATGGCGTGATTGAGATTGTGAGGGGAATGTACAAACGTTTCCCTCTGGCAATTCGTGATAGTAGAAAAGATAAGAAGAATGTGGTTCTTTTGGCTGCGGAGTACAGGCAGCCAGACGTGTACAGGTTTTTACTGAAGAAAAAAAAAGAGATAAAAAGCCAGTTTCGAGCGGTGGATGATGAGGGAAACAGCGGCTTGCATCTCGCAGCAACCGCCATAAATCCTGAGCTTTGGCGCATCAGTGGAGTTGCATTACAAATGCAATGGGAAGTTAAGTGGTACAAGTACGTGAAGAAATCTATGCCACTCCATTTCTTTGCCCACTATAACAATTCAGGAAAAACTGCAACCACAATTTTCCATGAAACCCACAAGGATTTGATGAAAGAAAGCGGAGAGTGGCTTACTAAAACCGCAAAGTCATGCTCTGTGGTGGGTACCTTGATTGTAACAGTGGCTTTTACTTCTGCTGTCAGCATTCCAGGTGGGTTTGACAACACAGACGGCGCACCATTACTTGAAAAACAGCCAGCTTTTTTAATCTTCGCCGTCTTTTCCCTCATTGCCCTCTGCCTTTCTTCAACCTCAGTCATCATGTTTCTTTCCATCTTGACCTACAGGTTTGATGCCCATGACTTCAAATCAAACTTGCCTTGGAAACTCTTCCTCGGCTTTTCCTCACTTTTCTTTTCCATCATCTCCATGCTGGTTTCATTTTGTTCCGGCCACTACTTTCTCATCGATCACCGCCTTCAAAACGTCGCCGTTCTTCTCTATACACTTGTTTTTCTCCCCGTCGTCCTTTTCTTCTTTCTATCCAAGCTTCCTCTCTACATAGATGTCTTACAGACTATTTTTAAAATAATGCCTAGAAGGAGTTCCCATGTCCTCCTACCAGCTGATCCCCTCCCTGCCCAAAATCCTTCTAAACTTTTCAAAAAAGGAAAATTTGAGGTCACTTCCATCCCTCTTAACCTAACTTCAATTTCAAACCCATTGTTCATCTTCACACCCACCACCCCAGATTCATATCCTCTCATCTTTTTTCTTCCTGGCTGCATCCAATCCGACTATGCCCATTTCCTCCACCTCATAGCTTCACACGGCTTCCTTATACTTGCCCCACAGCTGTTCGACGTGAAGTCAACAACATGTAAAATGGACGAAACAGATCAGTTAACAACACAAGTTAAATCTGACCGAGAAGGGGTGGAAGACAAGCTATCAAAACTGTCTGAGGTGAAAGGAGGGAAACCAAAAGTGTCGTTATCTTTAGGCCACCACAACAACCCTTCGAATCCATTTTCAGCAGTGTTTGGCTTCCACCCAGCGCCTGGAACCAAATTCAGCATCCCAGAGTCTCAAATTCAGACCTACCTCCACCCTCAATCCTCCAACATATCCTCACCAATTGTTGAAAGTCAATTTATAATTTCCAAGTTATGCGCAACTGTAAAATTGTCTGTTTCTTGA

Coding sequence (CDS)

ATGTGTGAATTAGAGTTAGAGAAGGTAAAAAGAGAAGTAGGAGATTTTCTTCATGCAAACGTAACGAGAGGGAATTGGAAGGCGGTGGCGGAGGAATATGAAAAAGCCTCAAACGTAGCTCAGTCACTGAAGCTGCTTCGAGAGGAAAACACAGCGCTGCATTTGGCTGTGATTGATAATCAAGAAGAAATAGTTGAAAAGCTTGTCAAATTCATTTGCAGATCCAAAGATGATGATGATGATTACAAGAAACTTCTTGAGACTACAAATAATATGTTAAACAACCCTCTCCACCTTGCCGCCATAATGGGAAGCGTAAGAATGTGCCGAGCCATTGCTTCAGCACATGACGAGTTGGTGAATAAGAGAAACAAACTCGATCAAACGCCTCTCTTCTTGGCGGCTTTGCATGGAAACAAGGACGCCTTTTATTGCCTTTACTACTTTTCCAGAAACTTTTCCTCTCATCAAATTTCTTCCAACTGCAGAGCCAACGGAGACACCGTCCTACATTGCGCCCTCAGAAACGAGCAATTTGATTTGGCATTTCAATTTATTCACCTAAAGGAGGAGGCTATGAACTGGGTGAATGAACAAGGCTCAACCGCTCTTCATGTTTTAGCAAGTAAGCCAACTTCCTTCAAAAGTGGAAGCAACATCAACGGATGGCAGAACATCATCTATTACTTGATATTTGTGGATCAACTAAAGCCTCGATCAATAAAAAGCCTTAGCAAAGCGTGCAAGAAACCAAGCACAACTGCTTCGACCTATTTTCCAGTTAAGTACGGGACATGCATCGACTTCTTTACGAGGTTGTGGGATCTGTTTTTAAAAGGATCTGCTGAAATCAGAAAGATACGAGAGAAGAAAGAGAAACACACTTGGTCAGTTCAAGTGATGGAGAAACTTCTTGAAGCTGTTAAACCCCATAAATATGGTGACGACGGAAGAACTCCCATGAATCCAACATTTAAAACAGACAAAGAAGAAACACAACCTTACTCCGTTGTAGACGATAAAGTCAATTTCAGTCCTAACTACGATCATGAACTACTAGAGAACTCAAAGAATGCTGAAGATGTCTCAGAGTCAGCGATGTTATTAGCAACAAAGAATGGCGTGATTGAGATTGTGAGGGGAATGTACAAACGTTTCCCTCTGGCAATTCGTGATAGTAGAAAAGATAAGAAGAATGTGGTTCTTTTGGCTGCGGAGTACAGGCAGCCAGACGTGTACAGGTTTTTACTGAAGAAAAAAAAAGAGATAAAAAGCCAGTTTCGAGCGGTGGATGATGAGGGAAACAGCGGCTTGCATCTCGCAGCAACCGCCATAAATCCTGAGCTTTGGCGCATCAGTGGAGTTGCATTACAAATGCAATGGGAAGTTAAGTGGTACAAGTACGTGAAGAAATCTATGCCACTCCATTTCTTTGCCCACTATAACAATTCAGGAAAAACTGCAACCACAATTTTCCATGAAACCCACAAGGATTTGATGAAAGAAAGCGGAGAGTGGCTTACTAAAACCGCAAAGTCATGCTCTGTGGTGGGTACCTTGATTGTAACAGTGGCTTTTACTTCTGCTGTCAGCATTCCAGGTGGGTTTGACAACACAGACGGCGCACCATTACTTGAAAAACAGCCAGCTTTTTTAATCTTCGCCGTCTTTTCCCTCATTGCCCTCTGCCTTTCTTCAACCTCAGTCATCATGTTTCTTTCCATCTTGACCTACAGGTTTGATGCCCATGACTTCAAATCAAACTTGCCTTGGAAACTCTTCCTCGGCTTTTCCTCACTTTTCTTTTCCATCATCTCCATGCTGGTTTCATTTTGTTCCGGCCACTACTTTCTCATCGATCACCGCCTTCAAAACGTCGCCGTTCTTCTCTATACACTTGTTTTTCTCCCCGTCGTCCTTTTCTTCTTTCTATCCAAGCTTCCTCTCTACATAGATGTCTTACAGACTATTTTTAAAATAATGCCTAGAAGGAGTTCCCATGTCCTCCTACCAGCTGATCCCCTCCCTGCCCAAAATCCTTCTAAACTTTTCAAAAAAGGAAAATTTGAGGTCACTTCCATCCCTCTTAACCTAACTTCAATTTCAAACCCATTGTTCATCTTCACACCCACCACCCCAGATTCATATCCTCTCATCTTTTTTCTTCCTGGCTGCATCCAATCCGACTATGCCCATTTCCTCCACCTCATAGCTTCACACGGCTTCCTTATACTTGCCCCACAGCTGTTCGACGTGAAGTCAACAACATGTAAAATGGACGAAACAGATCAGTTAACAACACAAGTTAAATCTGACCGAGAAGGGGTGGAAGACAAGCTATCAAAACTGTCTGAGGTGAAAGGAGGGAAACCAAAAGTGTCGTTATCTTTAGGCCACCACAACAACCCTTCGAATCCATTTTCAGCAGTGTTTGGCTTCCACCCAGCGCCTGGAACCAAATTCAGCATCCCAGAGTCTCAAATTCAGACCTACCTCCACCCTCAATCCTCCAACATATCCTCACCAATTGTTGAAAGTCAATTTATAATTTCCAAGTTATGCGCAACTGTAAAATTGTCTGTTTCTTGA

Protein sequence

MCELELEKVKREVGDFLHANVTRGNWKAVAEEYEKASNVAQSLKLLREENTALHLAVIDNQEEIVEKLVKFICRSKDDDDDYKKLLETTNNMLNNPLHLAAIMGSVRMCRAIASAHDELVNKRNKLDQTPLFLAALHGNKDAFYCLYYFSRNFSSHQISSNCRANGDTVLHCALRNEQFDLAFQFIHLKEEAMNWVNEQGSTALHVLASKPTSFKSGSNINGWQNIIYYLIFVDQLKPRSIKSLSKACKKPSTTASTYFPVKYGTCIDFFTRLWDLFLKGSAEIRKIREKKEKHTWSVQVMEKLLEAVKPHKYGDDGRTPMNPTFKTDKEETQPYSVVDDKVNFSPNYDHELLENSKNAEDVSESAMLLATKNGVIEIVRGMYKRFPLAIRDSRKDKKNVVLLAAEYRQPDVYRFLLKKKKEIKSQFRAVDDEGNSGLHLAATAINPELWRISGVALQMQWEVKWYKYVKKSMPLHFFAHYNNSGKTATTIFHETHKDLMKESGEWLTKTAKSCSVVGTLIVTVAFTSAVSIPGGFDNTDGAPLLEKQPAFLIFAVFSLIALCLSSTSVIMFLSILTYRFDAHDFKSNLPWKLFLGFSSLFFSIISMLVSFCSGHYFLIDHRLQNVAVLLYTLVFLPVVLFFFLSKLPLYIDVLQTIFKIMPRRSSHVLLPADPLPAQNPSKLFKKGKFEVTSIPLNLTSISNPLFIFTPTTPDSYPLIFFLPGCIQSDYAHFLHLIASHGFLILAPQLFDVKSTTCKMDETDQLTTQVKSDREGVEDKLSKLSEVKGGKPKVSLSLGHHNNPSNPFSAVFGFHPAPGTKFSIPESQIQTYLHPQSSNISSPIVESQFIISKLCATVKLSVS
Homology
BLAST of Tan0011111 vs. ExPASy Swiss-Prot
Match: Q9LE89 (Chlorophyllase type 0 OS=Chenopodium album OX=3559 GN=CACLH PE=1 SV=1)

HSP 1 Score: 70.1 bits (170), Expect = 1.4e-10
Identity = 64/187 (34.22%), Postives = 93/187 (49.73%), Query Frame = 0

Query: 683 LFKKGKFEVTSIPLNLT----SISNPLFIFTPTTPDSYPLIFFLPGCIQS--DYAHFLHL 742
           +F KG F+VT+ P+ +     S   PL I +P     YP++ F+ G + S  DY+ F + 
Sbjct: 37  VFHKGNFQVTNNPIRVKRYEFSAPEPLIIISPKEAGVYPVLLFIHGTMLSNEDYSLFFNY 96

Query: 743 IASHGFLILAPQLFDV--KSTTCKMDETDQ---------LTTQVKSDR--EGVEDKLSKL 802
           IASHGF+++AP+LF +       + DE D          L  QV   R   GVE  L KL
Sbjct: 97  IASHGFIVVAPKLFRLFPPKLPSQQDEIDMAASVANWMPLYLQVVLQRYVTGVEGDLEKL 156

Query: 803 S---EVKGGKPKVSLSLGHHNNPSN-PFSAVFGFHPAPGTKF---SIPESQIQTYLHPQS 844
           +     +GGK   +L+LG  N   +  FSA+ G  P  G      ++P   + TY  P S
Sbjct: 157 AISGHSRGGKSAFALALGFSNIKLDVTFSALIGVDPVAGRSVDDRTLP--HVLTY-KPNS 216

BLAST of Tan0011111 vs. ExPASy Swiss-Prot
Match: O22527 (Chlorophyllase-1 OS=Arabidopsis thaliana OX=3702 GN=CLH1 PE=1 SV=1)

HSP 1 Score: 55.8 bits (133), Expect = 2.7e-06
Identity = 52/181 (28.73%), Postives = 86/181 (47.51%), Query Frame = 0

Query: 684 FKKGKFEVTSIPL-----NLTSISNPLFIFTPTTPDSYPLIFFLPGCIQSD--YAHFLHL 743
           F+ G    T IP+     + T+   P+ I  PT   +YP++ F  G    +  Y+  L+ 
Sbjct: 19  FEIGSLPTTEIPVDPVENDSTAPPKPVRITCPTVAGTYPVVLFFHGFYLRNYFYSDVLNH 78

Query: 744 IASHGFLILAPQLFDVKSTTCKMDETDQLTTQVKSDREGVEDKL-----------SKLSE 803
           IASHG++++APQL  +     ++ E D   + +    E ++  L           S +  
Sbjct: 79  IASHGYILVAPQLCKLLPPGGQV-EVDDAGSVINWASENLKAHLPTSVNANGKYTSLVGH 138

Query: 804 VKGGKPKVSLSLGHHN--NPSNPFSAVFGFHPAPGT-KFSIPESQIQTYLHPQSSNISSP 844
            +GGK   +++LGH    +PS  FSA+ G  P  GT K+   +  I TY  P+S  +  P
Sbjct: 139 SRGGKTAFAVALGHAATLDPSITFSALIGIDPVAGTNKYIRTDPHILTY-KPESFELDIP 197

BLAST of Tan0011111 vs. ExPASy Swiss-Prot
Match: Q94LX1 (Chlorophyllase-1, chloroplastic OS=Citrus unshiu OX=55188 PE=2 SV=1)

HSP 1 Score: 49.3 bits (116), Expect = 2.6e-04
Identity = 46/164 (28.05%), Postives = 72/164 (43.90%), Query Frame = 0

Query: 674 PLPAQNPSKLFKKGKFEVTSIPLNLTSISN-----PLFIFTPTTPDSYPLIFFLPGCIQS 733
           PL A     +F +G +    I L  +S S+     PL I TP    ++ +I FL G   S
Sbjct: 17  PLLATATLPVFTRGIYSTKRITLETSSPSSPPPPKPLIIVTPAEKGTFNVILFLHGTSLS 76

Query: 734 D--YAHFLHLIASHGFLILAPQLFDVKSTTCKMDETDQLTTQVKSDREGVEDKLSKLSEV 793
           +  Y+     IASHGF+++APQL+         +E +      +   +G++  L + +E 
Sbjct: 77  NKSYSKIFDHIASHGFIVVAPQLYTSIPPPSATNELNSAAEVAEWLPQGLQQNLPENTEA 136

Query: 794 -----------KGGKPKVSLSLGHHNNPSNPFSAVFGFHPAPGT 820
                      +GG+   +LSL +       F AV G  P  GT
Sbjct: 137 NVSLVAVMGHSRGGQTAFALSLRY------GFGAVIGLDPVAGT 174

BLAST of Tan0011111 vs. ExPASy Swiss-Prot
Match: Q9MV14 (Chlorophyllase-1, chloroplastic OS=Citrus sinensis OX=2711 GN=CHLASE1 PE=1 SV=1)

HSP 1 Score: 48.9 bits (115), Expect = 3.3e-04
Identity = 46/164 (28.05%), Postives = 72/164 (43.90%), Query Frame = 0

Query: 674 PLPAQNPSKLFKKGKFEVTSIPLNLTSISN-----PLFIFTPTTPDSYPLIFFLPGCIQS 733
           PL A     +F +G +    I L  +S S+     PL I TP    ++ +I FL G   S
Sbjct: 17  PLLATATLPVFTRGIYSTKRITLETSSPSSPPPPKPLIIVTPAGKGTFNVILFLHGTSLS 76

Query: 734 D--YAHFLHLIASHGFLILAPQLFDVKSTTCKMDETDQLTTQVKSDREGVEDKLSKLSEV 793
           +  Y+     IASHGF+++APQL+         +E +      +   +G++  L + +E 
Sbjct: 77  NKSYSKIFDHIASHGFIVVAPQLYTSIPPPSATNELNSAAEVAEWLPQGLQQNLPENTEA 136

Query: 794 -----------KGGKPKVSLSLGHHNNPSNPFSAVFGFHPAPGT 820
                      +GG+   +LSL +       F AV G  P  GT
Sbjct: 137 NVSLVAVMGHSRGGQTAFALSLRY------GFGAVIGLDPVAGT 174

BLAST of Tan0011111 vs. NCBI nr
Match: XP_022995621.1 (uncharacterized protein LOC111491104 isoform X2 [Cucurbita maxima])

HSP 1 Score: 937.2 bits (2421), Expect = 1.0e-268
Identity = 534/954 (55.97%), Postives = 651/954 (68.24%), Query Frame = 0

Query: 7   EKVKREVGDFLHANVTRGNWKAVAEEYEKASNVAQSLKLLREENTALHLAVIDNQEEIVE 66
           +K K  + DFL+ N  RG W+ V ++YE+    AQ LKL R  +TALHLAV+DN+EE+V+
Sbjct: 28  QKQKEILRDFLYTNTKRGKWEEVIKKYEEYPE-AQELKLTRNGDTALHLAVLDNREEVVQ 87

Query: 67  KLVKFICRSKDDDDDYKKLLETTNNMLNNPLHLAAIMGSVRMCRAIASAHDELVNKRNKL 126
           KLV  I  +      Y +LL+TTN+    PLHLAA MGS  MC AIASAHDELV+ RNK+
Sbjct: 88  KLVNRIKHT----SKYDELLKTTNDREETPLHLAAQMGSATMCNAIASAHDELVDLRNKV 147

Query: 127 DQTPLFLAALHGNKDAFYCLYYFSRNFSSHQISSNCR--ANGDTVLHCALRNEQFDLAFQ 186
           D+TPL+LAA  GN+DAF+CLY+F RN +S +I++NCR  +NGDTVLH ALRN+ FDLAFQ
Sbjct: 148 DETPLYLAAASGNRDAFFCLYHFCRNNAS-RITANCRLTSNGDTVLHSALRNDHFDLAFQ 207

Query: 187 FIHLKEEAMNWVNEQGSTALHVLASKPTSFKSGSNINGWQNIIYYLIFVDQLKPRSIKSL 246
            +HL  EAM+WV E G T LHVLASKPT+FKSGS I GW+NI YY   VDQLKP+ I SL
Sbjct: 208 ILHLNNEAMHWVTETGITPLHVLASKPTAFKSGSQIRGWRNIAYYCTHVDQLKPQPIDSL 267

Query: 247 SK----ACKKPSTTASTYFPVKYGTCIDFFTRLWDLFLK--------------------- 306
            +        P+ T++  FP  YGTCIDFFT +WD FLK                     
Sbjct: 268 IRDWIDRMSNPN-TSTPCFPANYGTCIDFFTWVWDGFLKGSGLKRICHDNENDESKKDTQ 327

Query: 307 ------------------------------------------------------------ 366
                                                                       
Sbjct: 328 VVRNIMVEGGESLETDESHQRLDTELLKAHGLAKEASITNVPRNYNTCIHFFQIVFSAIM 387

Query: 367 -----GSAEIRKIREKKEKHTWSVQVMEKLLEAVKPHKYGDDGRTPMNPTFKT--DKEET 426
                GSAE +KIR KKEKHTWSVQVMEKLLE   P +Y  +G TPM+ T +T    E T
Sbjct: 388 ISLGLGSAEFKKIRRKKEKHTWSVQVMEKLLEYAPPDEYECNGGTPMDSTSQTPDQAEVT 447

Query: 427 QPYSVVDDKVNFSPNYDHELLENSKNAEDVSESAMLLATKNGVIEIVRGMYKRFPLAIRD 486
            PYS+V  +V  S + + +  E  K  ++V E+AMLLA KNGVIEIV+GM+ RFPL+I D
Sbjct: 448 LPYSLVAGEVRLSNSIESKPKEAEK-PKNVQETAMLLAAKNGVIEIVKGMFHRFPLSICD 507

Query: 487 SRKDKKNVVLLAAEYRQPDVYRFLLKKKKEIKSQFRAVDDEGNSGLHLAATAINPELWRI 546
           +RKDKKNVVLLAAEY QPDVYRFLL  K   ++ FRAVDD GNS LHLAA A    +WRI
Sbjct: 508 ARKDKKNVVLLAAEYGQPDVYRFLLSPKVYKENLFRAVDDNGNSALHLAAAASKSMIWRI 567

Query: 547 SGVALQMQWEVKWYKYVKKSMPLHFFAHYNNSGKTATTIFHETHKDLMKESGEWLTKTAK 606
           +G ALQMQWE+KWYK+V++S+PL+FFAHYN  GK AT IFHETH DL+K+SGEWLTKT+K
Sbjct: 568 TGAALQMQWEIKWYKFVEESVPLYFFAHYNKEGKNATAIFHETHMDLVKKSGEWLTKTSK 627

Query: 607 SCSVVGTLIVTVAFTSAVSIPGGFDNTDGAPLLEKQPAFLIFAVFSLIALCLSSTSVIMF 666
           SCSVVGTLIVTVAFTS  SIPGGF+  DG+P L+ + AF  FA+FSLIALCLSSTSV MF
Sbjct: 628 SCSVVGTLIVTVAFTSVASIPGGFNPHDGSPFLKDRKAFFTFALFSLIALCLSSTSVTMF 687

Query: 667 LSILTYRFDAHDFKSNLPWKLFLGFSSLFFSIISMLVSFCSGHYFLIDHRLQNVAVLLYT 726
           L+ILTYRFDA+DF++NLPWKLF+GFSSLF SIISMLVSFC+GHYFL+   + + A LLYT
Sbjct: 688 LAILTYRFDANDFRTNLPWKLFIGFSSLFGSIISMLVSFCAGHYFLMHQHIPHHAALLYT 747

Query: 727 LVFLPVVLFFFLSKLPLYIDVLQTIFKIMPRRSSHVLLPADPLPAQNPS-KLFKKGKFEV 786
           +V +PV L F +SKLPLYIDV+Q IFKI+P RS+HV+L +DPLP   PS K F+KGKFEV
Sbjct: 748 IVLVPVALIFIISKLPLYIDVVQAIFKIVPNRSAHVVL-SDPLPPHTPSIKPFQKGKFEV 807

Query: 787 TSIPL---NLTSISNPLFIFTPTTPDSYPLIFFLPGCIQSDYAHFLHLIASHGFLILAPQ 846
           TSIP+   +  S SNPL I TPT   SYPL+FFLPGC + DY+HFL LIAS G +I+ P 
Sbjct: 808 TSIPVEDSDAFSPSNPLLILTPTAQGSYPLLFFLPGCAEYDYSHFLQLIASQGLVIVWPL 867

Query: 847 LFDVKSTTCKMDETDQLTTQVKSDREGVEDKLSKLSE-VKGGKPKV-SLSLGHHNNPSNP 860
               ++T  +M++T Q  T   SDRE VE++LS + +  +GG+PK  SL+LG ++ P NP
Sbjct: 868 QMSAEATRSEMNKTRQFKTWDASDREMVEERLSGVVDPSEGGEPKSWSLALG-YDRPWNP 927

BLAST of Tan0011111 vs. NCBI nr
Match: XP_022995620.1 (uncharacterized protein LOC111491104 isoform X1 [Cucurbita maxima])

HSP 1 Score: 933.7 bits (2412), Expect = 1.1e-267
Identity = 533/955 (55.81%), Postives = 648/955 (67.85%), Query Frame = 0

Query: 7   EKVKREVGDFLHANVTRGNWKAVAEEYEKASNVAQSLKLLREENTALHLAVIDNQEEIVE 66
           +K K  + DFL+ N  RG W+ V ++YE+    AQ LKL R  +TALHLAV+DN+EE+V+
Sbjct: 28  QKQKEILRDFLYTNTKRGKWEEVIKKYEEYPE-AQELKLTRNGDTALHLAVLDNREEVVQ 87

Query: 67  KLVKFICRSKDDDDDYKKLLETTNNMLNNPLHLAAIMGSVRMCRAIASAHDELVNKRNKL 126
           KLV  I  +      Y +LL+TTN+    PLHLAA MGS  MC AIASAHDELV+ RNK+
Sbjct: 88  KLVNRIKHT----SKYDELLKTTNDREETPLHLAAQMGSATMCNAIASAHDELVDLRNKV 147

Query: 127 DQTPLFLAALHGNKDAFYCLYYFSRNFSSHQISSNCR--ANGDTVLHCALRNEQFDLAFQ 186
           D+TPL+LAA  GN+DAF+CLY+F RN +S +I++NCR  +NGDTVLH ALRN+ FDLAFQ
Sbjct: 148 DETPLYLAAASGNRDAFFCLYHFCRNNAS-RITANCRLTSNGDTVLHSALRNDHFDLAFQ 207

Query: 187 FIHLKEEAMNWVNEQGSTALHVLASKPTSFKSGSNINGWQNIIYYLIFVDQLKPRSIKSL 246
            +HL  EAM+WV E G T LHVLASKPT+FKSGS I GW+NI YY   VDQLKP+ I SL
Sbjct: 208 ILHLNNEAMHWVTETGITPLHVLASKPTAFKSGSQIRGWRNIAYYCTHVDQLKPQPIDSL 267

Query: 247 SK----ACKKPSTTASTYFPVKYGTCIDFFTRLWDLFLK--------------------- 306
            +        P+ T++  FP  YGTCIDFFT +WD FLK                     
Sbjct: 268 IRDWIDRMSNPN-TSTPCFPANYGTCIDFFTWVWDGFLKGSGLKRICHDNENDESKKDTQ 327

Query: 307 ------------------------------------------------------------ 366
                                                                       
Sbjct: 328 VVRNIMVEGGESLETDESHQRLDTELLKAHGLAKEASITNVPRNYNTCIHFFQIVFSAIM 387

Query: 367 -----GSAEIRKIREKKEKHTWSVQVMEKLLEAVKPHKYGDDGRTPMNPTFKT--DKEET 426
                GSAE +KIR KKEKHTWSVQVMEKLLE   P +Y  +G TPM+ T +T    E T
Sbjct: 388 ISLGLGSAEFKKIRRKKEKHTWSVQVMEKLLEYAPPDEYECNGGTPMDSTSQTPDQAEVT 447

Query: 427 QPYSVVDDKVNFSPNYDHELLENSKNAE-DVSESAMLLATKNGVIEIVRGMYKRFPLAIR 486
            PYS+V  +V  S + + +  E  K       E+AMLLA KNGVIEIV+GM+ RFPL+I 
Sbjct: 448 LPYSLVAGEVRLSNSIESKPKEAEKPKNVQAPETAMLLAAKNGVIEIVKGMFHRFPLSIC 507

Query: 487 DSRKDKKNVVLLAAEYRQPDVYRFLLKKKKEIKSQFRAVDDEGNSGLHLAATAINPELWR 546
           D+RKDKKNVVLLAAEY QPDVYRFLL  K   ++ FRAVDD GNS LHLAA A    +WR
Sbjct: 508 DARKDKKNVVLLAAEYGQPDVYRFLLSPKVYKENLFRAVDDNGNSALHLAAAASKSMIWR 567

Query: 547 ISGVALQMQWEVKWYKYVKKSMPLHFFAHYNNSGKTATTIFHETHKDLMKESGEWLTKTA 606
           I+G ALQMQWE+KWYK+V++S+PL+FFAHYN  GK AT IFHETH DL+K+SGEWLTKT+
Sbjct: 568 ITGAALQMQWEIKWYKFVEESVPLYFFAHYNKEGKNATAIFHETHMDLVKKSGEWLTKTS 627

Query: 607 KSCSVVGTLIVTVAFTSAVSIPGGFDNTDGAPLLEKQPAFLIFAVFSLIALCLSSTSVIM 666
           KSCSVVGTLIVTVAFTS  SIPGGF+  DG+P L+ + AF  FA+FSLIALCLSSTSV M
Sbjct: 628 KSCSVVGTLIVTVAFTSVASIPGGFNPHDGSPFLKDRKAFFTFALFSLIALCLSSTSVTM 687

Query: 667 FLSILTYRFDAHDFKSNLPWKLFLGFSSLFFSIISMLVSFCSGHYFLIDHRLQNVAVLLY 726
           FL+ILTYRFDA+DF++NLPWKLF+GFSSLF SIISMLVSFC+GHYFL+   + + A LLY
Sbjct: 688 FLAILTYRFDANDFRTNLPWKLFIGFSSLFGSIISMLVSFCAGHYFLMHQHIPHHAALLY 747

Query: 727 TLVFLPVVLFFFLSKLPLYIDVLQTIFKIMPRRSSHVLLPADPLPAQNPS-KLFKKGKFE 786
           T+V +PV L F +SKLPLYIDV+Q IFKI+P RS+HV+L +DPLP   PS K F+KGKFE
Sbjct: 748 TIVLVPVALIFIISKLPLYIDVVQAIFKIVPNRSAHVVL-SDPLPPHTPSIKPFQKGKFE 807

Query: 787 VTSIPL---NLTSISNPLFIFTPTTPDSYPLIFFLPGCIQSDYAHFLHLIASHGFLILAP 846
           VTSIP+   +  S SNPL I TPT   SYPL+FFLPGC + DY+HFL LIAS G +I+ P
Sbjct: 808 VTSIPVEDSDAFSPSNPLLILTPTAQGSYPLLFFLPGCAEYDYSHFLQLIASQGLVIVWP 867

Query: 847 QLFDVKSTTCKMDETDQLTTQVKSDREGVEDKLSKLSE-VKGGKPKV-SLSLGHHNNPSN 860
                ++T  +M++T Q  T   SDRE VE++LS + +  +GG+PK  SL+LG ++ P N
Sbjct: 868 LQMSAEATRSEMNKTRQFKTWDASDREMVEERLSGVVDPSEGGEPKSWSLALG-YDRPWN 927

BLAST of Tan0011111 vs. NCBI nr
Match: XP_022995622.1 (uncharacterized protein LOC111491104 isoform X3 [Cucurbita maxima])

HSP 1 Score: 929.9 bits (2402), Expect = 1.6e-266
Identity = 533/955 (55.81%), Postives = 649/955 (67.96%), Query Frame = 0

Query: 7   EKVKREVGDFLHANVTRGNWKAVAEEYEKASNVAQSLKLLREENTALHLAVIDNQEEIVE 66
           +K K  + DFL+ N  RG W+ V ++YE+    AQ LKL R  +TALHLAV+DN+EE+V+
Sbjct: 28  QKQKEILRDFLYTNTKRGKWEEVIKKYEEYPE-AQELKLTRNGDTALHLAVLDNREEVVQ 87

Query: 67  KLVKFICRSKDDDDDYKKLLETTNNMLNNPLHLAAIMGSVRMCRAIASAHDELVNKRNKL 126
           KLV  I  +      Y +LL+TTN+    PLHLAA MGS  MC AIASAHDELV+ RNK+
Sbjct: 88  KLVNRIKHT----SKYDELLKTTNDREETPLHLAAQMGSATMCNAIASAHDELVDLRNKV 147

Query: 127 DQTPLFLAALHGNKDAFYCLYYFSRNFSSHQISSNCR--ANGDTVLHCALRNEQFDLAFQ 186
           D+TPL+LAA  GN+DAF+CLY+F RN +S +I++NCR  +NGDTVLH ALRN+ FDLAFQ
Sbjct: 148 DETPLYLAAASGNRDAFFCLYHFCRNNAS-RITANCRLTSNGDTVLHSALRNDHFDLAFQ 207

Query: 187 FIHLKEEAMNWVNEQGSTALHVLASKPTSFKSGSNINGWQNIIYYLIFVDQLKPRSIKSL 246
            +HL  EAM+WV E G T LHVLASKPT+FKSGS I GW+NI YY   VDQLKP+ I SL
Sbjct: 208 ILHLNNEAMHWVTETGITPLHVLASKPTAFKSGSQIRGWRNIAYYCTHVDQLKPQPIDSL 267

Query: 247 SK----ACKKPSTTASTYFPVKYGTCIDFFTRLWDLFLK--------------------- 306
            +        P+ T++  FP  YGTCIDFFT +WD FLK                     
Sbjct: 268 IRDWIDRMSNPN-TSTPCFPANYGTCIDFFTWVWDGFLKGSGLKRICHDNENDESKKDTQ 327

Query: 307 ------------------------------------------------------------ 366
                                                                       
Sbjct: 328 VVRNIMVEGGESLETDESHQRLDTELLKAHGLAKEASITNVPRNYNTCIHFFQIVFSAIM 387

Query: 367 -----GSAEIRKIREKKEKHTWSVQVMEKLLEAVKPHKYGDDGRTPMNPTFKT--DKEET 426
                GSAE +KIR KKEKHTWSVQVMEKLLE   P +Y  +G TPM+ T +T    E T
Sbjct: 388 ISLGLGSAEFKKIRRKKEKHTWSVQVMEKLLEYAPPDEYECNGGTPMDSTSQTPDQAEVT 447

Query: 427 QPYSVVDDKVNFSPNYDHELLENSKNAE-DVSESAMLLATKNGVIEIVRGMYKRFPLAIR 486
            PYS+V  +V  S + + +  E  K       E+AMLLA KNGVIEIV+GM+ RFPL+I 
Sbjct: 448 LPYSLVAGEVRLSNSIESKPKEAEKPKNVQAPETAMLLAAKNGVIEIVKGMFHRFPLSIC 507

Query: 487 DSRKDKKNVVLLAAEYRQPDVYRFLLKKKKEIKSQFRAVDDEGNSGLHLAATAINPELWR 546
           D+RKDKKNVVLLAAEY QPDVYRFLL  K   ++ FRAVDD GNS LHLAA A    +WR
Sbjct: 508 DARKDKKNVVLLAAEYGQPDVYRFLLSPKVYKENLFRAVDDNGNSALHLAAAASKSMIWR 567

Query: 547 ISGVALQMQWEVKWYKYVKKSMPLHFFAHYNNSGKTATTIFHETHKDLMKESGEWLTKTA 606
           I+G ALQMQWE+KWYK+V++S+PL+FFAHYN  GK AT IFHETH DL+K+SGEWLTKT+
Sbjct: 568 ITGAALQMQWEIKWYKFVEESVPLYFFAHYNKEGKNATAIFHETHMDLVKKSGEWLTKTS 627

Query: 607 KSCSVVGTLIVTVAFTSAVSIPGGFDNTDGAPLLEKQPAFLIFAVFSLIALCLSSTSVIM 666
           KSCSVVGTLIVTVAFTS  SIPGGF+  DG+P L+ + AF  FA+FSLIALCLSSTSV M
Sbjct: 628 KSCSVVGTLIVTVAFTSVASIPGGFNPHDGSPFLKDRKAFFTFALFSLIALCLSSTSVTM 687

Query: 667 FLSILTYRFDAHDFKSNLPWKLFLGFSSLFFSIISMLVSFCSGHYFLIDHRLQNVAVLLY 726
           FL+ILTYRFDA+DF++NLPWKLF+GFSSLF SIISMLVSFC+GHYFL+   + + A LLY
Sbjct: 688 FLAILTYRFDANDFRTNLPWKLFIGFSSLFGSIISMLVSFCAGHYFLMHQHIPHHAALLY 747

Query: 727 TLVFLPVVLFFFLSKLPLYIDVLQTIFKIMPRRSSHVLLPADPLPAQNPS-KLFKKGKFE 786
           T+V +PV L F +SKLPLYIDV+Q IFKI+P RS+HV+L +DPLP   PS K F+KGKFE
Sbjct: 748 TIVLVPVALIFIISKLPLYIDVVQAIFKIVPNRSAHVVL-SDPLPPHTPSIKPFQKGKFE 807

Query: 787 VTSIPL---NLTSISNPLFIFTPTTPDSYPLIFFLPGCIQSDYAHFLHLIASHGFLILAP 846
           VTSIP+   +  S SNPL I TPT   SYPL+FFLPGC + DY+HFL LIAS G +I+ P
Sbjct: 808 VTSIPVEDSDAFSPSNPLLILTPTAQGSYPLLFFLPGCAEYDYSHFLQLIASQGLVIVWP 867

Query: 847 QLFDVKSTTCKMDETDQLTTQVKSDREGVEDKLSKLSE-VKGGKPKV-SLSLGHHNNPSN 860
               +++T  +M++T Q  T   SDRE VE++LS + +  +GG+PK  SL+LG ++ P N
Sbjct: 868 ----LQATRSEMNKTRQFKTWDASDREMVEERLSGVVDPSEGGEPKSWSLALG-YDRPWN 927

BLAST of Tan0011111 vs. NCBI nr
Match: KAG6606413.1 (Chlorophyllase type 0, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 902.9 bits (2332), Expect = 2.1e-258
Identity = 519/958 (54.18%), Postives = 637/958 (66.49%), Query Frame = 0

Query: 7   EKVKREVGDFLHANVTRGNWKAVAEEYEKASNVAQSLKLLREENTALHLAVIDNQEEIVE 66
           EK +  + DFL+ N+ RGNWK V  +YEK    AQ LKL R  +TALHLAV+DN+EE+V+
Sbjct: 13  EKHEATLRDFLYINMKRGNWKEVINKYEKHPE-AQGLKLTRNGDTALHLAVLDNREEMVQ 72

Query: 67  KLVKFICRSKDDDDDYKKLLETTNNMLNNPLHLAAIMGSVRMCRAIASAHDELVNKRNKL 126
           KLV  I  SK D     KLLETTN+   NPLHLAA MGS  MC AIASAH +LV +RNK+
Sbjct: 73  KLVNRIKDSKCD-----KLLETTNDREENPLHLAAQMGSATMCYAIASAHHKLVEERNKM 132

Query: 127 DQTPLFLAALHGNKDAFYCLYYFSRNFSSHQISSNCR--ANGDTVLHCALRNEQFDLAFQ 186
           D+TPL+LAA  GN+DAF+CLY+F R+    +I++NCR  +NGDTVLH ALRN+ FDLAF 
Sbjct: 133 DETPLYLAAASGNRDAFFCLYHFCRDLGP-EITANCRLSSNGDTVLHSALRNDHFDLAFH 192

Query: 187 FIHLKEEAMNWVNEQGSTALHVLASKPTSFKSGSNINGWQNIIYYLIFVDQLKPRSIKSL 246
            +HL  EAM+WV + G T LHVLASKPT+FKSGS I GW+NI YY   V+QLKP+ I SL
Sbjct: 193 ILHLNNEAMHWVTKDGVTPLHVLASKPTAFKSGSQIRGWRNIAYYCTHVEQLKPQPIDSL 252

Query: 247 SKAC---KKPSTTASTYFPVKYGTCIDFFTRLWDLFLK---------------------- 306
            +        + T++  FP  YG CIDFFT +WD FLK                      
Sbjct: 253 IRDWMDRMSNTNTSTPCFPANYGICIDFFTWVWDGFLKGSGLKRICYDFKNDESKKDTDD 312

Query: 307 ------------------------------------------------------------ 366
                                                                       
Sbjct: 313 AGRNIMVEGGERSEADEPHQRLDTQLLKAHGLAKEASITNVPRNYNTCIHFFQIVFSAIL 372

Query: 367 -----GSAEIRKIREKKEKHTWSVQVMEKLLEAVKPHKYGDDGRTPMNPTFKT--DKEET 426
                GSAE +KIR +KEKHTWSVQVMEKLLE   P +Y  +G  PM+ T +T    + T
Sbjct: 373 ISLGWGSAEFKKIRRQKEKHTWSVQVMEKLLEYAAPDEYDCNGGIPMDSTSQTPDQADVT 432

Query: 427 QPYSVVDDKVNFSPNYDHELLENSKNAEDVSESAMLLATKNGVIEIVRGMYKRFPLAIRD 486
            PYS  DD V FS + + +  E  K  +D  E+ MLLA KNGVIEIV+GM++RFPL+I D
Sbjct: 433 LPYSFEDDDVLFSVHIESKPTEAEK-PKDFQETPMLLAAKNGVIEIVKGMFRRFPLSIYD 492

Query: 487 SRKDKKNVVLLAAEYRQPDVYRFLLKKKKEIKSQFRAVDDEGNSGLHLAATAINPELWRI 546
           + KDKKNVVLLAAEY QPDVYRFLL      ++ FRAVD  GNS LHLAA A    +WRI
Sbjct: 493 AGKDKKNVVLLAAEYGQPDVYRFLLSPNVYKENLFRAVDANGNSALHLAAAASKSMIWRI 552

Query: 547 SGVALQMQWEVKWYKYVKKSMPLHFFAHYNNSGKTATTIFHETHKDLMKESGEWLTKTAK 606
           +G ALQMQWE+KWYK+V++S+PL+FFAHYN  GK AT IFHETH DL+++SG+WLTKT+K
Sbjct: 553 TGAALQMQWEIKWYKFVEESVPLYFFAHYNKEGKNATAIFHETHMDLVQKSGDWLTKTSK 612

Query: 607 SCSVVGTLIVTVAFTSAVSIPGGFDNTDGAPLLEKQPAFLIFAVFSLIALCLSSTSVIMF 666
           SCSVVG LIVTVAFTS  SIPGGF+  DG+P L+ + AF  FA+FSLIALCLSSTSV +F
Sbjct: 613 SCSVVGALIVTVAFTSVASIPGGFNPRDGSPFLQDREAFFTFALFSLIALCLSSTSVTIF 672

Query: 667 LSILTYRFDAHDFKSNLPWKLFLGFSSLFFSIISMLVSFCSGHYFLIDHRLQNVAVLLYT 726
           L+ILT+RFDA+DF++NLPWKLF+GFSSLF SIISML+SFC+GHYFL+   + + A LLYT
Sbjct: 673 LAILTHRFDANDFRTNLPWKLFIGFSSLFGSIISMLISFCAGHYFLMHRHIPHHAALLYT 732

Query: 727 LVFLPVVLFFFLSKLPLYIDVLQTIFKIMPRRSSHVLLPADPLPAQNPS-KLFKKGKFEV 786
           +V +PV L F +SKLPLYIDV+Q IFKI+P+RS+HV+L +DPLP   PS K F+KGKFEV
Sbjct: 733 IVLVPVALIFIISKLPLYIDVVQAIFKIVPKRSAHVVL-SDPLPLHTPSIKPFRKGKFEV 792

Query: 787 TSI---PLNLTSISNPLFIFTPTTPDSYPLIFFLPGC-IQSDYAHFLHLIASHGFLILAP 846
           TS      +  S S PL I  PT   SYPL+FFLPGC  + DY+H L  IAS G +I+ P
Sbjct: 793 TSTAREDSDAFSPSTPLSILAPTAQGSYPLLFFLPGCAAEYDYSHVLQHIASQGLVIVCP 852

Query: 847 QLFDVKSTTCKMDETDQLTTQVKSDREGVEDKLS-KLSEVKGGKPKV-SLSLGHHNNPSN 863
                K+T  + +ET Q  T   +DRE VE++LS  ++E +GGK K  SL+LG ++ P N
Sbjct: 853 LQMRAKATRSEANETSQFKTWDAADREMVEERLSGVVAEFEGGKTKSWSLALG-YDRPRN 912

BLAST of Tan0011111 vs. NCBI nr
Match: XP_022931013.1 (uncharacterized protein LOC111437338 isoform X2 [Cucurbita moschata])

HSP 1 Score: 902.1 bits (2330), Expect = 3.6e-258
Identity = 521/960 (54.27%), Postives = 638/960 (66.46%), Query Frame = 0

Query: 7   EKVKREVGDFLHANVTRGNWKAVAEEYEKASNVAQSLKLLREENTALHLAVIDNQEEIVE 66
           EK +  + DFL+ N  R  W+ V ++YEK    AQ LKL R  +TALHLAV+DN+EE+V+
Sbjct: 28  EKHEATLRDFLYINTKRTEWEKVIKKYEKHPE-AQGLKLTRNGDTALHLAVLDNREEMVQ 87

Query: 67  KLVKFICRSKDDDDDYKKLLETTNNMLNNPLHLAAIMGSVRMCRAIASAHDELVNKRNKL 126
           KLV  I  SK D+     LLETTN+   NPLHLAA MGS  MC AIASAH +LV KRNK+
Sbjct: 88  KLVNRIKDSKRDE-----LLETTNDRKENPLHLAAQMGSATMCYAIASAHHKLVEKRNKI 147

Query: 127 DQTPLFLAALHGNKDAFYCLYYFSRNFSSHQISSNCR--ANGDTVLHCALRNEQFDLAFQ 186
           D+TPL+LAA  GN+DAF+CLY+F R+  S  I++NCR  +NGDTVLH ALRN+ FDLAF 
Sbjct: 148 DETPLYLAAASGNRDAFFCLYHFCRDLDS-GITANCRLSSNGDTVLHSALRNDHFDLAFH 207

Query: 187 FIHLKEEAMNWVNEQGSTALHVLASKPTSFKSGSNINGWQNIIYYLIFVDQLKPRSIKSL 246
            +HL  EAM+WV + G T LHVLASKPT+FKSGS I GW+NI YY   VDQL P+ I SL
Sbjct: 208 ILHLHNEAMHWVTKDGVTPLHVLASKPTAFKSGSQIRGWRNIAYYCTHVDQLNPQPIDSL 267

Query: 247 SK----ACKKPSTTASTYFPVKYGTCIDFFTRLWDLFLK--------------------- 306
            +        P+ T++  FP  Y TCIDFFT +WD FLK                     
Sbjct: 268 IRDWIDRMSNPN-TSTPCFPANYETCIDFFTWVWDGFLKGSGLKRICHDFKNDESKKDTD 327

Query: 307 ------------------------------------------------------------ 366
                                                                       
Sbjct: 328 DAGRNIMVEGGESSEAAEPHQRLDTQLLKAHGLAKEASITNVPRNYNTCIHFFQIVFSAI 387

Query: 367 ------GSAEIRKIREKKEKHTWSVQVMEKLLEAVKPHKYGDDGRTPMNPTFKTDKEE-- 426
                 GSAE +KIR +KEKHTWSVQVMEKLLE   P +Y  +G  PM+ T +T  +   
Sbjct: 388 LISLGWGSAEFKKIRRQKEKHTWSVQVMEKLLEYAAPDEYDCNGGIPMDSTSQTPDQAGV 447

Query: 427 TQPYSVVDDKVNFSPNYDHELLENSK-NAEDVSESAMLLATKNGVIEIVRGMYKRFPLAI 486
           T PYS  DD V FS + + +  E  K   +D  E+ MLLA KNGVIEIV+GM+ RFPL+I
Sbjct: 448 TLPYSFQDDDVLFSVHIESKPTEAEKPKPKDFQETPMLLAAKNGVIEIVKGMFCRFPLSI 507

Query: 487 RDSRKDKKNVVLLAAEYRQPDVYRFLLKKKKEIKSQFRAVDDEGNSGLHLAATAINPELW 546
            D+ KDKKNVVLLAAEY QPDVYRFLL  K   ++ FRAVDD GNS LHLAA A    +W
Sbjct: 508 YDAGKDKKNVVLLAAEYGQPDVYRFLLSPKVYKENLFRAVDDNGNSALHLAAAASKSMIW 567

Query: 547 RISGVALQMQWEVKWYKYVKKSMPLHFFAHYNNSGKTATTIFHETHKDLMKESGEWLTKT 606
           RI+G ALQMQWE+KWYK+V++S+PL+FFAHYN  GK AT IFHETH DL+++SG+WL KT
Sbjct: 568 RITGAALQMQWEIKWYKFVEESVPLYFFAHYNKEGKNATAIFHETHMDLVQKSGDWLIKT 627

Query: 607 AKSCSVVGTLIVTVAFTSAVSIPGGFDNTDGAPLLEKQPAFLIFAVFSLIALCLSSTSVI 666
           +KSCSVVG LIVTVAFTS  SIPGGF+  DG+P L+ + AF  FA+FSLIALCLSSTSV 
Sbjct: 628 SKSCSVVGALIVTVAFTSVASIPGGFNPRDGSPFLQDREAFFTFALFSLIALCLSSTSVT 687

Query: 667 MFLSILTYRFDAHDFKSNLPWKLFLGFSSLFFSIISMLVSFCSGHYFLIDHRLQNVAVLL 726
           +FL+ILT+RFDA+DF++NLPWKLF+GFSSLF SIISML+SFC+GHYFL+   + + A LL
Sbjct: 688 IFLAILTHRFDANDFRTNLPWKLFIGFSSLFGSIISMLISFCAGHYFLMHRHIPHHAALL 747

Query: 727 YTLVFLPVVLFFFLSKLPLYIDVLQTIFKIMPRRSSHVLLPADPLPAQNPS-KLFKKGKF 786
           YT+V +PV L F +SKLPLYIDV+Q IFKI+P+RS+HV+L +DPLP   PS K F+KGKF
Sbjct: 748 YTIVLVPVALIFIISKLPLYIDVVQAIFKIVPKRSAHVVL-SDPLPLHTPSVKPFRKGKF 807

Query: 787 EVTSIPL---NLTSISNPLFIFTPTTPDSYPLIFFLPGC-IQSDYAHFLHLIASHGFLIL 846
           EVTS  +   +  S S PL I  PT   SYPL+FFLPGC  + DY+HFL  IAS G +I+
Sbjct: 808 EVTSTAMEDSDAFSPSTPLSILAPTAQGSYPLLFFLPGCAAEYDYSHFLQRIASQGLVIV 867

Query: 847 APQLFDVKSTTCKMDETDQLTTQVKSDREGVEDKLS-KLSEVKGGKPKV-SLSLGHHNNP 863
            P     K+T  + +ET Q  T   +DRE VE++LS  ++E+KGGK K  SL+LG ++ P
Sbjct: 868 CPLQMRAKATRSEANETSQFKTWDAADREMVEERLSGVVAELKGGKTKSWSLALG-YDRP 927

BLAST of Tan0011111 vs. ExPASy TrEMBL
Match: A0A6J1K2F1 (uncharacterized protein LOC111491104 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111491104 PE=3 SV=1)

HSP 1 Score: 937.2 bits (2421), Expect = 4.9e-269
Identity = 534/954 (55.97%), Postives = 651/954 (68.24%), Query Frame = 0

Query: 7   EKVKREVGDFLHANVTRGNWKAVAEEYEKASNVAQSLKLLREENTALHLAVIDNQEEIVE 66
           +K K  + DFL+ N  RG W+ V ++YE+    AQ LKL R  +TALHLAV+DN+EE+V+
Sbjct: 28  QKQKEILRDFLYTNTKRGKWEEVIKKYEEYPE-AQELKLTRNGDTALHLAVLDNREEVVQ 87

Query: 67  KLVKFICRSKDDDDDYKKLLETTNNMLNNPLHLAAIMGSVRMCRAIASAHDELVNKRNKL 126
           KLV  I  +      Y +LL+TTN+    PLHLAA MGS  MC AIASAHDELV+ RNK+
Sbjct: 88  KLVNRIKHT----SKYDELLKTTNDREETPLHLAAQMGSATMCNAIASAHDELVDLRNKV 147

Query: 127 DQTPLFLAALHGNKDAFYCLYYFSRNFSSHQISSNCR--ANGDTVLHCALRNEQFDLAFQ 186
           D+TPL+LAA  GN+DAF+CLY+F RN +S +I++NCR  +NGDTVLH ALRN+ FDLAFQ
Sbjct: 148 DETPLYLAAASGNRDAFFCLYHFCRNNAS-RITANCRLTSNGDTVLHSALRNDHFDLAFQ 207

Query: 187 FIHLKEEAMNWVNEQGSTALHVLASKPTSFKSGSNINGWQNIIYYLIFVDQLKPRSIKSL 246
            +HL  EAM+WV E G T LHVLASKPT+FKSGS I GW+NI YY   VDQLKP+ I SL
Sbjct: 208 ILHLNNEAMHWVTETGITPLHVLASKPTAFKSGSQIRGWRNIAYYCTHVDQLKPQPIDSL 267

Query: 247 SK----ACKKPSTTASTYFPVKYGTCIDFFTRLWDLFLK--------------------- 306
            +        P+ T++  FP  YGTCIDFFT +WD FLK                     
Sbjct: 268 IRDWIDRMSNPN-TSTPCFPANYGTCIDFFTWVWDGFLKGSGLKRICHDNENDESKKDTQ 327

Query: 307 ------------------------------------------------------------ 366
                                                                       
Sbjct: 328 VVRNIMVEGGESLETDESHQRLDTELLKAHGLAKEASITNVPRNYNTCIHFFQIVFSAIM 387

Query: 367 -----GSAEIRKIREKKEKHTWSVQVMEKLLEAVKPHKYGDDGRTPMNPTFKT--DKEET 426
                GSAE +KIR KKEKHTWSVQVMEKLLE   P +Y  +G TPM+ T +T    E T
Sbjct: 388 ISLGLGSAEFKKIRRKKEKHTWSVQVMEKLLEYAPPDEYECNGGTPMDSTSQTPDQAEVT 447

Query: 427 QPYSVVDDKVNFSPNYDHELLENSKNAEDVSESAMLLATKNGVIEIVRGMYKRFPLAIRD 486
            PYS+V  +V  S + + +  E  K  ++V E+AMLLA KNGVIEIV+GM+ RFPL+I D
Sbjct: 448 LPYSLVAGEVRLSNSIESKPKEAEK-PKNVQETAMLLAAKNGVIEIVKGMFHRFPLSICD 507

Query: 487 SRKDKKNVVLLAAEYRQPDVYRFLLKKKKEIKSQFRAVDDEGNSGLHLAATAINPELWRI 546
           +RKDKKNVVLLAAEY QPDVYRFLL  K   ++ FRAVDD GNS LHLAA A    +WRI
Sbjct: 508 ARKDKKNVVLLAAEYGQPDVYRFLLSPKVYKENLFRAVDDNGNSALHLAAAASKSMIWRI 567

Query: 547 SGVALQMQWEVKWYKYVKKSMPLHFFAHYNNSGKTATTIFHETHKDLMKESGEWLTKTAK 606
           +G ALQMQWE+KWYK+V++S+PL+FFAHYN  GK AT IFHETH DL+K+SGEWLTKT+K
Sbjct: 568 TGAALQMQWEIKWYKFVEESVPLYFFAHYNKEGKNATAIFHETHMDLVKKSGEWLTKTSK 627

Query: 607 SCSVVGTLIVTVAFTSAVSIPGGFDNTDGAPLLEKQPAFLIFAVFSLIALCLSSTSVIMF 666
           SCSVVGTLIVTVAFTS  SIPGGF+  DG+P L+ + AF  FA+FSLIALCLSSTSV MF
Sbjct: 628 SCSVVGTLIVTVAFTSVASIPGGFNPHDGSPFLKDRKAFFTFALFSLIALCLSSTSVTMF 687

Query: 667 LSILTYRFDAHDFKSNLPWKLFLGFSSLFFSIISMLVSFCSGHYFLIDHRLQNVAVLLYT 726
           L+ILTYRFDA+DF++NLPWKLF+GFSSLF SIISMLVSFC+GHYFL+   + + A LLYT
Sbjct: 688 LAILTYRFDANDFRTNLPWKLFIGFSSLFGSIISMLVSFCAGHYFLMHQHIPHHAALLYT 747

Query: 727 LVFLPVVLFFFLSKLPLYIDVLQTIFKIMPRRSSHVLLPADPLPAQNPS-KLFKKGKFEV 786
           +V +PV L F +SKLPLYIDV+Q IFKI+P RS+HV+L +DPLP   PS K F+KGKFEV
Sbjct: 748 IVLVPVALIFIISKLPLYIDVVQAIFKIVPNRSAHVVL-SDPLPPHTPSIKPFQKGKFEV 807

Query: 787 TSIPL---NLTSISNPLFIFTPTTPDSYPLIFFLPGCIQSDYAHFLHLIASHGFLILAPQ 846
           TSIP+   +  S SNPL I TPT   SYPL+FFLPGC + DY+HFL LIAS G +I+ P 
Sbjct: 808 TSIPVEDSDAFSPSNPLLILTPTAQGSYPLLFFLPGCAEYDYSHFLQLIASQGLVIVWPL 867

Query: 847 LFDVKSTTCKMDETDQLTTQVKSDREGVEDKLSKLSE-VKGGKPKV-SLSLGHHNNPSNP 860
               ++T  +M++T Q  T   SDRE VE++LS + +  +GG+PK  SL+LG ++ P NP
Sbjct: 868 QMSAEATRSEMNKTRQFKTWDASDREMVEERLSGVVDPSEGGEPKSWSLALG-YDRPWNP 927

BLAST of Tan0011111 vs. ExPASy TrEMBL
Match: A0A6J1K4G3 (uncharacterized protein LOC111491104 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111491104 PE=3 SV=1)

HSP 1 Score: 933.7 bits (2412), Expect = 5.5e-268
Identity = 533/955 (55.81%), Postives = 648/955 (67.85%), Query Frame = 0

Query: 7   EKVKREVGDFLHANVTRGNWKAVAEEYEKASNVAQSLKLLREENTALHLAVIDNQEEIVE 66
           +K K  + DFL+ N  RG W+ V ++YE+    AQ LKL R  +TALHLAV+DN+EE+V+
Sbjct: 28  QKQKEILRDFLYTNTKRGKWEEVIKKYEEYPE-AQELKLTRNGDTALHLAVLDNREEVVQ 87

Query: 67  KLVKFICRSKDDDDDYKKLLETTNNMLNNPLHLAAIMGSVRMCRAIASAHDELVNKRNKL 126
           KLV  I  +      Y +LL+TTN+    PLHLAA MGS  MC AIASAHDELV+ RNK+
Sbjct: 88  KLVNRIKHT----SKYDELLKTTNDREETPLHLAAQMGSATMCNAIASAHDELVDLRNKV 147

Query: 127 DQTPLFLAALHGNKDAFYCLYYFSRNFSSHQISSNCR--ANGDTVLHCALRNEQFDLAFQ 186
           D+TPL+LAA  GN+DAF+CLY+F RN +S +I++NCR  +NGDTVLH ALRN+ FDLAFQ
Sbjct: 148 DETPLYLAAASGNRDAFFCLYHFCRNNAS-RITANCRLTSNGDTVLHSALRNDHFDLAFQ 207

Query: 187 FIHLKEEAMNWVNEQGSTALHVLASKPTSFKSGSNINGWQNIIYYLIFVDQLKPRSIKSL 246
            +HL  EAM+WV E G T LHVLASKPT+FKSGS I GW+NI YY   VDQLKP+ I SL
Sbjct: 208 ILHLNNEAMHWVTETGITPLHVLASKPTAFKSGSQIRGWRNIAYYCTHVDQLKPQPIDSL 267

Query: 247 SK----ACKKPSTTASTYFPVKYGTCIDFFTRLWDLFLK--------------------- 306
            +        P+ T++  FP  YGTCIDFFT +WD FLK                     
Sbjct: 268 IRDWIDRMSNPN-TSTPCFPANYGTCIDFFTWVWDGFLKGSGLKRICHDNENDESKKDTQ 327

Query: 307 ------------------------------------------------------------ 366
                                                                       
Sbjct: 328 VVRNIMVEGGESLETDESHQRLDTELLKAHGLAKEASITNVPRNYNTCIHFFQIVFSAIM 387

Query: 367 -----GSAEIRKIREKKEKHTWSVQVMEKLLEAVKPHKYGDDGRTPMNPTFKT--DKEET 426
                GSAE +KIR KKEKHTWSVQVMEKLLE   P +Y  +G TPM+ T +T    E T
Sbjct: 388 ISLGLGSAEFKKIRRKKEKHTWSVQVMEKLLEYAPPDEYECNGGTPMDSTSQTPDQAEVT 447

Query: 427 QPYSVVDDKVNFSPNYDHELLENSKNAE-DVSESAMLLATKNGVIEIVRGMYKRFPLAIR 486
            PYS+V  +V  S + + +  E  K       E+AMLLA KNGVIEIV+GM+ RFPL+I 
Sbjct: 448 LPYSLVAGEVRLSNSIESKPKEAEKPKNVQAPETAMLLAAKNGVIEIVKGMFHRFPLSIC 507

Query: 487 DSRKDKKNVVLLAAEYRQPDVYRFLLKKKKEIKSQFRAVDDEGNSGLHLAATAINPELWR 546
           D+RKDKKNVVLLAAEY QPDVYRFLL  K   ++ FRAVDD GNS LHLAA A    +WR
Sbjct: 508 DARKDKKNVVLLAAEYGQPDVYRFLLSPKVYKENLFRAVDDNGNSALHLAAAASKSMIWR 567

Query: 547 ISGVALQMQWEVKWYKYVKKSMPLHFFAHYNNSGKTATTIFHETHKDLMKESGEWLTKTA 606
           I+G ALQMQWE+KWYK+V++S+PL+FFAHYN  GK AT IFHETH DL+K+SGEWLTKT+
Sbjct: 568 ITGAALQMQWEIKWYKFVEESVPLYFFAHYNKEGKNATAIFHETHMDLVKKSGEWLTKTS 627

Query: 607 KSCSVVGTLIVTVAFTSAVSIPGGFDNTDGAPLLEKQPAFLIFAVFSLIALCLSSTSVIM 666
           KSCSVVGTLIVTVAFTS  SIPGGF+  DG+P L+ + AF  FA+FSLIALCLSSTSV M
Sbjct: 628 KSCSVVGTLIVTVAFTSVASIPGGFNPHDGSPFLKDRKAFFTFALFSLIALCLSSTSVTM 687

Query: 667 FLSILTYRFDAHDFKSNLPWKLFLGFSSLFFSIISMLVSFCSGHYFLIDHRLQNVAVLLY 726
           FL+ILTYRFDA+DF++NLPWKLF+GFSSLF SIISMLVSFC+GHYFL+   + + A LLY
Sbjct: 688 FLAILTYRFDANDFRTNLPWKLFIGFSSLFGSIISMLVSFCAGHYFLMHQHIPHHAALLY 747

Query: 727 TLVFLPVVLFFFLSKLPLYIDVLQTIFKIMPRRSSHVLLPADPLPAQNPS-KLFKKGKFE 786
           T+V +PV L F +SKLPLYIDV+Q IFKI+P RS+HV+L +DPLP   PS K F+KGKFE
Sbjct: 748 TIVLVPVALIFIISKLPLYIDVVQAIFKIVPNRSAHVVL-SDPLPPHTPSIKPFQKGKFE 807

Query: 787 VTSIPL---NLTSISNPLFIFTPTTPDSYPLIFFLPGCIQSDYAHFLHLIASHGFLILAP 846
           VTSIP+   +  S SNPL I TPT   SYPL+FFLPGC + DY+HFL LIAS G +I+ P
Sbjct: 808 VTSIPVEDSDAFSPSNPLLILTPTAQGSYPLLFFLPGCAEYDYSHFLQLIASQGLVIVWP 867

Query: 847 QLFDVKSTTCKMDETDQLTTQVKSDREGVEDKLSKLSE-VKGGKPKV-SLSLGHHNNPSN 860
                ++T  +M++T Q  T   SDRE VE++LS + +  +GG+PK  SL+LG ++ P N
Sbjct: 868 LQMSAEATRSEMNKTRQFKTWDASDREMVEERLSGVVDPSEGGEPKSWSLALG-YDRPWN 927

BLAST of Tan0011111 vs. ExPASy TrEMBL
Match: A0A6J1JZF8 (uncharacterized protein LOC111491104 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111491104 PE=3 SV=1)

HSP 1 Score: 929.9 bits (2402), Expect = 7.9e-267
Identity = 533/955 (55.81%), Postives = 649/955 (67.96%), Query Frame = 0

Query: 7   EKVKREVGDFLHANVTRGNWKAVAEEYEKASNVAQSLKLLREENTALHLAVIDNQEEIVE 66
           +K K  + DFL+ N  RG W+ V ++YE+    AQ LKL R  +TALHLAV+DN+EE+V+
Sbjct: 28  QKQKEILRDFLYTNTKRGKWEEVIKKYEEYPE-AQELKLTRNGDTALHLAVLDNREEVVQ 87

Query: 67  KLVKFICRSKDDDDDYKKLLETTNNMLNNPLHLAAIMGSVRMCRAIASAHDELVNKRNKL 126
           KLV  I  +      Y +LL+TTN+    PLHLAA MGS  MC AIASAHDELV+ RNK+
Sbjct: 88  KLVNRIKHT----SKYDELLKTTNDREETPLHLAAQMGSATMCNAIASAHDELVDLRNKV 147

Query: 127 DQTPLFLAALHGNKDAFYCLYYFSRNFSSHQISSNCR--ANGDTVLHCALRNEQFDLAFQ 186
           D+TPL+LAA  GN+DAF+CLY+F RN +S +I++NCR  +NGDTVLH ALRN+ FDLAFQ
Sbjct: 148 DETPLYLAAASGNRDAFFCLYHFCRNNAS-RITANCRLTSNGDTVLHSALRNDHFDLAFQ 207

Query: 187 FIHLKEEAMNWVNEQGSTALHVLASKPTSFKSGSNINGWQNIIYYLIFVDQLKPRSIKSL 246
            +HL  EAM+WV E G T LHVLASKPT+FKSGS I GW+NI YY   VDQLKP+ I SL
Sbjct: 208 ILHLNNEAMHWVTETGITPLHVLASKPTAFKSGSQIRGWRNIAYYCTHVDQLKPQPIDSL 267

Query: 247 SK----ACKKPSTTASTYFPVKYGTCIDFFTRLWDLFLK--------------------- 306
            +        P+ T++  FP  YGTCIDFFT +WD FLK                     
Sbjct: 268 IRDWIDRMSNPN-TSTPCFPANYGTCIDFFTWVWDGFLKGSGLKRICHDNENDESKKDTQ 327

Query: 307 ------------------------------------------------------------ 366
                                                                       
Sbjct: 328 VVRNIMVEGGESLETDESHQRLDTELLKAHGLAKEASITNVPRNYNTCIHFFQIVFSAIM 387

Query: 367 -----GSAEIRKIREKKEKHTWSVQVMEKLLEAVKPHKYGDDGRTPMNPTFKT--DKEET 426
                GSAE +KIR KKEKHTWSVQVMEKLLE   P +Y  +G TPM+ T +T    E T
Sbjct: 388 ISLGLGSAEFKKIRRKKEKHTWSVQVMEKLLEYAPPDEYECNGGTPMDSTSQTPDQAEVT 447

Query: 427 QPYSVVDDKVNFSPNYDHELLENSKNAE-DVSESAMLLATKNGVIEIVRGMYKRFPLAIR 486
            PYS+V  +V  S + + +  E  K       E+AMLLA KNGVIEIV+GM+ RFPL+I 
Sbjct: 448 LPYSLVAGEVRLSNSIESKPKEAEKPKNVQAPETAMLLAAKNGVIEIVKGMFHRFPLSIC 507

Query: 487 DSRKDKKNVVLLAAEYRQPDVYRFLLKKKKEIKSQFRAVDDEGNSGLHLAATAINPELWR 546
           D+RKDKKNVVLLAAEY QPDVYRFLL  K   ++ FRAVDD GNS LHLAA A    +WR
Sbjct: 508 DARKDKKNVVLLAAEYGQPDVYRFLLSPKVYKENLFRAVDDNGNSALHLAAAASKSMIWR 567

Query: 547 ISGVALQMQWEVKWYKYVKKSMPLHFFAHYNNSGKTATTIFHETHKDLMKESGEWLTKTA 606
           I+G ALQMQWE+KWYK+V++S+PL+FFAHYN  GK AT IFHETH DL+K+SGEWLTKT+
Sbjct: 568 ITGAALQMQWEIKWYKFVEESVPLYFFAHYNKEGKNATAIFHETHMDLVKKSGEWLTKTS 627

Query: 607 KSCSVVGTLIVTVAFTSAVSIPGGFDNTDGAPLLEKQPAFLIFAVFSLIALCLSSTSVIM 666
           KSCSVVGTLIVTVAFTS  SIPGGF+  DG+P L+ + AF  FA+FSLIALCLSSTSV M
Sbjct: 628 KSCSVVGTLIVTVAFTSVASIPGGFNPHDGSPFLKDRKAFFTFALFSLIALCLSSTSVTM 687

Query: 667 FLSILTYRFDAHDFKSNLPWKLFLGFSSLFFSIISMLVSFCSGHYFLIDHRLQNVAVLLY 726
           FL+ILTYRFDA+DF++NLPWKLF+GFSSLF SIISMLVSFC+GHYFL+   + + A LLY
Sbjct: 688 FLAILTYRFDANDFRTNLPWKLFIGFSSLFGSIISMLVSFCAGHYFLMHQHIPHHAALLY 747

Query: 727 TLVFLPVVLFFFLSKLPLYIDVLQTIFKIMPRRSSHVLLPADPLPAQNPS-KLFKKGKFE 786
           T+V +PV L F +SKLPLYIDV+Q IFKI+P RS+HV+L +DPLP   PS K F+KGKFE
Sbjct: 748 TIVLVPVALIFIISKLPLYIDVVQAIFKIVPNRSAHVVL-SDPLPPHTPSIKPFQKGKFE 807

Query: 787 VTSIPL---NLTSISNPLFIFTPTTPDSYPLIFFLPGCIQSDYAHFLHLIASHGFLILAP 846
           VTSIP+   +  S SNPL I TPT   SYPL+FFLPGC + DY+HFL LIAS G +I+ P
Sbjct: 808 VTSIPVEDSDAFSPSNPLLILTPTAQGSYPLLFFLPGCAEYDYSHFLQLIASQGLVIVWP 867

Query: 847 QLFDVKSTTCKMDETDQLTTQVKSDREGVEDKLSKLSE-VKGGKPKV-SLSLGHHNNPSN 860
               +++T  +M++T Q  T   SDRE VE++LS + +  +GG+PK  SL+LG ++ P N
Sbjct: 868 ----LQATRSEMNKTRQFKTWDASDREMVEERLSGVVDPSEGGEPKSWSLALG-YDRPWN 927

BLAST of Tan0011111 vs. ExPASy TrEMBL
Match: A0A6J1EX58 (uncharacterized protein LOC111437338 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111437338 PE=3 SV=1)

HSP 1 Score: 902.1 bits (2330), Expect = 1.8e-258
Identity = 521/960 (54.27%), Postives = 638/960 (66.46%), Query Frame = 0

Query: 7   EKVKREVGDFLHANVTRGNWKAVAEEYEKASNVAQSLKLLREENTALHLAVIDNQEEIVE 66
           EK +  + DFL+ N  R  W+ V ++YEK    AQ LKL R  +TALHLAV+DN+EE+V+
Sbjct: 28  EKHEATLRDFLYINTKRTEWEKVIKKYEKHPE-AQGLKLTRNGDTALHLAVLDNREEMVQ 87

Query: 67  KLVKFICRSKDDDDDYKKLLETTNNMLNNPLHLAAIMGSVRMCRAIASAHDELVNKRNKL 126
           KLV  I  SK D+     LLETTN+   NPLHLAA MGS  MC AIASAH +LV KRNK+
Sbjct: 88  KLVNRIKDSKRDE-----LLETTNDRKENPLHLAAQMGSATMCYAIASAHHKLVEKRNKI 147

Query: 127 DQTPLFLAALHGNKDAFYCLYYFSRNFSSHQISSNCR--ANGDTVLHCALRNEQFDLAFQ 186
           D+TPL+LAA  GN+DAF+CLY+F R+  S  I++NCR  +NGDTVLH ALRN+ FDLAF 
Sbjct: 148 DETPLYLAAASGNRDAFFCLYHFCRDLDS-GITANCRLSSNGDTVLHSALRNDHFDLAFH 207

Query: 187 FIHLKEEAMNWVNEQGSTALHVLASKPTSFKSGSNINGWQNIIYYLIFVDQLKPRSIKSL 246
            +HL  EAM+WV + G T LHVLASKPT+FKSGS I GW+NI YY   VDQL P+ I SL
Sbjct: 208 ILHLHNEAMHWVTKDGVTPLHVLASKPTAFKSGSQIRGWRNIAYYCTHVDQLNPQPIDSL 267

Query: 247 SK----ACKKPSTTASTYFPVKYGTCIDFFTRLWDLFLK--------------------- 306
            +        P+ T++  FP  Y TCIDFFT +WD FLK                     
Sbjct: 268 IRDWIDRMSNPN-TSTPCFPANYETCIDFFTWVWDGFLKGSGLKRICHDFKNDESKKDTD 327

Query: 307 ------------------------------------------------------------ 366
                                                                       
Sbjct: 328 DAGRNIMVEGGESSEAAEPHQRLDTQLLKAHGLAKEASITNVPRNYNTCIHFFQIVFSAI 387

Query: 367 ------GSAEIRKIREKKEKHTWSVQVMEKLLEAVKPHKYGDDGRTPMNPTFKTDKEE-- 426
                 GSAE +KIR +KEKHTWSVQVMEKLLE   P +Y  +G  PM+ T +T  +   
Sbjct: 388 LISLGWGSAEFKKIRRQKEKHTWSVQVMEKLLEYAAPDEYDCNGGIPMDSTSQTPDQAGV 447

Query: 427 TQPYSVVDDKVNFSPNYDHELLENSK-NAEDVSESAMLLATKNGVIEIVRGMYKRFPLAI 486
           T PYS  DD V FS + + +  E  K   +D  E+ MLLA KNGVIEIV+GM+ RFPL+I
Sbjct: 448 TLPYSFQDDDVLFSVHIESKPTEAEKPKPKDFQETPMLLAAKNGVIEIVKGMFCRFPLSI 507

Query: 487 RDSRKDKKNVVLLAAEYRQPDVYRFLLKKKKEIKSQFRAVDDEGNSGLHLAATAINPELW 546
            D+ KDKKNVVLLAAEY QPDVYRFLL  K   ++ FRAVDD GNS LHLAA A    +W
Sbjct: 508 YDAGKDKKNVVLLAAEYGQPDVYRFLLSPKVYKENLFRAVDDNGNSALHLAAAASKSMIW 567

Query: 547 RISGVALQMQWEVKWYKYVKKSMPLHFFAHYNNSGKTATTIFHETHKDLMKESGEWLTKT 606
           RI+G ALQMQWE+KWYK+V++S+PL+FFAHYN  GK AT IFHETH DL+++SG+WL KT
Sbjct: 568 RITGAALQMQWEIKWYKFVEESVPLYFFAHYNKEGKNATAIFHETHMDLVQKSGDWLIKT 627

Query: 607 AKSCSVVGTLIVTVAFTSAVSIPGGFDNTDGAPLLEKQPAFLIFAVFSLIALCLSSTSVI 666
           +KSCSVVG LIVTVAFTS  SIPGGF+  DG+P L+ + AF  FA+FSLIALCLSSTSV 
Sbjct: 628 SKSCSVVGALIVTVAFTSVASIPGGFNPRDGSPFLQDREAFFTFALFSLIALCLSSTSVT 687

Query: 667 MFLSILTYRFDAHDFKSNLPWKLFLGFSSLFFSIISMLVSFCSGHYFLIDHRLQNVAVLL 726
           +FL+ILT+RFDA+DF++NLPWKLF+GFSSLF SIISML+SFC+GHYFL+   + + A LL
Sbjct: 688 IFLAILTHRFDANDFRTNLPWKLFIGFSSLFGSIISMLISFCAGHYFLMHRHIPHHAALL 747

Query: 727 YTLVFLPVVLFFFLSKLPLYIDVLQTIFKIMPRRSSHVLLPADPLPAQNPS-KLFKKGKF 786
           YT+V +PV L F +SKLPLYIDV+Q IFKI+P+RS+HV+L +DPLP   PS K F+KGKF
Sbjct: 748 YTIVLVPVALIFIISKLPLYIDVVQAIFKIVPKRSAHVVL-SDPLPLHTPSVKPFRKGKF 807

Query: 787 EVTSIPL---NLTSISNPLFIFTPTTPDSYPLIFFLPGC-IQSDYAHFLHLIASHGFLIL 846
           EVTS  +   +  S S PL I  PT   SYPL+FFLPGC  + DY+HFL  IAS G +I+
Sbjct: 808 EVTSTAMEDSDAFSPSTPLSILAPTAQGSYPLLFFLPGCAAEYDYSHFLQRIASQGLVIV 867

Query: 847 APQLFDVKSTTCKMDETDQLTTQVKSDREGVEDKLS-KLSEVKGGKPKV-SLSLGHHNNP 863
            P     K+T  + +ET Q  T   +DRE VE++LS  ++E+KGGK K  SL+LG ++ P
Sbjct: 868 CPLQMRAKATRSEANETSQFKTWDAADREMVEERLSGVVAELKGGKTKSWSLALG-YDRP 927

BLAST of Tan0011111 vs. ExPASy TrEMBL
Match: A0A6J1ESA5 (uncharacterized protein LOC111437338 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111437338 PE=3 SV=1)

HSP 1 Score: 897.5 bits (2318), Expect = 4.3e-257
Identity = 520/962 (54.05%), Postives = 636/962 (66.11%), Query Frame = 0

Query: 7   EKVKREVGDFLHANVTRGNWKAVAEEYEKASNVAQSLKLLREENTALHLAVIDNQEEIVE 66
           EK +  + DFL+ N  R  W+ V ++YEK    AQ LKL R  +TALHLAV+DN+EE+V+
Sbjct: 28  EKHEATLRDFLYINTKRTEWEKVIKKYEKHPE-AQGLKLTRNGDTALHLAVLDNREEMVQ 87

Query: 67  KLVKFICRSKDDDDDYKKLLETTNNMLNNPLHLAAIMGSVRMCRAIASAHDELVNKRNKL 126
           KLV  I  SK D+     LLETTN+   NPLHLAA MGS  MC AIASAH +LV KRNK+
Sbjct: 88  KLVNRIKDSKRDE-----LLETTNDRKENPLHLAAQMGSATMCYAIASAHHKLVEKRNKI 147

Query: 127 DQTPLFLAALHGNKDAFYCLYYFSRNFSSHQISSNCR--ANGDTVLHCALRNEQFDLAFQ 186
           D+TPL+LAA  GN+DAF+CLY+F R+  S  I++NCR  +NGDTVLH ALRN+ FDLAF 
Sbjct: 148 DETPLYLAAASGNRDAFFCLYHFCRDLDS-GITANCRLSSNGDTVLHSALRNDHFDLAFH 207

Query: 187 FIHLKEEAMNWVNEQGSTALHVLASKPTSFKSGSNINGWQNIIYYLIFVDQLKPRSIKSL 246
            +HL  EAM+WV + G T LHVLASKPT+FKSGS I GW+NI YY   VDQL P+ I SL
Sbjct: 208 ILHLHNEAMHWVTKDGVTPLHVLASKPTAFKSGSQIRGWRNIAYYCTHVDQLNPQPIDSL 267

Query: 247 SK----ACKKPSTTASTYFPVKYGTCIDFFTRLWDLFLK--------------------- 306
            +        P+ T++  FP  Y TCIDFFT +WD FLK                     
Sbjct: 268 IRDWIDRMSNPN-TSTPCFPANYETCIDFFTWVWDGFLKGSGLKRICHDFKNDESKKDTD 327

Query: 307 ------------------------------------------------------------ 366
                                                                       
Sbjct: 328 DAGRNIMVEGGESSEAAEPHQRLDTQLLKAHGLAKEASITNVPRNYNTCIHFFQIVFSAI 387

Query: 367 ------GSAEIRKIREKKEKHTWSVQVMEKLLEAVKPHKYGDDGRTPMNPTFKTDKEE-- 426
                 GSAE +KIR +KEKHTWSVQVMEKLLE   P +Y  +G  PM+ T +T  +   
Sbjct: 388 LISLGWGSAEFKKIRRQKEKHTWSVQVMEKLLEYAAPDEYDCNGGIPMDSTSQTPDQAGV 447

Query: 427 TQPYSVVDDKVNFSPNYDHELLENSKNAE---DVSESAMLLATKNGVIEIVRGMYKRFPL 486
           T PYS  DD V FS + + +  E  K         E+ MLLA KNGVIEIV+GM+ RFPL
Sbjct: 448 TLPYSFQDDDVLFSVHIESKPTEAEKPKPKDFQAPETPMLLAAKNGVIEIVKGMFCRFPL 507

Query: 487 AIRDSRKDKKNVVLLAAEYRQPDVYRFLLKKKKEIKSQFRAVDDEGNSGLHLAATAINPE 546
           +I D+ KDKKNVVLLAAEY QPDVYRFLL  K   ++ FRAVDD GNS LHLAA A    
Sbjct: 508 SIYDAGKDKKNVVLLAAEYGQPDVYRFLLSPKVYKENLFRAVDDNGNSALHLAAAASKSM 567

Query: 547 LWRISGVALQMQWEVKWYKYVKKSMPLHFFAHYNNSGKTATTIFHETHKDLMKESGEWLT 606
           +WRI+G ALQMQWE+KWYK+V++S+PL+FFAHYN  GK AT IFHETH DL+++SG+WL 
Sbjct: 568 IWRITGAALQMQWEIKWYKFVEESVPLYFFAHYNKEGKNATAIFHETHMDLVQKSGDWLI 627

Query: 607 KTAKSCSVVGTLIVTVAFTSAVSIPGGFDNTDGAPLLEKQPAFLIFAVFSLIALCLSSTS 666
           KT+KSCSVVG LIVTVAFTS  SIPGGF+  DG+P L+ + AF  FA+FSLIALCLSSTS
Sbjct: 628 KTSKSCSVVGALIVTVAFTSVASIPGGFNPRDGSPFLQDREAFFTFALFSLIALCLSSTS 687

Query: 667 VIMFLSILTYRFDAHDFKSNLPWKLFLGFSSLFFSIISMLVSFCSGHYFLIDHRLQNVAV 726
           V +FL+ILT+RFDA+DF++NLPWKLF+GFSSLF SIISML+SFC+GHYFL+   + + A 
Sbjct: 688 VTIFLAILTHRFDANDFRTNLPWKLFIGFSSLFGSIISMLISFCAGHYFLMHRHIPHHAA 747

Query: 727 LLYTLVFLPVVLFFFLSKLPLYIDVLQTIFKIMPRRSSHVLLPADPLPAQNPS-KLFKKG 786
           LLYT+V +PV L F +SKLPLYIDV+Q IFKI+P+RS+HV+L +DPLP   PS K F+KG
Sbjct: 748 LLYTIVLVPVALIFIISKLPLYIDVVQAIFKIVPKRSAHVVL-SDPLPLHTPSVKPFRKG 807

Query: 787 KFEVTSIPL---NLTSISNPLFIFTPTTPDSYPLIFFLPGC-IQSDYAHFLHLIASHGFL 846
           KFEVTS  +   +  S S PL I  PT   SYPL+FFLPGC  + DY+HFL  IAS G +
Sbjct: 808 KFEVTSTAMEDSDAFSPSTPLSILAPTAQGSYPLLFFLPGCAAEYDYSHFLQRIASQGLV 867

Query: 847 ILAPQLFDVKSTTCKMDETDQLTTQVKSDREGVEDKLS-KLSEVKGGKPKV-SLSLGHHN 863
           I+ P     K+T  + +ET Q  T   +DRE VE++LS  ++E+KGGK K  SL+LG ++
Sbjct: 868 IVCPLQMRAKATRSEANETSQFKTWDAADREMVEERLSGVVAELKGGKTKSWSLALG-YD 927

BLAST of Tan0011111 vs. TAIR 10
Match: AT3G18670.1 (Ankyrin repeat family protein )

HSP 1 Score: 134.8 bits (338), Expect = 3.3e-31
Identity = 96/298 (32.21%), Postives = 158/298 (53.02%), Query Frame = 0

Query: 366 AMLLATKNGVIEIVRGMYKRFPLAIRDSRKDKKNVVLLAAEYRQPDVYRFLLK---KKKE 425
           A+  A +NG++E +  M + +P  +        N+   A   RQ  ++  +     KK  
Sbjct: 292 ALFKAVENGIVEYIEEMMRHYPDIVWSKNSSGLNIFFYAVSQRQEKIFSLIYNIGAKKNI 351

Query: 426 IKSQFRAVDDEGNSGLHLAA-TAINPELWRISGVALQMQWEVKWYKYVKKSM-PLHFFAH 485
           + + +   D   N+ LH AA  A    L  I G ALQMQ E++W+K V+K + P H    
Sbjct: 352 LATNW---DIFHNNMLHHAAYRAPASRLNLIPGAALQMQRELQWFKEVEKLVQPKHRKMV 411

Query: 486 YNNSGKTATTIFHETHKDLMKESGEWLTKTAKSCSVVGTLIVTVAFTSAVSIPGGFDNTD 545
                KT   +F + HKDL+++  +W+ +TA SC+VV  LI T+ F+SA ++PGG+  +D
Sbjct: 412 NLKQKKTPKALFTDQHKDLVEQGEKWMKETATSCTVVAALITTMMFSSAFTVPGGY-RSD 471

Query: 546 GAPLLEKQPAFLIFAVFSLIALCLSSTSVIMFLSILTYRFDAHDFKSNLPWKLFLGFSSL 605
           G PL   Q  F IF +   I+L  S  S++MFL IL  R+   DF  +LP KL +G  +L
Sbjct: 472 GMPLYIHQHRFKIFLISDAISLFTSCMSLLMFLGILKSRYREEDFLRSLPTKLIVGLLAL 531

Query: 606 FFSIISMLVSFCSGHYFLIDHRLQNVAVLLYTLVFLPVVLFFFLSKLPLYIDVLQTIF 659
           F S+ +M+V+F      L+  ++  V+     L  +P+ +F  L + P+ +++ +  +
Sbjct: 532 FLSMATMIVTFVVTLMTLVGEKISWVSAQFMFLAVIPLGMFVVL-QFPVLLEIFRATY 584

BLAST of Tan0011111 vs. TAIR 10
Match: AT5G04700.1 (Ankyrin repeat family protein )

HSP 1 Score: 132.5 bits (332), Expect = 1.6e-30
Identity = 96/291 (32.99%), Postives = 147/291 (50.52%), Query Frame = 0

Query: 357 KNAEDVSESAMLLATKNGVIEIVRGMYKRFPLAIRDSRKDKKNVV-LLAAEYRQPDVYRF 416
           K   +  + A+L A + G ++ +  M +     +  +R    + + LLA E+RQ  V+  
Sbjct: 353 KERSETVDEALLFAVRYGNVDFLVEMIRNNSELLWSTRTSSSSTLFLLAVEFRQEKVFSL 412

Query: 417 LLKKKKEIKSQFRAVDDEGNSGLHLAATAINP-ELWRISGVALQMQWEVKWYKYVKKSMP 476
           L              D +GN  LHLA     P +L  + G  LQ+Q E++W+K V++  P
Sbjct: 413 LYGLDDRKYLLLADKDCDGNGVLHLAGFPSPPSKLSSVVGAPLQLQRELQWFKEVERIAP 472

Query: 477 LHFFAHYNNSGKTATTIFHETHKDLMKESGEWLTKTAKSCSVVGTLIVTVAFTSAVSIPG 536
                  N   +T   IF + H+ L +E+ +W+  TA SCS+V  LIVTV F +  ++PG
Sbjct: 473 EIEKERVNTEEQTPIEIFTKEHQGLRQEAEKWMKDTAMSCSLVAALIVTVTFAAVFTVPG 532

Query: 537 GF-DNTDGAPLLEKQPAFLIFAVFSLIALCLSSTSVIMFLSILTYRFDAHDFKSNLPWKL 596
           G  DN+ G P   +   F+IF V  LI+   S TSV++FL ILT R+   DF   LP K+
Sbjct: 533 GTDDNSKGKPFHLRDRRFIIFIVSDLISCFASCTSVLIFLGILTARYSFDDFLVFLPTKM 592

Query: 597 FLGFSSLFFSIISMLVSFCSGHYFLIDHRLQNVAVLLYTLVFLPVVLFFFL 645
             G S LF SI +ML++F S  + ++    + +         LP +LF  L
Sbjct: 593 IAGLSILFVSIAAMLIAFSSALFTMMGKEGKWIVAPTILFACLPALLFVLL 643

BLAST of Tan0011111 vs. TAIR 10
Match: AT5G35810.1 (Ankyrin repeat family protein )

HSP 1 Score: 130.2 bits (326), Expect = 8.1e-30
Identity = 101/307 (32.90%), Postives = 163/307 (53.09%), Query Frame = 0

Query: 362 VSESAMLL--ATKNGVIEIVRGMYKRFPLAIRDSRKDKKNVVLLAAEYRQPDVYR--FLL 421
           V  S MLL  A ++G +E++  + + +P  I       +++  +AA  R   ++   + L
Sbjct: 28  VGSSPMLLFDAAQSGNLELLLILIRSYPDLIWTVDHKNQSLFHIAAINRHEKIFNRIYEL 87

Query: 422 KKKKEIKSQFRAVDDEGNSGLHLAATAINP-ELWRISGVALQMQWEVKWYKYVKKSMPLH 481
              K++ + ++  +   N  LHL A    P  L  +SG ALQMQ E+ WYK VK+ +P  
Sbjct: 88  GAIKDLIAMYKEKESNDNL-LHLVARLPPPNRLQVVSGAALQMQREILWYKAVKEIVPRV 147

Query: 482 FFAHYNNSGKTATTIFHETHKDLMKESGEWLTKTAKSCSVVGTLIVTVAFTSAVSIPGGF 541
           +    N   + A  +F + H +L KE  +W+ +TA +C +V TLI TV F +A ++PGG 
Sbjct: 148 YIKTKNKKEEVAHDLFTKEHDNLRKEGEKWMKETATACILVSTLIATVVFAAAFTLPGGN 207

Query: 542 D-----NTDGAPLLEKQPAFLIFAVFSLIALCLSSTSVIMFLSILTYRFDAHDFKSNLPW 601
           D      T G P   K+  F +F +   +AL  S TS+++FLSILT R+    F++ LP 
Sbjct: 208 DTSGDIKTLGFPTFRKEFWFEVFIISDSVALLSSVTSIMIFLSILTSRYAEASFQTTLPT 267

Query: 602 KLFLGFSSLFFSIISMLVSFCSGHYFLIDHRLQNVAVLLYTLVFLPVVLFFFLSKLPLYI 659
           KL LG  +LF SIISM+++F +    LI  +    +++L   V     L F +    L+ 
Sbjct: 268 KLMLGLLALFVSIISMVLAF-TATLILIRDQEPKWSLILLVYVASATALSFVVLHFQLWF 327

BLAST of Tan0011111 vs. TAIR 10
Match: AT5G04730.1 (Ankyrin-repeat containing protein )

HSP 1 Score: 129.4 bits (324), Expect = 1.4e-29
Identity = 91/262 (34.73%), Postives = 136/262 (51.91%), Query Frame = 0

Query: 398 KNVVLLAAEYRQPDVYRFLLKKKKEIKSQFRAVDDEGNSGLHLAATAINP-ELWRISGVA 457
           +N+  LA E+++  ++  +        +  R+ D   N+ LH+A     P +L +ISG A
Sbjct: 330 RNLFQLAVEFKKEKIFNLIHGLDDRKVTLLRSYDKGNNNILHIAGRLSTPDQLSKISGAA 389

Query: 458 LQMQWEVKWYKYVKKSMPLHFFAHYNNSGKTATTIFHETHKDLMKESGEWLTKTAKSCSV 517
           L+MQ E +W+K V+  +        N   KT   IF   H+ L KE  EW+  TA +CS 
Sbjct: 390 LKMQRESQWFKEVESLVSEREVVQKNKDNKTPRQIFEHYHEHLRKEGEEWMKYTATACSF 449

Query: 518 VGTLIVTVAFTSAVSIPGGFDNTDGAPLLEKQPAFLIFAVFSLIALCLSSTSVIMFLSIL 577
           V  LI TV F +  ++PGG D T G+PL+     F  F     +A   S  SV++FLSIL
Sbjct: 450 VAALIATVTFQAIFTVPGGIDGTSGSPLILNDLHFRAFIFTDTLAFFASCISVLIFLSIL 509

Query: 578 TYRFDAHDFKSNLPWKLFLGFSSLFFSIISMLVSFCSGHYFLIDHRLQNVAVLLYTLVFL 637
           T R+   DF  +LP K+ LG S LF SI SMLV+F +     + H+   +   L  L   
Sbjct: 510 TSRYSFDDFIVSLPRKMILGQSILFISIASMLVAFITSLSASMRHK-PALVYPLKPLASF 569

Query: 638 PVVLFFFLSKLPLYIDVLQTIF 659
           P +LF  L + PL  +++ + +
Sbjct: 570 PSLLFLML-QYPLLKEMISSTY 589

BLAST of Tan0011111 vs. TAIR 10
Match: AT3G54070.1 (Ankyrin repeat family protein )

HSP 1 Score: 122.1 bits (305), Expect = 2.2e-27
Identity = 90/265 (33.96%), Postives = 144/265 (54.34%), Query Frame = 0

Query: 403 LAAEYRQPDVYRFL--LKKKKEIKSQFRAVDDEGNSGLHLAATAINPELWRI-SGVALQM 462
           +AA YR  +++  +  L   K++ + ++    + ++ LHL A        ++ SG AL M
Sbjct: 295 VAALYRHENIFSLIYELGGIKDLIASYKEKQSK-DTLLHLVARLPPMNRQQVGSGAALHM 354

Query: 463 QWEVKWYKYVKKSMPLHFFAHYNNSGKTATTIFHETHKDLMKESGEWLTKTAKSCSVVGT 522
           Q E+ W+K VK+ +P  +    N  G+ A  IF E H++L KE   W+ +TA +C +  T
Sbjct: 355 QKELLWFKAVKEIVPRSYIETKNTKGELAHDIFTEQHENLRKEGERWMKETATACMLGAT 414

Query: 523 LIVTVAFTSAVSIPGGFD------NTDGAPLLEKQPAFLIFAVFSLIALCLSSTSVIMFL 582
           LI TV F +A++IPGG D      NT G P   K+  F IF +   +AL  S  S+++FL
Sbjct: 415 LIATVVFAAAITIPGGNDDSGDKANTLGFPNFRKRLLFDIFTLSDSVALFSSMMSIVIFL 474

Query: 583 SILTYRFDAHDFKSNLPWKLFLGFSSLFFSIISMLVSFCSGHYFLIDHRLQNVAVLLYTL 642
           SI T R+   DF+ +LP KL  G S+LF SIISM+++F      +   +     VL+  L
Sbjct: 475 SIFTSRYAEEDFRYDLPTKLMFGLSALFISIISMILAFTFSMILIRVEKASLSLVLISCL 534

Query: 643 VFLPVVLFFFLSKLPLYIDVLQTIF 659
             L  + F +L    L+ + L++++
Sbjct: 535 ASLTALTFAYL-YFHLWFNTLRSVY 557

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LE891.4e-1034.22Chlorophyllase type 0 OS=Chenopodium album OX=3559 GN=CACLH PE=1 SV=1[more]
O225272.7e-0628.73Chlorophyllase-1 OS=Arabidopsis thaliana OX=3702 GN=CLH1 PE=1 SV=1[more]
Q94LX12.6e-0428.05Chlorophyllase-1, chloroplastic OS=Citrus unshiu OX=55188 PE=2 SV=1[more]
Q9MV143.3e-0428.05Chlorophyllase-1, chloroplastic OS=Citrus sinensis OX=2711 GN=CHLASE1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
XP_022995621.11.0e-26855.97uncharacterized protein LOC111491104 isoform X2 [Cucurbita maxima][more]
XP_022995620.11.1e-26755.81uncharacterized protein LOC111491104 isoform X1 [Cucurbita maxima][more]
XP_022995622.11.6e-26655.81uncharacterized protein LOC111491104 isoform X3 [Cucurbita maxima][more]
KAG6606413.12.1e-25854.18Chlorophyllase type 0, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022931013.13.6e-25854.27uncharacterized protein LOC111437338 isoform X2 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1K2F14.9e-26955.97uncharacterized protein LOC111491104 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1K4G35.5e-26855.81uncharacterized protein LOC111491104 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1JZF87.9e-26755.81uncharacterized protein LOC111491104 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1EX581.8e-25854.27uncharacterized protein LOC111437338 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1ESA54.3e-25754.05uncharacterized protein LOC111437338 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT3G18670.13.3e-3132.21Ankyrin repeat family protein [more]
AT5G04700.11.6e-3032.99Ankyrin repeat family protein [more]
AT5G35810.18.1e-3032.90Ankyrin repeat family protein [more]
AT5G04730.11.4e-2934.73Ankyrin-repeat containing protein [more]
AT3G54070.12.2e-2733.96Ankyrin repeat family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002110Ankyrin repeatSMARTSM00248ANK_2acoord: 126..155
e-value: 170.0
score: 10.7
coord: 48..78
e-value: 0.54
score: 19.3
coord: 92..122
e-value: 43.0
score: 13.0
coord: 396..425
e-value: 210.0
score: 10.1
coord: 165..195
e-value: 260.0
score: 9.4
NoneNo IPR availablePFAMPF13637Ank_4coord: 95..143
e-value: 1.6E-5
score: 25.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 312..332
NoneNo IPR availablePANTHERPTHR24177:SF289ANKYRIN REPEAT PROTEINcoord: 37..666
NoneNo IPR availablePANTHERPTHR24177CASKINcoord: 37..666
IPR017395ChlorophyllasePFAMPF07224Chlorophyllasecoord: 681..843
e-value: 2.5E-16
score: 59.6
IPR026961PGG domainPFAMPF13962PGGcoord: 505..617
e-value: 9.5E-27
score: 93.1
IPR036770Ankyrin repeat-containing domain superfamilyGENE3D1.25.40.20coord: 9..250
e-value: 7.6E-24
score: 86.2
IPR036770Ankyrin repeat-containing domain superfamilyGENE3D1.25.40.20coord: 358..501
e-value: 5.0E-7
score: 31.4
IPR036770Ankyrin repeat-containing domain superfamilySUPERFAMILY48403Ankyrin repeatcoord: 17..221

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0011111.1Tan0011111.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015996 chlorophyll catabolic process
molecular_function GO:0047746 chlorophyllase activity
molecular_function GO:0005515 protein binding