Tan0014409 (gene) Snake gourd v1

Overview
NameTan0014409
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionIENR2 domain-containing protein
LocationLG01: 8337341 .. 8342500 (+)
RNA-Seq ExpressionTan0014409
SyntenyTan0014409
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCATACATTCTCATGTAAGTTCTTCTCCATTCTCCTACATTTTCAAGCTCGCCGGAAATTTCTTTGTAACGAAATTTCGTATTCTTTCTCTGTTGTGAGTCTAAGTGTCGCCCAGATTTCCCCCATTAACATCTTTGTTTTAGATTGTAAGGAAGTGGCCATGAAAGTTATGGCACAGTTCTTTCTGTTTATTTTTGTATTCCCCCCTCCCCACATCAAAAGATTAAATAGGCGTCATTTTCCTTTAGTTTTCTGCAATTGTTTTGCGTGCATTCGTTAGTATCGTGAAGGCTGAGTGAAGTTGAAGGGTAATGTGACTCGATATATTTCCTGCATTTTAGAACCTTATTATCTTCTTCGTGGCCACTAGTGTACTTTAACTTTAGGATAATTTGCACTCCGGAGGACAATGAGTTTGATCCTCGGACCTCGACGGGAGACGAAGATGTCTTAACCGCTTAGCTATGCTCACATTTGCTTTGGAGATTTTGTTTTTTTGCTTAATGGTTATGGGAAATCTGTAAATATAGCAAATGTTTATTTTATTGGCAGGTCACATTAATAGAAAAAACATGAATTTAAGCTAGGCTTTAGGACATATTGAAATGCAACCTCCAATATTGTTTGGATTTATGTTACTCATTTTGGAATTGAACCATAAATGACTTTCCGTATTATCTGTGTCACTCCTTGGTGATAGGGTGTACATGTACATCTAAGCAGCAATCAAACGATTGATTGGAACCACGATAAAACTTTTGAAAGTACTAGGATTAAATTGAAACTAAAATTGAAGCATAGAGTTAAAAAAGTGTTTTTTAACCTTTATACATTATTTTTAGTTGACTAATTTATACCATTAATATTTATTAATTACTAGTCAGAAAGAAGTGAAGGATTATGACTCTGTAAACTTGAGAATGTTGTTCTTCTTTTAAGGGTCGAAAGGAAAAACATCAGATGTTGCAACTCTCTTTCACAATACATACGTATCTTATTGCATTTCTGAATGTATAACAGGAGATTATCAGGTGCAACATTATCTGTAAATCTTGCTCCAAATTATGCTATCTGGAAGATTTTCTATTACCCAGTAGCCAACATCAATCTTCCATCAAATGCGCTGCCAGTGAATCAGCAGATATCTACAGTCAGGAACACTTCATTATTTTCTCCTTTCAATATCTTCAACAAAACAAATTCTTCTCAATCCTTGCTGCTCATGGTTGATGAGGGTAGAAATTCCAACTCTGGTGAGTGCTACAAGTCCAAGTGTTCCTCAGGTTCATTTGAGAAGCAGGTTTTGAGCAGAGATGCCGAGGATGATGATTGTCCAGAAAATCTTGAGACTGGAAATTACAAGGAATGGCAAAGACGAAGAAAAATAGGACTGGCAAATAAAGGCAGAGTACCATGGAACAAGGGCAAGAAACACAACTTGGGTAAGAACATTTAGTAGGTCTTCTACTAGGAGTGGTGAATTTTAAATGGCCAGTGGCTACAGTTTATACTTGGTGGAGTTTGGCGCTTGGGATTTTAACCTATAATTATCTGCAGAAACTCGTAAGCGAATCAAGCAGAGAACAATTGAAGCCTTGAGAGACCCTGAGGTGAGGTTCTACCTACTTAAGCCTACTTGTCAGTGTTATCTGTAAATCATACTTTCTTGATGTTGTTGTGTTAAAGTATTGATATAATTAAATTTACACCATAACTCATAAGCTTAAGCTTTTGGGTTGGTTGAAGGAAGTCCTGTGTTCGGACCCCTCATTCCCCCTAATTAATATTGATTTTCACTTGTTTGGGTCTTCTACAAATTTAAAGTCCACAAGTGGGGAAAGTGTTAAAATATTGATAGTTAACCTTAATGTAACTCATATGCTTAAGCTTTTTGGGTTAATTGGTGATTGAACATGTTGGATTGAATATGCTTTCTGCAAATTGTCTAACCTTGTCTCACTTTGAACATCCTACTACTTGTTGCTATACATGGTCTTCTTCTCACTGCTAGACTTTTTTCCATCTAGTACTCATGAAAAGGAAAATAAGCATCGAAATACAATGGAAAAGAAAAGCCCGTTCTTCTGCACGTTCATTATGGACGTTTATTTTTTAGATTGATTCACCAATCATATTCAGTAACTTCAACCTTTTCATTTTTCTATACTTCAGGTGAGGAGGAAGATGTCCGAATATTCCCGCACTCATAGGTTTCACTCTTAACCTCTCACACTTTTATTTCTTAATGCTGAATGAACTTAAAAATGCTATGTTTCCCATGAACTTAGAAGATGCAAAATAGCGGGTCATATTCATATGATATGCCCATCATATTTTCAGAGTAATGATGGGACTAGGCCATTAAGGAGGGTAGAATAGTTGGATAACAGATTATGAGGATATACACACCAGAAAGTTGTTGCTGTGGTCTTTTAGTATATTAGAAGTTGAACTTTGTGGAGGCAAAAGTGGATGGAATCAGGAGGAAATCTACTGCCGGACATTAAATAATTCAGAAAAAATCAATGGAAAGAACTAAGGTGGGTTTTCATGGAAGTTCAAGGATTAAAACAGAATGTTTGAAAGTTTAAAAACTAATAGAAAAAGTATGAAACTTTGGAGACCTTATTTAATTTTTTAACTAACAGTATCTGTTTTTTCAAAAAAAAGGAAAAAAAGGGCATCTGATCTGACACATGTAAGTAGCAAGCAAGTACATTTTTGATGATTTTCTTTGAGCCCTCGTGACTTTCGAGAACTTGTTTCTAGACTCTGCATGTATAAATTGATCTCACACTGTAGTAACTACGTTGAAGATTGCCGGTTTCCCTCTTGCTTAACAGAAGAATTAACAACACATGATTTCAGATTTTGTTGGATATTTAAGTTGAAAGCATTTTGTAAACCTTACAGTATACTTTCAAGATCTCTTATAAAAGGGGAAAAAAGGAAATTCCTTTTTTAGGATCTGGCTCTGAAATCATAAAAAGCTTATGTTAAATCATCAATTTACCCAAAAGCTTAAGCGGATAGATTGGGGGTAAATTTAATTATATCAATATAGTCCAACACTCTCTCTCATTTATGGGCTTGGAAATTTGAAGAAAAGCCAAACAAGTGGAAATTAATATTAATTGGGGAAGAAACGACTTTACAGGGTGTTTGAACTCAAGATCTCCTGCTCTAATATCATGTTAAACCACCAATCAACCTAAAAGCTTAAGCTGATAGGTTGGGGTAAATTTAATTATATCAATATAGTCTAACAACTTCTTCTCTTTGATCAACTTGTAACCACCCTGGCATATAGTTAAATAAGGCTGTGGGTGAAAATGCTGGACTCTTATGGTCCTAGAATTGATGTCATTGGCTTCAAAATCTCAACATTTAGCATTTGACCATGTGCAATTAAAAAACTCGCTTATGTTCAATATTCCTTAGAATTTTACTGTCTACTGGAAGCAAGTTCTTCGTCTCAAAAACTTATTAGTTTGAGAGAGTTTCCTCGTTATCAAGTGGAATTGTTCAGCTTGATGAGGCAAACCTCGTCTGGTTATTAGGTATAAACATGGTAGGAAGTTGTTAAGATACAGTCGTGCATTATTCTATTTGCAGTGACCAGGTCAAGGTTAAAATTAGCTCTTCACTCAGACGTGTATGGGGAAAGAGATTAATGAAGAAGAGATTAAATGAGACATTCTTTTTTTCCTGGATGGAAAGCATAGCTGTTGCTGCGAAGAAAGGAGGCAAGGAAGAACAAGAACTCGACTGGGATAGCTATGACAAGATAAAACAAGAAATGCTCCACCAAAAGCTTCAATGGGCTGCAGAGAAGGCAAAGTTAAAAGTGGTGAGAGCAGAGAATGCAAAAAAGAGAAAAGTCGAAGGGAGGGTTCGTAAGAAAGAAAAGGGGGATGATAATGCCAAAACAAAGAAAATGAAAATGTGTTCCAGAAGAAGAAATGGGGGGAAAAGGAAGGCAAAAGAGGGAGAAGACATCCAGACAAAAATGAAGAAACTGACTGCCACTGAAAGATCAAGGCTTAAGCAGAGACTAAAAAAGGTAATTATCAGGGAGGTCCTGTAGATTTACTTAATAGTTTGAATGAAATTGAAATTCTTCAAGTAGTAATGTAAAACGTGTGAATGCTCATTTCCACTAGATGGATCTAAATGATAGCATGCATCCAATGGAACTGATCTATAACAAAAGCCAACCTATCATAACTAATTGGAAATGTAAATTTTTAGGCCAGCCTTCTTCATTTTAATCTGTTAGCCCTCTGAAATGTAAAATGAACTGTCGATAATGAGGTTCTTATTATGCTTTTATGTCTCCTTCTGTTTATGTTATCCTTTTTCACTCGTCTTTGAATCTAGATTCGCAAAAAGATTACAATAAATGGTGCAGTTGCTGCTCAAGGAAGCATAGCATCAGTTATACCCCAAAACACATCCTGGGAAAAACTGGATCTAGATCTTATAAAGAAAGGACAAATGAGGAAAGAAGTATCGCTGGCAGATCAAATTCAAGTTGCCAAGAATAGGAAAGCAGAATCTATAGCCTGCAAAGTTCTTGTAGCCTCTACTTTGTCGTACGGATGCACCAGGGAAGCGGAAAGGTAACCTTTGACTGACCTTCATTGCTTGCAAGAAGAACAAATGTCTGATGTCTCTGAAGGTGACTTTTCCTCCCATCCAAGGTGATTTTCATTTTATCAATGGCTTATGTTTTTATATGTTCTTTTGTTTTGTTCAACGTGAAAGATCACTTCCAGTCATTGTTCTATTGCAGAACACAAATATAATACTTTTCTGTAATTCTTGTTTTGACTTCTGCAGATTCTTGGAACATTTAGATCCATTGGAGTCACAGAAGCTTGTTCAATTTTCGAGGCAACGGCTACAGCTCATTAGTTTTGAAAGACTTGAAGAAAGTTACAGTGTATTTGCAGGCTGGATTCTGATAAGCTATTGAGCAACAGCTGCATGTCCATGGCATCAGATAGATGGCTTGCTTGCCTTTTGCTCAATCTGTTCCATAATTGTGTGGGGTATCTAAAGATACTAAATGCCTCTCTGAAGTCAAAGTTGGTATATTTACAAACAAAAAGTTGGATTTTTTTTTTTTCAAAGAAAGATATGGTAGAAAAATAA

mRNA sequence

ATGCCATACATTCTCATTGTCGCCCAGATTTCCCCCATTAACATCTTTGTTTTAGATTGTAAGGAAGTGGCCATGAAAGTTATGGCACAGAGATTATCAGGTGCAACATTATCTGTAAATCTTGCTCCAAATTATGCTATCTGGAAGATTTTCTATTACCCAGTAGCCAACATCAATCTTCCATCAAATGCGCTGCCAGTGAATCAGCAGATATCTACAGTCAGGAACACTTCATTATTTTCTCCTTTCAATATCTTCAACAAAACAAATTCTTCTCAATCCTTGCTGCTCATGGTTGATGAGGGTAGAAATTCCAACTCTGGTGAGTGCTACAAGTCCAAGTGTTCCTCAGGTTCATTTGAGAAGCAGGTTTTGAGCAGAGATGCCGAGGATGATGATTGTCCAGAAAATCTTGAGACTGGAAATTACAAGGAATGGCAAAGACGAAGAAAAATAGGACTGGCAAATAAAGGCAGAGTACCATGGAACAAGGGCAAGAAACACAACTTGGAAACTCGTAAGCGAATCAAGCAGAGAACAATTGAAGCCTTGAGAGACCCTGAGGTGAGGAGGAAGATGTCCGAATATTCCCGCACTCATAGTGACCAGGTCAAGGTTAAAATTAGCTCTTCACTCAGACGTGTATGGGGAAAGAGATTAATGAAGAAGAGATTAAATGAGACATTCTTTTTTTCCTGGATGGAAAGCATAGCTGTTGCTGCGAAGAAAGGAGGCAAGGAAGAACAAGAACTCGACTGGGATAGCTATGACAAGATAAAACAAGAAATGCTCCACCAAAAGCTTCAATGGGCTGCAGAGAAGGCAAAGTTAAAAGTGGTGAGAGCAGAGAATGCAAAAAAGAGAAAAGTCGAAGGGAGGGTTCGTAAGAAAGAAAAGGGGGATGATAATGCCAAAACAAAGAAAATGAAAATGTGTTCCAGAAGAAGAAATGGGGGGAAAAGGAAGGCAAAAGAGGGAGAAGACATCCAGACAAAAATGAAGAAACTGACTGCCACTGAAAGATCAAGGCTTAAGCAGAGACTAAAAAAGATTCGCAAAAAGATTACAATAAATGGTGCAGTTGCTGCTCAAGGAAGCATAGCATCAGTTATACCCCAAAACACATCCTGGGAAAAACTGGATCTAGATCTTATAAAGAAAGGACAAATGAGGAAAGAAGTATCGCTGGCAGATCAAATTCAAGTTGCCAAGAATAGGAAAGCAGAATCTATAGCCTGCAAAGTTCTTGTAGCCTCTACTTTGTCGTACGGATGCACCAGGGAAGCGGAAAGATTCTTGGAACATTTAGATCCATTGGAGTCACAGAAGCTTTGTATTTGCAGGCTGGATTCTGATAAGCTATTGAGCAACAGCTGCATGTCCATGGCATCAGATAGATGGCTTGCTTGCCTTTTGCTCAATCTGTTCCATAATTGTGTGGGGTATCTAAAGATACTAAATGCCTCTCTGAAGTCAAAGTTGGTATATTTACAAACAAAAAGTTGGATTTTTTTTTTTTCAAAGAAAGATATGGTAGAAAAATAA

Coding sequence (CDS)

ATGCCATACATTCTCATTGTCGCCCAGATTTCCCCCATTAACATCTTTGTTTTAGATTGTAAGGAAGTGGCCATGAAAGTTATGGCACAGAGATTATCAGGTGCAACATTATCTGTAAATCTTGCTCCAAATTATGCTATCTGGAAGATTTTCTATTACCCAGTAGCCAACATCAATCTTCCATCAAATGCGCTGCCAGTGAATCAGCAGATATCTACAGTCAGGAACACTTCATTATTTTCTCCTTTCAATATCTTCAACAAAACAAATTCTTCTCAATCCTTGCTGCTCATGGTTGATGAGGGTAGAAATTCCAACTCTGGTGAGTGCTACAAGTCCAAGTGTTCCTCAGGTTCATTTGAGAAGCAGGTTTTGAGCAGAGATGCCGAGGATGATGATTGTCCAGAAAATCTTGAGACTGGAAATTACAAGGAATGGCAAAGACGAAGAAAAATAGGACTGGCAAATAAAGGCAGAGTACCATGGAACAAGGGCAAGAAACACAACTTGGAAACTCGTAAGCGAATCAAGCAGAGAACAATTGAAGCCTTGAGAGACCCTGAGGTGAGGAGGAAGATGTCCGAATATTCCCGCACTCATAGTGACCAGGTCAAGGTTAAAATTAGCTCTTCACTCAGACGTGTATGGGGAAAGAGATTAATGAAGAAGAGATTAAATGAGACATTCTTTTTTTCCTGGATGGAAAGCATAGCTGTTGCTGCGAAGAAAGGAGGCAAGGAAGAACAAGAACTCGACTGGGATAGCTATGACAAGATAAAACAAGAAATGCTCCACCAAAAGCTTCAATGGGCTGCAGAGAAGGCAAAGTTAAAAGTGGTGAGAGCAGAGAATGCAAAAAAGAGAAAAGTCGAAGGGAGGGTTCGTAAGAAAGAAAAGGGGGATGATAATGCCAAAACAAAGAAAATGAAAATGTGTTCCAGAAGAAGAAATGGGGGGAAAAGGAAGGCAAAAGAGGGAGAAGACATCCAGACAAAAATGAAGAAACTGACTGCCACTGAAAGATCAAGGCTTAAGCAGAGACTAAAAAAGATTCGCAAAAAGATTACAATAAATGGTGCAGTTGCTGCTCAAGGAAGCATAGCATCAGTTATACCCCAAAACACATCCTGGGAAAAACTGGATCTAGATCTTATAAAGAAAGGACAAATGAGGAAAGAAGTATCGCTGGCAGATCAAATTCAAGTTGCCAAGAATAGGAAAGCAGAATCTATAGCCTGCAAAGTTCTTGTAGCCTCTACTTTGTCGTACGGATGCACCAGGGAAGCGGAAAGATTCTTGGAACATTTAGATCCATTGGAGTCACAGAAGCTTTGTATTTGCAGGCTGGATTCTGATAAGCTATTGAGCAACAGCTGCATGTCCATGGCATCAGATAGATGGCTTGCTTGCCTTTTGCTCAATCTGTTCCATAATTGTGTGGGGTATCTAAAGATACTAAATGCCTCTCTGAAGTCAAAGTTGGTATATTTACAAACAAAAAGTTGGATTTTTTTTTTTTCAAAGAAAGATATGGTAGAAAAATAA

Protein sequence

MPYILIVAQISPINIFVLDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSSGSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVKVKISSSLRRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRNGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLVASTLSYGCTREAERFLEHLDPLESQKLCICRLDSDKLLSNSCMSMASDRWLACLLLNLFHNCVGYLKILNASLKSKLVYLQTKSWIFFFSKKDMVEK
Homology
BLAST of Tan0014409 vs. NCBI nr
Match: XP_038897968.1 (uncharacterized protein LOC120085828 [Benincasa hispida])

HSP 1 Score: 603.2 bits (1554), Expect = 2.1e-168
Identity = 324/418 (77.51%), Postives = 357/418 (85.41%), Query Frame = 0

Query: 13  INIFVLDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQIS 72
           + + ++DC    M     RL G TLSVNLAPN A+WKI YYP+ANINLP NA P+NQQ++
Sbjct: 38  VGLHLMDCHFTRMPYTHMRLLGTTLSVNLAPNPALWKISYYPIANINLPPNAGPINQQMT 97

Query: 73  TVRNTSLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSSGSFEKQVLSRDAEDD 132
            +R+ S+FSP +IFN+ +SSQ++L MVDEGRNSN  ECYKSKCSSG  EK V+S     +
Sbjct: 98  IIRSDSVFSPLDIFNRRSSSQAMLFMVDEGRNSNFAECYKSKCSSGPIEKLVVS---NKN 157

Query: 133 DCPENLETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRK 192
           D PENLET N KE QRR+KIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDP+VRRK
Sbjct: 158 DSPENLETENDKELQRRKKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPKVRRK 217

Query: 193 MSEYSRTHSDQVKVKISSSLRRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELD 252
           MSEY RTHSDQVKVKISSSLR VWGKRLMKKRLNETFF SWMESIAVAAKKGGKEEQELD
Sbjct: 218 MSEYPRTHSDQVKVKISSSLRCVWGKRLMKKRLNETFFLSWMESIAVAAKKGGKEEQELD 277

Query: 253 WDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMC 312
           WDSYDKIKQE+LHQ LQ  AEK KLKV RAEN KK+KV+G V KKEKG+DN+KTKK+KMC
Sbjct: 278 WDSYDKIKQEILHQDLQRVAEKTKLKVTRAENVKKKKVQGMVHKKEKGEDNSKTKKLKMC 337

Query: 313 SRRRNGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIP 372
           SRRRNGGKRK KEG+D   KMKK T  ERS+LKQRLKKIRKKI+ NGAV AQGSIASV P
Sbjct: 338 SRRRNGGKRKGKEGDDTLRKMKKSTTIERSKLKQRLKKIRKKISTNGAVIAQGSIASVAP 397

Query: 373 QNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLVASTLSYGCTREAE 431
           +NTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAES ACKVL+ASTL+Y CT  AE
Sbjct: 398 KNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESTACKVLIASTLTYQCTGFAE 452

BLAST of Tan0014409 vs. NCBI nr
Match: XP_008451721.1 (PREDICTED: uncharacterized protein LOC103492929 isoform X1 [Cucumis melo])

HSP 1 Score: 601.7 bits (1550), Expect = 6.1e-168
Identity = 326/415 (78.55%), Postives = 353/415 (85.06%), Query Frame = 0

Query: 18  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNT 77
           +DC    M  +  RL G T +V LAPN A+WKI YYPVANIN PSNA P+N Q+S +R+ 
Sbjct: 1   MDCHFTRMPYIHMRLLGTTFTVKLAPNPALWKISYYPVANINFPSNATPINHQMSIIRSD 60

Query: 78  SLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSSGSFEKQVLSRDAEDDDCPEN 137
           SLFSPFN+FN+T+SSQ+ L MVDEGRNS+ GECYKSKCSS S EKQVLS     DD PEN
Sbjct: 61  SLFSPFNVFNRTSSSQAFLFMVDEGRNSHFGECYKSKCSSCSIEKQVLS---NKDDSPEN 120

Query: 138 LETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYS 197
           LET N  EWQRR+KIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEY 
Sbjct: 121 LETENDNEWQRRKKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYP 180

Query: 198 RTHSDQVKVKISSSLRRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYD 257
           RTHSDQVKVKISSSLRRVWGKRL+KKRLNETFF SWMESIAVAAKKGGKEEQELDWDSYD
Sbjct: 181 RTHSDQVKVKISSSLRRVWGKRLLKKRLNETFFLSWMESIAVAAKKGGKEEQELDWDSYD 240

Query: 258 KIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRN 317
           KIKQE LHQ+LQ  AEK KLK +RAENAK R+V+ RVRKKEKGDD AKTKK+KMCSRRR+
Sbjct: 241 KIKQETLHQELQRVAEKEKLKAMRAENAKMREVQRRVRKKEKGDDYAKTKKLKMCSRRRD 300

Query: 318 GGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSW 377
            GKRK KE +D   KMKK T  ERS+LKQRLKKIRKKI+INGAV  QGSIASV PQNTSW
Sbjct: 301 AGKRKGKEEDDNLRKMKKSTTIERSKLKQRLKKIRKKISINGAVTTQGSIASVAPQNTSW 360

Query: 378 EKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLV-ASTLSYGCTREAER 432
           E LDLDLIKKGQMRKE SLADQIQVAKNRKAES ACKVL+ ASTL++ CT  AER
Sbjct: 361 ETLDLDLIKKGQMRKEASLADQIQVAKNRKAESTACKVLIAASTLAFQCTGVAER 412

BLAST of Tan0014409 vs. NCBI nr
Match: KAG6600400.1 (hypothetical protein SDJN03_05633, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 596.7 bits (1537), Expect = 2.0e-166
Identity = 328/453 (72.41%), Postives = 365/453 (80.57%), Query Frame = 0

Query: 18  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNT 77
           +D     M  +  +L GATLSVNLAPN AIWK FYYPVAN+NLPSN +P+NQQIS  RN 
Sbjct: 1   MDYHFTRMPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNLPSNVMPMNQQISICRND 60

Query: 78  SLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSSGSFEKQVLSRDAEDDDCPEN 137
           SL SP N+FN+TNSSQSLL +V EGR SNSGECYKSKCSSGSFEKQV SR+  DDDCPEN
Sbjct: 61  SLSSPSNVFNRTNSSQSLLFIVAEGRISNSGECYKSKCSSGSFEKQVSSRNIGDDDCPEN 120

Query: 138 LETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYS 197
            ET N KEWQRRRKIG+ANKG+VPWNKGKKH+LETRKRIKQRTIEAL++P+VRRKMSEY 
Sbjct: 121 HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYP 180

Query: 198 R-THSDQVKVKISSSLRRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSY 257
           R THSDQVK KISSSLRRVWGKRL+KKRLNE FF SW ESIAVAAKKGGKEEQELDWDS+
Sbjct: 181 RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFRSWKESIAVAAKKGGKEEQELDWDSH 240

Query: 258 DKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRR 317
           DKI QEMLHQKL+   EK KLK++RAENAKKRK++GR          AK KK KMCSRRR
Sbjct: 241 DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGR---------GAKIKKRKMCSRRR 300

Query: 318 NGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTS 377
           NGGKR+ KEGEDIQ  MK+LTA ERS LKQRLKKIRKKI IN  VAAQGS+ASV+P+ T+
Sbjct: 301 NGGKRRMKEGEDIQRTMKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTT 360

Query: 378 WEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLVASTLSYGCTREAERFLEHL 437
           WEK+DLD IKKG++R+EVSLADQIQ AKNRKAESIACK+LVASTLSYGC   A       
Sbjct: 361 WEKMDLDRIKKGKLREEVSLADQIQFAKNRKAESIACKILVASTLSYGCAGGA------- 420

Query: 438 DPLESQKLCICRLDSDKLLSNSCMSMASDRWLA 470
                       +DS+K  S S  SMA+DRWLA
Sbjct: 421 ------------MDSEKRSSKSSTSMAADRWLA 425

BLAST of Tan0014409 vs. NCBI nr
Match: KAG7031061.1 (hypothetical protein SDJN02_05100 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 596.3 bits (1536), Expect = 2.5e-166
Identity = 333/475 (70.11%), Postives = 375/475 (78.95%), Query Frame = 0

Query: 18  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNT 77
           +D     M  +  +L GATLSVNLAPN AIWK FYYPVAN+NLPSN +P+NQQIS  RN 
Sbjct: 1   MDYHFTRMPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNLPSNVMPMNQQISICRND 60

Query: 78  SLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSSGSFEKQVLSRDAEDDDCPEN 137
           SL SP N+FN+TNSSQSLL +V EGR SNSGECYKSKCSSGSFEKQV SR+  DDDCPEN
Sbjct: 61  SLSSPSNVFNRTNSSQSLLFIVAEGRISNSGECYKSKCSSGSFEKQVSSRNIGDDDCPEN 120

Query: 138 LETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYS 197
            ET N KEWQRRRKIG+ANKG+VPWNKGKKH+LETRKRIKQRTIEAL++P+VRRKMSEY 
Sbjct: 121 HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYP 180

Query: 198 R-THSDQVKVKISSSLRRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSY 257
           R THSDQVK KISSSLRRVWGKRL+KKRLNE FF SW ESIAVAAKKGGKEEQELDWDS+
Sbjct: 181 RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFRSWKESIAVAAKKGGKEEQELDWDSH 240

Query: 258 DKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRR 317
           DKI QEMLHQKL+   EK KLK++RAENAKKRK++GR          AK KK KMCSRRR
Sbjct: 241 DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGR---------GAKIKKRKMCSRRR 300

Query: 318 NGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTS 377
           NGGKR+ KEGEDIQ  MK+LTA ERS LKQRLKKIRKKI IN  VAAQGS+ASV+P+ T+
Sbjct: 301 NGGKRRMKEGEDIQRTMKELTAIERSGLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTT 360

Query: 378 WEKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLVASTLSYGCTREAERFLEHL 437
           WEK+DLD IKKG++R+EVSLADQIQ AKNRKAESIACK+LVASTLSYGC   A       
Sbjct: 361 WEKMDLDRIKKGKLREEVSLADQIQFAKNRKAESIACKILVASTLSYGCAGGA------- 420

Query: 438 DPLESQKLCICRLDSDKLLSNSCMSMASDRWLACLLLNLFHNCVGYLKILNASLK 492
                       +DS+K  S S  SMA+DRWLA         C  Y+KIL++ ++
Sbjct: 421 ------------MDSEKRSSKSSTSMAADRWLAFCSACFIIVC-RYVKILSSLMQ 446

BLAST of Tan0014409 vs. NCBI nr
Match: XP_011653304.1 (uncharacterized protein LOC101207813 isoform X1 [Cucumis sativus] >KGN53591.1 hypothetical protein Csa_014799 [Cucumis sativus])

HSP 1 Score: 587.0 bits (1512), Expect = 1.5e-163
Identity = 322/414 (77.78%), Postives = 349/414 (84.30%), Query Frame = 0

Query: 18  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNT 77
           +DC    M  +  RL G T +V LAPN A+WKI YYPVANIN PSNA P+N Q+S VRN 
Sbjct: 1   MDCHFTRMPYIHMRLLGTTFTVKLAPNPALWKISYYPVANINFPSNAAPINHQMSIVRND 60

Query: 78  SLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSSGSFEKQVLSRDAEDDDCPEN 137
           S+FSPFNIFN+T+ SQ+ L MVDEGRNSN GECYKSKCSS S EKQVLS     DD PEN
Sbjct: 61  SVFSPFNIFNRTSFSQAFLFMVDEGRNSNFGECYKSKCSSCSIEKQVLS---NKDDSPEN 120

Query: 138 LETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYS 197
           LET N KEWQRR+KIGLANKGRVPWNKGKKHNLETR RIKQRTIEALRDPEVRRKMSEY 
Sbjct: 121 LETENDKEWQRRKKIGLANKGRVPWNKGKKHNLETRTRIKQRTIEALRDPEVRRKMSEYP 180

Query: 198 RTHSDQVKVKISSSLRRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYD 257
           R HSDQVKVKISSSLRRVWGKRLMKKRLNETFF SWMESIAVAAKKGGKEEQELDWDSYD
Sbjct: 181 RIHSDQVKVKISSSLRRVWGKRLMKKRLNETFFLSWMESIAVAAKKGGKEEQELDWDSYD 240

Query: 258 KIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRN 317
           KIKQE LHQ+L+  AEK KLK +R ENAK +KV+ RV KKEKGDDNAKTKK+KMCSRRR+
Sbjct: 241 KIKQETLHQELRRVAEKEKLKAMR-ENAKMKKVQRRVGKKEKGDDNAKTKKLKMCSRRRD 300

Query: 318 GGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSW 377
            GKRK KE ++++ K KK T  ERS+LKQRLKKIRKKI+INGAV AQGSIASV PQN  W
Sbjct: 301 EGKRKGKEDDNLR-KKKKSTTIERSKLKQRLKKIRKKISINGAVTAQGSIASVAPQNPCW 360

Query: 378 EKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLVASTLSYGCTREAER 432
           EKLDLDLIKKGQ  KE SLADQIQVAKNRKAES ACKVL+ASTL++ CT  AER
Sbjct: 361 EKLDLDLIKKGQTWKEASLADQIQVAKNRKAESTACKVLIASTLAFQCTGVAER 409

BLAST of Tan0014409 vs. ExPASy TrEMBL
Match: A0A1S3BS78 (uncharacterized protein LOC103492929 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103492929 PE=4 SV=1)

HSP 1 Score: 601.7 bits (1550), Expect = 2.9e-168
Identity = 326/415 (78.55%), Postives = 353/415 (85.06%), Query Frame = 0

Query: 18  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNT 77
           +DC    M  +  RL G T +V LAPN A+WKI YYPVANIN PSNA P+N Q+S +R+ 
Sbjct: 1   MDCHFTRMPYIHMRLLGTTFTVKLAPNPALWKISYYPVANINFPSNATPINHQMSIIRSD 60

Query: 78  SLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSSGSFEKQVLSRDAEDDDCPEN 137
           SLFSPFN+FN+T+SSQ+ L MVDEGRNS+ GECYKSKCSS S EKQVLS     DD PEN
Sbjct: 61  SLFSPFNVFNRTSSSQAFLFMVDEGRNSHFGECYKSKCSSCSIEKQVLS---NKDDSPEN 120

Query: 138 LETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYS 197
           LET N  EWQRR+KIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEY 
Sbjct: 121 LETENDNEWQRRKKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYP 180

Query: 198 RTHSDQVKVKISSSLRRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYD 257
           RTHSDQVKVKISSSLRRVWGKRL+KKRLNETFF SWMESIAVAAKKGGKEEQELDWDSYD
Sbjct: 181 RTHSDQVKVKISSSLRRVWGKRLLKKRLNETFFLSWMESIAVAAKKGGKEEQELDWDSYD 240

Query: 258 KIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRN 317
           KIKQE LHQ+LQ  AEK KLK +RAENAK R+V+ RVRKKEKGDD AKTKK+KMCSRRR+
Sbjct: 241 KIKQETLHQELQRVAEKEKLKAMRAENAKMREVQRRVRKKEKGDDYAKTKKLKMCSRRRD 300

Query: 318 GGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSW 377
            GKRK KE +D   KMKK T  ERS+LKQRLKKIRKKI+INGAV  QGSIASV PQNTSW
Sbjct: 301 AGKRKGKEEDDNLRKMKKSTTIERSKLKQRLKKIRKKISINGAVTTQGSIASVAPQNTSW 360

Query: 378 EKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLV-ASTLSYGCTREAER 432
           E LDLDLIKKGQMRKE SLADQIQVAKNRKAES ACKVL+ ASTL++ CT  AER
Sbjct: 361 ETLDLDLIKKGQMRKEASLADQIQVAKNRKAESTACKVLIAASTLAFQCTGVAER 412

BLAST of Tan0014409 vs. ExPASy TrEMBL
Match: A0A0A0L0E1 (IENR2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G083650 PE=4 SV=1)

HSP 1 Score: 587.0 bits (1512), Expect = 7.5e-164
Identity = 322/414 (77.78%), Postives = 349/414 (84.30%), Query Frame = 0

Query: 18  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNT 77
           +DC    M  +  RL G T +V LAPN A+WKI YYPVANIN PSNA P+N Q+S VRN 
Sbjct: 1   MDCHFTRMPYIHMRLLGTTFTVKLAPNPALWKISYYPVANINFPSNAAPINHQMSIVRND 60

Query: 78  SLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSSGSFEKQVLSRDAEDDDCPEN 137
           S+FSPFNIFN+T+ SQ+ L MVDEGRNSN GECYKSKCSS S EKQVLS     DD PEN
Sbjct: 61  SVFSPFNIFNRTSFSQAFLFMVDEGRNSNFGECYKSKCSSCSIEKQVLS---NKDDSPEN 120

Query: 138 LETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYS 197
           LET N KEWQRR+KIGLANKGRVPWNKGKKHNLETR RIKQRTIEALRDPEVRRKMSEY 
Sbjct: 121 LETENDKEWQRRKKIGLANKGRVPWNKGKKHNLETRTRIKQRTIEALRDPEVRRKMSEYP 180

Query: 198 RTHSDQVKVKISSSLRRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYD 257
           R HSDQVKVKISSSLRRVWGKRLMKKRLNETFF SWMESIAVAAKKGGKEEQELDWDSYD
Sbjct: 181 RIHSDQVKVKISSSLRRVWGKRLMKKRLNETFFLSWMESIAVAAKKGGKEEQELDWDSYD 240

Query: 258 KIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRN 317
           KIKQE LHQ+L+  AEK KLK +R ENAK +KV+ RV KKEKGDDNAKTKK+KMCSRRR+
Sbjct: 241 KIKQETLHQELRRVAEKEKLKAMR-ENAKMKKVQRRVGKKEKGDDNAKTKKLKMCSRRRD 300

Query: 318 GGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSW 377
            GKRK KE ++++ K KK T  ERS+LKQRLKKIRKKI+INGAV AQGSIASV PQN  W
Sbjct: 301 EGKRKGKEDDNLR-KKKKSTTIERSKLKQRLKKIRKKISINGAVTAQGSIASVAPQNPCW 360

Query: 378 EKLDLDLIKKGQMRKEVSLADQIQVAKNRKAESIACKVLVASTLSYGCTREAER 432
           EKLDLDLIKKGQ  KE SLADQIQVAKNRKAES ACKVL+ASTL++ CT  AER
Sbjct: 361 EKLDLDLIKKGQTWKEASLADQIQVAKNRKAESTACKVLIASTLAFQCTGVAER 409

BLAST of Tan0014409 vs. ExPASy TrEMBL
Match: A0A6J1ITB7 (uncharacterized protein LOC111480410 OS=Cucurbita maxima OX=3661 GN=LOC111480410 PE=4 SV=1)

HSP 1 Score: 583.2 bits (1502), Expect = 1.1e-162
Identity = 314/405 (77.53%), Postives = 347/405 (85.68%), Query Frame = 0

Query: 25  MKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFN 84
           M  +  +L GAT+SVNLAPN AIWK FYYPVAN+NLPSN +P+NQQIS  RN SL SPFN
Sbjct: 1   MPYIHMKLLGATVSVNLAPNSAIWKTFYYPVANVNLPSNVMPMNQQISICRNDSLSSPFN 60

Query: 85  IFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSSGSFEKQVLSRDAEDDDCPENLETGNYK 144
           +FN+TNSSQSLL +V EGR SNSGECYKSKCSSGSFEKQV SR+  DDDCPEN ET N K
Sbjct: 61  VFNRTNSSQSLLFIVAEGRISNSGECYKSKCSSGSFEKQVSSRNIGDDDCPENCETENDK 120

Query: 145 EWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSR-THSDQ 204
           EWQRRRKIG+ANKG+VPWNKGKKH+LETRKRIKQRTIEAL++P+VRRKMSEY R THSDQ
Sbjct: 121 EWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALKNPKVRRKMSEYPRPTHSDQ 180

Query: 205 VKVKISSSLRRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEM 264
           VK KISSSLRRVWGKRL+KKRLNE FF SW ESIAVAAKKGGKEEQELDWDS+DKI QEM
Sbjct: 181 VKTKISSSLRRVWGKRLLKKRLNEAFFRSWKESIAVAAKKGGKEEQELDWDSHDKIIQEM 240

Query: 265 LHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRNGGKRKA 324
           LHQKL+   EK KLK++RAENAKKRK++GR          AK KK KMCSRRRNGGKRK 
Sbjct: 241 LHQKLKMVEEKEKLKLMRAENAKKRKIQGR---------GAKIKKRKMCSRRRNGGKRKM 300

Query: 325 KEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLD 384
           KE EDIQ  +K+LTA ERSRLKQRLKKIRKKI IN  VAAQGS+ASV+P+ T+WEKLDLD
Sbjct: 301 KEVEDIQRTLKELTAIERSRLKQRLKKIRKKIAINSVVAAQGSVASVVPRGTTWEKLDLD 360

Query: 385 LIKKGQMRKEVSLADQIQVAKNRKAESIACKVLVASTLSYGCTRE 429
           LIKKG++R+ VSLADQIQ AK RKAESIACK+LVASTLSYGC  E
Sbjct: 361 LIKKGKLREGVSLADQIQFAKIRKAESIACKILVASTLSYGCAGE 396

BLAST of Tan0014409 vs. ExPASy TrEMBL
Match: A0A6J1CMC2 (uncharacterized protein LOC111012783 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111012783 PE=4 SV=1)

HSP 1 Score: 573.9 bits (1478), Expect = 6.6e-160
Identity = 313/400 (78.25%), Postives = 343/400 (85.75%), Query Frame = 0

Query: 32  LSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNTSLFSPFNIFNKTNS 91
           LSGAT S+NLA N  +WKIF YPVA INLPSN +PVN QIS +++ S  SP +I N+T+ 
Sbjct: 3   LSGATPSINLARNSVLWKIFCYPVA-INLPSNVVPVNHQISVIKHDSSVSPISILNRTSH 62

Query: 92  SQSLLLMVDEGRNSNSGECYKSKCSSGSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRK 151
           S  LL M DEGRNSN G CYKSKCS  S EK+V  R+  DDDCP+NL   N KE QRRR+
Sbjct: 63  SLPLLFMADEGRNSNFGWCYKSKCSLDSLEKRVYYREISDDDCPQNLGKENDKESQRRRR 122

Query: 152 IGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVKVKISSS 211
           IGLANKG VPWNKGKKHN+ETR+RIKQRTIEALRDP+VRRKMSEY RTHSDQVKVKISSS
Sbjct: 123 IGLANKGNVPWNKGKKHNMETRERIKQRTIEALRDPKVRRKMSEYPRTHSDQVKVKISSS 182

Query: 212 LRRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWA 271
           LRRVWGKRLMKKRLNETFF SW ESIAVAAKKGGKE +ELDWDSY KIKQEML QKLQ A
Sbjct: 183 LRRVWGKRLMKKRLNETFFLSWRESIAVAAKKGGKEAEELDWDSYQKIKQEMLRQKLQRA 242

Query: 272 AEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRNGGKRKAKEGEDIQT 331
           AEKA LK  RAENAKKRKVE R+RK+EKGD N K K+MKMCS+ RNG KRKAKEGEDIQ 
Sbjct: 243 AEKANLKETRAENAKKRKVERRIRKEEKGDGNGKLKRMKMCSKGRNGRKRKAKEGEDIQR 302

Query: 332 KMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMR 391
           +MKKLTA ERSRLKQRLK+IRKKI+INGAVAA+GSIASVIPQNTSWEKLDLDLIKKGQMR
Sbjct: 303 EMKKLTAIERSRLKQRLKRIRKKISINGAVAARGSIASVIPQNTSWEKLDLDLIKKGQMR 362

Query: 392 KEVSLADQIQVAKNRKAESIACKVLVASTLSYGCTREAER 432
           K VSLA+QIQVAK+RKAESIACKVL+AST +Y CT  AE+
Sbjct: 363 KGVSLAEQIQVAKSRKAESIACKVLLASTSTYQCTGRAEK 401

BLAST of Tan0014409 vs. ExPASy TrEMBL
Match: A0A6J1FPR6 (uncharacterized protein LOC111447116 OS=Cucurbita moschata OX=3662 GN=LOC111447116 PE=4 SV=1)

HSP 1 Score: 550.4 bits (1417), Expect = 7.8e-153
Identity = 298/391 (76.21%), Postives = 331/391 (84.65%), Query Frame = 0

Query: 18  LDCKEVAMKVMAQRLSGATLSVNLAPNYAIWKIFYYPVANINLPSNALPVNQQISTVRNT 77
           +D     M  +  +L GATLSVNLAPN AIWK FYYPVAN+NLPSN +P+NQQIS  RN 
Sbjct: 1   MDYHFTRMPYIHMKLLGATLSVNLAPNSAIWKTFYYPVANVNLPSNLMPMNQQISICRND 60

Query: 78  SLFSPFNIFNKTNSSQSLLLMVDEGRNSNSGECYKSKCSSGSFEKQVLSRDAEDDDCPEN 137
           SL SP N+FN+TNSSQSLL +V EGR SNSGECYKSKCSSGSFEKQV SR+  DDDCPEN
Sbjct: 61  SLSSPSNVFNRTNSSQSLLFIVAEGRISNSGECYKSKCSSGSFEKQVSSRNIGDDDCPEN 120

Query: 138 LETGNYKEWQRRRKIGLANKGRVPWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYS 197
            ET N KEWQRRRKIG+ANKG+VPWNKGKKH+LETRKRIKQRTIEALR+P+VRRKMSEY 
Sbjct: 121 HETENDKEWQRRRKIGVANKGKVPWNKGKKHSLETRKRIKQRTIEALRNPKVRRKMSEYP 180

Query: 198 R-THSDQVKVKISSSLRRVWGKRLMKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSY 257
           R THSDQVK KISSSLRRVWGKRL+KKRLNE FF SW ESIAVAAKKGGKEEQELDWDS+
Sbjct: 181 RPTHSDQVKTKISSSLRRVWGKRLLKKRLNEAFFRSWKESIAVAAKKGGKEEQELDWDSH 240

Query: 258 DKIKQEMLHQKLQWAAEKAKLKVVRAENAKKRKVEGRVRKKEKGDDNAKTKKMKMCSRRR 317
           DKI QEMLHQKL+   EK KLK++RAENAKKRK++GR          AK KK KM SRRR
Sbjct: 241 DKIIQEMLHQKLKMVEEKEKLKLMRAENAKKRKIQGR---------GAKIKKRKMRSRRR 300

Query: 318 NGGKRKAKEGEDIQTKMKKLTATERSRLKQRLKKIRKKITINGAVAAQGSIASVIPQNTS 377
           NGGKR+ KEGED+Q   K+LTA ERSRLKQRLKKIRKKI ING VAAQGS+ASV+P+ T+
Sbjct: 301 NGGKRRMKEGEDVQRTKKELTAIERSRLKQRLKKIRKKIAINGVVAAQGSVASVVPRGTT 360

Query: 378 WEKLDLDLIKKGQMRKEVSLADQIQVAKNRK 408
           WEK+DLDLIKKG++R+EVSLADQIQ AKNRK
Sbjct: 361 WEKMDLDLIKKGKLREEVSLADQIQFAKNRK 382

BLAST of Tan0014409 vs. TAIR 10
Match: AT1G53250.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G53800.1); Has 11909 Blast hits to 7704 proteins in 757 species: Archae - 51; Bacteria - 1338; Metazoa - 4550; Fungi - 987; Plants - 464; Viruses - 24; Other Eukaryotes - 4495 (source: NCBI BLink). )

HSP 1 Score: 207.2 bits (526), Expect = 3.1e-53
Identity = 138/306 (45.10%), Postives = 190/306 (62.09%), Query Frame = 0

Query: 106 NSGECYKSKCSSGSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRVPWNKG 165
           N  E ++ + +S   E + +++D E D   +       KE +RRRKIGLANKG+VPWNKG
Sbjct: 67  NVFEIHRKEVNSSLLEVKAMNKDTEADSDSDR----KIKEEERRRKIGLANKGKVPWNKG 126

Query: 166 KKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVKVKISSSLRRVWGKRLMKKRL 225
           +KH+ +TR+RIKQRTIEAL +P+VR+KMS++ + HS++ K KI +S+++VW +R   KRL
Sbjct: 127 RKHSEDTRRRIKQRTIEALTNPKVRKKMSDHQQPHSNETKEKIRASVKQVWAERSRSKRL 186

Query: 226 NETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVVRAENA 285
            E F  SW E+IA AA+KGG  E ELDWDSY+KIKQ+   ++LQ A EKA+ K    E  
Sbjct: 187 KEKFMSSWSENIAEAARKGGSGEAELDWDSYEKIKQDFSSEQLQLAEEKARAK----EQT 246

Query: 286 KKRKVEGRVRKKEKGDDNAKTKKMKMCSRRRNGGKRKAKEGEDIQTKMKKLTATERSRLK 345
           K    E    + EK    A+ KK +    RR G  RK K+      + +  T   RS+LK
Sbjct: 247 KMIAKEAAKARTEKMRRAAEKKKEREEKDRREGKIRKPKQ------ERENPTIASRSKLK 306

Query: 346 QRLKKIRKKITINGAVAAQGSIASVIPQNTSWEKLDLDLIKKGQMRKEVSLADQIQVAKN 405
           +RL KI KK T  G +A       V+      EKLDLDLI+K + R ++SLADQIQ AKN
Sbjct: 307 KRLTKIHKKKTSLGKIAI--GTDRVVSVAAKLEKLDLDLIRKERTRGDISLADQIQAAKN 356

Query: 406 RKAESI 412
           ++   +
Sbjct: 367 QRGSDV 356

BLAST of Tan0014409 vs. TAIR 10
Match: AT1G53800.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G53250.1); Has 1136 Blast hits to 882 proteins in 242 species: Archae - 2; Bacteria - 216; Metazoa - 257; Fungi - 77; Plants - 87; Viruses - 4; Other Eukaryotes - 493 (source: NCBI BLink). )

HSP 1 Score: 85.1 bits (209), Expect = 1.8e-16
Identity = 73/274 (26.64%), Postives = 136/274 (49.64%), Query Frame = 0

Query: 101 EGRNSNSGECYKSKCSSGSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRV 160
           E   S+S     SK S+GS            DD  E ++    +E  RR +I  AN+G  
Sbjct: 75  ENERSSSLSSASSKSSNGS-----------ADDGEEQVDD---REKLRRMRISKANRGNT 134

Query: 161 PWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVKVKISSSLRRVWGKRL 220
           PWNKG+KH+ ET ++I++RT  A++DP+++ K++      + + ++KI   +R  W +R 
Sbjct: 135 PWNKGRKHSPETLQKIRERTKIAMQDPKIKMKLANLGHAQNKETRMKIGEGVRMRWARRK 194

Query: 221 MKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVV 280
            ++++ ET  F W   +A AAK+G  +E+EL WDSY+ + Q+    +L+W     + K +
Sbjct: 195 ERRKVQETCHFEWQNLLAEAAKQGYTDEEELQWDSYNILDQQ---NQLEWLESVEQRKAI 254

Query: 281 RAENAKKRKVEGRVRKKEKGDDNA-----KTKKMKMCS-------------RRRNGGKRK 340
           +   + +R  +   +++   +  A      + + ++CS             RRR   +  
Sbjct: 255 KGAKSNRRAPKSPEQRRRIAEAIAAKWADPSYRERVCSGLAKYHGIPVGVERRRRRPRSD 314

Query: 341 AKEGEDIQTKMKKLTATERSRLKQRLKKIRKKIT 357
           A+  +   TK     +    + + ++ K+RK+ T
Sbjct: 315 AEPRKKTPTKKSTRDSEFERQSQVQVVKVRKRKT 331

BLAST of Tan0014409 vs. TAIR 10
Match: AT1G53800.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G53250.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 85.1 bits (209), Expect = 1.8e-16
Identity = 73/274 (26.64%), Postives = 136/274 (49.64%), Query Frame = 0

Query: 101 EGRNSNSGECYKSKCSSGSFEKQVLSRDAEDDDCPENLETGNYKEWQRRRKIGLANKGRV 160
           E   S+S     SK S+GS            DD  E ++    +E  RR +I  AN+G  
Sbjct: 79  ENERSSSLSSASSKSSNGS-----------ADDGEEQVDD---REKLRRMRISKANRGNT 138

Query: 161 PWNKGKKHNLETRKRIKQRTIEALRDPEVRRKMSEYSRTHSDQVKVKISSSLRRVWGKRL 220
           PWNKG+KH+ ET ++I++RT  A++DP+++ K++      + + ++KI   +R  W +R 
Sbjct: 139 PWNKGRKHSPETLQKIRERTKIAMQDPKIKMKLANLGHAQNKETRMKIGEGVRMRWARRK 198

Query: 221 MKKRLNETFFFSWMESIAVAAKKGGKEEQELDWDSYDKIKQEMLHQKLQWAAEKAKLKVV 280
            ++++ ET  F W   +A AAK+G  +E+EL WDSY+ + Q+    +L+W     + K +
Sbjct: 199 ERRKVQETCHFEWQNLLAEAAKQGYTDEEELQWDSYNILDQQ---NQLEWLESVEQRKAI 258

Query: 281 RAENAKKRKVEGRVRKKEKGDDNA-----KTKKMKMCS-------------RRRNGGKRK 340
           +   + +R  +   +++   +  A      + + ++CS             RRR   +  
Sbjct: 259 KGAKSNRRAPKSPEQRRRIAEAIAAKWADPSYRERVCSGLAKYHGIPVGVERRRRRPRSD 318

Query: 341 AKEGEDIQTKMKKLTATERSRLKQRLKKIRKKIT 357
           A+  +   TK     +    + + ++ K+RK+ T
Sbjct: 319 AEPRKKTPTKKSTRDSEFERQSQVQVVKVRKRKT 335

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038897968.12.1e-16877.51uncharacterized protein LOC120085828 [Benincasa hispida][more]
XP_008451721.16.1e-16878.55PREDICTED: uncharacterized protein LOC103492929 isoform X1 [Cucumis melo][more]
KAG6600400.12.0e-16672.41hypothetical protein SDJN03_05633, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7031061.12.5e-16670.11hypothetical protein SDJN02_05100 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_011653304.11.5e-16377.78uncharacterized protein LOC101207813 isoform X1 [Cucumis sativus] >KGN53591.1 hy... [more]
Match NameE-valueIdentityDescription
A0A1S3BS782.9e-16878.55uncharacterized protein LOC103492929 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A0A0L0E17.5e-16477.78IENR2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G083650 PE=4 ... [more]
A0A6J1ITB71.1e-16277.53uncharacterized protein LOC111480410 OS=Cucurbita maxima OX=3661 GN=LOC111480410... [more]
A0A6J1CMC26.6e-16078.25uncharacterized protein LOC111012783 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1FPR67.8e-15376.21uncharacterized protein LOC111447116 OS=Cucurbita moschata OX=3662 GN=LOC1114471... [more]
Match NameE-valueIdentityDescription
AT1G53250.13.1e-5345.10unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G53800.11.8e-1626.64unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G53800.21.8e-1626.64unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 265..285
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 293..326
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 293..308
NoneNo IPR availablePANTHERPTHR34199:SF1HISTONE-LYSINE N-METHYLTRANSFERASE, H3 LYSINE-79 SPECIFIC-LIKE PROTEINcoord: 28..420
NoneNo IPR availablePANTHERPTHR34199NUMOD3 MOTIF FAMILY PROTEIN, EXPRESSEDcoord: 28..420
IPR003611Nuclease associated modular domain 3PFAMPF07460NUMOD3coord: 149..176
e-value: 7.1E-8
score: 32.2

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0014409.1Tan0014409.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003824 catalytic activity
molecular_function GO:0003677 DNA binding