Tan0002668 (gene) Snake gourd v1

Overview
NameTan0002668
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF4408 domain-containing protein
LocationLG01: 112222166 .. 112224002 (-)
RNA-Seq ExpressionTan0002668
SyntenyTan0002668
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCAAAACCCACTCTCTCTCAAATGCCTTCGCTTCTCTTCATCTTCTTCCCTAACAAATCCCCCATTTCCTCTCCCGTCTCCGTGCCCTATTTCCGCCATGTTCGCTGAATCTGTTTCTTCCACACTCTCCATATGGACCTCCGTCAACAGTTGGTTCACTCCCACTGTTCTCTTCGTCGTCCTCAACCTCGTGATTGGCACCATTGCTATTGCTTCCAACTTAGGCGGCCCTCAAAAACCTAATCACCGCCATCCTTCCGACCCTGACCAGCCTCAGTATCTTCCACGATCTCCTTCCCTCCTTCACAGGCTTAAATCGATTAATCCTTATGCGTATAGATCCGAAGAGCCGGCAACTGTGTTCGAGAAACCGCCTGGAAACGAGACTCATTATGCTAGTTTTGAACATCCGCAATTGGATAGATCTCCTTCTGTGTTTCAGCGGTTTAAGTTTAACTTCCCAGGCTACAAATCGGAGGAGTATTTTCAGTCTCCACCGGCGGCGGCTGTGTCCGAGAAAACGCATGGTACCGAGACTCATTATGCTAATTTTGAACATCCGCAACTGGTTAGATCTCCTTCTATGCTTCAGCGGCTCAAGTTTAATTTCTCAGGCTACAAATCGGAGGAGTCTTTCCAATCTACGCCACCGGCTGCTGTGTCTGAGAAAACGCCTGGACTCGAGACTCATTACACTAATTGTGAACATCCACAACTGGTTAGATCTCCTTCTATGATTCAGCGACTCAAGTTTAACTTCTATGGCTATAAATCGGAGGAATCTTTTCAATCTCCGCCTCCGTCTGTTCGTGCGGTTCAGATCCGTCGGGAGGAGGTGGAGCCGAAACGAGTGGAGGTGGAGGAGGATGGAGAAACGGACGGAGATGAGGAACTGTCTATGGACGAGGTTTACAGTAAACTCCATGGCGATCACTTCACCAGGACGAAATCCGATACGAAGCCGACTGCTGGTGAGCTTCCAACGAGACTGCCGAGAAAGATGAAGAAATCGGCGAGTTCGAAATCCGCGTTCTCGCATTTCGAAGCCGATGAAATTGTAGAGAGTCGCCGTCCGGCCACCGTGAAAGAAGGAAGAGAGAAAATGACGGAGATAGACGACGAAGTGGACGCCAGGGCCGACGACTTCATCAACAAATTCAAGCAACAGTTGAAATTGCAGAGGCTGGAATCCATTCTCAAGTACAAGGAGATGGTCAGCAGAGGGAACGCGAAATAGGCAAGGTAAATAAAAGGTAATTTTCAACTCTCTTGGGGTGGGGTTTTCTAATTTTTGATATTCTCAAGGATTAGTTTGAAATTTTCAAGTTTTATAAATTGGACTCCAGCAAGTAAAGCCTAAAAAGAAAAGAGAAAAAAGAAAAAAGAAAAGCAGATCTATCTACTGTTTTTAATTTCCGTCTTTGTAACTTTCATTGCTGAATTCTTTTCTTTAATGCCCCTGAAGTTTGGCCTTCACCTATTTTCCGTGACCAATGTTTATAAATTCGTTTCCAATTACGTCCCTCCATCACTCCCATCCGATAAGCTGTAATTTAATGCTCCATCTCGATAATCTGACAGACAGGTCACTGTATCAATATAATAAGCCAAATGTCTTATTAATTATAGGGTATTGTGTCTTAAAATTTAATAACCCAGGAGTGGGGAGGGGTAGTTTCGTAAATAGACGTTGGCTCTCATGTTGCACGTGGGGGAAAACGCCTCTTCACATTAGGCTCTGTTTCTGTGCGCCATTTAATATTTTATATATATAAAAAATAATCAAGGAGTAAATGTCAAAATATAACGACAATTTGTATTTATTGTTTAATTCGGC

mRNA sequence

CCAAAACCCACTCTCTCTCAAATGCCTTCGCTTCTCTTCATCTTCTTCCCTAACAAATCCCCCATTTCCTCTCCCGTCTCCGTGCCCTATTTCCGCCATGTTCGCTGAATCTGTTTCTTCCACACTCTCCATATGGACCTCCGTCAACAGTTGGTTCACTCCCACTGTTCTCTTCGTCGTCCTCAACCTCGTGATTGGCACCATTGCTATTGCTTCCAACTTAGGCGGCCCTCAAAAACCTAATCACCGCCATCCTTCCGACCCTGACCAGCCTCAGTATCTTCCACGATCTCCTTCCCTCCTTCACAGGCTTAAATCGATTAATCCTTATGCGTATAGATCCGAAGAGCCGGCAACTGTGTTCGAGAAACCGCCTGGAAACGAGACTCATTATGCTAGTTTTGAACATCCGCAATTGGATAGATCTCCTTCTGTGTTTCAGCGGTTTAAGTTTAACTTCCCAGGCTACAAATCGGAGGAGTATTTTCAGTCTCCACCGGCGGCGGCTGTGTCCGAGAAAACGCATGGTACCGAGACTCATTATGCTAATTTTGAACATCCGCAACTGGTTAGATCTCCTTCTATGCTTCAGCGGCTCAAGTTTAATTTCTCAGGCTACAAATCGGAGGAGTCTTTCCAATCTACGCCACCGGCTGCTGTGTCTGAGAAAACGCCTGGACTCGAGACTCATTACACTAATTGTGAACATCCACAACTGGTTAGATCTCCTTCTATGATTCAGCGACTCAAGTTTAACTTCTATGGCTATAAATCGGAGGAATCTTTTCAATCTCCGCCTCCGTCTGTTCGTGCGGTTCAGATCCGTCGGGAGGAGGTGGAGCCGAAACGAGTGGAGGTGGAGGAGGATGGAGAAACGGACGGAGATGAGGAACTGTCTATGGACGAGGTTTACAGTAAACTCCATGGCGATCACTTCACCAGGACGAAATCCGATACGAAGCCGACTGCTGGTGAGCTTCCAACGAGACTGCCGAGAAAGATGAAGAAATCGGCGAGTTCGAAATCCGCGTTCTCGCATTTCGAAGCCGATGAAATTGTAGAGAGTCGCCGTCCGGCCACCGTGAAAGAAGGAAGAGAGAAAATGACGGAGATAGACGACGAAGTGGACGCCAGGGCCGACGACTTCATCAACAAATTCAAGCAACAGTTGAAATTGCAGAGGCTGGAATCCATTCTCAAGTACAAGGAGATGGTCAGCAGAGGGAACGCGAAATAGGCAAGGTAAATAAAAGGTAATTTTCAACTCTCTTGGGGTGGGGTTTTCTAATTTTTGATATTCTCAAGGATTAGTTTGAAATTTTCAAGTTTTATAAATTGGACTCCAGCAAGTAAAGCCTAAAAAGAAAAGAGAAAAAAGAAAAAAGAAAAGCAGATCTATCTACTGTTTTTAATTTCCGTCTTTGTAACTTTCATTGCTGAATTCTTTTCTTTAATGCCCCTGAAGTTTGGCCTTCACCTATTTTCCGTGACCAATGTTTATAAATTCGTTTCCAATTACGTCCCTCCATCACTCCCATCCGATAAGCTGTAATTTAATGCTCCATCTCGATAATCTGACAGACAGGTCACTGTATCAATATAATAAGCCAAATGTCTTATTAATTATAGGGTATTGTGTCTTAAAATTTAATAACCCAGGAGTGGGGAGGGGTAGTTTCGTAAATAGACGTTGGCTCTCATGTTGCACGTGGGGGAAAACGCCTCTTCACATTAGGCTCTGTTTCTGTGCGCCATTTAATATTTTATATATATAAAAAATAATCAAGGAGTAAATGTCAAAATATAACGACAATTTGTATTTATTGTTTAATTCGGC

Coding sequence (CDS)

ATGTTCGCTGAATCTGTTTCTTCCACACTCTCCATATGGACCTCCGTCAACAGTTGGTTCACTCCCACTGTTCTCTTCGTCGTCCTCAACCTCGTGATTGGCACCATTGCTATTGCTTCCAACTTAGGCGGCCCTCAAAAACCTAATCACCGCCATCCTTCCGACCCTGACCAGCCTCAGTATCTTCCACGATCTCCTTCCCTCCTTCACAGGCTTAAATCGATTAATCCTTATGCGTATAGATCCGAAGAGCCGGCAACTGTGTTCGAGAAACCGCCTGGAAACGAGACTCATTATGCTAGTTTTGAACATCCGCAATTGGATAGATCTCCTTCTGTGTTTCAGCGGTTTAAGTTTAACTTCCCAGGCTACAAATCGGAGGAGTATTTTCAGTCTCCACCGGCGGCGGCTGTGTCCGAGAAAACGCATGGTACCGAGACTCATTATGCTAATTTTGAACATCCGCAACTGGTTAGATCTCCTTCTATGCTTCAGCGGCTCAAGTTTAATTTCTCAGGCTACAAATCGGAGGAGTCTTTCCAATCTACGCCACCGGCTGCTGTGTCTGAGAAAACGCCTGGACTCGAGACTCATTACACTAATTGTGAACATCCACAACTGGTTAGATCTCCTTCTATGATTCAGCGACTCAAGTTTAACTTCTATGGCTATAAATCGGAGGAATCTTTTCAATCTCCGCCTCCGTCTGTTCGTGCGGTTCAGATCCGTCGGGAGGAGGTGGAGCCGAAACGAGTGGAGGTGGAGGAGGATGGAGAAACGGACGGAGATGAGGAACTGTCTATGGACGAGGTTTACAGTAAACTCCATGGCGATCACTTCACCAGGACGAAATCCGATACGAAGCCGACTGCTGGTGAGCTTCCAACGAGACTGCCGAGAAAGATGAAGAAATCGGCGAGTTCGAAATCCGCGTTCTCGCATTTCGAAGCCGATGAAATTGTAGAGAGTCGCCGTCCGGCCACCGTGAAAGAAGGAAGAGAGAAAATGACGGAGATAGACGACGAAGTGGACGCCAGGGCCGACGACTTCATCAACAAATTCAAGCAACAGTTGAAATTGCAGAGGCTGGAATCCATTCTCAAGTACAAGGAGATGGTCAGCAGAGGGAACGCGAAATAG

Protein sequence

MFAESVSSTLSIWTSVNSWFTPTVLFVVLNLVIGTIAIASNLGGPQKPNHRHPSDPDQPQYLPRSPSLLHRLKSINPYAYRSEEPATVFEKPPGNETHYASFEHPQLDRSPSVFQRFKFNFPGYKSEEYFQSPPAAAVSEKTHGTETHYANFEHPQLVRSPSMLQRLKFNFSGYKSEESFQSTPPAAVSEKTPGLETHYTNCEHPQLVRSPSMIQRLKFNFYGYKSEESFQSPPPSVRAVQIRREEVEPKRVEVEEDGETDGDEELSMDEVYSKLHGDHFTRTKSDTKPTAGELPTRLPRKMKKSASSKSAFSHFEADEIVESRRPATVKEGREKMTEIDDEVDARADDFINKFKQQLKLQRLESILKYKEMVSRGNAK
Homology
BLAST of Tan0002668 vs. ExPASy Swiss-Prot
Match: F4K956 (Pathogen-associated molecular patterns-induced protein A70 OS=Arabidopsis thaliana OX=3702 GN=A70 PE=1 SV=1)

HSP 1 Score: 129.8 bits (325), Expect = 6.5e-29
Identity = 136/394 (34.52%), Postives = 192/394 (48.73%), Query Frame = 0

Query: 10  LSIWTSVNSWFTPTVLFVVLNLVIGTIAIASNLGGPQKPNHRHP---SDPDQPQYLPRSP 69
           + + T+V S+FTPT LF++LNL+IGTI + S LG   + +++H         P  L R+P
Sbjct: 1   MELLTTVASFFTPTTLFLLLNLMIGTIVVTSRLGSGSRKHYQHHDGFGSGHAPAPLARAP 60

Query: 70  SLLHRLKSINPYAYRSEEPATVFEKPPGNETHYASFEHPQLDRSPSVFQRFKFNFPGYKS 129
           S++ R+KSIN + Y+   P T       +     S  H   D +P+  QR        KS
Sbjct: 61  SIIDRVKSINFHLYKFPHPETELFSMTAHHDIIGSDLHVYPDPNPAPLQRAPSLLDRVKS 120

Query: 130 --EEYFQSPPAAAVSEKTHGTETHYANFEHP---QLVRSPSMLQRLK-FNFSGYKSEESF 189
               YF+ P     S+    + +H      P    L R+PS+L R+K  N S +K    F
Sbjct: 121 INMSYFKFPHDVTGSDPHSHSHSHLDLHPDPAPAPLQRAPSLLDRVKSINMSYFK----F 180

Query: 190 QSTPPAAVSEKTPGLETHYTNCEHPQLVRS-PSMIQRL-KFNFYGYKSEESFQ------- 249
           Q   P          E  Y +   P    S P+ + R+   +   ++  E  Q       
Sbjct: 181 QQYNPE---------ENDYAHHTEPTRFESIPTRMGRVDPIDISKFRIPEEDQPTGTGVN 240

Query: 250 ---SPPPSVRAVQIRREEVEPKRVEVEEDGETDGDEELSMDEVYSKLHGDH-FTRTKSDT 309
              +PP   RA  I  E V+  ++      + D D++ + D V   LH +H   R+KS++
Sbjct: 241 SQINPPGLTRAPSI-LERVKSIKLSSFYRSDPDLDQKQNPDPV---LHEEHKHVRSKSES 300

Query: 310 KPTAGELPTRLPRKMKKSASSKSAF----SHFEADEIVES---RRPATVK-EGREKMTEI 369
           K    +    L  KM KSAS KS F    SH EA E VES   RRP T + E      + 
Sbjct: 301 KKPVKKKKKAL-TKMTKSASEKSGFGFAGSHAEAPETVESLERRRPDTTRVERSTSFGDG 360

Query: 370 DDEVDARADDFINKFKQQLKLQRLESILKYKEMV 374
           +D VDA+A DFINKFKQQLKLQRL+SIL+YKEM+
Sbjct: 361 EDGVDAKASDFINKFKQQLKLQRLDSILRYKEML 376

BLAST of Tan0002668 vs. NCBI nr
Match: XP_008452727.1 (PREDICTED: uncharacterized protein LOC103493663 [Cucumis melo] >XP_008452729.1 PREDICTED: uncharacterized protein LOC103493663 [Cucumis melo] >KAA0064497.1 myb-like protein AA [Cucumis melo var. makuwa] >TYK20093.1 myb-like protein AA [Cucumis melo var. makuwa])

HSP 1 Score: 582.8 bits (1501), Expect = 2.1e-162
Identity = 298/380 (78.42%), Postives = 333/380 (87.63%), Query Frame = 0

Query: 1   MFAESVSSTLSIWTSVNSWFTPTVLFVVLNLVIGTIAIASNLGGPQKPNHRHPSDPDQPQ 60
           MFAESVSSTLSIWTS+NSWFTPTVLFVVLNLVIGTIAIASNLGG Q+PN RHPSDPD P 
Sbjct: 1   MFAESVSSTLSIWTSLNSWFTPTVLFVVLNLVIGTIAIASNLGGSQRPNQRHPSDPDYPH 60

Query: 61  YLPRSPSLLHRLKSINPYAYRSEEPATVFEKPPGNETHYASFEHPQLDRSPSVFQRFKFN 120
           YL RSPS+L RLKS+NPYAYRSEEPATVFEKPPG + HYA++EHPQL RSPS+ QRFKF+
Sbjct: 61  YLHRSPSVLQRLKSMNPYAYRSEEPATVFEKPPGIDAHYANYEHPQLVRSPSMLQRFKFS 120

Query: 121 FPGYKSEEYFQSPPAAAVSEKTHGTETHYANFEHPQLVRSPSMLQRLKFNFSGYKSEESF 180
           FP YK EE FQSPP+A   EK H  +TH AN++HPQLVRSPS+LQRLKF+FSGYK EESF
Sbjct: 121 FPSYKPEESFQSPPSATTFEKAHEIDTHSANYQHPQLVRSPSVLQRLKFSFSGYKPEESF 180

Query: 181 QSTPPAAVSEKTPGLETHYTNCEHPQLVRSPSMIQRLKFNFYGY-KSEESFQSPPPSVRA 240
           QS PP    EK  G + HY+N EHPQLVRSPSM+QR+KFNFYG+ K+EESFQSPPP+V  
Sbjct: 181 QSPPPVTHVEKPAGGDAHYSNFEHPQLVRSPSMLQRIKFNFYGHNKAEESFQSPPPTVSE 240

Query: 241 VQIRREEVEPKRVEVEEDGETDGDEELSMDEVYSKLHGDHFTRTKSDTKPTAGELPTRLP 300
           VQIRR++ E KR+   ED +TDGD+E +MDEV+SKLHGDHF RTKSDT PT+GE PT+L 
Sbjct: 241 VQIRRKDDESKRM---EDEQTDGDQEPTMDEVFSKLHGDHFNRTKSDTMPTSGEFPTKLS 300

Query: 301 RKMKKSASSKSAFSHFEADEIVESRRPATVKEGREKMTEIDDEVDARADDFINKFKQQLK 360
           +KMKKSASSKS FSHFEAD+IVESRRPATVKEGREK+TEI+DEVDARADDFINKFKQQLK
Sbjct: 301 KKMKKSASSKSTFSHFEADDIVESRRPATVKEGREKITEIEDEVDARADDFINKFKQQLK 360

Query: 361 LQRLESILKYKEMVSRGNAK 380
           LQRLESILKYKEMV RGNAK
Sbjct: 361 LQRLESILKYKEMVGRGNAK 377

BLAST of Tan0002668 vs. NCBI nr
Match: XP_004141316.1 (pathogen-associated molecular patterns-induced protein A70 [Cucumis sativus] >KGN55333.1 hypothetical protein Csa_012010 [Cucumis sativus])

HSP 1 Score: 580.5 bits (1495), Expect = 1.1e-161
Identity = 298/377 (79.05%), Postives = 326/377 (86.47%), Query Frame = 0

Query: 1   MFAESVSSTLSIWTSVNSWFTPTVLFVVLNLVIGTIAIASNLGGPQKPNHRHPSDPDQPQ 60
           M AESVSSTLSIWTS+NSWFTPTVLFVVLNLVIGTIAIASNLGG Q+ N RHPSDPD P 
Sbjct: 1   MLAESVSSTLSIWTSLNSWFTPTVLFVVLNLVIGTIAIASNLGGTQRTNQRHPSDPDYPH 60

Query: 61  YLPRSPSLLHRLKSINPYAYRSEEPATVFEKPPGNETHYASFEHPQLDRSPSVFQRFKFN 120
           YL RSPS+L RLKS+NPY+YRSEEPATV EKPPG + HYA++EHPQL RSPS+ QRFKF+
Sbjct: 61  YLHRSPSVLQRLKSMNPYSYRSEEPATVLEKPPGIDAHYANYEHPQLVRSPSMLQRFKFS 120

Query: 121 FPGYKSEEYFQSPPAAAVSEKTHGTETHYANFEHPQLVRSPSMLQRLKFNFSGYKSEESF 180
           FP YK EE FQSPP+A   EK HG + H AN++HPQLVRSPS+LQRLK +FSGYK EESF
Sbjct: 121 FPSYKPEESFQSPPSATAFEKPHGIDAHSANYQHPQLVRSPSVLQRLKSSFSGYKPEESF 180

Query: 181 QSTPPAAVSEKTPGLETHYTNCEHPQLVRSPSMIQRLKFNFYGYKSEESFQSPPPSVRAV 240
           QS PP    EK+ G +THYTN EHPQLVRSPSM+QRLKFNFYGYKSEESFQSPPP+V   
Sbjct: 181 QSPPPVTHVEKSAGGDTHYTNFEHPQLVRSPSMLQRLKFNFYGYKSEESFQSPPPTVSEA 240

Query: 241 QIRREEVEPKRVEVEEDGETDGDEELSMDEVYSKLHGDHFTRTKSDTKPTAGELPTRLPR 300
           QIRR+E E KRV   ED + D D+E +MDEV+SKLHGDHF RTKSDT PTAGE PT+L R
Sbjct: 241 QIRRKEDESKRV---EDEQMDEDQEPTMDEVFSKLHGDHFNRTKSDTMPTAGEFPTKLSR 300

Query: 301 KMKKSASSKSAFSHFEADEIVESRRPATVKEGREKMTEIDDEVDARADDFINKFKQQLKL 360
           KMKKSASSKS FSHFEADEIVESRRPATVKEG+EKMTEI+DEVDARADDFINKFKQQLKL
Sbjct: 301 KMKKSASSKSTFSHFEADEIVESRRPATVKEGKEKMTEIEDEVDARADDFINKFKQQLKL 360

Query: 361 QRLESILKYKEMVSRGN 378
           QRLESILKYKEMV RGN
Sbjct: 361 QRLESILKYKEMVGRGN 374

BLAST of Tan0002668 vs. NCBI nr
Match: XP_022940047.1 (uncharacterized protein LOC111445796 [Cucurbita moschata])

HSP 1 Score: 578.6 bits (1490), Expect = 4.1e-161
Identity = 305/383 (79.63%), Postives = 328/383 (85.64%), Query Frame = 0

Query: 1   MFAESVSSTLSIWTSVNSWFTPTVLFVVLNLVIGTIAIASNLGGPQKPNHRHPSDPDQPQ 60
           MFAESVSSTLSIWTS+NSWFTPTVLFVVLNLVIGTI IASNLGGPQK NHRHPSDPDQ Q
Sbjct: 1   MFAESVSSTLSIWTSINSWFTPTVLFVVLNLVIGTIVIASNLGGPQK-NHRHPSDPDQFQ 60

Query: 61  YLPRSPSLLHRLKSINPYAYRSEEPATVFEKPPGNETHYASFEHPQLDRSPSVFQRFKFN 120
           YL RSPS+L RLKS NPY+YRSEEPAT+FEK PGNETHYA+FEHPQL RSPSV QRFKF+
Sbjct: 61  YLHRSPSMLQRLKSFNPYSYRSEEPATLFEKLPGNETHYATFEHPQLVRSPSVLQRFKFS 120

Query: 121 FPGYKSEEYFQSPPAAAVSEKTHGTETHYANFEHPQLVRSPSMLQRLKFNFSGYKSEESF 180
           F GYKSEE FQSPP A V EK+   ETHYA+FEHPQLVRSPS+ QR KF+FSGYKSEESF
Sbjct: 121 FAGYKSEESFQSPPPATVVEKSPVNETHYASFEHPQLVRSPSVFQRFKFSFSGYKSEESF 180

Query: 181 QSTPPAAVSEKTPGLETHYTNCEHPQLVRSPSMIQRLKFNFYGYKSEESFQSPPPSVRA- 240
            S  PA V EK P  E HY   EHPQLVRSPSM+QRLKFNFYG++SEES Q   PSV A 
Sbjct: 181 DSPLPATVVEKPPANEIHYAKFEHPQLVRSPSMLQRLKFNFYGFRSEESSQYTHPSVTAV 240

Query: 241 -----VQIRREEVEPKRVEVEEDGETDGDEELSMDEVYSKLHGDHFTRTKSDTKPTAGEL 300
                VQI REE  PKR    ED E D D+EL+M+EVYSKLHGDHFTRTKSDTKPTAGE+
Sbjct: 241 QKIEKVQIGREEAAPKRA---EDEEMDEDQELTMEEVYSKLHGDHFTRTKSDTKPTAGEI 300

Query: 301 PTRLPRKMKKSASSKSAFSHFEADEIVESRRPATVKEGREKMTEIDDEVDARADDFINKF 360
           PT+LPRKMKKSASSKSAFSHFEADEIVESRRPATV EGR KMTEID+ VDARADDFIN+F
Sbjct: 301 PTKLPRKMKKSASSKSAFSHFEADEIVESRRPATVNEGRAKMTEIDEGVDARADDFINRF 360

Query: 361 KQQLKLQRLESILKYKEMVSRGN 378
           KQQLKLQRLES+LKYK+M+SRGN
Sbjct: 361 KQQLKLQRLESVLKYKDMISRGN 379

BLAST of Tan0002668 vs. NCBI nr
Match: XP_023524243.1 (uncharacterized protein LOC111788207 [Cucurbita pepo subsp. pepo] >XP_023524244.1 uncharacterized protein LOC111788207 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 577.0 bits (1486), Expect = 1.2e-160
Identity = 305/383 (79.63%), Postives = 327/383 (85.38%), Query Frame = 0

Query: 1   MFAESVSSTLSIWTSVNSWFTPTVLFVVLNLVIGTIAIASNLGGPQKPNHRHPSDPDQPQ 60
           MFAESVSSTLSIWTS+NSWFTPTVLFVVLNLVIGTI IASNLGGPQK NHRHPSDPDQ Q
Sbjct: 1   MFAESVSSTLSIWTSINSWFTPTVLFVVLNLVIGTIVIASNLGGPQK-NHRHPSDPDQFQ 60

Query: 61  YLPRSPSLLHRLKSINPYAYRSEEPATVFEKPPGNETHYASFEHPQLDRSPSVFQRFKFN 120
           YL RSPS+L RLKS NPY+YRSEEPAT+FEK PGNETHYA+FEHPQL RSPSV QRFKF+
Sbjct: 61  YLHRSPSMLQRLKSFNPYSYRSEEPATLFEKLPGNETHYATFEHPQLVRSPSVLQRFKFS 120

Query: 121 FPGYKSEEYFQSPPAAAVSEKTHGTETHYANFEHPQLVRSPSMLQRLKFNFSGYKSEESF 180
           F GYKSEE FQSPP A V EK+   ETHYA+FEHPQLVRSPS+ QR KF+FSGYKSEESF
Sbjct: 121 FAGYKSEESFQSPPPATVVEKSPVNETHYASFEHPQLVRSPSVFQRFKFSFSGYKSEESF 180

Query: 181 QSTPPAAVSEKTPGLETHYTNCEHPQLVRSPSMIQRLKFNFYGYKSEESFQSPPPSVRA- 240
            S  PA V EK P  E HY   EHPQLVRSPSM+QRLKFNFYG++SE+S Q   PSV A 
Sbjct: 181 DSLLPATVDEKPPANEIHYAKFEHPQLVRSPSMLQRLKFNFYGFRSEDSSQYTHPSVTAV 240

Query: 241 -----VQIRREEVEPKRVEVEEDGETDGDEELSMDEVYSKLHGDHFTRTKSDTKPTAGEL 300
                VQI REE  PKR    ED E D D+E +MDEVYSKLHGDHFTRTKSDTKPTAGE+
Sbjct: 241 QKIEKVQIGREEAVPKRA---EDEEMDEDQEPTMDEVYSKLHGDHFTRTKSDTKPTAGEI 300

Query: 301 PTRLPRKMKKSASSKSAFSHFEADEIVESRRPATVKEGREKMTEIDDEVDARADDFINKF 360
           PT+LPRKMKKSASSKSAFSHFEADEIVESRRPATV EGR KMTEID+ VDARADDFIN+F
Sbjct: 301 PTKLPRKMKKSASSKSAFSHFEADEIVESRRPATVNEGRAKMTEIDEGVDARADDFINRF 360

Query: 361 KQQLKLQRLESILKYKEMVSRGN 378
           KQQLKLQRLESILKYK+M+SRGN
Sbjct: 361 KQQLKLQRLESILKYKDMISRGN 379

BLAST of Tan0002668 vs. NCBI nr
Match: XP_022982373.1 (uncharacterized protein LOC111481220 [Cucurbita maxima])

HSP 1 Score: 573.2 bits (1476), Expect = 1.7e-159
Identity = 305/385 (79.22%), Postives = 327/385 (84.94%), Query Frame = 0

Query: 1   MFAESVSSTLSIWTSVNSWFTPTVLFVVLNLVIGTIAIASNLGGPQKPNHRHPSDPDQPQ 60
           MFAESVSSTLSIWTS+NSWFTPTVLFV+LNLVIGTIAIASNLGGPQK N+RHPSDPDQ Q
Sbjct: 1   MFAESVSSTLSIWTSINSWFTPTVLFVILNLVIGTIAIASNLGGPQK-NYRHPSDPDQLQ 60

Query: 61  YLPRSPSLLHRLKSINPYAYRSEEPATVFEKPPGNETHYASFEHPQLDRSPSVFQRFKFN 120
           YL RSPS+L RLKS NPY+YRSEEPAT+FEK PGNETHYASFEHPQL RSPSV QRFKF+
Sbjct: 61  YLHRSPSMLQRLKSFNPYSYRSEEPATLFEKLPGNETHYASFEHPQLVRSPSVLQRFKFS 120

Query: 121 FPGYKSEEYFQSPPAAAVSEKTHGTETHYANFEHPQLVRSPSMLQRLKFNFSGYKSEESF 180
           F GYKSEE FQS P   V EK+   ETHYA+FEHPQLVRSPS+ QR KF+FSGYKSEESF
Sbjct: 121 FSGYKSEESFQSLPPETVVEKSPVNETHYASFEHPQLVRSPSVFQRFKFSFSGYKSEESF 180

Query: 181 QSTPPAAVSEKTPGLETHYTNCEHPQLVRSPSMIQRLKFNFYGYKSEESFQSPPPSVRA- 240
            S  PA V EK P  E HY   EHPQLVRSPSM+QRLKFNFYG++SEES +   PSV A 
Sbjct: 181 DSPLPATVVEKPPANEIHYAKFEHPQLVRSPSMLQRLKFNFYGFRSEESSKYTHPSVTAV 240

Query: 241 -----VQIRREEVEPKRVEVEEDGETDGDEELSMDEVYSKLHGDHFTRTKSDTKPTAGEL 300
                VQI REE  PKR    ED E D D+E +MDEVYSKLHGDHFTRTKSDTKPTAGE+
Sbjct: 241 QKIEEVQIGREEAAPKRA---EDEEIDEDQEPTMDEVYSKLHGDHFTRTKSDTKPTAGEI 300

Query: 301 PTRLPRKMKKSASSKSAFSHFEADEIVESRRPATVKEGREKMTEIDDEVDARADDFINKF 360
           PT+LPRKMKKSASSKSAFSHFEADEIVESRRPATV EGR KM EID+ VDARADDFINKF
Sbjct: 301 PTKLPRKMKKSASSKSAFSHFEADEIVESRRPATVNEGRAKMAEIDEGVDARADDFINKF 360

Query: 361 KQQLKLQRLESILKYKEMVSRGNAK 380
           KQQLKLQRLESILKYK+M+SRGNAK
Sbjct: 361 KQQLKLQRLESILKYKDMISRGNAK 381

BLAST of Tan0002668 vs. ExPASy TrEMBL
Match: A0A1S3BUJ6 (uncharacterized protein LOC103493663 OS=Cucumis melo OX=3656 GN=LOC103493663 PE=4 SV=1)

HSP 1 Score: 582.8 bits (1501), Expect = 1.0e-162
Identity = 298/380 (78.42%), Postives = 333/380 (87.63%), Query Frame = 0

Query: 1   MFAESVSSTLSIWTSVNSWFTPTVLFVVLNLVIGTIAIASNLGGPQKPNHRHPSDPDQPQ 60
           MFAESVSSTLSIWTS+NSWFTPTVLFVVLNLVIGTIAIASNLGG Q+PN RHPSDPD P 
Sbjct: 1   MFAESVSSTLSIWTSLNSWFTPTVLFVVLNLVIGTIAIASNLGGSQRPNQRHPSDPDYPH 60

Query: 61  YLPRSPSLLHRLKSINPYAYRSEEPATVFEKPPGNETHYASFEHPQLDRSPSVFQRFKFN 120
           YL RSPS+L RLKS+NPYAYRSEEPATVFEKPPG + HYA++EHPQL RSPS+ QRFKF+
Sbjct: 61  YLHRSPSVLQRLKSMNPYAYRSEEPATVFEKPPGIDAHYANYEHPQLVRSPSMLQRFKFS 120

Query: 121 FPGYKSEEYFQSPPAAAVSEKTHGTETHYANFEHPQLVRSPSMLQRLKFNFSGYKSEESF 180
           FP YK EE FQSPP+A   EK H  +TH AN++HPQLVRSPS+LQRLKF+FSGYK EESF
Sbjct: 121 FPSYKPEESFQSPPSATTFEKAHEIDTHSANYQHPQLVRSPSVLQRLKFSFSGYKPEESF 180

Query: 181 QSTPPAAVSEKTPGLETHYTNCEHPQLVRSPSMIQRLKFNFYGY-KSEESFQSPPPSVRA 240
           QS PP    EK  G + HY+N EHPQLVRSPSM+QR+KFNFYG+ K+EESFQSPPP+V  
Sbjct: 181 QSPPPVTHVEKPAGGDAHYSNFEHPQLVRSPSMLQRIKFNFYGHNKAEESFQSPPPTVSE 240

Query: 241 VQIRREEVEPKRVEVEEDGETDGDEELSMDEVYSKLHGDHFTRTKSDTKPTAGELPTRLP 300
           VQIRR++ E KR+   ED +TDGD+E +MDEV+SKLHGDHF RTKSDT PT+GE PT+L 
Sbjct: 241 VQIRRKDDESKRM---EDEQTDGDQEPTMDEVFSKLHGDHFNRTKSDTMPTSGEFPTKLS 300

Query: 301 RKMKKSASSKSAFSHFEADEIVESRRPATVKEGREKMTEIDDEVDARADDFINKFKQQLK 360
           +KMKKSASSKS FSHFEAD+IVESRRPATVKEGREK+TEI+DEVDARADDFINKFKQQLK
Sbjct: 301 KKMKKSASSKSTFSHFEADDIVESRRPATVKEGREKITEIEDEVDARADDFINKFKQQLK 360

Query: 361 LQRLESILKYKEMVSRGNAK 380
           LQRLESILKYKEMV RGNAK
Sbjct: 361 LQRLESILKYKEMVGRGNAK 377

BLAST of Tan0002668 vs. ExPASy TrEMBL
Match: A0A5A7VC03 (Myb-like protein AA OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold134G002170 PE=4 SV=1)

HSP 1 Score: 582.8 bits (1501), Expect = 1.0e-162
Identity = 298/380 (78.42%), Postives = 333/380 (87.63%), Query Frame = 0

Query: 1   MFAESVSSTLSIWTSVNSWFTPTVLFVVLNLVIGTIAIASNLGGPQKPNHRHPSDPDQPQ 60
           MFAESVSSTLSIWTS+NSWFTPTVLFVVLNLVIGTIAIASNLGG Q+PN RHPSDPD P 
Sbjct: 1   MFAESVSSTLSIWTSLNSWFTPTVLFVVLNLVIGTIAIASNLGGSQRPNQRHPSDPDYPH 60

Query: 61  YLPRSPSLLHRLKSINPYAYRSEEPATVFEKPPGNETHYASFEHPQLDRSPSVFQRFKFN 120
           YL RSPS+L RLKS+NPYAYRSEEPATVFEKPPG + HYA++EHPQL RSPS+ QRFKF+
Sbjct: 61  YLHRSPSVLQRLKSMNPYAYRSEEPATVFEKPPGIDAHYANYEHPQLVRSPSMLQRFKFS 120

Query: 121 FPGYKSEEYFQSPPAAAVSEKTHGTETHYANFEHPQLVRSPSMLQRLKFNFSGYKSEESF 180
           FP YK EE FQSPP+A   EK H  +TH AN++HPQLVRSPS+LQRLKF+FSGYK EESF
Sbjct: 121 FPSYKPEESFQSPPSATTFEKAHEIDTHSANYQHPQLVRSPSVLQRLKFSFSGYKPEESF 180

Query: 181 QSTPPAAVSEKTPGLETHYTNCEHPQLVRSPSMIQRLKFNFYGY-KSEESFQSPPPSVRA 240
           QS PP    EK  G + HY+N EHPQLVRSPSM+QR+KFNFYG+ K+EESFQSPPP+V  
Sbjct: 181 QSPPPVTHVEKPAGGDAHYSNFEHPQLVRSPSMLQRIKFNFYGHNKAEESFQSPPPTVSE 240

Query: 241 VQIRREEVEPKRVEVEEDGETDGDEELSMDEVYSKLHGDHFTRTKSDTKPTAGELPTRLP 300
           VQIRR++ E KR+   ED +TDGD+E +MDEV+SKLHGDHF RTKSDT PT+GE PT+L 
Sbjct: 241 VQIRRKDDESKRM---EDEQTDGDQEPTMDEVFSKLHGDHFNRTKSDTMPTSGEFPTKLS 300

Query: 301 RKMKKSASSKSAFSHFEADEIVESRRPATVKEGREKMTEIDDEVDARADDFINKFKQQLK 360
           +KMKKSASSKS FSHFEAD+IVESRRPATVKEGREK+TEI+DEVDARADDFINKFKQQLK
Sbjct: 301 KKMKKSASSKSTFSHFEADDIVESRRPATVKEGREKITEIEDEVDARADDFINKFKQQLK 360

Query: 361 LQRLESILKYKEMVSRGNAK 380
           LQRLESILKYKEMV RGNAK
Sbjct: 361 LQRLESILKYKEMVGRGNAK 377

BLAST of Tan0002668 vs. ExPASy TrEMBL
Match: A0A0A0L0P6 (DUF4408 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G646070 PE=4 SV=1)

HSP 1 Score: 580.5 bits (1495), Expect = 5.2e-162
Identity = 298/377 (79.05%), Postives = 326/377 (86.47%), Query Frame = 0

Query: 1   MFAESVSSTLSIWTSVNSWFTPTVLFVVLNLVIGTIAIASNLGGPQKPNHRHPSDPDQPQ 60
           M AESVSSTLSIWTS+NSWFTPTVLFVVLNLVIGTIAIASNLGG Q+ N RHPSDPD P 
Sbjct: 1   MLAESVSSTLSIWTSLNSWFTPTVLFVVLNLVIGTIAIASNLGGTQRTNQRHPSDPDYPH 60

Query: 61  YLPRSPSLLHRLKSINPYAYRSEEPATVFEKPPGNETHYASFEHPQLDRSPSVFQRFKFN 120
           YL RSPS+L RLKS+NPY+YRSEEPATV EKPPG + HYA++EHPQL RSPS+ QRFKF+
Sbjct: 61  YLHRSPSVLQRLKSMNPYSYRSEEPATVLEKPPGIDAHYANYEHPQLVRSPSMLQRFKFS 120

Query: 121 FPGYKSEEYFQSPPAAAVSEKTHGTETHYANFEHPQLVRSPSMLQRLKFNFSGYKSEESF 180
           FP YK EE FQSPP+A   EK HG + H AN++HPQLVRSPS+LQRLK +FSGYK EESF
Sbjct: 121 FPSYKPEESFQSPPSATAFEKPHGIDAHSANYQHPQLVRSPSVLQRLKSSFSGYKPEESF 180

Query: 181 QSTPPAAVSEKTPGLETHYTNCEHPQLVRSPSMIQRLKFNFYGYKSEESFQSPPPSVRAV 240
           QS PP    EK+ G +THYTN EHPQLVRSPSM+QRLKFNFYGYKSEESFQSPPP+V   
Sbjct: 181 QSPPPVTHVEKSAGGDTHYTNFEHPQLVRSPSMLQRLKFNFYGYKSEESFQSPPPTVSEA 240

Query: 241 QIRREEVEPKRVEVEEDGETDGDEELSMDEVYSKLHGDHFTRTKSDTKPTAGELPTRLPR 300
           QIRR+E E KRV   ED + D D+E +MDEV+SKLHGDHF RTKSDT PTAGE PT+L R
Sbjct: 241 QIRRKEDESKRV---EDEQMDEDQEPTMDEVFSKLHGDHFNRTKSDTMPTAGEFPTKLSR 300

Query: 301 KMKKSASSKSAFSHFEADEIVESRRPATVKEGREKMTEIDDEVDARADDFINKFKQQLKL 360
           KMKKSASSKS FSHFEADEIVESRRPATVKEG+EKMTEI+DEVDARADDFINKFKQQLKL
Sbjct: 301 KMKKSASSKSTFSHFEADEIVESRRPATVKEGKEKMTEIEDEVDARADDFINKFKQQLKL 360

Query: 361 QRLESILKYKEMVSRGN 378
           QRLESILKYKEMV RGN
Sbjct: 361 QRLESILKYKEMVGRGN 374

BLAST of Tan0002668 vs. ExPASy TrEMBL
Match: A0A6J1FIJ5 (uncharacterized protein LOC111445796 OS=Cucurbita moschata OX=3662 GN=LOC111445796 PE=4 SV=1)

HSP 1 Score: 578.6 bits (1490), Expect = 2.0e-161
Identity = 305/383 (79.63%), Postives = 328/383 (85.64%), Query Frame = 0

Query: 1   MFAESVSSTLSIWTSVNSWFTPTVLFVVLNLVIGTIAIASNLGGPQKPNHRHPSDPDQPQ 60
           MFAESVSSTLSIWTS+NSWFTPTVLFVVLNLVIGTI IASNLGGPQK NHRHPSDPDQ Q
Sbjct: 1   MFAESVSSTLSIWTSINSWFTPTVLFVVLNLVIGTIVIASNLGGPQK-NHRHPSDPDQFQ 60

Query: 61  YLPRSPSLLHRLKSINPYAYRSEEPATVFEKPPGNETHYASFEHPQLDRSPSVFQRFKFN 120
           YL RSPS+L RLKS NPY+YRSEEPAT+FEK PGNETHYA+FEHPQL RSPSV QRFKF+
Sbjct: 61  YLHRSPSMLQRLKSFNPYSYRSEEPATLFEKLPGNETHYATFEHPQLVRSPSVLQRFKFS 120

Query: 121 FPGYKSEEYFQSPPAAAVSEKTHGTETHYANFEHPQLVRSPSMLQRLKFNFSGYKSEESF 180
           F GYKSEE FQSPP A V EK+   ETHYA+FEHPQLVRSPS+ QR KF+FSGYKSEESF
Sbjct: 121 FAGYKSEESFQSPPPATVVEKSPVNETHYASFEHPQLVRSPSVFQRFKFSFSGYKSEESF 180

Query: 181 QSTPPAAVSEKTPGLETHYTNCEHPQLVRSPSMIQRLKFNFYGYKSEESFQSPPPSVRA- 240
            S  PA V EK P  E HY   EHPQLVRSPSM+QRLKFNFYG++SEES Q   PSV A 
Sbjct: 181 DSPLPATVVEKPPANEIHYAKFEHPQLVRSPSMLQRLKFNFYGFRSEESSQYTHPSVTAV 240

Query: 241 -----VQIRREEVEPKRVEVEEDGETDGDEELSMDEVYSKLHGDHFTRTKSDTKPTAGEL 300
                VQI REE  PKR    ED E D D+EL+M+EVYSKLHGDHFTRTKSDTKPTAGE+
Sbjct: 241 QKIEKVQIGREEAAPKRA---EDEEMDEDQELTMEEVYSKLHGDHFTRTKSDTKPTAGEI 300

Query: 301 PTRLPRKMKKSASSKSAFSHFEADEIVESRRPATVKEGREKMTEIDDEVDARADDFINKF 360
           PT+LPRKMKKSASSKSAFSHFEADEIVESRRPATV EGR KMTEID+ VDARADDFIN+F
Sbjct: 301 PTKLPRKMKKSASSKSAFSHFEADEIVESRRPATVNEGRAKMTEIDEGVDARADDFINRF 360

Query: 361 KQQLKLQRLESILKYKEMVSRGN 378
           KQQLKLQRLES+LKYK+M+SRGN
Sbjct: 361 KQQLKLQRLESVLKYKDMISRGN 379

BLAST of Tan0002668 vs. ExPASy TrEMBL
Match: A0A6J1J4N9 (uncharacterized protein LOC111481220 OS=Cucurbita maxima OX=3661 GN=LOC111481220 PE=4 SV=1)

HSP 1 Score: 573.2 bits (1476), Expect = 8.2e-160
Identity = 305/385 (79.22%), Postives = 327/385 (84.94%), Query Frame = 0

Query: 1   MFAESVSSTLSIWTSVNSWFTPTVLFVVLNLVIGTIAIASNLGGPQKPNHRHPSDPDQPQ 60
           MFAESVSSTLSIWTS+NSWFTPTVLFV+LNLVIGTIAIASNLGGPQK N+RHPSDPDQ Q
Sbjct: 1   MFAESVSSTLSIWTSINSWFTPTVLFVILNLVIGTIAIASNLGGPQK-NYRHPSDPDQLQ 60

Query: 61  YLPRSPSLLHRLKSINPYAYRSEEPATVFEKPPGNETHYASFEHPQLDRSPSVFQRFKFN 120
           YL RSPS+L RLKS NPY+YRSEEPAT+FEK PGNETHYASFEHPQL RSPSV QRFKF+
Sbjct: 61  YLHRSPSMLQRLKSFNPYSYRSEEPATLFEKLPGNETHYASFEHPQLVRSPSVLQRFKFS 120

Query: 121 FPGYKSEEYFQSPPAAAVSEKTHGTETHYANFEHPQLVRSPSMLQRLKFNFSGYKSEESF 180
           F GYKSEE FQS P   V EK+   ETHYA+FEHPQLVRSPS+ QR KF+FSGYKSEESF
Sbjct: 121 FSGYKSEESFQSLPPETVVEKSPVNETHYASFEHPQLVRSPSVFQRFKFSFSGYKSEESF 180

Query: 181 QSTPPAAVSEKTPGLETHYTNCEHPQLVRSPSMIQRLKFNFYGYKSEESFQSPPPSVRA- 240
            S  PA V EK P  E HY   EHPQLVRSPSM+QRLKFNFYG++SEES +   PSV A 
Sbjct: 181 DSPLPATVVEKPPANEIHYAKFEHPQLVRSPSMLQRLKFNFYGFRSEESSKYTHPSVTAV 240

Query: 241 -----VQIRREEVEPKRVEVEEDGETDGDEELSMDEVYSKLHGDHFTRTKSDTKPTAGEL 300
                VQI REE  PKR    ED E D D+E +MDEVYSKLHGDHFTRTKSDTKPTAGE+
Sbjct: 241 QKIEEVQIGREEAAPKRA---EDEEIDEDQEPTMDEVYSKLHGDHFTRTKSDTKPTAGEI 300

Query: 301 PTRLPRKMKKSASSKSAFSHFEADEIVESRRPATVKEGREKMTEIDDEVDARADDFINKF 360
           PT+LPRKMKKSASSKSAFSHFEADEIVESRRPATV EGR KM EID+ VDARADDFINKF
Sbjct: 301 PTKLPRKMKKSASSKSAFSHFEADEIVESRRPATVNEGRAKMAEIDEGVDARADDFINKF 360

Query: 361 KQQLKLQRLESILKYKEMVSRGNAK 380
           KQQLKLQRLESILKYK+M+SRGNAK
Sbjct: 361 KQQLKLQRLESILKYKDMISRGNAK 381

BLAST of Tan0002668 vs. TAIR 10
Match: AT2G26110.1 (Protein of unknown function (DUF761) )

HSP 1 Score: 195.7 bits (496), Expect = 6.9e-50
Identity = 146/375 (38.93%), Postives = 198/375 (52.80%), Query Frame = 0

Query: 11  SIWTSVNSWFTPTVLFVVLNLVIGTIAIASNLGGPQKPNHRHPSDPDQPQYLPRSPSLLH 70
           S+ T++ SWFTPTVLFV LNL+IGTIAI+S+            +DP+Q Q + RSPS++H
Sbjct: 40  SVLTAMYSWFTPTVLFVFLNLMIGTIAISSSFSSKS-------NDPNQTQ-IQRSPSMIH 99

Query: 71  RLKSINPYAYRSEEPATVFEKPPGNETHYASFEHPQLDRSPSVFQRFKFNFPGYKSEEYF 130
           RLKSIN                      ++SF  P                     + + 
Sbjct: 100 RLKSIN----------------------FSSFTSP--------------------DKSHL 159

Query: 131 QSPPAAAVSEKTHGTETHYANFEHPQLVRSPSMLQRLKFNFSGYKSEESFQSTPPAAVSE 190
           + PP+       +       NF  P  +                                
Sbjct: 160 EFPPSTPEDNSNN-------NFHQPASIEQ------------------------------ 219

Query: 191 KTPGLETHYTNCEHPQLVRSPSMIQRLK-FNFYGYKSEES---FQSPPPSVRAVQIRREE 250
                         P L RSPS++ R+K FN Y Y S+E     ++ PPSV  V+ ++E+
Sbjct: 220 ------------NQPFLSRSPSVLHRIKSFNLYNYISQEPTNIIEASPPSV-TVETKQEQ 279

Query: 251 VEPKRVEVEEDGETDGDEELSMDEVYSKLHGDHFTRTKSDTKPTAGELPTRLPRKMKKSA 310
           V+ + V+ E++     +EE S++EVYSKL+ +H  RTKSDT+P AG  P +LP+KMKKSA
Sbjct: 280 VQEQEVKEEQE-----EEEQSLEEVYSKLNLNHVARTKSDTEPAAGIRPPKLPKKMKKSA 309

Query: 311 SSKSAFSHFEADEI-VESRRPATVKEGR-EKMTEIDDEVDARADDFINKFKQQLKLQRLE 370
           S+KS FSHF+ DEI VE+RRPATVK  R   + E D+EVDA+ADDFIN+FK QLKLQR++
Sbjct: 340 STKSPFSHFQEDEISVEARRPATVKVPRVTTVEEADEEVDAKADDFINRFKHQLKLQRID 309

Query: 371 SILKYKEMVSRGNAK 380
           SI KYKEMV + N K
Sbjct: 400 SITKYKEMVKKRNDK 309

BLAST of Tan0002668 vs. TAIR 10
Match: AT5G56980.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 18 plant structures; EXPRESSED DURING: 12 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G26130.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 129.8 bits (325), Expect = 4.6e-30
Identity = 136/394 (34.52%), Postives = 192/394 (48.73%), Query Frame = 0

Query: 10  LSIWTSVNSWFTPTVLFVVLNLVIGTIAIASNLGGPQKPNHRHP---SDPDQPQYLPRSP 69
           + + T+V S+FTPT LF++LNL+IGTI + S LG   + +++H         P  L R+P
Sbjct: 1   MELLTTVASFFTPTTLFLLLNLMIGTIVVTSRLGSGSRKHYQHHDGFGSGHAPAPLARAP 60

Query: 70  SLLHRLKSINPYAYRSEEPATVFEKPPGNETHYASFEHPQLDRSPSVFQRFKFNFPGYKS 129
           S++ R+KSIN + Y+   P T       +     S  H   D +P+  QR        KS
Sbjct: 61  SIIDRVKSINFHLYKFPHPETELFSMTAHHDIIGSDLHVYPDPNPAPLQRAPSLLDRVKS 120

Query: 130 --EEYFQSPPAAAVSEKTHGTETHYANFEHP---QLVRSPSMLQRLK-FNFSGYKSEESF 189
               YF+ P     S+    + +H      P    L R+PS+L R+K  N S +K    F
Sbjct: 121 INMSYFKFPHDVTGSDPHSHSHSHLDLHPDPAPAPLQRAPSLLDRVKSINMSYFK----F 180

Query: 190 QSTPPAAVSEKTPGLETHYTNCEHPQLVRS-PSMIQRL-KFNFYGYKSEESFQ------- 249
           Q   P          E  Y +   P    S P+ + R+   +   ++  E  Q       
Sbjct: 181 QQYNPE---------ENDYAHHTEPTRFESIPTRMGRVDPIDISKFRIPEEDQPTGTGVN 240

Query: 250 ---SPPPSVRAVQIRREEVEPKRVEVEEDGETDGDEELSMDEVYSKLHGDH-FTRTKSDT 309
              +PP   RA  I  E V+  ++      + D D++ + D V   LH +H   R+KS++
Sbjct: 241 SQINPPGLTRAPSI-LERVKSIKLSSFYRSDPDLDQKQNPDPV---LHEEHKHVRSKSES 300

Query: 310 KPTAGELPTRLPRKMKKSASSKSAF----SHFEADEIVES---RRPATVK-EGREKMTEI 369
           K    +    L  KM KSAS KS F    SH EA E VES   RRP T + E      + 
Sbjct: 301 KKPVKKKKKAL-TKMTKSASEKSGFGFAGSHAEAPETVESLERRRPDTTRVERSTSFGDG 360

Query: 370 DDEVDARADDFINKFKQQLKLQRLESILKYKEMV 374
           +D VDA+A DFINKFKQQLKLQRL+SIL+YKEM+
Sbjct: 361 EDGVDAKASDFINKFKQQLKLQRLDSILRYKEML 376

BLAST of Tan0002668 vs. TAIR 10
Match: AT4G26130.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G56980.1); Has 121 Blast hits to 116 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 113; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 98.2 bits (243), Expect = 1.5e-20
Identity = 112/376 (29.79%), Postives = 161/376 (42.82%), Query Frame = 0

Query: 10  LSIWTSVNSWFTPTVLFVVLNLVIGTIAIASNLGGPQKPNHRH-----PSDPDQPQ-YLP 69
           + + TS+ +W TPT LF++LN  I TI I +      + +++H      S  DQ Q    
Sbjct: 1   MELLTSLTNWLTPTTLFLLLNFTIATIFITNRFSSCSRKHNQHQDGYGSSGHDQNQARFG 60

Query: 70  RSPSLLHRLKSINPYAYRSEEPATVFEKPPGNETHYASFEHPQLDRSPSVFQRFK-FNFP 129
           R PSL+ R+KSIN + Y S  P         +E HY+  + P  +  PS+ QR K  N P
Sbjct: 61  RPPSLIDRVKSINFHLYNSPSPE--------SEIHYSGSD-PNPNPPPSLLQRVKSINMP 120

Query: 130 GYKSEEYFQSPPAAAVSEKTHGTETHYANFEHPQLVRSPSMLQRLKFNFSGYKSEESFQS 189
                 YF+ P         H +E  YA +E   L+  P               +E+ + 
Sbjct: 121 ------YFKFP--------QHNSEGDYAAYE---LMTQP---------------DETNRV 180

Query: 190 TPPAAVSEKTPGLETHYTNCEHPQLVRSPSMIQRLKFNFYGYKSEESFQSPPPSVRAVQI 249
            P   + E     E  +          +PS++QR+K      K    ++S P     VQ 
Sbjct: 181 DPIDKIPEDDVTTEPRF---------GAPSLLQRVK----SIKLPSLYRSDPDPTPEVQT 240

Query: 250 RREEVEPKRVEVEEDGETDGDEELSMDEVYSKLHGDHFTRTKSDTKPTAGELPTRLPRKM 309
                                                 TRTKS++   A +   +  +KM
Sbjct: 241 H-------------------------------------TRTKSESSKPATKKKKKATKKM 283

Query: 310 KKSASSKS-AFSHFEADEIVESRRPATVKEGREKMTEIDD----EVDARADDFINKFKQQ 369
            KSAS +       E  E VE RRP T++   E+ T I D     VD +A +FINKFKQQ
Sbjct: 301 MKSASERHIGREEEETVEAVEKRRPETMRV--ERTTSIGDGGEEGVDDKASNFINKFKQQ 283

Query: 370 LKLQRLESILKYKEMV 374
           LKLQRL+S L+Y+EM+
Sbjct: 361 LKLQRLDSFLRYREML 283

BLAST of Tan0002668 vs. TAIR 10
Match: AT4G04990.1 (Protein of unknown function (DUF761) )

HSP 1 Score: 47.8 bits (112), Expect = 2.3e-05
Identity = 51/186 (27.42%), Postives = 84/186 (45.16%), Query Frame = 0

Query: 222 YGYKSEESFQSPPPSVRAVQIRREEVEPKRVEVEEDGETDGDEELSMDEVYSKLHG---- 281
           YG  S E    P    + V +RR    P  V+      T GDE  +M+E++ ++      
Sbjct: 125 YGVSSPEVRFFPTAPEKPVGLRRPPTVP--VKTFPQDNTSGDESETMEEMWERVKAEKQP 184

Query: 282 -------DHFTRTKSDTKPTAGE--LPTRLPRKMKKSASSKSAFSHFEADEIVESRRP-- 341
                  DH   ++ DTK +     LP+R P + ++   S S+ S   +     +RRP  
Sbjct: 185 KKPNSLQDHVI-SRGDTKMSTSSWPLPSRSPSRARRPTPSLSSLSPSSS----RARRPPS 244

Query: 342 ATVKEGREKMTEID-------------DEVDARADDFINKFKQQLKLQRLESILKYKEMV 380
           +  + G++ M  I              +E+++R + FI KFK ++KLQRLES+ +YK   
Sbjct: 245 SPARPGKKLMERIPSWVKLKKELSMGREELNSRVEAFITKFKDEMKLQRLESVRRYKSFR 303

BLAST of Tan0002668 vs. TAIR 10
Match: AT5G47920.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G13880.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 43.9 bits (102), Expect = 3.3e-04
Identity = 22/49 (44.90%), Postives = 36/49 (73.47%), Query Frame = 0

Query: 328 TVKEGREKMTEIDDEVDARADDFINKFKQQLKLQRLESILKYKEMVSRG 377
           T KE   +  +++DE+D  AD FI++F +Q+KLQ+L S  +Y+EM++RG
Sbjct: 138 TSKEEEGENFKLEDEIDHVADLFISRFHKQMKLQKLLSFKRYQEMLARG 186

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
F4K9566.5e-2934.52Pathogen-associated molecular patterns-induced protein A70 OS=Arabidopsis thalia... [more]
Match NameE-valueIdentityDescription
XP_008452727.12.1e-16278.42PREDICTED: uncharacterized protein LOC103493663 [Cucumis melo] >XP_008452729.1 P... [more]
XP_004141316.11.1e-16179.05pathogen-associated molecular patterns-induced protein A70 [Cucumis sativus] >KG... [more]
XP_022940047.14.1e-16179.63uncharacterized protein LOC111445796 [Cucurbita moschata][more]
XP_023524243.11.2e-16079.63uncharacterized protein LOC111788207 [Cucurbita pepo subsp. pepo] >XP_023524244.... [more]
XP_022982373.11.7e-15979.22uncharacterized protein LOC111481220 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A1S3BUJ61.0e-16278.42uncharacterized protein LOC103493663 OS=Cucumis melo OX=3656 GN=LOC103493663 PE=... [more]
A0A5A7VC031.0e-16278.42Myb-like protein AA OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold134G... [more]
A0A0A0L0P65.2e-16279.05DUF4408 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G646070 PE=... [more]
A0A6J1FIJ52.0e-16179.63uncharacterized protein LOC111445796 OS=Cucurbita moschata OX=3662 GN=LOC1114457... [more]
A0A6J1J4N98.2e-16079.22uncharacterized protein LOC111481220 OS=Cucurbita maxima OX=3661 GN=LOC111481220... [more]
Match NameE-valueIdentityDescription
AT2G26110.16.9e-5038.93Protein of unknown function (DUF761) [more]
AT5G56980.14.6e-3034.52unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G26130.11.5e-2029.79unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G04990.12.3e-0527.42Protein of unknown function (DUF761) [more]
AT5G47920.13.3e-0444.90unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008480Protein of unknown function DUF761, plantPFAMPF05553DUF761coord: 340..375
e-value: 6.1E-18
score: 64.1
IPR025520Domain of unknown function DUF4408PFAMPF14364DUF4408coord: 11..40
e-value: 5.2E-10
score: 38.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 43..63
NoneNo IPR availablePANTHERPTHR33098COTTON FIBER (DUF761)coord: 1..101
coord: 143..377
NoneNo IPR availablePANTHERPTHR33098:SF53OS06G0566500 PROTEINcoord: 1..101
coord: 143..377

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0002668.1Tan0002668.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane