Tan0007298 (gene) Snake gourd v1

Overview
NameTan0007298
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHybrid signal transduction protein dokA isoform X2
LocationLG01: 84099830 .. 84102694 (-)
RNA-Seq ExpressionTan0007298
SyntenyTan0007298
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCTAAAAATACGGCGTCGTTTTAATACAGCGGTCGTCCTCGAGTTCAATCTCCGAGTTGAAAGCTTCGCCGACTCGGTGATTGTGGCGTACGGCGCTGAAGATCACAGCCGAAAAAATCTTTGGGGCTTCAATTTCCATGGAGGGAATCGGATCCAGATTGGGCCGAGCCTCGTCCCGCTACGGTCCGGCGGCCACCGTCTTCAATGGCCCCGTCCGGAAGTGGAAGAAGAAGTGGGTCTTGGCTTCTTCATCTTCCTCCGGTGTCGCTTACCAGTCCTCTTCTCAGTCCCAATCCAACGCTCAGAAGCTTCTTCTCTGCCGCTGGACCCCGATTCCTCCCCCTACTTCCTCGGAAGCAGACGGAGCCTCCGCGCCGTCCGACGAACCGCCCAGACGGAAGTTCCGTTACACTCCTGTAAGTTTAAACAACTATCCGAGTTTGTTTGTATTGTATGTCCGATTGATGCTCTTTGTGTGGTGACTGGTGAGTGAGCGGTTGCGATTGCGATTGTTGCATGTGAATTGGTTTTGACCTTGGGAGTGTCACCGATTTCAGAGGGAAATTAAGTGAGGGGGTTTTGAAGTTCTTTGTTAATCTCCTCTGTTGTTGATTTTTGAAGGACGATGCAATTCTCGGTACCGGTTTATACTAACGCTTTTGGTTTCTAGTTACTGATTTTTGTATGCTCTTCGATTTTGTATAATTTATCTTCTGAATTGATTACGATCATCTCGTATTCTGTTCTTATTGTTGCCTGTTCTTCTAGATTAGGATTTAACTACGAGCAAATAATTTGGATGTTTTGGTTGTGATGTGAAGATTGCCGTGCTGGAGGAACAGAAAAATGCAGAGTTGAGAAGTGTCAAAGAGGAAGTTAGAATGAAAGAAATGGACCAGTCTGCTGTGAGGACTGCAAATCAGGGCAATGAGGCTCAGGGGGAGCTAAATGTCAATGAGGTTTTCAAGGAAGAAACTCAGGTAACTCACTGATTTCAAATTGGTTTGTTTATTGTTTTCTTGTGAAGTTTCTTTGTTCTTGTTCTTGTTCATGCTCTTGCTCTCTACCAATGGCTGTTTTCTCATTGATTGCTGATCAGCCTGATTGTGTTGTTGAATGTTCGTCTGGATTGGATCATGTCAGACTGTAAAGAGCTCTACCCTGTGATACTTGTAAGGGGCAGTTTAGGGCGCTGGGTGGGTTATAATAACAAGGGTTATAATTGTCGGTGGGTTATTATAATCTGTGGAATCATATAATACTATTAAAATGCAGAATAATATAGTCGTGGGTTATAATAGTCTGTGTTTGGGGTACAAAGTATCCCACAGGTTATAATAACACAAGTTATTATAATATGTGCCCCAAACAAGCTCTAAAAGAGAATTTACTATTCTTTATGATCTGGAGGATATAGGCTTATAGTTGACATAGTTATTGACCATGCACAAGTGTAACATGGCATTGGGGGCTCTTTTATTTAATTGAGATAGTTACATGCTCATGTTGTTTATCAACAAACAGTTGACATAGTTATTGACCATGCACAAGTGTAACATGGCATTGGGGGCTCTTTTATTTAATTGAGATAGTTACATGCTCATGTTGTTTATCAACAAACAATTAAGAACATGGAACTTGAGTGGTTTTTTTGTTTGGCCTCTCAAATTTGTAATTTTGAAATCTCATATTGAAAGCAAATGTAGGGAGGGTCAAACAGGTGTCTAGTGAGAATAGTTCTGGTGCATACAATTTGGTCCAAAAAATCTCGTATTCAAGGGTTCGTGAGACATGAGTTGGTTGAAAGATTATAGGAGAGAAGAATTCTTCGTTATAAAACTGGGATTTCTCTTTATCATTTTGGTTTCAGGGAACCTTGAGCGGTGAGCATTTGGTAAAATTAAAGCCAATCTTTACTTACCACCATGGCAGACTCTAGGGATAAGTTTATCAATTCACACACTCATGAACTGTAACATAGTTTTCTAGATTAGCATAATCATAGAAAAGTTAGCTAGTTCAGTACAAAACGGTAGAATCTTCTCAAGAACTCGATGAACCTGAACCATTCTCTTTGCAGGAAAAGAGCAAGATTCTAAACAGCCCGCGGGACGCAAGCAGGAACAATTTAGATTTGGCATTGTGCTTGAACGGTCAGAACCAGAACCAGAAGCAAGATTCTGCTGGTAAGAGCTCTTAACATCTGCAAACAAGACCGAGCTCGAGCTGGTTTACTCTATGGACAACGGTATAGCAGAAATTAGTCTCTAATGTGATAGTATCTCATCCAGTGGTAACTTGGAAGCACATTAAATTATAGTTACTGTTTGTATATGATCTAGGGTTAGTGGGGGATGGATCAGTACATAATCTAGTAGAACCTTTTGGGAGCTTGCTTGTTGTTGTGGAAAGTATATTTATTCTTCCTGAATAATGTGCAATCAGAATGCTGTAGGAAGAACCTGCTCAGTATTTTTTTTTTTTTTTTTGGTTCATGAAAATGTTTATAGTGGTTGAAACTGAAGTTGGTAGAGGCGTTTGTGGACTGTTGAAGATTAAGAGAAGCTGTCTGGTCTCCCTTCCTAAGATGAAAAGCATAGAAGAAATTCTAACCTTAAGTTCATTGATATTGGGGGGAAAGAGCAGATTGGAGTGACAAGTGTAATAAATCAAAGATTGTGTTCGTTGAGCATATAGTGGTTAAGGCACTCGTATGTCGTATGTCGTATGTCCTCTAAGGTTGAAAGTTCATCATCGATCTTACATTTGTTGAACTAAAAAGATGGTTGTATAGGATAACGTTTTTGTTTCCTTTTTTAGTTCTTCGTTCTGTTATTTAAAAATACAAACTATTATGATTTTGAA

mRNA sequence

CCTAAAAATACGGCGTCGTTTTAATACAGCGGTCGTCCTCGAGTTCAATCTCCGAGTTGAAAGCTTCGCCGACTCGGTGATTGTGGCGTACGGCGCTGAAGATCACAGCCGAAAAAATCTTTGGGGCTTCAATTTCCATGGAGGGAATCGGATCCAGATTGGGCCGAGCCTCGTCCCGCTACGGTCCGGCGGCCACCGTCTTCAATGGCCCCGTCCGGAAGTGGAAGAAGAAGTGGGTCTTGGCTTCTTCATCTTCCTCCGGTGTCGCTTACCAGTCCTCTTCTCAGTCCCAATCCAACGCTCAGAAGCTTCTTCTCTGCCGCTGGACCCCGATTCCTCCCCCTACTTCCTCGGAAGCAGACGGAGCCTCCGCGCCGTCCGACGAACCGCCCAGACGGAAGTTCCGTTACACTCCTATTGCCGTGCTGGAGGAACAGAAAAATGCAGAGTTGAGAAGTGTCAAAGAGGAAGTTAGAATGAAAGAAATGGACCAGTCTGCTGTGAGGACTGCAAATCAGGGCAATGAGGCTCAGGGGGAGCTAAATGTCAATGAGGTTTTCAAGGAAGAAACTCAGGAAAAGAGCAAGATTCTAAACAGCCCGCGGGACGCAAGCAGGAACAATTTAGATTTGGCATTGTGCTTGAACGGTCAGAACCAGAACCAGAAGCAAGATTCTGCTGGTAAGAGCTCTTAACATCTGCAAACAAGACCGAGCTCGAGCTGGTTTACTCTATGGACAACGGTATAGCAGAAATTAGTCTCTAATGTGATAGTATCTCATCCAGTGGTAACTTGGAAGCACATTAAATTATAGTTACTGTTTGTATATGATCTAGGGTTAGTGGGGGATGGATCAGTACATAATCTAGTAGAACCTTTTGGGAGCTTGCTTGTTGTTGTGGAAAGTATATTTATTCTTCCTGAATAATGTGCAATCAGAATGCTGTAGGAAGAACCTGCTCAGTATTTTTTTTTTTTTTTTTGGTTCATGAAAATGTTTATAGTGGTTGAAACTGAAGTTGGTAGAGGCGTTTGTGGACTGTTGAAGATTAAGAGAAGCTGTCTGGTCTCCCTTCCTAAGATGAAAAGCATAGAAGAAATTCTAACCTTAAGTTCATTGATATTGGGGGGAAAGAGCAGATTGGAGTGACAAGTGTAATAAATCAAAGATTGTGTTCGTTGAGCATATAGTGGTTAAGGCACTCGTATGTCGTATGTCGTATGTCCTCTAAGGTTGAAAGTTCATCATCGATCTTACATTTGTTGAACTAAAAAGATGGTTGTATAGGATAACGTTTTTGTTTCCTTTTTTAGTTCTTCGTTCTGTTATTTAAAAATACAAACTATTATGATTTTGAA

Coding sequence (CDS)

ATGGAGGGAATCGGATCCAGATTGGGCCGAGCCTCGTCCCGCTACGGTCCGGCGGCCACCGTCTTCAATGGCCCCGTCCGGAAGTGGAAGAAGAAGTGGGTCTTGGCTTCTTCATCTTCCTCCGGTGTCGCTTACCAGTCCTCTTCTCAGTCCCAATCCAACGCTCAGAAGCTTCTTCTCTGCCGCTGGACCCCGATTCCTCCCCCTACTTCCTCGGAAGCAGACGGAGCCTCCGCGCCGTCCGACGAACCGCCCAGACGGAAGTTCCGTTACACTCCTATTGCCGTGCTGGAGGAACAGAAAAATGCAGAGTTGAGAAGTGTCAAAGAGGAAGTTAGAATGAAAGAAATGGACCAGTCTGCTGTGAGGACTGCAAATCAGGGCAATGAGGCTCAGGGGGAGCTAAATGTCAATGAGGTTTTCAAGGAAGAAACTCAGGAAAAGAGCAAGATTCTAAACAGCCCGCGGGACGCAAGCAGGAACAATTTAGATTTGGCATTGTGCTTGAACGGTCAGAACCAGAACCAGAAGCAAGATTCTGCTGGTAAGAGCTCTTAA

Protein sequence

MEGIGSRLGRASSRYGPAATVFNGPVRKWKKKWVLASSSSSGVAYQSSSQSQSNAQKLLLCRWTPIPPPTSSEADGASAPSDEPPRRKFRYTPIAVLEEQKNAELRSVKEEVRMKEMDQSAVRTANQGNEAQGELNVNEVFKEETQEKSKILNSPRDASRNNLDLALCLNGQNQNQKQDSAGKSS
Homology
BLAST of Tan0007298 vs. NCBI nr
Match: XP_022936275.1 (uncharacterized protein LOC111442943 [Cucurbita moschata])

HSP 1 Score: 306.2 bits (783), Expect = 1.9e-79
Identity = 164/185 (88.65%), Postives = 172/185 (92.97%), Query Frame = 0

Query: 1   MEGIGSRLGRASSRYGPAATVFNGPVRKWKKKWVLASSSSSGVAYQSSSQSQSNAQKLLL 60
           MEGIGSRLGRASSRYGPAATVFNGPVR+WKKKWVLASSSSSGV YQ+SSQSQSNAQKLLL
Sbjct: 1   MEGIGSRLGRASSRYGPAATVFNGPVRRWKKKWVLASSSSSGVNYQTSSQSQSNAQKLLL 60

Query: 61  CRWTPIPPPTSSEADGASAPSDEPPRRKFRYTPIAVLEEQKNAELRSVKEEVRMKEMDQS 120
           CRWTPI P TSSEADGA+AP +EPPRRKFRYTPIAVLEEQKNA LRSVK+EVR+KEMDQS
Sbjct: 61  CRWTPIHPSTSSEADGAAAPPEEPPRRKFRYTPIAVLEEQKNAGLRSVKDEVRVKEMDQS 120

Query: 121 AVRTANQGNEAQGELNVNEVFKEETQEKSKILNSPRDASRNNLDLALCLNGQNQNQKQDS 180
           A +TA  GNEAQGELNVNEVFKEETQE SKILNSPRD +RNNLDLALCLNG  QN KQDS
Sbjct: 121 ASKTATVGNEAQGELNVNEVFKEETQETSKILNSPRDGNRNNLDLALCLNG--QNPKQDS 180

Query: 181 AGKSS 186
           AGKSS
Sbjct: 181 AGKSS 183

BLAST of Tan0007298 vs. NCBI nr
Match: XP_023536003.1 (uncharacterized protein LOC111797276 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 302.4 bits (773), Expect = 2.7e-78
Identity = 161/185 (87.03%), Postives = 171/185 (92.43%), Query Frame = 0

Query: 1   MEGIGSRLGRASSRYGPAATVFNGPVRKWKKKWVLASSSSSGVAYQSSSQSQSNAQKLLL 60
           MEGIGSRLGRASSRYGPAATVFNGPVR+WKKKWVLASSSSSGV YQ+SSQSQSNAQKLLL
Sbjct: 1   MEGIGSRLGRASSRYGPAATVFNGPVRRWKKKWVLASSSSSGVNYQTSSQSQSNAQKLLL 60

Query: 61  CRWTPIPPPTSSEADGASAPSDEPPRRKFRYTPIAVLEEQKNAELRSVKEEVRMKEMDQS 120
           CRWTPI P TSSEADG +AP +EPPRRKFRYTPIAVLEEQKNA LRSVK+EVR+KEMDQS
Sbjct: 61  CRWTPIHPSTSSEADGTAAPPEEPPRRKFRYTPIAVLEEQKNAGLRSVKDEVRVKEMDQS 120

Query: 121 AVRTANQGNEAQGELNVNEVFKEETQEKSKILNSPRDASRNNLDLALCLNGQNQNQKQDS 180
           A +TA  GN+AQG+LNVNEVFKEETQE SKILNSPRD +RNNLDLALCLNG  QN KQDS
Sbjct: 121 ASKTATVGNDAQGDLNVNEVFKEETQETSKILNSPRDGNRNNLDLALCLNG--QNPKQDS 180

Query: 181 AGKSS 186
           AGKSS
Sbjct: 181 AGKSS 183

BLAST of Tan0007298 vs. NCBI nr
Match: KAG6592060.1 (hypothetical protein SDJN03_14406, partial [Cucurbita argyrosperma subsp. sororia] >KAG7024937.1 hypothetical protein SDJN02_13757, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 302.0 bits (772), Expect = 3.6e-78
Identity = 162/185 (87.57%), Postives = 171/185 (92.43%), Query Frame = 0

Query: 1   MEGIGSRLGRASSRYGPAATVFNGPVRKWKKKWVLASSSSSGVAYQSSSQSQSNAQKLLL 60
           MEGIGSRLGRASSRYGPAATVFNGPVR+WKKKWVLASSSSSGV YQ+SSQSQSN+QKLLL
Sbjct: 1   MEGIGSRLGRASSRYGPAATVFNGPVRRWKKKWVLASSSSSGVNYQTSSQSQSNSQKLLL 60

Query: 61  CRWTPIPPPTSSEADGASAPSDEPPRRKFRYTPIAVLEEQKNAELRSVKEEVRMKEMDQS 120
           CRWTPI P TSSEADGA+A  +EPPRRKFRYTPIAVLEEQKNA LRSVK+EVR+KEMDQS
Sbjct: 61  CRWTPIHPSTSSEADGAAAAPEEPPRRKFRYTPIAVLEEQKNAGLRSVKDEVRVKEMDQS 120

Query: 121 AVRTANQGNEAQGELNVNEVFKEETQEKSKILNSPRDASRNNLDLALCLNGQNQNQKQDS 180
           A +TA  GNEAQGELNVNEVFKEETQE SKILNSPRD +RNNLDLALCLNG  QN KQDS
Sbjct: 121 ASKTATVGNEAQGELNVNEVFKEETQETSKILNSPRDGNRNNLDLALCLNG--QNPKQDS 180

Query: 181 AGKSS 186
           AGKSS
Sbjct: 181 AGKSS 183

BLAST of Tan0007298 vs. NCBI nr
Match: XP_038898202.1 (uncharacterized protein LOC120085942 [Benincasa hispida])

HSP 1 Score: 301.2 bits (770), Expect = 6.1e-78
Identity = 160/185 (86.49%), Postives = 168/185 (90.81%), Query Frame = 0

Query: 1   MEGIGSRLGRASSRYGPAATVFNGPVRKWKKKWVLASSSSSGVAYQSSSQSQSNAQKLLL 60
           MEGIGSRLGRASSRYGPAATVFNGPVRKWKKKWVLASSSSSG+ Y  SSQSQSNAQKLLL
Sbjct: 1   MEGIGSRLGRASSRYGPAATVFNGPVRKWKKKWVLASSSSSGLTYHPSSQSQSNAQKLLL 60

Query: 61  CRWTPIPPPTSSEADGASAPSDEPPRRKFRYTPIAVLEEQKNAELRSVKEEVRMKEMDQS 120
           CRWTPI PPTSSEADG +AP DEPP+RKFRYTPIAVLEEQKNAELRS+K+EVR+KEMDQ 
Sbjct: 61  CRWTPIHPPTSSEADGTTAPPDEPPKRKFRYTPIAVLEEQKNAELRSIKDEVRVKEMDQL 120

Query: 121 AVRTANQGNEAQGELNVNEVFKEETQEKSKILNSPRDASRNNLDLALCLNGQNQNQKQDS 180
           A +TA  GNEAQGELNVNEVFKEETQE SKILNSP D SRNNLDLALCLNG  + Q QDS
Sbjct: 121 AAKTATVGNEAQGELNVNEVFKEETQETSKILNSPSDGSRNNLDLALCLNG--RKQIQDS 180

Query: 181 AGKSS 186
           AGKSS
Sbjct: 181 AGKSS 183

BLAST of Tan0007298 vs. NCBI nr
Match: XP_022976368.1 (uncharacterized protein LOC111476793 [Cucurbita maxima])

HSP 1 Score: 299.3 bits (765), Expect = 2.3e-77
Identity = 161/185 (87.03%), Postives = 168/185 (90.81%), Query Frame = 0

Query: 1   MEGIGSRLGRASSRYGPAATVFNGPVRKWKKKWVLASSSSSGVAYQSSSQSQSNAQKLLL 60
           MEGIGSRLGRASSRYGPAATVFNGPVR+WKKKWVLASSSSSGV YQ+SSQSQSNA KLLL
Sbjct: 1   MEGIGSRLGRASSRYGPAATVFNGPVRRWKKKWVLASSSSSGVNYQTSSQSQSNAHKLLL 60

Query: 61  CRWTPIPPPTSSEADGASAPSDEPPRRKFRYTPIAVLEEQKNAELRSVKEEVRMKEMDQS 120
           CRWTPI P TSSEADGA+AP +EPPRRKFRYTPIAVLEEQKNA LRSVK+EVR+KE DQS
Sbjct: 61  CRWTPIHPSTSSEADGAAAPPEEPPRRKFRYTPIAVLEEQKNAGLRSVKDEVRVKETDQS 120

Query: 121 AVRTANQGNEAQGELNVNEVFKEETQEKSKILNSPRDASRNNLDLALCLNGQNQNQKQDS 180
           A + A  GNEAQGELNVNEVFKEETQE SKILNSPRD  RNNLDLALCLNG  QN KQDS
Sbjct: 121 ASKMATVGNEAQGELNVNEVFKEETQETSKILNSPRDGDRNNLDLALCLNG--QNPKQDS 180

Query: 181 AGKSS 186
           AGKSS
Sbjct: 181 AGKSS 183

BLAST of Tan0007298 vs. ExPASy TrEMBL
Match: A0A6J1FD66 (uncharacterized protein LOC111442943 OS=Cucurbita moschata OX=3662 GN=LOC111442943 PE=4 SV=1)

HSP 1 Score: 306.2 bits (783), Expect = 9.2e-80
Identity = 164/185 (88.65%), Postives = 172/185 (92.97%), Query Frame = 0

Query: 1   MEGIGSRLGRASSRYGPAATVFNGPVRKWKKKWVLASSSSSGVAYQSSSQSQSNAQKLLL 60
           MEGIGSRLGRASSRYGPAATVFNGPVR+WKKKWVLASSSSSGV YQ+SSQSQSNAQKLLL
Sbjct: 1   MEGIGSRLGRASSRYGPAATVFNGPVRRWKKKWVLASSSSSGVNYQTSSQSQSNAQKLLL 60

Query: 61  CRWTPIPPPTSSEADGASAPSDEPPRRKFRYTPIAVLEEQKNAELRSVKEEVRMKEMDQS 120
           CRWTPI P TSSEADGA+AP +EPPRRKFRYTPIAVLEEQKNA LRSVK+EVR+KEMDQS
Sbjct: 61  CRWTPIHPSTSSEADGAAAPPEEPPRRKFRYTPIAVLEEQKNAGLRSVKDEVRVKEMDQS 120

Query: 121 AVRTANQGNEAQGELNVNEVFKEETQEKSKILNSPRDASRNNLDLALCLNGQNQNQKQDS 180
           A +TA  GNEAQGELNVNEVFKEETQE SKILNSPRD +RNNLDLALCLNG  QN KQDS
Sbjct: 121 ASKTATVGNEAQGELNVNEVFKEETQETSKILNSPRDGNRNNLDLALCLNG--QNPKQDS 180

Query: 181 AGKSS 186
           AGKSS
Sbjct: 181 AGKSS 183

BLAST of Tan0007298 vs. ExPASy TrEMBL
Match: A0A6J1ILX5 (uncharacterized protein LOC111476793 OS=Cucurbita maxima OX=3661 GN=LOC111476793 PE=4 SV=1)

HSP 1 Score: 299.3 bits (765), Expect = 1.1e-77
Identity = 161/185 (87.03%), Postives = 168/185 (90.81%), Query Frame = 0

Query: 1   MEGIGSRLGRASSRYGPAATVFNGPVRKWKKKWVLASSSSSGVAYQSSSQSQSNAQKLLL 60
           MEGIGSRLGRASSRYGPAATVFNGPVR+WKKKWVLASSSSSGV YQ+SSQSQSNA KLLL
Sbjct: 1   MEGIGSRLGRASSRYGPAATVFNGPVRRWKKKWVLASSSSSGVNYQTSSQSQSNAHKLLL 60

Query: 61  CRWTPIPPPTSSEADGASAPSDEPPRRKFRYTPIAVLEEQKNAELRSVKEEVRMKEMDQS 120
           CRWTPI P TSSEADGA+AP +EPPRRKFRYTPIAVLEEQKNA LRSVK+EVR+KE DQS
Sbjct: 61  CRWTPIHPSTSSEADGAAAPPEEPPRRKFRYTPIAVLEEQKNAGLRSVKDEVRVKETDQS 120

Query: 121 AVRTANQGNEAQGELNVNEVFKEETQEKSKILNSPRDASRNNLDLALCLNGQNQNQKQDS 180
           A + A  GNEAQGELNVNEVFKEETQE SKILNSPRD  RNNLDLALCLNG  QN KQDS
Sbjct: 121 ASKMATVGNEAQGELNVNEVFKEETQETSKILNSPRDGDRNNLDLALCLNG--QNPKQDS 180

Query: 181 AGKSS 186
           AGKSS
Sbjct: 181 AGKSS 183

BLAST of Tan0007298 vs. ExPASy TrEMBL
Match: A0A6J1FHM4 (uncharacterized protein LOC111445863 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445863 PE=4 SV=1)

HSP 1 Score: 295.0 bits (754), Expect = 2.1e-76
Identity = 156/185 (84.32%), Postives = 166/185 (89.73%), Query Frame = 0

Query: 1   MEGIGSRLGRASSRYGPAATVFNGPVRKWKKKWVLASSSSSGVAYQSSSQSQSNAQKLLL 60
           MEGIGSRLGR SSRYGPAATVFNGPVRKWKKKWVLASSSSSGVA QSSS SQSNAQKLLL
Sbjct: 1   MEGIGSRLGRVSSRYGPAATVFNGPVRKWKKKWVLASSSSSGVANQSSSHSQSNAQKLLL 60

Query: 61  CRWTPIPPPTSSEADGASAPSDEPPRRKFRYTPIAVLEEQKNAELRSVKEEVRMKEMDQS 120
           CRWTPIPPPTSSE DGA+AP DEPPRRKFRYTPIAVLEEQ+NAELRSV++E+R KEMDQS
Sbjct: 61  CRWTPIPPPTSSEDDGAAAPPDEPPRRKFRYTPIAVLEEQRNAELRSVRDEIRTKEMDQS 120

Query: 121 AVRTANQGNEAQGELNVNEVFKEETQEKSKILNSPRDASRNNLDLALCLNGQNQNQKQDS 180
           A +TA   NEAQGELNVNE+FKEE+QEK KILNS  D  ++NLDLALCLNGQN    QDS
Sbjct: 121 AAKTATVSNEAQGELNVNEIFKEESQEKGKILNSLHDVGKSNLDLALCLNGQN----QDS 180

Query: 181 AGKSS 186
           AGKSS
Sbjct: 181 AGKSS 181

BLAST of Tan0007298 vs. ExPASy TrEMBL
Match: A0A6J1CD90 (uncharacterized protein LOC111010600 OS=Momordica charantia OX=3673 GN=LOC111010600 PE=4 SV=1)

HSP 1 Score: 294.3 bits (752), Expect = 3.6e-76
Identity = 156/185 (84.32%), Postives = 167/185 (90.27%), Query Frame = 0

Query: 1   MEGIGSRLGRASSRYGPAATVFNGPVRKWKKKWVLASSSSSGVAYQSSSQSQSNAQKLLL 60
           MEGIGSRLGRASSRYGPAATVFNGPVRKWKKKWVLASSSSSG++YQ SS SQSNA KLLL
Sbjct: 1   MEGIGSRLGRASSRYGPAATVFNGPVRKWKKKWVLASSSSSGLSYQPSSHSQSNAHKLLL 60

Query: 61  CRWTPIPPPTSSEADGASAPSDEPPRRKFRYTPIAVLEEQKNAELRSVKEEVRMKEMDQS 120
           CRWTPIPPP S+E DG +AP DEPPRRKFRYTPIAVLEEQKNAELRSVK+EVRMKE+DQ+
Sbjct: 61  CRWTPIPPPPSTETDGPTAPPDEPPRRKFRYTPIAVLEEQKNAELRSVKDEVRMKEIDQA 120

Query: 121 AVRTANQGNEAQGELNVNEVFKEETQEKSKILNSPRDASRNNLDLALCLNGQNQNQKQDS 180
           AV TA   NE  GELNVNEVFKEETQE SKILNSPR+A++N LDLALCLNG NQN  Q+S
Sbjct: 121 AVFTATVSNETHGELNVNEVFKEETQETSKILNSPRNANKNRLDLALCLNGGNQN--QES 180

Query: 181 AGKSS 186
           AGKSS
Sbjct: 181 AGKSS 183

BLAST of Tan0007298 vs. ExPASy TrEMBL
Match: A0A0A0K8E5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G145940 PE=4 SV=1)

HSP 1 Score: 293.9 bits (751), Expect = 4.7e-76
Identity = 157/184 (85.33%), Postives = 166/184 (90.22%), Query Frame = 0

Query: 1   MEGIGSRLGRASSRYGPAATVFNGPVRKWKKKWVLASSSSSGVAYQSSSQSQSNAQKLLL 60
           MEGIGSRLGR SSRYGPAATVFNGPVRKWKKKWVLASSSSSG+ YQ+SS SQSNAQKLLL
Sbjct: 1   MEGIGSRLGRVSSRYGPAATVFNGPVRKWKKKWVLASSSSSGLNYQTSSHSQSNAQKLLL 60

Query: 61  CRWTPIPPPTSSEADGASAPSDEPPRRKFRYTPIAVLEEQKNAELRSVKEEVRMKEMDQS 120
           CRWTPI PPTSSEAD A+AP DEPP+RKFRYTPIAVLEEQK+AELRSVK+EVRMKEMDQ 
Sbjct: 61  CRWTPIHPPTSSEADEATAPPDEPPKRKFRYTPIAVLEEQKSAELRSVKDEVRMKEMDQL 120

Query: 121 AVRTANQGNEAQGELNVNEVFKEETQEKSKILNSPRDASRNNLDLALCLNGQNQNQKQDS 180
           A +TA  GNEA GELNVNE+FKEETQE SK LNSPRD SRNNLDLALCLNG  Q + QDS
Sbjct: 121 AAKTATVGNEALGELNVNEIFKEETQETSKNLNSPRDGSRNNLDLALCLNG--QKKVQDS 180

Query: 181 AGKS 185
           AGKS
Sbjct: 181 AGKS 182

BLAST of Tan0007298 vs. TAIR 10
Match: AT4G22320.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G55210.1). )

HSP 1 Score: 103.2 bits (256), Expect = 2.3e-22
Identity = 80/230 (34.78%), Postives = 120/230 (52.17%), Query Frame = 0

Query: 1   MEGIGSRLGRASSRYGPAATVFNGPVRKWKKKWVLASSSS---------------SGVAY 60
           MEG+G+RLGR+S+RYGP ATVF GPVRKWKKKWV  S SS               + V  
Sbjct: 1   MEGVGARLGRSSTRYGP-ATVFTGPVRKWKKKWVHVSPSSKKDNNNSSSGSAAAAASVVN 60

Query: 61  QSSSQSQSNAQKLLLCRWTPIPPPTSSEADGAS---APS--------DEPPRRKFRYTPI 120
             S+   SN   LLL +W P+    +   DG S   +PS        ++PPRR+F+Y PI
Sbjct: 61  GGSNSDGSNGSHLLLYKWAPLSQNGNGNEDGKSESNSPSEDTVATVAEDPPRRRFKYVPI 120

Query: 121 AVLEEQKNAELRSVKEEVRMKEMDQ-------------SAVRTANQGNEAQGEL------ 180
           AVLEEQK  E+  ++E+ +++E D+                +T  + +E + E+      
Sbjct: 121 AVLEEQKK-EITEIEEDDKIEEDDKIDEDNKVEQEDKVDEDKTVEESSEKKAEVEVEEKP 180

BLAST of Tan0007298 vs. TAIR 10
Match: AT4G22320.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G55210.1); Has 8953 Blast hits to 5363 proteins in 542 species: Archae - 33; Bacteria - 806; Metazoa - 2454; Fungi - 831; Plants - 279; Viruses - 151; Other Eukaryotes - 4399 (source: NCBI BLink). )

HSP 1 Score: 100.1 bits (248), Expect = 1.9e-21
Identity = 80/231 (34.63%), Postives = 121/231 (52.38%), Query Frame = 0

Query: 1   MEGIGSRLGRASSRYGPAATVFNGPVRKWKKKWVLASSSS---------------SGVAY 60
           MEG+G+RLGR+S+RYGP ATVF GPVRKWKKKWV  S SS               + V  
Sbjct: 1   MEGVGARLGRSSTRYGP-ATVFTGPVRKWKKKWVHVSPSSKKDNNNSSSGSAAAAASVVN 60

Query: 61  QSSSQSQSNAQKLLLCRWTPIPPPTSSEADGAS---APS--------DEPPRRKFRYTPI 120
             S+   SN   LLL +W P+    +   DG S   +PS        ++PPRR+F+Y PI
Sbjct: 61  GGSNSDGSNGSHLLLYKWAPLSQNGNGNEDGKSESNSPSEDTVATVAEDPPRRRFKYVPI 120

Query: 121 AVLEEQKNAELRSVKEEVRMKEMDQ-------------SAVRTANQGNEAQGEL------ 180
           AVLEEQK  E+  ++E+ +++E D+                +T  + +E + E+      
Sbjct: 121 AVLEEQKK-EITEIEEDDKIEEDDKIDEDNKVEQEDKVDEDKTVEESSEKKAEVEVEEKP 180

BLAST of Tan0007298 vs. TAIR 10
Match: AT5G55210.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G22320.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 97.1 bits (240), Expect = 1.6e-20
Identity = 73/185 (39.46%), Postives = 103/185 (55.68%), Query Frame = 0

Query: 1   MEGIGSRLGRASSRY-GPAAT-VFNGPVRKWKKKWVLASSSSSGVAYQSSSQSQSNAQK- 60
           MEG+GSRL R SSRY GPAAT VF+G VRKWKKKWV  S+SS GV   S S  ++N+   
Sbjct: 1   MEGVGSRLSRTSSRYSGPAATAVFSGRVRKWKKKWVRVSTSSVGVFRASKSNGRNNSNNS 60

Query: 61  -----LLLCRWTPIPPPTSSEAD-GASAPSDEPPRRKFRYTPIAVLEEQKNAELRSVKEE 120
                LLL +WTP+   T + +D   S  ++E P+R+FRY PIA+LE ++    +  + E
Sbjct: 61  NSPHHLLLHKWTPLTSATVTASDANGSGETEESPKRRFRYAPIAMLEHREKVISKDSEIE 120

Query: 121 VRMKEMDQSAVRTANQGNEAQGELNVNEVFKEETQEKSKILNSPRDASRNNLDLALCLNG 177
              +   +S +  A        EL++N    ++T+E          A   NL+L LCLN 
Sbjct: 121 ETEEFDTESPLPKA-------VELDMNLTDSDQTKE----------AKTGNLNLGLCLNS 168

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022936275.11.9e-7988.65uncharacterized protein LOC111442943 [Cucurbita moschata][more]
XP_023536003.12.7e-7887.03uncharacterized protein LOC111797276 [Cucurbita pepo subsp. pepo][more]
KAG6592060.13.6e-7887.57hypothetical protein SDJN03_14406, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_038898202.16.1e-7886.49uncharacterized protein LOC120085942 [Benincasa hispida][more]
XP_022976368.12.3e-7787.03uncharacterized protein LOC111476793 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1FD669.2e-8088.65uncharacterized protein LOC111442943 OS=Cucurbita moschata OX=3662 GN=LOC1114429... [more]
A0A6J1ILX51.1e-7787.03uncharacterized protein LOC111476793 OS=Cucurbita maxima OX=3661 GN=LOC111476793... [more]
A0A6J1FHM42.1e-7684.32uncharacterized protein LOC111445863 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1CD903.6e-7684.32uncharacterized protein LOC111010600 OS=Momordica charantia OX=3673 GN=LOC111010... [more]
A0A0A0K8E54.7e-7685.33Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G145940 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G22320.22.3e-2234.78unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G22320.11.9e-2134.63unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G55210.11.6e-2039.46unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 117..185
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 155..185
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 59..93
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 120..134
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 140..154
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..20
NoneNo IPR availablePANTHERPTHR34572GOLGIN FAMILY A PROTEINcoord: 1..183
NoneNo IPR availablePANTHERPTHR34572:SF8BNAC09G31710D PROTEINcoord: 1..183

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0007298.1Tan0007298.1mRNA