Tan0022094 (gene) Snake gourd v1

Overview
NameTan0022094
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon 17.6
LocationLG02: 20007605 .. 20012728 (+)
RNA-Seq ExpressionTan0022094
SyntenyTan0022094
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTTGCATGAGATTCACTTTCCTCGAGAGAGGTTATTTGTTATCTTAGTTTCTTGGTTTATAAAAGCAGTGCAATAACTCATTAGCATGACCTCAATAAAATAGTTAATTAATTGGTGAAATCTAGTCCCTAAGATATTCGTCTCTCCTGATTGTTCTCAACTTTCACTCTTTGATTTTGTGTTTTCATCTCACTTTAAAGTCTAATTTTCTAGATAAAATTGAGTGGTGTAATTGATAGGAATTAATTGAATTAAATACCCAATCCTATGGACGATACTCGTACTCAAATTCACTATATTAAAACTTGACCCGTACACTTGCGGTATTGCAAAATTTACGCAACAAGTTTTTTGTAGTTCTGTAGGGATTGACGTTTATTTAATTCTTTTGATACTATTACGAAGTCACTTAGTCTTAATCTAGAGTTTGTGTTTCTTTTTGTTGTCTTTGTTTATTTTGATTTTGTGTTGAGTTTTGTGTTTCTTTTAGTTTTGTGTGTTTTTTTTTTGTTAGTTTTAAATTTTATGTTTTTCTATGTTGTGTGTTGCATGCGAAGTAACTCAGAGACTAATCTAGTTCCTTTTGATCTTGAAGCTGATAGAACTTTCAGAAGGAAGCTTAAGGAAACTAAGGACAAAGACAGATTCATGGAGGAGCAACAGGGAGAGTATAAGCCCATTAGAGAGCACTTCCATCCCACTTTTGCTGGAAACCAATCTGGAATTATTTATGCCCCAATTCAGGCTAATAATTTTGAGATAAAGACGGGTCTCATCCAGATGGTTCGAGAAAATGCCTTCAGAGGACATGCCACTGAGGATCCCAATCTGCATCTGAGATCATTTGCTGAAATTGTGGCATGGTAAAAATTAATGGTGTATCTCCTAATGCTATTCGCTTACGATTGTTTCCTTTTTCTCTGCAGGATAAGACAAAATTCTGGTTGGAGTCATTGCCCATAGGTAGTATTAGTACTTGGGAGGAATTAACTCAGGCATTCTTGGCCAAATACTTCCCACCTGCCAAGACTACGAGGCTGCGCACTGAGATTGACACTTTCAGACAGCAGGATGGAGAACAGATGTTTGAGGCTTGGGAAAGATACAAAGATTTACTTAGAAAATGCCCACAACATGGTTATCCTGACTGGTTGCAAATTCAATTATTTTATAATGGATTATCTGGAAATAATAAATCTCTTTTGGATGCAGCTGCTGGGGGATCAATATTTTCTAAGTCTCCAGAGGATGCTCGTACTTTGCTTGAGGAGATGGCTGCCACAAGTTATAATTGGCATTACGAAAGATCAAGCGCCAAACTCAGAAAGAGGATGTACGAGATAGATGAGGTAAGTTCCATCAAGTTCCATATGGCTTCCTAACTAGTGCATTTGAATAAACTTGCATGTACAGTAATGTTGAATCCATAGCCTGCAGTTGCATTACCGCTAACAAACTTGCAGGTGTTGAAGATCTGGAACCTGAGCAGGCTCAATATGTGAACAACAGAGGTTATGGATACCGAGGAAATCAGAATCAAAATTAGTTGCCCACTCATTATCATCCATGTTTGCGCAATCATGAAATTTTTTCTTATGCTAATCAGAAGAATGTGTTGCAGCCTCCTCCAGGTTATGCATCTACATCTACAGCTGAGGGAAAACCATCTCTGGAGGACATGATGAGTTCATTCATCACTGAATCTAGGAATCGAACCAACCAGTTGGAAAATGTAGTGCTAAGTTTGAGCAACACAGTGAACTCACATGGAGCAACCATTAAGAACATAGAGGTACAGATTAGTTAGATGGAAAACACCCTAAACACGCTGCAGAAAGGTAAGTTTCCTAGTGACACTGAGGTGAATCCTAGGGAACAGTGCAAAGCTGTGACCTTAAGAAGTGGTAAGGAGCTTGCTGAAACTGAAAAGGTAAATTCTGAAGAGCAACTCAATTATAATGCTAAGAAGGATACATAGGAAGTGGTATAAGAAGCGAGCACATCGAAACAAACAGAGAAGATTTCTGATTTCTCTATTCCTATTGTTCCTAACTCTATTCCTTATCCTCAGCGCTTTCAAAGAAAAAGACTGATTTTGATTTTGCTAAATTTCTTGATATGTTTAAGGAACTGCACATTAACATTCCTTTTCCAGATGCTTTGGAACAAATGCCTAAATATGCAAAATTCATGAAGGAAGTCATGTCCAGAAAGAAGAAGTTTGACAAATATGAGACAGTCAATCTCACTGAAGAATGCAGTGCTGTACTCCAGAGGAAGCTACCACAAAAATTGAAAGATCCAGAGAGTTTTACTATTCCTTGCAATATTGGTAGTATTTTTGTTGATAGAGCTTTATGTGATCTAGGGGCTACTATTAATTTGATGCCTTTATTTGTTTACAGAAAACTAGGTCTTGGGGAGGTGCAACCTACAACCATCTCACTACAGTTGGCGGACCGTTCTATCAAATATCCCAGAGGTATTGTAGAGGATGTGTTGGTAAAAGTTGAACTGTTTATTTTCCCTGCAGATTTTGTGGTTCTGGACATGGAAGAAGATTATGAGATCCCAATTATTTTGGGGCGACCTTTCCTGGCCACTGGAAGAGCATTGATAGATGTGCAGGATGGTCAGTTAACTCTAAGGGTTGGTGAGGAAAAGGTTGTATTTGACATTTTTCGATCCTTGAAGTACCCTAATGAGGTAAGTACCTGTCACATGATCGATATTCTGGATAATGCTGTGAATACACCTAGAGAGCTAGTTTTATCTGTTGACACCTTAGAGACTTGTATGACAAATTCAATGGTAGATCAGTTAGAAATTTTGGATCATGAAGTTACTTATTGTAATGAATCACTGAATCAATTGCCTAAACTTCAATATGAACAATTTTCACAACTGCATGTGACTATGGTAGAAGAAACAAAACCTTCTGTTCTAGAGTTGAAACAATTACTTGCACACCTGCATTACACTTTCTTAGAAAATTTATCTACGTATCCTGTGATTATTTCTGCTTTATTTGGTTATTTGATCTTCGTTGTTCTTTTTGTTTTCAGATGTGAAAGAAGAAAAAAAAAGAAAAAAAAGGTCTCGAAACGGCGTCGCAACGCTTATCCAAGCGTTGCAACCATTCTTATTTGGCGTTAAGTCGCTGCGCTTTGTCGATACCACGCGACCCATGAGGACAGCGTTGCCCCTCAGCGTTGCAACGCTGTCGCGAAATCTGGGATTTTAAAATTCCAGATTTCGCCAAAACAATCTCAAATTTCATCTTCTCCTCTCTTTTCTCTCTTCTTTCTTTCAATCTCTCTCAATATTTCCCCAAGATTGTTCACTTTAACCATTCCCCACTCACTTTTCTTCCTATTTTGTGCTAGATTGAGCATTTTGGGGTAAGATTCATATCCATTTGAAGGCAAATCGTGGGTATCTATTTCTCTACCATTTTCTTTCCGATTTTTGTGTTGTTTCTGTAATTAGACCTAAGATTTATTGTGTGTTTTGCTATATTATTGTTTAGGGTGATGAATCTTGGGCTTAGATGAGATTAGGAGATTTGGATAGTTTAATGGAAGTGCCCTAGATGTGTTTTCATGGTAAATTGATGTTGGTTTTCAATTTTTTCCATCTTTGGGCATTATTACTCCTTAAGGCATGAATTTTAAGTTTGGGGTATTGAATTCCTTGATTTTGATAACATAAGTCCCTTCTCCTTAGTTGTGCATTTTGATAAATACATGATTTGGTTGTGTGGTTGTATCTTTGCTCATTAATTTTGGTAAGACAAGTTCTTGATAATTAGGTGCATTTTGATCTTGGTAAGACACTTCCTTGCTCTTTATTTGTGTTTGTGCTTTCTCTTTGGACTACTCTTCCTACATGAGCATGAGATGAGTTGATTGACCTTGTAGTTGAATTAGTATGTCAATTTGGTCATTTGGATGTCTTCACTTACTAATTGGGAACTTATTCTTATTGTTGGATGGGGCATGTATAATTGTTTTGCTTGGATTGTCTTTGGGCTTATGCTTATTTTTCCCTTGCTCCTAGTAAGCCTATGACGAAGTGGTCCACTTTTATATATGGCTCTCTTGTGTTGTAGTTCTACTTCTTGTTCATGATTGATAGGTGGCGAGCGTCATTGGGTTTTTATCTTTCCCCCCATTTACATCATCCTTGGGTAGAGAACTTCAAAGTTCATTTTCTTTTACACCTTATGCACCATCCATAGGACATCACCTTGGCTTCCCTTTTACTTTGCACTAACTTTGTTACTTGCCCTTGTCTAGTTCTTTCTTTTACACTGTTCCCATATGGCACCAAAAAGGGAAAAGACAACAGCTGACTCTAGTTCTGAATATGATGGACAACGGTTCGTGGATGCTGAAGCTGTGAGTCGGTACCAGAGTTATGTAGTGCATCACGCTCTGGTTCCTGAGCGAGGCATTGTTCCTGATGACTCCCATCACCAGGACATCTACACTACCATTCAGCAACGACATTGAGAGTTGTTGGTGCAACAACCCGAGGCAGGAGTGGTCCTGCTTGTGAGGGAATTTTATGCCAACATGCCTCTGGAGGCTTACCATACCACCGTGCGAGGGAAACACATCCCGTTCGACGTTGTGTCCATTAACAGGTATTGTCACTTACCTGATTATGAGACTGATGAGTACACTATCTATGCCAGGACACAGTTCAACTCTGCAGAGGTACTTGCTACGCTAGCACGACCTGGGGCTCAATGGGTAACAAGGAGAGGCGAGGTCTCCAAACTCCAGACAGCGGACCTCATCGTATCCAGCAGAGTGTGGCAAAGTTTTATCTGTGCTAGATTCCAGCCGGTTGCATACATGAGTGACGTAACGAAGGAGCGCACCATCCTCTTGTACGCCATTGTCACTGGCAAGACGATTGATGTAGGCAAAGTCATCAGTCGATACATCAGGCACGTGCGACGAGGCACGACCACGGGCGGACTAGGCCATCCCTCTCTCATCACCGCCCTATGTCGTCTTGCCAGAGTCACCTGGACCTCCGAGGAGAAGCTGATTCACCCCAAGGGGGTGATGGATAAGAACTACATCATGAGACTTCAGGATCATGATTAA

mRNA sequence

ATGTCTTGCATGAGATTCACTTTCCTCGAGAGAGAGGTTATGGATACCGAGGAAATCAGAATCAAAATTAGTTGCCCACTCATTATCATCCATGTTTGCGCAATCATGAAATTTTTTCTTATGCTAATCAGAAGAATGTGTTGCAGCCTCCTCCAGAAGAAAAAAAAAGAAAAAAAAGGTCTCGAAACGGCGTCGCAACGCTTATCCAAGCGTTGCAACCATTCTTATTTGGCGTTAAAGTTGTTGGTGCAACAACCCGAGGCAGGAGTGGTCCTGCTTGTGAGGGAATTTTATGCCAACATGCCTCTGGAGGCTTACCATACCACCGTGCGAGGGAAACACATCCCGTTCGACGTTGTCAGAGTGTGGCAAAGTTTTATCTGTGCTAGATTCCAGCCGGTTGCATACATGAGTGACGTAACGAAGGAGCGCACCATCCTCTTGTACGCCATTGTCACTGGCAAGACGATTGATGTAGGCAAAGTCATCAGTCGATACATCAGGCACGTGCGACGAGGCACGACCACGGGCGGACTAGGCCATCCCTCTCTCATCACCGCCCTATGTCGTCTTGCCAGAGTCACCTGGACCTCCGAGGAGAAGCTGATTCACCCCAAGGGGGTGATGGATAAGAACTACATCATGAGACTTCAGGATCATGATTAA

Coding sequence (CDS)

ATGTCTTGCATGAGATTCACTTTCCTCGAGAGAGAGGTTATGGATACCGAGGAAATCAGAATCAAAATTAGTTGCCCACTCATTATCATCCATGTTTGCGCAATCATGAAATTTTTTCTTATGCTAATCAGAAGAATGTGTTGCAGCCTCCTCCAGAAGAAAAAAAAAGAAAAAAAAGGTCTCGAAACGGCGTCGCAACGCTTATCCAAGCGTTGCAACCATTCTTATTTGGCGTTAAAGTTGTTGGTGCAACAACCCGAGGCAGGAGTGGTCCTGCTTGTGAGGGAATTTTATGCCAACATGCCTCTGGAGGCTTACCATACCACCGTGCGAGGGAAACACATCCCGTTCGACGTTGTCAGAGTGTGGCAAAGTTTTATCTGTGCTAGATTCCAGCCGGTTGCATACATGAGTGACGTAACGAAGGAGCGCACCATCCTCTTGTACGCCATTGTCACTGGCAAGACGATTGATGTAGGCAAAGTCATCAGTCGATACATCAGGCACGTGCGACGAGGCACGACCACGGGCGGACTAGGCCATCCCTCTCTCATCACCGCCCTATGTCGTCTTGCCAGAGTCACCTGGACCTCCGAGGAGAAGCTGATTCACCCCAAGGGGGTGATGGATAAGAACTACATCATGAGACTTCAGGATCATGATTAA

Protein sequence

MSCMRFTFLEREVMDTEEIRIKISCPLIIIHVCAIMKFFLMLIRRMCCSLLQKKKKEKKGLETASQRLSKRCNHSYLALKLLVQQPEAGVVLLVREFYANMPLEAYHTTVRGKHIPFDVVRVWQSFICARFQPVAYMSDVTKERTILLYAIVTGKTIDVGKVISRYIRHVRRGTTTGGLGHPSLITALCRLARVTWTSEEKLIHPKGVMDKNYIMRLQDHD
Homology
BLAST of Tan0022094 vs. NCBI nr
Match: KAA0033353.1 (putative S-locus lectin protein kinase family protein [Cucumis melo var. makuwa] >TYJ96637.1 putative S-locus lectin protein kinase family protein [Cucumis melo var. makuwa])

HSP 1 Score: 129.0 bits (323), Expect = 5.0e-26
Identity = 75/192 (39.06%), Postives = 98/192 (51.04%), Query Frame = 0

Query: 83  VQQPEAGVVLLVREFYANMPLEAYHTTVRGKHIPFD------------------------ 142
           V+QPE  VV +VREFYANM   +  + VRG+ + FD                        
Sbjct: 71  VKQPEPAVVSIVREFYANMVEGSSRSFVRGRQVSFDYGTINRYYHLPNFERDEYAIYASE 130

Query: 143 ------VVR-----------------------------VWQSFICARFQPVAYMSDVTKE 202
                 ++R                             VW  FICA+  PVA+ S VTKE
Sbjct: 131 HVDVHQIIRELCQPGAEWIINPGEPIRFKSSNLTVSNQVWHKFICAKLLPVAHTSSVTKE 190

Query: 203 RTILLYAIVTGKTIDVGKVISRYIRHVRRGTTTGGLGHPSLITALCRLARVTWTSEEKLI 216
           R ILLYAI T +++DVGKVI + + ++R+   TGGLGH SLITALCR   V W  +E+L+
Sbjct: 191 RAILLYAIATKRSVDVGKVIHKSLCNIRKSGMTGGLGHSSLITALCRNEGVVWNEKEELV 250

BLAST of Tan0022094 vs. NCBI nr
Match: XP_008458668.1 (PREDICTED: uncharacterized protein LOC103497996 [Cucumis melo])

HSP 1 Score: 129.0 bits (323), Expect = 5.0e-26
Identity = 75/192 (39.06%), Postives = 98/192 (51.04%), Query Frame = 0

Query: 83  VQQPEAGVVLLVREFYANMPLEAYHTTVRGKHIPFD------------------------ 142
           V+QPE  VV +VREFYANM   +  + VRG+ + FD                        
Sbjct: 9   VKQPEPAVVSIVREFYANMVEGSSRSFVRGRQVSFDYGTINRYYHLPNFERDEYAIYASE 68

Query: 143 ------VVR-----------------------------VWQSFICARFQPVAYMSDVTKE 202
                 ++R                             VW  FICA+  PVA+ S VTKE
Sbjct: 69  HVDVHQIIRELCQPGAEWIINPGEPIRFKSSNLTVSNQVWHKFICAKLLPVAHTSSVTKE 128

Query: 203 RTILLYAIVTGKTIDVGKVISRYIRHVRRGTTTGGLGHPSLITALCRLARVTWTSEEKLI 216
           R ILLYAI T +++DVGKVI + + ++R+   TGGLGH SLITALCR   V W  +E+L+
Sbjct: 129 RAILLYAIATKRSVDVGKVIHKSLCNIRKSGMTGGLGHSSLITALCRNEGVVWNEKEELV 188

BLAST of Tan0022094 vs. NCBI nr
Match: KGN46897.1 (hypothetical protein Csa_020731 [Cucumis sativus])

HSP 1 Score: 127.1 bits (318), Expect = 1.9e-25
Identity = 73/194 (37.63%), Postives = 99/194 (51.03%), Query Frame = 0

Query: 83  VQQPEAGVVLLVREFYANMPLEAYHTTVRGKHIPFD------------------------ 142
           V+QPE  V+ +VREFYANM   +  + VRG+ + FD                        
Sbjct: 71  VKQPEPAVLSIVREFYANMVEGSSRSFVRGRQVSFDYGTINRYYHLPNFERDEYDIYASE 130

Query: 143 ------VVR-----------------------------VWQSFICARFQPVAYMSDVTKE 202
                 ++R                             VW  FICA+  PVA+ S VTKE
Sbjct: 131 HVDVHQIIRELCQPGAEWVINPGEPIRFKSSNLTVSNQVWHKFICAKLLPVAHTSSVTKE 190

Query: 203 RTILLYAIVTGKTIDVGKVISRYIRHVRRGTTTGGLGHPSLITALCRLARVTWTSEEKLI 218
           R ILLYAI T +++DVGKVI + + ++R+   TGGLGH SLITALCR   V W  +E+L+
Sbjct: 191 RAILLYAIATKRSVDVGKVIQKSLCNIRKSGMTGGLGHSSLITALCRNEGVVWNEKEELV 250

BLAST of Tan0022094 vs. NCBI nr
Match: EOY01634.1 (Uncharacterized protein TCM_011481 [Theobroma cacao])

HSP 1 Score: 106.7 bits (265), Expect = 2.6e-19
Identity = 52/100 (52.00%), Postives = 73/100 (73.00%), Query Frame = 0

Query: 121 RVWQSFICARFQPVAYMSDVTKERTILLYAIVTGKTIDVGKVISRYIRHVRRGTTTGGLG 180
           +VW  F+ AR  PV ++S +TK+R +LLYA+VTGKTI+VGK+I   I HV  G+   G+ 
Sbjct: 97  KVWYHFLTARLLPVKHVSVITKDRAVLLYAMVTGKTINVGKLIFENILHV-AGSAKEGIW 156

Query: 181 HPSLITALCRLARVTWTSEEKLIHPKGVMDKNYIMRLQDH 221
           +PSLITALC+ ARV W+S E+L+HPK  +D N + RL ++
Sbjct: 157 YPSLITALCKQARVQWSSVEELLHPKVPLDANIVNRLYNY 195

BLAST of Tan0022094 vs. NCBI nr
Match: EOY08849.1 (Uncharacterized protein TCM_024087 [Theobroma cacao])

HSP 1 Score: 102.8 bits (255), Expect = 3.8e-18
Identity = 63/150 (42.00%), Postives = 85/150 (56.67%), Query Frame = 0

Query: 85  QPEAGVVLLVREFYANMPLEAYHTT-VRG--------------KHIPFDVVRVWQSFICA 144
           QP+A VV +VREFYAN+         VRG              + +    ++VW  F+ A
Sbjct: 68  QPDATVVPVVREFYANVVEHVDGVAFVRGAQWKTSHDEPVSFKRSVMKKELQVWLHFVAA 127

Query: 145 RFQPVAYMSDVTKERTILLYAIVTGKTIDVGKVISRYIRHVRRGTTTGGLGHPSLITALC 204
           R     ++SDVTK+R +L+YAIV  K+IDVGKVIS  I H  R T   G+G PSLITALC
Sbjct: 128 RLLSSTHISDVTKDRAVLIYAIVAHKSIDVGKVISHAILHTGR-TKRDGIGFPSLITALC 187

Query: 205 RLARVTWTSEEKLIHPKGVMDKNYIMRLQD 220
             A V W+ +E+L  PK  +    + RL++
Sbjct: 188 ARAGVQWSDKEQLQQPKLPITMGILQRLEE 216

BLAST of Tan0022094 vs. ExPASy TrEMBL
Match: A0A5D3BBY3 (Putative S-locus lectin protein kinase family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold26G00020 PE=4 SV=1)

HSP 1 Score: 129.0 bits (323), Expect = 2.4e-26
Identity = 75/192 (39.06%), Postives = 98/192 (51.04%), Query Frame = 0

Query: 83  VQQPEAGVVLLVREFYANMPLEAYHTTVRGKHIPFD------------------------ 142
           V+QPE  VV +VREFYANM   +  + VRG+ + FD                        
Sbjct: 71  VKQPEPAVVSIVREFYANMVEGSSRSFVRGRQVSFDYGTINRYYHLPNFERDEYAIYASE 130

Query: 143 ------VVR-----------------------------VWQSFICARFQPVAYMSDVTKE 202
                 ++R                             VW  FICA+  PVA+ S VTKE
Sbjct: 131 HVDVHQIIRELCQPGAEWIINPGEPIRFKSSNLTVSNQVWHKFICAKLLPVAHTSSVTKE 190

Query: 203 RTILLYAIVTGKTIDVGKVISRYIRHVRRGTTTGGLGHPSLITALCRLARVTWTSEEKLI 216
           R ILLYAI T +++DVGKVI + + ++R+   TGGLGH SLITALCR   V W  +E+L+
Sbjct: 191 RAILLYAIATKRSVDVGKVIHKSLCNIRKSGMTGGLGHSSLITALCRNEGVVWNEKEELV 250

BLAST of Tan0022094 vs. ExPASy TrEMBL
Match: A0A1S3C7Y0 (uncharacterized protein LOC103497996 OS=Cucumis melo OX=3656 GN=LOC103497996 PE=4 SV=1)

HSP 1 Score: 129.0 bits (323), Expect = 2.4e-26
Identity = 75/192 (39.06%), Postives = 98/192 (51.04%), Query Frame = 0

Query: 83  VQQPEAGVVLLVREFYANMPLEAYHTTVRGKHIPFD------------------------ 142
           V+QPE  VV +VREFYANM   +  + VRG+ + FD                        
Sbjct: 9   VKQPEPAVVSIVREFYANMVEGSSRSFVRGRQVSFDYGTINRYYHLPNFERDEYAIYASE 68

Query: 143 ------VVR-----------------------------VWQSFICARFQPVAYMSDVTKE 202
                 ++R                             VW  FICA+  PVA+ S VTKE
Sbjct: 69  HVDVHQIIRELCQPGAEWIINPGEPIRFKSSNLTVSNQVWHKFICAKLLPVAHTSSVTKE 128

Query: 203 RTILLYAIVTGKTIDVGKVISRYIRHVRRGTTTGGLGHPSLITALCRLARVTWTSEEKLI 216
           R ILLYAI T +++DVGKVI + + ++R+   TGGLGH SLITALCR   V W  +E+L+
Sbjct: 129 RAILLYAIATKRSVDVGKVIHKSLCNIRKSGMTGGLGHSSLITALCRNEGVVWNEKEELV 188

BLAST of Tan0022094 vs. ExPASy TrEMBL
Match: A0A0A0KER1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G149380 PE=4 SV=1)

HSP 1 Score: 127.1 bits (318), Expect = 9.1e-26
Identity = 73/194 (37.63%), Postives = 99/194 (51.03%), Query Frame = 0

Query: 83  VQQPEAGVVLLVREFYANMPLEAYHTTVRGKHIPFD------------------------ 142
           V+QPE  V+ +VREFYANM   +  + VRG+ + FD                        
Sbjct: 71  VKQPEPAVLSIVREFYANMVEGSSRSFVRGRQVSFDYGTINRYYHLPNFERDEYDIYASE 130

Query: 143 ------VVR-----------------------------VWQSFICARFQPVAYMSDVTKE 202
                 ++R                             VW  FICA+  PVA+ S VTKE
Sbjct: 131 HVDVHQIIRELCQPGAEWVINPGEPIRFKSSNLTVSNQVWHKFICAKLLPVAHTSSVTKE 190

Query: 203 RTILLYAIVTGKTIDVGKVISRYIRHVRRGTTTGGLGHPSLITALCRLARVTWTSEEKLI 218
           R ILLYAI T +++DVGKVI + + ++R+   TGGLGH SLITALCR   V W  +E+L+
Sbjct: 191 RAILLYAIATKRSVDVGKVIQKSLCNIRKSGMTGGLGHSSLITALCRNEGVVWNEKEELV 250

BLAST of Tan0022094 vs. ExPASy TrEMBL
Match: A0A0A0KNI1 (AA_kinase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G468460 PE=4 SV=1)

HSP 1 Score: 122.1 bits (305), Expect = 2.9e-24
Identity = 66/165 (40.00%), Postives = 94/165 (56.97%), Query Frame = 0

Query: 83  VQQPEAGVVLLVREFYANMPLEAYHTTVRGKHIPFDVVRVWQSFICARFQ---------- 142
           V+QPE  V+ +VREFYANM   +  + VRG+ + FD   + + +    F+          
Sbjct: 71  VKQPEPAVLSIVREFYANMVEGSSRSFVRGRQVSFDYGTINRYYHLPNFERDEYDIYASE 130

Query: 143 --------------------PVAYMSDVTKERTILLYAIVTGKTIDVGKVISRYIRHVRR 202
                               P+A+ S VTKER ILLYAI T +++DVGKVI + + ++R+
Sbjct: 131 HVDVHQIIRELCQPGAEWLLPMAHTSSVTKERAILLYAIATKRSVDVGKVIQKSLCNIRK 190

Query: 203 GTTTGGLGHPSLITALCRLARVTWTSEEKLIHPKGVMDKNYIMRL 218
              TGGLGH SLITALCR   V W  +E+L+ PK +MDK++IM +
Sbjct: 191 SGMTGGLGHSSLITALCRNEGVVWNEKEELVDPKPIMDKSFIMEI 235

BLAST of Tan0022094 vs. ExPASy TrEMBL
Match: A0A061EH53 (Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_011481 PE=4 SV=1)

HSP 1 Score: 106.7 bits (265), Expect = 1.3e-19
Identity = 52/100 (52.00%), Postives = 73/100 (73.00%), Query Frame = 0

Query: 121 RVWQSFICARFQPVAYMSDVTKERTILLYAIVTGKTIDVGKVISRYIRHVRRGTTTGGLG 180
           +VW  F+ AR  PV ++S +TK+R +LLYA+VTGKTI+VGK+I   I HV  G+   G+ 
Sbjct: 97  KVWYHFLTARLLPVKHVSVITKDRAVLLYAMVTGKTINVGKLIFENILHV-AGSAKEGIW 156

Query: 181 HPSLITALCRLARVTWTSEEKLIHPKGVMDKNYIMRLQDH 221
           +PSLITALC+ ARV W+S E+L+HPK  +D N + RL ++
Sbjct: 157 YPSLITALCKQARVQWSSVEELLHPKVPLDANIVNRLYNY 195

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAA0033353.15.0e-2639.06putative S-locus lectin protein kinase family protein [Cucumis melo var. makuwa]... [more]
XP_008458668.15.0e-2639.06PREDICTED: uncharacterized protein LOC103497996 [Cucumis melo][more]
KGN46897.11.9e-2537.63hypothetical protein Csa_020731 [Cucumis sativus][more]
EOY01634.12.6e-1952.00Uncharacterized protein TCM_011481 [Theobroma cacao][more]
EOY08849.13.8e-1842.00Uncharacterized protein TCM_024087 [Theobroma cacao][more]
Match NameE-valueIdentityDescription
A0A5D3BBY32.4e-2639.06Putative S-locus lectin protein kinase family protein OS=Cucumis melo var. makuw... [more]
A0A1S3C7Y02.4e-2639.06uncharacterized protein LOC103497996 OS=Cucumis melo OX=3656 GN=LOC103497996 PE=... [more]
A0A0A0KER19.1e-2637.63Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G149380 PE=4 SV=1[more]
A0A0A0KNI12.9e-2440.00AA_kinase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G468460 P... [more]
A0A061EH531.3e-1952.00Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_011481 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 51..71

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0022094.1Tan0022094.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1901564 organonitrogen compound metabolic process
biological_process GO:0016310 phosphorylation
biological_process GO:0044238 primary metabolic process
cellular_component GO:0016020 membrane
molecular_function GO:0005488 binding
molecular_function GO:0004672 protein kinase activity