Tan0014035.1 (mRNA) Snake gourd v1

Overview
NameTan0014035.1
TypemRNA
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
LocationLG11: 1066637 .. 1067559 (+)
Sequence length923
RNA-Seq ExpressionTan0014035.1
SyntenyTan0014035.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTAATTTGGTCGGTTTTCAATTGTCTTCATCTTCTTTCTTCTTTCCTCTTTCCTTTTTCTCAGAACCCATTTCTCCTTCTTCTTCCATTCACACATTGTTCGTTTATGCCCCATTCCCATTTCCATGGCGATCGAGGCCGTTTCTCCTGAGATTCCCGTTGTCGCAATGAGCCCTAGAATTTCATTCTCTCACGATTTCATCCAGACCGAAGCTATTCCGGTAGAACAACGCCCTAATTCCCGATCCAATTCCTCCGGTTTCAATTCCAGCTTCGATTTCGATTTCTGCATTCGCGAATGTTCCGATCAGGAATCCTCCTCCGCGGATGAAATTTTCTCTCACGGCAAAATTCTGCCCCTCGAAATCAAGAAGAAATCGGACGAGCCTAATGTGCGAGTCGATCAGTCTTCTTCTTCTTCTTCTTCTAATTATTCTCCTCCATTGACACGAGCGAAATCGCTCGATGCTAATGCGGAGAAATTTTTGAAGAAGGATCGATCGATGAAGGAAATCAAGGCCGCGAGTACTAGTGATTCTGAAGAGAAACAAAGTCCTAATTCCAATTACAAATCATTTTGGCGTTTCAAAAGAAGTAGCAGCTGTGGCTCTGGATATACTCGTAGTTTATGTCCTTTGCCGCTTCTATCACGAAGCAATTCGACTGGCTCTGAGCCCAATATTAAGCGAACGTCATTGTCAAAGGACGGTGTAACTCAAAAACAGAGCTCTCATAGAAATGCGCCAAAAAATTCACAACAGTTTTCGTCTTCAATGGGATATCAGAAACCTCCATTGAAGAAGGTACATGGTTCGTACGGTAATGGAGTTCAAGTAAACCCTATTCTAAATGTTCATTCGGGGAATCTTTTCGGTTTGGGCTCAATATTCTCCTCTGGCAAGGATAGGAGCAAGAAAAAGTGA

mRNA sequence

GTTAATTTGGTCGGTTTTCAATTGTCTTCATCTTCTTTCTTCTTTCCTCTTTCCTTTTTCTCAGAACCCATTTCTCCTTCTTCTTCCATTCACACATTGTTCGTTTATGCCCCATTCCCATTTCCATGGCGATCGAGGCCGTTTCTCCTGAGATTCCCGTTGTCGCAATGAGCCCTAGAATTTCATTCTCTCACGATTTCATCCAGACCGAAGCTATTCCGGTAGAACAACGCCCTAATTCCCGATCCAATTCCTCCGGTTTCAATTCCAGCTTCGATTTCGATTTCTGCATTCGCGAATGTTCCGATCAGGAATCCTCCTCCGCGGATGAAATTTTCTCTCACGGCAAAATTCTGCCCCTCGAAATCAAGAAGAAATCGGACGAGCCTAATGTGCGAGTCGATCAGTCTTCTTCTTCTTCTTCTTCTAATTATTCTCCTCCATTGACACGAGCGAAATCGCTCGATGCTAATGCGGAGAAATTTTTGAAGAAGGATCGATCGATGAAGGAAATCAAGGCCGCGAGTACTAGTGATTCTGAAGAGAAACAAAGTCCTAATTCCAATTACAAATCATTTTGGCGTTTCAAAAGAAGTAGCAGCTGTGGCTCTGGATATACTCGTAGTTTATGTCCTTTGCCGCTTCTATCACGAAGCAATTCGACTGGCTCTGAGCCCAATATTAAGCGAACGTCATTGTCAAAGGACGGTGTAACTCAAAAACAGAGCTCTCATAGAAATGCGCCAAAAAATTCACAACAGTTTTCGTCTTCAATGGGATATCAGAAACCTCCATTGAAGAAGGTACATGGTTCGTACGGTAATGGAGTTCAAGTAAACCCTATTCTAAATGTTCATTCGGGGAATCTTTTCGGTTTGGGCTCAATATTCTCCTCTGGCAAGGATAGGAGCAAGAAAAAGTGA

Coding sequence (CDS)

ATGGCGATCGAGGCCGTTTCTCCTGAGATTCCCGTTGTCGCAATGAGCCCTAGAATTTCATTCTCTCACGATTTCATCCAGACCGAAGCTATTCCGGTAGAACAACGCCCTAATTCCCGATCCAATTCCTCCGGTTTCAATTCCAGCTTCGATTTCGATTTCTGCATTCGCGAATGTTCCGATCAGGAATCCTCCTCCGCGGATGAAATTTTCTCTCACGGCAAAATTCTGCCCCTCGAAATCAAGAAGAAATCGGACGAGCCTAATGTGCGAGTCGATCAGTCTTCTTCTTCTTCTTCTTCTAATTATTCTCCTCCATTGACACGAGCGAAATCGCTCGATGCTAATGCGGAGAAATTTTTGAAGAAGGATCGATCGATGAAGGAAATCAAGGCCGCGAGTACTAGTGATTCTGAAGAGAAACAAAGTCCTAATTCCAATTACAAATCATTTTGGCGTTTCAAAAGAAGTAGCAGCTGTGGCTCTGGATATACTCGTAGTTTATGTCCTTTGCCGCTTCTATCACGAAGCAATTCGACTGGCTCTGAGCCCAATATTAAGCGAACGTCATTGTCAAAGGACGGTGTAACTCAAAAACAGAGCTCTCATAGAAATGCGCCAAAAAATTCACAACAGTTTTCGTCTTCAATGGGATATCAGAAACCTCCATTGAAGAAGGTACATGGTTCGTACGGTAATGGAGTTCAAGTAAACCCTATTCTAAATGTTCATTCGGGGAATCTTTTCGGTTTGGGCTCAATATTCTCCTCTGGCAAGGATAGGAGCAAGAAAAAGTGA

Protein sequence

MAIEAVSPEIPVVAMSPRISFSHDFIQTEAIPVEQRPNSRSNSSGFNSSFDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKSDEPNVRVDQSSSSSSSNYSPPLTRAKSLDANAEKFLKKDRSMKEIKAASTSDSEEKQSPNSNYKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSEPNIKRTSLSKDGVTQKQSSHRNAPKNSQQFSSSMGYQKPPLKKVHGSYGNGVQVNPILNVHSGNLFGLGSIFSSGKDRSKKK
Homology
BLAST of Tan0014035.1 vs. NCBI nr
Match: XP_022990991.1 (uncharacterized protein LOC111487716 isoform X1 [Cucurbita maxima])

HSP 1 Score: 429.1 bits (1102), Expect = 2.8e-116
Identity = 233/265 (87.92%), Postives = 244/265 (92.08%), Query Frame = 0

Query: 1   MAIEAVSPEIPVVAMSPRISFSHDFIQTEAIPVEQRPNSRSNSSGFNSSFDFDFCIRECS 60
           MAIEAVSP+IPVVA+SPRISFSHDFI  EAIPVEQR NSRS+SS FNSSFDFDFCIRECS
Sbjct: 1   MAIEAVSPDIPVVALSPRISFSHDFIHAEAIPVEQRSNSRSSSSAFNSSFDFDFCIRECS 60

Query: 61  DQESSSADEIFSHGKILPLEIKKKSDEPNVRVDQSSSSSSSNYSPPLTRAKSLDANAEKF 120
            QESSSADEIFSHGKILPLEIKKKS+EP++RVDQ   SS SN+SPPLTRAKSLD+NAEK 
Sbjct: 61  HQESSSADEIFSHGKILPLEIKKKSEEPHLRVDQ---SSFSNHSPPLTRAKSLDSNAEKC 120

Query: 121 LKKDRSMKEIKAASTSDSEEKQSPNSNYKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNST 180
           LKKDRS KEIK A +SDSEEKQS  SN+KSFW FKRSSSCGSGYTRSLCPLPLLSRSNST
Sbjct: 121 LKKDRSPKEIKEAVSSDSEEKQS--SNFKSFWGFKRSSSCGSGYTRSLCPLPLLSRSNST 180

Query: 181 GSEPNIKRTSLSKDGVTQKQSSHRNAPKNSQQFSSSMGYQKPPLKKVHGSYGNGVQVNPI 240
           GS PNIKRT+LSKDGVTQKQSSHRNAPKNSQ  SSSMGYQKPPLKKVHGSY N VQVNPI
Sbjct: 181 GSAPNIKRTTLSKDGVTQKQSSHRNAPKNSQHCSSSMGYQKPPLKKVHGSYSNVVQVNPI 240

Query: 241 LNVHSGNLFGLGSIFSSGKDRSKKK 266
           LNVHSGNLFGLGSIFSSGKDRSKKK
Sbjct: 241 LNVHSGNLFGLGSIFSSGKDRSKKK 260

BLAST of Tan0014035.1 vs. NCBI nr
Match: XP_022953381.1 (uncharacterized protein LOC111455949 isoform X1 [Cucurbita moschata])

HSP 1 Score: 428.3 bits (1100), Expect = 4.7e-116
Identity = 232/265 (87.55%), Postives = 244/265 (92.08%), Query Frame = 0

Query: 1   MAIEAVSPEIPVVAMSPRISFSHDFIQTEAIPVEQRPNSRSNSSGFNSSFDFDFCIRECS 60
           MAIEAVSP+IPVVA+SPRISFSHDFI  EAIPVEQRPNSRS+SS FNSSFDFDFCIRECS
Sbjct: 1   MAIEAVSPDIPVVALSPRISFSHDFIHAEAIPVEQRPNSRSSSSAFNSSFDFDFCIRECS 60

Query: 61  DQESSSADEIFSHGKILPLEIKKKSDEPNVRVDQSSSSSSSNYSPPLTRAKSLDANAEKF 120
            QESSSADEIFSHGKILPLEIKKK +EP++R+DQ   SS SN+SPPLTRAKSLD+NAEK 
Sbjct: 61  HQESSSADEIFSHGKILPLEIKKKLEEPHLRLDQ---SSFSNHSPPLTRAKSLDSNAEKC 120

Query: 121 LKKDRSMKEIKAASTSDSEEKQSPNSNYKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNST 180
           LKKDRS KEIK A +SDSEEKQS  SN+KSFW FKRSSSCGSGYTRSLCPLPLLSRSNST
Sbjct: 121 LKKDRSPKEIKDAVSSDSEEKQS--SNFKSFWGFKRSSSCGSGYTRSLCPLPLLSRSNST 180

Query: 181 GSEPNIKRTSLSKDGVTQKQSSHRNAPKNSQQFSSSMGYQKPPLKKVHGSYGNGVQVNPI 240
           GS PNIKRT+LSKDGVTQKQSSHRNAPKNSQ  SSSMGYQKPPLKKVHGSY N VQVNPI
Sbjct: 181 GSAPNIKRTTLSKDGVTQKQSSHRNAPKNSQHCSSSMGYQKPPLKKVHGSYSNVVQVNPI 240

Query: 241 LNVHSGNLFGLGSIFSSGKDRSKKK 266
           LNVHSGNLFGLGSIFSSGKDRSKKK
Sbjct: 241 LNVHSGNLFGLGSIFSSGKDRSKKK 260

BLAST of Tan0014035.1 vs. NCBI nr
Match: XP_023517703.1 (uncharacterized protein LOC111781353 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 427.9 bits (1099), Expect = 6.2e-116
Identity = 231/265 (87.17%), Postives = 243/265 (91.70%), Query Frame = 0

Query: 1   MAIEAVSPEIPVVAMSPRISFSHDFIQTEAIPVEQRPNSRSNSSGFNSSFDFDFCIRECS 60
           MAIEAVSP+IP VA+SPRISFSHDFI  EAIPVEQRPNSRS+SS FNSSFDFDFCIRECS
Sbjct: 1   MAIEAVSPDIPAVALSPRISFSHDFIHAEAIPVEQRPNSRSSSSAFNSSFDFDFCIRECS 60

Query: 61  DQESSSADEIFSHGKILPLEIKKKSDEPNVRVDQSSSSSSSNYSPPLTRAKSLDANAEKF 120
            QESSSADEIFSHGKILPLEIKK+ +EP++RVDQ   SS SN+SPPLTRAKSLD+NAEK 
Sbjct: 61  HQESSSADEIFSHGKILPLEIKKRLEEPHLRVDQ---SSFSNHSPPLTRAKSLDSNAEKC 120

Query: 121 LKKDRSMKEIKAASTSDSEEKQSPNSNYKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNST 180
           LKKDRS KEIK A +SDSEEKQS  SN+KSFW FKRSSSCGSGYTRSLCPLPLLSRSNST
Sbjct: 121 LKKDRSPKEIKDAVSSDSEEKQS--SNFKSFWGFKRSSSCGSGYTRSLCPLPLLSRSNST 180

Query: 181 GSEPNIKRTSLSKDGVTQKQSSHRNAPKNSQQFSSSMGYQKPPLKKVHGSYGNGVQVNPI 240
           GS PNIKRT+LSKDGVTQKQSSHRNAPKNSQ  SSSMGYQKPPLKKVHGSY N VQVNPI
Sbjct: 181 GSAPNIKRTTLSKDGVTQKQSSHRNAPKNSQHCSSSMGYQKPPLKKVHGSYSNAVQVNPI 240

Query: 241 LNVHSGNLFGLGSIFSSGKDRSKKK 266
           LNVHSGNLFGLGSIFSSGKDRSKKK
Sbjct: 241 LNVHSGNLFGLGSIFSSGKDRSKKK 260

BLAST of Tan0014035.1 vs. NCBI nr
Match: KAG7033387.1 (hypothetical protein SDJN02_07443, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 427.6 bits (1098), Expect = 8.1e-116
Identity = 232/265 (87.55%), Postives = 244/265 (92.08%), Query Frame = 0

Query: 1   MAIEAVSPEIPVVAMSPRISFSHDFIQTEAIPVEQRPNSRSNSSGFNSSFDFDFCIRECS 60
           MAIEAVSP+IPVVA+SPRISFSHDFI  EAIPVEQRPNSRS+SS FNSSFDFDFCIRECS
Sbjct: 1   MAIEAVSPDIPVVALSPRISFSHDFIHAEAIPVEQRPNSRSSSSAFNSSFDFDFCIRECS 60

Query: 61  DQESSSADEIFSHGKILPLEIKKKSDEPNVRVDQSSSSSSSNYSPPLTRAKSLDANAEKF 120
            QESSSADEIFSHGKILPLEIKKK +EP++RVDQ   SS SN+SPPLTRAKSLD+NAEK 
Sbjct: 61  HQESSSADEIFSHGKILPLEIKKKLEEPHLRVDQ---SSFSNHSPPLTRAKSLDSNAEKC 120

Query: 121 LKKDRSMKEIKAASTSDSEEKQSPNSNYKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNST 180
           LKKDRS KEIK A +SDSEEKQS  SN+KSFW FKRSSSCGSGYTRSLCPLPLLSRSNST
Sbjct: 121 LKKDRSPKEIKDAVSSDSEEKQS--SNFKSFWGFKRSSSCGSGYTRSLCPLPLLSRSNST 180

Query: 181 GSEPNIKRTSLSKDGVTQKQSSHRNAPKNSQQFSSSMGYQKPPLKKVHGSYGNGVQVNPI 240
           GS PNIKRT+LSKDGVTQKQSSHRNAPKNSQ  SSSMG+QKPPLKKVHGSY N VQVNPI
Sbjct: 181 GSAPNIKRTTLSKDGVTQKQSSHRNAPKNSQHCSSSMGHQKPPLKKVHGSYSNVVQVNPI 240

Query: 241 LNVHSGNLFGLGSIFSSGKDRSKKK 266
           LNVHSGNLFGLGSIFSSGKDRSKKK
Sbjct: 241 LNVHSGNLFGLGSIFSSGKDRSKKK 260

BLAST of Tan0014035.1 vs. NCBI nr
Match: KAG6602701.1 (hypothetical protein SDJN03_07934, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 427.2 bits (1097), Expect = 1.1e-115
Identity = 232/265 (87.55%), Postives = 243/265 (91.70%), Query Frame = 0

Query: 1   MAIEAVSPEIPVVAMSPRISFSHDFIQTEAIPVEQRPNSRSNSSGFNSSFDFDFCIRECS 60
           MAIEAVSP+IPVVA+SPRISFSHDFI  EAIPVEQRPNSRS+SS FNSSFDFDFCIRECS
Sbjct: 1   MAIEAVSPDIPVVALSPRISFSHDFIHAEAIPVEQRPNSRSSSSAFNSSFDFDFCIRECS 60

Query: 61  DQESSSADEIFSHGKILPLEIKKKSDEPNVRVDQSSSSSSSNYSPPLTRAKSLDANAEKF 120
            QESSSADEIFSHGKILPLEIKKK +EP++RVDQ   SS SN+SPPLTRAKSLD+NAEK 
Sbjct: 61  HQESSSADEIFSHGKILPLEIKKKLEEPHLRVDQ---SSFSNHSPPLTRAKSLDSNAEKC 120

Query: 121 LKKDRSMKEIKAASTSDSEEKQSPNSNYKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNST 180
           LKKDRS KEIK A +SDSEEKQS  SN+KSFW FKRSSSCGSGYTRSLCPLPLLSRSNST
Sbjct: 121 LKKDRSPKEIKDAVSSDSEEKQS--SNFKSFWGFKRSSSCGSGYTRSLCPLPLLSRSNST 180

Query: 181 GSEPNIKRTSLSKDGVTQKQSSHRNAPKNSQQFSSSMGYQKPPLKKVHGSYGNGVQVNPI 240
           GS PNIKRT+LSKDGVTQKQSSHRNAPKNSQ  SSSMGYQKPPLKKVHGSY N VQVNPI
Sbjct: 181 GSAPNIKRTTLSKDGVTQKQSSHRNAPKNSQHCSSSMGYQKPPLKKVHGSYSNVVQVNPI 240

Query: 241 LNVHSGNLFGLGSIFSSGKDRSKKK 266
           LNVHSGNLFGLGSIFSS KDRSKKK
Sbjct: 241 LNVHSGNLFGLGSIFSSAKDRSKKK 260

BLAST of Tan0014035.1 vs. ExPASy TrEMBL
Match: A0A6J1JPH3 (uncharacterized protein LOC111487716 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111487716 PE=4 SV=1)

HSP 1 Score: 429.1 bits (1102), Expect = 1.3e-116
Identity = 233/265 (87.92%), Postives = 244/265 (92.08%), Query Frame = 0

Query: 1   MAIEAVSPEIPVVAMSPRISFSHDFIQTEAIPVEQRPNSRSNSSGFNSSFDFDFCIRECS 60
           MAIEAVSP+IPVVA+SPRISFSHDFI  EAIPVEQR NSRS+SS FNSSFDFDFCIRECS
Sbjct: 1   MAIEAVSPDIPVVALSPRISFSHDFIHAEAIPVEQRSNSRSSSSAFNSSFDFDFCIRECS 60

Query: 61  DQESSSADEIFSHGKILPLEIKKKSDEPNVRVDQSSSSSSSNYSPPLTRAKSLDANAEKF 120
            QESSSADEIFSHGKILPLEIKKKS+EP++RVDQ   SS SN+SPPLTRAKSLD+NAEK 
Sbjct: 61  HQESSSADEIFSHGKILPLEIKKKSEEPHLRVDQ---SSFSNHSPPLTRAKSLDSNAEKC 120

Query: 121 LKKDRSMKEIKAASTSDSEEKQSPNSNYKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNST 180
           LKKDRS KEIK A +SDSEEKQS  SN+KSFW FKRSSSCGSGYTRSLCPLPLLSRSNST
Sbjct: 121 LKKDRSPKEIKEAVSSDSEEKQS--SNFKSFWGFKRSSSCGSGYTRSLCPLPLLSRSNST 180

Query: 181 GSEPNIKRTSLSKDGVTQKQSSHRNAPKNSQQFSSSMGYQKPPLKKVHGSYGNGVQVNPI 240
           GS PNIKRT+LSKDGVTQKQSSHRNAPKNSQ  SSSMGYQKPPLKKVHGSY N VQVNPI
Sbjct: 181 GSAPNIKRTTLSKDGVTQKQSSHRNAPKNSQHCSSSMGYQKPPLKKVHGSYSNVVQVNPI 240

Query: 241 LNVHSGNLFGLGSIFSSGKDRSKKK 266
           LNVHSGNLFGLGSIFSSGKDRSKKK
Sbjct: 241 LNVHSGNLFGLGSIFSSGKDRSKKK 260

BLAST of Tan0014035.1 vs. ExPASy TrEMBL
Match: A0A6J1GMV2 (uncharacterized protein LOC111455949 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111455949 PE=4 SV=1)

HSP 1 Score: 428.3 bits (1100), Expect = 2.3e-116
Identity = 232/265 (87.55%), Postives = 244/265 (92.08%), Query Frame = 0

Query: 1   MAIEAVSPEIPVVAMSPRISFSHDFIQTEAIPVEQRPNSRSNSSGFNSSFDFDFCIRECS 60
           MAIEAVSP+IPVVA+SPRISFSHDFI  EAIPVEQRPNSRS+SS FNSSFDFDFCIRECS
Sbjct: 1   MAIEAVSPDIPVVALSPRISFSHDFIHAEAIPVEQRPNSRSSSSAFNSSFDFDFCIRECS 60

Query: 61  DQESSSADEIFSHGKILPLEIKKKSDEPNVRVDQSSSSSSSNYSPPLTRAKSLDANAEKF 120
            QESSSADEIFSHGKILPLEIKKK +EP++R+DQ   SS SN+SPPLTRAKSLD+NAEK 
Sbjct: 61  HQESSSADEIFSHGKILPLEIKKKLEEPHLRLDQ---SSFSNHSPPLTRAKSLDSNAEKC 120

Query: 121 LKKDRSMKEIKAASTSDSEEKQSPNSNYKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNST 180
           LKKDRS KEIK A +SDSEEKQS  SN+KSFW FKRSSSCGSGYTRSLCPLPLLSRSNST
Sbjct: 121 LKKDRSPKEIKDAVSSDSEEKQS--SNFKSFWGFKRSSSCGSGYTRSLCPLPLLSRSNST 180

Query: 181 GSEPNIKRTSLSKDGVTQKQSSHRNAPKNSQQFSSSMGYQKPPLKKVHGSYGNGVQVNPI 240
           GS PNIKRT+LSKDGVTQKQSSHRNAPKNSQ  SSSMGYQKPPLKKVHGSY N VQVNPI
Sbjct: 181 GSAPNIKRTTLSKDGVTQKQSSHRNAPKNSQHCSSSMGYQKPPLKKVHGSYSNVVQVNPI 240

Query: 241 LNVHSGNLFGLGSIFSSGKDRSKKK 266
           LNVHSGNLFGLGSIFSSGKDRSKKK
Sbjct: 241 LNVHSGNLFGLGSIFSSGKDRSKKK 260

BLAST of Tan0014035.1 vs. ExPASy TrEMBL
Match: A0A6J1BVI8 (uncharacterized protein LOC111006095 OS=Momordica charantia OX=3673 GN=LOC111006095 PE=4 SV=1)

HSP 1 Score: 388.7 bits (997), Expect = 2.0e-104
Identity = 217/268 (80.97%), Postives = 230/268 (85.82%), Query Frame = 0

Query: 1   MAIEAVSPEIPVVAMSPRISFSHDFIQTEAIPVEQRPNSRSNSSGFNSSFDFDFCIRECS 60
           MAIEAV P+I V AMSPRISFSHDF Q+EAIPVEQRP SRSNSSG NSS DFDFCIRECS
Sbjct: 1   MAIEAVCPDISVPAMSPRISFSHDFCQSEAIPVEQRPKSRSNSSGLNSSIDFDFCIRECS 60

Query: 61  DQESSSADEIFSHGKILPLEIKKKSDEPNVRVDQSSSSSSSNYSPPLTRAKSLDANAEKF 120
           DQESSSADEIFSHG+ILPLEIKKK ++P V +DQSSS+ +     PL R +SLDA+ EK 
Sbjct: 61  DQESSSADEIFSHGRILPLEIKKKPEDPPVLIDQSSSAPA-----PLARTRSLDADVEKC 120

Query: 121 LKKDRSMKEIKAASTSDSEEKQSPNSNYKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNST 180
           LKKDRS KEIKAA+ SDSEEKQS NS  KSFWRFKRSSSCGSGYT SLCPLPLLSRSNST
Sbjct: 121 LKKDRSSKEIKAAN-SDSEEKQSSNS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNST 180

Query: 181 GSEPNIKRTSLSKDGVTQKQSSHRNAPKNS---QQFSSSMGYQKPPLKKVHGSYGNGVQV 240
           GS PNIKRT LSKDG + KQSSHRN+ K S    Q SSSMGYQKPPLKKVHGSYGNGVQV
Sbjct: 181 GSAPNIKRTPLSKDGASHKQSSHRNSSKTSSSHSQCSSSMGYQKPPLKKVHGSYGNGVQV 240

Query: 241 NPILNVHSGNLFGLGSIFSSGKDRSKKK 266
           NPILNVHSGNLFGLGSIFSS KDRSKKK
Sbjct: 241 NPILNVHSGNLFGLGSIFSSAKDRSKKK 260

BLAST of Tan0014035.1 vs. ExPASy TrEMBL
Match: E5GB42 (Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)

HSP 1 Score: 359.8 bits (922), Expect = 1.0e-95
Identity = 213/268 (79.48%), Postives = 223/268 (83.21%), Query Frame = 0

Query: 1   MAIE-AVSPEIPVVAMSPRISFSHDFIQTEAIPVEQRPNSRSNSSGFNSSFDFDFCIREC 60
           MAIE AVSP+IPV AMSPRISFSHDF  TE IPVEQRPNSRS SSGF+SSFDFDFCI EC
Sbjct: 1   MAIEAAVSPDIPVPAMSPRISFSHDFSLTEPIPVEQRPNSRSKSSGFSSSFDFDFCIPEC 60

Query: 61  SDQESSSADEIFSHGKILPLEIKKKSDEPNVRVDQSSSSSSSNYSPPLTRAKSLDANAEK 120
           SD ESSSADEIFSHGKILPLEIKKK ++   R++ SS +  S   PPLTR KSLD N EK
Sbjct: 61  SDHESSSADEIFSHGKILPLEIKKKPEDQ--RLEHSSLNHHS--PPPLTRTKSLDLNPEK 120

Query: 121 FLKKDRSMKEIKAASTSDSEEKQSPNSNYKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNS 180
            LKKD S KEIKA   SDSEEKQ+ NSN KSFWRFKRSSSCGSGYTRSLCPLPLLSRSNS
Sbjct: 121 CLKKDPSFKEIKATG-SDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNS 180

Query: 181 TGSEP-NIKRTSLSKDGVTQKQSSHRNAPKNSQQ-FSSSMGYQKPPLKKVHGSYGNGVQV 240
           TGS   NIKRT LSKDGV QKQSS RN  KN QQ  SSS G+QKPPLKKVHGSYGNGV+V
Sbjct: 181 TGSSSNNIKRTPLSKDGVNQKQSS-RNGLKNLQQCSSSSTGFQKPPLKKVHGSYGNGVKV 240

Query: 241 NPILNVHSGNLFGLGSIFSSGKDRSKKK 266
           NPILNVHS NLFGLGSIFSS  DRSKKK
Sbjct: 241 NPILNVHSANLFGLGSIFSSAIDRSKKK 262

BLAST of Tan0014035.1 vs. ExPASy TrEMBL
Match: A0A1S3B315 (uncharacterized protein LOC103485440 OS=Cucumis melo OX=3656 GN=LOC103485440 PE=4 SV=1)

HSP 1 Score: 359.8 bits (922), Expect = 1.0e-95
Identity = 213/268 (79.48%), Postives = 223/268 (83.21%), Query Frame = 0

Query: 1   MAIE-AVSPEIPVVAMSPRISFSHDFIQTEAIPVEQRPNSRSNSSGFNSSFDFDFCIREC 60
           MAIE AVSP+IPV AMSPRISFSHDF  TE IPVEQRPNSRS SSGF+SSFDFDFCI EC
Sbjct: 1   MAIEAAVSPDIPVPAMSPRISFSHDFSLTEPIPVEQRPNSRSKSSGFSSSFDFDFCIPEC 60

Query: 61  SDQESSSADEIFSHGKILPLEIKKKSDEPNVRVDQSSSSSSSNYSPPLTRAKSLDANAEK 120
           SD ESSSADEIFSHGKILPLEIKKK ++   R++ SS +  S   PPLTR KSLD N EK
Sbjct: 61  SDHESSSADEIFSHGKILPLEIKKKPEDQ--RLEHSSLNHHS--PPPLTRTKSLDLNPEK 120

Query: 121 FLKKDRSMKEIKAASTSDSEEKQSPNSNYKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNS 180
            LKKD S KEIKA   SDSEEKQ+ NSN KSFWRFKRSSSCGSGYTRSLCPLPLLSRSNS
Sbjct: 121 CLKKDPSFKEIKATG-SDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNS 180

Query: 181 TGSEP-NIKRTSLSKDGVTQKQSSHRNAPKNSQQ-FSSSMGYQKPPLKKVHGSYGNGVQV 240
           TGS   NIKRT LSKDGV QKQSS RN  KN QQ  SSS G+QKPPLKKVHGSYGNGV+V
Sbjct: 181 TGSSSNNIKRTPLSKDGVNQKQSS-RNGLKNLQQCSSSSTGFQKPPLKKVHGSYGNGVKV 240

Query: 241 NPILNVHSGNLFGLGSIFSSGKDRSKKK 266
           NPILNVHS NLFGLGSIFSS  DRSKKK
Sbjct: 241 NPILNVHSANLFGLGSIFSSAIDRSKKK 262

BLAST of Tan0014035.1 vs. TAIR 10
Match: AT1G67050.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G38320.1); Has 617 Blast hits to 318 proteins in 80 species: Archae - 0; Bacteria - 16; Metazoa - 141; Fungi - 62; Plants - 128; Viruses - 2; Other Eukaryotes - 268 (source: NCBI BLink). )

HSP 1 Score: 172.2 bits (435), Expect = 5.7e-43
Identity = 129/272 (47.43%), Postives = 168/272 (61.76%), Query Frame = 0

Query: 15  MSPRISFSHDFIQTEAIPVEQRPNSRSNS--SGFNSSFDFDFCI------RECSDQESSS 74
           MSPRISFS DF Q++AIP+E+RP   SNS  S  NSS DFDFCI       E  DQ S S
Sbjct: 12  MSPRISFSRDFCQSDAIPIEKRPLRSSNSKPSSLNSSIDFDFCIPGGVNSGESFDQGSWS 71

Query: 75  ADEIFSHGKILPLEIKKKSDEPNVRVDQSSSSSSSNYSPPLTRAKSLDANAEKFLKKDRS 134
           ADE+FS+GKILP EIKKK  EP  +  +     S     P +R +    N E+       
Sbjct: 72  ADELFSNGKILPTEIKKK-PEPGKKEPEPKPVKSK----PDSRKQRKQPNEEQ------- 131

Query: 135 MKEIKAASTSDSEEKQSPNSNYKSFWRFKRSSS--CGSGYTRSLCPLPLLSRSNSTGSEP 194
            +E     T  +EEK    +N KSFW FKRSSS  CGS Y RSLCPLPLL+RSNSTGS  
Sbjct: 132 -QEDDVIIT--TEEK----TNTKSFWGFKRSSSLNCGSTYGRSLCPLPLLNRSNSTGSTS 191

Query: 195 NIKRTSLSK---DGVTQKQSSHRNAPKNSQQFSSSMGYQKPPLKKVHGSY------GNGV 254
           + ++ S S+   + V  +QSS  ++  ++    S+ G+ KPPLKK +G Y      G G+
Sbjct: 192 SKQKQSSSRKHNEHVKLQQSSSLSSSSSASSSLSNNGFSKPPLKKSYGGYSYGSHGGGGI 251

Query: 255 QVNPILN-VHSGNLFGLGSIFS-SGKDRSKKK 266
           +V+P++N V SGNLFG GS+FS +G+D++KK+
Sbjct: 252 RVSPVINVVPSGNLFGFGSMFSGNGRDKNKKR 264

BLAST of Tan0014035.1 vs. TAIR 10
Match: AT1G48780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G18300.1); Has 89 Blast hits to 89 proteins in 11 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants - 86; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 80.9 bits (198), Expect = 1.7e-15
Identity = 92/256 (35.94%), Postives = 124/256 (48.44%), Query Frame = 0

Query: 18  RISFSHDFIQTEAIP---VEQRPNSRSNSSGFNSS-FDFDFCIRECSDQ-ESSSADEIFS 77
           RISFS D  Q++  P   +E     R + +  +SS  DF+F I    D  +SS ADEIF+
Sbjct: 9   RISFSSDLGQSDKAPPPVIEPSGLIRRDETLLDSSNSDFEFHISNSFDPGDSSPADEIFA 68

Query: 78  HGKILPLEIKKKSDEPNVRVDQSSSSSSSNYSPPLTRAKSLDANAEKFLKKDRSMKEIKA 137
            G ILP  +   S  P                PP+T + S    + + L    S KE   
Sbjct: 69  DGMILPFHVTAASTVPKRLYKYE--------LPPITSSLSPSPLSPQPLPTKHSEKETNG 128

Query: 138 ASTSDSEEKQSPNSNYKSFWRFKRSSSCGSGYTRSL-CPLPLLSRSNSTGSEPNIKRTSL 197
            ++  + + ++  S+ KSFW FKRSSS      +SL C  P L+RSNSTGS  N KR  L
Sbjct: 129 RASGANSDSEAEKSS-KSFWSFKRSSSLNCDIKKSLICSFPRLTRSNSTGSVTNSKRAML 188

Query: 198 SKDGVTQKQSSHRNAPKNSQQFSSSMGYQKPPLKKVHGSYGNGVQVNPILNVHSGNLFGL 257
               V   + S R++  N+ QF      QK   KK  G  G    V P+LN    + FGL
Sbjct: 189 R--DVNNHRPSSRSSCCNAYQFRP----QKHTGKK--GEGGGSFSVIPVLN--GPSTFGL 245

Query: 258 GSIF--SSGKDRSKKK 266
           GSI   S+ KD++K K
Sbjct: 249 GSILRHSNSKDKTKTK 245

BLAST of Tan0014035.1 vs. TAIR 10
Match: AT1G68330.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G48780.1); Has 155 Blast hits to 147 proteins in 23 species: Archae - 0; Bacteria - 0; Metazoa - 19; Fungi - 3; Plants - 126; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )

HSP 1 Score: 80.5 bits (197), Expect = 2.3e-15
Identity = 100/290 (34.48%), Postives = 140/290 (48.28%), Query Frame = 0

Query: 1   MAIEAVSPEIPVVAMSPRISFSHDFIQTEAIPVEQRPNSRSNSSGFNSSFDFDFCI-REC 60
           MAI+    E     +SPRISFS+D   T+   V      R +S+  +S  +FDFC    C
Sbjct: 1   MAIDVCCSEASGSGISPRISFSYDLDSTDDGEV------RLDSTLLDSGSEFDFCFGSSC 60

Query: 61  SDQESSSADEIFSHGKILPLEIKKKSDEPNV---RVDQSSSSSSSNYSPPLTRAKSLDAN 120
           S QE S ADE+FS GKILP++IKK+   P     RV +S+S SSS+ S       S  ++
Sbjct: 61  SVQEVSPADELFSEGKILPVQIKKEESLPQTVTFRVPRSASLSSSSSS------SSSSSS 120

Query: 121 AEKFLKKDRSMKEIKAASTSDSEEKQSPNSNYKSFWRFKRSSSCGSGYTRS----LCPLP 180
           + +  +K   +KE+     SD E+K         F +FKRS S     +R+    +    
Sbjct: 121 SSRAPEKKMRLKELLLNPESDFEDKPR-----GLFLQFKRSISLNYDKSRNSKGLIRSFH 180

Query: 181 LLSRSNSTGSEPNIKRTSLSKD----GVTQKQSSHRNAPKNSQQFSSSM--GYQKPPLKK 240
            LSRSNST   PN     L K+      T     H+   + S   SSS    Y K PL +
Sbjct: 181 FLSRSNST---PNPNLDLLPKETHHPHKTHNLPKHKPPLRRSSSLSSSSVPFYSKKPLGR 240

Query: 241 VHGSYGN---GVQVNPILNVH--------SGNLFGLGSIFSSGKDRSKKK 266
              S+GN   GV+V+P+LN          +   F +GS+  +GK  +K K
Sbjct: 241 --NSFGNGNGGVRVSPVLNFPPPAFISNVADGFFSIGSL-CNGKTNTKTK 267

BLAST of Tan0014035.1 vs. TAIR 10
Match: AT3G18300.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G48780.1); Has 69 Blast hits to 69 proteins in 7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 69; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 70.5 bits (171), Expect = 2.3e-12
Identity = 92/275 (33.45%), Postives = 129/275 (46.91%), Query Frame = 0

Query: 18  RISFSHDFIQTE-AIPVEQRPNS---RSNSSGFNSSFDFDFCIRECSDQ-ESSSADEIFS 77
           R SF+ D  Q++   P+EQ+P+    R  +   +S+ DF+F I    D  +SS ADEIF+
Sbjct: 10  RFSFAGDLGQSDKGTPMEQQPSGPVRRDTTLLDSSNSDFEFHISSNFDPGDSSPADEIFA 69

Query: 78  HGKILPL---EIKKKSDEPNVRVDQS-----SSSSSSNYSPPLTRAKSLDANAEKFLKKD 137
            G ILP+   ++   S  P            S+ + S+Y PPL     L  ++ K+  K+
Sbjct: 70  DGMILPVLPFQVTATSTMPKRLYKYELPPIVSAPTLSSYLPPL--PLPLPEHSRKYSVKE 129

Query: 138 R--SMKEIKAASTSDSEEKQSPNSNYKSFWRFKRSSSCGSGYTRSL-CPLPLLSRSNSTG 197
              S+    + + SDSE ++S     KSFW FKRSSS      +SL C  P L+RSNSTG
Sbjct: 130 TRGSLNGRGSGANSDSEAEKSS----KSFWSFKRSSSLNCDIKKSLICSFPRLTRSNSTG 189

Query: 198 SEPNIKRTSLSKDGVTQKQSSHRNAPKNSQQFSSSM---------GYQKPPLK---KVHG 257
           S    KR  L    + +  S     P+     SS M          YQ  P K   K  G
Sbjct: 190 SVAISKREMLR--DINKHSSQRHGVPRPGVNPSSHMRPPSSFCCSSYQFRPQKHAGKNGG 249

Query: 258 SYGNGVQVNPILNVHSGNLFGLGSIFSSGKDRSKK 265
             G    + P++   S   FGLGSI    K++ KK
Sbjct: 250 GRGGSFWIAPVIGGPSP--FGLGSILRLTKEKKKK 274

BLAST of Tan0014035.1 vs. TAIR 10
Match: AT5G38320.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G67050.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 62.8 bits (151), Expect = 4.9e-10
Identity = 82/258 (31.78%), Postives = 115/258 (44.57%), Query Frame = 0

Query: 16  SPRISFSHDFIQTEAIPVEQRPN-SRSNSSGFNSSFDFDFCIRE--CSDQESSSADEIFS 75
           SPRISFS+DF   E+IP+EQR + S  + S F   F  +F I     S + S SA+E F+
Sbjct: 8   SPRISFSNDFCHHESIPIEQRTSQSPYDISNFYWGFPLEFSIPRGAISGESSWSAEEFFN 67

Query: 76  HGKILPLEIKKKSDEPNVRVDQSSSSSSSNYSPPLTRAKSLDANAEKFLKKDRSMKEIKA 135
            GKILP+E+ KK  EP  R      S +  Y   L R + +       ++    + EI+ 
Sbjct: 68  DGKILPIEM-KKIPEPIYR------SKTDKYKTGLPRPEIIP------IEDFEPVLEIEE 127

Query: 136 ASTSDSEEKQSPNSNYKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSEPNIKRTSLS 195
               + E K                             LPLL   NSTGS          
Sbjct: 128 IGDQEYEVK-----------------------------LPLLP-YNSTGS---------- 187

Query: 196 KDGVTQKQSSHRNAPKNSQQFSSSMGYQKPPLKKVHGSY-----GNGVQVNPILN-VHSG 255
                   +S ++   +S   S +  + KP LK  HG Y     G  ++V+  L+ V SG
Sbjct: 188 --------NSIKSQVSSSSSSSFNGSFPKPILKNNHGGYNYKGHGGVIRVSSFLDMVPSG 204

Query: 256 NLFGLGSI-FSSGKDRSK 264
           NLFGLGSI F  G++++K
Sbjct: 248 NLFGLGSINFDGGRNKNK 204

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022990991.12.8e-11687.92uncharacterized protein LOC111487716 isoform X1 [Cucurbita maxima][more]
XP_022953381.14.7e-11687.55uncharacterized protein LOC111455949 isoform X1 [Cucurbita moschata][more]
XP_023517703.16.2e-11687.17uncharacterized protein LOC111781353 [Cucurbita pepo subsp. pepo][more]
KAG7033387.18.1e-11687.55hypothetical protein SDJN02_07443, partial [Cucurbita argyrosperma subsp. argyro... [more]
KAG6602701.11.1e-11587.55hypothetical protein SDJN03_07934, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
A0A6J1JPH31.3e-11687.92uncharacterized protein LOC111487716 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1GMV22.3e-11687.55uncharacterized protein LOC111455949 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1BVI82.0e-10480.97uncharacterized protein LOC111006095 OS=Momordica charantia OX=3673 GN=LOC111006... [more]
E5GB421.0e-9579.48Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1[more]
A0A1S3B3151.0e-9579.48uncharacterized protein LOC103485440 OS=Cucumis melo OX=3656 GN=LOC103485440 PE=... [more]
Match NameE-valueIdentityDescription
AT1G67050.15.7e-4347.43unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G48780.11.7e-1535.94unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G68330.12.3e-1534.48unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G18300.12.3e-1233.45unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G38320.14.9e-1031.78unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 124..146
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 86..109
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 171..219
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 171..227
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 91..109
NoneNo IPR availablePANTHERPTHR36757BNAANNG22500D PROTEINcoord: 15..265

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Tan0014035Tan0014035gene


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0014035.1-five_prime_utrTan0014035.1-five_prime_utr-LG11:1066637..1066761five_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0014035.1-exonTan0014035.1-exon-LG11:1066637..1067559exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0014035.1-cdsTan0014035.1-cds-LG11:1066762..1067559CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Tan0014035.1Tan0014035.1-proteinpolypeptide