Tan0004788 (gene) Snake gourd v1

Overview
NameTan0004788
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
LocationLG05: 33785222 .. 33786498 (+)
RNA-Seq ExpressionTan0004788
SyntenyTan0004788
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCCAAGTCAACCATGAAGCAATTGGGCATCCTGGTAGAAGAGTTATCAAGCAGCAAACTTGTGATCCAAGGCTTCAACCAAGGAGGCCAGCGAGCTATTGGCATGATACGTTTAGAGCTTATCATTGGGGACCTCAAGGCCGACACTCTGTTCCACGTCATAGACTCCAAGACTACCTATAAGTTGCTACTAGGTCGTCCTTGGATTCATGGAAATGGAGTTGTAACTTCTACGTTACACCAGTGCTTCAAGTTTTATCAAGATGGCGTCAAGAAGGTGGAGGCAGATACCAAGCCATTTTCAGAAGTTGAATCCCATTTTGTTGACGCGAAATTTTACATGAAGGGTGACAGTATAGGGGAGACTGTACCAACAAAGATCCCTTTAATAAAAGCGACAGTCGGCCTAAGGAGGTGCCGCAGATTGACGTGAAAGAAGAAACTATCAAATGTACAAATGTGCCTGCTCTGAGAAATAGCGAAGTCTCTACGAATTTCACAAAGTCTGAAATTTCAAAAGATGAAGGAAGCGCGACCTCCCCTGTTTTACGTTACGTTCCTTTGTCTCGACAAAAGAAGGGTGAGTCACCATTTGCAGAGTGCACTAAAAGCCTGACTGTGGGTGAAATCGAAGTTTTGAAGGGAAGCTTCACAATGTCGCTCACGAAGATAACGAAGCAAGAGGTCAAGAAACTTGAAGATGATCGATTGGAAGCAAGTTTACCTAAGAGTCGAACGAAAGATTGGTTTGACCCTAAAGCATATAAACTCCTATCAAAGGCAGGATACGACATCACAACTCATACAGAGTTCAAAAGTATAAAGATCTTCGATGAGAGATCTGAGCTCTCATTAACACAAAAAAAGCTTTTAAAGGAAGGTTATACTATCCCTGCGTCAAGGAAAGGACTGGGCTATAAGTCTCCTGAGTCGGTCTGCATAATAAGAAAAGGGAAGGCAAAGGTGGCAAACGCAAACCACATAACAGTGGAAGAGGTAGACGATTCAGATGAAAAGGAAAACGTTACCCAAAGGACTTTTGTTTTTAGCCGCGTCGGGTCGTTGGTGGCACGACCTTCAGTCCTCCAACGATTTGGCACCACTCAAGTAGAAGAAGAGCGATCATCTCCCGTTTCTGGTTCCACTCGAACCTCAGCCCTCATGAGGATAAGGATGCCCACTGAAAAAGAAGGAAGCATTTTACCAGCCCTTACACCAGACTCCGTTCGACCTTCCGTCCGTCAGAGGCTAGGTGTGCCTGCTGGTGAAAGATAA

mRNA sequence

ATGCCCAAGTCAACCATGAAGCAATTGGGCATCCTGGTAGAAGAGTTATCAAGCAGCAAACTTGTGATCCAAGGCTTCAACCAAGGAGGCCAGCGAGCTATTGGCATGATACGTTTAGAGCTTATCATTGGGGACCTCAAGGCCGACACTCTGTTCCACGTCATAGACTCCAAGACTACCTATAAGTTGCTACTAGGTCGTCCTTGGATTCATGGAAATGGAGTTGTAACTTCTACGTTACACCAGTGCTTCAAGTTTTATCAAGATGGCGTCAAGAAGGTGGAGGCAGATACCAAGCCATTTTCAGAAGTTGAATCCCATTTTGTTGACGCGAAATTTTACATGAAGGGTGACAAAACTATCAAATGTACAAATGTGCCTGCTCTGAGAAATAGCGAAGTCTCTACGAATTTCACAAAGTCTGAAATTTCAAAAGATGAAGGAAGCGCGACCTCCCCTGTTTTACGTTACGTTCCTTTGTCTCGACAAAAGAAGGGTGAGTCACCATTTGCAGAGTGCACTAAAAGCCTGACTGTGGGTGAAATCGAAGTTTTGAAGGGAAGCTTCACAATGTCGCTCACGAAGATAACGAAGCAAGAGGTCAAGAAACTTGAAGATGATCGATTGGAAGCAAGTTTACCTAAGAGTCGAACGAAAGATTGGTTTGACCCTAAAGCATATAAACTCCTATCAAAGGCAGGATACGACATCACAACTCATACAGAGTTCAAAAGTATAAAGATCTTCGATGAGAGATCTGAGCTCTCATTAACACAAAAAAAGCTTTTAAAGGAAGGTTATACTATCCCTGCGTCAAGGAAAGGACTGGGCTATAAGTCTCCTGAGTCGGTCTGCATAATAAGAAAAGGGAAGGCAAAGGTGGCAAACGCAAACCACATAACAGTGGAAGAGGTAGACGATTCAGATGAAAAGGAAAACGTTACCCAAAGGACTTTTGTTTTTAGCCGCGTCGGGTCGTTGGTGGCACGACCTTCAGTCCTCCAACGATTTGGCACCACTCAAGTAGAAGAAGAGCGATCATCTCCCGTTTCTGGTTCCACTCGAACCTCAGCCCTCATGAGGATAAGGATGCCCACTGAAAAAGAAGGAAGCATTTTACCAGCCCTTACACCAGACTCCGTTCGACCTTCCGTCCGTCAGAGGCTAGGTGTGCCTGCTGGTGAAAGATAA

Coding sequence (CDS)

ATGCCCAAGTCAACCATGAAGCAATTGGGCATCCTGGTAGAAGAGTTATCAAGCAGCAAACTTGTGATCCAAGGCTTCAACCAAGGAGGCCAGCGAGCTATTGGCATGATACGTTTAGAGCTTATCATTGGGGACCTCAAGGCCGACACTCTGTTCCACGTCATAGACTCCAAGACTACCTATAAGTTGCTACTAGGTCGTCCTTGGATTCATGGAAATGGAGTTGTAACTTCTACGTTACACCAGTGCTTCAAGTTTTATCAAGATGGCGTCAAGAAGGTGGAGGCAGATACCAAGCCATTTTCAGAAGTTGAATCCCATTTTGTTGACGCGAAATTTTACATGAAGGGTGACAAAACTATCAAATGTACAAATGTGCCTGCTCTGAGAAATAGCGAAGTCTCTACGAATTTCACAAAGTCTGAAATTTCAAAAGATGAAGGAAGCGCGACCTCCCCTGTTTTACGTTACGTTCCTTTGTCTCGACAAAAGAAGGGTGAGTCACCATTTGCAGAGTGCACTAAAAGCCTGACTGTGGGTGAAATCGAAGTTTTGAAGGGAAGCTTCACAATGTCGCTCACGAAGATAACGAAGCAAGAGGTCAAGAAACTTGAAGATGATCGATTGGAAGCAAGTTTACCTAAGAGTCGAACGAAAGATTGGTTTGACCCTAAAGCATATAAACTCCTATCAAAGGCAGGATACGACATCACAACTCATACAGAGTTCAAAAGTATAAAGATCTTCGATGAGAGATCTGAGCTCTCATTAACACAAAAAAAGCTTTTAAAGGAAGGTTATACTATCCCTGCGTCAAGGAAAGGACTGGGCTATAAGTCTCCTGAGTCGGTCTGCATAATAAGAAAAGGGAAGGCAAAGGTGGCAAACGCAAACCACATAACAGTGGAAGAGGTAGACGATTCAGATGAAAAGGAAAACGTTACCCAAAGGACTTTTGTTTTTAGCCGCGTCGGGTCGTTGGTGGCACGACCTTCAGTCCTCCAACGATTTGGCACCACTCAAGTAGAAGAAGAGCGATCATCTCCCGTTTCTGGTTCCACTCGAACCTCAGCCCTCATGAGGATAAGGATGCCCACTGAAAAAGAAGGAAGCATTTTACCAGCCCTTACACCAGACTCCGTTCGACCTTCCGTCCGTCAGAGGCTAGGTGTGCCTGCTGGTGAAAGATAA

Protein sequence

MPKSTMKQLGILVEELSSSKLVIQGFNQGGQRAIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEADTKPFSEVESHFVDAKFYMKGDKTIKCTNVPALRNSEVSTNFTKSEISKDEGSATSPVLRYVPLSRQKKGESPFAECTKSLTVGEIEVLKGSFTMSLTKITKQEVKKLEDDRLEASLPKSRTKDWFDPKAYKLLSKAGYDITTHTEFKSIKIFDERSELSLTQKKLLKEGYTIPASRKGLGYKSPESVCIIRKGKAKVANANHITVEEVDDSDEKENVTQRTFVFSRVGSLVARPSVLQRFGTTQVEEERSSPVSGSTRTSALMRIRMPTEKEGSILPALTPDSVRPSVRQRLGVPAGER
Homology
BLAST of Tan0004788 vs. NCBI nr
Match: XP_031739134.1 (uncharacterized protein LOC116402863 [Cucumis sativus])

HSP 1 Score: 455.7 bits (1171), Expect = 4.1e-124
Identity = 254/418 (60.77%), Postives = 292/418 (69.86%), Query Frame = 0

Query: 1    MPKSTMKQLGILVEELSSSKLVIQGFNQGGQRAIGMIRLELIIGDLKADTLFHVIDSKTT 60
            MPKSTM QLGIL++ELS+SKLVIQGFNQG QRAIGMIRLELIIGDLKA  LFHVIDS+TT
Sbjct: 774  MPKSTMWQLGILMDELSNSKLVIQGFNQGSQRAIGMIRLELIIGDLKASALFHVIDSRTT 833

Query: 61   YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEADTKPFSEVESHFVDAKFYMKGDKT 120
            YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEAD+ PFSE ESHF DAKFY K +  
Sbjct: 834  YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEADSNPFSEAESHFADAKFYSKNNNI 893

Query: 121  IKC--TNVPALR-------------------------NSEVSTNFTKSEISKDEGSATSP 180
            ++      P  +                           E  T+ TK  I KDE +A +P
Sbjct: 894  LEVLPAETPLTKGEDNSQLKSLATTEPHESARTFNSGKGEAYTSSTKGMILKDENAANTP 953

Query: 181  VLRYVPLSRQKKGESPFAECTKSLTVGEIEVLKGSFTMSLTKITKQEVKKLEDDRLEASL 240
            VLRYVPLSR+KKGESPF E  K L VG+IE++K SFT  LTKI KQEVK    D +EA+L
Sbjct: 954  VLRYVPLSRRKKGESPFMESPKGLKVGDIEIIKESFTTPLTKIAKQEVKV---DLVEANL 1013

Query: 241  PKSRTKDWFDPKAYKLLSKAGYDITTHTEFKSIKIFDERSELSLTQKKLLKEGYTIPASR 300
            P+ RTKD FDPKAYKL++KAGYD T HTEFKS++I D R ELS TQKKLL+EG++IP SR
Sbjct: 1014 PQRRTKDGFDPKAYKLMAKAGYDFTAHTEFKSLEIHD-RPELSSTQKKLLREGHSIPVSR 1073

Query: 301  KGLGYKSPESVCIIRKGKAKVANANHITVEEVDDSDEKENVTQRTFVFSRVGSLVARPSV 360
            KGLGYKSPE + I +KGK KV + NHIT+EE D++D KE   QR  VF R+   VARP V
Sbjct: 1074 KGLGYKSPEPIRITKKGKEKVVDINHITIEEDDNTDVKEGDNQRISVFDRIRPSVARPVV 1133

Query: 361  LQRFGTTQVEEERSSPVSGSTRTSALMRIRMPTEKEGSILPALTPDSVRPSVRQRLGV 392
             +R   T+ E ER   V    R S   R+     KE S   ALT  + RPS  +RLGV
Sbjct: 1134 FERLSMTEAERERLQSVPNLERHSVFRRLTTTPIKEESTCHALT--TTRPSAFERLGV 1185

BLAST of Tan0004788 vs. NCBI nr
Match: XP_031737372.1 (uncharacterized protein LOC116402244 [Cucumis sativus])

HSP 1 Score: 455.7 bits (1171), Expect = 4.1e-124
Identity = 254/418 (60.77%), Postives = 292/418 (69.86%), Query Frame = 0

Query: 1   MPKSTMKQLGILVEELSSSKLVIQGFNQGGQRAIGMIRLELIIGDLKADTLFHVIDSKTT 60
           MPKSTM QLGIL++ELS+SKLVIQGFNQG QRAIGMIRLELIIGDLKA  LFHVIDS+TT
Sbjct: 149 MPKSTMWQLGILMDELSNSKLVIQGFNQGSQRAIGMIRLELIIGDLKASALFHVIDSRTT 208

Query: 61  YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEADTKPFSEVESHFVDAKFYMKGDKT 120
           YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEAD+ PFSE ESHF DAKFY K +  
Sbjct: 209 YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEADSNPFSEAESHFADAKFYSKNNNI 268

Query: 121 IKC--TNVPALR-------------------------NSEVSTNFTKSEISKDEGSATSP 180
           ++      P  +                           E  T+ TK  I KDE +A +P
Sbjct: 269 LEVLPAETPLTKGEDNSQLKSLATTEPHESARTFNSGKGEAYTSSTKGMILKDENAANTP 328

Query: 181 VLRYVPLSRQKKGESPFAECTKSLTVGEIEVLKGSFTMSLTKITKQEVKKLEDDRLEASL 240
           VLRYVPLSR+KKGESPF E  K L VG+IE++K SFT  LTKI KQEVK    D +EA+L
Sbjct: 329 VLRYVPLSRRKKGESPFMESPKGLKVGDIEIIKESFTTPLTKIAKQEVKV---DLVEANL 388

Query: 241 PKSRTKDWFDPKAYKLLSKAGYDITTHTEFKSIKIFDERSELSLTQKKLLKEGYTIPASR 300
           P+ RTKD FDPKAYKL++KAGYD T HTEFKS++I D R ELS TQKKLL+EG++IP SR
Sbjct: 389 PQRRTKDGFDPKAYKLMAKAGYDFTAHTEFKSLEIHD-RPELSSTQKKLLREGHSIPVSR 448

Query: 301 KGLGYKSPESVCIIRKGKAKVANANHITVEEVDDSDEKENVTQRTFVFSRVGSLVARPSV 360
           KGLGYKSPE + I +KGK KV + NHIT+EE D++D KE   QR  VF R+   VARP V
Sbjct: 449 KGLGYKSPEPIRITKKGKEKVVDINHITIEEDDNTDVKEGDNQRISVFDRIRPSVARPVV 508

Query: 361 LQRFGTTQVEEERSSPVSGSTRTSALMRIRMPTEKEGSILPALTPDSVRPSVRQRLGV 392
            +R   T+ E ER   V    R S   R+     KE S   ALT  + RPS  +RLGV
Sbjct: 509 FERLSMTEAERERLQSVPNLERHSVFRRLTTTPIKEESTCHALT--TTRPSAFERLGV 560

BLAST of Tan0004788 vs. NCBI nr
Match: XP_031735972.1 (uncharacterized protein LOC116401693 [Cucumis sativus])

HSP 1 Score: 455.7 bits (1171), Expect = 4.1e-124
Identity = 254/418 (60.77%), Postives = 292/418 (69.86%), Query Frame = 0

Query: 1    MPKSTMKQLGILVEELSSSKLVIQGFNQGGQRAIGMIRLELIIGDLKADTLFHVIDSKTT 60
            MPKSTM QLGIL++ELS+SKLVIQGFNQG QRAIGMIRLELIIGDLKA  LFHVIDS+TT
Sbjct: 774  MPKSTMWQLGILMDELSNSKLVIQGFNQGSQRAIGMIRLELIIGDLKASALFHVIDSRTT 833

Query: 61   YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEADTKPFSEVESHFVDAKFYMKGDKT 120
            YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEAD+ PFSE ESHF DAKFY K +  
Sbjct: 834  YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEADSNPFSEAESHFADAKFYSKNNNI 893

Query: 121  IKC--TNVPALR-------------------------NSEVSTNFTKSEISKDEGSATSP 180
            ++      P  +                           E  T+ TK  I KDE +A +P
Sbjct: 894  LEVLPAETPLTKGEDNSQLKSLATTEPHESARTFNSGKGEAYTSSTKGMILKDENAANTP 953

Query: 181  VLRYVPLSRQKKGESPFAECTKSLTVGEIEVLKGSFTMSLTKITKQEVKKLEDDRLEASL 240
            VLRYVPLSR+KKGESPF E  K L VG+IE++K SFT  LTKI KQEVK    D +EA+L
Sbjct: 954  VLRYVPLSRRKKGESPFMESPKGLKVGDIEIIKESFTTPLTKIAKQEVKV---DLVEANL 1013

Query: 241  PKSRTKDWFDPKAYKLLSKAGYDITTHTEFKSIKIFDERSELSLTQKKLLKEGYTIPASR 300
            P+ RTKD FDPKAYKL++KAGYD T HTEFKS++I D R ELS TQKKLL+EG++IP SR
Sbjct: 1014 PQRRTKDGFDPKAYKLMAKAGYDFTAHTEFKSLEIHD-RPELSSTQKKLLREGHSIPVSR 1073

Query: 301  KGLGYKSPESVCIIRKGKAKVANANHITVEEVDDSDEKENVTQRTFVFSRVGSLVARPSV 360
            KGLGYKSPE + I +KGK KV + NHIT+EE D++D KE   QR  VF R+   VARP V
Sbjct: 1074 KGLGYKSPEPIRITKKGKEKVVDINHITIEEDDNTDVKEGDNQRISVFDRIRPSVARPVV 1133

Query: 361  LQRFGTTQVEEERSSPVSGSTRTSALMRIRMPTEKEGSILPALTPDSVRPSVRQRLGV 392
             +R   T+ E ER   V    R S   R+     KE S   ALT  + RPS  +RLGV
Sbjct: 1134 FERLSMTEAERERLQSVPSLERHSVFRRLTTTPIKEESTCHALT--TTRPSAFERLGV 1185

BLAST of Tan0004788 vs. NCBI nr
Match: XP_031740568.1 (uncharacterized protein LOC116403508 [Cucumis sativus])

HSP 1 Score: 455.7 bits (1171), Expect = 4.1e-124
Identity = 254/418 (60.77%), Postives = 292/418 (69.86%), Query Frame = 0

Query: 1    MPKSTMKQLGILVEELSSSKLVIQGFNQGGQRAIGMIRLELIIGDLKADTLFHVIDSKTT 60
            MPKSTM QLGIL++ELS+SKLVIQGFNQG QRAIGMIRLELIIGDLKA  LFHVIDS+TT
Sbjct: 774  MPKSTMWQLGILMDELSNSKLVIQGFNQGSQRAIGMIRLELIIGDLKASALFHVIDSRTT 833

Query: 61   YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEADTKPFSEVESHFVDAKFYMKGDKT 120
            YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEAD+ PFSE ESHF DAKFY K +  
Sbjct: 834  YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEADSNPFSEAESHFADAKFYSKNNNI 893

Query: 121  IKC--TNVPALR-------------------------NSEVSTNFTKSEISKDEGSATSP 180
            ++      P  +                           E  T+ TK  I KDE +A +P
Sbjct: 894  LEVLPAETPLTKGEDNSQLKSLATTEPHESARTFNSGKGEAYTSSTKGMILKDENAANTP 953

Query: 181  VLRYVPLSRQKKGESPFAECTKSLTVGEIEVLKGSFTMSLTKITKQEVKKLEDDRLEASL 240
            VLRYVPLSR+KKGESPF E  K L VG+IE++K SFT  LTKI KQEVK    D +EA+L
Sbjct: 954  VLRYVPLSRRKKGESPFMESPKGLKVGDIEIIKESFTTPLTKIAKQEVKV---DLVEANL 1013

Query: 241  PKSRTKDWFDPKAYKLLSKAGYDITTHTEFKSIKIFDERSELSLTQKKLLKEGYTIPASR 300
            P+ RTKD FDPKAYKL++KAGYD T HTEFKS++I D R ELS TQKKLL+EG++IP SR
Sbjct: 1014 PQRRTKDGFDPKAYKLMAKAGYDFTAHTEFKSLEIHD-RPELSSTQKKLLREGHSIPVSR 1073

Query: 301  KGLGYKSPESVCIIRKGKAKVANANHITVEEVDDSDEKENVTQRTFVFSRVGSLVARPSV 360
            KGLGYKSPE + I +KGK KV + NHIT+EE D++D KE   QR  VF R+   VARP V
Sbjct: 1074 KGLGYKSPEPIRITKKGKEKVVDINHITIEEDDNTDVKEGDNQRISVFDRIRPSVARPVV 1133

Query: 361  LQRFGTTQVEEERSSPVSGSTRTSALMRIRMPTEKEGSILPALTPDSVRPSVRQRLGV 392
             +R   T+ E ER   V    R S   R+     KE S   ALT  + RPS  +RLGV
Sbjct: 1134 FERLSMTEAERERLQSVPNLERHSVFRRLTTTPIKEESTCHALT--TTRPSAFERLGV 1185

BLAST of Tan0004788 vs. NCBI nr
Match: XP_031742032.1 (uncharacterized protein LOC116404025 [Cucumis sativus])

HSP 1 Score: 455.3 bits (1170), Expect = 5.4e-124
Identity = 254/418 (60.77%), Postives = 292/418 (69.86%), Query Frame = 0

Query: 1    MPKSTMKQLGILVEELSSSKLVIQGFNQGGQRAIGMIRLELIIGDLKADTLFHVIDSKTT 60
            MPKSTM QLGIL++ELS+SKLVIQGFNQG QRAIGMIRLELIIGDLKA  LFHVIDS+TT
Sbjct: 774  MPKSTMWQLGILMDELSNSKLVIQGFNQGSQRAIGMIRLELIIGDLKASALFHVIDSRTT 833

Query: 61   YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEADTKPFSEVESHFVDAKFYMKGDKT 120
            YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEAD+ PFSE ESHF DAKFY K +  
Sbjct: 834  YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEADSNPFSEAESHFADAKFYSKNNNI 893

Query: 121  IKC--TNVPALR-------------------------NSEVSTNFTKSEISKDEGSATSP 180
            ++      P  +                           E  T+ TK  I KDE +A +P
Sbjct: 894  LEVLPAETPLTKGEDNSQLKSLATTEPHESARTFNSGKGEAYTSNTKGMILKDENAANTP 953

Query: 181  VLRYVPLSRQKKGESPFAECTKSLTVGEIEVLKGSFTMSLTKITKQEVKKLEDDRLEASL 240
            VLRYVPLSR+KKGESPF E  K L VG+IE++K SFT  LTKI KQEVK    D +EA+L
Sbjct: 954  VLRYVPLSRRKKGESPFMESPKGLKVGDIEIIKESFTTPLTKIAKQEVKV---DLVEANL 1013

Query: 241  PKSRTKDWFDPKAYKLLSKAGYDITTHTEFKSIKIFDERSELSLTQKKLLKEGYTIPASR 300
            P+ RTKD FDPKAYKL++KAGYD T HTEFKS++I D R ELS TQKKLL+EG++IP SR
Sbjct: 1014 PQRRTKDGFDPKAYKLMAKAGYDFTAHTEFKSLEIHD-RPELSSTQKKLLREGHSIPVSR 1073

Query: 301  KGLGYKSPESVCIIRKGKAKVANANHITVEEVDDSDEKENVTQRTFVFSRVGSLVARPSV 360
            KGLGYKSPE + I +KGK KV + NHIT+EE D++D KE   QR  VF R+   VARP V
Sbjct: 1074 KGLGYKSPEPIRITKKGKEKVVDINHITIEEDDNTDVKEGDNQRISVFDRIRPSVARPVV 1133

Query: 361  LQRFGTTQVEEERSSPVSGSTRTSALMRIRMPTEKEGSILPALTPDSVRPSVRQRLGV 392
             +R   T+ E ER   V    R S   R+     KE S   ALT  + RPS  +RLGV
Sbjct: 1134 FERLSMTEAERERLQSVPNLERHSVFRRLTTTPIKEESTCHALT--TTRPSAFERLGV 1185

BLAST of Tan0004788 vs. ExPASy TrEMBL
Match: A0A5A7UD46 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold174G001280 PE=4 SV=1)

HSP 1 Score: 455.3 bits (1170), Expect = 2.6e-124
Identity = 248/418 (59.33%), Postives = 294/418 (70.33%), Query Frame = 0

Query: 1   MPKSTMKQLGILVEELSSSKLVIQGFNQGGQRAIGMIRLELIIGDLKADTLFHVIDSKTT 60
           MPKSTM+QLGIL+EELS+SKL+IQGFNQG QR IGMIRLELIIGDLK   LFHVIDS+TT
Sbjct: 154 MPKSTMRQLGILMEELSNSKLIIQGFNQGSQRIIGMIRLELIIGDLKDSALFHVIDSRTT 213

Query: 61  YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEADTKPFSEVESHFVDAKFYMKGDKT 120
           YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEAD+ PFSE ESHF DAKFY+K D +
Sbjct: 214 YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEADSNPFSEAESHFADAKFYLKNDSS 273

Query: 121 IKCTNVPAL---------------------------RNSEVSTNFTKSEISKDEGSATSP 180
           ++  +V  L                             SE STN  KS I  DE ++  P
Sbjct: 274 LEVVSVEVLLVNREDNLQLKSLASKEPHKSIGTFHSGKSEASTNTAKSVILMDEKTSNPP 333

Query: 181 VLRYVPLSRQKKGESPFAECTKSLTVGEIEVLKGSFTMSLTKITKQEVKKLEDDRLEASL 240
           +LRYVPLSR+KKGESPF E  + L VGEIEVLK SFT  LTKITKQE+K    D  EASL
Sbjct: 334 ILRYVPLSRRKKGESPFVESPQGLKVGEIEVLKESFTTPLTKITKQEIK---IDLTEASL 393

Query: 241 PKSRTKDWFDPKAYKLLSKAGYDITTHTEFKSIKIFDERSELSLTQKKLLKEGYTIPASR 300
           P+ RTKD FDPKAYKL++KAGYD TTHTEFKS+KI+ E+ +LS TQKKLL+EG+ IP SR
Sbjct: 394 PQRRTKDGFDPKAYKLMAKAGYDFTTHTEFKSLKIY-EQPKLSSTQKKLLREGHVIPMSR 453

Query: 301 KGLGYKSPESVCIIRKGKAKVANANHITVEEVDDSDEKENVTQRTFVFSRVGSLVARPSV 360
           KGLGYKSPE + I RKGK KV ++NHIT++E D  +EKE  +QRT  F R+   VAR  V
Sbjct: 454 KGLGYKSPEPIRITRKGKEKVVDSNHITLKEADSMEEKEGDSQRTSAFDRISPHVARAPV 513

Query: 361 LQRFGTTQVEEERSSPVSGSTRTSALMRIRMPTEKEGSILPALTPDSVRPSVRQRLGV 392
            ++   T+ E +     S   R SA  R+ +  ++E  I    T  + +PS  +RL +
Sbjct: 514 FEKLSMTEAERKDHQSTSNLDRRSAFQRLTITFKEEKGI--CQTSMTTKPSAFERLSI 565

BLAST of Tan0004788 vs. ExPASy TrEMBL
Match: A0A5A7TJZ7 (Retrotransposon gag protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold306G003220 PE=4 SV=1)

HSP 1 Score: 443.4 bits (1139), Expect = 1.0e-120
Identity = 242/391 (61.89%), Postives = 284/391 (72.63%), Query Frame = 0

Query: 1   MPKSTMKQLGILVEELSSSKLVIQGFNQGGQRAIGMIRLELIIGDLKADTLFHVIDSKTT 60
           MPKSTM+QLGIL++ELS+SKLVIQGFNQG +R IGMIRLELIIGDLKA  LFHVID +TT
Sbjct: 593 MPKSTMRQLGILIDELSNSKLVIQGFNQGSKRVIGMIRLELIIGDLKASALFHVIDLRTT 652

Query: 61  YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEADTKPFSEVESHFVDAKFYMKGDKT 120
           YKLLL RPWIHGNGVVTS LHQCFKFYQDG+KKVEAD  PFSE ESHF DAKFY+K D +
Sbjct: 653 YKLLLDRPWIHGNGVVTSALHQCFKFYQDGIKKVEADPNPFSEAESHFADAKFYLKNDNS 712

Query: 121 IKCTNVPALRNSEVSTNFTKSEISKDEGSATSPVLRYVPLSRQKKGESPFAECTKSLTVG 180
            +  +V  +   + ST+  KS I  DE ++  P+LRYVPLSR KKGESPF +  + L VG
Sbjct: 713 PEAVSV-EVPLGKASTSTRKSMILMDEKTSNPPILRYVPLSRCKKGESPFVKSPQGLKVG 772

Query: 181 EIEVLKGSFTMSLTKITKQEVKKLEDDRLEASLPKSRTKDWFDPKAYKLLSKAGYDITTH 240
           +IEVLK SFT   TKITKQE+K    D  EASLP+S TKD FDPKAYKL++K GYD TTH
Sbjct: 773 DIEVLKESFTTPFTKITKQEIK---IDLTEASLPQSWTKDGFDPKAYKLMAKVGYDFTTH 832

Query: 241 TEFKSIKIFDERSELSLTQKKLLKEGYTIPASRKGLGYKSPESVCIIRKGKAKVANANHI 300
            EFKS+KI  E+ +LS TQKKLL+EG+ IP SRKGLGYKSPE + I RKGK KV + NHI
Sbjct: 833 IEFKSLKI-HEQPKLSSTQKKLLREGHAIPMSRKGLGYKSPEPIRITRKGKEKVVDNNHI 892

Query: 301 TVEEVDDSDEKENVTQRTFVFSRVGSLVARPSVLQRFGTTQVEEERSSPVSGSTRTSALM 360
           TV+EVD   EKE   QRT  F R+   VAR  V +R   T+VE +     S   R SA  
Sbjct: 893 TVKEVDSMKEKEGDGQRTSAFDRISPHVARTPVFERLSMTEVERKDHQSTSNLDRRSAFQ 952

Query: 361 RIRMPTEKEGSILPALTPDSVRPSVRQRLGV 392
           R+ M ++KE  I  A    + RPS  +RL +
Sbjct: 953 RLTMTSKKEKGICQAWM--TTRPSAFERLSM 976

BLAST of Tan0004788 vs. ExPASy TrEMBL
Match: A0A5A7UEC9 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold131G00760 PE=4 SV=1)

HSP 1 Score: 423.3 bits (1087), Expect = 1.1e-114
Identity = 229/372 (61.56%), Postives = 269/372 (72.31%), Query Frame = 0

Query: 1   MPKSTMKQLGILVEELSSSKLVIQGFNQGGQRAIGMIRLELIIGDLKADTLFHVIDSKTT 60
           MPKSTM+Q GIL+EEL +SKLVIQGFNQG QR IG+IRLELIIGDLKA  LFHVI+S+TT
Sbjct: 1   MPKSTMRQSGILMEELLNSKLVIQGFNQGSQRVIGIIRLELIIGDLKASALFHVINSRTT 60

Query: 61  YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEADTKPFSEVESHFVDAKFYMKGDKT 120
           YKLLLGRPWIHGNGVVTSTLH CFKFYQDGVKKVE D+ PFSE ESHF DAKFY+K D +
Sbjct: 61  YKLLLGRPWIHGNGVVTSTLHHCFKFYQDGVKKVEVDSNPFSEAESHFADAKFYLKNDSS 120

Query: 121 IKCTN--VPALR-------------------------NSEVSTNFTKSEISKDEGSATSP 180
            +  +  VP +                           SEVST+  KS I  DE ++  P
Sbjct: 121 PEAVSKEVPLVNREDNLQLKSLASKEPHKSIGTFHSGKSEVSTSTAKSVILMDEKTSNPP 180

Query: 181 VLRYVPLSRQKKGESPFAECTKSLTVGEIEVLKGSFTMSLTKITKQEVKKLEDDRLEASL 240
           +LRYVPLSR+KKGESPF E  + L VG+IEVLK SFT  LTKI K+E+K    D  EASL
Sbjct: 181 ILRYVPLSRRKKGESPFVESPQGLKVGDIEVLKESFTTPLTKINKKEIK---IDLTEASL 240

Query: 241 PKSRTKDWFDPKAYKLLSKAGYDITTHTEFKSIKIFDERSELSLTQKKLLKEGYTIPASR 300
           P+ RTKD FDPKAYKL++KAGYD  THTEFK +KI  E+ +LS TQKKLL+EG+ IP SR
Sbjct: 241 PQRRTKDGFDPKAYKLMAKAGYDFITHTEFKILKI-HEQPKLSSTQKKLLQEGHAIPMSR 300

Query: 301 KGLGYKSPESVCIIRKGKAKVANANHITVEEVDDSDEKENVTQRTFVFSRVGSLVARPSV 346
           KGLGYKSPE + I RKGK KV ++NHITV+EVD  +EKE+ +QRT  F R+   VAR  V
Sbjct: 301 KGLGYKSPEPIRITRKGKEKVVDSNHITVKEVDSMEEKEDDSQRTSAFDRISPHVARAPV 360

BLAST of Tan0004788 vs. ExPASy TrEMBL
Match: A0A5A7TZU9 (Ribonuclease H OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold498G00940 PE=4 SV=1)

HSP 1 Score: 417.2 bits (1071), Expect = 7.9e-113
Identity = 237/426 (55.63%), Postives = 288/426 (67.61%), Query Frame = 0

Query: 1    MPKSTMKQLGILVEELSSSKLVIQGFNQGGQRAIGMIRLELIIGDLKADTLFHVIDSKTT 60
            +PKSTM QLGI VEELS+SKLVIQGFNQG QRAIG +RLE++IGDL+A T+FHVIDS+TT
Sbjct: 766  LPKSTMNQLGISVEELSNSKLVIQGFNQGAQRAIGTVRLEVVIGDLQASTIFHVIDSRTT 825

Query: 61   YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEADTKPFSEVESHFVDAKFYMKGD-- 120
            YK+LLGRPWIH NG+VTSTLHQCFKFY+ G+KKV+AD++PF++ ESHF DAKFY K +  
Sbjct: 826  YKMLLGRPWIHENGIVTSTLHQCFKFYKQGIKKVDADSRPFTKAESHFADAKFYTKSEDV 885

Query: 121  KTIKCTNVPALR-------------------------NSEVSTNFTKSEISKDEGSAT-- 180
              I  T VP  +                         N E++T  TK    + E  AT  
Sbjct: 886  SEIISTEVPVTKGTFKNEQEMITSKKSSKGDALNSQQNGELTTE-TKLRAPEAEKIATLQ 945

Query: 181  -----SPVLRYVPLSRQKKGESPFAECTKSLTVGEIEVLKGSFTMSLTKITKQEVKKLED 240
                  PVLRY+PLSR+KKGESPF EC+K+LTV   E+LK +FT  LTKI K E KK+E 
Sbjct: 946  KEVSNPPVLRYIPLSRRKKGESPFTECSKNLTVKNTEILKENFTAPLTKIEKGEAKKIEK 1005

Query: 241  DRLEASLPKSRTKDWFDPKAYKLLSKAGYDITTHTEFKSIKIFDERSELSLTQKKLLKEG 300
              L+A LP+ RT + FDPKAYKL++KAGYD TT TE KS+KIFDER ELS TQKKL K+G
Sbjct: 1006 KDLQAYLPERRTVEGFDPKAYKLMAKAGYDFTTRTELKSVKIFDERPELSPTQKKLQKQG 1065

Query: 301  YTIPASRKGLGYKSPESVCIIRKGKAKVANANHITVEEVDDSDEKENV-TQRTFVFSRVG 360
            Y+IP SR G+GY+S E V I  KGKAKVAN  HITVEE  DS+E + V +QR+ VF R+ 
Sbjct: 1066 YSIPNSRAGIGYQSSEPVRITGKGKAKVANTCHITVEESKDSEEGKKVRSQRSSVFDRIA 1125

Query: 361  SLVARPSVLQRFGTTQVEEERSSPVSGSTRTSALMRIRMPTEKEGSILPALTPDSVRPSV 392
                RPSV QR  T+  ++        STR SA  R+    +K  SI P  TP + R S 
Sbjct: 1126 FSAIRPSVFQRVSTSIAKDSNQVSTCSSTRLSAFQRLNTSAKKVRSISP--TP-TTRKSA 1185

BLAST of Tan0004788 vs. ExPASy TrEMBL
Match: A0A5D3BIH8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold180G001270 PE=4 SV=1)

HSP 1 Score: 417.2 bits (1071), Expect = 7.9e-113
Identity = 237/426 (55.63%), Postives = 288/426 (67.61%), Query Frame = 0

Query: 1    MPKSTMKQLGILVEELSSSKLVIQGFNQGGQRAIGMIRLELIIGDLKADTLFHVIDSKTT 60
            +PKSTM QLGI VEELS+SKLVIQGFNQG QRAIG +RLE++IGDL+A T+FHVIDS+TT
Sbjct: 744  LPKSTMNQLGISVEELSNSKLVIQGFNQGAQRAIGTVRLEVVIGDLQASTIFHVIDSRTT 803

Query: 61   YKLLLGRPWIHGNGVVTSTLHQCFKFYQDGVKKVEADTKPFSEVESHFVDAKFYMKGD-- 120
            YK+LLGRPWIH NG+VTSTLHQCFKFY+ G+KKV+AD++PF++ ESHF DAKFY K +  
Sbjct: 804  YKMLLGRPWIHENGIVTSTLHQCFKFYKQGIKKVDADSRPFTKAESHFADAKFYTKSEDV 863

Query: 121  KTIKCTNVPALR-------------------------NSEVSTNFTKSEISKDEGSAT-- 180
              I  T VP  +                         N E++T  TK    + E  AT  
Sbjct: 864  SEIISTEVPVTKGTFKNEQEMITSKKSSKGDALNSQQNGELTTE-TKLRAPEAEKIATLQ 923

Query: 181  -----SPVLRYVPLSRQKKGESPFAECTKSLTVGEIEVLKGSFTMSLTKITKQEVKKLED 240
                  PVLRY+PLSR+KKGESPF EC+K+LTV   E+LK +FT  LTKI K E KK+E 
Sbjct: 924  KEVSNPPVLRYIPLSRRKKGESPFTECSKNLTVKNTEILKENFTAPLTKIEKGEAKKIEK 983

Query: 241  DRLEASLPKSRTKDWFDPKAYKLLSKAGYDITTHTEFKSIKIFDERSELSLTQKKLLKEG 300
              L+A LP+ RT + FDPKAYKL++KAGYD TT TE KS+KIFDER ELS TQKKL K+G
Sbjct: 984  KDLQAYLPERRTVEGFDPKAYKLMAKAGYDFTTRTELKSVKIFDERPELSPTQKKLQKQG 1043

Query: 301  YTIPASRKGLGYKSPESVCIIRKGKAKVANANHITVEEVDDSDEKENV-TQRTFVFSRVG 360
            Y+IP SR G+GY+S E V I  KGKAKVAN  HITVEE  DS+E + V +QR+ VF R+ 
Sbjct: 1044 YSIPNSRAGIGYQSSEPVRITGKGKAKVANTCHITVEESKDSEEGKKVRSQRSSVFDRIA 1103

Query: 361  SLVARPSVLQRFGTTQVEEERSSPVSGSTRTSALMRIRMPTEKEGSILPALTPDSVRPSV 392
                RPSV QR  T+  ++        STR SA  R+    +K  SI P  TP + R S 
Sbjct: 1104 FSAIRPSVFQRVSTSIAKDSNQVSTCSSTRLSAFQRLNTSAKKVRSISP--TP-TTRKSA 1163

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_031739134.14.1e-12460.77uncharacterized protein LOC116402863 [Cucumis sativus][more]
XP_031737372.14.1e-12460.77uncharacterized protein LOC116402244 [Cucumis sativus][more]
XP_031735972.14.1e-12460.77uncharacterized protein LOC116401693 [Cucumis sativus][more]
XP_031740568.14.1e-12460.77uncharacterized protein LOC116403508 [Cucumis sativus][more]
XP_031742032.15.4e-12460.77uncharacterized protein LOC116404025 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
A0A5A7UD462.6e-12459.33Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A5A7TJZ71.0e-12061.89Retrotransposon gag protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaf... [more]
A0A5A7UEC91.1e-11461.56Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A5A7TZU97.9e-11355.63Ribonuclease H OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold498G00940... [more]
A0A5D3BIH87.9e-11355.63Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 1..97
e-value: 4.6E-7
score: 31.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 365..396
NoneNo IPR availableCDDcd00303retropepsin_likecoord: 1..71
e-value: 5.00579E-7
score: 45.4052

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0004788.1Tan0004788.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006259 DNA metabolic process