Tan0013718 (gene) Snake gourd v1

Overview
NameTan0013718
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
LocationLG03: 57733088 .. 57734084 (-)
RNA-Seq ExpressionTan0013718
SyntenyTan0013718
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAGGTACTTCAAAACACTCCAAGCATACATGGACGAAGGTTGAGGATGCGAGATTGGTCGAGTCATTGGTCTATTTGGTACATAATGGGTGGCGGTCAGACAATGGAACATTCAGGCCTGGATATCTCCAACATCTTCAAAAGATGCTAGCAGAGAAACTGCCAAATTCATCATTAGAACTGAATACAATAGACTGCAAAGTGAGAACTTTGAAAAAGCAATACAATGTTGTTGCAGAGATGCTTGGGAATGGTTGTAGCGGATTTGGATGGAACGAAGAATTTAAACGTGTTGAGGCAGAGAAGGAGGTATTTGATGCATGGGTCAAGGTGAGGTAGTACTTTTAATAAAAAAATTATCATTAGCATTCGTTGTAGATATACGTAATATATTTTCTCTCACATGCAGAGCCATACAAATGCCAAGGGGATGACGAACAAACCATTTCCACACTATGATGATCTTGCATTTGTATTTGGAAAAGACAGAGCAACGGGGATGGGCGCGGAGACCCTAGGGAAAATGGCCTCTAACGCTGCAGAACAACTGGAGGAGGAGATCCGACTGGGATCGCAAGACTTCTTCGGGACGGAGCAACGACCAATGGAGAATCCATGCACTACTGATGTAGGGGAGGAAGAATTGCCAGAGACTCCTACTAATAGACGTAATACATCTGGCACGTCTTCTCGATGTACTGGTAGCAAAAGAAAGAGATCATGCTTCCAAACTGAAATGATTGATGTTGTGCGGACAACAATGGACATCCAAACAACTCACATGCAACACCTCCTATCGTGGCAGAAGGAGAAGTACGAGTTGGAGGCTGCACGACGGAAGGAAGTGGTCGACCTGTTGTACCAAATAGAAGGGTTGACCGAGCATGATCGTGTATCTCTGATAGACATGCTTGTCACTGATATACAGAAGACAGACTACTTCCTACAGGTCCCACCTCAATCGAGGAGGGCGTCTTTGCTTTTGCGTCTCTAG

mRNA sequence

ATGACAGGTACTTCAAAACACTCCAAGCATACATGGACGAAGGTTGAGGATGCGAGATTGGTCGAGTCATTGGTCTATTTGGTACATAATGGGTGGCGGTCAGACAATGGAACATTCAGGCCTGGATATCTCCAACATCTTCAAAAGATGCTAGCAGAGAAACTGCCAAATTCATCATTAGAACTGAATACAATAGACTGCAAAGTGAGAACTTTGAAAAAGCAATACAATGTTGTTGCAGAGATGCTTGGGAATGGTTGTAGCGGATTTGGATGGAACGAAGAATTTAAACGTGTTGAGGCAGAGAAGGAGGTATTTGATGCATGGGTCAAGAGCCATACAAATGCCAAGGGGATGACGAACAAACCATTTCCACACTATGATGATCTTGCATTTGTATTTGGAAAAGACAGAGCAACGGGGATGGGCGCGGAGACCCTAGGGAAAATGGCCTCTAACGCTGCAGAACAACTGGAGGAGGAGATCCGACTGGGATCGCAAGACTTCTTCGGGACGGAGCAACGACCAATGGAGAATCCATGCACTACTGATGTAGGGGAGGAAGAATTGCCAGAGACTCCTACTAATAGACGTAATACATCTGGCACGTCTTCTCGATGTACTGGTAGCAAAAGAAAGAGATCATGCTTCCAAACTGAAATGATTGATGTTGTGCGGACAACAATGGACATCCAAACAACTCACATGCAACACCTCCTATCGTGGCAGAAGGAGAAGTACGAGTTGGAGGCTGCACGACGGAAGGAAGTGGTCGACCTGTTGTACCAAATAGAAGGGTTGACCGAGCATGATCGTGTATCTCTGATAGACATGCTTGTCACTGATATACAGAAGACAGACTACTTCCTACAGGTCCCACCTCAATCGAGGAGGGCGTCTTTGCTTTTGCGTCTCTAG

Coding sequence (CDS)

ATGACAGGTACTTCAAAACACTCCAAGCATACATGGACGAAGGTTGAGGATGCGAGATTGGTCGAGTCATTGGTCTATTTGGTACATAATGGGTGGCGGTCAGACAATGGAACATTCAGGCCTGGATATCTCCAACATCTTCAAAAGATGCTAGCAGAGAAACTGCCAAATTCATCATTAGAACTGAATACAATAGACTGCAAAGTGAGAACTTTGAAAAAGCAATACAATGTTGTTGCAGAGATGCTTGGGAATGGTTGTAGCGGATTTGGATGGAACGAAGAATTTAAACGTGTTGAGGCAGAGAAGGAGGTATTTGATGCATGGGTCAAGAGCCATACAAATGCCAAGGGGATGACGAACAAACCATTTCCACACTATGATGATCTTGCATTTGTATTTGGAAAAGACAGAGCAACGGGGATGGGCGCGGAGACCCTAGGGAAAATGGCCTCTAACGCTGCAGAACAACTGGAGGAGGAGATCCGACTGGGATCGCAAGACTTCTTCGGGACGGAGCAACGACCAATGGAGAATCCATGCACTACTGATGTAGGGGAGGAAGAATTGCCAGAGACTCCTACTAATAGACGTAATACATCTGGCACGTCTTCTCGATGTACTGGTAGCAAAAGAAAGAGATCATGCTTCCAAACTGAAATGATTGATGTTGTGCGGACAACAATGGACATCCAAACAACTCACATGCAACACCTCCTATCGTGGCAGAAGGAGAAGTACGAGTTGGAGGCTGCACGACGGAAGGAAGTGGTCGACCTGTTGTACCAAATAGAAGGGTTGACCGAGCATGATCGTGTATCTCTGATAGACATGCTTGTCACTGATATACAGAAGACAGACTACTTCCTACAGGTCCCACCTCAATCGAGGAGGGCGTCTTTGCTTTTGCGTCTCTAG

Protein sequence

MTGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVRTLKKQYNVVAEMLGNGCSGFGWNEEFKRVEAEKEVFDAWVKSHTNAKGMTNKPFPHYDDLAFVFGKDRATGMGAETLGKMASNAAEQLEEEIRLGSQDFFGTEQRPMENPCTTDVGEEELPETPTNRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTTHMQHLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVSLIDMLVTDIQKTDYFLQVPPQSRRASLLLRL
Homology
BLAST of Tan0013718 vs. NCBI nr
Match: XP_038887234.1 (uncharacterized protein LOC120077425 [Benincasa hispida])

HSP 1 Score: 318.2 bits (814), Expect = 7.9e-83
Identity = 169/298 (56.71%), Postives = 205/298 (68.79%), Query Frame = 0

Query: 1   MTGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSL 60
           MTG SK SKH W+KVEDARLVE+L+YLV  GWRSDNGTFRPGYLQHL+++L EK+P  +L
Sbjct: 1   MTGNSKRSKHVWSKVEDARLVEALLYLVETGWRSDNGTFRPGYLQHLEQILHEKVPGCAL 60

Query: 61  ELNTIDCKVRTLKKQYNVVAEMLGNGCSGFGWNEEFKRVEAEKEVFDAWVKSHTNAKGMT 120
             NTI+CKVR+LKKQYN V+EML    SGF WNEEFK V+ E+E+FD WV+SH NAKGM 
Sbjct: 61  NKNTIECKVRSLKKQYNAVSEMLSQ--SGFNWNEEFKCVQVEREIFDLWVRSHPNAKGMW 120

Query: 121 NKPFPHYDDLAFVFGKDRATGMGAETLGKMASNAAEQLEEEIRLGSQDFFGTEQRPMENP 180
            KPFPHYDDL+ VFGKDRA                            D    E R  E+P
Sbjct: 121 KKPFPHYDDLSAVFGKDRA----------------------------DCHTPEVRQTESP 180

Query: 181 CTTDVGEEELPETPTNRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTTHMQHLL 240
              D  +EE  E  T R +    SSR  GSKRKRS FQ EMID+V++T+++Q+THM  L 
Sbjct: 181 LNQDEIDEEPAEQSTGRASVPTESSR--GSKRKRSSFQVEMIDIVKSTVEMQSTHMGRLA 240

Query: 241 SWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVSLIDMLVTDIQKTDYFLQVPPQSRR 299
           SWQ EKYELE    KEVV+ +Y I+ L E+D+V+LID++VTDIQKTD FL VP  +R+
Sbjct: 241 SWQNEKYELEL---KEVVNAIYNIDDLEENDQVTLIDLIVTDIQKTDCFLAVPEHARK 263

BLAST of Tan0013718 vs. NCBI nr
Match: XP_038896380.1 (uncharacterized protein LOC120084641 [Benincasa hispida])

HSP 1 Score: 311.6 bits (797), Expect = 7.4e-81
Identity = 165/298 (55.37%), Postives = 201/298 (67.45%), Query Frame = 0

Query: 1   MTGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSL 60
           M G+ K SKH W+KVED +LVE+L+YLV  GWRSDNGTFR GYLQ+L+++L EK+P  +L
Sbjct: 1   MAGSGKRSKHVWSKVEDTKLVEALLYLVETGWRSDNGTFRLGYLQYLERILHEKVPGCAL 60

Query: 61  ELNTIDCKVRTLKKQYNVVAEMLGNGCSGFGWNEEFKRVEAEKEVFDAWVKSHTNAKGMT 120
             NTI+CKVR+LKKQYN V+EML    SGFGWNEEFK V+ EKE+FD WV+SH NAKGM 
Sbjct: 61  NQNTIECKVRSLKKQYNAVSEMLSQ--SGFGWNEEFKCVQVEKEIFDLWVRSHLNAKGMW 120

Query: 121 NKPFPHYDDLAFVFGKDRATGMGAETLGKMASNAAEQLEEEIRLGSQDFFGTEQRPMENP 180
           NK F HYDDL+ VFGKDRA     E                                E+P
Sbjct: 121 NKSFLHYDDLSTVFGKDRANCHTPEVC----------------------------QAESP 180

Query: 181 CTTDVGEEELPETPTNRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTTHMQHLL 240
              D  +EE  E  T R +    SSR  GSKRKR  FQ EMID++R+T+++Q+THM  L 
Sbjct: 181 LNQDEIDEEPAEQSTGRASVLAESSR--GSKRKRPSFQAEMIDIMRSTVEMQSTHMGRLA 240

Query: 241 SWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVSLIDMLVTDIQKTDYFLQVPPQSRR 299
           SWQKEKYELE  RRKEVV+ +Y I+GL E D+V+ ID+LVTDIQKTD FL VP  +R+
Sbjct: 241 SWQKEKYELEFGRRKEVVNAIYSIDGLDEDDQVTFIDLLVTDIQKTDCFLAVPEHARK 266

BLAST of Tan0013718 vs. NCBI nr
Match: XP_038895773.1 (uncharacterized protein LOC120083935 [Benincasa hispida])

HSP 1 Score: 270.4 bits (690), Expect = 1.9e-68
Identity = 147/293 (50.17%), Postives = 180/293 (61.43%), Query Frame = 0

Query: 6   KHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTI 65
           K SKH W+KVEDA+ VE+L+YLV  GWRSDNGTFR  YLQHL+++  EK+   +L  NTI
Sbjct: 44  KRSKHVWSKVEDAKFVEALLYLVDTGWRSDNGTFRLEYLQHLERIHHEKVLGCALNQNTI 103

Query: 66  DCKVRTLKKQYNVVAEMLGNGCSGFGWNEEFKRVEAEKEVFDAWVKSHTNAKGMTNKPFP 125
           +CKVR+LKKQ N V+EML    SGF WNEEFK V+ E+E+FD WV+SH NAKGM NKPFP
Sbjct: 104 ECKVRSLKKQCNAVSEMLSQ--SGFDWNEEFKCVQVEREIFDPWVRSHPNAKGMWNKPFP 163

Query: 126 HYDDLAFVFGKDRATGMGAETLGKMASNAAEQLEEEIRLGSQDFFGTEQRPMENPCTTDV 185
           HYDDL+ VFGK +A G  +E    M +NA  + E+EIRLGSQD            C T  
Sbjct: 164 HYDDLSTVFGKYKAVGQSSEDPYVMTTNAFREFEDEIRLGSQD------------CHTP- 223

Query: 186 GEEELPETPTNRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTTHMQHLLSWQKE 245
                                                         ++THM  L SWQKE
Sbjct: 224 ----------------------------------------------ESTHMGRLASWQKE 275

Query: 246 KYELEAARRKEVVDLLYQIEGLTEHDRVSLIDMLVTDIQKTDYFLQVPPQSRR 299
           KYELE  RRKEVV+ +Y I+GL E D+V+LID+LVTDIQKT+ FL VP  +R+
Sbjct: 284 KYELEFGRRKEVVNAIYNIDGLDEDDQVTLIDLLVTDIQKTNCFLAVPEHARK 275

BLAST of Tan0013718 vs. NCBI nr
Match: XP_038902479.1 (uncharacterized protein At2g29880-like [Benincasa hispida])

HSP 1 Score: 254.6 bits (649), Expect = 1.1e-63
Identity = 130/217 (59.91%), Postives = 154/217 (70.97%), Query Frame = 0

Query: 1   MTGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSL 60
           MT   K SKH W+KVEDA+LVE+L+YLV  GWRSDNGTFRPGYLQHL+++L EK+P  +L
Sbjct: 1   MTSNGKRSKHIWSKVEDAKLVEALLYLVETGWRSDNGTFRPGYLQHLERILHEKVPGCTL 60

Query: 61  ELNTIDCKVRTLKKQYNVVAEMLGNGCSGFGWNEEFKRVEAEKEVFDAWVKSHTNAKGMT 120
             NTI+CKVR+LKKQYN+V+EML    SGF WNEEFK V+ E+E+FD WV SH NAK M 
Sbjct: 61  NQNTIECKVRSLKKQYNIVSEMLSQ--SGFDWNEEFKCVQVEREIFDLWVLSHPNAKRMW 120

Query: 121 NKPFPHYDDLAFVFGKDRATGMGAETLGKMASNAAEQLEEEIRLGSQDFFGTEQRPMENP 180
           NKPFPHYDD + VFGKDR  G  +E    MA+NA  + E+EIRLGSQD    E R  E+P
Sbjct: 121 NKPFPHYDDFSTVFGKDRVVGKSSEDPYVMATNAFREFEDEIRLGSQDCQTPEVRQTESP 180

Query: 181 CTTDVGEEELPETPTNRRNTSGTSSRCTGSKRKRSCF 218
              D  +EE  E  T R +    SSR  GSKRKR  F
Sbjct: 181 LNQDEIDEEPAEQSTGRASVPAKSSR--GSKRKRPSF 213

BLAST of Tan0013718 vs. NCBI nr
Match: XP_008441954.1 (PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo] >KAA0047736.1 retrotransposon protein [Cucumis melo var. makuwa] >TYK08388.1 retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 236.5 bits (602), Expect = 3.0e-58
Identity = 127/299 (42.47%), Postives = 181/299 (60.54%), Query Frame = 0

Query: 1   MTGTSKHSKHTWTKVEDARLVESLVYLVHN-GWRSDNGTFRPGYLQHLQKMLAEKLPNSS 60
           M   S+  KHTWTK E+ + VE LV LV + GWRSDNGTF+PGYL  LQ+M+AEKLP ++
Sbjct: 1   MASLSRAPKHTWTKEEEEKFVECLVELVSSGGWRSDNGTFQPGYLAQLQRMMAEKLPGTN 60

Query: 61  L-ELNTIDCKVRTLKKQYNVVAEMLGNGCSGFGWNEEFKRVEAEKEVFDAWVKSHTNAKG 120
           + E +TIDC V++LKK Y+ +AEM G  CSGFGWNEEF+ + AE+++FD+W+KSH  AKG
Sbjct: 61  IQESSTIDCHVKSLKKTYHAIAEMRGPSCSGFGWNEEFQCIIAERDLFDSWIKSHPAAKG 120

Query: 121 MTNKPFPHYDDLAFVFGKDRATGMGAETLGKMASNAAEQLEEEIRLGSQDFFGTEQRPME 180
           + +K FP+YDDL++VFGKDRATG  +ET   + SN +    + I LG       +    +
Sbjct: 121 LLHKSFPYYDDLSYVFGKDRATGARSETFPNVGSNVSNMFNDTIPLG-------DSHDED 180

Query: 181 NPCTTDVGEEELPETPTNRRNTSGTSSR-CTG-SKRKRSCFQTEMIDVVRTTMDIQTTHM 240
            P     G    P+     R    +  R C+  SKRKR   + E ++V+R+ M+     +
Sbjct: 181 IPTMYSQGVHMSPDEMFGIRAGQASERRNCSSVSKRKRGSERYETVEVIRSVMEFGNEQL 240

Query: 241 QHLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVSLIDMLVTDIQKTDYFLQVPPQ 296
           + +  W KEK  +E   R +VV  L  I  L   DR  L+ +L   ++  + FL +P +
Sbjct: 241 KAIADWPKEKRAMEVEMRAQVVKQLQDIPKLRSQDRAKLMQILFRSLEAIEGFLSIPTE 292

BLAST of Tan0013718 vs. ExPASy TrEMBL
Match: A0A5A7U0H7 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold648G002060 PE=4 SV=1)

HSP 1 Score: 236.5 bits (602), Expect = 1.5e-58
Identity = 127/299 (42.47%), Postives = 181/299 (60.54%), Query Frame = 0

Query: 1   MTGTSKHSKHTWTKVEDARLVESLVYLVHN-GWRSDNGTFRPGYLQHLQKMLAEKLPNSS 60
           M   S+  KHTWTK E+ + VE LV LV + GWRSDNGTF+PGYL  LQ+M+AEKLP ++
Sbjct: 1   MASLSRAPKHTWTKEEEEKFVECLVELVSSGGWRSDNGTFQPGYLAQLQRMMAEKLPGTN 60

Query: 61  L-ELNTIDCKVRTLKKQYNVVAEMLGNGCSGFGWNEEFKRVEAEKEVFDAWVKSHTNAKG 120
           + E +TIDC V++LKK Y+ +AEM G  CSGFGWNEEF+ + AE+++FD+W+KSH  AKG
Sbjct: 61  IQESSTIDCHVKSLKKTYHAIAEMRGPSCSGFGWNEEFQCIIAERDLFDSWIKSHPAAKG 120

Query: 121 MTNKPFPHYDDLAFVFGKDRATGMGAETLGKMASNAAEQLEEEIRLGSQDFFGTEQRPME 180
           + +K FP+YDDL++VFGKDRATG  +ET   + SN +    + I LG       +    +
Sbjct: 121 LLHKSFPYYDDLSYVFGKDRATGARSETFPNVGSNVSNMFNDTIPLG-------DSHDED 180

Query: 181 NPCTTDVGEEELPETPTNRRNTSGTSSR-CTG-SKRKRSCFQTEMIDVVRTTMDIQTTHM 240
            P     G    P+     R    +  R C+  SKRKR   + E ++V+R+ M+     +
Sbjct: 181 IPTMYSQGVHMSPDEMFGIRAGQASERRNCSSVSKRKRGSERYETVEVIRSVMEFGNEQL 240

Query: 241 QHLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVSLIDMLVTDIQKTDYFLQVPPQ 296
           + +  W KEK  +E   R +VV  L  I  L   DR  L+ +L   ++  + FL +P +
Sbjct: 241 KAIADWPKEKRAMEVEMRAQVVKQLQDIPKLRSQDRAKLMQILFRSLEAIEGFLSIPTE 292

BLAST of Tan0013718 vs. ExPASy TrEMBL
Match: A0A1S3B4L3 (uncharacterized protein LOC103485953 OS=Cucumis melo OX=3656 GN=LOC103485953 PE=4 SV=1)

HSP 1 Score: 236.5 bits (602), Expect = 1.5e-58
Identity = 127/299 (42.47%), Postives = 181/299 (60.54%), Query Frame = 0

Query: 1   MTGTSKHSKHTWTKVEDARLVESLVYLVHN-GWRSDNGTFRPGYLQHLQKMLAEKLPNSS 60
           M   S+  KHTWTK E+ + VE LV LV + GWRSDNGTF+PGYL  LQ+M+AEKLP ++
Sbjct: 1   MASLSRAPKHTWTKEEEEKFVECLVELVSSGGWRSDNGTFQPGYLAQLQRMMAEKLPGTN 60

Query: 61  L-ELNTIDCKVRTLKKQYNVVAEMLGNGCSGFGWNEEFKRVEAEKEVFDAWVKSHTNAKG 120
           + E +TIDC V++LKK Y+ +AEM G  CSGFGWNEEF+ + AE+++FD+W+KSH  AKG
Sbjct: 61  IQESSTIDCHVKSLKKTYHAIAEMRGPSCSGFGWNEEFQCIIAERDLFDSWIKSHPAAKG 120

Query: 121 MTNKPFPHYDDLAFVFGKDRATGMGAETLGKMASNAAEQLEEEIRLGSQDFFGTEQRPME 180
           + +K FP+YDDL++VFGKDRATG  +ET   + SN +    + I LG       +    +
Sbjct: 121 LLHKSFPYYDDLSYVFGKDRATGARSETFPNVGSNVSNMFNDTIPLG-------DSHDED 180

Query: 181 NPCTTDVGEEELPETPTNRRNTSGTSSR-CTG-SKRKRSCFQTEMIDVVRTTMDIQTTHM 240
            P     G    P+     R    +  R C+  SKRKR   + E ++V+R+ M+     +
Sbjct: 181 IPTMYSQGVHMSPDEMFGIRAGQASERRNCSSVSKRKRGSERYETVEVIRSVMEFGNEQL 240

Query: 241 QHLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVSLIDMLVTDIQKTDYFLQVPPQ 296
           + +  W KEK  +E   R +VV  L  I  L   DR  L+ +L   ++  + FL +P +
Sbjct: 241 KAIADWPKEKRAMEVEMRAQVVKQLQDIPKLRSQDRAKLMQILFRSLEAIEGFLSIPTE 292

BLAST of Tan0013718 vs. ExPASy TrEMBL
Match: A0A5A7UME4 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold615G00290 PE=4 SV=1)

HSP 1 Score: 216.5 bits (550), Expect = 1.6e-52
Identity = 129/298 (43.29%), Postives = 179/298 (60.07%), Query Frame = 0

Query: 1   MTGTSKHSKHTWTKVEDARLVESLVYLVH-NGWRSDNGTFRPGYLQHLQKMLAEKLPNSS 60
           MT +S+  KHTWTK E+A LVE LV LV+  GWRSDNGTFRPGYL  L +M+A K+P S+
Sbjct: 1   MTSSSRLPKHTWTKEEEAGLVECLVELVNAGGWRSDNGTFRPGYLNQLARMMAFKIPGSN 60

Query: 61  LELNTIDCKVRTLKKQYNVVAEMLGNGCSGFGWNEEFKRVEAEKEVFDAWVKSHTNAKGM 120
           +  +TID +++ +K+ ++ +AEM G  CSGFGWN+E K + AEKEVFD W  SH  AKG+
Sbjct: 61  IHASTIDSRIKLMKRMFHALAEMRGPNCSGFGWNDEKKCIVAEKEVFDDW--SHPAAKGL 120

Query: 121 TNKPFPHYDDLAFVFGKDRATGMGAETLGKMASNAAEQLEEEIRLGSQDFFGTEQRPMEN 180
            NK F HYD+L++VFGKDRATG  AE+   + SN     + E      D   T+  PM +
Sbjct: 121 LNKSFVHYDELSYVFGKDRATGGRAESFADIGSNDPPGYDAEAADAMPD---TDFPPMYS 180

Query: 181 PCTTDVGEEELPETPT----NRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTTH 240
           P   ++  ++L ET T     RRN S      +GSKRKR    T+  D+VRT ++     
Sbjct: 181 P-GLNMSPDDLMETRTARVSERRNVS------SGSKRKRPGHATDSGDIVRTAIEYGNEQ 240

Query: 241 MQHLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVSLIDMLVTDIQKTDYFLQVP 294
           +  +  W   + +     R+E+V  L  I  LT  DR  L+ +L+ ++     FL+VP
Sbjct: 241 LHRIAEWPILQRQDATQTRQEIVRHLEAIPELTLMDRCRLMRILMRNVDDMKAFLEVP 286

BLAST of Tan0013718 vs. ExPASy TrEMBL
Match: A0A5D3CBF7 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1112G00350 PE=4 SV=1)

HSP 1 Score: 214.5 bits (545), Expect = 6.0e-52
Identity = 129/299 (43.14%), Postives = 180/299 (60.20%), Query Frame = 0

Query: 1   MTGTSKHSKHTWTKVEDARLVESLVYLVH-NGWRSDNGTFRPGYLQHLQKMLAEKLPNSS 60
           MT +S+  KHTWTK E+A LVE LV LV+  GWRSDNGTFRPGYL  L +M+A K+P S+
Sbjct: 1   MTSSSRLPKHTWTKEEEAGLVECLVELVNAGGWRSDNGTFRPGYLNQLARMMAFKIPGSN 60

Query: 61  LELNTIDCKVRTLKKQYNVVAEMLGNGCSGFGWNEEFKRVEAEKEVFDAWVKSHTNAKGM 120
           +  +TID +++ +K+ ++ +AEM G  CSGFGWN+E K + AEKEVFD W  SH  AKG+
Sbjct: 61  IHASTIDSRIKLMKRMFHALAEMRGPNCSGFGWNDEKKCIVAEKEVFDDW--SHPAAKGL 120

Query: 121 TNKPFPHYDDLAFVFGKDRATGMGAETLGKMASNAAEQLEEEIRLGSQDFF-GTEQRPME 180
            NK F HYD+L++VFGKDRATG  AE+   + SN     +     G+ D    T+  PM 
Sbjct: 121 LNKSFVHYDELSYVFGKDRATGGRAESFADIGSNDPPGYD----AGAADAMPDTDFPPMY 180

Query: 181 NPCTTDVGEEELPETPT----NRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTT 240
           +P   ++  ++L ET T     RRN S      +GSKRKR    T+  D+VRT ++    
Sbjct: 181 SP-GLNMSPDDLMETRTARVSERRNVS------SGSKRKRPGHATDSGDIVRTAIEYGNE 240

Query: 241 HMQHLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVSLIDMLVTDIQKTDYFLQVP 294
            +  +  W   + +     R+E+V  L  I  LT  DR  L+ +L+ ++     FL+VP
Sbjct: 241 QLHRIAEWPILQRQDATQTRQEIVRHLEAIPELTLMDRCRLMRILMRNVDDMKAFLEVP 286

BLAST of Tan0013718 vs. ExPASy TrEMBL
Match: E5GCB5 (Retrotransposon protein OS=Cucumis melo subsp. melo OX=412675 PE=3 SV=1)

HSP 1 Score: 214.5 bits (545), Expect = 6.0e-52
Identity = 129/299 (43.14%), Postives = 180/299 (60.20%), Query Frame = 0

Query: 1   MTGTSKHSKHTWTKVEDARLVESLVYLVH-NGWRSDNGTFRPGYLQHLQKMLAEKLPNSS 60
           MT +S+  KHTWTK E+A LVE LV LV+  GWRSDNGTFRPGYL  L +M+A K+P S+
Sbjct: 356 MTSSSRLPKHTWTKEEEAGLVECLVELVNAGGWRSDNGTFRPGYLNQLARMMAFKIPGSN 415

Query: 61  LELNTIDCKVRTLKKQYNVVAEMLGNGCSGFGWNEEFKRVEAEKEVFDAWVKSHTNAKGM 120
           +  +TID +++ +K+ ++ +AEM G  CSGFGWN+E K + AEKEVFD W  SH  AKG+
Sbjct: 416 IHASTIDSRIKLMKRMFHALAEMRGPNCSGFGWNDEKKCIVAEKEVFDDW--SHPAAKGL 475

Query: 121 TNKPFPHYDDLAFVFGKDRATGMGAETLGKMASNAAEQLEEEIRLGSQDFF-GTEQRPME 180
            NK F HYD+L++VFGKDRATG  AE+   + SN     +     G+ D    T+  PM 
Sbjct: 476 LNKSFVHYDELSYVFGKDRATGGRAESFADIGSNDPPGYD----AGAADAMPDTDFPPMY 535

Query: 181 NPCTTDVGEEELPETPT----NRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMDIQTT 240
           +P   ++  ++L ET T     RRN S      +GSKRKR    T+  D+VRT ++    
Sbjct: 536 SP-GLNMSPDDLMETRTARVSERRNVS------SGSKRKRPGHATDSGDIVRTAIEYGNE 595

Query: 241 HMQHLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVSLIDMLVTDIQKTDYFLQVP 294
            +  +  W   + +     R+E+V  L  I  LT  DR  L+ +L+ ++     FL+VP
Sbjct: 596 QLHRIAEWPILQRQDATQTRQEIVRHLEAIPELTLMDRCRLMRILMRNVDDMKAFLEVP 641

BLAST of Tan0013718 vs. TAIR 10
Match: AT1G30140.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G27260.1); Has 313 Blast hits to 256 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 8; Plants - 295; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 60.1 bits (144), Expect = 3.6e-09
Identity = 61/223 (27.35%), Postives = 86/223 (38.57%), Query Frame = 0

Query: 3   GTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKML--AEKLPNSSL 62
           G  K   + WT  E   L+E    L+   WR  +G    G L    K+L    K    + 
Sbjct: 8   GKEKGPYNQWTPDETDVLIE----LIRQNWRDSSGII--GKLTVESKLLPALNKRLGCNK 67

Query: 63  ELNTIDCKVRTLKKQYNVVAEMLGNGCSGFGWNEEFKRVEAEKEVFDAWVKSHTNAKGMT 122
                  +++ LK  Y    + L    SGFGW+ E K+  A  EV+  ++K+H N K M 
Sbjct: 68  NHKNYMSRLKFLKNLYQSYLD-LKRFSSGFGWDPETKKFTAPDEVWRDYLKAHPNHKHMQ 127

Query: 123 NKPFPHYDDLAFVFGKDRATG----------------MGAETLGKMASNAAEQLEEEIRL 182
            +   H++DL  +FG   ATG                +G  + GK   N  E +EE    
Sbjct: 128 TESIDHFEDLQIIFGDVVATGSFAVGMSDSTCPRIYTVGERSQGKETVNQDENIEEVYEF 187

Query: 183 GSQDFFGTEQRPMENPCTTDVGEEELPETPTNRRNTSGTSSRC 208
             Q     E     +P T D       E    R+ T G   RC
Sbjct: 188 SFQHPSSAEY--STSPFTFDPTTRGRSEKLLPRKRTKG--GRC 219

BLAST of Tan0013718 vs. TAIR 10
Match: AT4G02210.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes - 26 (source: NCBI BLink). )

HSP 1 Score: 53.5 bits (127), Expect = 3.4e-07
Identity = 50/231 (21.65%), Postives = 98/231 (42.42%), Query Frame = 0

Query: 11  TWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVR 70
           TW    D   ++ ++     G     G FR      +  +   K   S+ +++ +  + +
Sbjct: 185 TWHPPMDRYFIDLMLDQARRG-NQIEGVFRKQAWTEMVNLFNAKF-ESNFDVDVLKNRYK 244

Query: 71  TLKKQYNVVAEMLGNGCSGFGWNEEFKRVEAEKEVFDAWVKSHTNAKGMTNKPFPHYDDL 130
           +L++Q+N +  +L +   GF W+ E + V A+  V+  ++K+H +A+    +P P+Y DL
Sbjct: 245 SLRRQFNAIKSILRS--DGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDL 304

Query: 131 AFVFGKDRATGMGAETLGKMASNAAEQLEEEIRLGSQDFFG--TEQRPMENPCTTDV--- 190
             + G                      +EE     + D+F   TE +  ++  TTD+   
Sbjct: 305 CVLCGD-------------------SGIEENECFVAMDWFDPETEFQEFKSSGTTDLSIS 364

Query: 191 GEEE----LPETPTNRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMDIQ 233
            EEE    L   P N+R+    +     + +K    +T+ + +  T   IQ
Sbjct: 365 AEEEDSNSLLFDPKNKRDQLANTDTSPINPKKPRVDETQTMSIEDTVEAIQ 392

BLAST of Tan0013718 vs. TAIR 10
Match: AT4G02210.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24960.2). )

HSP 1 Score: 53.5 bits (127), Expect = 3.4e-07
Identity = 50/231 (21.65%), Postives = 98/231 (42.42%), Query Frame = 0

Query: 11  TWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVR 70
           TW    D   ++ ++     G     G FR      +  +   K   S+ +++ +  + +
Sbjct: 185 TWHPPMDRYFIDLMLDQARRG-NQIEGVFRKQAWTEMVNLFNAKF-ESNFDVDVLKNRYK 244

Query: 71  TLKKQYNVVAEMLGNGCSGFGWNEEFKRVEAEKEVFDAWVKSHTNAKGMTNKPFPHYDDL 130
           +L++Q+N +  +L +   GF W+ E + V A+  V+  ++K+H +A+    +P P+Y DL
Sbjct: 245 SLRRQFNAIKSILRS--DGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDL 304

Query: 131 AFVFGKDRATGMGAETLGKMASNAAEQLEEEIRLGSQDFFG--TEQRPMENPCTTDV--- 190
             + G                      +EE     + D+F   TE +  ++  TTD+   
Sbjct: 305 CVLCGD-------------------SGIEENECFVAMDWFDPETEFQEFKSSGTTDLSIS 364

Query: 191 GEEE----LPETPTNRRNTSGTSSRCTGSKRKRSCFQTEMIDVVRTTMDIQ 233
            EEE    L   P N+R+    +     + +K    +T+ + +  T   IQ
Sbjct: 365 AEEEDSNSLLFDPKNKRDQLANTDTSPINPKKPRVDETQTMSIEDTVEAIQ 392

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038887234.17.9e-8356.71uncharacterized protein LOC120077425 [Benincasa hispida][more]
XP_038896380.17.4e-8155.37uncharacterized protein LOC120084641 [Benincasa hispida][more]
XP_038895773.11.9e-6850.17uncharacterized protein LOC120083935 [Benincasa hispida][more]
XP_038902479.11.1e-6359.91uncharacterized protein At2g29880-like [Benincasa hispida][more]
XP_008441954.13.0e-5842.47PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo] >KAA0047736.1 ret... [more]
Match NameE-valueIdentityDescription
A0A5A7U0H71.5e-5842.47Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3B4L31.5e-5842.47uncharacterized protein LOC103485953 OS=Cucumis melo OX=3656 GN=LOC103485953 PE=... [more]
A0A5A7UME41.6e-5243.29Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A5D3CBF76.0e-5243.14Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
E5GCB56.0e-5243.14Retrotransposon protein OS=Cucumis melo subsp. melo OX=412675 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G30140.13.6e-0927.35unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G02210.13.4e-0721.65unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G02210.23.4e-0721.65unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024752Myb/SANT-like domainPFAMPF12776Myb_DNA-bind_3coord: 11..107
e-value: 8.0E-11
score: 42.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 195..211
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 171..211
NoneNo IPR availablePANTHERPTHR46250MYB/SANT-LIKE DNA-BINDING DOMAIN PROTEIN-RELATEDcoord: 5..300

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0013718.1Tan0013718.1mRNA