Tan0012844 (gene) Snake gourd v1

Overview
NameTan0012844
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
LocationLG01: 14001412 .. 14006369 (+)
RNA-Seq ExpressionTan0012844
SyntenyTan0012844
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAATAACCCTTTGTCACAAGCATTAATTTTCGCTATCTTTCTCTCATTACTCTATGGCGAAACGAAACGTCTCACCTTCCTCTCGAAGAACTTCTCTCCTTCCTCTCATAGAGCTTCTCTCATTCGTCTCTCTTTCATCTCCATCTTCCCTACTGTGAATCGTCTTTCAATGGAGTTGAGGTTAGGGTTTGCTATATTCCTTCTACTTTTCTCTGACAGGTACGCATCTTCTTCTTACTTGTTCGATTTTTCGTCTTCGAATCCATACCCTTTACTCCGTCGATCGACTTTTCTTCATGGATCTAGTGTCCAACTATCCATGCAACGAAATCTATCTTATTTTACCTCTGAATCGTCTCGTTTCGTGGCGATTTTTCGAAAAATTCTTCCAAATGGTTTAGGATTTGACTAGATCCTTAGCCCTAATCTTTCACTCTTTTTTCCAGAACCCTAATCTCAACATCCTAAGGGGAGGACTCACCTCCCGCCGCTTTAATGCCCCATGTCGGTCCCTCTTTTGTTTCCATTTAATCCTTTCTTCTGTACTTATCTCTGTCATGATATGGCTAGTGCGATTGATTCGACAAATAGGAAAACAAAATTGTATGGGACTGGTGAGGGGTTTTAGAGCTTGGTGGGGGAAGGGTCACAGTTGTCATACACTGCAGGTTTGTCTTCTTTCCCTATCGGCCGACCAATGATGAACGATTGTTCTGTAAAGCCCAACGATAAACTCTTGCTTTCTTTCGTATTTTTTATTTGTTCTGTATAATGTATGTTTCTTATGATTTATGAGAAGTTTATTATATTTGTAAATTCTCTCATTCTGGACTCTGTTTTTTGTTTTTGCCATGGTTTGATATGTGTTTCTACTATGTTTGTCACGAATTCAATCTTCTGTTTGATATACTTAATCAACAAGTGTTGGGCCTCTGCTTTTATCTTACGATTTTTCCTCTATTTGTTAAATTTATGTCCCATTGTTCTGCTTCTAATTTACGATTTATCCTCTGTTTGTACAACAAGTGGTGGGTCTCTGCTTATTGGATATGACTTGATTTTTTTTTTTTTTAGGATTTGTTTCTTCTGTCACGTTGGATTTAGAAAAGTTTTGAGTTCTTTGTCTGTGCATTAACTTCTTGACAAAACACATGTATTGTTAATATTTTTTTTGTAATAAATAAATATTTATATACTTATATTCCATGAATATTATCTTCTCAACTTCACCTACCCGAGAGTTGGTTTTCTCCTTCTGTTCAACTTATTAATTGCATTACTTAATCTAATTGTATACCTTTTTAGCTGAACCGTAAGCCTACTGCCTTGTTGAGTAATGAGGTTTAAATGCACTTTAATTTCCACCACTTCCCCATTCCAACTACAATTCCTATTCTCCTATTATATAAACATATGTCACATGCTTTTGTTCTTTGAGAATAACTCCACTGCAAACATCCCCTATTGTCACCTGTGAGTACTTTTTGTTTCCCTTTTTCATGTTCTCATACTATTTTCTCTTCTGTGTTTATTCTTCAGTTCTATTATAGGTTGGGATATCTTATATTTTGTCTTGTATTCAATAAACTCATATGTTTTTTCCACCACCATTTCATTGTTATCGTTACTGTCGTGAGGAAAGCTTACATCAATCCCTGAGGTCGGCTTTAATATGCTTTATCTTTCACACTTGTGGTTGTTCTTGTTTTAGGTTTCACCCATTACCCGTGCAATATTCAGTTTGACAGGTAATGTACATTCCAAACCATTGTATTCGATTTTCTCATGCATGTACTAATGTCATTTTTGTTTTGTTATTATTCAGCTCCAAGTCCATAACATCCCTGAAGGTCATAGGTTAAAGGACTATCCTTAACCTATTTTTTTTAGCATTTCTAAAGGTTAGTTGTCGTCTATGTTTAAAGTATGATTATATTATATATGGTTATACTATCACTATTTAACTTCAATACTTGTTCTTCCTTGTAGGCATGAACCAGTCATCTTCACCAATCTTAGTTTGAGGTTCGTTGGTTTAGGATTGTAGTTATGTCATGCTTGTGGTTTTTTCTGTTGTATGCTAATTAAAATTAGTATGGAACTGTGTTCATTTTGGTATTTCTGTTGAACATGTTCATGTTGATAACATTGAATATGTGGTTTAACATTGATATTTTGTATGAGTGATTAGAAATGTTTATTATTTTTTTACTTTTCTGAACTAATATCAAATTTCATATTATGATACCAATAGTCAGTAGAACAAATCTACTTTTAGAACAAAGTTTTTAATTACAAAGAAAATTTTAGAACAAATATGGCATCTAGAAAATGTATGTATATTATGGTTTTGGAAAATAAAAGGATTTAAGAAAAAAAAAACAAAGAACAAAAGTTGCTAGAAACATACTCGAACACATTGAACAAAGTTTGGGGAATCAACTAACACATTTGAATACATTTTCAATTTGGGGATATCAACTACTTTTTTTTTTTTTTTAAAAAAAAGTGATCTTTTCTTCAATCTAGGAACACACTTGAACAAAGTTTGTACCATTTTTTTTGTATTGAAATTTGGGGATATCAACCTTTATGAAAGAAAATGATATTTTGTTCAACCAAGATAATGACCATACTTGTAAAAACTTGGGGTGTCTAATTGATTTAACATGATATCAACTTTCACCAATTAAAAAAACTAGAACCAATCACCAGCGCATGCACTGATGAGCGATGGAAAAAGTTTGAGGTAATGAAATGAGGCTAAGTATGTCCATTGTTGGTGAAGATAAAAGGACCATACTTATTTGACATGTTTAATGATGGTTGTGATGTCTTTATTCCAAACAGAATTGTCTGGCCGCCCTAGATGGTACATATATCAAAGTGAATGTCAAGGCTGCTGATCGACTGCGATATAGGACGCGAAAAGGTGAAATTGCAACGAACGTACTAGGGGTTTGTTCTCCGATCGAAGAATTTATATTTGTCATGCCAGGATGGGAAGGTCGACAATCGATTCTAGAGTTCTTAGGGATGCCATCTCTCGTTCTAACGGTTTGAAGGTTCCTAGGGGTAAACCATAAACAATGTTACTTTTAATTAAGAAACAAATAATGTTGTCATTTAACTGGTAGCATCTTATACAGGGTACTATTATTTGTGTGATGTTGGTTACCCTAATGCTGAGGGATTCCTTGCTCCTTATAGTGGAGAACGATACCACCTTTCAGAGTGGCGTGGATCATGGAATGCACCGAGAATTCTACGAGAATTTTTCAACATGAAACACTATTCTGCAAGGAACGTGATCGAAAGAGAGTTCATAGTCTTGAAGGGGAGGTGGGCCATTCTACGAGAAAAGTCATTTTACCCGGTCCAAATACAATGTCGAACTATAACAACATATTGTCTCATTCACAACCTAATCTGTAGGGAGATGAGTTCGAGCACTCTATTGGATGAGGTGGAGGAAGTCGACTCTGGTCAAATAGTAGCGAATGGGGAAAATATACAATTCATTGAAAGCTCCAACGAATGGACCTAATTTAGGGATGGCTTGACAAATCAAATGTTTAATGTTTGGGAACAATCATGATTACTTATGGACTGTAATAATTATTATGTCAAACACTAACTTATTTTCATGTACTCACAATTGCTGATATTTTATGACGATCCTCTTGCTCATATGTTATGTTTTCTATTGTGCAACACTTACTTGACTTGCCTTCGTTGTGACATAAACCTAATTTGGCAATACACATGCATGTAGCATTATGGCAGGTACTTCGAAAAACTCCAAACATACATGGACGAAGGTCGAGGATGCGAGGTTGGTGGAGTCACTTGTACCTTTAGAATATAATGGGTGATGATCTGACAACGGGACCTTCAGGTCTGGCTATTTACACCATCTCCAGAAGATGCTAGTTGAGAAATTGCCAAATTCATGCCTATAACAAAACACAATCGATTGTAAGGTCAGAACTCTAAAAAAATAATACAATGCTATTGCAGAGATGCTTAGTAATGCATGTAGTGACTTCAGCTGAAATGAAGAGTTCAAATGTGTTGAGGCAGAGAAGGAGGTGTTTGATGCATGGGTTAAGGTGAGATAATAATATTATAGCATATTGTTATTATCGTTCATTGTACGTATGTGTAATATATCTATTCACATGCAGAACCATACAAACGCAAAGGGGATGAAAAATAAGCCATTTCCGCACTATGATGACCTCGCATTTGTCTTCGGAAAGGATAGAGCTACAGGAATAGGTGCAGAGACTCCAATGGAAATGGCATCTAGCTCTGCAGAACAAATGGATGAAGAGATTCGTTTGAGATCACAAGACTTCATGGGGGTAGAACAACGAACAATGGAGAATCCATGAATTTGTGACGTAGGGGAAGATGACTTGCCAGACACTTCTACTAGTAGGCGTAATACATCTGGTATGTCTTCTAGATGTACTGGGAGAAAAAGAAAATGATCATCCTTCCAAATTGAATTAATTGATGTTGTGCGCACAACAATGGATATGCAAACCAATCACATGCAAAAACTTCTATCCTGGCAGAAGGAGAAGTATGAGTTGAAGGCTGCACGAAGGAAGGAAGTAGCCGATCTCTTGTATCAGATAGAAGGATTGACTGAACATGATCGTGTCTCCTTGATAGACTTGCTTGTGACTTATATCCAGAAGACTGACTACTTTCTACAGGTTCCACCTTAATCAAGGAGGACATATTGCATGCGCCTACTGGGAATGACTGGATGATTAGACATTGATCATTTTGTTGTTGTATTTTGTGGACATTACTTTGTATCTACTACTACTTTGTATTTTTTTATGGGTTTTTAACTAATTTTTTTGTTAACAATGTACGCATAAGCCATTGAATTGATATATTATAAATTTGGTGATATGCAAGTAATACATGTCACTGAACTA

mRNA sequence

AAATAACCCTTTGTCACAAGCATTAATTTTCGCTATCTTTCTCTCATTACTCTATGGCGAAACGAAACGTCTCACCTTCCTCTCGAAGAACTTCTCTCCTTCCTCTCATAGAGCTTCTCTCATTCGTCTCTCTTTCATCTCCATCTTCCCTACTGTGAATCGTCTTTCAATGGAGTTGAGGTTAGGGTTTGCTATATTCCTTCTACTTTTCTCTGACAGGTACGCATCTTCTTCTTACTTGTTCGATTTTTCGTCTTCGAATCCATACCCTTTACTCCGTCGATCGACTTTTCTTCATGGATCTAGTGTCCAACTATCCATGCAACGAAATCTATCTTATTTTACCTCTGAATCGTCTCGTTTCGTGGCGATTTTTCGAAAAATTCTTCCAAATGGTTTAGGATTTGACTAGATCCTTAGCCCTAATCTTTCACTCTTTTTTCCAGAACCCTAATCTCAACATCCTAAGGGGAGGACTCACCTCCCGCCGCTTTAATGCCCCATGTCGGTCCCTCTTTTGTTTCCATTTAATCCTTTCTTCTGTACTTATCTCTGTCATGATATGGCTAGTGCGATTGATTCGACAAATAGGAAAACAAAATTGTATGGGACTGGTGAGGGGTTTTAGAGCTTGGTGGGGGAAGGGTCACAGTTGTCATACACTGCAGGTTTGTCTTCTTTCCCTATCGGCCGACCAATGATGAACGATTGTTCTGTAAAGCCCAACGATAAACTCTTGCTTTCTTTCGTATTTTTTATTTGTTCTGTATAATGTATGTTTCTTATGATTTATGAGAAGTTTATTATATTTGTAAATTCTCTCATTCTGGACTCTGTTTTTTGTTTTTGCCATGGTTTGATATGTGTTTCTACTATGTTTGTCACGAATTCAATCTTCTGTTTGATATACTTAATCAACAAGTGTTGGGCCTCTGCTTTTATCTTACGATTTTTCCTCTATTTGTTAAATTTATGTCCCATTGTTCTGCTTCTAATTTACGATTTATCCTCTGTTTGTACAACAAGTGGTGGGTCTCTGCTTATTGGATATGACTTGATTTTTTTTTTTTTTAGGATTTGTTTCTTCTGTCACGTTGGATTTAGAAAAGTTTTGAGTTCTTTGTCTGTGCATTAACTTCTTGACAAAACACATGTATTGTTAATATTTTTTTTGTAATAAATAAATATTTATATACTTATATTCCATGAATATTATCTTCTCAACTTCACCTACCCGAGAGTTGGTTTTCTCCTTCTGTTCAACTTATTAATTGCATTACTTAATCTAATTGTATACCTTTTTAGCTGAACCGTAAGCCTACTGCCTTGTTGAGTAATGAGGTTTAAATGCACTTTAATTTCCACCACTTCCCCATTCCAACTACAATTCCTATTCTCCTATTATATAAACATATGTCACATGCTTTTGTTCTTTGAGAATAACTCCACTGCAAACATCCCCTATTGTCACCTGTGAGTACTTTTTGTTTCCCTTTTTCATGTTCTCATACTATTTTCTCTTCTGTGTTTATTCTTCAGTTCTATTATAGGTTGGGATATCTTATATTTTGTCTTGTATTCAATAAACTCATATGTTTTTTCCACCACCATTTCATTGTTATCGTTACTGTCGTGAGGAAAGCTTACATCAATCCCTGAGGTCGGCTTTAATATGCTTTATCTTTCACACTTGTGGTTGTTCTTGTTTTAGGTTTCACCCATTACCCGTGCAATATTCAGTTTGACAGCTCCAAGTCCATAACATCCCTGAAGGTCATAGGTTAAAGGACTATCCTTAACCTATTTTTTTTAGCATTTCTAAAGGCATGAACCAGTCATCTTCACCAATCTTAGTTTGAGGTTCGTTGGTTTAGGATTGTAGTTATGTCATGCTTGTGGTTTTTTCTGTTGTATGCTAATTAAAATTAGTATGGAACTGTGTTCATTTTGGTATTTCTGTTGAACATGTTCATGTTGATAACATTGAATATGTGGTTTAACATTGATATTTTGTATGAGTGATTAGAAATGTTTATTATTTTTTTACTTTTCTGAACTAATATCAAATTTCATATTATGATACCAATAGTCAGTAGAACAAATCTACTTTTAGAACAAAGTTTTTAATTACAAAGAAAATTTTAGAACAAATATGGCATCTAGAAAATGTATGTATATTATGGTTTTGGAAAATAAAAGGATTTAAGAAAAAAAAAACAAAGAACAAAAGTTGCTAGAAACATACTCGAACACATTGAACAAAGTTTGGGGAATCAACTAACACATTTGAATACATTTTCAATTTGGGGATATCAACTACTTTTTTTTTTTTTTTAAAAAAAAGTGATCTTTTCTTCAATCTAGGAACACACTTGAACAAAGTTTGTACCATTTTTTTTGTATTGAAATTTGGGGATATCAACCTTTATGAAAGAAAATGATATTTTGTTCAACCAAGATAATGACCATACTTGTAAAAACTTGGGGTGTCTAATTGATTTAACATGATATCAACTTTCACCAATTAAAAAAACTAGAACCAATCACCAGCGCATGCACTGATGAGCGATGGAAAAAGTTTGAGGTAATGAAATGAGGCTAAGTATGTCCATTGTTGGTGAAGATAAAAGGACCATACTTATTTGACATGTTTAATGATGGTTGTGATGTCTTTATTCCAAACAGAATTGTCTGGCCGCCCTAGATGGTACATATATCAAAGTGAATGTCAAGGCTGCTGATCGACTGCGATATAGGACGCGAAAAGGTGAAATTGCAACGAACGTACTAGGGGTTTGTTCTCCGATCGAAGAATTTATATTTGTCATGCCAGGATGGGAAGGTCGACAATCGATTCTAGAGTTCTTAGGGATGCCATCTCTCGTTCTAACGGTTTGAAGGTTCCTAGGGGGTACTATTATTTGTGTGATGTTGGTTACCCTAATGCTGAGGGATTCCTTGCTCCTTATAGTGGAGAACGATACCACCTTTCAGAGTGGCGTGGATCATGGAATGCACCGAGAATTCTACGAGAATTTTTCAACATGAAACACTATTCTGCAAGGAACGTGATCGAAAGAGAGTTCATAGTCTTGAAGGGGAGGTGGGCCATTCTACGAGAAAAGTCATTTTACCCGGTCCAAATACAATGTCGAACTATAACAACATATTGTCTCATTCACAACCTAATCTGTAGGGAGATGAGTTCGAGCACTCTATTGGATGAGGTGGAGGAAGTCGACTCTGGTCAAATAGTAGCGAATGGGGAAAATATACAATTCATTGAAAGCTCCAACGAATGGACCTAATTTAGGGATGGCTTGACAAATCAAATGTTTAATGTTTGGGAACAATCATGATTACTTATGGACTGTAATAATTATTATGTCAAACACTAACTTATTTTCATGTACTCACAATTGCTGATATTTTATGACGATCCTCTTGCTCATATGTTATGTTTTCTATTGTGCAACACTTACTTGACTTGCCTTCGTTGTGACATAAACCTAATTTGGCAATACACATGCATGTAGCATTATGGCAGGTACTTCGAAAAACTCCAAACATACATGGACGAAGGTCGAGGATGCGAGGTTGGTGGAGTCACTTGTACCTTTAGAATATAATGGGTGATGATCTGACAACGGGACCTTCAGGTCTGGCTATTTACACCATCTCCAGAAGATGCTAGTTGAGAAATTGCCAAATTCATGCCTATAACAAAACACAATCGATTGTAAGGTCAGAACTCTAAAAAAATAATACAATGCTATTGCAGAGATGCTTAGTAATGCATGTAGTGACTTCAGCTGAAATGAAGAGTTCAAATGTGTTGAGGCAGAGAAGGAGGTGTTTGATGCATGGGTTAAGAACCATACAAACGCAAAGGGGATGAAAAATAAGCCATTTCCGCACTATGATGACCTCGCATTTGTCTTCGGAAAGGATAGAGCTACAGGAATAGGTGCAGAGACTCCAATGGAAATGGCATCTAGCTCTGCAGAACAAATGGATGAAGAGATTCGTTTGAGATCACAAGACTTCATGGGGGTAGAACAACGAACAATGGAGAATCCATGAATTTGTGACGTAGGGGAAGATGACTTGCCAGACACTTCTACTAGTAGGCGTAATACATCTGGTATGTCTTCTAGATGTACTGGGAGAAAAAGAAAATGATCATCCTTCCAAATTGAATTAATTGATGTTGTGCGCACAACAATGGATATGCAAACCAATCACATGCAAAAACTTCTATCCTGGCAGAAGGAGAAGTATGAGTTGAAGGCTGCACGAAGGAAGGAAGTAGCCGATCTCTTGTATCAGATAGAAGGATTGACTGAACATGATCGTGTCTCCTTGATAGACTTGCTTGTGACTTATATCCAGAAGACTGACTACTTTCTACAGGTTCCACCTTAATCAAGGAGGACATATTGCATGCGCCTACTGGGAATGACTGGATGATTAGACATTGATCATTTTGTTGTTGTATTTTGTGGACATTACTTTGTATCTACTACTACTTTGTATTTTTTTATGGGTTTTTAACTAATTTTTTTGTTAACAATGTACGCATAAGCCATTGAATTGATATATTATAAATTTGGTGATATGCAAGTAATACATGTCACTGAACTA

Coding sequence (CDS)

ATGGGAAGGTCGACAATCGATTCTAGAGTTCTTAGGGATGCCATCTCTCGTTCTAACGGTTTGAAGGTTCCTAGGGGGTACTATTATTTGTGTGATGTTGGTTACCCTAATGCTGAGGGATTCCTTGCTCCTTATAGTGGAGAACGATACCACCTTTCAGAGTGGCGTGGATCATGGAATGCACCGAGAATTCTACGAGAATTTTTCAACATGAAACACTATTCTGCAAGGAACGTGATCGAAAGAGAGTTCATAGTCTTGAAGGGGAGGTGGGCCATTCTACGAGAAAAGTCATTTTACCCGGTCCAAATACAATGTCGAACTATAACAACATATTGTCTCATTCACAACCTAATCTGTAGGGAGATGAGTTCGAGCACTCTATTGGATGAGGTGGAGGAAGTCGACTCTGGTCAAATAGTAGCGAATGGGGAAAATATACAATTCATTGAAAGCTCCAACGAATGGACCTAA

Protein sequence

MGRSTIDSRVLRDAISRSNGLKVPRGYYYLCDVGYPNAEGFLAPYSGERYHLSEWRGSWNAPRILREFFNMKHYSARNVIEREFIVLKGRWAILREKSFYPVQIQCRTITTYCLIHNLICREMSSSTLLDEVEEVDSGQIVANGENIQFIESSNEWT
Homology
BLAST of Tan0012844 vs. NCBI nr
Match: XP_038885881.1 (protein ALP1-like [Benincasa hispida])

HSP 1 Score: 229.6 bits (584), Expect = 1.9e-56
Identity = 109/153 (71.24%), Postives = 122/153 (79.74%), Query Frame = 0

Query: 5   TIDSRVLRDAISRSNGLKVPRGYYYLCDVGYPNAEGFLAPYSGERYHLSEWRGSWNAPRI 64
           T DSRV RD ISR NGLKVP+GYYYLCDVGYPNAEGFLAPY GERYHLSEWRG  NAP  
Sbjct: 86  TADSRVSRDVISRPNGLKVPKGYYYLCDVGYPNAEGFLAPYKGERYHLSEWRGGGNAPTA 145

Query: 65  LREFFNMKHYSARNVIEREFIVLKGRWAILREKSFYPVQIQCRTITTYCLIHNLICREMS 124
            REFFNMKH SA NVIER   +LKGRWAILR +S+YPVQIQCRTI   CL+HN I REM+
Sbjct: 146 PREFFNMKHSSAWNVIERTLGLLKGRWAILRGQSYYPVQIQCRTIMACCLLHNFINREMT 205

Query: 125 SSTLLDEVEEVDSGQIVANGENIQFIESSNEWT 158
           +S L+++++EVDS      G  I +IESSNEWT
Sbjct: 206 NSELIEDLDEVDSSFATTRGNEINYIESSNEWT 238

BLAST of Tan0012844 vs. NCBI nr
Match: KAA0062747.1 (retrotransposon protein [Cucumis melo var. makuwa] >TYK22546.1 retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 229.2 bits (583), Expect = 2.5e-56
Identity = 106/154 (68.83%), Postives = 124/154 (80.52%), Query Frame = 0

Query: 4   STIDSRVLRDAISRSNGLKVPRGYYYLCDVGYPNAEGFLAPYSGERYHLSEWRGSWNAPR 63
           S  DSR+LRDAISR NGLKVP+GYYYLCD GYPNAEGFLAPY GERYHLSEWRG  NAP 
Sbjct: 152 SAADSRILRDAISRHNGLKVPKGYYYLCDAGYPNAEGFLAPYRGERYHLSEWRGESNAPT 211

Query: 64  ILREFFNMKHYSARNVIEREFIVLKGRWAILREKSFYPVQIQCRTITTYCLIHNLICREM 123
             REFFNMKH S+RNVIER F +LKG WAILR KS+YPV +QCRTI   CL+HNLI REM
Sbjct: 212 TTREFFNMKHSSSRNVIERAFGLLKGCWAILRGKSYYPVDVQCRTIMACCLLHNLINREM 271

Query: 124 SSSTLLDEVEEVDSGQIVANGENIQFIESSNEWT 158
           ++S ++D+++E DS      G+ I +IE+SNEW+
Sbjct: 272 TNSEIIDDLDEGDSTYATTGGDEINYIEASNEWS 305

BLAST of Tan0012844 vs. NCBI nr
Match: KAA0044844.1 (retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 224.2 bits (570), Expect = 8.0e-55
Identity = 105/154 (68.18%), Postives = 122/154 (79.22%), Query Frame = 0

Query: 4   STIDSRVLRDAISRSNGLKVPRGYYYLCDVGYPNAEGFLAPYSGERYHLSEWRGSWNAPR 63
           S  DSR+LRDA+SR N LKVP+GYYYL DVGYPNAEGFLAPY G+RYHL EWRG  NAP 
Sbjct: 150 SAADSRILRDALSRPNELKVPKGYYYLVDVGYPNAEGFLAPYKGQRYHLQEWRGPENAPS 209

Query: 64  ILREFFNMKHYSARNVIEREFIVLKGRWAILREKSFYPVQIQCRTITTYCLIHNLICREM 123
             +EFFNMKH SARNVIER F VLKGRWAILREKS+YPV++QCRTI   CL+HNLI REM
Sbjct: 210 TSKEFFNMKHSSARNVIERAFGVLKGRWAILREKSYYPVEVQCRTILACCLLHNLINREM 269

Query: 124 SSSTLLDEVEEVDSGQIVANGENIQFIESSNEWT 158
           ++  + D ++EVDS        NI +IE+SNEW+
Sbjct: 270 TNFDIEDNIDEVDSTHATTAAGNIHYIETSNEWS 303

BLAST of Tan0012844 vs. NCBI nr
Match: ADN34114.1 (retrotransposon protein [Cucumis melo subsp. melo])

HSP 1 Score: 224.2 bits (570), Expect = 8.0e-55
Identity = 104/154 (67.53%), Postives = 123/154 (79.87%), Query Frame = 0

Query: 4   STIDSRVLRDAISRSNGLKVPRGYYYLCDVGYPNAEGFLAPYSGERYHLSEWRGSWNAPR 63
           S  DSR+LRDA+SR N LKVP+GYYYL DVGYPNAEGFLAPY G+RYHL EWRG  NAP 
Sbjct: 192 SAADSRILRDALSRPNRLKVPKGYYYLVDVGYPNAEGFLAPYRGQRYHLQEWRGPENAPS 251

Query: 64  ILREFFNMKHYSARNVIEREFIVLKGRWAILREKSFYPVQIQCRTITTYCLIHNLICREM 123
             +EFFNMKHYSARNVIER F VLKGRWAILR KS+YPV++QCRTI   CL+HNLI REM
Sbjct: 252 TSKEFFNMKHYSARNVIERAFGVLKGRWAILRGKSYYPVEVQCRTILACCLLHNLINREM 311

Query: 124 SSSTLLDEVEEVDSGQIVANGENIQFIESSNEWT 158
           ++  + D ++EVDS       ++I +IE+SNEW+
Sbjct: 312 TNFDIEDNIDEVDSTHATTAADDIHYIETSNEWS 345

BLAST of Tan0012844 vs. NCBI nr
Match: KAA0068124.1 (retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 222.2 bits (565), Expect = 3.1e-54
Identity = 103/154 (66.88%), Postives = 122/154 (79.22%), Query Frame = 0

Query: 4   STIDSRVLRDAISRSNGLKVPRGYYYLCDVGYPNAEGFLAPYSGERYHLSEWRGSWNAPR 63
           S  DSR+LRDA+SR NGLKVP+GYYYL D GYPNAEGFLAPY G+RYHL EWRG  NAP 
Sbjct: 171 SAADSRILRDALSRPNGLKVPKGYYYLVDAGYPNAEGFLAPYRGQRYHLQEWRGPENAPS 230

Query: 64  ILREFFNMKHYSARNVIEREFIVLKGRWAILREKSFYPVQIQCRTITTYCLIHNLICREM 123
             +EFFNMKH SARNVIER F VLKGRWAILR KS+YPV++QCRTI   CL+HNLI REM
Sbjct: 231 TSKEFFNMKHSSARNVIERAFGVLKGRWAILRGKSYYPVEVQCRTILACCLLHNLINREM 290

Query: 124 SSSTLLDEVEEVDSGQIVANGENIQFIESSNEWT 158
           ++  + D ++EVDS       ++I +IE+SNEW+
Sbjct: 291 TNFDIEDNIDEVDSTHATTAADDIHYIETSNEWS 324

BLAST of Tan0012844 vs. ExPASy TrEMBL
Match: A0A5D3DG22 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold523G00290 PE=3 SV=1)

HSP 1 Score: 229.2 bits (583), Expect = 1.2e-56
Identity = 106/154 (68.83%), Postives = 124/154 (80.52%), Query Frame = 0

Query: 4   STIDSRVLRDAISRSNGLKVPRGYYYLCDVGYPNAEGFLAPYSGERYHLSEWRGSWNAPR 63
           S  DSR+LRDAISR NGLKVP+GYYYLCD GYPNAEGFLAPY GERYHLSEWRG  NAP 
Sbjct: 152 SAADSRILRDAISRHNGLKVPKGYYYLCDAGYPNAEGFLAPYRGERYHLSEWRGESNAPT 211

Query: 64  ILREFFNMKHYSARNVIEREFIVLKGRWAILREKSFYPVQIQCRTITTYCLIHNLICREM 123
             REFFNMKH S+RNVIER F +LKG WAILR KS+YPV +QCRTI   CL+HNLI REM
Sbjct: 212 TTREFFNMKHSSSRNVIERAFGLLKGCWAILRGKSYYPVDVQCRTIMACCLLHNLINREM 271

Query: 124 SSSTLLDEVEEVDSGQIVANGENIQFIESSNEWT 158
           ++S ++D+++E DS      G+ I +IE+SNEW+
Sbjct: 272 TNSEIIDDLDEGDSTYATTGGDEINYIEASNEWS 305

BLAST of Tan0012844 vs. ExPASy TrEMBL
Match: A0A5A7TNY6 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G001360 PE=3 SV=1)

HSP 1 Score: 224.2 bits (570), Expect = 3.9e-55
Identity = 105/154 (68.18%), Postives = 122/154 (79.22%), Query Frame = 0

Query: 4   STIDSRVLRDAISRSNGLKVPRGYYYLCDVGYPNAEGFLAPYSGERYHLSEWRGSWNAPR 63
           S  DSR+LRDA+SR N LKVP+GYYYL DVGYPNAEGFLAPY G+RYHL EWRG  NAP 
Sbjct: 150 SAADSRILRDALSRPNELKVPKGYYYLVDVGYPNAEGFLAPYKGQRYHLQEWRGPENAPS 209

Query: 64  ILREFFNMKHYSARNVIEREFIVLKGRWAILREKSFYPVQIQCRTITTYCLIHNLICREM 123
             +EFFNMKH SARNVIER F VLKGRWAILREKS+YPV++QCRTI   CL+HNLI REM
Sbjct: 210 TSKEFFNMKHSSARNVIERAFGVLKGRWAILREKSYYPVEVQCRTILACCLLHNLINREM 269

Query: 124 SSSTLLDEVEEVDSGQIVANGENIQFIESSNEWT 158
           ++  + D ++EVDS        NI +IE+SNEW+
Sbjct: 270 TNFDIEDNIDEVDSTHATTAAGNIHYIETSNEWS 303

BLAST of Tan0012844 vs. ExPASy TrEMBL
Match: E5GCB5 (Retrotransposon protein OS=Cucumis melo subsp. melo OX=412675 PE=3 SV=1)

HSP 1 Score: 224.2 bits (570), Expect = 3.9e-55
Identity = 104/154 (67.53%), Postives = 123/154 (79.87%), Query Frame = 0

Query: 4   STIDSRVLRDAISRSNGLKVPRGYYYLCDVGYPNAEGFLAPYSGERYHLSEWRGSWNAPR 63
           S  DSR+LRDA+SR N LKVP+GYYYL DVGYPNAEGFLAPY G+RYHL EWRG  NAP 
Sbjct: 192 SAADSRILRDALSRPNRLKVPKGYYYLVDVGYPNAEGFLAPYRGQRYHLQEWRGPENAPS 251

Query: 64  ILREFFNMKHYSARNVIEREFIVLKGRWAILREKSFYPVQIQCRTITTYCLIHNLICREM 123
             +EFFNMKHYSARNVIER F VLKGRWAILR KS+YPV++QCRTI   CL+HNLI REM
Sbjct: 252 TSKEFFNMKHYSARNVIERAFGVLKGRWAILRGKSYYPVEVQCRTILACCLLHNLINREM 311

Query: 124 SSSTLLDEVEEVDSGQIVANGENIQFIESSNEWT 158
           ++  + D ++EVDS       ++I +IE+SNEW+
Sbjct: 312 TNFDIEDNIDEVDSTHATTAADDIHYIETSNEWS 345

BLAST of Tan0012844 vs. ExPASy TrEMBL
Match: A0A5A7VL29 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold238G00850 PE=3 SV=1)

HSP 1 Score: 222.2 bits (565), Expect = 1.5e-54
Identity = 103/154 (66.88%), Postives = 122/154 (79.22%), Query Frame = 0

Query: 4   STIDSRVLRDAISRSNGLKVPRGYYYLCDVGYPNAEGFLAPYSGERYHLSEWRGSWNAPR 63
           S  DSR+LRDA+SR NGLKVP+GYYYL D GYPNAEGFLAPY G+RYHL EWRG  NAP 
Sbjct: 171 SAADSRILRDALSRPNGLKVPKGYYYLVDAGYPNAEGFLAPYRGQRYHLQEWRGPENAPS 230

Query: 64  ILREFFNMKHYSARNVIEREFIVLKGRWAILREKSFYPVQIQCRTITTYCLIHNLICREM 123
             +EFFNMKH SARNVIER F VLKGRWAILR KS+YPV++QCRTI   CL+HNLI REM
Sbjct: 231 TSKEFFNMKHSSARNVIERAFGVLKGRWAILRGKSYYPVEVQCRTILACCLLHNLINREM 290

Query: 124 SSSTLLDEVEEVDSGQIVANGENIQFIESSNEWT 158
           ++  + D ++EVDS       ++I +IE+SNEW+
Sbjct: 291 TNFDIEDNIDEVDSTHATTAADDIHYIETSNEWS 324

BLAST of Tan0012844 vs. ExPASy TrEMBL
Match: A0A5A7UPP3 (Putative nuclease HARBI1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold280G00020 PE=3 SV=1)

HSP 1 Score: 220.3 bits (560), Expect = 5.6e-54
Identity = 102/154 (66.23%), Postives = 122/154 (79.22%), Query Frame = 0

Query: 4   STIDSRVLRDAISRSNGLKVPRGYYYLCDVGYPNAEGFLAPYSGERYHLSEWRGSWNAPR 63
           S +DSR+LRDAISR NGLKVP+GYYY  D GYPNA+GFLAPY G+RYHL EWRG+ N P 
Sbjct: 76  SAVDSRILRDAISRPNGLKVPKGYYYQVDAGYPNADGFLAPYRGQRYHLQEWRGAENVPS 135

Query: 64  ILREFFNMKHYSARNVIEREFIVLKGRWAILREKSFYPVQIQCRTITTYCLIHNLICREM 123
             +EFFNMKH SARNVIER F VLKGRWAILR KS+YPV++QCRTI   CL+HNLI REM
Sbjct: 136 TSKEFFNMKHSSARNVIERAFGVLKGRWAILRGKSYYPVEVQCRTILACCLLHNLINREM 195

Query: 124 SSSTLLDEVEEVDSGQIVANGENIQFIESSNEWT 158
           ++  + D + EVDS     + ++I +IE+SNEWT
Sbjct: 196 TNFDIEDNIVEVDSTHTTTSVDDIHYIETSNEWT 229

BLAST of Tan0012844 vs. TAIR 10
Match: AT5G35695.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G41980.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 87.8 bits (216), Expect = 8.4e-18
Identity = 52/143 (36.36%), Postives = 71/143 (49.65%), Query Frame = 0

Query: 4   STIDSRVLRDAISRSNGLKVPRGYYYLCDVGYPNAEGFLAPYSGERYHLSEWRGSWNAPR 63
           S  DSRVL DA+ +          +YL D G+ N   FLAP+ G RYHL E+ G    P 
Sbjct: 35  SAHDSRVLSDALRK----------FYLVDCGFANRLNFLAPFRGVRYHLQEFAGQRRDPE 94

Query: 64  ILREFFNMKHYSARNVIEREFIVLKGRWAILREKSFYPVQIQCRTITTYCLIHNLICREM 123
              E FN++H S RNVIER F + K R+AI +    +  + Q   + T   +HN + +E 
Sbjct: 95  TPHELFNLRHVSLRNVIERIFGIFKSRFAIFKSAPPFSYKKQAGLVLTCAALHNFLRKEC 154

Query: 124 SSSTLLDEVEEVDSGQIVANGEN 147
            S       E  + G +V N  N
Sbjct: 155 RSDEADFPDEVGNEGDVVNNEGN 167

BLAST of Tan0012844 vs. TAIR 10
Match: AT5G41980.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G43722.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 82.8 bits (203), Expect = 2.7e-16
Identity = 47/131 (35.88%), Postives = 64/131 (48.85%), Query Frame = 0

Query: 4   STIDSRVLRDAISRSNGLKVPRGYYYLCDVGYPNAEGFLAPYSGERYHLSEWRGSWNAPR 63
           S  D +VL  A++R N L+VP+G YY+ D  YPN  GF+APY G          S N+  
Sbjct: 195 SASDQQVLNAALTRRNKLQVPQGKYYIVDNKYPNLPGFIAPYHGV---------STNSRE 254

Query: 64  ILREFFNMKHYSARNVIEREFIVLKGRWAILREKSFYPVQIQCRTITTYCLIHNLICREM 123
             +E FN +H      I R F  LK R+ IL     YP+Q Q + +   C +HN +  E 
Sbjct: 255 EAKEMFNERHKLLHRAIHRTFGALKERFPILLSAPPYPLQTQVKLVIAACALHNYVRLEK 314

Query: 124 SSSTLLDEVEE 135
               +    EE
Sbjct: 315 PDDLVFRMFEE 316

BLAST of Tan0012844 vs. TAIR 10
Match: AT1G43722.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G28730.1); Has 924 Blast hits to 912 proteins in 109 species: Archae - 0; Bacteria - 0; Metazoa - 222; Fungi - 31; Plants - 661; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 58.9 bits (141), Expect = 4.2e-09
Identity = 37/93 (39.78%), Postives = 48/93 (51.61%), Query Frame = 0

Query: 4   STIDSRVLRDAISRSNGLKVPRG-YYYLCDVGYPNAEGFLAPYSGE-----RYHLSEWRG 63
           S  D+ VL+ A    +   +P    YYL D GYPN +G LAPY        RYH+S++  
Sbjct: 228 SCYDTAVLQIAQQSDSEFPLPPSEKYYLVDSGYPNKQGLLAPYRSSRNRVVRYHMSQFYY 287

Query: 64  SWNAPRILREFFNMKHYSARNVIEREFIVLKGR 91
               PR   E FN  H S R+VIER F + K +
Sbjct: 288 G-PRPRNKHELFNQCHTSLRSVIERTFRIWKNK 319

BLAST of Tan0012844 vs. TAIR 10
Match: AT4G10890.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2439 (InterPro:IPR018838); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G43722.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 56.6 bits (135), Expect = 2.1e-08
Identity = 28/67 (41.79%), Postives = 38/67 (56.72%), Query Frame = 0

Query: 28  YYLCDVGYPNAEGFLAPYSGERYHLSEWRGSWNAPRILREFFNMKHYSARNVIEREFIVL 87
           YYL +  YP   G+L P+    YHL ++ G    P  ++E FN KH   R+VI+R F V 
Sbjct: 95  YYLVNSVYPTTTGYLGPHRRILYHLGQF-GRGGPPVTVQELFNRKHLDLRSVIDRTFGVW 154

Query: 88  KGRWAIL 95
           K +W IL
Sbjct: 155 KAKWRIL 160

BLAST of Tan0012844 vs. TAIR 10
Match: AT5G28730.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G43722.1); Has 496 Blast hits to 496 proteins in 68 species: Archae - 0; Bacteria - 0; Metazoa - 3; Fungi - 23; Plants - 470; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 45.4 bits (106), Expect = 4.8e-05
Identity = 33/87 (37.93%), Postives = 39/87 (44.83%), Query Frame = 0

Query: 1   MGRSTIDSRVLRDAISRSNGLKV-PRGYYYLCDVGYPNAEGFLAPYSGERYHLSEWRGSW 60
           M  ST D+RVL  AIS      V P   YYL D GY N  G+LAPY  E     +   + 
Sbjct: 159 MAGSTHDARVLSAAISDDPLFHVPPDSKYYLVDSGYANKRGYLAPYRREHREAQDIISNN 218

Query: 61  NAPRILREFFNMKHYSARNVIEREFIV 87
                L E  N+K Y   NV     +V
Sbjct: 219 FLTVNLFETHNIKDYDFDNVDSENNVV 245

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038885881.11.9e-5671.24protein ALP1-like [Benincasa hispida][more]
KAA0062747.12.5e-5668.83retrotransposon protein [Cucumis melo var. makuwa] >TYK22546.1 retrotransposon p... [more]
KAA0044844.18.0e-5568.18retrotransposon protein [Cucumis melo var. makuwa][more]
ADN34114.18.0e-5567.53retrotransposon protein [Cucumis melo subsp. melo][more]
KAA0068124.13.1e-5466.88retrotransposon protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A5D3DG221.2e-5668.83Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A5A7TNY63.9e-5568.18Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
E5GCB53.9e-5567.53Retrotransposon protein OS=Cucumis melo subsp. melo OX=412675 PE=3 SV=1[more]
A0A5A7VL291.5e-5466.88Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A5A7UPP35.6e-5466.23Putative nuclease HARBI1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffol... [more]
Match NameE-valueIdentityDescription
AT5G35695.18.4e-1836.36CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT5G41980.12.7e-1635.88CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT1G43722.14.2e-0939.78unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G10890.12.1e-0841.79unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2439... [more]
AT5G28730.14.8e-0537.93unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 7..117
e-value: 2.1E-7
score: 30.8
NoneNo IPR availablePANTHERPTHR22930:SF211NUCLEASE HARBI1-RELATEDcoord: 4..120
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 4..120

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0012844.1Tan0012844.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0046872 metal ion binding