Sed0005350 (gene) Chayote v1

Overview
NameSed0005350
Typegene
OrganismSechium edule (Chayote v1)
DescriptionRetrotransposon protein
LocationLG07: 14940033 .. 14941081 (+)
RNA-Seq ExpressionSed0005350
SyntenySed0005350
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTACAAGAGGTGCAAAACATTTGTGGACTGCTGAGGAGGATCGTGTCTTAATAGAATCATTGCTACACTTAAGCCAACTAAAGAAAGGGAGGGCTGACAATGGGACCTTTAGGTATGGGGTTTTTGGAGAAATCCAAAAGATGATAGTTGAAAAAATTCGTAACTGTTATATCGAAGTGAACCCTCATCTGGAGTCAAGGGTTCGTCATCTTCAGAAACAATATAATGCAATCCAAGAGATGTTGGGACAAAGTGCTAGTGGTTTTGGATGGAATGATGAGAAAAAATGTGTGGAAGTTGAGAAGTCAATTTTTGATGGTTGGGTGAAGGTTGGTAAATTTTATTATTTATTTTGTAGGCAGTTTTATACAAAAACATTAATATAAAGTATTTTAATGTTAAATTACAGAGTCATCCAACAGCTCGAGGGTTGAGGAACAAACCTTTCCCTTATTACGATGATCTAGCTGTAATATTTGGTAAAGACCGAGCAGTAGGTGGTGGAAGAAAAAGACTCTATCATCAGGGTCTTCAAGTAGCAGCTGACTTGGATTCATACTACACCCTTGAATTCGACCTTAATGATGACGTTCCTACTTCTGACTACTATCCGCATAGTGAAGACTTCATTCCTACTTACGTGTCCATGGGGGAAGAAAAACAAACAACACCTAGGACTCGACCATCCAGCTCAGGGATTCCTAGGCCTTCTCGGAAAAGGAAGGTCTCTATAGGAGAGAACGTGCAAGATAACATGACTATTGCCCTCGAAGAGACAGCGGAGGGGATCGCAAAAATATTTGCATGGCCAGAAAATAAGGTAAGGTTAGAAGTTGAACGGTCGAAAATGCTTATGTTAGAACTTAGGTCATTGGAAGGAATGTCAAGAGCTGATTGTACAACAGTGTCAAATACCCTACTTGCCAACCCAACCATGTATACAACATACATCGGTTATGATGATGATTGGAAGTACAAATTTTGCATGGAAGTGTTAGGTGGAACTCCTAACATGAACGAGGATCCTTCGTCGTACACTCCTTAG

mRNA sequence

ATGGCTACAAGAGGTGCAAAACATTTGTGGACTGCTGAGGAGGATCGTGTCTTAATAGAATCATTGCTACACTTAAGCCAACTAAAGAAAGGGAGGGCTGACAATGGGACCTTTAGGTATGGGGTTTTTGGAGAAATCCAAAAGATGATAGTTGAAAAAATTCGTAACTGTTATATCGAAGTGAACCCTCATCTGGAGTCAAGGGTTCGTCATCTTCAGAAACAATATAATGCAATCCAAGAGATGTTGGGACAAAGTGCTAGTGGTTTTGGATGGAATGATGAGAAAAAATGTGTGGAAGTTGAGAAGTCAATTTTTGATGGTTGGGTGAAGAGTCATCCAACAGCTCGAGGGTTGAGGAACAAACCTTTCCCTTATTACGATGATCTAGCTGTAATATTTGGTAAAGACCGAGCAGTAGGTGGTGGAAGAAAAAGACTCTATCATCAGGGTCTTCAAGTAGCAGCTGACTTGGATTCATACTACACCCTTGAATTCGACCTTAATGATGACGTTCCTACTTCTGACTACTATCCGCATAGTGAAGACTTCATTCCTACTTACGTGTCCATGGGGGAAGAAAAACAAACAACACCTAGGACTCGACCATCCAGCTCAGGGATTCCTAGGCCTTCTCGGAAAAGGAAGGTCTCTATAGGAGAGAACGTGCAAGATAACATGACTATTGCCCTCGAAGAGACAGCGGAGGGGATCGCAAAAATATTTGCATGGCCAGAAAATAAGGTAAGGTTAGAAGTTGAACGGTCGAAAATGCTTATGTTAGAACTTAGGTCATTGGAAGGAATGTCAAGAGCTGATTGTACAACAGTGTCAAATACCCTACTTGCCAACCCAACCATGTATACAACATACATCGGTTATGATGATGATTGGAAGTACAAATTTTGCATGGAAGTGTTAGGTGGAACTCCTAACATGAACGAGGATCCTTCGTCGTACACTCCTTAG

Coding sequence (CDS)

ATGGCTACAAGAGGTGCAAAACATTTGTGGACTGCTGAGGAGGATCGTGTCTTAATAGAATCATTGCTACACTTAAGCCAACTAAAGAAAGGGAGGGCTGACAATGGGACCTTTAGGTATGGGGTTTTTGGAGAAATCCAAAAGATGATAGTTGAAAAAATTCGTAACTGTTATATCGAAGTGAACCCTCATCTGGAGTCAAGGGTTCGTCATCTTCAGAAACAATATAATGCAATCCAAGAGATGTTGGGACAAAGTGCTAGTGGTTTTGGATGGAATGATGAGAAAAAATGTGTGGAAGTTGAGAAGTCAATTTTTGATGGTTGGGTGAAGAGTCATCCAACAGCTCGAGGGTTGAGGAACAAACCTTTCCCTTATTACGATGATCTAGCTGTAATATTTGGTAAAGACCGAGCAGTAGGTGGTGGAAGAAAAAGACTCTATCATCAGGGTCTTCAAGTAGCAGCTGACTTGGATTCATACTACACCCTTGAATTCGACCTTAATGATGACGTTCCTACTTCTGACTACTATCCGCATAGTGAAGACTTCATTCCTACTTACGTGTCCATGGGGGAAGAAAAACAAACAACACCTAGGACTCGACCATCCAGCTCAGGGATTCCTAGGCCTTCTCGGAAAAGGAAGGTCTCTATAGGAGAGAACGTGCAAGATAACATGACTATTGCCCTCGAAGAGACAGCGGAGGGGATCGCAAAAATATTTGCATGGCCAGAAAATAAGGTAAGGTTAGAAGTTGAACGGTCGAAAATGCTTATGTTAGAACTTAGGTCATTGGAAGGAATGTCAAGAGCTGATTGTACAACAGTGTCAAATACCCTACTTGCCAACCCAACCATGTATACAACATACATCGGTTATGATGATGATTGGAAGTACAAATTTTGCATGGAAGTGTTAGGTGGAACTCCTAACATGAACGAGGATCCTTCGTCGTACACTCCTTAG

Protein sequence

MATRGAKHLWTAEEDRVLIESLLHLSQLKKGRADNGTFRYGVFGEIQKMIVEKIRNCYIEVNPHLESRVRHLQKQYNAIQEMLGQSASGFGWNDEKKCVEVEKSIFDGWVKSHPTARGLRNKPFPYYDDLAVIFGKDRAVGGGRKRLYHQGLQVAADLDSYYTLEFDLNDDVPTSDYYPHSEDFIPTYVSMGEEKQTTPRTRPSSSGIPRPSRKRKVSIGENVQDNMTIALEETAEGIAKIFAWPENKVRLEVERSKMLMLELRSLEGMSRADCTTVSNTLLANPTMYTTYIGYDDDWKYKFCMEVLGGTPNMNEDPSSYTP
Homology
BLAST of Sed0005350 vs. NCBI nr
Match: TYK07921.1 (hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa])

HSP 1 Score: 195.3 bits (495), Expect = 8.2e-46
Identity = 111/293 (37.88%), Postives = 154/293 (52.56%), Query Frame = 0

Query: 3   TRGAKHLWTAEEDRVLIESLLHLSQLKKGRADNGTFRYGVFGEIQKMIVEKIRNCYIEVN 62
           ++  KH WT  ED VL+E LL L +    RADNGTF+ G                     
Sbjct: 143 SKATKHRWTTIEDEVLVECLLQLVEEGGWRADNGTFKLGYL------------------- 202

Query: 63  PHLESRVRHLQKQYNAIQEMLGQSASGFGWNDEKKCVEVEKSIFDGWVKSHPTARGLRNK 122
                      KQY AI EM+G + SGFGWN+ +KC+EVEK +FD WVK HP A+GL NK
Sbjct: 203 -----------KQYTAIAEMMGPACSGFGWNEGQKCIEVEKPVFDDWVKGHPNAQGLLNK 262

Query: 123 PFPYYDDLAVIFGKDRAVGGGRKRLYHQGLQVAADLDSYYTLEFDLND-DVPTSDYYPHS 182
           PFPY+ DL V+FG+DRA GG  K       Q A D +    ++ +L D D+P     PH 
Sbjct: 263 PFPYFYDLEVVFGRDRATGGRCKTPVEMSSQTARDTEE-DDMDINLEDFDIPN----PHG 322

Query: 183 EDFIPTYVSMGEEKQTTPRTRPSSSGIPRPSRKRKVSIGENVQDNMTIALEETAEGIAKI 242
            +        GE+  +TP +    +G  RPS+KR+ S   ++ D    ++ ET++ I KI
Sbjct: 323 LE-----PPSGEDMPSTPTSMTHDAGSSRPSKKRR-SYSGDLMDTFRASMRETSKEIGKI 382

Query: 243 FAWPENKVRLEVERSKMLMLELRSLEGMSRADCTTVSNTLLANPTMYTTYIGY 295
             W   K+ +E    K L  EL+++ GM   DC  V+ +LL +PTM   ++ Y
Sbjct: 383 ATWQREKMEIESSLHKRLYAELQTIPGMDVDDCLIVAESLLPDPTMLHAFLDY 394

BLAST of Sed0005350 vs. NCBI nr
Match: XP_024022021.1 (uncharacterized protein LOC112091787 [Morus notabilis])

HSP 1 Score: 191.4 bits (485), Expect = 1.2e-44
Identity = 120/313 (38.34%), Postives = 169/313 (53.99%), Query Frame = 0

Query: 7   KHLWTAEEDRVLIESLLHLSQLKKGRADNGTFRYGVFGEIQKMIVEKIRNCYIEVNPHLE 66
           KH WT  ED  L+E LL ++   K +ADNGTF+ G   +++KM+ EKI  C ++  PH++
Sbjct: 54  KHQWTTLEDSKLVECLLDMANSGKWKADNGTFKPGYLQQLEKMMNEKIPQCGLKAQPHID 113

Query: 67  SRVRHLQKQYNAIQEMLGQSASGFGWNDEKKCVEVEKSIFDGWVKSHPTARGLRNKPFPY 126
           SRV+ L+KQY+AI EMLG + SGFGWND+ KCV VEK +FD WVKSHP+A+GLRNKPFPY
Sbjct: 114 SRVKILKKQYHAISEMLGPAGSGFGWNDKDKCVVVEKDVFDEWVKSHPSAKGLRNKPFPY 173

Query: 127 YDDLAVIFGKDRAVGGGRKRLYHQGLQVAADLDSYYTLEFDLNDDVPTSDYYP--HSEDF 186
           +D+L ++FG DRA G G   L      +  D+D          + V   DY P   S++ 
Sbjct: 174 HDELGLVFGNDRANGQGAMGL----TDMVDDID---------KETVNDLDYDPLLMSDEN 233

Query: 187 IPTYVSMGEEKQTTPRTRPSSSGIPRPSRKRKVSIGENVQDNMTIALEETAEGIAKIFAW 246
           + T    G   QTT  + P + G  R  RKR         D +  AL E     + ++A 
Sbjct: 234 MDTASIGGPSVQTT--STPLALG--RKKRKR-----SQRGDVLVDALTEIVHKFSDMYAM 293

Query: 247 P-ENKVRLE---------VERSKMLMLELRSLEGMSRADCTTVSNTLLANPTMYTTYIGY 306
             EN  RL            R   +  E++ +EG++ A    V   L+ N      +   
Sbjct: 294 AGENIGRLANCFQYEADGAARRMQVFDEVKKVEGLTNAQRVRVGKLLVQNHDYTNYFFTL 344

Query: 307 DDDWKYKFCMEVL 308
           DD++K  F + +L
Sbjct: 354 DDEFKLDFLLSLL 344

BLAST of Sed0005350 vs. NCBI nr
Match: KAA0050106.1 (retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 187.2 bits (474), Expect = 2.2e-43
Identity = 112/310 (36.13%), Postives = 165/310 (53.23%), Query Frame = 0

Query: 3   TRGAKHLWTAEEDRVLIESLLHLSQLKKGRADNGTFRYGVFGEIQKMIVEKIRNCYIEVN 62
           ++  KH WT   D VL+E LL L +    RADNGTF+ G   ++QK++ EKI    I+V 
Sbjct: 6   SKATKHRWTT-IDEVLVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKILGSNIQVT 65

Query: 63  PHLESRVRHLQKQYNAIQEMLGQSASGFGWNDEKKCVEVEKSIFDGWVKSHPTARGLRNK 122
           P+L+SRV+ L+KQY AI EM+G + SGFGWN+E+KC+E EKS+FD WVK   TAR +   
Sbjct: 66  PNLKSRVKILKKQYIAIAEMMGPACSGFGWNEERKCIEAEKSVFDDWVK---TARDIEE- 125

Query: 123 PFPYYDDLAVIFGKDRAVGGGRKRLYHQGLQVAADLDSYYTLEFDLND-DVPTSDYYPHS 182
                DD                                  ++ +L D D+P     PH 
Sbjct: 126 -----DD----------------------------------MDINLEDFDIPN----PHG 185

Query: 183 EDFIPTYVSMGEEKQTTPRTRPSSSGIPRPSRKRKVSIGENVQDNMTIALEETAEGIAKI 242
            +        GE+  +TP +    +G  RPS+KR+ S   ++ D    ++ ET++ I KI
Sbjct: 186 LE-----PPSGEDMPSTPTSMAHDAGSSRPSKKRR-SYSGDLMDTFRASMRETSKEIGKI 245

Query: 243 FAWPENKVRLEVERSKMLMLELRSLEGMSRADCTTVSNTLLANPTMYTTYIGYDDDWKYK 302
            AW   K+ +E    K L ++L+++ GM   DC  V+ +LL +PTM   ++ Y  +WKY+
Sbjct: 246 AAWQREKMEIESSLHKRLYVDLQTIPGMDVDDCLIVAESLLPDPTMLHAFLDYPAEWKYR 261

Query: 303 FCMEVLGGTP 312
            CM +LG  P
Sbjct: 306 KCMRILGRQP 261

BLAST of Sed0005350 vs. NCBI nr
Match: XP_030483301.1 (uncharacterized protein LOC115699898 [Cannabis sativa])

HSP 1 Score: 184.9 bits (468), Expect = 1.1e-42
Identity = 102/304 (33.55%), Postives = 175/304 (57.57%), Query Frame = 0

Query: 5   GAKHLWTAEEDRVLIESLLHLSQLKKGRADNGTFRYGVFGEIQKMIVEKIRNCYIEVNPH 64
           G KH WT+ +D  L+E L+ +    K +ADNGTF+ G   +++KM+ ++I N  I+  PH
Sbjct: 12  GRKHQWTSIQDSKLVECLVDMCNSGKWKADNGTFKPGYLQQLEKMMNDRIPNSGIKAQPH 71

Query: 65  LESRVRHLQKQYNAIQEMLGQSASGFGWNDEKKCVEVEKSIFDGWVKSHPTARGLRNKPF 124
           ++SR++ L++QY AI +MLG SASGFGWN++ KCV  +K +FD WVKSHPTA+GL +KPF
Sbjct: 72  IDSRLKILKRQYTAISDMLGPSASGFGWNEQLKCVVADKIVFDEWVKSHPTAKGLLHKPF 131

Query: 125 PYYDDLAVIFGKDRAVGGGRKRLYHQGLQVAADLDSYYTLEFDLNDDVPTSDYYPHSEDF 184
           PYYD+LA+++GKDRA G G         ++A ++++ +  +FD  D +   +        
Sbjct: 132 PYYDELAIVYGKDRATGDGAMGFSETLDEIAEEINNGWNDDFDPFDPLDEMNANASMNSS 191

Query: 185 IPTYVSMGEEKQTTPRT-RPSSSGIPRPSRKRKVSIGENVQDNMTIALEETAEGIAKIFA 244
           IP+        QTT +  R S++G P         + ++VQ+  T+     ++ I K+  
Sbjct: 192 IPS-------SQTTRKAKRKSNNGDPLVE-----LLSKSVQEFSTMQ-ASASDSIKKLAD 251

Query: 245 WPENKVRLEVERSKMLMLELRSLEGMSRADCTTVSNTLLANPTMYTTYIGYDDDWKYKFC 304
             +++      R K L  E++ ++G++ +    +   L++N      +   ++++K  F 
Sbjct: 252 CFQHEADGAARRMK-LYEEIKKVDGLTNSQRLKIGKLLVSNQPHIDYFFTLEEEFKLDFL 301

Query: 305 MEVL 308
           + +L
Sbjct: 312 LGML 301

BLAST of Sed0005350 vs. NCBI nr
Match: CAD1831704.1 (unnamed protein product [Ananas comosus var. bracteatus])

HSP 1 Score: 184.9 bits (468), Expect = 1.1e-42
Identity = 117/316 (37.03%), Postives = 166/316 (52.53%), Query Frame = 0

Query: 4   RGAKHLWTAEEDRVLIESLLHLSQLKKGRADNGTFRYGVFGEIQKMIVEKIRNCYIEVNP 63
           +G KH WT EED  LIE LL L+     +ADNGTFR G   +++K + E++  C ++  P
Sbjct: 15  KGLKHQWTKEEDEKLIECLLELTTSGNWKADNGTFRNGYLQQLEKWMHERLPGCQLKGVP 74

Query: 64  HLESRVRHLQKQYNAIQEMLGQSASGFGWNDEKKCVEVEKSIFDGWVKSHPTARGLRNKP 123
           H+ESR +  ++QYNAI EMLG +ASGFGWND +KC+  EK++FD WVKSHPTA GLR K 
Sbjct: 75  HIESRFKLWKRQYNAISEMLGPAASGFGWNDAEKCIICEKTVFDAWVKSHPTAAGLRGKS 134

Query: 124 FPYYDDLAVIFGKDRAVGGGRKRLYHQGLQVAADLDSYYTLEFDLNDDVPTSDYYPHSED 183
           FPY + L+V+FGKD A G         G + AAD       E +L     T D  P  E 
Sbjct: 135 FPYLEQLSVVFGKDCATGA--------GAESAADAARNVEDE-ELMTHASTQD--PDVEI 194

Query: 184 FIPTYVSMGEEKQTTPRTRPSSSGIPRPSRKRKVSIGE---NVQDNMTIALEETAEG--- 243
           F          +Q++ +   S+ G    S+KRK S+GE   N Q   T+   ET+     
Sbjct: 195 FF--------MEQSSTQVNTSNGGTSAKSKKRKSSLGEDSINEQFCSTLRSLETSFNDAN 254

Query: 244 -----IAKIFAWPENKVRLEVERSKMLMLELRSLEGMSRADCTTVSNTLLANPTMYTTYI 303
                +A  F +  N    E ++ K L  E+  +EG+        +  L  + T   T+ 
Sbjct: 255 QHLGRLANCFQFMANN---ETKKEK-LFEEILKIEGLEDKQIMDATEILTGDSTKLHTFY 307

Query: 304 GYDDDWKYKFCMEVLG 309
              D ++ ++   +LG
Sbjct: 315 CMPDHFRKRYIFRILG 307

BLAST of Sed0005350 vs. ExPASy TrEMBL
Match: A0A5D3C7T4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold265G00330 PE=4 SV=1)

HSP 1 Score: 195.3 bits (495), Expect = 4.0e-46
Identity = 111/293 (37.88%), Postives = 154/293 (52.56%), Query Frame = 0

Query: 3   TRGAKHLWTAEEDRVLIESLLHLSQLKKGRADNGTFRYGVFGEIQKMIVEKIRNCYIEVN 62
           ++  KH WT  ED VL+E LL L +    RADNGTF+ G                     
Sbjct: 143 SKATKHRWTTIEDEVLVECLLQLVEEGGWRADNGTFKLGYL------------------- 202

Query: 63  PHLESRVRHLQKQYNAIQEMLGQSASGFGWNDEKKCVEVEKSIFDGWVKSHPTARGLRNK 122
                      KQY AI EM+G + SGFGWN+ +KC+EVEK +FD WVK HP A+GL NK
Sbjct: 203 -----------KQYTAIAEMMGPACSGFGWNEGQKCIEVEKPVFDDWVKGHPNAQGLLNK 262

Query: 123 PFPYYDDLAVIFGKDRAVGGGRKRLYHQGLQVAADLDSYYTLEFDLND-DVPTSDYYPHS 182
           PFPY+ DL V+FG+DRA GG  K       Q A D +    ++ +L D D+P     PH 
Sbjct: 263 PFPYFYDLEVVFGRDRATGGRCKTPVEMSSQTARDTEE-DDMDINLEDFDIPN----PHG 322

Query: 183 EDFIPTYVSMGEEKQTTPRTRPSSSGIPRPSRKRKVSIGENVQDNMTIALEETAEGIAKI 242
            +        GE+  +TP +    +G  RPS+KR+ S   ++ D    ++ ET++ I KI
Sbjct: 323 LE-----PPSGEDMPSTPTSMTHDAGSSRPSKKRR-SYSGDLMDTFRASMRETSKEIGKI 382

Query: 243 FAWPENKVRLEVERSKMLMLELRSLEGMSRADCTTVSNTLLANPTMYTTYIGY 295
             W   K+ +E    K L  EL+++ GM   DC  V+ +LL +PTM   ++ Y
Sbjct: 383 ATWQREKMEIESSLHKRLYAELQTIPGMDVDDCLIVAESLLPDPTMLHAFLDY 394

BLAST of Sed0005350 vs. ExPASy TrEMBL
Match: A0A5A7U7F7 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold675G001200 PE=4 SV=1)

HSP 1 Score: 187.2 bits (474), Expect = 1.1e-43
Identity = 112/310 (36.13%), Postives = 165/310 (53.23%), Query Frame = 0

Query: 3   TRGAKHLWTAEEDRVLIESLLHLSQLKKGRADNGTFRYGVFGEIQKMIVEKIRNCYIEVN 62
           ++  KH WT   D VL+E LL L +    RADNGTF+ G   ++QK++ EKI    I+V 
Sbjct: 6   SKATKHRWTT-IDEVLVECLLQLVEEGGWRADNGTFKLGYLVQVQKLMKEKILGSNIQVT 65

Query: 63  PHLESRVRHLQKQYNAIQEMLGQSASGFGWNDEKKCVEVEKSIFDGWVKSHPTARGLRNK 122
           P+L+SRV+ L+KQY AI EM+G + SGFGWN+E+KC+E EKS+FD WVK   TAR +   
Sbjct: 66  PNLKSRVKILKKQYIAIAEMMGPACSGFGWNEERKCIEAEKSVFDDWVK---TARDIEE- 125

Query: 123 PFPYYDDLAVIFGKDRAVGGGRKRLYHQGLQVAADLDSYYTLEFDLND-DVPTSDYYPHS 182
                DD                                  ++ +L D D+P     PH 
Sbjct: 126 -----DD----------------------------------MDINLEDFDIPN----PHG 185

Query: 183 EDFIPTYVSMGEEKQTTPRTRPSSSGIPRPSRKRKVSIGENVQDNMTIALEETAEGIAKI 242
            +        GE+  +TP +    +G  RPS+KR+ S   ++ D    ++ ET++ I KI
Sbjct: 186 LE-----PPSGEDMPSTPTSMAHDAGSSRPSKKRR-SYSGDLMDTFRASMRETSKEIGKI 245

Query: 243 FAWPENKVRLEVERSKMLMLELRSLEGMSRADCTTVSNTLLANPTMYTTYIGYDDDWKYK 302
            AW   K+ +E    K L ++L+++ GM   DC  V+ +LL +PTM   ++ Y  +WKY+
Sbjct: 246 AAWQREKMEIESSLHKRLYVDLQTIPGMDVDDCLIVAESLLPDPTMLHAFLDYPAEWKYR 261

Query: 303 FCMEVLGGTP 312
            CM +LG  P
Sbjct: 306 KCMRILGRQP 261

BLAST of Sed0005350 vs. ExPASy TrEMBL
Match: A0A803QNC5 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 184.9 bits (468), Expect = 5.4e-43
Identity = 102/304 (33.55%), Postives = 175/304 (57.57%), Query Frame = 0

Query: 5   GAKHLWTAEEDRVLIESLLHLSQLKKGRADNGTFRYGVFGEIQKMIVEKIRNCYIEVNPH 64
           G KH WT+ +D  L+E L+ +    K +ADNGTF+ G   +++KM+ ++I N  I+  PH
Sbjct: 267 GRKHQWTSIQDSKLVECLVDMCNSGKWKADNGTFKPGYLQQLEKMMNDRIPNSGIKAQPH 326

Query: 65  LESRVRHLQKQYNAIQEMLGQSASGFGWNDEKKCVEVEKSIFDGWVKSHPTARGLRNKPF 124
           ++SR++ L++QY AI +MLG SASGFGWN++ KCV  +K +FD WVKSHPTA+GL +KPF
Sbjct: 327 IDSRLKILKRQYTAISDMLGPSASGFGWNEQLKCVVADKIVFDEWVKSHPTAKGLLHKPF 386

Query: 125 PYYDDLAVIFGKDRAVGGGRKRLYHQGLQVAADLDSYYTLEFDLNDDVPTSDYYPHSEDF 184
           PYYD+LA+++GKDRA G G         ++A ++++ +  +FD  D +   +        
Sbjct: 387 PYYDELAIVYGKDRATGDGAMGFSETLDEIAEEINNGWNDDFDPFDPLDEMNANASMNSS 446

Query: 185 IPTYVSMGEEKQTTPRT-RPSSSGIPRPSRKRKVSIGENVQDNMTIALEETAEGIAKIFA 244
           IP+        QTT +  R S++G P         + ++VQ+  T+     ++ I K+  
Sbjct: 447 IPS-------SQTTRKAKRKSNNGDPLVE-----LLSKSVQEFSTMQ-ASASDSIKKLAD 506

Query: 245 WPENKVRLEVERSKMLMLELRSLEGMSRADCTTVSNTLLANPTMYTTYIGYDDDWKYKFC 304
             +++      R K L  E++ ++G++ +    +   L++N      +   ++++K  F 
Sbjct: 507 CFQHEADGAARRMK-LYEEIKKVDGLTNSQRLKIGKLLVSNQPHIDYFFTLEEEFKLDFL 556

Query: 305 MEVL 308
           + +L
Sbjct: 567 LGML 556

BLAST of Sed0005350 vs. ExPASy TrEMBL
Match: A0A6V7PLJ8 (Myb_DNA-bind_3 domain-containing protein OS=Ananas comosus var. bracteatus OX=296719 GN=CB5_LOCUS14915 PE=4 SV=1)

HSP 1 Score: 184.9 bits (468), Expect = 5.4e-43
Identity = 117/316 (37.03%), Postives = 166/316 (52.53%), Query Frame = 0

Query: 4   RGAKHLWTAEEDRVLIESLLHLSQLKKGRADNGTFRYGVFGEIQKMIVEKIRNCYIEVNP 63
           +G KH WT EED  LIE LL L+     +ADNGTFR G   +++K + E++  C ++  P
Sbjct: 15  KGLKHQWTKEEDEKLIECLLELTTSGNWKADNGTFRNGYLQQLEKWMHERLPGCQLKGVP 74

Query: 64  HLESRVRHLQKQYNAIQEMLGQSASGFGWNDEKKCVEVEKSIFDGWVKSHPTARGLRNKP 123
           H+ESR +  ++QYNAI EMLG +ASGFGWND +KC+  EK++FD WVKSHPTA GLR K 
Sbjct: 75  HIESRFKLWKRQYNAISEMLGPAASGFGWNDAEKCIICEKTVFDAWVKSHPTAAGLRGKS 134

Query: 124 FPYYDDLAVIFGKDRAVGGGRKRLYHQGLQVAADLDSYYTLEFDLNDDVPTSDYYPHSED 183
           FPY + L+V+FGKD A G         G + AAD       E +L     T D  P  E 
Sbjct: 135 FPYLEQLSVVFGKDCATGA--------GAESAADAARNVEDE-ELMTHASTQD--PDVEI 194

Query: 184 FIPTYVSMGEEKQTTPRTRPSSSGIPRPSRKRKVSIGE---NVQDNMTIALEETAEG--- 243
           F          +Q++ +   S+ G    S+KRK S+GE   N Q   T+   ET+     
Sbjct: 195 FF--------MEQSSTQVNTSNGGTSAKSKKRKSSLGEDSINEQFCSTLRSLETSFNDAN 254

Query: 244 -----IAKIFAWPENKVRLEVERSKMLMLELRSLEGMSRADCTTVSNTLLANPTMYTTYI 303
                +A  F +  N    E ++ K L  E+  +EG+        +  L  + T   T+ 
Sbjct: 255 QHLGRLANCFQFMANN---ETKKEK-LFEEILKIEGLEDKQIMDATEILTGDSTKLHTFY 307

Query: 304 GYDDDWKYKFCMEVLG 309
              D ++ ++   +LG
Sbjct: 315 CMPDHFRKRYIFRILG 307

BLAST of Sed0005350 vs. ExPASy TrEMBL
Match: A0A5D3BC95 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold220G00380 PE=4 SV=1)

HSP 1 Score: 184.1 bits (466), Expect = 9.2e-43
Identity = 99/243 (40.74%), Postives = 145/243 (59.67%), Query Frame = 0

Query: 3   TRGAKHLWTAEEDRVLIESLLHLSQLKKGRADNGTFRYGVFGEIQKMIVEKIRNCYIEVN 62
           ++  KH WT  +D  L+E LL L +    RA+N TF+     ++QK++ EKI    I+V 
Sbjct: 6   SKATKHRWTTIKDDALVECLLQLVEEGGWRANNETFKPRYLVQVQKLMKEKIPRSNIQVT 65

Query: 63  PHLESRVRHLQKQYNAIQEMLGQSASGFGWNDEKKCVEVEKSIFDGWVKSHPTARGLRNK 122
            +LESRV+ L+KQY AI +M+G + S FGWN+E+KC+E EKS+FD WVK HP ARGL NK
Sbjct: 66  LNLESRVKFLKKQYTAIAKMMGPACSRFGWNEERKCIEAEKSVFDDWVKGHPNARGLLNK 125

Query: 123 PFPYYDDLAVIFGKDRAVGGGRKRLYHQGLQVAADLDSYYTLEFDLND-DVPTSDYYPHS 182
           PF Y+ DL ++FG+D+A GG  K       Q A D +    ++ +L D D+P     PH 
Sbjct: 126 PFAYFYDLEIVFGRDKATGGRCKPFVEMASQTARDTEE-DDMDINLEDFDIPN----PHG 185

Query: 183 EDFIPTYVSMGEEKQTTPRTRPSSSGIPRPSRKRKVSIGENVQDNMTIALEETAEGIAKI 242
            +        GE+  +T  +    +G  RPS+KR+   G+ + D    +++ET++ I KI
Sbjct: 186 LE-----PPSGEDMPSTLISMTHDAGSSRPSKKRRSYPGD-LMDTFRASMQETSKEIGKI 237

Query: 243 FAW 245
            AW
Sbjct: 246 AAW 237

BLAST of Sed0005350 vs. TAIR 10
Match: AT4G02210.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes - 26 (source: NCBI BLink). )

HSP 1 Score: 75.5 bits (184), Expect = 8.8e-14
Identity = 40/127 (31.50%), Postives = 70/127 (55.12%), Query Frame = 0

Query: 10  WTAEEDRVLIESLLHLSQLKKGRADNGTFRYGVFGEIQKMIVEKIR-NCYIEVNPHLESR 69
           W    DR  I+  L L Q ++G    G FR   + E+  +   K   N  ++V   L++R
Sbjct: 186 WHPPMDRYFID--LMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDV---LKNR 245

Query: 70  VRHLQKQYNAIQEMLGQSASGFGWNDEKKCVEVEKSIFDGWVKSHPTARGLRNKPFPYYD 129
            + L++Q+NAI+ +L   + GF W++E++ V  + +++  ++K+H  AR    +P PYY 
Sbjct: 246 YKSLRRQFNAIKSIL--RSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYK 305

Query: 130 DLAVIFG 136
           DL V+ G
Sbjct: 306 DLCVLCG 305

BLAST of Sed0005350 vs. TAIR 10
Match: AT4G02210.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24960.2). )

HSP 1 Score: 75.5 bits (184), Expect = 8.8e-14
Identity = 40/127 (31.50%), Postives = 70/127 (55.12%), Query Frame = 0

Query: 10  WTAEEDRVLIESLLHLSQLKKGRADNGTFRYGVFGEIQKMIVEKIR-NCYIEVNPHLESR 69
           W    DR  I+  L L Q ++G    G FR   + E+  +   K   N  ++V   L++R
Sbjct: 186 WHPPMDRYFID--LMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDV---LKNR 245

Query: 70  VRHLQKQYNAIQEMLGQSASGFGWNDEKKCVEVEKSIFDGWVKSHPTARGLRNKPFPYYD 129
            + L++Q+NAI+ +L   + GF W++E++ V  + +++  ++K+H  AR    +P PYY 
Sbjct: 246 YKSLRRQFNAIKSIL--RSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYK 305

Query: 130 DLAVIFG 136
           DL V+ G
Sbjct: 306 DLCVLCG 305

BLAST of Sed0005350 vs. TAIR 10
Match: AT5G27260.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G29880.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 66.2 bits (160), Expect = 5.4e-11
Identity = 59/215 (27.44%), Postives = 99/215 (46.05%), Query Frame = 0

Query: 4   RGAKHLWTAEEDRVLIESLLHLSQLKKGRADNGTFRYGVFGEIQKMIVE-------KIRN 63
           +G  + W+ EE ++L++ L+        R  NGT        I K+ VE           
Sbjct: 11  KGDYNPWSPEETKLLVQLLVE-GINNNWRDSNGT--------ISKLTVETKFMPEINKEF 70

Query: 64  CYIEVNPHLESRVRHLQKQYNAIQEMLGQSASGFGWNDEKKCVEVEKSIFDGWVKSHPTA 123
           C  +   H  SR+++L+ QY +  + L + +SGFGW+   K       ++  ++K+HP  
Sbjct: 71  CRSKNYNHYLSRMKYLKIQYQSCLD-LQRFSSGFGWDPLTKRFTASDEVWSDYLKAHPNN 130

Query: 124 RGLRNKPFPYYDDLAVIFGKDRAVGGGRKRLYH--QGLQVAADLD--SYYTLEFDLNDDV 183
           + LR   F ++D+L +IFG+  A G     L     GL   A  +    Y  +FD   + 
Sbjct: 131 KQLRYDTFEFFDELQIIFGEGVATGKNAIGLCDSTDGLTYRAGENPRKEYVDDFDNVYEY 190

Query: 184 PTSDYYPHSEDFIPTYVSMG--EEKQTTPRTRPSS 206
            T+ ++  SE + P ++S G  E  +  PR R  S
Sbjct: 191 DTTTHHESSEHYAP-FMSHGTSESPKLPPRKRTRS 214

BLAST of Sed0005350 vs. TAIR 10
Match: AT2G24960.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes - 50 (source: NCBI BLink). )

HSP 1 Score: 58.5 bits (140), Expect = 1.1e-08
Identity = 34/129 (26.36%), Postives = 60/129 (46.51%), Query Frame = 0

Query: 6   AKHLWTAEEDRVLIESLLHLSQLKKGRADNGTFRYGVFGEIQKMIVEKIRNCYIEVNPHL 65
           +K  WT E D+  +E  + + Q+ +G      F    +  I  +++   R         L
Sbjct: 168 SKTEWTLEMDQYFVE--IMVDQIGRGNKTGNAFSKQAW--IDMLVLFNARFSGQYGKRVL 227

Query: 66  ESRVRHLQKQYNAIQEMLGQSASGFGWNDEKKCVEVEKSIFDGWVKSHPTARGLRNKPFP 125
             R   L K Y  ++ +L +   GF W++ +  +  + +++D ++K HP AR  R K  P
Sbjct: 228 RHRYNKLLKYYKDMEAILKE--DGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLP 287

Query: 126 YYDDLAVIF 135
            Y+DL  IF
Sbjct: 288 SYNDLDTIF 290

BLAST of Sed0005350 vs. TAIR 10
Match: AT2G24960.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 12 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 58.5 bits (140), Expect = 1.1e-08
Identity = 34/129 (26.36%), Postives = 60/129 (46.51%), Query Frame = 0

Query: 6   AKHLWTAEEDRVLIESLLHLSQLKKGRADNGTFRYGVFGEIQKMIVEKIRNCYIEVNPHL 65
           +K  WT E D+  +E  + + Q+ +G      F    +  I  +++   R         L
Sbjct: 168 SKTEWTLEMDQYFVE--IMVDQIGRGNKTGNAFSKQAW--IDMLVLFNARFSGQYGKRVL 227

Query: 66  ESRVRHLQKQYNAIQEMLGQSASGFGWNDEKKCVEVEKSIFDGWVKSHPTARGLRNKPFP 125
             R   L K Y  ++ +L +   GF W++ +  +  + +++D ++K HP AR  R K  P
Sbjct: 228 RHRYNKLLKYYKDMEAILKE--DGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLP 287

Query: 126 YYDDLAVIF 135
            Y+DL  IF
Sbjct: 288 SYNDLDTIF 290

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TYK07921.18.2e-4637.88hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa][more]
XP_024022021.11.2e-4438.34uncharacterized protein LOC112091787 [Morus notabilis][more]
KAA0050106.12.2e-4336.13retrotransposon protein [Cucumis melo var. makuwa][more]
XP_030483301.11.1e-4233.55uncharacterized protein LOC115699898 [Cannabis sativa][more]
CAD1831704.11.1e-4237.03unnamed protein product [Ananas comosus var. bracteatus][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3C7T44.0e-4637.88Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A5A7U7F71.1e-4336.13Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A803QNC55.4e-4333.55Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A6V7PLJ85.4e-4337.03Myb_DNA-bind_3 domain-containing protein OS=Ananas comosus var. bracteatus OX=29... [more]
A0A5D3BC959.2e-4340.74Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
Match NameE-valueIdentityDescription
AT4G02210.18.8e-1431.50unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G02210.28.8e-1431.50unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G27260.15.4e-1127.44unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G24960.11.1e-0826.36unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G24960.21.1e-0826.36unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024752Myb/SANT-like domainPFAMPF12776Myb_DNA-bind_3coord: 10..103
e-value: 1.2E-8
score: 35.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 193..218
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 193..208
NoneNo IPR availablePANTHERPTHR46250:SF3MYB/SANT-LIKE DNA-BINDING DOMAIN PROTEINcoord: 3..304
NoneNo IPR availablePANTHERPTHR46250MYB/SANT-LIKE DNA-BINDING DOMAIN PROTEIN-RELATEDcoord: 3..304

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0005350.1Sed0005350.1mRNA