Sed0023566 (gene) Chayote v1

Overview
NameSed0023566
Typegene
OrganismSechium edule (Chayote v1)
DescriptionIntegrase catalytic domain-containing protein
LocationLG13: 1889381 .. 1891471 (-)
RNA-Seq ExpressionSed0023566
SyntenySed0023566
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGTAGAGAGAAATATGATCTTTTATATTAGATTGTATAAATAGCCCCATGTATATTCCTTTTTAATTAATAATAATAACTCTTTCTCTAATTTCTGAAGTTCAAATTGGTATCAGAGCAATTCTATTGCGCAACAATAAGAAATTCCCAAAATCCAAAACCCTTCTTTCACCGTCAATCTCACAACCTCCATCCTGAGTTCTTCGCCATCAATCTCTGCTTCTTCCGCCGTCAACCTTCTTCCCAAAATTCATCGCCATCAATACCAATCTCAATCTTCGAGCAATGGATCCCAAAAATCCTAAAGATCCAGAAAAAGAATCAAAGATTGAAGAACCATTAAAAGTACCGCCGATTCAGATAGATCCGTCACTGACTATCGTTCCAAGTCGTCAAGAATCGCTGATCGGAGAAAGCTCACAAAGTCAATATTACGTACATCACACCGATACAACGAATTTGGTACTGGTGTCCGAACTGCTAACCAATGATAACTATGTAACTTGGAGCAGATCGATGATCATTGCCCTATCAATCAGGAATAAGCTGGGATTTGTGAATGGGACGATACAGAGGCCGAAAGAAGAAGAGTTTCTACACCTGTGGACAAGAAACAATCATATAGTCATTTCTTGGATCCTAAACTCCATATCAAAAGGTATTTCATCTTCCATTATCTTTACAGATTATGCAAGGGCAATTTGGTTAGATCTTAAAGATCGCTTTGAAAGGAAAAATGGACCTAGGATTTTTCATTTAAAGAAAGGATTGACTACTATCAAACAAGGTCAGGATTCTGTAACAACATATTTCTCAAGAATCAAATCTTTATGGGATGAATATGCTTGCTATAGGCCTAGTTGTACTTGTGGGTTGTGCAGCTGTGGAGGTTTGAAGTCTATTCAAGAATTTGTACATTTTGAGTATCTTTTAGTCTTCTTGATGGGTCTAAATGATGAGTTTTCTACTACTAGATCACAAATACTTCTCATGGATCCATTGCCACCAGCCAATAAGGCTTTTTCTTTAATCGTACAAGAAGAGGAACATAAGGGAGATACAAATATTAAGAGTAATAGTACCTTAGCTGCCACTCAGTCTAAAACCACATACAAAGGGAAGGATTCTAAGCCAGTATGCAAGCATTGTGGTCTCATAGGACACACAATTGATGTTTGCTATAGAATACATGGATATCCGGATAATAGACCTGTGTGCAAGCATTGTGGGTTACAAGGACACACCATCGATGTATGTTATAAAATACATGGGTATCCACCTAGTAACAAGCAAAGGAAAAATAACTACAAGCAAACCAATGATAACCAAGGTTCTGTACAACCTGAAAACAAATCTTGCAAATCAGCAACAGTTGCAGCTAGCAATATTGAAAGTGATCCTTTTCAACAATGTCATGATATATTGACTCTTCTTCAATCCAAGTTAGCTGGCATCAAGAATGACAATGGAGCGAACCTAACGCAACATATGGCAGGTATGACACAAACATATGATTATTTTAAAGATAGATGGATACTAGATTCAGGTGCAGCAGCCCATATATGTCATAACAAAGATATATTCATGAATTTAAAAAGGATTGATACCTCTGTGATATTACCTAATAAGGATAGGATTATAGTCACTCATGCTGGATCTATATTGTTGTGTGGATCTATCATTTTAGATAGAGTCTTGTATGTTCCAAGTTTTAAATACAATTTATTGTCTATTAGTGCACTAACCTTGAATGATGCAGTGTTAGTAAATTTCACAACTAATGCTTGTATTATCCAGGACAAGCGCACTTTGAAGATGATTGGGAAGGGTAATCTTGAGCAAGGATTATATGCGTTAGAGGAGGTACCTTTATCTGCAGCATTGAATACTGTTTGTAGTGTAAGGAGTGGCTCACCATCCCTATGGCATAGGAGATTAGGTCATCCAGCTGATTTACCTTTAGTTGCTTTAAAAAATGTACTTTCTTTTGATGCAAATTGTAAAGGGGCTGAAAATTGTACTATATGCCCTTTGGCTAAACAAAACAGATTGAGATTCATTTCAAATAATAATAAATCAGATGCTATTTTTG

mRNA sequence

GGTAGAGAGAAATATGATCTTTTATATTAGATTGTATAAATAGCCCCATGTATATTCCTTTTTAATTAATAATAATAACTCTTTCTCTAATTTCTGAAGTTCAAATTGGTATCAGAGCAATTCTATTGCGCAACAATAAGAAATTCCCAAAATCCAAAACCCTTCTTTCACCGTCAATCTCACAACCTCCATCCTGAGTTCTTCGCCATCAATCTCTGCTTCTTCCGCCGTCAACCTTCTTCCCAAAATTCATCGCCATCAATACCAATCTCAATCTTCGAGCAATGGATCCCAAAAATCCTAAAGATCCAGAAAAAGAATCAAAGATTGAAGAACCATTAAAAGTACCGCCGATTCAGATAGATCCGTCACTGACTATCGTTCCAAGTCGTCAAGAATCGCTGATCGGAGAAAGCTCACAAAGTCAATATTACGTACATCACACCGATACAACGAATTTGGTACTGGTGTCCGAACTGCTAACCAATGATAACTATGTAACTTGGAGCAGATCGATGATCATTGCCCTATCAATCAGGAATAAGCTGGGATTTGTGAATGGGACGATACAGAGGCCGAAAGAAGAAGAGTTTCTACACCTGTGGACAAGAAACAATCATATAGTCATTTCTTGGATCCTAAACTCCATATCAAAAGGTATTTCATCTTCCATTATCTTTACAGATTATGCAAGGGCAATTTGGTTAGATCTTAAAGATCGCTTTGAAAGGAAAAATGGACCTAGGATTTTTCATTTAAAGAAAGGATTGACTACTATCAAACAAGGTCAGGATTCTGTAACAACATATTTCTCAAGAATCAAATCTTTATGGGATGAATATGCTTGCTATAGGCCTAGTTGTACTTGTGGGTTGTGCAGCTGTGGAGGTTTGAAGTCTATTCAAGAATTTGTACATTTTGAGTATCTTTTAGTCTTCTTGATGGGTCTAAATGATGAGTTTTCTACTACTAGATCACAAATACTTCTCATGGATCCATTGCCACCAGCCAATAAGGCTTTTTCTTTAATCGTACAAGAAGAGGAACATAAGGGAGATACAAATATTAAGAGTAATAGTACCTTAGCTGCCACTCAGTCTAAAACCACATACAAAGGGAAGGATTCTAAGCCAGTATGCAAGCATTGTGGTCTCATAGGACACACAATTGATGTTTGCTATAGAATACATGGATATCCGGATAATAGACCTGTGTGCAAGCATTGTGGGTTACAAGGACACACCATCGATGTATGTTATAAAATACATGGGTATCCACCTAGTAACAAGCAAAGGAAAAATAACTACAAGCAAACCAATGATAACCAAGGTTCTGTACAACCTGAAAACAAATCTTGCAAATCAGCAACAGTTGCAGCTAGCAATATTGAAAGTGATCCTTTTCAACAATGTCATGATATATTGACTCTTCTTCAATCCAAGTTAGCTGGCATCAAGAATGACAATGGAGCGAACCTAACGCAACATATGGCAGGACAAGCGCACTTTGAAGATGATTGGGAAGGGTAATCTTGAGCAAGGATTATATGCGTTAGAGGAGGTACCTTTATCTGCAGCATTGAATACTGTTTGTAGTGTAAGGAGTGGCTCACCATCCCTATGGCATAGGAGATTAGGTCATCCAGCTGATTTACCTTTAGTTGCTTTAAAAAATGTACTTTCTTTTGATGCAAATTGTAAAGGGGCTGAAAATTGTACTATATGCCCTTTGGCTAAACAAAACAGATTGAGATTCATTTCAAATAATAATAAATCAGATGCTATTTTTG

Coding sequence (CDS)

ATGGATCCCAAAAATCCTAAAGATCCAGAAAAAGAATCAAAGATTGAAGAACCATTAAAAGTACCGCCGATTCAGATAGATCCGTCACTGACTATCGTTCCAAGTCGTCAAGAATCGCTGATCGGAGAAAGCTCACAAAGTCAATATTACGTACATCACACCGATACAACGAATTTGGTACTGGTGTCCGAACTGCTAACCAATGATAACTATGTAACTTGGAGCAGATCGATGATCATTGCCCTATCAATCAGGAATAAGCTGGGATTTGTGAATGGGACGATACAGAGGCCGAAAGAAGAAGAGTTTCTACACCTGTGGACAAGAAACAATCATATAGTCATTTCTTGGATCCTAAACTCCATATCAAAAGGTATTTCATCTTCCATTATCTTTACAGATTATGCAAGGGCAATTTGGTTAGATCTTAAAGATCGCTTTGAAAGGAAAAATGGACCTAGGATTTTTCATTTAAAGAAAGGATTGACTACTATCAAACAAGGTCAGGATTCTGTAACAACATATTTCTCAAGAATCAAATCTTTATGGGATGAATATGCTTGCTATAGGCCTAGTTGTACTTGTGGGTTGTGCAGCTGTGGAGGTTTGAAGTCTATTCAAGAATTTGTACATTTTGAGTATCTTTTAGTCTTCTTGATGGGTCTAAATGATGAGTTTTCTACTACTAGATCACAAATACTTCTCATGGATCCATTGCCACCAGCCAATAAGGCTTTTTCTTTAATCGTACAAGAAGAGGAACATAAGGGAGATACAAATATTAAGAGTAATAGTACCTTAGCTGCCACTCAGTCTAAAACCACATACAAAGGGAAGGATTCTAAGCCAGTATGCAAGCATTGTGGTCTCATAGGACACACAATTGATGTTTGCTATAGAATACATGGATATCCGGATAATAGACCTGTGTGCAAGCATTGTGGGTTACAAGGACACACCATCGATGTATGTTATAAAATACATGGGTATCCACCTAGTAACAAGCAAAGGAAAAATAACTACAAGCAAACCAATGATAACCAAGGTTCTGTACAACCTGAAAACAAATCTTGCAAATCAGCAACAGTTGCAGCTAGCAATATTGAAAGTGATCCTTTTCAACAATGTCATGATATATTGACTCTTCTTCAATCCAAGTTAGCTGGCATCAAGAATGACAATGGAGCGAACCTAACGCAACATATGGCAGGACAAGCGCACTTTGAAGATGATTGGGAAGGGTAA

Protein sequence

MDPKNPKDPEKESKIEEPLKVPPIQIDPSLTIVPSRQESLIGESSQSQYYVHHTDTTNLVLVSELLTNDNYVTWSRSMIIALSIRNKLGFVNGTIQRPKEEEFLHLWTRNNHIVISWILNSISKGISSSIIFTDYARAIWLDLKDRFERKNGPRIFHLKKGLTTIKQGQDSVTTYFSRIKSLWDEYACYRPSCTCGLCSCGGLKSIQEFVHFEYLLVFLMGLNDEFSTTRSQILLMDPLPPANKAFSLIVQEEEHKGDTNIKSNSTLAATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHTIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGQAHFEDDWEG
Homology
BLAST of Sed0023566 vs. NCBI nr
Match: XP_022154973.1 (uncharacterized protein LOC111022117 [Momordica charantia])

HSP 1 Score: 340.5 bits (872), Expect = 2.0e-89
Identity = 177/379 (46.70%), Postives = 242/379 (63.85%), Query Frame = 0

Query: 51  VHHTDTTNLVLVSELLTNDNYVTWSRSMIIALSIRNKLGFVNGTIQRPKEEEFLHLWTRN 110
           +HH DT+NLVLVS+ LTN NYV+WSRSM IALSI+NKLGF+NG++ +P   + L +W RN
Sbjct: 1   MHHNDTSNLVLVSKPLTNSNYVSWSRSMTIALSIKNKLGFINGSLPKP-AGDLLPVWIRN 60

Query: 111 NHIVISWILNSISKGISSSIIFTDYARAIWLDLKDRFERKNGPRIFHLKKGLTTIKQGQD 170
            H+VI+W LNS+SK IS+S+IFT+    IWLDLKDRF+ +NGP+IF L++ L T+ Q Q 
Sbjct: 61  KHVVIAWFLNSVSKPISASLIFTNSTHEIWLDLKDRFQLQNGPQIFQLRRDLATLTQDQL 120

Query: 171 SVTTYFSRIKSLWDEYACYRPSCTCGLCSCGGLKSIQEFVHFEYLLVFLMGLNDEFSTTR 230
           SVT Y++++K+LWDEY  YRP CTCG CSCGG + +++FV FE+L+ FLMGLN+ F+  R
Sbjct: 121 SVTMYYTKLKALWDEYVSYRPGCTCGSCSCGGYRLVEKFVQFEHLMKFLMGLNESFAHIR 180

Query: 231 SQILLMDPLPPANKAFSLIVQEEEHKGDTNIKSNST---LAATQSKT-------TYKGKD 290
           +QILLMDP P   KAFSLI QEE+ +      + S    LA  QS++       + +   
Sbjct: 181 AQILLMDPPPSIGKAFSLISQEEQQRVIPLFSTPSPAVGLAVNQSRSSSASNSGSRQRNS 240

Query: 291 SKPVCKHCGLIGHTIDVCYRIHGYPD--NRPVCKHCGLQGHTIDVCYKIHGYPPSNKQRK 350
           S P C +CG+ GHT+D CYR+HG+P        +H      T  V       P SN    
Sbjct: 241 SCPYCTNCGIRGHTVDKCYRLHGFPSGYRSKGNQHSSTPSMTSSVSSTTSSSPASNSPST 300

Query: 351 ---NNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNG 410
              N+  QT+ +   + P              + SD F QCH+IL +LQS+L   K D+ 
Sbjct: 301 AIVNSISQTSASSSLISP--------------MTSDAFSQCHNILNMLQSQLNAAKTDSE 360

Query: 411 ANLTQHMAGQAHFEDDWEG 415
           A  + ++AG+ HF+DDW+G
Sbjct: 361 A--SPYLAGKVHFQDDWQG 362

BLAST of Sed0023566 vs. NCBI nr
Match: XP_022154919.1 (uncharacterized protein LOC111022065 [Momordica charantia])

HSP 1 Score: 307.4 bits (786), Expect = 1.9e-79
Identity = 161/372 (43.28%), Postives = 237/372 (63.71%), Query Frame = 0

Query: 40  LIGESSQSQYYVHHTDTTNLVLVSELLTNDNYVTWSRSMIIALSIRNKLGFVNGTIQRPK 99
           ++ E   + Y++HH+D T+LVLVS+LLT++NY +WSRS++IAL+++NK+GFV+G+I RP 
Sbjct: 20  IVVEQHANPYFLHHSDNTSLVLVSDLLTDENYTSWSRSIVIALTVKNKIGFVDGSISRPT 79

Query: 100 EEEFLHLWTRNNHIVISWILNSISKGISSSIIFTDYARAIWLDLKDRFERKNGPRIFHLK 159
           +   LH W   N++VISWI NS+SK IS+S++F+D A  IWLDLK+RF+R+N PRIF L+
Sbjct: 80  DGR-LHSWIICNNVVISWIFNSLSKKISASVLFSDSAHEIWLDLKERFQRQNRPRIFQLR 139

Query: 160 KGLTTIKQGQDSVTTYFSRIKSLWDEYACYRPSCTCGLCSCGGLKSIQEFVHFEYLLVFL 219
           + L+ + Q Q SVT YF+R+K+LW E A YRP+C+CG CS GG+KSI+     EY++ FL
Sbjct: 140 RELSNLTQDQLSVTAYFTRLKTLWSELALYRPACSCGRCSYGGVKSIEAHYQQEYVMAFL 199

Query: 220 MGLNDEFSTTRSQILLMDPLPPANKAFSLIVQEEEHKGDTNIKSNSTLAATQSKTTYKGK 279
           MGLN  FS  R+Q+LLM+P P  N+AF+L+ QE + +   ++ S ++  A+  + T    
Sbjct: 200 MGLNVSFSQIRAQLLLMEPAPTINRAFALVAQEMQQR-SISLPSVTSPTASAVRATSNSS 259

Query: 280 DSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHTIDVCYKIHGYPPSNKQRKN 339
           +S+            ++     H    ++ +C HCG+ GHT+D CYK+H YPP    R +
Sbjct: 260 NSR------------LNSSSASHMKRKDKSLCTHCGIYGHTVDKCYKLHEYPPG--YRSS 319

Query: 340 NYKQTNDNQGSVQPENKSCKSATVAASNIESD----PFQQCHDILTLLQSKLAGIK---- 399
            +K T+ N  S +      KS +   S I +        QC  +LTLLQS L   K    
Sbjct: 320 VHKTTSSNATSSRSAEAPSKSVSATPSGISNSLATLTADQCQRLLTLLQSHLTTTKTASD 372

Query: 400 NDNGANLTQHMA 404
           ND+G   T H+A
Sbjct: 380 NDSG---TSHVA 372

BLAST of Sed0023566 vs. NCBI nr
Match: GFY98609.1 (haloacid dehalogenase-like hydrolase (HAD) superfamily protein [Actinidia rufa])

HSP 1 Score: 305.8 bits (782), Expect = 5.5e-79
Identity = 165/395 (41.77%), Postives = 237/395 (60.00%), Query Frame = 0

Query: 40  LIGESSQSQYYVHHTDTTNLVLVSELLTNDNYVTWSRSMIIALSIRNKLGFVNGTIQRPK 99
           L  +   S Y++HH+D   LVLVS+ LT DNY +W+R+MIIALS++NKLGF++G+I +P+
Sbjct: 262 LATDDPSSPYFLHHSDGPELVLVSQSLTGDNYASWNRAMIIALSVKNKLGFIDGSITKPE 321

Query: 100 --EEEFLHLWTRNNHIVISWILNSISKGISSSIIFTDYARAIWLDLKDRFERKNGPRIFH 159
             +   L+ W RNN++VISWILNS+SK IS+SIIF+  A  IW+DLKDRF++ NGPRIF 
Sbjct: 322 GNDTNLLNSWIRNNNVVISWILNSVSKEISASIIFSASANEIWIDLKDRFQQSNGPRIFQ 381

Query: 160 LKKGLTTIKQGQDSVTTYFSRIKSLWDEYACYRPSCTCGLCSCGGLKSIQEFVHFEYLLV 219
           L++ L    Q Q  V+ YF+++K++W+E   YRP+C+CG C+CGG+K +      EY++ 
Sbjct: 382 LRRELMNHVQDQSPVSVYFTKLKTIWEELNNYRPACSCGNCTCGGVKKLNSHYQMEYIMS 441

Query: 220 FLMGLNDEFSTTRSQILLMDPLPPANKAFSLIVQEEEHKG---DTNIKSNS--TLA-ATQ 279
           FLM L+  F+  R Q+LLMDPLPP NK FSLI QEE  +      N  SNS  T+A A +
Sbjct: 442 FLMVLHYSFAQIRGQLLLMDPLPPINKVFSLISQEEHQRKIGIHVNSISNSADTMAFAIK 501

Query: 280 SKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHTIDVCYKIHGYP 339
           ++   +  D        G  G+     Y+  G   +R  C HC   GHTI+ CYK HGYP
Sbjct: 502 NENLKRFSDKSGSSNSGGYRGNQNSASYK--GQKKDRAFCTHCNFHGHTIEKCYKRHGYP 561

Query: 340 PSNKQR-KNNYKQTNDNQGSVQPENKSCKSATVA----------ASNIESDPFQQCHDIL 399
           P  K R ++ Y  +N +    Q  N SC  +              +N+ S+ +QQ   ++
Sbjct: 562 PGFKPRSRDAYTTSNSHNAVNQVSNHSCSISEARNDQQDNVGNFVTNLNSNQYQQ---LM 621

Query: 400 TLLQSKLA-GIKNDNGANLTQHMAGQAHFEDDWEG 415
            +L + +A  +K+      T +  G    EDDW+G
Sbjct: 622 CMLSNHMASSVKDQQDNPSTSYTTGHQQQEDDWQG 651

BLAST of Sed0023566 vs. NCBI nr
Match: KAA8536734.1 (hypothetical protein F0562_029212 [Nyssa sinensis])

HSP 1 Score: 303.5 bits (776), Expect = 2.7e-78
Identity = 158/357 (44.26%), Postives = 224/357 (62.75%), Query Frame = 0

Query: 43  ESSQSQYYVHHTDTTNLVLVSELLTNDNYVTWSRSMIIALSIRNKLGFVNGTIQRPK--E 102
           E   + YY+HH+++   VLVS+ LT +NY  WSR+M+IALS++NKLGFV+G I  P+  +
Sbjct: 24  EEPSNPYYLHHSNSPGQVLVSQQLTGENYTNWSRAMLIALSVKNKLGFVDGFIPEPQGTD 83

Query: 103 EEFLHLWTRNNHIVISWILNSISKGISSSIIFTDYARAIWLDLKDRFERKNGPRIFHLKK 162
              L  W RNN+IVISWILNSISK IS+SIIF  +AR IWLDL+DRF+++NGPRIF LK+
Sbjct: 84  NNLLDSWIRNNNIVISWILNSISKEISASIIFAAFAREIWLDLRDRFQQRNGPRIFQLKR 143

Query: 163 GLTTIKQGQDSVTTYFSRIKSLWDEYACYRPSCTCGLCSCGGLKSIQEFVHFEYLLVFLM 222
            L  ++Q Q SV+ YF+++K++W+E + YRP+C+CG C CGG+K++ ++   EY++ FLM
Sbjct: 144 ELMNLRQEQSSVSIYFTKVKTIWEELSNYRPNCSCGKCYCGGVKNLNDYHQTEYIMSFLM 203

Query: 223 GLNDEFSTTRSQILLMDPLPPANKAFSLIVQEEEHKGDTNIKSNSTLAATQSKTTYKGKD 282
           GL+D FS    Q+LLMD +PP N+ FSLIVQEE+ +  TN+ S+S+ +        K   
Sbjct: 204 GLDDSFSQVSGQLLLMDSMPPINRVFSLIVQEEQQR-RTNLSSDSSNSTGTMAFVVKTDV 263

Query: 283 SKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHTIDVCYKIHGYPPSNKQRKNN 342
           +K      G                 +RP C HC + GHT+D CYKIHGYPP  K R NN
Sbjct: 264 AK--SGGSGSQNSQNSNSSASKNQKRDRPYCTHCKILGHTVDRCYKIHGYPPGYKFRSNN 323

Query: 343 YKQTNDNQGSVQPE-NKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGA 397
                  Q S   + +    S      N+ S+ +QQ   ++++L + L+  K    A
Sbjct: 324 NSNAAAYQVSTSDDRSDQSNSFEGFVQNLNSNQYQQ---LMSMLSTHLSSSKKVTNA 374

BLAST of Sed0023566 vs. NCBI nr
Match: XP_038895765.1 (uncharacterized protein LOC120083929 [Benincasa hispida])

HSP 1 Score: 300.8 bits (769), Expect = 1.8e-77
Identity = 144/256 (56.25%), Postives = 191/256 (74.61%), Query Frame = 0

Query: 21  VPPIQIDPSL-----TIVPSRQESLIGESSQSQYYVHHTDTTNLVLVSELLTNDNYVTWS 80
           +P I + PS      T  P+    L      + Y +HH+DT+NLVLVSELLT+DNYV+WS
Sbjct: 3   IPEIPLRPSADPSSSTDFPTTGLPLSHLDQYTPYALHHSDTSNLVLVSELLTDDNYVSWS 62

Query: 81  RSMIIALSIRNKLGFVNGTIQRPKEEEFLHLWTRNNHIVISWILNSISKGISSSIIFTDY 140
           RSM++ L I+NKLGF++G++ RP   + LHLW  NN++V+SWIL S+SK ISSSI+FT+ 
Sbjct: 63  RSMVLTLFIQNKLGFIDGSLPRP-TGDLLHLWIHNNNVVVSWILKSVSKSISSSILFTES 122

Query: 141 ARAIWLDLKDRFERKNGPRIFHLKKGLTTIKQGQDSVTTYFSRIKSLWDEYACYRPSCTC 200
           A+AIWLDL+D F+R+NGPRIFHLK+ L+++KQ QDSVT YF+++KS  DEY  YRP CTC
Sbjct: 123 AQAIWLDLQDCFQRRNGPRIFHLKRELSSLKQDQDSVTMYFTKMKSFCDEYVSYRPGCTC 182

Query: 201 GLCSCGGLKSIQEFVHFEYLLVFLMGLNDEFSTTRSQILLMDPLPPANKAFSLIVQEEEH 260
           G C+CGG+KS+++F+ FEYLL F MGLND F+ TRSQ+LLMDP PP NKAFS + Q+E+H
Sbjct: 183 GQCTCGGIKSMEDFLQFEYLLCFFMGLNDSFNHTRSQLLLMDPPPPLNKAFSFVFQQEQH 242

Query: 261 KGDTNIKSNSTLAATQ 272
           +   N  S+      Q
Sbjct: 243 QSLANPPSSVVTLTVQ 257

BLAST of Sed0023566 vs. ExPASy TrEMBL
Match: A0A6J1DLQ9 (uncharacterized protein LOC111022117 OS=Momordica charantia OX=3673 GN=LOC111022117 PE=4 SV=1)

HSP 1 Score: 340.5 bits (872), Expect = 9.8e-90
Identity = 177/379 (46.70%), Postives = 242/379 (63.85%), Query Frame = 0

Query: 51  VHHTDTTNLVLVSELLTNDNYVTWSRSMIIALSIRNKLGFVNGTIQRPKEEEFLHLWTRN 110
           +HH DT+NLVLVS+ LTN NYV+WSRSM IALSI+NKLGF+NG++ +P   + L +W RN
Sbjct: 1   MHHNDTSNLVLVSKPLTNSNYVSWSRSMTIALSIKNKLGFINGSLPKP-AGDLLPVWIRN 60

Query: 111 NHIVISWILNSISKGISSSIIFTDYARAIWLDLKDRFERKNGPRIFHLKKGLTTIKQGQD 170
            H+VI+W LNS+SK IS+S+IFT+    IWLDLKDRF+ +NGP+IF L++ L T+ Q Q 
Sbjct: 61  KHVVIAWFLNSVSKPISASLIFTNSTHEIWLDLKDRFQLQNGPQIFQLRRDLATLTQDQL 120

Query: 171 SVTTYFSRIKSLWDEYACYRPSCTCGLCSCGGLKSIQEFVHFEYLLVFLMGLNDEFSTTR 230
           SVT Y++++K+LWDEY  YRP CTCG CSCGG + +++FV FE+L+ FLMGLN+ F+  R
Sbjct: 121 SVTMYYTKLKALWDEYVSYRPGCTCGSCSCGGYRLVEKFVQFEHLMKFLMGLNESFAHIR 180

Query: 231 SQILLMDPLPPANKAFSLIVQEEEHKGDTNIKSNST---LAATQSKT-------TYKGKD 290
           +QILLMDP P   KAFSLI QEE+ +      + S    LA  QS++       + +   
Sbjct: 181 AQILLMDPPPSIGKAFSLISQEEQQRVIPLFSTPSPAVGLAVNQSRSSSASNSGSRQRNS 240

Query: 291 SKPVCKHCGLIGHTIDVCYRIHGYPD--NRPVCKHCGLQGHTIDVCYKIHGYPPSNKQRK 350
           S P C +CG+ GHT+D CYR+HG+P        +H      T  V       P SN    
Sbjct: 241 SCPYCTNCGIRGHTVDKCYRLHGFPSGYRSKGNQHSSTPSMTSSVSSTTSSSPASNSPST 300

Query: 351 ---NNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNG 410
              N+  QT+ +   + P              + SD F QCH+IL +LQS+L   K D+ 
Sbjct: 301 AIVNSISQTSASSSLISP--------------MTSDAFSQCHNILNMLQSQLNAAKTDSE 360

Query: 411 ANLTQHMAGQAHFEDDWEG 415
           A  + ++AG+ HF+DDW+G
Sbjct: 361 A--SPYLAGKVHFQDDWQG 362

BLAST of Sed0023566 vs. ExPASy TrEMBL
Match: A0A6J1DNP7 (uncharacterized protein LOC111022065 OS=Momordica charantia OX=3673 GN=LOC111022065 PE=4 SV=1)

HSP 1 Score: 307.4 bits (786), Expect = 9.2e-80
Identity = 161/372 (43.28%), Postives = 237/372 (63.71%), Query Frame = 0

Query: 40  LIGESSQSQYYVHHTDTTNLVLVSELLTNDNYVTWSRSMIIALSIRNKLGFVNGTIQRPK 99
           ++ E   + Y++HH+D T+LVLVS+LLT++NY +WSRS++IAL+++NK+GFV+G+I RP 
Sbjct: 20  IVVEQHANPYFLHHSDNTSLVLVSDLLTDENYTSWSRSIVIALTVKNKIGFVDGSISRPT 79

Query: 100 EEEFLHLWTRNNHIVISWILNSISKGISSSIIFTDYARAIWLDLKDRFERKNGPRIFHLK 159
           +   LH W   N++VISWI NS+SK IS+S++F+D A  IWLDLK+RF+R+N PRIF L+
Sbjct: 80  DGR-LHSWIICNNVVISWIFNSLSKKISASVLFSDSAHEIWLDLKERFQRQNRPRIFQLR 139

Query: 160 KGLTTIKQGQDSVTTYFSRIKSLWDEYACYRPSCTCGLCSCGGLKSIQEFVHFEYLLVFL 219
           + L+ + Q Q SVT YF+R+K+LW E A YRP+C+CG CS GG+KSI+     EY++ FL
Sbjct: 140 RELSNLTQDQLSVTAYFTRLKTLWSELALYRPACSCGRCSYGGVKSIEAHYQQEYVMAFL 199

Query: 220 MGLNDEFSTTRSQILLMDPLPPANKAFSLIVQEEEHKGDTNIKSNSTLAATQSKTTYKGK 279
           MGLN  FS  R+Q+LLM+P P  N+AF+L+ QE + +   ++ S ++  A+  + T    
Sbjct: 200 MGLNVSFSQIRAQLLLMEPAPTINRAFALVAQEMQQR-SISLPSVTSPTASAVRATSNSS 259

Query: 280 DSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHTIDVCYKIHGYPPSNKQRKN 339
           +S+            ++     H    ++ +C HCG+ GHT+D CYK+H YPP    R +
Sbjct: 260 NSR------------LNSSSASHMKRKDKSLCTHCGIYGHTVDKCYKLHEYPPG--YRSS 319

Query: 340 NYKQTNDNQGSVQPENKSCKSATVAASNIESD----PFQQCHDILTLLQSKLAGIK---- 399
            +K T+ N  S +      KS +   S I +        QC  +LTLLQS L   K    
Sbjct: 320 VHKTTSSNATSSRSAEAPSKSVSATPSGISNSLATLTADQCQRLLTLLQSHLTTTKTASD 372

Query: 400 NDNGANLTQHMA 404
           ND+G   T H+A
Sbjct: 380 NDSG---TSHVA 372

BLAST of Sed0023566 vs. ExPASy TrEMBL
Match: A0A7J0FKC9 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein OS=Actinidia rufa OX=165716 GN=Acr_13g0000100 PE=4 SV=1)

HSP 1 Score: 305.8 bits (782), Expect = 2.7e-79
Identity = 165/395 (41.77%), Postives = 237/395 (60.00%), Query Frame = 0

Query: 40  LIGESSQSQYYVHHTDTTNLVLVSELLTNDNYVTWSRSMIIALSIRNKLGFVNGTIQRPK 99
           L  +   S Y++HH+D   LVLVS+ LT DNY +W+R+MIIALS++NKLGF++G+I +P+
Sbjct: 262 LATDDPSSPYFLHHSDGPELVLVSQSLTGDNYASWNRAMIIALSVKNKLGFIDGSITKPE 321

Query: 100 --EEEFLHLWTRNNHIVISWILNSISKGISSSIIFTDYARAIWLDLKDRFERKNGPRIFH 159
             +   L+ W RNN++VISWILNS+SK IS+SIIF+  A  IW+DLKDRF++ NGPRIF 
Sbjct: 322 GNDTNLLNSWIRNNNVVISWILNSVSKEISASIIFSASANEIWIDLKDRFQQSNGPRIFQ 381

Query: 160 LKKGLTTIKQGQDSVTTYFSRIKSLWDEYACYRPSCTCGLCSCGGLKSIQEFVHFEYLLV 219
           L++ L    Q Q  V+ YF+++K++W+E   YRP+C+CG C+CGG+K +      EY++ 
Sbjct: 382 LRRELMNHVQDQSPVSVYFTKLKTIWEELNNYRPACSCGNCTCGGVKKLNSHYQMEYIMS 441

Query: 220 FLMGLNDEFSTTRSQILLMDPLPPANKAFSLIVQEEEHKG---DTNIKSNS--TLA-ATQ 279
           FLM L+  F+  R Q+LLMDPLPP NK FSLI QEE  +      N  SNS  T+A A +
Sbjct: 442 FLMVLHYSFAQIRGQLLLMDPLPPINKVFSLISQEEHQRKIGIHVNSISNSADTMAFAIK 501

Query: 280 SKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHTIDVCYKIHGYP 339
           ++   +  D        G  G+     Y+  G   +R  C HC   GHTI+ CYK HGYP
Sbjct: 502 NENLKRFSDKSGSSNSGGYRGNQNSASYK--GQKKDRAFCTHCNFHGHTIEKCYKRHGYP 561

Query: 340 PSNKQR-KNNYKQTNDNQGSVQPENKSCKSATVA----------ASNIESDPFQQCHDIL 399
           P  K R ++ Y  +N +    Q  N SC  +              +N+ S+ +QQ   ++
Sbjct: 562 PGFKPRSRDAYTTSNSHNAVNQVSNHSCSISEARNDQQDNVGNFVTNLNSNQYQQ---LM 621

Query: 400 TLLQSKLA-GIKNDNGANLTQHMAGQAHFEDDWEG 415
            +L + +A  +K+      T +  G    EDDW+G
Sbjct: 622 CMLSNHMASSVKDQQDNPSTSYTTGHQQQEDDWQG 651

BLAST of Sed0023566 vs. ExPASy TrEMBL
Match: A0A5J5B2C5 (Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_029212 PE=4 SV=1)

HSP 1 Score: 303.5 bits (776), Expect = 1.3e-78
Identity = 158/357 (44.26%), Postives = 224/357 (62.75%), Query Frame = 0

Query: 43  ESSQSQYYVHHTDTTNLVLVSELLTNDNYVTWSRSMIIALSIRNKLGFVNGTIQRPK--E 102
           E   + YY+HH+++   VLVS+ LT +NY  WSR+M+IALS++NKLGFV+G I  P+  +
Sbjct: 24  EEPSNPYYLHHSNSPGQVLVSQQLTGENYTNWSRAMLIALSVKNKLGFVDGFIPEPQGTD 83

Query: 103 EEFLHLWTRNNHIVISWILNSISKGISSSIIFTDYARAIWLDLKDRFERKNGPRIFHLKK 162
              L  W RNN+IVISWILNSISK IS+SIIF  +AR IWLDL+DRF+++NGPRIF LK+
Sbjct: 84  NNLLDSWIRNNNIVISWILNSISKEISASIIFAAFAREIWLDLRDRFQQRNGPRIFQLKR 143

Query: 163 GLTTIKQGQDSVTTYFSRIKSLWDEYACYRPSCTCGLCSCGGLKSIQEFVHFEYLLVFLM 222
            L  ++Q Q SV+ YF+++K++W+E + YRP+C+CG C CGG+K++ ++   EY++ FLM
Sbjct: 144 ELMNLRQEQSSVSIYFTKVKTIWEELSNYRPNCSCGKCYCGGVKNLNDYHQTEYIMSFLM 203

Query: 223 GLNDEFSTTRSQILLMDPLPPANKAFSLIVQEEEHKGDTNIKSNSTLAATQSKTTYKGKD 282
           GL+D FS    Q+LLMD +PP N+ FSLIVQEE+ +  TN+ S+S+ +        K   
Sbjct: 204 GLDDSFSQVSGQLLLMDSMPPINRVFSLIVQEEQQR-RTNLSSDSSNSTGTMAFVVKTDV 263

Query: 283 SKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHTIDVCYKIHGYPPSNKQRKNN 342
           +K      G                 +RP C HC + GHT+D CYKIHGYPP  K R NN
Sbjct: 264 AK--SGGSGSQNSQNSNSSASKNQKRDRPYCTHCKILGHTVDRCYKIHGYPPGYKFRSNN 323

Query: 343 YKQTNDNQGSVQPE-NKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGA 397
                  Q S   + +    S      N+ S+ +QQ   ++++L + L+  K    A
Sbjct: 324 NSNAAAYQVSTSDDRSDQSNSFEGFVQNLNSNQYQQ---LMSMLSTHLSSSKKVTNA 374

BLAST of Sed0023566 vs. ExPASy TrEMBL
Match: A0A5J5BKC2 (Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_021321 PE=4 SV=1)

HSP 1 Score: 293.5 bits (750), Expect = 1.4e-75
Identity = 145/302 (48.01%), Postives = 200/302 (66.23%), Query Frame = 0

Query: 43  ESSQSQYYVHHTDTTNLVLVSELLTNDNYVTWSRSMIIALSIRNKLGFVNGTIQRPK--E 102
           E   + YY+HH+D+   +LVS+ LT +NY  WSR+M+IALS++NKLGFV+G+I  P+   
Sbjct: 24  EEPSNPYYLHHSDSLRQMLVSQQLTGENYTNWSRAMLIALSVKNKLGFVDGSILEPQGTG 83

Query: 103 EEFLHLWTRNNHIVISWILNSISKGISSSIIFTDYARAIWLDLKDRFERKNGPRIFHLKK 162
              L+ W RNN+IVISWILNS+SK IS+SIIF   AR IWLDL+DRF+++N PRIF LK+
Sbjct: 84  NNLLNSWIRNNNIVISWILNSVSKEISASIIFAASAREIWLDLRDRFQQRNRPRIFQLKR 143

Query: 163 GLTTIKQGQDSVTTYFSRIKSLWDEYACYRPSCTCGLCSCGGLKSIQEFVHFEYLLVFLM 222
            L  + Q Q SV+ YF+++K++W+E + YR +C+CG CSCGG+K++ +    EY++ FLM
Sbjct: 144 ELMNLHQEQSSVSIYFTKLKTIWEELSNYRLNCSCGKCSCGGVKNLNDHHQMEYIMSFLM 203

Query: 223 GLNDEFSTTRSQILLMDPLPPANKAFSLIVQEEEHKGDTNIKSNSTLAATQSKTTYKGKD 282
           GL+D FS  R Q+LLMDP+PP N+ FSLIVQEE+ +     ++NS+  ++ S  T     
Sbjct: 204 GLDDSFSQVRGQLLLMDPMPPINRVFSLIVQEEQQR-----RTNSSSDSSNSTGTMAFAV 263

Query: 283 SKPVCKH--CGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHTIDVCYKIHGYPPSNKQRK 341
              V K    G                 +R  C HC + GHT+D CYKIHGYPP  K + 
Sbjct: 264 KTDVAKSGGSGSQNSQNSNSSASKNQKRDRLYCMHCKILGHTVDRCYKIHGYPPGYKFKS 320

BLAST of Sed0023566 vs. TAIR 10
Match: AT1G21280.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 117.5 bits (293), Expect = 2.6e-26
Identity = 67/216 (31.02%), Postives = 112/216 (51.85%), Query Frame = 0

Query: 47  SQYY----VHHTDTTNLVLVSELLTNDNYVTWSRSMIIALSIRNKLGFVNGTIQRPKEEE 106
           S YY    +HH    ++  +S+    DNYV W       L +  K GF++GT+ +P    
Sbjct: 16  SPYYLPPDIHHPSDFSIQKLSK--DEDNYVAWKIRFRSFLRVTKKFGFIDGTLPKPDPFS 75

Query: 107 FLHL-WTRNNHIVISWILNSISKGISSSIIFTDYARAIWLDLKDRFERKNGPRIFHLKKG 166
            L+  W + N +V+ W++NS++  +  S+++ + A  +W DL+  F      +I+ L++ 
Sbjct: 76  PLYQPWEQCNAMVMYWLMNSMTDKLLESVMYAETAHKMWEDLRRVFVPCVDLKIYQLRRR 135

Query: 167 LTTIKQGQDSVTTYFSRIKSLWDEYACYR--PSCTCGLCSCGGLKSIQEFVHFEYLLVFL 226
           L T++QG DSV  YF ++  +W E + Y   P C CG C+C   K  +E    E    FL
Sbjct: 136 LATLRQGGDSVEEYFGKLSKVWMELSEYAPIPECKCGGCNCECTKRAEEAREKEQRYEFL 195

Query: 227 MG--LNDEFSTTRSQILLMDPLPPANKAFSLIVQEE 254
           MG  LN  F    ++I+   P P  ++AF+++   E
Sbjct: 196 MGLKLNQGFEAVTTKIMFQKPPPSLHEAFAMVKDAE 229

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022154973.12.0e-8946.70uncharacterized protein LOC111022117 [Momordica charantia][more]
XP_022154919.11.9e-7943.28uncharacterized protein LOC111022065 [Momordica charantia][more]
GFY98609.15.5e-7941.77haloacid dehalogenase-like hydrolase (HAD) superfamily protein [Actinidia rufa][more]
KAA8536734.12.7e-7844.26hypothetical protein F0562_029212 [Nyssa sinensis][more]
XP_038895765.11.8e-7756.25uncharacterized protein LOC120083929 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DLQ99.8e-9046.70uncharacterized protein LOC111022117 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
A0A6J1DNP79.2e-8043.28uncharacterized protein LOC111022065 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
A0A7J0FKC92.7e-7941.77Haloacid dehalogenase-like hydrolase (HAD) superfamily protein OS=Actinidia rufa... [more]
A0A5J5B2C51.3e-7844.26Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_029212 PE=4 SV=1[more]
A0A5J5BKC21.4e-7548.01Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_021321 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G21280.12.6e-2631.02CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Ha... [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 116..187
e-value: 2.1E-7
score: 31.1
IPR029472Retrotransposon Copia-like, N-terminalPFAMPF14244Retrotran_gag_3coord: 52..99
e-value: 2.2E-18
score: 65.7
NoneNo IPR availableGENE3D4.10.60.10coord: 261..329
e-value: 4.6E-5
score: 25.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..20
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 335..358
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..18
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 307..386
coord: 86..308
NoneNo IPR availablePANTHERPTHR34222:SF6OS02G0671800 PROTEINcoord: 307..386
coord: 86..308

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0023566.1Sed0023566.1mRNA