Homology
BLAST of HG10020559 vs. NCBI nr
Match:
XP_038893558.1 (putative pentatricopeptide repeat-containing protein At1g74580 [Benincasa hispida])
HSP 1 Score: 1524.6 bits (3946), Expect = 0.0e+00
Identity = 735/772 (95.21%), Postives = 756/772 (97.93%), Query Frame = 0
Query: 1 MNRALQPKHVAAVIRYQNDPLKALQMFNQVKTEDGFKHTLATYKCMIEKLGLHGQFEAME 60
MNRALQPKHVAAVIRYQNDPLKAL+MFNQVKTEDGFKHTL TYKCMIEKLGLHGQFEAME
Sbjct: 1 MNRALQPKHVAAVIRYQNDPLKALRMFNQVKTEDGFKHTLETYKCMIEKLGLHGQFEAME 60
Query: 61 DVLADMRKNVDNKMLEGVYIRIMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYNVIMNI 120
DVLA+MRKN DNKMLEGVYI IMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYNVIMNI
Sbjct: 61 DVLAEMRKNFDNKMLEGVYIGIMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYNVIMNI 120
Query: 121 LVEYGYFNQAHKVYMRMKYIGIYPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN 180
LVEYGYFNQAHKVYMRMK IGIYPDVYTHTIRIKSFCRTGRPSAA+RLLNNMP QGCEFN
Sbjct: 121 LVEYGYFNQAHKVYMRMKDIGIYPDVYTHTIRIKSFCRTGRPSAAMRLLNNMPAQGCEFN 180
Query: 181 AVSYCTVIGGFYEENCQIEAYHLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFN 240
AVSYCTVIGGFYEENCQIEAYHLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFN
Sbjct: 181 AVSYCTVIGGFYEENCQIEAYHLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFN 240
Query: 241 KVLKRGVCPNLFTFNIFIQGLCRKGRIDEAARLLESILSEGLTPDVVSYNTLICGFCKHS 300
KVLKRGVCPNLFTFNIFIQGLCRKG ID AARLLESI+SEGLTPDVVSYNTLICGFCKHS
Sbjct: 241 KVLKRGVCPNLFTFNIFIQGLCRKGAIDGAARLLESIISEGLTPDVVSYNTLICGFCKHS 300
Query: 301 KLVEAECYLRKMVNSGFEPNEFTYNTIINGFCKMGMMQNADKILCDAMFKGFMPDEFTYS 360
KLVEAECYL KMVN+GFEPNEFTYNTIINGFCKMGMMQNADKIL +AMFKGFMPDEFTYS
Sbjct: 301 KLVEAECYLHKMVNNGFEPNEFTYNTIINGFCKMGMMQNADKILREAMFKGFMPDEFTYS 360
Query: 361 SLINGLCDDGDMNQAMAVFNEAMEKGFKHSIILYNTVVKGFSKQGLVLQALQLMKDMMGH 420
SLINGLCD GDMN+AMAVFNEAMEKGFKHSIILYNT+VKG SKQGLVLQALQLMKDMM H
Sbjct: 361 SLINGLCDYGDMNRAMAVFNEAMEKGFKHSIILYNTLVKGLSKQGLVLQALQLMKDMMEH 420
Query: 421 GCSPDIWTYNLVVNGLCKMGCLSDASELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKA 480
GCSPDIWTYNLVVNGLCKMGCLSDA+ELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKA
Sbjct: 421 GCSPDIWTYNLVVNGLCKMGCLSDANELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKA 480
Query: 481 IEILDTMLSHGITPDVITYNTLLNGLCKARKLDNVVETFKAMLEKGCTPNIITYNILIES 540
+EILDTMLSHGITPDVITYNTLLNGLCKARKLDNVV+TF MLEKGCTPNIITYNILIES
Sbjct: 481 LEILDTMLSHGITPDVITYNTLLNGLCKARKLDNVVDTFTVMLEKGCTPNIITYNILIES 540
Query: 541 FCKDRKVSEAMDLFEEMKTRGLTPDIVTLCTLICGLCSNGELDKAYQLFLTLEKEYKFSY 600
FCKDRKVS+AMD+FEEMKTRGLTPDIVTLCTLICGLC+NGELDKAYQLF+TLEKEYKFSY
Sbjct: 541 FCKDRKVSKAMDMFEEMKTRGLTPDIVTLCTLICGLCNNGELDKAYQLFVTLEKEYKFSY 600
Query: 601 STAIFNIMINAFCEKLNINMAEKLFHKMGGCDCAPDNYTYRVMIDSYCKTGNIDPAHTFL 660
STAIFNIMINAFCEKLNI+MAEKLFHK+GGCDCAPDNYTYRVMIDSYCKTGNIDPAH FL
Sbjct: 601 STAIFNIMINAFCEKLNISMAEKLFHKLGGCDCAPDNYTYRVMIDSYCKTGNIDPAHAFL 660
Query: 661 LEKINKGFVPSFTTCGRVLNCLCVKHRLSEAVDIINLMVQNGIVPEEVNSIFEADKKEIA 720
LEKINKG VPSFTTCGRVLNCLCVKH+LSEAVDIINLMVQNGIVPEEVNSIFEADKKE+A
Sbjct: 661 LEKINKGLVPSFTTCGRVLNCLCVKHKLSEAVDIINLMVQNGIVPEEVNSIFEADKKEVA 720
Query: 721 APKIVVEYLMKKSHITYYSYELLYDGIRDRKLEKKKFKRSPSLGSGKRHVNL 773
APKIVVEYL+KKSHITYYSYELLYDGIRDRKL+KKKFKRSPSLGSGKRHVNL
Sbjct: 721 APKIVVEYLLKKSHITYYSYELLYDGIRDRKLDKKKFKRSPSLGSGKRHVNL 772
BLAST of HG10020559 vs. NCBI nr
Match:
KAG6584456.1 (putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1482.6 bits (3837), Expect = 0.0e+00
Identity = 716/778 (92.03%), Postives = 748/778 (96.14%), Query Frame = 0
Query: 1 MNRALQPKHVAAVIRYQNDPLKALQMFNQVKTEDGFKHTLATYKCMIEKLGLHGQFEAME 60
MNRALQPKHVAAVIRYQNDPLKAL+MFNQVKTEDGFKHTLATYKCMIEKLGLHG+FEAME
Sbjct: 1 MNRALQPKHVAAVIRYQNDPLKALKMFNQVKTEDGFKHTLATYKCMIEKLGLHGEFEAME 60
Query: 61 DVLADMRKNVDNKMLEGVYIRIMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYNVIMNI 120
DVLA+MRKNVDNKMLEGVYI IMRDYGRKGK+QEAVNVFERMDFYDC PSVQSYNVIMNI
Sbjct: 61 DVLAEMRKNVDNKMLEGVYIGIMRDYGRKGKIQEAVNVFERMDFYDCVPSVQSYNVIMNI 120
Query: 121 LVEYGYFNQAHKVYMRMKYIGIYPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN 180
LV+YGYFNQAHKVYMRMK IGI PDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN
Sbjct: 121 LVDYGYFNQAHKVYMRMKDIGILPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN 180
Query: 181 AVSYCTVIGGFYEENCQIEAYHLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFN 240
AVSYC VIGG+YEENCQIEAYHLF EML+QGICPDILTFNKLIHVLCKKGNVQESEKLFN
Sbjct: 181 AVSYCAVIGGYYEENCQIEAYHLFHEMLQQGICPDILTFNKLIHVLCKKGNVQESEKLFN 240
Query: 241 KVLKRGVCPNLFTFNIFIQGLCRKGRIDEAARLLESILSEGLTPDVVSYNTLICGFCKHS 300
KVLKRGVCPNLFTFNIFIQGLCRKG IDEAARLLESI+SEGLTPDVVSYNTLICGFCKHS
Sbjct: 241 KVLKRGVCPNLFTFNIFIQGLCRKGAIDEAARLLESIISEGLTPDVVSYNTLICGFCKHS 300
Query: 301 KLVEAECYLRKMVNSGFEPNEFTYNTIINGFCKMGMMQNADKILCDAMFKGFMPDEFTYS 360
KLVEAECYLRKMVN+GFEPNEFTYNTII+GFCKMGMMQNADKIL DAMFKGF+PDEFTYS
Sbjct: 301 KLVEAECYLRKMVNNGFEPNEFTYNTIIDGFCKMGMMQNADKILRDAMFKGFVPDEFTYS 360
Query: 361 SLINGLCDDGDMNQAMAVFNEAMEKGFKHSIILYNTVVKGFSKQGLVLQALQLMKDMMGH 420
SLINGLCDDGDMN+AMAVF+EAMEKGFKHSI+LYNT+VKG S+QGLVLQALQLMKDM+ H
Sbjct: 361 SLINGLCDDGDMNRAMAVFSEAMEKGFKHSIVLYNTLVKGLSQQGLVLQALQLMKDMLEH 420
Query: 421 GCSPDIWTYNLVVNGLCKMGCLSDASELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKA 480
GCSPDIWTYNLVVN LCKMGCLSDASE LNDAIAKGCIPDIFTFNTLIDGYCK LNLDKA
Sbjct: 421 GCSPDIWTYNLVVNALCKMGCLSDASEFLNDAIAKGCIPDIFTFNTLIDGYCKHLNLDKA 480
Query: 481 IEILDTMLSHGITPDVITYNTLLNGLCKARKLDNVVETFKAMLEKGCTPNIITYNILIES 540
IE LDTMLSHGITPDVITYNTLLNGLCKA+KL+NVV+TFKAMLEKGC PNIITYNILIES
Sbjct: 481 IETLDTMLSHGITPDVITYNTLLNGLCKAKKLNNVVDTFKAMLEKGCIPNIITYNILIES 540
Query: 541 FCKDRKVSEAMDLFEEMKTRGLTPDIVTLCTLICGLCSNGELDKAYQLFLTLEKEYKFSY 600
FCK RKV EAMD FEEMKTRGLTPDIVTLCTLICGLCSNGEL+KAYQLF+T+EKEYKFSY
Sbjct: 541 FCKSRKVGEAMDWFEEMKTRGLTPDIVTLCTLICGLCSNGELEKAYQLFVTIEKEYKFSY 600
Query: 601 STAIFNIMINAFCEKLNINMAEKLFHKMGGCDCAPDNYTYRVMIDSYCKTGNIDPAHTFL 660
STAIFNIMINAFCEKLN++MAE+LFHKMGGC CAPD+YTYRVMID+YCKTGNIDPA TFL
Sbjct: 601 STAIFNIMINAFCEKLNVSMAERLFHKMGGCGCAPDSYTYRVMIDTYCKTGNIDPAQTFL 660
Query: 661 LEKINKGFVPSFTTCGRVLNCLCVKHRLSEAVDIINLMVQNGIVPEEVNSIFEADKKEIA 720
LEKINKG VPSFTTCGRVLNCLCVKHRLSEAV IINLMVQNGIVPEEVNSIFEADKKE+A
Sbjct: 661 LEKINKGLVPSFTTCGRVLNCLCVKHRLSEAVGIINLMVQNGIVPEEVNSIFEADKKEVA 720
Query: 721 APKIVVEYLMKKSHITYYSYELLYDGIRDRK------LEKKKFKRSPSLGSGKRHVNL 773
APKIVVE+LMKKSHITYYSYELLYDGIRDRK L+KKKFKRSPSLG GK H+NL
Sbjct: 721 APKIVVEHLMKKSHITYYSYELLYDGIRDRKLDKKKLLDKKKFKRSPSLGPGKGHLNL 778
BLAST of HG10020559 vs. NCBI nr
Match:
XP_022923786.1 (uncharacterized protein LOC111431396 [Cucurbita moschata])
HSP 1 Score: 1482.6 bits (3837), Expect = 0.0e+00
Identity = 716/778 (92.03%), Postives = 748/778 (96.14%), Query Frame = 0
Query: 1 MNRALQPKHVAAVIRYQNDPLKALQMFNQVKTEDGFKHTLATYKCMIEKLGLHGQFEAME 60
MNRALQPKHVAAVIRYQNDPLKAL+MFNQVKTEDGFKHTLATYKCMIEKLGLHG+FEAME
Sbjct: 1 MNRALQPKHVAAVIRYQNDPLKALKMFNQVKTEDGFKHTLATYKCMIEKLGLHGEFEAME 60
Query: 61 DVLADMRKNVDNKMLEGVYIRIMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYNVIMNI 120
DVLA+MRKNVDNKMLEGVYI IMRDYGRKGK+QEAVNVFERMDFYDC PSVQSYNVIMNI
Sbjct: 61 DVLAEMRKNVDNKMLEGVYIGIMRDYGRKGKIQEAVNVFERMDFYDCVPSVQSYNVIMNI 120
Query: 121 LVEYGYFNQAHKVYMRMKYIGIYPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN 180
LV+YGYFNQAHKVYMRMK IGI PDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN
Sbjct: 121 LVDYGYFNQAHKVYMRMKDIGILPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN 180
Query: 181 AVSYCTVIGGFYEENCQIEAYHLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFN 240
AVSYC VIGG+YEENCQIEAYHLF EML+QGICPDILTFNKLIHVLCKKGNVQESEKLFN
Sbjct: 181 AVSYCAVIGGYYEENCQIEAYHLFHEMLQQGICPDILTFNKLIHVLCKKGNVQESEKLFN 240
Query: 241 KVLKRGVCPNLFTFNIFIQGLCRKGRIDEAARLLESILSEGLTPDVVSYNTLICGFCKHS 300
KVLKRGVCPNLFTFNIFIQGLCRKG IDEAARLLESI+SEGLTPDVVSYNTLICGFCKHS
Sbjct: 241 KVLKRGVCPNLFTFNIFIQGLCRKGAIDEAARLLESIISEGLTPDVVSYNTLICGFCKHS 300
Query: 301 KLVEAECYLRKMVNSGFEPNEFTYNTIINGFCKMGMMQNADKILCDAMFKGFMPDEFTYS 360
KLVEAECYLRKMVN+GFEPNEFTYNTII+GFCKMGMMQNADKIL DAMFKGF+PDEFTYS
Sbjct: 301 KLVEAECYLRKMVNNGFEPNEFTYNTIIDGFCKMGMMQNADKILRDAMFKGFVPDEFTYS 360
Query: 361 SLINGLCDDGDMNQAMAVFNEAMEKGFKHSIILYNTVVKGFSKQGLVLQALQLMKDMMGH 420
SLINGLCDDGDMN+AMAVF+EAMEKGFKHSI+LYNT+VKG S+QGLVLQALQLMKDM+ H
Sbjct: 361 SLINGLCDDGDMNRAMAVFSEAMEKGFKHSIVLYNTLVKGLSQQGLVLQALQLMKDMLEH 420
Query: 421 GCSPDIWTYNLVVNGLCKMGCLSDASELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKA 480
GCSPDIWTYNLVVN LCKMGCLSDASE LNDAIAKGCIPDIFTFNTLIDGYCK LNLDKA
Sbjct: 421 GCSPDIWTYNLVVNALCKMGCLSDASEFLNDAIAKGCIPDIFTFNTLIDGYCKHLNLDKA 480
Query: 481 IEILDTMLSHGITPDVITYNTLLNGLCKARKLDNVVETFKAMLEKGCTPNIITYNILIES 540
IE LDTMLSHGITPDVITYNTLLNGLCKA+KL+NVV+TFKAMLEKGC PNIITYNILIES
Sbjct: 481 IETLDTMLSHGITPDVITYNTLLNGLCKAKKLNNVVDTFKAMLEKGCIPNIITYNILIES 540
Query: 541 FCKDRKVSEAMDLFEEMKTRGLTPDIVTLCTLICGLCSNGELDKAYQLFLTLEKEYKFSY 600
FCK RKV EAMD FEEMKTRGLTPDIVTLCTLICGLCSNGEL+KAYQLF+T+EKEYKFSY
Sbjct: 541 FCKSRKVGEAMDWFEEMKTRGLTPDIVTLCTLICGLCSNGELEKAYQLFVTIEKEYKFSY 600
Query: 601 STAIFNIMINAFCEKLNINMAEKLFHKMGGCDCAPDNYTYRVMIDSYCKTGNIDPAHTFL 660
STAIFNIMINAFCEKLN++MAE+LFHKMGGC CAPD+YTYRVMID+YCKTGNIDPA TFL
Sbjct: 601 STAIFNIMINAFCEKLNVSMAERLFHKMGGCGCAPDSYTYRVMIDTYCKTGNIDPAQTFL 660
Query: 661 LEKINKGFVPSFTTCGRVLNCLCVKHRLSEAVDIINLMVQNGIVPEEVNSIFEADKKEIA 720
LEKINKG VPSFTTCGRVLNCLCVKHRLSEAV IINLMVQNGIVPEEVNSIFEADKKE+A
Sbjct: 661 LEKINKGLVPSFTTCGRVLNCLCVKHRLSEAVGIINLMVQNGIVPEEVNSIFEADKKEVA 720
Query: 721 APKIVVEYLMKKSHITYYSYELLYDGIRDRK------LEKKKFKRSPSLGSGKRHVNL 773
APKIVVE+LMKKSHITYYSYELLYDGIRDRK L+KKKFKRSPSLG GK H+NL
Sbjct: 721 APKIVVEHLMKKSHITYYSYELLYDGIRDRKLDKKKLLDKKKFKRSPSLGPGKGHLNL 778
BLAST of HG10020559 vs. NCBI nr
Match:
XP_023519594.1 (putative pentatricopeptide repeat-containing protein At1g74580 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1471.8 bits (3809), Expect = 0.0e+00
Identity = 711/778 (91.39%), Postives = 743/778 (95.50%), Query Frame = 0
Query: 1 MNRALQPKHVAAVIRYQNDPLKALQMFNQVKTEDGFKHTLATYKCMIEKLGLHGQFEAME 60
MNRALQPKHVAAVIRYQNDPLKAL+MFNQVKTEDGFKHTL TYKCMIEKLGLHG+FEAME
Sbjct: 1 MNRALQPKHVAAVIRYQNDPLKALKMFNQVKTEDGFKHTLVTYKCMIEKLGLHGEFEAME 60
Query: 61 DVLADMRKNVDNKMLEGVYIRIMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYNVIMNI 120
DVLA+MRKNVDNKMLEGVYI IMRDYGRKGK+QEAVNVFERMDFYDC PSVQSYNVIMNI
Sbjct: 61 DVLAEMRKNVDNKMLEGVYIGIMRDYGRKGKIQEAVNVFERMDFYDCVPSVQSYNVIMNI 120
Query: 121 LVEYGYFNQAHKVYMRMKYIGIYPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN 180
LV+YGYFNQAHKVYMRMK IGI PDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN
Sbjct: 121 LVDYGYFNQAHKVYMRMKDIGILPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN 180
Query: 181 AVSYCTVIGGFYEENCQIEAYHLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFN 240
AVSYC VIGG+YEENCQIEAYHLF EML+QGICPDILTFNKLIHVLCKKGNVQESEKLFN
Sbjct: 181 AVSYCAVIGGYYEENCQIEAYHLFHEMLQQGICPDILTFNKLIHVLCKKGNVQESEKLFN 240
Query: 241 KVLKRGVCPNLFTFNIFIQGLCRKGRIDEAARLLESILSEGLTPDVVSYNTLICGFCKHS 300
KVLKRGVCPNLFTFNIFIQGLCRKG IDEAARLLESI+SEGLTPDVVSYNTLICGFCKHS
Sbjct: 241 KVLKRGVCPNLFTFNIFIQGLCRKGAIDEAARLLESIISEGLTPDVVSYNTLICGFCKHS 300
Query: 301 KLVEAECYLRKMVNSGFEPNEFTYNTIINGFCKMGMMQNADKILCDAMFKGFMPDEFTYS 360
KLVEAECYLRKMVN+GFEPNEFTYNTII+GFCKMGMM NADKIL DAMFKGF+PDEFTYS
Sbjct: 301 KLVEAECYLRKMVNNGFEPNEFTYNTIIDGFCKMGMMPNADKILRDAMFKGFVPDEFTYS 360
Query: 361 SLINGLCDDGDMNQAMAVFNEAMEKGFKHSIILYNTVVKGFSKQGLVLQALQLMKDMMGH 420
SLINGLC+DGDMN+AMAVFNEAMEKGFKHSIILYNT+VKG S+QGLVLQALQLMKDM+ H
Sbjct: 361 SLINGLCNDGDMNRAMAVFNEAMEKGFKHSIILYNTLVKGLSQQGLVLQALQLMKDMLEH 420
Query: 421 GCSPDIWTYNLVVNGLCKMGCLSDASELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKA 480
GCSPDIWTYNLVVN LCKMGCLSDASE LNDAIAKGCIPDIFTFNTLIDGYCK LNLDKA
Sbjct: 421 GCSPDIWTYNLVVNALCKMGCLSDASEFLNDAIAKGCIPDIFTFNTLIDGYCKHLNLDKA 480
Query: 481 IEILDTMLSHGITPDVITYNTLLNGLCKARKLDNVVETFKAMLEKGCTPNIITYNILIES 540
IE LDTMLSHGITPDVITYNTLLNGLCKA+KL+NVV+TFKAMLEKGC PNIITYNILIES
Sbjct: 481 IETLDTMLSHGITPDVITYNTLLNGLCKAKKLNNVVDTFKAMLEKGCIPNIITYNILIES 540
Query: 541 FCKDRKVSEAMDLFEEMKTRGLTPDIVTLCTLICGLCSNGELDKAYQLFLTLEKEYKFSY 600
FCK RKV EAMD FEEMKTRGL PDIVTLCTLICGLCSNGEL+KAYQLF+ +EKEYKFSY
Sbjct: 541 FCKARKVGEAMDWFEEMKTRGLNPDIVTLCTLICGLCSNGELEKAYQLFVKIEKEYKFSY 600
Query: 601 STAIFNIMINAFCEKLNINMAEKLFHKMGGCDCAPDNYTYRVMIDSYCKTGNIDPAHTFL 660
STAIFNIMINAFCEKLN++MAE+LFHKMGGC CAPD+YTYRVMID+YCKTGNIDPA TFL
Sbjct: 601 STAIFNIMINAFCEKLNVSMAERLFHKMGGCGCAPDSYTYRVMIDTYCKTGNIDPAQTFL 660
Query: 661 LEKINKGFVPSFTTCGRVLNCLCVKHRLSEAVDIINLMVQNGIVPEEVNSIFEADKKEIA 720
LE +NKG VPSFTTCGRVLNCLCVKHRLSEAV IINLMVQNGIVPEEVNSIFEADKKE+A
Sbjct: 661 LENVNKGLVPSFTTCGRVLNCLCVKHRLSEAVGIINLMVQNGIVPEEVNSIFEADKKEVA 720
Query: 721 APKIVVEYLMKKSHITYYSYELLYDGIRDRK------LEKKKFKRSPSLGSGKRHVNL 773
APKIVVE+LMKKSHITYYSYELLYDGIRDRK L+KKKFKRSPSLG GK H+NL
Sbjct: 721 APKIVVEHLMKKSHITYYSYELLYDGIRDRKLDKKKLLDKKKFKRSPSLGPGKGHLNL 778
BLAST of HG10020559 vs. NCBI nr
Match:
XP_011649732.1 (putative pentatricopeptide repeat-containing protein At1g74580 [Cucumis sativus] >XP_031736299.1 putative pentatricopeptide repeat-containing protein At1g74580 [Cucumis sativus] >KAE8652508.1 hypothetical protein Csa_014110 [Cucumis sativus])
HSP 1 Score: 1467.6 bits (3798), Expect = 0.0e+00
Identity = 709/771 (91.96%), Postives = 742/771 (96.24%), Query Frame = 0
Query: 1 MNRALQPKHVAAVIRYQNDPLKALQMFNQVKTEDGFKHTLATYKCMIEKLGLHGQFEAME 60
MNRALQPKHVAAVIRYQNDPL AL+MFNQVKTEDGFKHTL TYKCMIEKLGLHG+FEAME
Sbjct: 1 MNRALQPKHVAAVIRYQNDPLNALKMFNQVKTEDGFKHTLETYKCMIEKLGLHGKFEAME 60
Query: 61 DVLADMRKNVDNKMLEGVYIRIMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYNVIMNI 120
DVLA+MRKNVD+KMLEGVYI IMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYN IMNI
Sbjct: 61 DVLAEMRKNVDSKMLEGVYIGIMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYNAIMNI 120
Query: 121 LVEYGYFNQAHKVYMRMKYIGIYPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN 180
LVEYGYF+QAHKVYMRMK IGIYPDVYTHTIR+KSFC TGRP+AALRLLNNMPGQGCEFN
Sbjct: 121 LVEYGYFSQAHKVYMRMKDIGIYPDVYTHTIRMKSFCITGRPTAALRLLNNMPGQGCEFN 180
Query: 181 AVSYCTVIGGFYEENCQIEAYHLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFN 240
AVSYC VI GFY+ENCQIEAYHLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLF+
Sbjct: 181 AVSYCAVISGFYKENCQIEAYHLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFS 240
Query: 241 KVLKRGVCPNLFTFNIFIQGLCRKGRIDEAARLLESILSEGLTPDVVSYNTLICGFCKHS 300
KV+KRGVCPNLFTFNIFIQGLCRKG IDEAARLLESI+SEGLTPDV+SYNTLICGFCKHS
Sbjct: 241 KVMKRGVCPNLFTFNIFIQGLCRKGAIDEAARLLESIVSEGLTPDVISYNTLICGFCKHS 300
Query: 301 KLVEAECYLRKMVNSGFEPNEFTYNTIINGFCKMGMMQNADKILCDAMFKGFMPDEFTYS 360
KLVEAECYL KMVNSG EPNEFTYNTIINGFCK GMMQNADKIL DAMFKGF+PDEFTYS
Sbjct: 301 KLVEAECYLHKMVNSGVEPNEFTYNTIINGFCKAGMMQNADKILRDAMFKGFIPDEFTYS 360
Query: 361 SLINGLCDDGDMNQAMAVFNEAMEKGFKHSIILYNTVVKGFSKQGLVLQALQLMKDMMGH 420
SLINGLC+DGDMN+AMAVF EAMEKGFKHSIILYNT+VKG SKQGLVLQALQLMKDMM H
Sbjct: 361 SLINGLCNDGDMNRAMAVFYEAMEKGFKHSIILYNTLVKGLSKQGLVLQALQLMKDMMEH 420
Query: 421 GCSPDIWTYNLVVNGLCKMGCLSDASELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKA 480
GCSPDIWTYNLVVNGLCKMGCLSDA+ +LNDAIAKGCIPDIFTFNTLIDGYCKQ N+DKA
Sbjct: 421 GCSPDIWTYNLVVNGLCKMGCLSDANGILNDAIAKGCIPDIFTFNTLIDGYCKQRNMDKA 480
Query: 481 IEILDTMLSHGITPDVITYNTLLNGLCKARKLDNVVETFKAMLEKGCTPNIITYNILIES 540
IEILDTMLSHGITPDVITYNTLLNGLCKARKLDNVV+TFKAMLEKGCTPNIITYNILIES
Sbjct: 481 IEILDTMLSHGITPDVITYNTLLNGLCKARKLDNVVDTFKAMLEKGCTPNIITYNILIES 540
Query: 541 FCKDRKVSEAMDLFEEMKTRGLTPDIVTLCTLICGLCSNGELDKAYQLFLTLEKEYKFSY 600
FCKDRKVSEAM+LF+EMKTRGLTPDIVTLCTLICGLCSNGELDKAY+LF+T+EKEYKFSY
Sbjct: 541 FCKDRKVSEAMELFKEMKTRGLTPDIVTLCTLICGLCSNGELDKAYELFVTIEKEYKFSY 600
Query: 601 STAIFNIMINAFCEKLNINMAEKLFHKMGGCDCAPDNYTYRVMIDSYCKTGNIDPAHTFL 660
STAIFNIMINAFCEKLN++MAEKLFHKMGG DCAPDNYTYRVMIDSYCKTGNID AHTFL
Sbjct: 601 STAIFNIMINAFCEKLNVSMAEKLFHKMGGSDCAPDNYTYRVMIDSYCKTGNIDLAHTFL 660
Query: 661 LEKINKGFVPSFTTCGRVLNCLCVKHRLSEAVDIINLMVQNGIVPEEVNSIFEADKKEIA 720
LE I+KG VPSFTTCG+VLNCLCV HRLSEAV IINLMVQNGIVPEEVNSIFEADKKE+A
Sbjct: 661 LENISKGLVPSFTTCGKVLNCLCVTHRLSEAVVIINLMVQNGIVPEEVNSIFEADKKEVA 720
Query: 721 APKIVVEYLMKKSHITYYSYELLYDGIRDRKLEKKKFKRSPSLGSGKRHVN 772
APKIVVEYL+KKSHITYYSYELLYDGIR+RKL+ KKFKRS SL SGKR N
Sbjct: 721 APKIVVEYLLKKSHITYYSYELLYDGIRNRKLDNKKFKRSTSLVSGKRVAN 771
BLAST of HG10020559 vs. ExPASy Swiss-Prot
Match:
Q9CA58 (Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis thaliana OX=3702 GN=At1g74580 PE=3 SV=1)
HSP 1 Score: 984.2 bits (2543), Expect = 8.6e-286
Identity = 461/756 (60.98%), Postives = 594/756 (78.57%), Query Frame = 0
Query: 1 MNRALQPKHVAAVIRYQNDPLKALQMFNQVKTEDGFKHTLATYKCMIEKLGLHGQFEAME 60
M L PKHV AVI+ Q DP+KAL+MFN ++ E GFKHTL+TY+ +IEKLG +G+FEAME
Sbjct: 1 MGPPLLPKHVTAVIKCQKDPMKALEMFNSMRKEVGFKHTLSTYRSVIEKLGYYGKFEAME 60
Query: 61 DVLADMRKNVDNKMLEGVYIRIMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYNVIMNI 120
+VL DMR+NV N MLEGVY+ M++YGRKGKVQEAVNVFERMDFYDCEP+V SYN IM++
Sbjct: 61 EVLVDMRENVGNHMLEGVYVGAMKNYGRKGKVQEAVNVFERMDFYDCEPTVFSYNAIMSV 120
Query: 121 LVEYGYFNQAHKVYMRMKYIGIYPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN 180
LV+ GYF+QAHKVYMRM+ GI PDVY+ TIR+KSFC+T RP AALRLLNNM QGCE N
Sbjct: 121 LVDSGYFDQAHKVYMRMRDRGITPDVYSFTIRMKSFCKTSRPHAALRLLNNMSSQGCEMN 180
Query: 181 AVSYCTVIGGFYEENCQIEAYHLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFN 240
V+YCTV+GGFYEEN + E Y LF +ML G+ + TFNKL+ VLCKKG+V+E EKL +
Sbjct: 181 VVAYCTVVGGFYEENFKAEGYELFGKMLASGVSLCLSTFNKLLRVLCKKGDVKECEKLLD 240
Query: 241 KVLKRGVCPNLFTFNIFIQGLCRKGRIDEAARLLESILSEGLTPDVVSYNTLICGFCKHS 300
KV+KRGV PNLFT+N+FIQGLC++G +D A R++ ++ +G PDV++YN LI G CK+S
Sbjct: 241 KVIKRGVLPNLFTYNLFIQGLCQRGELDGAVRMVGCLIEQGPKPDVITYNNLIYGLCKNS 300
Query: 301 KLVEAECYLRKMVNSGFEPNEFTYNTIINGFCKMGMMQNADKILCDAMFKGFMPDEFTYS 360
K EAE YL KMVN G EP+ +TYNT+I G+CK GM+Q A++I+ DA+F GF+PD+FTY
Sbjct: 301 KFQEAEVYLGKMVNEGLEPDSYTYNTLIAGYCKGGMVQLAERIVGDAVFNGFVPDQFTYR 360
Query: 361 SLINGLCDDGDMNQAMAVFNEAMEKGFKHSIILYNTVVKGFSKQGLVLQALQLMKDMMGH 420
SLI+GLC +G+ N+A+A+FNEA+ KG K ++ILYNT++KG S QG++L+A QL +M
Sbjct: 361 SLIDGLCHEGETNRALALFNEALGKGIKPNVILYNTLIKGLSNQGMILEAAQLANEMSEK 420
Query: 421 GCSPDIWTYNLVVNGLCKMGCLSDASELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKA 480
G P++ T+N++VNGLCKMGC+SDA L+ I+KG PDIFTFN LI GY QL ++ A
Sbjct: 421 GLIPEVQTFNILVNGLCKMGCVSDADGLVKVMISKGYFPDIFTFNILIHGYSTQLKMENA 480
Query: 481 IEILDTMLSHGITPDVITYNTLLNGLCKARKLDNVVETFKAMLEKGCTPNIITYNILIES 540
+EILD ML +G+ PDV TYN+LLNGLCK K ++V+ET+K M+EKGC PN+ T+NIL+ES
Sbjct: 481 LEILDVMLDNGVDPDVYTYNSLLNGLCKTSKFEDVMETYKTMVEKGCAPNLFTFNILLES 540
Query: 541 FCKDRKVSEAMDLFEEMKTRGLTPDIVTLCTLICGLCSNGELDKAYQLFLTLEKEYKFSY 600
C+ RK+ EA+ L EEMK + + PD VT TLI G C NG+LD AY LF +E+ YK S
Sbjct: 541 LCRYRKLDEALGLLEEMKNKSVNPDAVTFGTLIDGFCKNGDLDGAYTLFRKMEEAYKVSS 600
Query: 601 STAIFNIMINAFCEKLNINMAEKLFHKMGGCDCAPDNYTYRVMIDSYCKTGNIDPAHTFL 660
ST +NI+I+AF EKLN+ MAEKLF +M PD YTYR+M+D +CKTGN++ + FL
Sbjct: 601 STPTYNIIIHAFTEKLNVTMAEKLFQEMVDRCLGPDGYTYRLMVDGFCKTGNVNLGYKFL 660
Query: 661 LEKINKGFVPSFTTCGRVLNCLCVKHRLSEAVDIINLMVQNGIVPEEVNSIFEADKKEIA 720
LE + GF+PS TT GRV+NCLCV+ R+ EA II+ MVQ G+VPE VN+I + DKKE+A
Sbjct: 661 LEMMENGFIPSLTTLGRVINCLCVEDRVYEAAGIIHRMVQKGLVPEAVNTICDVDKKEVA 720
Query: 721 APKIVVEYLMKKSHITYYSYELLYDGIRDRKLEKKK 757
APK+V+E L+KKS ITYY+YELL+DG+RD++L KKK
Sbjct: 721 APKLVLEDLLKKSCITYYAYELLFDGLRDKRLRKKK 756
BLAST of HG10020559 vs. ExPASy Swiss-Prot
Match:
Q9FMF6 (Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g64320 PE=2 SV=1)
HSP 1 Score: 377.1 bits (967), Expect = 4.8e-103
Identity = 210/634 (33.12%), Postives = 354/634 (55.84%), Query Frame = 0
Query: 23 ALQMFNQVKTEDGFKHTLATYKCMIEKLGLHGQFEAMEDVLADMRKNVDNKMLEGVYIRI 82
++++F+ +++G++H+ Y+ +I KLG +G+F+ ++ +L M K+ E ++I I
Sbjct: 94 SMELFSWTGSQNGYRHSFDVYQVLIGKLGANGEFKTIDRLLIQM-KDEGIVFKESLFISI 153
Query: 83 MRDYGRKGKVQEAVN-VFERMDFYDCEPSVQSYNVIMNILVEYGYFNQAHKVYMRMKYIG 142
MRDY + G + + E + Y CEP+ +SYNV++ ILV A V+ M
Sbjct: 154 MRDYDKAGFPGQTTRLMLEMRNVYSCEPTFKSYNVVLEILVSGNCHKVAANVFYDMLSRK 213
Query: 143 IYPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFNAVSYCTVIGGFYEENCQIEAY 202
I P ++T + +K+FC +AL LL +M GC N+V Y T+I + N EA
Sbjct: 214 IPPTLFTFGVVMKAFCAVNEIDSALSLLRDMTKHGCVPNSVIYQTLIHSLSKCNRVNEAL 273
Query: 203 HLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFNKVLKRGVCPNLFTFNIFIQGL 262
L +EM G PD TFN +I LCK + E+ K+ N++L RG P+ T+ + GL
Sbjct: 274 QLLEEMFLMGCVPDAETFNDVILGLCKFDRINEAAKMVNRMLIRGFAPDDITYGYLMNGL 333
Query: 263 CRKGRIDEAARLLESILSEGLTPDVVSYNTLICGFCKHSKLVEAECYLRKMVNS-GFEPN 322
C+ GR+D A L I P++V +NTLI GF H +L +A+ L MV S G P+
Sbjct: 334 CKIGRVDAAKDLFYRIPK----PEIVIFNTLIHGFVTHGRLDDAKAVLSDMVTSYGIVPD 393
Query: 323 EFTYNTIINGFCKMGMMQNADKILCDAMFKGFMPDEFTYSSLINGLCDDGDMNQAMAVFN 382
TYN++I G+ K G++ A ++L D KG P+ ++Y+ L++G C G +++A V N
Sbjct: 394 VCTYNSLIYGYWKEGLVGLALEVLHDMRNKGCKPNVYSYTILVDGFCKLGKIDEAYNVLN 453
Query: 383 EAMEKGFKHSIILYNTVVKGFSKQGLVLQALQLMKDMMGHGCSPDIWTYNLVVNGLCKMG 442
E G K + + +N ++ F K+ + +A+++ ++M GC PD++T+N +++GLC++
Sbjct: 454 EMSADGLKPNTVGFNCLISAFCKEHRIPEAVEIFREMPRKGCKPDVYTFNSLISGLCEVD 513
Query: 443 CLSDASELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKAIEILDTMLSHGITPDVITYN 502
+ A LL D I++G + + T+NTLI+ + ++ + +A ++++ M+ G D ITYN
Sbjct: 514 EIKHALWLLRDMISEGVVANTVTYNTLINAFLRRGEIKEARKLVNEMVFQGSPLDEITYN 573
Query: 503 TLLNGLCKARKLDNVVETFKAMLEKGCTPNIITYNILIESFCKDRKVSEAMDLFEEMKTR 562
+L+ GLC+A ++D F+ ML G P+ I+ NILI C+ V EA++ +EM R
Sbjct: 574 SLIKGLCRAGEVDKARSLFEKMLRDGHAPSNISCNILINGLCRSGMVEEAVEFQKEMVLR 633
Query: 563 GLTPDIVTLCTLICGLCSNGELDKAYQLFLTLEKEYKFSYSTAIFNIMINAFCEKLNINM 622
G TPDIVT +LI GLC G ++ +F L+ E T FN +++ C+ +
Sbjct: 634 GSTPDIVTFNSLINGLCRAGRIEDGLTMFRKLQAE-GIPPDTVTFNTLMSWLCKGGFVYD 693
Query: 623 AEKLFHKMGGCDCAPDNYTYRVMIDSYCKTGNID 655
A L + P++ T+ +++ S +D
Sbjct: 694 ACLLLDEGIEDGFVPNHRTWSILLQSIIPQETLD 721
BLAST of HG10020559 vs. ExPASy Swiss-Prot
Match:
Q9LFF1 (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MEE40 PE=2 SV=1)
HSP 1 Score: 363.6 bits (932), Expect = 5.5e-99
Identity = 211/698 (30.23%), Postives = 353/698 (50.57%), Query Frame = 0
Query: 14 IRYQNDPLKALQMFNQVKTEDGFKHTLATYKCMIEKLGLHGQFEAMEDVLADMRKNVDNK 73
+R Q D AL++FN + F A Y+ ++ +LG G F+ M+ +L DM K+ +
Sbjct: 57 LRSQPDDSAALRLFNLASKKPNFSPEPALYEEILLRLGRSGSFDDMKKILEDM-KSSRCE 116
Query: 74 MLEGVYIRIMRDYGRKGKVQEAVNVFERM-DFYDCEPSVQSYNVIMNILVEYGYFNQAHK 133
M ++ ++ Y + E ++V + M D + +P YN ++N+LV+
Sbjct: 117 MGTSTFLILIESYAQFELQDEILSVVDWMIDEFGLKPDTHFYNRMLNLLVDGNSLKLVEI 176
Query: 134 VYMRMKYIGIYPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFNAVSYCTVIGGFY 193
+ +M GI PDV T + IK+ CR + A+ +L +MP G + ++ TV+ G+
Sbjct: 177 SHAKMSVWGIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYI 236
Query: 194 EENCQIEAYHLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFNKVLKR-GVCPNL 253
EE A + ++M++ G ++ N ++H CK+G V+++ ++ + G P+
Sbjct: 237 EEGDLDGALRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQ 296
Query: 254 FTFNIFIQGLCRKGRIDEAARLLESILSEGLTPDVVSYNTLICGFCKHSKLVEAECYLRK 313
+TFN + GLC+ G + A +++ +L EG PDV +YN++I G CK ++ EA L +
Sbjct: 297 YTFNTLVNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQ 356
Query: 314 MVNSGFEPNEFTYNTIINGFCKMGMMQNADKILCDAMFKGFMPDEFTYSSLINGLCDDGD 373
M+ PN TYNT+I+ CK ++ A ++ KG +PD T++SLI GLC +
Sbjct: 357 MITRDCSPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRN 416
Query: 374 MNQAMAVFNEAMEKGFKHSIILYNTVVKGFSKQGLVLQALQLMKDMMGHGCSPDIWTYNL 433
AM +F E M GC PD +TYN+
Sbjct: 417 HRVAMELFEE-----------------------------------MRSKGCEPDEFTYNM 476
Query: 434 VVNGLCKMGCLSDASELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKAIEILDTMLSHG 493
+++ LC G L +A +L GC + T+NTLIDG+CK +A EI D M HG
Sbjct: 477 LIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKANKTREAEEIFDEMEVHG 536
Query: 494 ITPDVITYNTLLNGLCKARKLDNVVETFKAMLEKGCTPNIITYNILIESFCKDRKVSEAM 553
++ + +TYNTL++GLCK+R++++ + M+ +G P+ TYN L+ FC+ + +A
Sbjct: 537 VSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYNSLLTHFCRGGDIKKAA 596
Query: 554 DLFEEMKTRGLTPDIVTLCTLICGLCSNGELDKAYQLFLTLEKEYKFSYSTAIFNIMINA 613
D+ + M + G PDIVT TLI GLC G ++ A +L +++ + + + +N +I
Sbjct: 597 DIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSIQMK-GINLTPHAYNPVIQG 656
Query: 614 FCEKLNINMAEKLFHKM-GGCDCAPDNYTYRVMIDSYCKTGN-IDPAHTFLLEKINKGFV 673
K A LF +M + PD +YR++ C G I A FL+E + KGFV
Sbjct: 657 LFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFRGLCNGGGPIREAVDFLVELLEKGFV 716
Query: 674 PSFTTCGRVLNCLCVKHRLSEAVDIINLMVQNGIVPEE 708
P F++ + L V ++N+++Q EE
Sbjct: 717 PEFSSLYMLAEGLLTLSMEETLVKLVNMVMQKARFSEE 717
BLAST of HG10020559 vs. ExPASy Swiss-Prot
Match:
Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)
HSP 1 Score: 339.0 bits (868), Expect = 1.4e-91
Identity = 177/558 (31.72%), Postives = 299/558 (53.58%), Query Frame = 0
Query: 153 IKSFCRTGRPSAALRLLNNMPGQGCEFNAVSYCTVIGGFYEENCQIE-AYHLFDEMLKQG 212
+KS+ R AL +++ G +SY V+ I A ++F EML+
Sbjct: 141 VKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQ 200
Query: 213 ICPDILTFNKLIHVLCKKGNVQESEKLFNKVLKRGVCPNLFTFNIFIQGLCRKGRIDEAA 272
+ P++ T+N LI C GN+ + LF+K+ +G PN+ T+N I G C+ +ID+
Sbjct: 201 VSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGF 260
Query: 273 RLLESILSEGLTPDVVSYNTLICGFCKHSKLVEAECYLRKMVNSGFEPNEFTYNTIINGF 332
+LL S+ +GL P+++SYN +I G C+ ++ E L +M G+ +E TYNT+I G+
Sbjct: 261 KLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGY 320
Query: 333 CKMGMMQNADKILCDAMFKGFMPDEFTYSSLINGLCDDGDMNQAMAVFNEAMEKGFKHSI 392
CK G A + + + G P TY+SLI+ +C G+MN+AM ++ +G +
Sbjct: 321 CKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNE 380
Query: 393 ILYNTVVKGFSKQGLVLQALQLMKDMMGHGCSPDIWTYNLVVNGLCKMGCLSDASELLND 452
Y T+V GFS++G + +A +++++M +G SP + TYN ++NG C G + DA +L D
Sbjct: 381 RTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLED 440
Query: 453 AIAKGCIPDIFTFNTLIDGYCKQLNLDKAIEILDTMLSHGITPDVITYNTLLNGLCKARK 512
KG PD+ +++T++ G+C+ ++D+A+ + M+ GI PD ITY++L+ G C+ R+
Sbjct: 441 MKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRR 500
Query: 513 LDNVVETFKAMLEKGCTPNIITYNILIESFCKDRKVSEAMDLFEEMKTRGLTPDIVTLCT 572
+ ++ ML G P+ TY LI ++C + + +A+ L EM +G+ PD+VT
Sbjct: 501 TKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSV 560
Query: 573 LICGLCSNGELDKAYQLFLTLEKEYK----FSYSTAIFNI----------MINAFCEKLN 632
LI GL +A +L L L E +Y T I N +I FC K
Sbjct: 561 LINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTLIENCSNIEFKSVVSLIKGFCMKGM 620
Query: 633 INMAEKLFHKMGGCDCAPDNYTYRVMIDSYCKTGNIDPAHTFLLEKINKGFVPSFTTCGR 692
+ A+++F M G + PD Y +MI +C+ G+I A+T E + GF+ T
Sbjct: 621 MTEADQVFESMLGKNHKPDGTAYNIMIHGHCRAGDIRKAYTLYKEMVKSGFLLHTVTVIA 680
Query: 693 VLNCLCVKHRLSEAVDII 696
++ L + +++E +I
Sbjct: 681 LVKALHKEGKVNELNSVI 698
BLAST of HG10020559 vs. ExPASy Swiss-Prot
Match:
Q9LPX2 (Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g12775 PE=2 SV=1)
HSP 1 Score: 331.3 bits (848), Expect = 3.0e-89
Identity = 168/509 (33.01%), Postives = 283/509 (55.60%), Query Frame = 0
Query: 203 LFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFNKVLKRGVCPNLFTFNIFIQGLC 262
L +M +GI I T + +I+ C+ + + K++K G P+ FN + GLC
Sbjct: 110 LCKQMESKGIAHSIYTLSIMINCFCRCRKLSYAFSTMGKIMKLGYEPDTVIFNTLLNGLC 169
Query: 263 RKGRIDEAARLLESILSEGLTPDVVSYNTLICGFCKHSKLVEAECYLRKMVNSGFEPNEF 322
+ R+ EA L++ ++ G P +++ NTL+ G C + K+ +A + +MV +GF+PNE
Sbjct: 170 LECRVSEALELVDRMVEMGHKPTLITLNTLVNGLCLNGKVSDAVVLIDRMVETGFQPNEV 229
Query: 323 TYNTIINGFCKMGMMQNADKILCDAMFKGFMPDEFTYSSLINGLCDDGDMNQAMAVFNEA 382
TY ++N CK G A ++L + D YS +I+GLC DG ++ A +FNE
Sbjct: 230 TYGPVLNVMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDGLCKDGSLDNAFNLFNEM 289
Query: 383 MEKGFKHSIILYNTVVKGFSKQGLVLQALQLMKDMMGHGCSPDIWTYNLVVNGLCKMGCL 442
KGFK II YNT++ GF G +L++DM+ SP++ T++++++ K G L
Sbjct: 290 EIKGFKADIITYNTLIGGFCNAGRWDDGAKLLRDMIKRKISPNVVTFSVLIDSFVKEGKL 349
Query: 443 SDASELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKAIEILDTMLSHGITPDVITYNTL 502
+A +LL + + +G P+ T+N+LIDG+CK+ L++AI+++D M+S G PD++T+N L
Sbjct: 350 READQLLKEMMQRGIAPNTITYNSLIDGFCKENRLEEAIQMVDLMISKGCDPDIMTFNIL 409
Query: 503 LNGLCKARKLDNVVETFKAMLEKGCTPNIITYNILIESFCKDRKVSEAMDLFEEMKTRGL 562
+NG CKA ++D+ +E F+ M +G N +TYN L++ FC+ K+ A LF+EM +R +
Sbjct: 410 INGYCKANRIDDGLELFREMSLRGVIANTVTYNTLVQGFCQSGKLEVAKKLFQEMVSRRV 469
Query: 563 TPDIVTLCTLICGLCSNGELDKAYQLFLTLEKEYKFSYSTAIFNIMINAFCEKLNINMAE 622
PDIV+ L+ GLC NGEL+KA ++F +EK K I+ I+I+ C ++ A
Sbjct: 470 RPDIVSYKILLDGLCDNGELEKALEIFGKIEKS-KMELDIGIYMIIIHGMCNASKVDDAW 529
Query: 623 KLFHKMGGCDCAPDNYTYRVMIDSYCKTGNIDPAHTFLLEKINKGFVPSFTTCGRVLNCL 682
LF + D Y +MI C+ ++ A + +G P T ++
Sbjct: 530 DLFCSLPLKGVKLDARAYNIMISELCRKDSLSKADILFRKMTEEGHAPDELTYNILIRAH 589
Query: 683 CVKHRLSEAVDIINLMVQNGIVPEEVNSI 712
+ A ++I M +G P +V+++
Sbjct: 590 LGDDDATTAAELIEEMKSSGF-PADVSTV 616
BLAST of HG10020559 vs. ExPASy TrEMBL
Match:
A0A6J1E7P0 (uncharacterized protein LOC111431396 OS=Cucurbita moschata OX=3662 GN=LOC111431396 PE=3 SV=1)
HSP 1 Score: 1482.6 bits (3837), Expect = 0.0e+00
Identity = 716/778 (92.03%), Postives = 748/778 (96.14%), Query Frame = 0
Query: 1 MNRALQPKHVAAVIRYQNDPLKALQMFNQVKTEDGFKHTLATYKCMIEKLGLHGQFEAME 60
MNRALQPKHVAAVIRYQNDPLKAL+MFNQVKTEDGFKHTLATYKCMIEKLGLHG+FEAME
Sbjct: 1 MNRALQPKHVAAVIRYQNDPLKALKMFNQVKTEDGFKHTLATYKCMIEKLGLHGEFEAME 60
Query: 61 DVLADMRKNVDNKMLEGVYIRIMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYNVIMNI 120
DVLA+MRKNVDNKMLEGVYI IMRDYGRKGK+QEAVNVFERMDFYDC PSVQSYNVIMNI
Sbjct: 61 DVLAEMRKNVDNKMLEGVYIGIMRDYGRKGKIQEAVNVFERMDFYDCVPSVQSYNVIMNI 120
Query: 121 LVEYGYFNQAHKVYMRMKYIGIYPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN 180
LV+YGYFNQAHKVYMRMK IGI PDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN
Sbjct: 121 LVDYGYFNQAHKVYMRMKDIGILPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN 180
Query: 181 AVSYCTVIGGFYEENCQIEAYHLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFN 240
AVSYC VIGG+YEENCQIEAYHLF EML+QGICPDILTFNKLIHVLCKKGNVQESEKLFN
Sbjct: 181 AVSYCAVIGGYYEENCQIEAYHLFHEMLQQGICPDILTFNKLIHVLCKKGNVQESEKLFN 240
Query: 241 KVLKRGVCPNLFTFNIFIQGLCRKGRIDEAARLLESILSEGLTPDVVSYNTLICGFCKHS 300
KVLKRGVCPNLFTFNIFIQGLCRKG IDEAARLLESI+SEGLTPDVVSYNTLICGFCKHS
Sbjct: 241 KVLKRGVCPNLFTFNIFIQGLCRKGAIDEAARLLESIISEGLTPDVVSYNTLICGFCKHS 300
Query: 301 KLVEAECYLRKMVNSGFEPNEFTYNTIINGFCKMGMMQNADKILCDAMFKGFMPDEFTYS 360
KLVEAECYLRKMVN+GFEPNEFTYNTII+GFCKMGMMQNADKIL DAMFKGF+PDEFTYS
Sbjct: 301 KLVEAECYLRKMVNNGFEPNEFTYNTIIDGFCKMGMMQNADKILRDAMFKGFVPDEFTYS 360
Query: 361 SLINGLCDDGDMNQAMAVFNEAMEKGFKHSIILYNTVVKGFSKQGLVLQALQLMKDMMGH 420
SLINGLCDDGDMN+AMAVF+EAMEKGFKHSI+LYNT+VKG S+QGLVLQALQLMKDM+ H
Sbjct: 361 SLINGLCDDGDMNRAMAVFSEAMEKGFKHSIVLYNTLVKGLSQQGLVLQALQLMKDMLEH 420
Query: 421 GCSPDIWTYNLVVNGLCKMGCLSDASELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKA 480
GCSPDIWTYNLVVN LCKMGCLSDASE LNDAIAKGCIPDIFTFNTLIDGYCK LNLDKA
Sbjct: 421 GCSPDIWTYNLVVNALCKMGCLSDASEFLNDAIAKGCIPDIFTFNTLIDGYCKHLNLDKA 480
Query: 481 IEILDTMLSHGITPDVITYNTLLNGLCKARKLDNVVETFKAMLEKGCTPNIITYNILIES 540
IE LDTMLSHGITPDVITYNTLLNGLCKA+KL+NVV+TFKAMLEKGC PNIITYNILIES
Sbjct: 481 IETLDTMLSHGITPDVITYNTLLNGLCKAKKLNNVVDTFKAMLEKGCIPNIITYNILIES 540
Query: 541 FCKDRKVSEAMDLFEEMKTRGLTPDIVTLCTLICGLCSNGELDKAYQLFLTLEKEYKFSY 600
FCK RKV EAMD FEEMKTRGLTPDIVTLCTLICGLCSNGEL+KAYQLF+T+EKEYKFSY
Sbjct: 541 FCKSRKVGEAMDWFEEMKTRGLTPDIVTLCTLICGLCSNGELEKAYQLFVTIEKEYKFSY 600
Query: 601 STAIFNIMINAFCEKLNINMAEKLFHKMGGCDCAPDNYTYRVMIDSYCKTGNIDPAHTFL 660
STAIFNIMINAFCEKLN++MAE+LFHKMGGC CAPD+YTYRVMID+YCKTGNIDPA TFL
Sbjct: 601 STAIFNIMINAFCEKLNVSMAERLFHKMGGCGCAPDSYTYRVMIDTYCKTGNIDPAQTFL 660
Query: 661 LEKINKGFVPSFTTCGRVLNCLCVKHRLSEAVDIINLMVQNGIVPEEVNSIFEADKKEIA 720
LEKINKG VPSFTTCGRVLNCLCVKHRLSEAV IINLMVQNGIVPEEVNSIFEADKKE+A
Sbjct: 661 LEKINKGLVPSFTTCGRVLNCLCVKHRLSEAVGIINLMVQNGIVPEEVNSIFEADKKEVA 720
Query: 721 APKIVVEYLMKKSHITYYSYELLYDGIRDRK------LEKKKFKRSPSLGSGKRHVNL 773
APKIVVE+LMKKSHITYYSYELLYDGIRDRK L+KKKFKRSPSLG GK H+NL
Sbjct: 721 APKIVVEHLMKKSHITYYSYELLYDGIRDRKLDKKKLLDKKKFKRSPSLGPGKGHLNL 778
BLAST of HG10020559 vs. ExPASy TrEMBL
Match:
A0A6J1KND0 (uncharacterized protein LOC111495738 OS=Cucurbita maxima OX=3661 GN=LOC111495738 PE=3 SV=1)
HSP 1 Score: 1464.1 bits (3789), Expect = 0.0e+00
Identity = 707/769 (91.94%), Postives = 738/769 (95.97%), Query Frame = 0
Query: 1 MNRALQPKHVAAVIRYQNDPLKALQMFNQVKTEDGFKHTLATYKCMIEKLGLHGQFEAME 60
MNRALQPKHVAAVIRYQNDPLKAL+MFNQVKTEDGFKHTLATYKCMIEKLGLHG+FEAME
Sbjct: 1 MNRALQPKHVAAVIRYQNDPLKALKMFNQVKTEDGFKHTLATYKCMIEKLGLHGEFEAME 60
Query: 61 DVLADMRKNVDNKMLEGVYIRIMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYNVIMNI 120
DVLA+MRKN+DNKMLEGVYI IMRDYGRKGK+QEAVNVFERMDFYDC PSVQSYNVIMNI
Sbjct: 61 DVLAEMRKNIDNKMLEGVYIGIMRDYGRKGKIQEAVNVFERMDFYDCVPSVQSYNVIMNI 120
Query: 121 LVEYGYFNQAHKVYMRMKYIGIYPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN 180
LV+YGYFNQAHKVYMRMK IGI PDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN
Sbjct: 121 LVDYGYFNQAHKVYMRMKDIGILPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN 180
Query: 181 AVSYCTVIGGFYEENCQIEAYHLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFN 240
AVSYC VIGG+YEENCQIEAYHLF EML+QGICPDILTFNKLIHVLCKKGNVQESEKLFN
Sbjct: 181 AVSYCAVIGGYYEENCQIEAYHLFHEMLQQGICPDILTFNKLIHVLCKKGNVQESEKLFN 240
Query: 241 KVLKRGVCPNLFTFNIFIQGLCRKGRIDEAARLLESILSEGLTPDVVSYNTLICGFCKHS 300
KVLKRGVCPNLFTFNIFIQGLCRKG IDEAARLLESI+SEGLTPDVVSYNTLICGFCKHS
Sbjct: 241 KVLKRGVCPNLFTFNIFIQGLCRKGAIDEAARLLESIISEGLTPDVVSYNTLICGFCKHS 300
Query: 301 KLVEAECYLRKMVNSGFEPNEFTYNTIINGFCKMGMMQNADKILCDAMFKGFMPDEFTYS 360
KLVEAECYLRKMVN+GFEPNEFTYNTII+GFCKMGMMQNADKIL DAMFKGF+PDEFTYS
Sbjct: 301 KLVEAECYLRKMVNNGFEPNEFTYNTIIDGFCKMGMMQNADKILHDAMFKGFVPDEFTYS 360
Query: 361 SLINGLCDDGDMNQAMAVFNEAMEKGFKHSIILYNTVVKGFSKQGLVLQALQLMKDMMGH 420
SLINGLCDDGDMN+AMAVFNEAMEKGFKHSIILYNT+VKG S+QGLVLQALQLMKDM+ H
Sbjct: 361 SLINGLCDDGDMNRAMAVFNEAMEKGFKHSIILYNTLVKGLSQQGLVLQALQLMKDMLEH 420
Query: 421 GCSPDIWTYNLVVNGLCKMGCLSDASELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKA 480
GC PDIWTYNLVVN LCKMGCLSDASE LNDAIAKGCIPDIFTFNTLIDGYCK LNLDKA
Sbjct: 421 GCGPDIWTYNLVVNALCKMGCLSDASEFLNDAIAKGCIPDIFTFNTLIDGYCKHLNLDKA 480
Query: 481 IEILDTMLSHGITPDVITYNTLLNGLCKARKLDNVVETFKAMLEKGCTPNIITYNILIES 540
IE LDTMLSHGITPDVITYNTLLNGLCKA+KL++VV+TFKAMLEKGC PNIITYNILIES
Sbjct: 481 IETLDTMLSHGITPDVITYNTLLNGLCKAKKLNSVVDTFKAMLEKGCIPNIITYNILIES 540
Query: 541 FCKDRKVSEAMDLFEEMKTRGLTPDIVTLCTLICGLCSNGELDKAYQLFLTLEKEYKFSY 600
FCK RKV EAMD FEEMKTRGLTPDIVTLCTLICGLCSNGEL+KAYQLF+T+EKEYKFSY
Sbjct: 541 FCKARKVGEAMDWFEEMKTRGLTPDIVTLCTLICGLCSNGELEKAYQLFVTIEKEYKFSY 600
Query: 601 STAIFNIMINAFCEKLNINMAEKLFHKMGGCDCAPDNYTYRVMIDSYCKTGNIDPAHTFL 660
STAIFNIMINAFCEKLN++MAE+LFHKMGGC CAPD+YTYRVMID+YCKTGNIDPA TFL
Sbjct: 601 STAIFNIMINAFCEKLNVSMAERLFHKMGGCGCAPDSYTYRVMIDTYCKTGNIDPAQTFL 660
Query: 661 LEKINKGFVPSFTTCGRVLNCLCVKHRLSEAVDIINLMVQNGIVPEEVNSIFEADKKEIA 720
LEKINKG VPSFT CGRVLNCLCVKHRL EAV IINLMVQNGIVPEEVNSIFEADKKE+A
Sbjct: 661 LEKINKGLVPSFTICGRVLNCLCVKHRLGEAVGIINLMVQNGIVPEEVNSIFEADKKEVA 720
Query: 721 APKIVVEYLMKKSHITYYSYELLYDGIRDRK------LEKKKFKRSPSL 764
APKIVVE+LMKKSHITYYSYELLYDGIRDRK L+KKKFKRSPSL
Sbjct: 721 APKIVVEHLMKKSHITYYSYELLYDGIRDRKLDKKKLLDKKKFKRSPSL 769
BLAST of HG10020559 vs. ExPASy TrEMBL
Match:
A0A1S3B3U5 (putative pentatricopeptide repeat-containing protein At1g74580 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103485663 PE=4 SV=1)
HSP 1 Score: 1438.3 bits (3722), Expect = 0.0e+00
Identity = 697/763 (91.35%), Postives = 733/763 (96.07%), Query Frame = 0
Query: 6 QPKHVAAVIRYQNDPLKALQMFNQVKTEDGFKHTLATYKCMIEKLGLHGQFEAMEDVLAD 65
+PKHVAAVIRYQNDPLKAL+ FNQVKTEDGFKHTL TYKCMIEKLGLHGQFEAMEDVLA+
Sbjct: 12 EPKHVAAVIRYQNDPLKALKTFNQVKTEDGFKHTLETYKCMIEKLGLHGQFEAMEDVLAE 71
Query: 66 MRKNVDNKMLEGVYIRIMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYNVIMNILVEYG 125
+RKNVDNKMLEGVYI IMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYN IMNILVEYG
Sbjct: 72 LRKNVDNKMLEGVYIGIMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYNAIMNILVEYG 131
Query: 126 YFNQAHKVYMRMKYIGIYPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFNAVSYC 185
YF+QAHKVYMRMK IGIYPDVYT+TIR+KSFCRTGRPSAALRLLNNMPGQGCEFNAVSYC
Sbjct: 132 YFSQAHKVYMRMKDIGIYPDVYTYTIRMKSFCRTGRPSAALRLLNNMPGQGCEFNAVSYC 191
Query: 186 TVIGGFYEENCQIEAYHLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFNKVLKR 245
VI GFYEENCQIEAYHLF+EMLKQGICPDILTFNKLIHVLCKKGNVQESEKLF+KV+KR
Sbjct: 192 AVISGFYEENCQIEAYHLFNEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFSKVMKR 251
Query: 246 GVCPNLFTFNIFIQGLCRKGRIDEAARLLESILSEGLTPDVVSYNTLICGFCKHSKLVEA 305
GVCPNLFTFNIF+QGLCRKG IDEAARLLESI+SEGLTPDV+SYNTLICGFCKHSKLVEA
Sbjct: 252 GVCPNLFTFNIFMQGLCRKGAIDEAARLLESIVSEGLTPDVISYNTLICGFCKHSKLVEA 311
Query: 306 ECYLRKMVNSGFEPNEFTYNTIINGFCKMGMMQNADKILCDAMFKGFMPDEFTYSSLING 365
EC LRKMVN+G EPNEFTYNTIINGFCK GMMQNADKIL DAMFKGF+PDEFTYS+LING
Sbjct: 312 ECCLRKMVNNGVEPNEFTYNTIINGFCKAGMMQNADKILRDAMFKGFIPDEFTYSALING 371
Query: 366 LCDDGDMNQAMAVFNEAMEKGFKHSIILYNTVVKGFSKQGLVLQALQLMKDMMGHGCSPD 425
LC+DGDM++AMAVF EAMEKGFKHSIILYNT+VKG SKQGLVLQALQLMKDMM HGCSPD
Sbjct: 372 LCNDGDMSRAMAVFYEAMEKGFKHSIILYNTLVKGLSKQGLVLQALQLMKDMMEHGCSPD 431
Query: 426 IWTYNLVVNGLCKMGCLSDASELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKAIEILD 485
IWTYNLVVNGLCKMGCLSDA+ +LNDAIAKGCI DIFTFNTLIDGYCKQ NLDKAIEILD
Sbjct: 432 IWTYNLVVNGLCKMGCLSDANGILNDAIAKGCILDIFTFNTLIDGYCKQRNLDKAIEILD 491
Query: 486 TMLSHGITPDVITYNTLLNGLCKARKLDNVVETFKAMLEKGCTPNIITYNILIESFCKDR 545
TMLSHGITPDVITYNT+LNGLCKARKLDNVV+TF+AMLEKGCTPNIITYNILIESFCKDR
Sbjct: 492 TMLSHGITPDVITYNTILNGLCKARKLDNVVDTFRAMLEKGCTPNIITYNILIESFCKDR 551
Query: 546 KVSEAMDLFEEMKTRGLTPDIVTLCTLICGLCSNGELDKAYQLFLTLEKEYKFSYSTAIF 605
KVSEAM+LFEEMKTRGLTPDIVTLCTL CGLCSNG+LDKAY+LF+TLEKEYKFSYSTAIF
Sbjct: 552 KVSEAMELFEEMKTRGLTPDIVTLCTLTCGLCSNGQLDKAYELFVTLEKEYKFSYSTAIF 611
Query: 606 NIMINAFCEKLNINMAEKLFHKMGGCDCAPDNYTYRVMIDSYCKTGNIDPAHTFLLEKIN 665
NIMINAF EKLN++M EKLFHKMGG DCAPDNYTYRVMIDSYCKTGNID AHTFLLEKI+
Sbjct: 612 NIMINAFSEKLNVSMVEKLFHKMGGSDCAPDNYTYRVMIDSYCKTGNIDLAHTFLLEKIS 671
Query: 666 KGFVPSFTTCGRVLNCLCVKHRLSEAVDIINLMVQNGIVPEEVNSIFEADKKEIAAPKIV 725
KG VPSFTTCG+VLNCLCVKHRL+EAVDIINLMVQNGIVPEEVNSIFEADKKE+AAPKIV
Sbjct: 672 KGLVPSFTTCGKVLNCLCVKHRLNEAVDIINLMVQNGIVPEEVNSIFEADKKEVAAPKIV 731
Query: 726 VEYLMKKSHITYYSYELLYDGIRDRKLEKKKFKRSPSLGSGKR 769
VEYL+KKSHITYYSYELLYDGIR RKL KKFKRS SL S KR
Sbjct: 732 VEYLLKKSHITYYSYELLYDGIRGRKL-NKKFKRSTSLVSRKR 773
BLAST of HG10020559 vs. ExPASy TrEMBL
Match:
A0A1S4DTL9 (putative pentatricopeptide repeat-containing protein At1g74580 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103485663 PE=4 SV=1)
HSP 1 Score: 1437.9 bits (3721), Expect = 0.0e+00
Identity = 697/762 (91.47%), Postives = 732/762 (96.06%), Query Frame = 0
Query: 7 PKHVAAVIRYQNDPLKALQMFNQVKTEDGFKHTLATYKCMIEKLGLHGQFEAMEDVLADM 66
PKHVAAVIRYQNDPLKAL+ FNQVKTEDGFKHTL TYKCMIEKLGLHGQFEAMEDVLA++
Sbjct: 2 PKHVAAVIRYQNDPLKALKTFNQVKTEDGFKHTLETYKCMIEKLGLHGQFEAMEDVLAEL 61
Query: 67 RKNVDNKMLEGVYIRIMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYNVIMNILVEYGY 126
RKNVDNKMLEGVYI IMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYN IMNILVEYGY
Sbjct: 62 RKNVDNKMLEGVYIGIMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYNAIMNILVEYGY 121
Query: 127 FNQAHKVYMRMKYIGIYPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFNAVSYCT 186
F+QAHKVYMRMK IGIYPDVYT+TIR+KSFCRTGRPSAALRLLNNMPGQGCEFNAVSYC
Sbjct: 122 FSQAHKVYMRMKDIGIYPDVYTYTIRMKSFCRTGRPSAALRLLNNMPGQGCEFNAVSYCA 181
Query: 187 VIGGFYEENCQIEAYHLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFNKVLKRG 246
VI GFYEENCQIEAYHLF+EMLKQGICPDILTFNKLIHVLCKKGNVQESEKLF+KV+KRG
Sbjct: 182 VISGFYEENCQIEAYHLFNEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFSKVMKRG 241
Query: 247 VCPNLFTFNIFIQGLCRKGRIDEAARLLESILSEGLTPDVVSYNTLICGFCKHSKLVEAE 306
VCPNLFTFNIF+QGLCRKG IDEAARLLESI+SEGLTPDV+SYNTLICGFCKHSKLVEAE
Sbjct: 242 VCPNLFTFNIFMQGLCRKGAIDEAARLLESIVSEGLTPDVISYNTLICGFCKHSKLVEAE 301
Query: 307 CYLRKMVNSGFEPNEFTYNTIINGFCKMGMMQNADKILCDAMFKGFMPDEFTYSSLINGL 366
C LRKMVN+G EPNEFTYNTIINGFCK GMMQNADKIL DAMFKGF+PDEFTYS+LINGL
Sbjct: 302 CCLRKMVNNGVEPNEFTYNTIINGFCKAGMMQNADKILRDAMFKGFIPDEFTYSALINGL 361
Query: 367 CDDGDMNQAMAVFNEAMEKGFKHSIILYNTVVKGFSKQGLVLQALQLMKDMMGHGCSPDI 426
C+DGDM++AMAVF EAMEKGFKHSIILYNT+VKG SKQGLVLQALQLMKDMM HGCSPDI
Sbjct: 362 CNDGDMSRAMAVFYEAMEKGFKHSIILYNTLVKGLSKQGLVLQALQLMKDMMEHGCSPDI 421
Query: 427 WTYNLVVNGLCKMGCLSDASELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKAIEILDT 486
WTYNLVVNGLCKMGCLSDA+ +LNDAIAKGCI DIFTFNTLIDGYCKQ NLDKAIEILDT
Sbjct: 422 WTYNLVVNGLCKMGCLSDANGILNDAIAKGCILDIFTFNTLIDGYCKQRNLDKAIEILDT 481
Query: 487 MLSHGITPDVITYNTLLNGLCKARKLDNVVETFKAMLEKGCTPNIITYNILIESFCKDRK 546
MLSHGITPDVITYNT+LNGLCKARKLDNVV+TF+AMLEKGCTPNIITYNILIESFCKDRK
Sbjct: 482 MLSHGITPDVITYNTILNGLCKARKLDNVVDTFRAMLEKGCTPNIITYNILIESFCKDRK 541
Query: 547 VSEAMDLFEEMKTRGLTPDIVTLCTLICGLCSNGELDKAYQLFLTLEKEYKFSYSTAIFN 606
VSEAM+LFEEMKTRGLTPDIVTLCTL CGLCSNG+LDKAY+LF+TLEKEYKFSYSTAIFN
Sbjct: 542 VSEAMELFEEMKTRGLTPDIVTLCTLTCGLCSNGQLDKAYELFVTLEKEYKFSYSTAIFN 601
Query: 607 IMINAFCEKLNINMAEKLFHKMGGCDCAPDNYTYRVMIDSYCKTGNIDPAHTFLLEKINK 666
IMINAF EKLN++M EKLFHKMGG DCAPDNYTYRVMIDSYCKTGNID AHTFLLEKI+K
Sbjct: 602 IMINAFSEKLNVSMVEKLFHKMGGSDCAPDNYTYRVMIDSYCKTGNIDLAHTFLLEKISK 661
Query: 667 GFVPSFTTCGRVLNCLCVKHRLSEAVDIINLMVQNGIVPEEVNSIFEADKKEIAAPKIVV 726
G VPSFTTCG+VLNCLCVKHRL+EAVDIINLMVQNGIVPEEVNSIFEADKKE+AAPKIVV
Sbjct: 662 GLVPSFTTCGKVLNCLCVKHRLNEAVDIINLMVQNGIVPEEVNSIFEADKKEVAAPKIVV 721
Query: 727 EYLMKKSHITYYSYELLYDGIRDRKLEKKKFKRSPSLGSGKR 769
EYL+KKSHITYYSYELLYDGIR RKL KKFKRS SL S KR
Sbjct: 722 EYLLKKSHITYYSYELLYDGIRGRKL-NKKFKRSTSLVSRKR 762
BLAST of HG10020559 vs. ExPASy TrEMBL
Match:
A0A5A7SZQ7 (Putative pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold112G00720 PE=4 SV=1)
HSP 1 Score: 1434.1 bits (3711), Expect = 0.0e+00
Identity = 691/749 (92.26%), Postives = 725/749 (96.80%), Query Frame = 0
Query: 1 MNRALQPKHVAAVIRYQNDPLKALQMFNQVKTEDGFKHTLATYKCMIEKLGLHGQFEAME 60
M RALQPKHVAAVIRYQNDPLKAL+ FNQVKTEDGFKHTL TYKCMIEKLGLHGQFEAME
Sbjct: 1 MIRALQPKHVAAVIRYQNDPLKALKTFNQVKTEDGFKHTLETYKCMIEKLGLHGQFEAME 60
Query: 61 DVLADMRKNVDNKMLEGVYIRIMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYNVIMNI 120
DVLA++RKNVDNKMLEGVYI IMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYN IMNI
Sbjct: 61 DVLAELRKNVDNKMLEGVYIGIMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYNAIMNI 120
Query: 121 LVEYGYFNQAHKVYMRMKYIGIYPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN 180
LVEYGYF+QAHKVYMRMK IGIYPDVYT+TIR+KSFCRTGRPSAALRLLNNMPGQGCEFN
Sbjct: 121 LVEYGYFSQAHKVYMRMKDIGIYPDVYTYTIRMKSFCRTGRPSAALRLLNNMPGQGCEFN 180
Query: 181 AVSYCTVIGGFYEENCQIEAYHLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFN 240
AVSYC VI GFYEENCQIEAYHLF+EMLKQGICPDILTFNKLIHVLCKKGNVQESEKLF+
Sbjct: 181 AVSYCAVISGFYEENCQIEAYHLFNEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFS 240
Query: 241 KVLKRGVCPNLFTFNIFIQGLCRKGRIDEAARLLESILSEGLTPDVVSYNTLICGFCKHS 300
KV+KRGVCPNLFTFNIFIQGLCRKG IDEAARLLESI+SEGLTPDV+SYNTLICGFCKHS
Sbjct: 241 KVMKRGVCPNLFTFNIFIQGLCRKGAIDEAARLLESIVSEGLTPDVISYNTLICGFCKHS 300
Query: 301 KLVEAECYLRKMVNSGFEPNEFTYNTIINGFCKMGMMQNADKILCDAMFKGFMPDEFTYS 360
KLVEAEC LRKMVN+G EPNEFTYNTIINGFCK GMMQNADKIL DAMFKGF+PDEFTYS
Sbjct: 301 KLVEAECCLRKMVNNGVEPNEFTYNTIINGFCKAGMMQNADKILRDAMFKGFIPDEFTYS 360
Query: 361 SLINGLCDDGDMNQAMAVFNEAMEKGFKHSIILYNTVVKGFSKQGLVLQALQLMKDMMGH 420
+LINGLC+DGDM++AMAVF EAMEKGFKHSIILYNT+VKG SKQGLVLQALQLMKDMM H
Sbjct: 361 ALINGLCNDGDMSRAMAVFYEAMEKGFKHSIILYNTLVKGLSKQGLVLQALQLMKDMMEH 420
Query: 421 GCSPDIWTYNLVVNGLCKMGCLSDASELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKA 480
GCSPDIWTYNLVVNGLCKMGCLSDA+ +LNDAIAKGCI DIFTFNTLIDGYCKQ NLDKA
Sbjct: 421 GCSPDIWTYNLVVNGLCKMGCLSDANGILNDAIAKGCILDIFTFNTLIDGYCKQRNLDKA 480
Query: 481 IEILDTMLSHGITPDVITYNTLLNGLCKARKLDNVVETFKAMLEKGCTPNIITYNILIES 540
IEILDTMLSHGITPDVITYNT+LNGLCKARKLDNVV+TF+AMLEKGCTPNIITYNILIES
Sbjct: 481 IEILDTMLSHGITPDVITYNTILNGLCKARKLDNVVDTFRAMLEKGCTPNIITYNILIES 540
Query: 541 FCKDRKVSEAMDLFEEMKTRGLTPDIVTLCTLICGLCSNGELDKAYQLFLTLEKEYKFSY 600
FCKDRKVSEAM+LFEEMKTRGLTPDIVTLCTLICGLCSNGELDKAY+LF+TLEKEYKFSY
Sbjct: 541 FCKDRKVSEAMELFEEMKTRGLTPDIVTLCTLICGLCSNGELDKAYELFVTLEKEYKFSY 600
Query: 601 STAIFNIMINAFCEKLNINMAEKLFHKMGGCDCAPDNYTYRVMIDSYCKTGNIDPAHTFL 660
STAIFNIMINAF EKLN++M EKLFHKMGG DCAPDNYTYRVMIDSYCKTGNID AHTFL
Sbjct: 601 STAIFNIMINAFSEKLNVSMVEKLFHKMGGSDCAPDNYTYRVMIDSYCKTGNIDLAHTFL 660
Query: 661 LEKINKGFVPSFTTCGRVLNCLCVKHRLSEAVDIINLMVQNGIVPEEVNSIFEADKKEIA 720
LEKI+KG VPSFTTCG+VLNCLCVKHRL+EAVDIINLMVQNGIVPEEVNSIFEADKKE+A
Sbjct: 661 LEKISKGLVPSFTTCGKVLNCLCVKHRLNEAVDIINLMVQNGIVPEEVNSIFEADKKEVA 720
Query: 721 APKIVVEYLMKKSHITYYSYELLYDGIRD 750
APKIVVEYL+KKSHITYYSYELLYDGIR+
Sbjct: 721 APKIVVEYLLKKSHITYYSYELLYDGIRE 749
BLAST of HG10020559 vs. TAIR 10
Match:
AT1G74580.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 984.2 bits (2543), Expect = 6.1e-287
Identity = 461/756 (60.98%), Postives = 594/756 (78.57%), Query Frame = 0
Query: 1 MNRALQPKHVAAVIRYQNDPLKALQMFNQVKTEDGFKHTLATYKCMIEKLGLHGQFEAME 60
M L PKHV AVI+ Q DP+KAL+MFN ++ E GFKHTL+TY+ +IEKLG +G+FEAME
Sbjct: 1 MGPPLLPKHVTAVIKCQKDPMKALEMFNSMRKEVGFKHTLSTYRSVIEKLGYYGKFEAME 60
Query: 61 DVLADMRKNVDNKMLEGVYIRIMRDYGRKGKVQEAVNVFERMDFYDCEPSVQSYNVIMNI 120
+VL DMR+NV N MLEGVY+ M++YGRKGKVQEAVNVFERMDFYDCEP+V SYN IM++
Sbjct: 61 EVLVDMRENVGNHMLEGVYVGAMKNYGRKGKVQEAVNVFERMDFYDCEPTVFSYNAIMSV 120
Query: 121 LVEYGYFNQAHKVYMRMKYIGIYPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFN 180
LV+ GYF+QAHKVYMRM+ GI PDVY+ TIR+KSFC+T RP AALRLLNNM QGCE N
Sbjct: 121 LVDSGYFDQAHKVYMRMRDRGITPDVYSFTIRMKSFCKTSRPHAALRLLNNMSSQGCEMN 180
Query: 181 AVSYCTVIGGFYEENCQIEAYHLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFN 240
V+YCTV+GGFYEEN + E Y LF +ML G+ + TFNKL+ VLCKKG+V+E EKL +
Sbjct: 181 VVAYCTVVGGFYEENFKAEGYELFGKMLASGVSLCLSTFNKLLRVLCKKGDVKECEKLLD 240
Query: 241 KVLKRGVCPNLFTFNIFIQGLCRKGRIDEAARLLESILSEGLTPDVVSYNTLICGFCKHS 300
KV+KRGV PNLFT+N+FIQGLC++G +D A R++ ++ +G PDV++YN LI G CK+S
Sbjct: 241 KVIKRGVLPNLFTYNLFIQGLCQRGELDGAVRMVGCLIEQGPKPDVITYNNLIYGLCKNS 300
Query: 301 KLVEAECYLRKMVNSGFEPNEFTYNTIINGFCKMGMMQNADKILCDAMFKGFMPDEFTYS 360
K EAE YL KMVN G EP+ +TYNT+I G+CK GM+Q A++I+ DA+F GF+PD+FTY
Sbjct: 301 KFQEAEVYLGKMVNEGLEPDSYTYNTLIAGYCKGGMVQLAERIVGDAVFNGFVPDQFTYR 360
Query: 361 SLINGLCDDGDMNQAMAVFNEAMEKGFKHSIILYNTVVKGFSKQGLVLQALQLMKDMMGH 420
SLI+GLC +G+ N+A+A+FNEA+ KG K ++ILYNT++KG S QG++L+A QL +M
Sbjct: 361 SLIDGLCHEGETNRALALFNEALGKGIKPNVILYNTLIKGLSNQGMILEAAQLANEMSEK 420
Query: 421 GCSPDIWTYNLVVNGLCKMGCLSDASELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKA 480
G P++ T+N++VNGLCKMGC+SDA L+ I+KG PDIFTFN LI GY QL ++ A
Sbjct: 421 GLIPEVQTFNILVNGLCKMGCVSDADGLVKVMISKGYFPDIFTFNILIHGYSTQLKMENA 480
Query: 481 IEILDTMLSHGITPDVITYNTLLNGLCKARKLDNVVETFKAMLEKGCTPNIITYNILIES 540
+EILD ML +G+ PDV TYN+LLNGLCK K ++V+ET+K M+EKGC PN+ T+NIL+ES
Sbjct: 481 LEILDVMLDNGVDPDVYTYNSLLNGLCKTSKFEDVMETYKTMVEKGCAPNLFTFNILLES 540
Query: 541 FCKDRKVSEAMDLFEEMKTRGLTPDIVTLCTLICGLCSNGELDKAYQLFLTLEKEYKFSY 600
C+ RK+ EA+ L EEMK + + PD VT TLI G C NG+LD AY LF +E+ YK S
Sbjct: 541 LCRYRKLDEALGLLEEMKNKSVNPDAVTFGTLIDGFCKNGDLDGAYTLFRKMEEAYKVSS 600
Query: 601 STAIFNIMINAFCEKLNINMAEKLFHKMGGCDCAPDNYTYRVMIDSYCKTGNIDPAHTFL 660
ST +NI+I+AF EKLN+ MAEKLF +M PD YTYR+M+D +CKTGN++ + FL
Sbjct: 601 STPTYNIIIHAFTEKLNVTMAEKLFQEMVDRCLGPDGYTYRLMVDGFCKTGNVNLGYKFL 660
Query: 661 LEKINKGFVPSFTTCGRVLNCLCVKHRLSEAVDIINLMVQNGIVPEEVNSIFEADKKEIA 720
LE + GF+PS TT GRV+NCLCV+ R+ EA II+ MVQ G+VPE VN+I + DKKE+A
Sbjct: 661 LEMMENGFIPSLTTLGRVINCLCVEDRVYEAAGIIHRMVQKGLVPEAVNTICDVDKKEVA 720
Query: 721 APKIVVEYLMKKSHITYYSYELLYDGIRDRKLEKKK 757
APK+V+E L+KKS ITYY+YELL+DG+RD++L KKK
Sbjct: 721 APKLVLEDLLKKSCITYYAYELLFDGLRDKRLRKKK 756
BLAST of HG10020559 vs. TAIR 10
Match:
AT5G64320.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 377.1 bits (967), Expect = 3.4e-104
Identity = 210/634 (33.12%), Postives = 354/634 (55.84%), Query Frame = 0
Query: 23 ALQMFNQVKTEDGFKHTLATYKCMIEKLGLHGQFEAMEDVLADMRKNVDNKMLEGVYIRI 82
++++F+ +++G++H+ Y+ +I KLG +G+F+ ++ +L M K+ E ++I I
Sbjct: 94 SMELFSWTGSQNGYRHSFDVYQVLIGKLGANGEFKTIDRLLIQM-KDEGIVFKESLFISI 153
Query: 83 MRDYGRKGKVQEAVN-VFERMDFYDCEPSVQSYNVIMNILVEYGYFNQAHKVYMRMKYIG 142
MRDY + G + + E + Y CEP+ +SYNV++ ILV A V+ M
Sbjct: 154 MRDYDKAGFPGQTTRLMLEMRNVYSCEPTFKSYNVVLEILVSGNCHKVAANVFYDMLSRK 213
Query: 143 IYPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFNAVSYCTVIGGFYEENCQIEAY 202
I P ++T + +K+FC +AL LL +M GC N+V Y T+I + N EA
Sbjct: 214 IPPTLFTFGVVMKAFCAVNEIDSALSLLRDMTKHGCVPNSVIYQTLIHSLSKCNRVNEAL 273
Query: 203 HLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFNKVLKRGVCPNLFTFNIFIQGL 262
L +EM G PD TFN +I LCK + E+ K+ N++L RG P+ T+ + GL
Sbjct: 274 QLLEEMFLMGCVPDAETFNDVILGLCKFDRINEAAKMVNRMLIRGFAPDDITYGYLMNGL 333
Query: 263 CRKGRIDEAARLLESILSEGLTPDVVSYNTLICGFCKHSKLVEAECYLRKMVNS-GFEPN 322
C+ GR+D A L I P++V +NTLI GF H +L +A+ L MV S G P+
Sbjct: 334 CKIGRVDAAKDLFYRIPK----PEIVIFNTLIHGFVTHGRLDDAKAVLSDMVTSYGIVPD 393
Query: 323 EFTYNTIINGFCKMGMMQNADKILCDAMFKGFMPDEFTYSSLINGLCDDGDMNQAMAVFN 382
TYN++I G+ K G++ A ++L D KG P+ ++Y+ L++G C G +++A V N
Sbjct: 394 VCTYNSLIYGYWKEGLVGLALEVLHDMRNKGCKPNVYSYTILVDGFCKLGKIDEAYNVLN 453
Query: 383 EAMEKGFKHSIILYNTVVKGFSKQGLVLQALQLMKDMMGHGCSPDIWTYNLVVNGLCKMG 442
E G K + + +N ++ F K+ + +A+++ ++M GC PD++T+N +++GLC++
Sbjct: 454 EMSADGLKPNTVGFNCLISAFCKEHRIPEAVEIFREMPRKGCKPDVYTFNSLISGLCEVD 513
Query: 443 CLSDASELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKAIEILDTMLSHGITPDVITYN 502
+ A LL D I++G + + T+NTLI+ + ++ + +A ++++ M+ G D ITYN
Sbjct: 514 EIKHALWLLRDMISEGVVANTVTYNTLINAFLRRGEIKEARKLVNEMVFQGSPLDEITYN 573
Query: 503 TLLNGLCKARKLDNVVETFKAMLEKGCTPNIITYNILIESFCKDRKVSEAMDLFEEMKTR 562
+L+ GLC+A ++D F+ ML G P+ I+ NILI C+ V EA++ +EM R
Sbjct: 574 SLIKGLCRAGEVDKARSLFEKMLRDGHAPSNISCNILINGLCRSGMVEEAVEFQKEMVLR 633
Query: 563 GLTPDIVTLCTLICGLCSNGELDKAYQLFLTLEKEYKFSYSTAIFNIMINAFCEKLNINM 622
G TPDIVT +LI GLC G ++ +F L+ E T FN +++ C+ +
Sbjct: 634 GSTPDIVTFNSLINGLCRAGRIEDGLTMFRKLQAE-GIPPDTVTFNTLMSWLCKGGFVYD 693
Query: 623 AEKLFHKMGGCDCAPDNYTYRVMIDSYCKTGNID 655
A L + P++ T+ +++ S +D
Sbjct: 694 ACLLLDEGIEDGFVPNHRTWSILLQSIIPQETLD 721
BLAST of HG10020559 vs. TAIR 10
Match:
AT3G53700.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 363.6 bits (932), Expect = 3.9e-100
Identity = 211/698 (30.23%), Postives = 353/698 (50.57%), Query Frame = 0
Query: 14 IRYQNDPLKALQMFNQVKTEDGFKHTLATYKCMIEKLGLHGQFEAMEDVLADMRKNVDNK 73
+R Q D AL++FN + F A Y+ ++ +LG G F+ M+ +L DM K+ +
Sbjct: 57 LRSQPDDSAALRLFNLASKKPNFSPEPALYEEILLRLGRSGSFDDMKKILEDM-KSSRCE 116
Query: 74 MLEGVYIRIMRDYGRKGKVQEAVNVFERM-DFYDCEPSVQSYNVIMNILVEYGYFNQAHK 133
M ++ ++ Y + E ++V + M D + +P YN ++N+LV+
Sbjct: 117 MGTSTFLILIESYAQFELQDEILSVVDWMIDEFGLKPDTHFYNRMLNLLVDGNSLKLVEI 176
Query: 134 VYMRMKYIGIYPDVYTHTIRIKSFCRTGRPSAALRLLNNMPGQGCEFNAVSYCTVIGGFY 193
+ +M GI PDV T + IK+ CR + A+ +L +MP G + ++ TV+ G+
Sbjct: 177 SHAKMSVWGIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYI 236
Query: 194 EENCQIEAYHLFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFNKVLKR-GVCPNL 253
EE A + ++M++ G ++ N ++H CK+G V+++ ++ + G P+
Sbjct: 237 EEGDLDGALRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQ 296
Query: 254 FTFNIFIQGLCRKGRIDEAARLLESILSEGLTPDVVSYNTLICGFCKHSKLVEAECYLRK 313
+TFN + GLC+ G + A +++ +L EG PDV +YN++I G CK ++ EA L +
Sbjct: 297 YTFNTLVNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQ 356
Query: 314 MVNSGFEPNEFTYNTIINGFCKMGMMQNADKILCDAMFKGFMPDEFTYSSLINGLCDDGD 373
M+ PN TYNT+I+ CK ++ A ++ KG +PD T++SLI GLC +
Sbjct: 357 MITRDCSPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRN 416
Query: 374 MNQAMAVFNEAMEKGFKHSIILYNTVVKGFSKQGLVLQALQLMKDMMGHGCSPDIWTYNL 433
AM +F E M GC PD +TYN+
Sbjct: 417 HRVAMELFEE-----------------------------------MRSKGCEPDEFTYNM 476
Query: 434 VVNGLCKMGCLSDASELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKAIEILDTMLSHG 493
+++ LC G L +A +L GC + T+NTLIDG+CK +A EI D M HG
Sbjct: 477 LIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKANKTREAEEIFDEMEVHG 536
Query: 494 ITPDVITYNTLLNGLCKARKLDNVVETFKAMLEKGCTPNIITYNILIESFCKDRKVSEAM 553
++ + +TYNTL++GLCK+R++++ + M+ +G P+ TYN L+ FC+ + +A
Sbjct: 537 VSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYNSLLTHFCRGGDIKKAA 596
Query: 554 DLFEEMKTRGLTPDIVTLCTLICGLCSNGELDKAYQLFLTLEKEYKFSYSTAIFNIMINA 613
D+ + M + G PDIVT TLI GLC G ++ A +L +++ + + + +N +I
Sbjct: 597 DIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSIQMK-GINLTPHAYNPVIQG 656
Query: 614 FCEKLNINMAEKLFHKM-GGCDCAPDNYTYRVMIDSYCKTGN-IDPAHTFLLEKINKGFV 673
K A LF +M + PD +YR++ C G I A FL+E + KGFV
Sbjct: 657 LFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFRGLCNGGGPIREAVDFLVELLEKGFV 716
Query: 674 PSFTTCGRVLNCLCVKHRLSEAVDIINLMVQNGIVPEE 708
P F++ + L V ++N+++Q EE
Sbjct: 717 PEFSSLYMLAEGLLTLSMEETLVKLVNMVMQKARFSEE 717
BLAST of HG10020559 vs. TAIR 10
Match:
AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 339.0 bits (868), Expect = 1.0e-92
Identity = 177/558 (31.72%), Postives = 299/558 (53.58%), Query Frame = 0
Query: 153 IKSFCRTGRPSAALRLLNNMPGQGCEFNAVSYCTVIGGFYEENCQIE-AYHLFDEMLKQG 212
+KS+ R AL +++ G +SY V+ I A ++F EML+
Sbjct: 141 VKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQ 200
Query: 213 ICPDILTFNKLIHVLCKKGNVQESEKLFNKVLKRGVCPNLFTFNIFIQGLCRKGRIDEAA 272
+ P++ T+N LI C GN+ + LF+K+ +G PN+ T+N I G C+ +ID+
Sbjct: 201 VSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGF 260
Query: 273 RLLESILSEGLTPDVVSYNTLICGFCKHSKLVEAECYLRKMVNSGFEPNEFTYNTIINGF 332
+LL S+ +GL P+++SYN +I G C+ ++ E L +M G+ +E TYNT+I G+
Sbjct: 261 KLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGY 320
Query: 333 CKMGMMQNADKILCDAMFKGFMPDEFTYSSLINGLCDDGDMNQAMAVFNEAMEKGFKHSI 392
CK G A + + + G P TY+SLI+ +C G+MN+AM ++ +G +
Sbjct: 321 CKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNE 380
Query: 393 ILYNTVVKGFSKQGLVLQALQLMKDMMGHGCSPDIWTYNLVVNGLCKMGCLSDASELLND 452
Y T+V GFS++G + +A +++++M +G SP + TYN ++NG C G + DA +L D
Sbjct: 381 RTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLED 440
Query: 453 AIAKGCIPDIFTFNTLIDGYCKQLNLDKAIEILDTMLSHGITPDVITYNTLLNGLCKARK 512
KG PD+ +++T++ G+C+ ++D+A+ + M+ GI PD ITY++L+ G C+ R+
Sbjct: 441 MKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRR 500
Query: 513 LDNVVETFKAMLEKGCTPNIITYNILIESFCKDRKVSEAMDLFEEMKTRGLTPDIVTLCT 572
+ ++ ML G P+ TY LI ++C + + +A+ L EM +G+ PD+VT
Sbjct: 501 TKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSV 560
Query: 573 LICGLCSNGELDKAYQLFLTLEKEYK----FSYSTAIFNI----------MINAFCEKLN 632
LI GL +A +L L L E +Y T I N +I FC K
Sbjct: 561 LINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTLIENCSNIEFKSVVSLIKGFCMKGM 620
Query: 633 INMAEKLFHKMGGCDCAPDNYTYRVMIDSYCKTGNIDPAHTFLLEKINKGFVPSFTTCGR 692
+ A+++F M G + PD Y +MI +C+ G+I A+T E + GF+ T
Sbjct: 621 MTEADQVFESMLGKNHKPDGTAYNIMIHGHCRAGDIRKAYTLYKEMVKSGFLLHTVTVIA 680
Query: 693 VLNCLCVKHRLSEAVDII 696
++ L + +++E +I
Sbjct: 681 LVKALHKEGKVNELNSVI 698
BLAST of HG10020559 vs. TAIR 10
Match:
AT1G12775.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 331.3 bits (848), Expect = 2.1e-90
Identity = 168/509 (33.01%), Postives = 283/509 (55.60%), Query Frame = 0
Query: 203 LFDEMLKQGICPDILTFNKLIHVLCKKGNVQESEKLFNKVLKRGVCPNLFTFNIFIQGLC 262
L +M +GI I T + +I+ C+ + + K++K G P+ FN + GLC
Sbjct: 110 LCKQMESKGIAHSIYTLSIMINCFCRCRKLSYAFSTMGKIMKLGYEPDTVIFNTLLNGLC 169
Query: 263 RKGRIDEAARLLESILSEGLTPDVVSYNTLICGFCKHSKLVEAECYLRKMVNSGFEPNEF 322
+ R+ EA L++ ++ G P +++ NTL+ G C + K+ +A + +MV +GF+PNE
Sbjct: 170 LECRVSEALELVDRMVEMGHKPTLITLNTLVNGLCLNGKVSDAVVLIDRMVETGFQPNEV 229
Query: 323 TYNTIINGFCKMGMMQNADKILCDAMFKGFMPDEFTYSSLINGLCDDGDMNQAMAVFNEA 382
TY ++N CK G A ++L + D YS +I+GLC DG ++ A +FNE
Sbjct: 230 TYGPVLNVMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDGLCKDGSLDNAFNLFNEM 289
Query: 383 MEKGFKHSIILYNTVVKGFSKQGLVLQALQLMKDMMGHGCSPDIWTYNLVVNGLCKMGCL 442
KGFK II YNT++ GF G +L++DM+ SP++ T++++++ K G L
Sbjct: 290 EIKGFKADIITYNTLIGGFCNAGRWDDGAKLLRDMIKRKISPNVVTFSVLIDSFVKEGKL 349
Query: 443 SDASELLNDAIAKGCIPDIFTFNTLIDGYCKQLNLDKAIEILDTMLSHGITPDVITYNTL 502
+A +LL + + +G P+ T+N+LIDG+CK+ L++AI+++D M+S G PD++T+N L
Sbjct: 350 READQLLKEMMQRGIAPNTITYNSLIDGFCKENRLEEAIQMVDLMISKGCDPDIMTFNIL 409
Query: 503 LNGLCKARKLDNVVETFKAMLEKGCTPNIITYNILIESFCKDRKVSEAMDLFEEMKTRGL 562
+NG CKA ++D+ +E F+ M +G N +TYN L++ FC+ K+ A LF+EM +R +
Sbjct: 410 INGYCKANRIDDGLELFREMSLRGVIANTVTYNTLVQGFCQSGKLEVAKKLFQEMVSRRV 469
Query: 563 TPDIVTLCTLICGLCSNGELDKAYQLFLTLEKEYKFSYSTAIFNIMINAFCEKLNINMAE 622
PDIV+ L+ GLC NGEL+KA ++F +EK K I+ I+I+ C ++ A
Sbjct: 470 RPDIVSYKILLDGLCDNGELEKALEIFGKIEKS-KMELDIGIYMIIIHGMCNASKVDDAW 529
Query: 623 KLFHKMGGCDCAPDNYTYRVMIDSYCKTGNIDPAHTFLLEKINKGFVPSFTTCGRVLNCL 682
LF + D Y +MI C+ ++ A + +G P T ++
Sbjct: 530 DLFCSLPLKGVKLDARAYNIMISELCRKDSLSKADILFRKMTEEGHAPDELTYNILIRAH 589
Query: 683 CVKHRLSEAVDIINLMVQNGIVPEEVNSI 712
+ A ++I M +G P +V+++
Sbjct: 590 LGDDDATTAAELIEEMKSSGF-PADVSTV 616
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038893558.1 | 0.0e+00 | 95.21 | putative pentatricopeptide repeat-containing protein At1g74580 [Benincasa hispid... | [more] |
KAG6584456.1 | 0.0e+00 | 92.03 | putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyros... | [more] |
XP_022923786.1 | 0.0e+00 | 92.03 | uncharacterized protein LOC111431396 [Cucurbita moschata] | [more] |
XP_023519594.1 | 0.0e+00 | 91.39 | putative pentatricopeptide repeat-containing protein At1g74580 [Cucurbita pepo s... | [more] |
XP_011649732.1 | 0.0e+00 | 91.96 | putative pentatricopeptide repeat-containing protein At1g74580 [Cucumis sativus]... | [more] |
Match Name | E-value | Identity | Description | |
Q9CA58 | 8.6e-286 | 60.98 | Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis th... | [more] |
Q9FMF6 | 4.8e-103 | 33.12 | Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidop... | [more] |
Q9LFF1 | 5.5e-99 | 30.23 | Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... | [more] |
Q9FIX3 | 1.4e-91 | 31.72 | Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... | [more] |
Q9LPX2 | 3.0e-89 | 33.01 | Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidop... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1E7P0 | 0.0e+00 | 92.03 | uncharacterized protein LOC111431396 OS=Cucurbita moschata OX=3662 GN=LOC1114313... | [more] |
A0A6J1KND0 | 0.0e+00 | 91.94 | uncharacterized protein LOC111495738 OS=Cucurbita maxima OX=3661 GN=LOC111495738... | [more] |
A0A1S3B3U5 | 0.0e+00 | 91.35 | putative pentatricopeptide repeat-containing protein At1g74580 isoform X1 OS=Cuc... | [more] |
A0A1S4DTL9 | 0.0e+00 | 91.47 | putative pentatricopeptide repeat-containing protein At1g74580 isoform X2 OS=Cuc... | [more] |
A0A5A7SZQ7 | 0.0e+00 | 92.26 | Putative pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa... | [more] |
Match Name | E-value | Identity | Description | |
AT1G74580.1 | 6.1e-287 | 60.98 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT5G64320.1 | 3.4e-104 | 33.12 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT3G53700.1 | 3.9e-100 | 30.23 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT5G39710.1 | 1.0e-92 | 31.72 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT1G12775.1 | 2.1e-90 | 33.01 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |