HG10022270 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10022270
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr05: 22497195 .. 22499144 (-)
RNA-Seq ExpressionHG10022270
SyntenyHG10022270
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGAGGACTCAATTCCCACTCAAAAGTGTTCTTGTTCGTATTGGACGTCATGGGTCCATGCTTCAGGTTGTTTCTCTGTCATCTTTAACATCTGATAGTCTCATTACCACCGTACTTAACTGTAGAAGCCCCGAGAGAGCACTTGAATTATTCAATGCGGCACCAGAAAAGAATACTCAGCTTTACTCAGCCATCATTCACGTCTTGGTTGGATCCAAGCTATTTTCCCATGCCAGATGTTTGCTGAAAGGTCTCGTAGAAAACCTCCTCAGAAAATCTCACAAGCGTTACCATGTATGTCAGTTGGCATTCAATGCGTTGAGTCAATTAAAATCCTCAAAATTTACTCCCAATGTATATGGTGAGTTACTTATTGTCTTATCTAAAATGGGGCTTGTAGAAGAAGCTTTGTGGATGTACCGCAAAGTTGGGGCGGCACTCGCAAAGCAGGCTTGTAATGTGCTTTTAGATGTCTTGGTTAAGACTGGGAAATTTGAATTGTTGTGGAGGATTTTTGAAGAAATGATTTCCAATGGGCTATCTCCTGATGTTATCACTTATGGCATCCTAATCGATGGTTGCTGCCGTCAGGGAGATCTTTTAAGGGCACATGAGATGTTTGATGAAATGAGAGCAAAAGGAATTGAACCAACAGTTGTCGTGTACACCATTCTTATTCGTGGCCTTTGCTCAGAAAACAAAATGGAGGAAGCGGAGAGTATGCATAGAGCAATGAGGGAAGTCGGGGTGCTTCCAAACGTGTATACTTACAACACTTTGATGGATGGGTACTGCAAGTTGGCCAATGCAAAACAAGCACTTAGATTGTATCAAGATATGATTGGTGAAGGTTTAGTGCCAGACGATGTTACATTTGGCATTTTAATTGATGGACTCTGCAAATTTGGTGACATCAAGGCTGCTCGGAATCTTTCTGTGAATATAGTAAAATTTAGTGTTACTCCTAGCATAGCTGTATATAATTCTTTGATCGATGGTTACTGTAGGGCAGGAGATGTTTCTGAAGCAATTGCTTTCCTTTTAGAGTTGGAAAGATTTAAAGTTTCGCCTGATGTCATTACATACAGTATACTTATTAGAGGTCTCTGTTCTGCGGGTCGAACTGAAGAAGCGGATAACATATTTGAGAAAATGATGAAAGATGGAATTCCTGCTAACTCTGTTACATATAATTCATTAATCAATGGGTGCTGCAAAGAAGGCAACATGAAAAAAGCGCTGGAAATATGCTCTCGAATGACTGAAAATGGTGTTGAACCAAATGTGATCACTTTCTCAATCCTGATTGATGGGTATTGCAAGATAAGGAACTTGCGAGCTGCTATGGGCATATATTCAGAAATGGTTATCAAAAGCCTTTCTCCTGATGTGGTTGCTTATACAGCTATGATAGATGGGCATTGCAAATATGGTAGCATGAAAGAGGCTCTTAAACTATATAGTGATATGCTGGATAATGGCCTTACCCCTAATTGTTACACTATCAGTTGCTTATTAGATGGACTTTGTAAAGATGGCAGAATTTCAAATGCACTCAAACTTTTCACGGAAAACACTGGATTTCAGACTACCAGGTGCGATGTCAACGCGGGAGGAAGCAAGCCTTCGTTCACAAATCATGTCGTGTATACTGCTCTAATCCATGGATTGTGTGAGGATGGGCAAATTTTTAAGGCAGCAAAGTTGTTTTCAGACATGAGATGTTACGGTTTGCAACCAGATGAAGTGATTTATGTAGTCATGTTACAAGGGTACTTTCAAGTTAAACGCATCCTCGACGTGATGATGCTACATGCAGACATGTTAAAATTTGGTGTTATCCCAAATTCAGCCATCTACTCAATATTATGTAAGGGTTATCGAGAGAGCGGATTTCTGAAATCAGCTCTGAATTGTTCCAAGGATCTTGAGGAACTACATAAATGA

mRNA sequence

ATGTTGAGGACTCAATTCCCACTCAAAAGTGTTCTTGTTCGTATTGGACGTCATGGGTCCATGCTTCAGGTTGTTTCTCTGTCATCTTTAACATCTGATAGTCTCATTACCACCGTACTTAACTGTAGAAGCCCCGAGAGAGCACTTGAATTATTCAATGCGGCACCAGAAAAGAATACTCAGCTTTACTCAGCCATCATTCACGTCTTGGTTGGATCCAAGCTATTTTCCCATGCCAGATGTTTGCTGAAAGGTCTCGTAGAAAACCTCCTCAGAAAATCTCACAAGCGTTACCATGTATGTCAGTTGGCATTCAATGCGTTGAGTCAATTAAAATCCTCAAAATTTACTCCCAATGTATATGGTGAGTTACTTATTGTCTTATCTAAAATGGGGCTTGTAGAAGAAGCTTTGTGGATGTACCGCAAAGTTGGGGCGGCACTCGCAAAGCAGGCTTGTAATGTGCTTTTAGATGTCTTGGTTAAGACTGGGAAATTTGAATTGTTGTGGAGGATTTTTGAAGAAATGATTTCCAATGGGCTATCTCCTGATGTTATCACTTATGGCATCCTAATCGATGGTTGCTGCCGTCAGGGAGATCTTTTAAGGGCACATGAGATGTTTGATGAAATGAGAGCAAAAGGAATTGAACCAACAGTTGTCGTGTACACCATTCTTATTCGTGGCCTTTGCTCAGAAAACAAAATGGAGGAAGCGGAGAGTATGCATAGAGCAATGAGGGAAGTCGGGGTGCTTCCAAACGTGTATACTTACAACACTTTGATGGATGGGTACTGCAAGTTGGCCAATGCAAAACAAGCACTTAGATTGTATCAAGATATGATTGGTGAAGGTTTAGTGCCAGACGATGTTACATTTGGCATTTTAATTGATGGACTCTGCAAATTTGGTGACATCAAGGCTGCTCGGAATCTTTCTGTGAATATAGTAAAATTTAGTGTTACTCCTAGCATAGCTGTATATAATTCTTTGATCGATGGTTACTGTAGGGCAGGAGATGTTTCTGAAGCAATTGCTTTCCTTTTAGAGTTGGAAAGATTTAAAGTTTCGCCTGATGTCATTACATACAGTATACTTATTAGAGGTCTCTGTTCTGCGGGTCGAACTGAAGAAGCGGATAACATATTTGAGAAAATGATGAAAGATGGAATTCCTGCTAACTCTGTTACATATAATTCATTAATCAATGGGTGCTGCAAAGAAGGCAACATGAAAAAAGCGCTGGAAATATGCTCTCGAATGACTGAAAATGGTGTTGAACCAAATGTGATCACTTTCTCAATCCTGATTGATGGGTATTGCAAGATAAGGAACTTGCGAGCTGCTATGGGCATATATTCAGAAATGGTTATCAAAAGCCTTTCTCCTGATGTGGTTGCTTATACAGCTATGATAGATGGGCATTGCAAATATGGTAGCATGAAAGAGGCTCTTAAACTATATAGTGATATGCTGGATAATGGCCTTACCCCTAATTGTTACACTATCAGTTGCTTATTAGATGGACTTTGTAAAGATGGCAGAATTTCAAATGCACTCAAACTTTTCACGGAAAACACTGGATTTCAGACTACCAGGTGCGATGTCAACGCGGGAGGAAGCAAGCCTTCGTTCACAAATCATGTCGTGTATACTGCTCTAATCCATGGATTGTGTGAGGATGGGCAAATTTTTAAGGCAGCAAAGTTGTTTTCAGACATGAGATGTTACGGTTTGCAACCAGATGAAGTGATTTATGTAGTCATGTTACAAGGGTACTTTCAAGTTAAACGCATCCTCGACGTGATGATGCTACATGCAGACATGTTAAAATTTGGTGTTATCCCAAATTCAGCCATCTACTCAATATTATGTAAGGGTTATCGAGAGAGCGGATTTCTGAAATCAGCTCTGAATTGTTCCAAGGATCTTGAGGAACTACATAAATGA

Coding sequence (CDS)

ATGTTGAGGACTCAATTCCCACTCAAAAGTGTTCTTGTTCGTATTGGACGTCATGGGTCCATGCTTCAGGTTGTTTCTCTGTCATCTTTAACATCTGATAGTCTCATTACCACCGTACTTAACTGTAGAAGCCCCGAGAGAGCACTTGAATTATTCAATGCGGCACCAGAAAAGAATACTCAGCTTTACTCAGCCATCATTCACGTCTTGGTTGGATCCAAGCTATTTTCCCATGCCAGATGTTTGCTGAAAGGTCTCGTAGAAAACCTCCTCAGAAAATCTCACAAGCGTTACCATGTATGTCAGTTGGCATTCAATGCGTTGAGTCAATTAAAATCCTCAAAATTTACTCCCAATGTATATGGTGAGTTACTTATTGTCTTATCTAAAATGGGGCTTGTAGAAGAAGCTTTGTGGATGTACCGCAAAGTTGGGGCGGCACTCGCAAAGCAGGCTTGTAATGTGCTTTTAGATGTCTTGGTTAAGACTGGGAAATTTGAATTGTTGTGGAGGATTTTTGAAGAAATGATTTCCAATGGGCTATCTCCTGATGTTATCACTTATGGCATCCTAATCGATGGTTGCTGCCGTCAGGGAGATCTTTTAAGGGCACATGAGATGTTTGATGAAATGAGAGCAAAAGGAATTGAACCAACAGTTGTCGTGTACACCATTCTTATTCGTGGCCTTTGCTCAGAAAACAAAATGGAGGAAGCGGAGAGTATGCATAGAGCAATGAGGGAAGTCGGGGTGCTTCCAAACGTGTATACTTACAACACTTTGATGGATGGGTACTGCAAGTTGGCCAATGCAAAACAAGCACTTAGATTGTATCAAGATATGATTGGTGAAGGTTTAGTGCCAGACGATGTTACATTTGGCATTTTAATTGATGGACTCTGCAAATTTGGTGACATCAAGGCTGCTCGGAATCTTTCTGTGAATATAGTAAAATTTAGTGTTACTCCTAGCATAGCTGTATATAATTCTTTGATCGATGGTTACTGTAGGGCAGGAGATGTTTCTGAAGCAATTGCTTTCCTTTTAGAGTTGGAAAGATTTAAAGTTTCGCCTGATGTCATTACATACAGTATACTTATTAGAGGTCTCTGTTCTGCGGGTCGAACTGAAGAAGCGGATAACATATTTGAGAAAATGATGAAAGATGGAATTCCTGCTAACTCTGTTACATATAATTCATTAATCAATGGGTGCTGCAAAGAAGGCAACATGAAAAAAGCGCTGGAAATATGCTCTCGAATGACTGAAAATGGTGTTGAACCAAATGTGATCACTTTCTCAATCCTGATTGATGGGTATTGCAAGATAAGGAACTTGCGAGCTGCTATGGGCATATATTCAGAAATGGTTATCAAAAGCCTTTCTCCTGATGTGGTTGCTTATACAGCTATGATAGATGGGCATTGCAAATATGGTAGCATGAAAGAGGCTCTTAAACTATATAGTGATATGCTGGATAATGGCCTTACCCCTAATTGTTACACTATCAGTTGCTTATTAGATGGACTTTGTAAAGATGGCAGAATTTCAAATGCACTCAAACTTTTCACGGAAAACACTGGATTTCAGACTACCAGGTGCGATGTCAACGCGGGAGGAAGCAAGCCTTCGTTCACAAATCATGTCGTGTATACTGCTCTAATCCATGGATTGTGTGAGGATGGGCAAATTTTTAAGGCAGCAAAGTTGTTTTCAGACATGAGATGTTACGGTTTGCAACCAGATGAAGTGATTTATGTAGTCATGTTACAAGGGTACTTTCAAGTTAAACGCATCCTCGACGTGATGATGCTACATGCAGACATGTTAAAATTTGGTGTTATCCCAAATTCAGCCATCTACTCAATATTATGTAAGGGTTATCGAGAGAGCGGATTTCTGAAATCAGCTCTGAATTGTTCCAAGGATCTTGAGGAACTACATAAATGA

Protein sequence

MLRTQFPLKSVLVRIGRHGSMLQVVSLSSLTSDSLITTVLNCRSPERALELFNAAPEKNTQLYSAIIHVLVGSKLFSHARCLLKGLVENLLRKSHKRYHVCQLAFNALSQLKSSKFTPNVYGELLIVLSKMGLVEEALWMYRKVGAALAKQACNVLLDVLVKTGKFELLWRIFEEMISNGLSPDVITYGILIDGCCRQGDLLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSENKMEEAESMHRAMREVGVLPNVYTYNTLMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFGILIDGLCKFGDIKAARNLSVNIVKFSVTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERFKVSPDVITYSILIRGLCSAGRTEEADNIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKALEICSRMTENGVEPNVITFSILIDGYCKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDGHCKYGSMKEALKLYSDMLDNGLTPNCYTISCLLDGLCKDGRISNALKLFTENTGFQTTRCDVNAGGSKPSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYFQVKRILDVMMLHADMLKFGVIPNSAIYSILCKGYRESGFLKSALNCSKDLEELHK
Homology
BLAST of HG10022270 vs. NCBI nr
Match: XP_038890299.1 (pentatricopeptide repeat-containing protein At5g61400, partial [Benincasa hispida])

HSP 1 Score: 1166.0 bits (3015), Expect = 0.0e+00
Identity = 578/646 (89.47%), Postives = 614/646 (95.05%), Query Frame = 0

Query: 1   MLRTQFPLKSVLVRIGRHGSMLQVVSLSSLTSDSLITTVLNCRSPERALELFNAAPEKNT 60
           ML TQ PLKS LVRIG  G+MLQVVSLSS T DS ITTVLNCRSP +ALE FNAAPEKNT
Sbjct: 17  MLMTQLPLKSALVRIGCQGTMLQVVSLSSSTPDSPITTVLNCRSPRKALEFFNAAPEKNT 76

Query: 61  QLYSAIIHVLVGSKLFSHARCLLKGLVENLLRKSHKRYHVCQLAFNALSQLKSSKFTPNV 120
           QLYSAIIHVLVG+KLFS+AR LLK LVE+L+ KSHK Y+VCQLAFNALS+LKSSKFTPNV
Sbjct: 77  QLYSAIIHVLVGAKLFSNARYLLKDLVEDLVAKSHKPYYVCQLAFNALSRLKSSKFTPNV 136

Query: 121 YGELLIVLSKMGLVEEALWMYRKVGAALAKQACNVLLDVLVKTGKFELLWRIFEEMISNG 180
           YGEL+IVLSKM LVEEALWMYRKVG  L KQACNVLLDVLVKTG+FELLWRI+EEMISNG
Sbjct: 137 YGELIIVLSKMRLVEEALWMYRKVGTTLTKQACNVLLDVLVKTGRFELLWRIYEEMISNG 196

Query: 181 LSPDVITYGILIDGCCRQGDLLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSENKMEEAE 240
           LS DVITYGILIDGCCRQGDLLRAHEMFDEMR KGIEPTVVVYTILIRGLCSENKMEEAE
Sbjct: 197 LSHDVITYGILIDGCCRQGDLLRAHEMFDEMRVKGIEPTVVVYTILIRGLCSENKMEEAE 256

Query: 241 SMHRAMREVGVLPNVYTYNTLMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFGILIDGL 300
           S+HR MREVGVLPNVY+YNTLMDGYCKLANAKQALRLYQDM+GEGLVPDDVTFGILIDGL
Sbjct: 257 SLHRVMREVGVLPNVYSYNTLMDGYCKLANAKQALRLYQDMLGEGLVPDDVTFGILIDGL 316

Query: 301 CKFGDIKAARNLSVNIVKFSVTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERFKVSPDV 360
           CKFGD KAARNLSVN+VKFSVTPSIAVYNSLIDGYC+AGDVSEA+AF LEL RFKVSPDV
Sbjct: 317 CKFGDSKAARNLSVNMVKFSVTPSIAVYNSLIDGYCKAGDVSEAMAFFLELGRFKVSPDV 376

Query: 361 ITYSILIRGLCSAGRTEEADNIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKALEICSR 420
           ITYSILIRGLCSAGR EEADNIFEKM KDGIPANSVTYNSLI+G CKEGN+KKALEICSR
Sbjct: 377 ITYSILIRGLCSAGRIEEADNIFEKMTKDGIPANSVTYNSLIDGYCKEGNIKKALEICSR 436

Query: 421 MTENGVEPNVITFSILIDGYCKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDGHCKYGS 480
           MTENGVEPNVITFS+LIDGYCKIR+LRAAMGIYSEMVIKSLSPDVV YT MIDGHCKYGS
Sbjct: 437 MTENGVEPNVITFSMLIDGYCKIRDLRAAMGIYSEMVIKSLSPDVVVYTTMIDGHCKYGS 496

Query: 481 MKEALKLYSDMLDNGLTPNCYTISCLLDGLCKDGRISNALKLFTENTGFQTTRCDVNAGG 540
           +KEALKLY+DML+NGLTPNC+TISCLLDGLCKDGRIS+AL+LFT+  GF TTRCDV+AGG
Sbjct: 497 IKEALKLYNDMLENGLTPNCFTISCLLDGLCKDGRISDALELFTKKIGFGTTRCDVDAGG 556

Query: 541 SKPSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYFQVKRIL 600
           ++PSFTN+VVYTALIHGLCEDGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYFQVKR+L
Sbjct: 557 TEPSFTNNVVYTALIHGLCEDGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYFQVKRVL 616

Query: 601 DVMMLHADMLKFGVIPNSAIYSILCKGYRESGFLKSALNCSKDLEE 647
           ++MMLHADMLKFGVIPNSAIYSILCKGYR+SGFLKSALNCSKDLEE
Sbjct: 617 NMMMLHADMLKFGVIPNSAIYSILCKGYRKSGFLKSALNCSKDLEE 662

BLAST of HG10022270 vs. NCBI nr
Match: XP_022990473.1 (pentatricopeptide repeat-containing protein At5g61400 [Cucurbita maxima])

HSP 1 Score: 1119.0 bits (2893), Expect = 0.0e+00
Identity = 548/648 (84.57%), Postives = 603/648 (93.06%), Query Frame = 0

Query: 1   MLRTQFPLKSVLVRIGRHGSMLQVVSLSSLTSDSLITTVLNCRSPERALELFNAAPEKNT 60
           ML  Q PLKSVLV IGR+GS+LQ V+LSS T DSLITTVLNC+SP++ALELFNAAPEKNT
Sbjct: 1   MLMNQLPLKSVLVHIGRYGSILQAVALSSSTPDSLITTVLNCKSPKKALELFNAAPEKNT 60

Query: 61  QLYSAIIHVLVGSKLFSHARCLLKGLVENLLRKSHKRYHVCQLAFNALSQLKSSKFTPNV 120
           +LYSAIIHVLVGSKLFSHARCLLK L+++LL KS + YHVCQLAFNALS LK+SKF+PNV
Sbjct: 61  RLYSAIIHVLVGSKLFSHARCLLKDLIQDLLVKSRRPYHVCQLAFNALSSLKTSKFSPNV 120

Query: 121 YGELLIVLSKMGLVEEALWMYRKVGAALAKQACNVLLDVLVKTGKFELLWRIFEEMISNG 180
           Y EL+IVLSKMGLV+EALWMYRKVG A+A+QACNVLLDVLVKTG+FELLW I+EEM+SNG
Sbjct: 121 YSELIIVLSKMGLVDEALWMYRKVGVAVARQACNVLLDVLVKTGRFELLWGIYEEMVSNG 180

Query: 181 LSPDVITYGILIDGCCRQGDLLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSENKMEEAE 240
           LSPDVITYGILIDG CRQGDLLRAHE+FDEMR KGIEPTVVVYTI IRGLCS+NKMEEAE
Sbjct: 181 LSPDVITYGILIDGRCRQGDLLRAHEIFDEMRVKGIEPTVVVYTIFIRGLCSDNKMEEAE 240

Query: 241 SMHRAMREVGVLPNVYTYNTLMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFGILIDGL 300
            +HR MRE+GVLPNVYTYNTLM+G+CK+AN KQALRLY DM+GE LVPD+VTFGILIDGL
Sbjct: 241 GIHRLMRELGVLPNVYTYNTLMNGHCKVANVKQALRLYHDMLGEDLVPDNVTFGILIDGL 300

Query: 301 CKFGDIKAARNLSVNIVKFSVTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERFKVSPDV 360
           CKFGDIKAARNL VN+VKFSVTPSIAVYNSLIDGYC+AGD+SEA+AFL ELERFKVSPDV
Sbjct: 301 CKFGDIKAARNLLVNMVKFSVTPSIAVYNSLIDGYCKAGDISEAMAFLSELERFKVSPDV 360

Query: 361 ITYSILIRGLCSAGRTEEADNIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKALEICSR 420
           +TYSILIRGLCSAGR EEADN+ EKMMK+GIPANSVTYNSLI+GCCKEGNM KALEICSR
Sbjct: 361 VTYSILIRGLCSAGRIEEADNMLEKMMKEGIPANSVTYNSLIDGCCKEGNMNKALEICSR 420

Query: 421 MTENGVEPNVITFSILIDGYCKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDGHCKYGS 480
           M ENGVEPNVITFS+LIDGYCKIRN+ AAMGIYSEM IKSLSPDVVAYTAMIDGHCK+G+
Sbjct: 421 MIENGVEPNVITFSMLIDGYCKIRNIEAAMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGN 480

Query: 481 MKEALKLYSDMLDNGLTPNCYTISCLLDGLCKDGRISNALKLFTENTGFQTTRCDVNAGG 540
           MKEALKLY+DMLDNGLTPN YT+SCLLDGLCKDGR+ +AL+LFTE   F TT+C V+A  
Sbjct: 481 MKEALKLYNDMLDNGLTPNSYTLSCLLDGLCKDGRVLDALELFTEKAEFGTTKCKVDAAE 540

Query: 541 SKPSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYFQVKRIL 600
           SK SFTNHVVYTALIHGLCEDGQIFKAAKLFSDMR YGLQPDEVIYVVML+GYFQVKRIL
Sbjct: 541 SKLSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRSYGLQPDEVIYVVMLKGYFQVKRIL 600

Query: 601 DVMMLHADMLKFGVIPNSAIYSILCKGYRESGFLKSALNCSKDLEELH 649
           D+ MLHADMLKFG++PNSAIYS L KGYRESGFLKSALNCSK+L+EL+
Sbjct: 601 DMTMLHADMLKFGIVPNSAIYSTLSKGYRESGFLKSALNCSKELKELY 648

BLAST of HG10022270 vs. NCBI nr
Match: XP_023517041.1 (pentatricopeptide repeat-containing protein At5g61400 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1117.1 bits (2888), Expect = 0.0e+00
Identity = 551/648 (85.03%), Postives = 603/648 (93.06%), Query Frame = 0

Query: 1   MLRTQFPLKSVLVRIGRHGSMLQVVSLSSLTSDSLITTVLNCRSPERALELFNAAPEKNT 60
           ML  Q PLKSVLV IGRHGS+LQ V+LSS T D+LITTVLNC+SP++ALELFNAAPEKNT
Sbjct: 1   MLMNQLPLKSVLVHIGRHGSILQAVALSSSTPDNLITTVLNCKSPKKALELFNAAPEKNT 60

Query: 61  QLYSAIIHVLVGSKLFSHARCLLKGLVENLLRKSHKRYHVCQLAFNALSQLKSSKFTPNV 120
           +LYSAIIHVLVGSKLFSHARCLLK L+++LL KS + YHVCQLAFNALS LK+SKF+PNV
Sbjct: 61  RLYSAIIHVLVGSKLFSHARCLLKELIQDLLVKSRRPYHVCQLAFNALSSLKTSKFSPNV 120

Query: 121 YGELLIVLSKMGLVEEALWMYRKVGAALAKQACNVLLDVLVKTGKFELLWRIFEEMISNG 180
           Y EL+IVLSKMGLV+EALWMYRKVG A+A+QACNVLLDVLVKTG+FELLW I+EEM+SNG
Sbjct: 121 YSELIIVLSKMGLVDEALWMYRKVGVAVARQACNVLLDVLVKTGRFELLWGIYEEMVSNG 180

Query: 181 LSPDVITYGILIDGCCRQGDLLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSENKMEEAE 240
           LSPDVITYGILIDG CRQGDLLRAHE+FDEMR KGIEPTVVVYTILIRGLCSENKMEEAE
Sbjct: 181 LSPDVITYGILIDGRCRQGDLLRAHEIFDEMRVKGIEPTVVVYTILIRGLCSENKMEEAE 240

Query: 241 SMHRAMREVGVLPNVYTYNTLMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFGILIDGL 300
           S+HR MRE+GVLPNVYTYNTLM+G+CK+AN KQALRLY +M+GE LVPD+VTFGILIDGL
Sbjct: 241 SIHRLMRELGVLPNVYTYNTLMNGHCKVANVKQALRLYHNMLGEDLVPDNVTFGILIDGL 300

Query: 301 CKFGDIKAARNLSVNIVKFSVTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERFKVSPDV 360
           CKFGDIKAARNLSVN+VKFSVTPSIAVYNSLIDGYC+AGD+SEA+AFL ELERFKVSPDV
Sbjct: 301 CKFGDIKAARNLSVNMVKFSVTPSIAVYNSLIDGYCKAGDISEAMAFLSELERFKVSPDV 360

Query: 361 ITYSILIRGLCSAGRTEEADNIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKALEICSR 420
           +TYSILIRGLCSAGR EEADN+ EKMMK+GIPANSVTYNSLI+GCCKEGNM KALEICSR
Sbjct: 361 VTYSILIRGLCSAGRIEEADNMLEKMMKEGIPANSVTYNSLIDGCCKEGNMNKALEICSR 420

Query: 421 MTENGVEPNVITFSILIDGYCKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDGHCKYGS 480
           M ENGVEPNVITFS+LIDGYCKIRN+ AAMGIYSEM IKSLSPDVVAYTAMIDGHCK+GS
Sbjct: 421 MIENGVEPNVITFSMLIDGYCKIRNVEAAMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGS 480

Query: 481 MKEALKLYSDMLDNGLTPNCYTISCLLDGLCKDGRISNALKLFTENTGFQTTRCDVNAGG 540
           MKEALKLY+DMLDNGLTPN YT+SCLLDGLCKDGR+S+AL+LFTE   F TT+C      
Sbjct: 481 MKEALKLYNDMLDNGLTPNSYTLSCLLDGLCKDGRVSDALELFTEKAEFGTTKC------ 540

Query: 541 SKPSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYFQVKRIL 600
            K SFTNHVVYTALIHGLCEDGQIFKAAKLFSDMR YGLQPDEVIYVVML+GYFQVKRIL
Sbjct: 541 -KLSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRSYGLQPDEVIYVVMLKGYFQVKRIL 600

Query: 601 DVMMLHADMLKFGVIPNSAIYSILCKGYRESGFLKSALNCSKDLEELH 649
           D+ MLHADMLKFG+IPNSAIYS L KGYRESGFLKSALNCSK+L+EL+
Sbjct: 601 DMTMLHADMLKFGIIPNSAIYSTLSKGYRESGFLKSALNCSKELKELY 641

BLAST of HG10022270 vs. NCBI nr
Match: XP_022959787.1 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g61400 [Cucurbita moschata])

HSP 1 Score: 1111.3 bits (2873), Expect = 0.0e+00
Identity = 548/648 (84.57%), Postives = 599/648 (92.44%), Query Frame = 0

Query: 1   MLRTQFPLKSVLVRIGRHGSMLQVVSLSSLTSDSLITTVLNCRSPERALELFNAAPEKNT 60
           ML  Q PL+SVLV IGR+GS+LQ V+LSS T DSLITTVLNC+SP++ALELFNAAPEKNT
Sbjct: 33  MLMNQLPLRSVLVHIGRYGSILQAVALSSSTPDSLITTVLNCKSPQKALELFNAAPEKNT 92

Query: 61  QLYSAIIHVLVGSKLFSHARCLLKGLVENLLRKSHKRYHVCQLAFNALSQLKSSKFTPNV 120
           +LYSAIIHVLVGSKLFSHARCLLK L+++LL KS + YHVCQLAFN LS LK+SKF+PNV
Sbjct: 93  RLYSAIIHVLVGSKLFSHARCLLKELIQDLLVKSRRPYHVCQLAFNVLSSLKTSKFSPNV 152

Query: 121 YGELLIVLSKMGLVEEALWMYRKVGAALAKQACNVLLDVLVKTGKFELLWRIFEEMISNG 180
           Y EL+IVLSKMGLV+EALWMYRKVG A+A+QACNVLLDVLVKTG+FELLW I+EEM+SNG
Sbjct: 153 YSELIIVLSKMGLVDEALWMYRKVGVAVARQACNVLLDVLVKTGRFELLWGIYEEMVSNG 212

Query: 181 LSPDVITYGILIDGCCRQGDLLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSENKMEEAE 240
           LSPDVITYGILIDG CRQGDLLRAHE+FDEMR KGIEPTVVVYTILIRGLCSENKMEEAE
Sbjct: 213 LSPDVITYGILIDGRCRQGDLLRAHEIFDEMRVKGIEPTVVVYTILIRGLCSENKMEEAE 272

Query: 241 SMHRAMREVGVLPNVYTYNTLMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFGILIDGL 300
           S+HR MRE+GVLPNVYTYNTLM+G+CK+AN KQALRLY DM+GE LVPD+VTFGILIDGL
Sbjct: 273 SIHRLMRELGVLPNVYTYNTLMNGHCKVANVKQALRLYHDMLGEDLVPDNVTFGILIDGL 332

Query: 301 CKFGDIKAARNLSVNIVKFSVTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERFKVSPDV 360
           CKFGDIKAARNL VN+VKFSVTPSIAVYNSLIDGYC+AGD+SEA+AFL ELERFKVSPDV
Sbjct: 333 CKFGDIKAARNLLVNMVKFSVTPSIAVYNSLIDGYCKAGDISEAMAFLSELERFKVSPDV 392

Query: 361 ITYSILIRGLCSAGRTEEADNIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKALEICSR 420
           +TYSILIRG CSAGR EEADN+ EKMMK+GIPANSVTYNSLI+GCCKEGNM KALEICSR
Sbjct: 393 VTYSILIRGFCSAGRIEEADNMLEKMMKEGIPANSVTYNSLIDGCCKEGNMNKALEICSR 452

Query: 421 MTENGVEPNVITFSILIDGYCKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDGHCKYGS 480
           M ENGVEPNVITFS+LIDGYCKIRN+ AAMGIYSEM IKSLSPDVVAYTAMIDGHCK+GS
Sbjct: 453 MIENGVEPNVITFSMLIDGYCKIRNIEAAMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGS 512

Query: 481 MKEALKLYSDMLDNGLTPNCYTISCLLDGLCKDGRISNALKLFTENTGFQTTRCDVNAGG 540
           MKEALKLY+DML NGLTPN YT+SCLLDGLCKDGR+S+AL+LFTE   F TT+C      
Sbjct: 513 MKEALKLYNDMLQNGLTPNSYTLSCLLDGLCKDGRVSDALELFTEKAEFGTTKC------ 572

Query: 541 SKPSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYFQVKRIL 600
            K SFTNHVVYTALIHGLCEDGQIFKAAKLFSDMR YGLQPDEVIYVVML+GYFQVKRIL
Sbjct: 573 -KLSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRSYGLQPDEVIYVVMLKGYFQVKRIL 632

Query: 601 DVMMLHADMLKFGVIPNSAIYSILCKGYRESGFLKSALNCSKDLEELH 649
           D+ MLHADMLKFG+IPNSAIYS L KGYRESGFLKSALNCSK+LEEL+
Sbjct: 633 DMTMLHADMLKFGIIPNSAIYSTLSKGYRESGFLKSALNCSKELEELY 673

BLAST of HG10022270 vs. NCBI nr
Match: XP_011655326.1 (pentatricopeptide repeat-containing protein At5g61400 [Cucumis sativus] >XP_011655327.1 pentatricopeptide repeat-containing protein At5g61400 [Cucumis sativus] >XP_011655328.1 pentatricopeptide repeat-containing protein At5g61400 [Cucumis sativus] >KGN51232.1 hypothetical protein Csa_008737 [Cucumis sativus])

HSP 1 Score: 1086.2 bits (2808), Expect = 0.0e+00
Identity = 547/646 (84.67%), Postives = 586/646 (90.71%), Query Frame = 0

Query: 1   MLRTQFPLKSVLVRIGRHGSMLQVVSLSSLTSDSLITTVLNCRSPERALELFNAAPEKNT 60
           ML TQFPLKSVLVRIG +G+MLQVVSLSSLT DSLITTVLNCRSP +ALE FNAAPEKN 
Sbjct: 12  MLMTQFPLKSVLVRIGLNGTMLQVVSLSSLTPDSLITTVLNCRSPWKALEFFNAAPEKNI 71

Query: 61  QLYSAIIHVLVGSKLFSHARCLLKGLVENLLRKSHKRYHVCQLAFNALSQLKSSKFTPNV 120
           QLYSAIIHVLVGSKL SHAR LL  LV+NL+ KSHK YH CQLAF+ LS+LKSSKFTPNV
Sbjct: 72  QLYSAIIHVLVGSKLLSHARYLLNDLVQNLV-KSHKPYHACQLAFSELSRLKSSKFTPNV 131

Query: 121 YGELLIVLSKMGLVEEALWMYRKVGAALAKQACNVLLDVLVKTGKFELLWRIFEEMISNG 180
           YGEL+IVL KM LVEEAL MY KVGAAL  QACNVLL VLVKTG+FELLWRI+EEMISNG
Sbjct: 132 YGELIIVLCKMELVEEALSMYHKVGAALTIQACNVLLYVLVKTGRFELLWRIYEEMISNG 191

Query: 181 LSPDVITYGILIDGCCRQGDLLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSENKMEEAE 240
           LSP VIT+G LIDGCCRQGDLLRA EMFDEMR KGI PTV+VYTILIRGLCS+NK+EEAE
Sbjct: 192 LSPSVITFGTLIDGCCRQGDLLRAQEMFDEMRVKGIVPTVIVYTILIRGLCSDNKIEEAE 251

Query: 241 SMHRAMREVGVLPNVYTYNTLMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFGILIDGL 300
           SMHRAMREVGV PNVYTYNTLMDGYCKLANAKQALRLYQDM+GEGLVPD VTFGILIDGL
Sbjct: 252 SMHRAMREVGVYPNVYTYNTLMDGYCKLANAKQALRLYQDMLGEGLVPDVVTFGILIDGL 311

Query: 301 CKFGDIKAARNLSVNIVKFSVTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERFKVSPDV 360
           CKFG++KAARNL VN++KFSVTP+IAVYNSLID YC+ GDVSEA+A  LELERF+VSPDV
Sbjct: 312 CKFGEMKAARNLFVNMIKFSVTPNIAVYNSLIDAYCKVGDVSEAMALFLELERFEVSPDV 371

Query: 361 ITYSILIRGLCSAGRTEEADNIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKALEICSR 420
            TYSILIRGLCS  RTEEA NIFEKM K+GI ANSVTYNSLI+GCCKEG M KALEICS+
Sbjct: 372 FTYSILIRGLCSVSRTEEAGNIFEKMTKEGILANSVTYNSLIDGCCKEGKMDKALEICSQ 431

Query: 421 MTENGVEPNVITFSILIDGYCKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDGHCKYGS 480
           MTENGVEPNVITFS LIDGYCKIRNL+AAMGIYSEMVIKSLSPDVV YTAMIDGHCKYGS
Sbjct: 432 MTENGVEPNVITFSTLIDGYCKIRNLQAAMGIYSEMVIKSLSPDVVTYTAMIDGHCKYGS 491

Query: 481 MKEALKLYSDMLDNGLTPNCYTISCLLDGLCKDGRISNALKLFTENTGFQTTRCDVNAGG 540
           MKEALKLYSDMLDNG+TPNCYTISCLLDGLCKDG+IS+AL+LFTE   FQT RC+V+AGG
Sbjct: 492 MKEALKLYSDMLDNGITPNCYTISCLLDGLCKDGKISDALELFTEKIEFQTPRCNVDAGG 551

Query: 541 SKPSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYFQVKRIL 600
           SKPS TNHV YTALIHGLC+DGQ  KA KLFSDMR YGLQPDEVIYVVML+G FQVK IL
Sbjct: 552 SKPSLTNHVAYTALIHGLCQDGQFSKAVKLFSDMRRYGLQPDEVIYVVMLRGLFQVKYIL 611

Query: 601 DVMMLHADMLKFGVIPNSAIYSILCKGYRESGFLKSALNCSKDLEE 647
             MMLHADMLKFGVIPNSA++ ILC+ Y+ESGFLKSA NCSKDLEE
Sbjct: 612 --MMLHADMLKFGVIPNSAVHVILCECYQESGFLKSAQNCSKDLEE 654

BLAST of HG10022270 vs. ExPASy Swiss-Prot
Match: Q9FLJ4 (Pentatricopeptide repeat-containing protein At5g61400 OS=Arabidopsis thaliana OX=3702 GN=At5g61400 PE=2 SV=1)

HSP 1 Score: 573.5 bits (1477), Expect = 2.9e-162
Identity = 287/618 (46.44%), Postives = 411/618 (66.50%), Query Frame = 0

Query: 26  SLSSLTSDSLITTVLNCRSPERALELF------NAAPEKNTQLYSAIIHVLVGSKLFSHA 85
           S SS +S SL   +L CRS E A +LF        +   + Q +SA+IHVL G+  ++ A
Sbjct: 35  SASSFSSSSLAEAILKCRSAEEAFKLFETSSRSRVSKSNDLQSFSAVIHVLTGAHKYTLA 94

Query: 86  RCLLKGLVENLLRKSHKRYHVCQLAFNALSQLKSSKFTPNVYGELLIVLSKMGLVEEALW 145
           RCL+K L+E L R S    ++    FNAL  ++S KF+  V+  L++   +MGL EEALW
Sbjct: 95  RCLIKSLIERLKRHSEPS-NMSHRLFNALEDIQSPKFSIGVFSLLIMEFLEMGLFEEALW 154

Query: 146 MYRKVGAALAKQACNVLLDVLVKTGKFELLWRIFEEMISNGLSPDVITYGILIDGCCRQG 205
           + R++  +   +AC  +L+ LV+  +F+ +W  ++ MIS GL PDV  Y +L   C +QG
Sbjct: 155 VSREMKCSPDSKACLSILNGLVRRRRFDSVWVDYQLMISRGLVPDVHIYFVLFQCCFKQG 214

Query: 206 DLLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSENKMEEAESMHRAMREVGVLPNVYTYN 265
              +  ++ DEM + GI+P V +YTI I  LC +NKMEEAE M   M++ GVLPN+YTY+
Sbjct: 215 LYSKKEKLLDEMTSLGIKPNVYIYTIYILDLCRDNKMEEAEKMFELMKKHGVLPNLYTYS 274

Query: 266 TLMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFGILIDGLCKFGDIKAARNLSVNIVKF 325
            ++DGYCK  N +QA  LY++++   L+P+ V FG L+DG CK  ++  AR+L V++VKF
Sbjct: 275 AMIDGYCKTGNVRQAYGLYKEILVAELLPNVVVFGTLVDGFCKARELVTARSLFVHMVKF 334

Query: 326 SVTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERFKVSPDVITYSILIRGLCSAGRTEEA 385
            V P++ VYN LI G+C++G++ EA+  L E+E   +SPDV TY+ILI GLC   +  EA
Sbjct: 335 GVDPNLYVYNCLIHGHCKSGNMLEAVGLLSEMESLNLSPDVFTYTILINGLCIEDQVAEA 394

Query: 386 DNIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKALEICSRMTENGVEPNVITFSILIDG 445
           + +F+KM  + I  +S TYNSLI+G CKE NM++AL++CS MT +GVEPN+ITFS LIDG
Sbjct: 395 NRLFQKMKNERIFPSSATYNSLIHGYCKEYNMEQALDLCSEMTASGVEPNIITFSTLIDG 454

Query: 446 YCKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDGHCKYGSMKEALKLYSDMLDNGLTPN 505
           YC +R+++AAMG+Y EM IK + PDVV YTA+ID H K  +MKEAL+LYSDML+ G+ PN
Sbjct: 455 YCNVRDIKAAMGLYFEMTIKGIVPDVVTYTALIDAHFKEANMKEALRLYSDMLEAGIHPN 514

Query: 506 CYTISCLLDGLCKDGRISNALKLFTENTGFQTTRCDVNAGGSKPSFTNHVVYTALIHGLC 565
            +T +CL+DG  K+GR+S A+  + EN               + S  NHV +T LI GLC
Sbjct: 515 DHTFACLVDGFWKEGRLSVAIDFYQEN-------------NQQRSCWNHVGFTCLIEGLC 574

Query: 566 EDGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYFQVKRILDVMMLHADMLKFGVIPNSA 625
           ++G I +A++ FSDMR  G+ PD   YV ML+G+ Q KRI D MML  DM+K G++PN  
Sbjct: 575 QNGYILRASRFFSDMRSCGITPDICSYVSMLKGHLQEKRITDTMMLQCDMIKTGILPNLL 634

Query: 626 IYSILCKGYRESGFLKSA 638
           +  +L + Y+ +G++KSA
Sbjct: 635 VNQLLARFYQANGYVKSA 638

BLAST of HG10022270 vs. ExPASy Swiss-Prot
Match: P0C894 (Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis thaliana OX=3702 GN=At2g02150 PE=3 SV=1)

HSP 1 Score: 297.4 bits (760), Expect = 4.0e-79
Identity = 180/605 (29.75%), Postives = 307/605 (50.74%), Query Frame = 0

Query: 45  PERALELFNAAPEKN-----TQLYSAIIHVLVGSKLFSHARCLLKGLVENLLRKSHKRYH 104
           P+ A + F  +  +N      + Y  + H+L  ++++  A  +LK   E +L K+     
Sbjct: 122 PKLAFKFFKWSMTRNGFKHSVESYCIVAHILFCARMYYDANSVLK---EMVLSKAD---- 181

Query: 105 VCQLAFNALSQLKSSKFTP-NVYGELLIVLSKMGLVEEALWMYRKVGAALA---KQACNV 164
            C + F+ L   ++       V+  L  VL  +G++EEA+  + K+         ++CN 
Sbjct: 182 -CDV-FDVLWSTRNVCVPGFGVFDALFSVLIDLGMLEEAIQCFSKMKRFRVFPKTRSCNG 241

Query: 165 LLDVLVKTGKFELLWRIFEEMISNGLSPDVITYGILIDGCCRQGDLLRAHEMFDEMRAKG 224
           LL    K GK + + R F++MI  G  P V TY I+ID  C++GD+  A  +F+EM+ +G
Sbjct: 242 LLHRFAKLGKTDDVKRFFKDMIGAGARPTVFTYNIMIDCMCKEGDVEAARGLFEEMKFRG 301

Query: 225 IEPTVVVYTILIRGLCSENKMEEAESMHRAMREVGVLPNVYTYNTLMDGYCKLANAKQAL 284
           + P  V Y  +I G     ++++       M+++   P+V TYN L++ +CK       L
Sbjct: 302 LVPDTVTYNSMIDGFGKVGRLDDTVCFFEEMKDMCCEPDVITYNALINCFCKFGKLPIGL 361

Query: 285 RLYQDMIGEGLVPDDVTFGILIDGLCKFGDIKAARNLSVNIVKFSVTPSIAVYNSLIDGY 344
             Y++M G GL P+ V++  L+D  CK G ++ A    V++ +  + P+   Y SLID  
Sbjct: 362 EFYREMKGNGLKPNVVSYSTLVDAFCKEGMMQQAIKFYVDMRRVGLVPNEYTYTSLIDAN 421

Query: 345 CRAGDVSEAIAFLLELERFKVSPDVITYSILIRGLCSAGRTEEADNIFEKMMKDGIPANS 404
           C+ G++S+A     E+ +  V  +V+TY+ LI GLC A R +EA+ +F KM   G+  N 
Sbjct: 422 CKIGNLSDAFRLGNEMLQVGVEWNVVTYTALIDGLCDAERMKEAEELFGKMDTAGVIPNL 481

Query: 405 VTYNSLINGCCKEGNMKKALEICSRMTENGVEPNVITFSILIDGYCKIRNLRAAMGIYSE 464
            +YN+LI+G  K  NM +ALE+ + +   G++P+++ +   I G C +  + AA  + +E
Sbjct: 482 ASYNALIHGFVKAKNMDRALELLNELKGRGIKPDLLLYGTFIWGLCSLEKIEAAKVVMNE 541

Query: 465 MVIKSLSPDVVAYTAMIDGHCKYGSMKEALKLYSDMLDNGLTPNCYTISCLLDGLCKDGR 524
           M    +  + + YT ++D + K G+  E L L  +M +  +     T   L+DGLCK+  
Sbjct: 542 MKECGIKANSLIYTTLMDAYFKSGNPTEGLHLLDEMKELDIEVTVVTFCVLIDGLCKNKL 601

Query: 525 ISNALKLF---TENTGFQTTRCDVNAGGSKPSFTNHVVYTALIHGLCEDGQIFKAAKLFS 584
           +S A+  F   + + G Q                N  ++TA+I GLC+D Q+  A  LF 
Sbjct: 602 VSKAVDYFNRISNDFGLQ---------------ANAAIFTAMIDGLCKDNQVEAATTLFE 661

Query: 585 DMRCYGLQPDEVIYVVMLQGYFQVKRILDVMMLHADMLKFGVIPNSAIYSILCKGYRESG 638
            M   GL PD   Y  ++ G F+   +L+ + L   M + G+  +   Y+ L  G     
Sbjct: 662 QMVQKGLVPDRTAYTSLMDGNFKQGNVLEALALRDKMAEIGMKLDLLAYTSLVWGLSHCN 702

BLAST of HG10022270 vs. ExPASy Swiss-Prot
Match: Q9LSL9 (Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX=3702 GN=At5g65560 PE=2 SV=1)

HSP 1 Score: 293.1 bits (749), Expect = 7.6e-78
Identity = 173/532 (32.52%), Postives = 273/532 (51.32%), Query Frame = 0

Query: 121 YGELLIVLSKMGLVEEALWMYRKVGAALAKQAC------NVLLDVLVKTGKFELLWRIFE 180
           Y  LL  L++ GLV+E   +Y ++   L  + C      N +++   K G  E   +   
Sbjct: 186 YNTLLNSLARFGLVDEMKQVYMEM---LEDKVCPNIYTYNKMVNGYCKLGNVEEANQYVS 245

Query: 181 EMISNGLSPDVITYGILIDGCCRQGDLLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSEN 240
           +++  GL PD  TY  LI G C++ DL  A ++F+EM  KG     V YT LI GLC   
Sbjct: 246 KIVEAGLDPDFFTYTSLIMGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVAR 305

Query: 241 KMEEAESMHRAMREVGVLPNVYTYNTLMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFG 300
           +++EA  +   M++    P V TY  L+   C      +AL L ++M   G+ P+  T+ 
Sbjct: 306 RIDEAMDLFVKMKDDECFPTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYT 365

Query: 301 ILIDGLCKFGDIKAARNLSVNIVKFSVTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERF 360
           +LID LC     + AR L   +++  + P++  YN+LI+GYC+ G + +A+  +  +E  
Sbjct: 366 VLIDSLCSQCKFEKARELLGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDVVELMESR 425

Query: 361 KVSPDVITYSILIRGLCSAGRTEEADNIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKA 420
           K+SP+  TY+ LI+G C +    +A  +  KM++  +  + VTYNSLI+G C+ GN   A
Sbjct: 426 KLSPNTRTYNELIKGYCKS-NVHKAMGVLNKMLERKVLPDVVTYNSLIDGQCRSGNFDSA 485

Query: 421 LEICSRMTENGVEPNVITFSILIDGYCKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDG 480
             + S M + G+ P+  T++ +ID  CK + +  A  ++  +  K ++P+VV YTA+IDG
Sbjct: 486 YRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALIDG 545

Query: 481 HCKYGSMKEALKLYSDMLDNGLTPNCYTISCLLDGLCKDGRISNALKLFTENTGFQTTRC 540
           +CK G + EA  +   ML     PN  T + L+ GLC DG++  A  L  +         
Sbjct: 546 YCKAGKVDEAHLMLEKMLSKNCLPNSLTFNALIHGLCADGKLKEATLLEEKMVKI----- 605

Query: 541 DVNAGGSKPSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYF 600
                G +P+ +     T LIH L +DG    A   F  M   G +PD   Y   +Q Y 
Sbjct: 606 -----GLQPTVSTD---TILIHRLLKDGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYC 665

Query: 601 QVKRILDVMMLHADMLKFGVIPNSAIYSILCKGYRESGFLKSALNCSKDLEE 647
           +  R+LD   + A M + GV P+   YS L KGY + G    A +  K + +
Sbjct: 666 REGRLLDAEDMMAKMRENGVSPDLFTYSSLIKGYGDLGQTNFAFDVLKRMRD 700

BLAST of HG10022270 vs. ExPASy Swiss-Prot
Match: Q0WVK7 (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 287.0 bits (733), Expect = 5.5e-76
Identity = 173/608 (28.45%), Postives = 299/608 (49.18%), Query Frame = 0

Query: 32  SDSLITTVLNCRSPER-ALELFNAA---PEKNTQLYSAIIHVLVGSKLFSHARCLLKGLV 91
           +D LI  ++  +   R  L+ F+ A    + N +    +IH+ V SK    A+ L+    
Sbjct: 87  TDHLIWVLMKIKCDYRLVLDFFDWARSRRDSNLESLCIVIHLAVASKDLKVAQSLISSFW 146

Query: 92  ENLLRKSHKRYHVCQLAFNALSQLKSSKFTPNVYGELLIVLSKMGLVEEALWMYRKV--- 151
           E    K +      Q     +   K     P V+     VL   GL+ EA  ++ K+   
Sbjct: 147 ER--PKLNVTDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNY 206

Query: 152 GAALAKQACNVLLDVL----VKTGKFELLWRIFEEMISNGLSPDVITYGILIDGCCRQGD 211
           G  L+  +CNV L  L     KT    +++R F E+   G+  +V +Y I+I   C+ G 
Sbjct: 207 GLVLSVDSCNVYLTRLSKDCYKTATAIIVFREFPEV---GVCWNVASYNIVIHFVCQLGR 266

Query: 212 LLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSENKMEEAESMHRAMREVGVLPNVYTYNT 271
           +  AH +   M  KG  P V+ Y+ ++ G C   ++++   +   M+  G+ PN Y Y +
Sbjct: 267 IKEAHHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGS 326

Query: 272 LMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFGILIDGLCKFGDIKAARNLSVNIVKFS 331
           ++   C++    +A   + +MI +G++PD V +  LIDG CK GDI+AA      +    
Sbjct: 327 IIGLLCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRD 386

Query: 332 VTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERFKVSPDVITYSILIRGLCSAGRTEEAD 391
           +TP +  Y ++I G+C+ GD+ EA     E+    + PD +T++ LI G C AG  ++A 
Sbjct: 387 ITPDVLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAF 446

Query: 392 NIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKALEICSRMTENGVEPNVITFSILIDGY 451
            +   M++ G   N VTY +LI+G CKEG++  A E+   M + G++PN+ T++ +++G 
Sbjct: 447 RVHNHMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGL 506

Query: 452 CKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDGHCKYGSMKEALKLYSDMLDNGLTPNC 511
           CK  N+  A+ +  E     L+ D V YT ++D +CK G M +A ++  +ML  GL P  
Sbjct: 507 CKSGNIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTI 566

Query: 512 YTISCLLDGLCKDGRISNALKLFTENTGFQTTRCDVNAGGSKPSFTNHVVYTALIHGLCE 571
            T + L++G C  G + +  KL             +N   +K    N   + +L+   C 
Sbjct: 567 VTFNVLMNGFCLHGMLEDGEKL-------------LNWMLAKGIAPNATTFNSLVKQYCI 626

Query: 572 DGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYFQVKRILDVMMLHADMLKFGVIPNSAI 629
              +  A  ++ DM   G+ PD   Y  +++G+ + + + +   L  +M   G   + + 
Sbjct: 627 RNNLKAATAIYKDMCSRGVGPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKGFSVSVST 676

BLAST of HG10022270 vs. ExPASy Swiss-Prot
Match: Q9LFC5 (Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana OX=3702 GN=At5g01110 PE=2 SV=1)

HSP 1 Score: 280.8 bits (717), Expect = 3.9e-74
Identity = 169/613 (27.57%), Postives = 305/613 (49.76%), Query Frame = 0

Query: 58  KNTQL-YSAIIHVLVGSKLFSHAR-CLLKGLVENLLRKSHKRYHVCQLAFNALSQLKSSK 117
           K+T L  SA+IH+LV S   S A+ CLL+     + R    R  +     +  S   S+ 
Sbjct: 110 KHTSLSLSAMIHILVRSGRLSDAQSCLLR----MIRRSGVSRLEIVNSLDSTFSNCGSND 169

Query: 118 FTPNVYGELLIVLSKMGLVEEALWMYRKVGAALAKQACNVLLDVLVKTGKFELLWRIFEE 177
              ++     +   K+    EA  + R  G  ++  ACN L+  LV+ G  EL W +++E
Sbjct: 170 SVFDLLIRTYVQARKLREAHEAFTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQE 229

Query: 178 MISNGLSPDVITYGILIDGCCRQGDLLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSENK 237
           +  +G+  +V T  I+++  C+ G + +      +++ KG+ P +V Y  LI    S+  
Sbjct: 230 ISRSGVGINVYTLNIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKGL 289

Query: 238 MEEAESMHRAMREVGVLPNVYTYNTLMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFGI 297
           MEEA  +  AM   G  P VYTYNT+++G CK    ++A  ++ +M+  GL PD  T+  
Sbjct: 290 MEEAFELMNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEMLRSGLSPDSTTYRS 349

Query: 298 LIDGLCKFGDIKAARNLSVNIVKFSVTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERFK 357
           L+   CK GD+     +  ++    V P +  ++S++  + R+G++ +A+ +   ++   
Sbjct: 350 LLMEACKKGDVVETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEAG 409

Query: 358 VSPDVITYSILIRGLCSAGRTEEADNIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKAL 417
           + PD + Y+ILI+G C  G    A N+  +M++ G   + VTYN++++G CK   + +A 
Sbjct: 410 LIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGEAD 469

Query: 418 EICSRMTENGVEPNVITFSILIDGYCKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDGH 477
           ++ + MTE  + P+  T +ILIDG+CK+ NL+ AM ++ +M  K +  DVV Y  ++DG 
Sbjct: 470 KLFNEMTERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRIRLDVVTYNTLLDGF 529

Query: 478 CKYGSMKEALKLYSDMLDNGLTPNCYTISCLLDGLCKDGRISNALKLFTE--NTGFQTT- 537
            K G +  A ++++DM+   + P   + S L++ LC  G ++ A +++ E  +   + T 
Sbjct: 530 GKVGDIDTAKEIWADMVSKEILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPTV 589

Query: 538 ---------RCDVNAGGSKPSFTNHVV----------YTALIHGLCEDGQIFKAAKLFSD 597
                     C         SF   ++          Y  LI+G   +  + KA  L   
Sbjct: 590 MICNSMIKGYCRSGNASDGESFLEKMISEGFVPDCISYNTLIYGFVREENMSKAFGLVKK 649

Query: 598 M--RCYGLQPDEVIYVVMLQGYFQVKRILDVMMLHADMLKFGVIPNSAIYSILCKGYRES 645
           M     GL PD   Y  +L G+ +  ++ +  ++   M++ GV P+ + Y+ +  G+   
Sbjct: 650 MEEEQGGLVPDVFTYNSILHGFCRQNQMKEAEVVLRKMIERGVNPDRSTYTCMINGFVSQ 709

BLAST of HG10022270 vs. ExPASy TrEMBL
Match: A0A6J1JTD6 (pentatricopeptide repeat-containing protein At5g61400 OS=Cucurbita maxima OX=3661 GN=LOC111487325 PE=4 SV=1)

HSP 1 Score: 1119.0 bits (2893), Expect = 0.0e+00
Identity = 548/648 (84.57%), Postives = 603/648 (93.06%), Query Frame = 0

Query: 1   MLRTQFPLKSVLVRIGRHGSMLQVVSLSSLTSDSLITTVLNCRSPERALELFNAAPEKNT 60
           ML  Q PLKSVLV IGR+GS+LQ V+LSS T DSLITTVLNC+SP++ALELFNAAPEKNT
Sbjct: 1   MLMNQLPLKSVLVHIGRYGSILQAVALSSSTPDSLITTVLNCKSPKKALELFNAAPEKNT 60

Query: 61  QLYSAIIHVLVGSKLFSHARCLLKGLVENLLRKSHKRYHVCQLAFNALSQLKSSKFTPNV 120
           +LYSAIIHVLVGSKLFSHARCLLK L+++LL KS + YHVCQLAFNALS LK+SKF+PNV
Sbjct: 61  RLYSAIIHVLVGSKLFSHARCLLKDLIQDLLVKSRRPYHVCQLAFNALSSLKTSKFSPNV 120

Query: 121 YGELLIVLSKMGLVEEALWMYRKVGAALAKQACNVLLDVLVKTGKFELLWRIFEEMISNG 180
           Y EL+IVLSKMGLV+EALWMYRKVG A+A+QACNVLLDVLVKTG+FELLW I+EEM+SNG
Sbjct: 121 YSELIIVLSKMGLVDEALWMYRKVGVAVARQACNVLLDVLVKTGRFELLWGIYEEMVSNG 180

Query: 181 LSPDVITYGILIDGCCRQGDLLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSENKMEEAE 240
           LSPDVITYGILIDG CRQGDLLRAHE+FDEMR KGIEPTVVVYTI IRGLCS+NKMEEAE
Sbjct: 181 LSPDVITYGILIDGRCRQGDLLRAHEIFDEMRVKGIEPTVVVYTIFIRGLCSDNKMEEAE 240

Query: 241 SMHRAMREVGVLPNVYTYNTLMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFGILIDGL 300
            +HR MRE+GVLPNVYTYNTLM+G+CK+AN KQALRLY DM+GE LVPD+VTFGILIDGL
Sbjct: 241 GIHRLMRELGVLPNVYTYNTLMNGHCKVANVKQALRLYHDMLGEDLVPDNVTFGILIDGL 300

Query: 301 CKFGDIKAARNLSVNIVKFSVTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERFKVSPDV 360
           CKFGDIKAARNL VN+VKFSVTPSIAVYNSLIDGYC+AGD+SEA+AFL ELERFKVSPDV
Sbjct: 301 CKFGDIKAARNLLVNMVKFSVTPSIAVYNSLIDGYCKAGDISEAMAFLSELERFKVSPDV 360

Query: 361 ITYSILIRGLCSAGRTEEADNIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKALEICSR 420
           +TYSILIRGLCSAGR EEADN+ EKMMK+GIPANSVTYNSLI+GCCKEGNM KALEICSR
Sbjct: 361 VTYSILIRGLCSAGRIEEADNMLEKMMKEGIPANSVTYNSLIDGCCKEGNMNKALEICSR 420

Query: 421 MTENGVEPNVITFSILIDGYCKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDGHCKYGS 480
           M ENGVEPNVITFS+LIDGYCKIRN+ AAMGIYSEM IKSLSPDVVAYTAMIDGHCK+G+
Sbjct: 421 MIENGVEPNVITFSMLIDGYCKIRNIEAAMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGN 480

Query: 481 MKEALKLYSDMLDNGLTPNCYTISCLLDGLCKDGRISNALKLFTENTGFQTTRCDVNAGG 540
           MKEALKLY+DMLDNGLTPN YT+SCLLDGLCKDGR+ +AL+LFTE   F TT+C V+A  
Sbjct: 481 MKEALKLYNDMLDNGLTPNSYTLSCLLDGLCKDGRVLDALELFTEKAEFGTTKCKVDAAE 540

Query: 541 SKPSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYFQVKRIL 600
           SK SFTNHVVYTALIHGLCEDGQIFKAAKLFSDMR YGLQPDEVIYVVML+GYFQVKRIL
Sbjct: 541 SKLSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRSYGLQPDEVIYVVMLKGYFQVKRIL 600

Query: 601 DVMMLHADMLKFGVIPNSAIYSILCKGYRESGFLKSALNCSKDLEELH 649
           D+ MLHADMLKFG++PNSAIYS L KGYRESGFLKSALNCSK+L+EL+
Sbjct: 601 DMTMLHADMLKFGIVPNSAIYSTLSKGYRESGFLKSALNCSKELKELY 648

BLAST of HG10022270 vs. ExPASy TrEMBL
Match: A0A6J1H5I4 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g61400 OS=Cucurbita moschata OX=3662 GN=LOC111460748 PE=4 SV=1)

HSP 1 Score: 1111.3 bits (2873), Expect = 0.0e+00
Identity = 548/648 (84.57%), Postives = 599/648 (92.44%), Query Frame = 0

Query: 1   MLRTQFPLKSVLVRIGRHGSMLQVVSLSSLTSDSLITTVLNCRSPERALELFNAAPEKNT 60
           ML  Q PL+SVLV IGR+GS+LQ V+LSS T DSLITTVLNC+SP++ALELFNAAPEKNT
Sbjct: 33  MLMNQLPLRSVLVHIGRYGSILQAVALSSSTPDSLITTVLNCKSPQKALELFNAAPEKNT 92

Query: 61  QLYSAIIHVLVGSKLFSHARCLLKGLVENLLRKSHKRYHVCQLAFNALSQLKSSKFTPNV 120
           +LYSAIIHVLVGSKLFSHARCLLK L+++LL KS + YHVCQLAFN LS LK+SKF+PNV
Sbjct: 93  RLYSAIIHVLVGSKLFSHARCLLKELIQDLLVKSRRPYHVCQLAFNVLSSLKTSKFSPNV 152

Query: 121 YGELLIVLSKMGLVEEALWMYRKVGAALAKQACNVLLDVLVKTGKFELLWRIFEEMISNG 180
           Y EL+IVLSKMGLV+EALWMYRKVG A+A+QACNVLLDVLVKTG+FELLW I+EEM+SNG
Sbjct: 153 YSELIIVLSKMGLVDEALWMYRKVGVAVARQACNVLLDVLVKTGRFELLWGIYEEMVSNG 212

Query: 181 LSPDVITYGILIDGCCRQGDLLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSENKMEEAE 240
           LSPDVITYGILIDG CRQGDLLRAHE+FDEMR KGIEPTVVVYTILIRGLCSENKMEEAE
Sbjct: 213 LSPDVITYGILIDGRCRQGDLLRAHEIFDEMRVKGIEPTVVVYTILIRGLCSENKMEEAE 272

Query: 241 SMHRAMREVGVLPNVYTYNTLMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFGILIDGL 300
           S+HR MRE+GVLPNVYTYNTLM+G+CK+AN KQALRLY DM+GE LVPD+VTFGILIDGL
Sbjct: 273 SIHRLMRELGVLPNVYTYNTLMNGHCKVANVKQALRLYHDMLGEDLVPDNVTFGILIDGL 332

Query: 301 CKFGDIKAARNLSVNIVKFSVTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERFKVSPDV 360
           CKFGDIKAARNL VN+VKFSVTPSIAVYNSLIDGYC+AGD+SEA+AFL ELERFKVSPDV
Sbjct: 333 CKFGDIKAARNLLVNMVKFSVTPSIAVYNSLIDGYCKAGDISEAMAFLSELERFKVSPDV 392

Query: 361 ITYSILIRGLCSAGRTEEADNIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKALEICSR 420
           +TYSILIRG CSAGR EEADN+ EKMMK+GIPANSVTYNSLI+GCCKEGNM KALEICSR
Sbjct: 393 VTYSILIRGFCSAGRIEEADNMLEKMMKEGIPANSVTYNSLIDGCCKEGNMNKALEICSR 452

Query: 421 MTENGVEPNVITFSILIDGYCKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDGHCKYGS 480
           M ENGVEPNVITFS+LIDGYCKIRN+ AAMGIYSEM IKSLSPDVVAYTAMIDGHCK+GS
Sbjct: 453 MIENGVEPNVITFSMLIDGYCKIRNIEAAMGIYSEMGIKSLSPDVVAYTAMIDGHCKHGS 512

Query: 481 MKEALKLYSDMLDNGLTPNCYTISCLLDGLCKDGRISNALKLFTENTGFQTTRCDVNAGG 540
           MKEALKLY+DML NGLTPN YT+SCLLDGLCKDGR+S+AL+LFTE   F TT+C      
Sbjct: 513 MKEALKLYNDMLQNGLTPNSYTLSCLLDGLCKDGRVSDALELFTEKAEFGTTKC------ 572

Query: 541 SKPSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYFQVKRIL 600
            K SFTNHVVYTALIHGLCEDGQIFKAAKLFSDMR YGLQPDEVIYVVML+GYFQVKRIL
Sbjct: 573 -KLSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRSYGLQPDEVIYVVMLKGYFQVKRIL 632

Query: 601 DVMMLHADMLKFGVIPNSAIYSILCKGYRESGFLKSALNCSKDLEELH 649
           D+ MLHADMLKFG+IPNSAIYS L KGYRESGFLKSALNCSK+LEEL+
Sbjct: 633 DMTMLHADMLKFGIIPNSAIYSTLSKGYRESGFLKSALNCSKELEELY 673

BLAST of HG10022270 vs. ExPASy TrEMBL
Match: A0A0A0KS30 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G496480 PE=4 SV=1)

HSP 1 Score: 1086.2 bits (2808), Expect = 0.0e+00
Identity = 547/646 (84.67%), Postives = 586/646 (90.71%), Query Frame = 0

Query: 1   MLRTQFPLKSVLVRIGRHGSMLQVVSLSSLTSDSLITTVLNCRSPERALELFNAAPEKNT 60
           ML TQFPLKSVLVRIG +G+MLQVVSLSSLT DSLITTVLNCRSP +ALE FNAAPEKN 
Sbjct: 12  MLMTQFPLKSVLVRIGLNGTMLQVVSLSSLTPDSLITTVLNCRSPWKALEFFNAAPEKNI 71

Query: 61  QLYSAIIHVLVGSKLFSHARCLLKGLVENLLRKSHKRYHVCQLAFNALSQLKSSKFTPNV 120
           QLYSAIIHVLVGSKL SHAR LL  LV+NL+ KSHK YH CQLAF+ LS+LKSSKFTPNV
Sbjct: 72  QLYSAIIHVLVGSKLLSHARYLLNDLVQNLV-KSHKPYHACQLAFSELSRLKSSKFTPNV 131

Query: 121 YGELLIVLSKMGLVEEALWMYRKVGAALAKQACNVLLDVLVKTGKFELLWRIFEEMISNG 180
           YGEL+IVL KM LVEEAL MY KVGAAL  QACNVLL VLVKTG+FELLWRI+EEMISNG
Sbjct: 132 YGELIIVLCKMELVEEALSMYHKVGAALTIQACNVLLYVLVKTGRFELLWRIYEEMISNG 191

Query: 181 LSPDVITYGILIDGCCRQGDLLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSENKMEEAE 240
           LSP VIT+G LIDGCCRQGDLLRA EMFDEMR KGI PTV+VYTILIRGLCS+NK+EEAE
Sbjct: 192 LSPSVITFGTLIDGCCRQGDLLRAQEMFDEMRVKGIVPTVIVYTILIRGLCSDNKIEEAE 251

Query: 241 SMHRAMREVGVLPNVYTYNTLMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFGILIDGL 300
           SMHRAMREVGV PNVYTYNTLMDGYCKLANAKQALRLYQDM+GEGLVPD VTFGILIDGL
Sbjct: 252 SMHRAMREVGVYPNVYTYNTLMDGYCKLANAKQALRLYQDMLGEGLVPDVVTFGILIDGL 311

Query: 301 CKFGDIKAARNLSVNIVKFSVTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERFKVSPDV 360
           CKFG++KAARNL VN++KFSVTP+IAVYNSLID YC+ GDVSEA+A  LELERF+VSPDV
Sbjct: 312 CKFGEMKAARNLFVNMIKFSVTPNIAVYNSLIDAYCKVGDVSEAMALFLELERFEVSPDV 371

Query: 361 ITYSILIRGLCSAGRTEEADNIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKALEICSR 420
            TYSILIRGLCS  RTEEA NIFEKM K+GI ANSVTYNSLI+GCCKEG M KALEICS+
Sbjct: 372 FTYSILIRGLCSVSRTEEAGNIFEKMTKEGILANSVTYNSLIDGCCKEGKMDKALEICSQ 431

Query: 421 MTENGVEPNVITFSILIDGYCKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDGHCKYGS 480
           MTENGVEPNVITFS LIDGYCKIRNL+AAMGIYSEMVIKSLSPDVV YTAMIDGHCKYGS
Sbjct: 432 MTENGVEPNVITFSTLIDGYCKIRNLQAAMGIYSEMVIKSLSPDVVTYTAMIDGHCKYGS 491

Query: 481 MKEALKLYSDMLDNGLTPNCYTISCLLDGLCKDGRISNALKLFTENTGFQTTRCDVNAGG 540
           MKEALKLYSDMLDNG+TPNCYTISCLLDGLCKDG+IS+AL+LFTE   FQT RC+V+AGG
Sbjct: 492 MKEALKLYSDMLDNGITPNCYTISCLLDGLCKDGKISDALELFTEKIEFQTPRCNVDAGG 551

Query: 541 SKPSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYFQVKRIL 600
           SKPS TNHV YTALIHGLC+DGQ  KA KLFSDMR YGLQPDEVIYVVML+G FQVK IL
Sbjct: 552 SKPSLTNHVAYTALIHGLCQDGQFSKAVKLFSDMRRYGLQPDEVIYVVMLRGLFQVKYIL 611

Query: 601 DVMMLHADMLKFGVIPNSAIYSILCKGYRESGFLKSALNCSKDLEE 647
             MMLHADMLKFGVIPNSA++ ILC+ Y+ESGFLKSA NCSKDLEE
Sbjct: 612 --MMLHADMLKFGVIPNSAVHVILCECYQESGFLKSAQNCSKDLEE 654

BLAST of HG10022270 vs. ExPASy TrEMBL
Match: A0A5D3C3E1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold98G002130 PE=4 SV=1)

HSP 1 Score: 1083.6 bits (2801), Expect = 0.0e+00
Identity = 545/646 (84.37%), Postives = 584/646 (90.40%), Query Frame = 0

Query: 1   MLRTQFPLKSVLVRIGRHGSMLQVVSLSSLTSDSLITTVLNCRSPERALELFNAAPEKNT 60
           ML TQFPLKSVLVRIG +G+MLQVVSLSSLTSDSLITTVLNCRSP +ALE FNAAPEK  
Sbjct: 1   MLMTQFPLKSVLVRIGLNGTMLQVVSLSSLTSDSLITTVLNCRSPRKALEFFNAAPEKTI 60

Query: 61  QLYSAIIHVLVGSKLFSHARCLLKGLVENLLRKSHKRYHVCQLAFNALSQLKSSKFTPNV 120
           QLYSAIIHVLVGS+L SHAR LLK LV+NL+ KSHK YH CQL F+ LS+LKSSKF+PNV
Sbjct: 61  QLYSAIIHVLVGSELLSHARYLLKDLVQNLV-KSHKPYHACQLVFSELSRLKSSKFSPNV 120

Query: 121 YGELLIVLSKMGLVEEALWMYRKVGAALAKQACNVLLDVLVKTGKFELLWRIFEEMISNG 180
           YGEL+IVL KM LVEEAL MY KVGA L  QACNVLL+VLVKTG+FELLWRI+EEMISNG
Sbjct: 121 YGELIIVLCKMELVEEALSMYHKVGATLTIQACNVLLNVLVKTGRFELLWRIYEEMISNG 180

Query: 181 LSPDVITYGILIDGCCRQGDLLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSENKMEEAE 240
           LSP VIT+G LIDGCCRQGDLLRA EMFDEMR KGI PTVVVYTILIRGLCS++KMEEAE
Sbjct: 181 LSPSVITFGTLIDGCCRQGDLLRAQEMFDEMRVKGIVPTVVVYTILIRGLCSDSKMEEAE 240

Query: 241 SMHRAMREVGVLPNVYTYNTLMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFGILIDGL 300
           SMHRAMREVGV PN+YTYNTLMDGYCKLANAKQALRLYQDM+GEGLVPD VTFGILIDGL
Sbjct: 241 SMHRAMREVGVYPNLYTYNTLMDGYCKLANAKQALRLYQDMLGEGLVPDVVTFGILIDGL 300

Query: 301 CKFGDIKAARNLSVNIVKFSVTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERFKVSPDV 360
           CKFG++KAARNL VN++KF VTP+I VYNSLID YC+ GDVSEA+AF LELER+KVSPDV
Sbjct: 301 CKFGEMKAARNLFVNMIKFCVTPNINVYNSLIDAYCKVGDVSEAMAFFLELERYKVSPDV 360

Query: 361 ITYSILIRGLCSAGRTEEADNIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKALEICSR 420
            TYSILIRGLCS  RTEEA NIFEKM K+GI ANSVTYNSLI+G CKEG M+KALEICS+
Sbjct: 361 FTYSILIRGLCSVTRTEEAGNIFEKMTKEGILANSVTYNSLIDGYCKEGKMEKALEICSQ 420

Query: 421 MTENGVEPNVITFSILIDGYCKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDGHCKYGS 480
           MTENGVEPNVITFS LIDGYCKIRNL+AAMGIYSEMVIKSLSPDVV YTAMIDGHCKYGS
Sbjct: 421 MTENGVEPNVITFSTLIDGYCKIRNLQAAMGIYSEMVIKSLSPDVVTYTAMIDGHCKYGS 480

Query: 481 MKEALKLYSDMLDNGLTPNCYTISCLLDGLCKDGRISNALKLFTENTGFQTTRCDVNAGG 540
           MKEALKLYSDMLDNG+TPNCYTISCLLDGLCKDGRIS+AL+LFTE   FQT RC+V+AGG
Sbjct: 481 MKEALKLYSDMLDNGITPNCYTISCLLDGLCKDGRISDALRLFTEKIEFQTPRCNVDAGG 540

Query: 541 SKPSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYFQVKRIL 600
           SKPS TNHV YTALIHGLC+DGQ FKA KLFSDMR YGLQPDEVIYVVMLQG  QVK IL
Sbjct: 541 SKPSLTNHVAYTALIHGLCQDGQFFKAVKLFSDMRRYGLQPDEVIYVVMLQGLLQVKHIL 600

Query: 601 DVMMLHADMLKFGVIPNSAIYSILCKGYRESGFLKSALNCSKDLEE 647
             MMLHADMLKFG IPNSA+Y ILCK Y+ESGFLKSA NCSKDLEE
Sbjct: 601 --MMLHADMLKFGFIPNSAVYVILCKCYQESGFLKSAQNCSKDLEE 643

BLAST of HG10022270 vs. ExPASy TrEMBL
Match: A0A1S4E4Z4 (pentatricopeptide repeat-containing protein At5g61400 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501683 PE=4 SV=1)

HSP 1 Score: 1080.1 bits (2792), Expect = 0.0e+00
Identity = 543/646 (84.06%), Postives = 583/646 (90.25%), Query Frame = 0

Query: 1   MLRTQFPLKSVLVRIGRHGSMLQVVSLSSLTSDSLITTVLNCRSPERALELFNAAPEKNT 60
           ML TQFPLKSVLVRIG +G+MLQVVSLSSLTSDSL+TTVLNCRSP +ALE FNAAPEK  
Sbjct: 1   MLMTQFPLKSVLVRIGLNGTMLQVVSLSSLTSDSLLTTVLNCRSPRKALEFFNAAPEKTI 60

Query: 61  QLYSAIIHVLVGSKLFSHARCLLKGLVENLLRKSHKRYHVCQLAFNALSQLKSSKFTPNV 120
           QLYSAIIHVLVGS+L SHAR LLK LV+NL+ KSHK YH CQL F+ LS+LKSSKF+PNV
Sbjct: 61  QLYSAIIHVLVGSELLSHARYLLKDLVQNLV-KSHKPYHACQLVFSELSRLKSSKFSPNV 120

Query: 121 YGELLIVLSKMGLVEEALWMYRKVGAALAKQACNVLLDVLVKTGKFELLWRIFEEMISNG 180
           YGEL+IVL KM LVEEAL MY KVGA L  QACNVLL+VLVKTG+FELLWRI+EEMISNG
Sbjct: 121 YGELIIVLCKMELVEEALSMYHKVGATLTIQACNVLLNVLVKTGRFELLWRIYEEMISNG 180

Query: 181 LSPDVITYGILIDGCCRQGDLLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSENKMEEAE 240
           LSP VIT+G LIDGCCRQGDLLRA EMFDEMR KGI PTVVVYTILIRGLCS++KMEEAE
Sbjct: 181 LSPSVITFGTLIDGCCRQGDLLRAQEMFDEMRVKGIVPTVVVYTILIRGLCSDSKMEEAE 240

Query: 241 SMHRAMREVGVLPNVYTYNTLMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFGILIDGL 300
           SMHRAMREVGV PN+YTYNTLMDGYCKLANAKQALRLYQDM+GEGLVPD VTFGILIDGL
Sbjct: 241 SMHRAMREVGVYPNLYTYNTLMDGYCKLANAKQALRLYQDMLGEGLVPDVVTFGILIDGL 300

Query: 301 CKFGDIKAARNLSVNIVKFSVTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERFKVSPDV 360
           CKFG++KAARNL VN++KF VTP+I VYNSLID YC+ GDVSEA+AF LELER+KVSPDV
Sbjct: 301 CKFGEMKAARNLFVNMIKFCVTPNINVYNSLIDAYCKVGDVSEAMAFFLELERYKVSPDV 360

Query: 361 ITYSILIRGLCSAGRTEEADNIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKALEICSR 420
            TYSILIRGLCS  RTEEA NIFEKM K+GI ANSVTYNSLI+G CKEG M+KALEICS+
Sbjct: 361 FTYSILIRGLCSVTRTEEAGNIFEKMTKEGILANSVTYNSLIDGYCKEGKMEKALEICSQ 420

Query: 421 MTENGVEPNVITFSILIDGYCKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDGHCKYGS 480
           MTENGVEPNVITFS LIDGYCKIRNL+AAMGIYSEMVIKSLSPDVV YTAMIDGHCKYGS
Sbjct: 421 MTENGVEPNVITFSTLIDGYCKIRNLQAAMGIYSEMVIKSLSPDVVTYTAMIDGHCKYGS 480

Query: 481 MKEALKLYSDMLDNGLTPNCYTISCLLDGLCKDGRISNALKLFTENTGFQTTRCDVNAGG 540
           MKEALKLYSDMLDNG+TPNCYTISCLLDGLCKDGRIS+AL+LFTE   FQT RC+V+AGG
Sbjct: 481 MKEALKLYSDMLDNGITPNCYTISCLLDGLCKDGRISDALRLFTEKIEFQTPRCNVDAGG 540

Query: 541 SKPSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYFQVKRIL 600
           SKPS TNHV YTALIHGLC+DGQ FKA KLFSDMR YGLQPDEVIYVVMLQG  QVK IL
Sbjct: 541 SKPSLTNHVAYTALIHGLCQDGQFFKAVKLFSDMRRYGLQPDEVIYVVMLQGLLQVKHIL 600

Query: 601 DVMMLHADMLKFGVIPNSAIYSILCKGYRESGFLKSALNCSKDLEE 647
             MMLHADMLKFG IPNSA+Y ILCK Y+ SGFLKSA NCSKDLEE
Sbjct: 601 --MMLHADMLKFGFIPNSAVYVILCKCYQGSGFLKSAQNCSKDLEE 643

BLAST of HG10022270 vs. TAIR 10
Match: AT5G61400.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 573.5 bits (1477), Expect = 2.1e-163
Identity = 287/618 (46.44%), Postives = 411/618 (66.50%), Query Frame = 0

Query: 26  SLSSLTSDSLITTVLNCRSPERALELF------NAAPEKNTQLYSAIIHVLVGSKLFSHA 85
           S SS +S SL   +L CRS E A +LF        +   + Q +SA+IHVL G+  ++ A
Sbjct: 35  SASSFSSSSLAEAILKCRSAEEAFKLFETSSRSRVSKSNDLQSFSAVIHVLTGAHKYTLA 94

Query: 86  RCLLKGLVENLLRKSHKRYHVCQLAFNALSQLKSSKFTPNVYGELLIVLSKMGLVEEALW 145
           RCL+K L+E L R S    ++    FNAL  ++S KF+  V+  L++   +MGL EEALW
Sbjct: 95  RCLIKSLIERLKRHSEPS-NMSHRLFNALEDIQSPKFSIGVFSLLIMEFLEMGLFEEALW 154

Query: 146 MYRKVGAALAKQACNVLLDVLVKTGKFELLWRIFEEMISNGLSPDVITYGILIDGCCRQG 205
           + R++  +   +AC  +L+ LV+  +F+ +W  ++ MIS GL PDV  Y +L   C +QG
Sbjct: 155 VSREMKCSPDSKACLSILNGLVRRRRFDSVWVDYQLMISRGLVPDVHIYFVLFQCCFKQG 214

Query: 206 DLLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSENKMEEAESMHRAMREVGVLPNVYTYN 265
              +  ++ DEM + GI+P V +YTI I  LC +NKMEEAE M   M++ GVLPN+YTY+
Sbjct: 215 LYSKKEKLLDEMTSLGIKPNVYIYTIYILDLCRDNKMEEAEKMFELMKKHGVLPNLYTYS 274

Query: 266 TLMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFGILIDGLCKFGDIKAARNLSVNIVKF 325
            ++DGYCK  N +QA  LY++++   L+P+ V FG L+DG CK  ++  AR+L V++VKF
Sbjct: 275 AMIDGYCKTGNVRQAYGLYKEILVAELLPNVVVFGTLVDGFCKARELVTARSLFVHMVKF 334

Query: 326 SVTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERFKVSPDVITYSILIRGLCSAGRTEEA 385
            V P++ VYN LI G+C++G++ EA+  L E+E   +SPDV TY+ILI GLC   +  EA
Sbjct: 335 GVDPNLYVYNCLIHGHCKSGNMLEAVGLLSEMESLNLSPDVFTYTILINGLCIEDQVAEA 394

Query: 386 DNIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKALEICSRMTENGVEPNVITFSILIDG 445
           + +F+KM  + I  +S TYNSLI+G CKE NM++AL++CS MT +GVEPN+ITFS LIDG
Sbjct: 395 NRLFQKMKNERIFPSSATYNSLIHGYCKEYNMEQALDLCSEMTASGVEPNIITFSTLIDG 454

Query: 446 YCKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDGHCKYGSMKEALKLYSDMLDNGLTPN 505
           YC +R+++AAMG+Y EM IK + PDVV YTA+ID H K  +MKEAL+LYSDML+ G+ PN
Sbjct: 455 YCNVRDIKAAMGLYFEMTIKGIVPDVVTYTALIDAHFKEANMKEALRLYSDMLEAGIHPN 514

Query: 506 CYTISCLLDGLCKDGRISNALKLFTENTGFQTTRCDVNAGGSKPSFTNHVVYTALIHGLC 565
            +T +CL+DG  K+GR+S A+  + EN               + S  NHV +T LI GLC
Sbjct: 515 DHTFACLVDGFWKEGRLSVAIDFYQEN-------------NQQRSCWNHVGFTCLIEGLC 574

Query: 566 EDGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYFQVKRILDVMMLHADMLKFGVIPNSA 625
           ++G I +A++ FSDMR  G+ PD   YV ML+G+ Q KRI D MML  DM+K G++PN  
Sbjct: 575 QNGYILRASRFFSDMRSCGITPDICSYVSMLKGHLQEKRITDTMMLQCDMIKTGILPNLL 634

Query: 626 IYSILCKGYRESGFLKSA 638
           +  +L + Y+ +G++KSA
Sbjct: 635 VNQLLARFYQANGYVKSA 638

BLAST of HG10022270 vs. TAIR 10
Match: AT2G02150.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 297.4 bits (760), Expect = 2.9e-80
Identity = 180/605 (29.75%), Postives = 307/605 (50.74%), Query Frame = 0

Query: 45  PERALELFNAAPEKN-----TQLYSAIIHVLVGSKLFSHARCLLKGLVENLLRKSHKRYH 104
           P+ A + F  +  +N      + Y  + H+L  ++++  A  +LK   E +L K+     
Sbjct: 122 PKLAFKFFKWSMTRNGFKHSVESYCIVAHILFCARMYYDANSVLK---EMVLSKAD---- 181

Query: 105 VCQLAFNALSQLKSSKFTP-NVYGELLIVLSKMGLVEEALWMYRKVGAALA---KQACNV 164
            C + F+ L   ++       V+  L  VL  +G++EEA+  + K+         ++CN 
Sbjct: 182 -CDV-FDVLWSTRNVCVPGFGVFDALFSVLIDLGMLEEAIQCFSKMKRFRVFPKTRSCNG 241

Query: 165 LLDVLVKTGKFELLWRIFEEMISNGLSPDVITYGILIDGCCRQGDLLRAHEMFDEMRAKG 224
           LL    K GK + + R F++MI  G  P V TY I+ID  C++GD+  A  +F+EM+ +G
Sbjct: 242 LLHRFAKLGKTDDVKRFFKDMIGAGARPTVFTYNIMIDCMCKEGDVEAARGLFEEMKFRG 301

Query: 225 IEPTVVVYTILIRGLCSENKMEEAESMHRAMREVGVLPNVYTYNTLMDGYCKLANAKQAL 284
           + P  V Y  +I G     ++++       M+++   P+V TYN L++ +CK       L
Sbjct: 302 LVPDTVTYNSMIDGFGKVGRLDDTVCFFEEMKDMCCEPDVITYNALINCFCKFGKLPIGL 361

Query: 285 RLYQDMIGEGLVPDDVTFGILIDGLCKFGDIKAARNLSVNIVKFSVTPSIAVYNSLIDGY 344
             Y++M G GL P+ V++  L+D  CK G ++ A    V++ +  + P+   Y SLID  
Sbjct: 362 EFYREMKGNGLKPNVVSYSTLVDAFCKEGMMQQAIKFYVDMRRVGLVPNEYTYTSLIDAN 421

Query: 345 CRAGDVSEAIAFLLELERFKVSPDVITYSILIRGLCSAGRTEEADNIFEKMMKDGIPANS 404
           C+ G++S+A     E+ +  V  +V+TY+ LI GLC A R +EA+ +F KM   G+  N 
Sbjct: 422 CKIGNLSDAFRLGNEMLQVGVEWNVVTYTALIDGLCDAERMKEAEELFGKMDTAGVIPNL 481

Query: 405 VTYNSLINGCCKEGNMKKALEICSRMTENGVEPNVITFSILIDGYCKIRNLRAAMGIYSE 464
            +YN+LI+G  K  NM +ALE+ + +   G++P+++ +   I G C +  + AA  + +E
Sbjct: 482 ASYNALIHGFVKAKNMDRALELLNELKGRGIKPDLLLYGTFIWGLCSLEKIEAAKVVMNE 541

Query: 465 MVIKSLSPDVVAYTAMIDGHCKYGSMKEALKLYSDMLDNGLTPNCYTISCLLDGLCKDGR 524
           M    +  + + YT ++D + K G+  E L L  +M +  +     T   L+DGLCK+  
Sbjct: 542 MKECGIKANSLIYTTLMDAYFKSGNPTEGLHLLDEMKELDIEVTVVTFCVLIDGLCKNKL 601

Query: 525 ISNALKLF---TENTGFQTTRCDVNAGGSKPSFTNHVVYTALIHGLCEDGQIFKAAKLFS 584
           +S A+  F   + + G Q                N  ++TA+I GLC+D Q+  A  LF 
Sbjct: 602 VSKAVDYFNRISNDFGLQ---------------ANAAIFTAMIDGLCKDNQVEAATTLFE 661

Query: 585 DMRCYGLQPDEVIYVVMLQGYFQVKRILDVMMLHADMLKFGVIPNSAIYSILCKGYRESG 638
            M   GL PD   Y  ++ G F+   +L+ + L   M + G+  +   Y+ L  G     
Sbjct: 662 QMVQKGLVPDRTAYTSLMDGNFKQGNVLEALALRDKMAEIGMKLDLLAYTSLVWGLSHCN 702

BLAST of HG10022270 vs. TAIR 10
Match: AT5G65560.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 293.1 bits (749), Expect = 5.4e-79
Identity = 173/532 (32.52%), Postives = 273/532 (51.32%), Query Frame = 0

Query: 121 YGELLIVLSKMGLVEEALWMYRKVGAALAKQAC------NVLLDVLVKTGKFELLWRIFE 180
           Y  LL  L++ GLV+E   +Y ++   L  + C      N +++   K G  E   +   
Sbjct: 186 YNTLLNSLARFGLVDEMKQVYMEM---LEDKVCPNIYTYNKMVNGYCKLGNVEEANQYVS 245

Query: 181 EMISNGLSPDVITYGILIDGCCRQGDLLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSEN 240
           +++  GL PD  TY  LI G C++ DL  A ++F+EM  KG     V YT LI GLC   
Sbjct: 246 KIVEAGLDPDFFTYTSLIMGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVAR 305

Query: 241 KMEEAESMHRAMREVGVLPNVYTYNTLMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFG 300
           +++EA  +   M++    P V TY  L+   C      +AL L ++M   G+ P+  T+ 
Sbjct: 306 RIDEAMDLFVKMKDDECFPTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYT 365

Query: 301 ILIDGLCKFGDIKAARNLSVNIVKFSVTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERF 360
           +LID LC     + AR L   +++  + P++  YN+LI+GYC+ G + +A+  +  +E  
Sbjct: 366 VLIDSLCSQCKFEKARELLGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDVVELMESR 425

Query: 361 KVSPDVITYSILIRGLCSAGRTEEADNIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKA 420
           K+SP+  TY+ LI+G C +    +A  +  KM++  +  + VTYNSLI+G C+ GN   A
Sbjct: 426 KLSPNTRTYNELIKGYCKS-NVHKAMGVLNKMLERKVLPDVVTYNSLIDGQCRSGNFDSA 485

Query: 421 LEICSRMTENGVEPNVITFSILIDGYCKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDG 480
             + S M + G+ P+  T++ +ID  CK + +  A  ++  +  K ++P+VV YTA+IDG
Sbjct: 486 YRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALIDG 545

Query: 481 HCKYGSMKEALKLYSDMLDNGLTPNCYTISCLLDGLCKDGRISNALKLFTENTGFQTTRC 540
           +CK G + EA  +   ML     PN  T + L+ GLC DG++  A  L  +         
Sbjct: 546 YCKAGKVDEAHLMLEKMLSKNCLPNSLTFNALIHGLCADGKLKEATLLEEKMVKI----- 605

Query: 541 DVNAGGSKPSFTNHVVYTALIHGLCEDGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYF 600
                G +P+ +     T LIH L +DG    A   F  M   G +PD   Y   +Q Y 
Sbjct: 606 -----GLQPTVSTD---TILIHRLLKDGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYC 665

Query: 601 QVKRILDVMMLHADMLKFGVIPNSAIYSILCKGYRESGFLKSALNCSKDLEE 647
           +  R+LD   + A M + GV P+   YS L KGY + G    A +  K + +
Sbjct: 666 REGRLLDAEDMMAKMRENGVSPDLFTYSSLIKGYGDLGQTNFAFDVLKRMRD 700

BLAST of HG10022270 vs. TAIR 10
Match: AT1G05670.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 287.0 bits (733), Expect = 3.9e-77
Identity = 173/608 (28.45%), Postives = 299/608 (49.18%), Query Frame = 0

Query: 32  SDSLITTVLNCRSPER-ALELFNAA---PEKNTQLYSAIIHVLVGSKLFSHARCLLKGLV 91
           +D LI  ++  +   R  L+ F+ A    + N +    +IH+ V SK    A+ L+    
Sbjct: 87  TDHLIWVLMKIKCDYRLVLDFFDWARSRRDSNLESLCIVIHLAVASKDLKVAQSLISSFW 146

Query: 92  ENLLRKSHKRYHVCQLAFNALSQLKSSKFTPNVYGELLIVLSKMGLVEEALWMYRKV--- 151
           E    K +      Q     +   K     P V+     VL   GL+ EA  ++ K+   
Sbjct: 147 ER--PKLNVTDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNY 206

Query: 152 GAALAKQACNVLLDVL----VKTGKFELLWRIFEEMISNGLSPDVITYGILIDGCCRQGD 211
           G  L+  +CNV L  L     KT    +++R F E+   G+  +V +Y I+I   C+ G 
Sbjct: 207 GLVLSVDSCNVYLTRLSKDCYKTATAIIVFREFPEV---GVCWNVASYNIVIHFVCQLGR 266

Query: 212 LLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSENKMEEAESMHRAMREVGVLPNVYTYNT 271
           +  AH +   M  KG  P V+ Y+ ++ G C   ++++   +   M+  G+ PN Y Y +
Sbjct: 267 IKEAHHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGS 326

Query: 272 LMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFGILIDGLCKFGDIKAARNLSVNIVKFS 331
           ++   C++    +A   + +MI +G++PD V +  LIDG CK GDI+AA      +    
Sbjct: 327 IIGLLCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRD 386

Query: 332 VTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERFKVSPDVITYSILIRGLCSAGRTEEAD 391
           +TP +  Y ++I G+C+ GD+ EA     E+    + PD +T++ LI G C AG  ++A 
Sbjct: 387 ITPDVLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAF 446

Query: 392 NIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKALEICSRMTENGVEPNVITFSILIDGY 451
            +   M++ G   N VTY +LI+G CKEG++  A E+   M + G++PN+ T++ +++G 
Sbjct: 447 RVHNHMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGL 506

Query: 452 CKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDGHCKYGSMKEALKLYSDMLDNGLTPNC 511
           CK  N+  A+ +  E     L+ D V YT ++D +CK G M +A ++  +ML  GL P  
Sbjct: 507 CKSGNIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTI 566

Query: 512 YTISCLLDGLCKDGRISNALKLFTENTGFQTTRCDVNAGGSKPSFTNHVVYTALIHGLCE 571
            T + L++G C  G + +  KL             +N   +K    N   + +L+   C 
Sbjct: 567 VTFNVLMNGFCLHGMLEDGEKL-------------LNWMLAKGIAPNATTFNSLVKQYCI 626

Query: 572 DGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYFQVKRILDVMMLHADMLKFGVIPNSAI 629
              +  A  ++ DM   G+ PD   Y  +++G+ + + + +   L  +M   G   + + 
Sbjct: 627 RNNLKAATAIYKDMCSRGVGPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKGFSVSVST 676

BLAST of HG10022270 vs. TAIR 10
Match: AT1G05670.2 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 287.0 bits (733), Expect = 3.9e-77
Identity = 173/608 (28.45%), Postives = 299/608 (49.18%), Query Frame = 0

Query: 32  SDSLITTVLNCRSPER-ALELFNAA---PEKNTQLYSAIIHVLVGSKLFSHARCLLKGLV 91
           +D LI  ++  +   R  L+ F+ A    + N +    +IH+ V SK    A+ L+    
Sbjct: 87  TDHLIWVLMKIKCDYRLVLDFFDWARSRRDSNLESLCIVIHLAVASKDLKVAQSLISSFW 146

Query: 92  ENLLRKSHKRYHVCQLAFNALSQLKSSKFTPNVYGELLIVLSKMGLVEEALWMYRKV--- 151
           E    K +      Q     +   K     P V+     VL   GL+ EA  ++ K+   
Sbjct: 147 ER--PKLNVTDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNY 206

Query: 152 GAALAKQACNVLLDVL----VKTGKFELLWRIFEEMISNGLSPDVITYGILIDGCCRQGD 211
           G  L+  +CNV L  L     KT    +++R F E+   G+  +V +Y I+I   C+ G 
Sbjct: 207 GLVLSVDSCNVYLTRLSKDCYKTATAIIVFREFPEV---GVCWNVASYNIVIHFVCQLGR 266

Query: 212 LLRAHEMFDEMRAKGIEPTVVVYTILIRGLCSENKMEEAESMHRAMREVGVLPNVYTYNT 271
           +  AH +   M  KG  P V+ Y+ ++ G C   ++++   +   M+  G+ PN Y Y +
Sbjct: 267 IKEAHHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGS 326

Query: 272 LMDGYCKLANAKQALRLYQDMIGEGLVPDDVTFGILIDGLCKFGDIKAARNLSVNIVKFS 331
           ++   C++    +A   + +MI +G++PD V +  LIDG CK GDI+AA      +    
Sbjct: 327 IIGLLCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRD 386

Query: 332 VTPSIAVYNSLIDGYCRAGDVSEAIAFLLELERFKVSPDVITYSILIRGLCSAGRTEEAD 391
           +TP +  Y ++I G+C+ GD+ EA     E+    + PD +T++ LI G C AG  ++A 
Sbjct: 387 ITPDVLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAF 446

Query: 392 NIFEKMMKDGIPANSVTYNSLINGCCKEGNMKKALEICSRMTENGVEPNVITFSILIDGY 451
            +   M++ G   N VTY +LI+G CKEG++  A E+   M + G++PN+ T++ +++G 
Sbjct: 447 RVHNHMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGL 506

Query: 452 CKIRNLRAAMGIYSEMVIKSLSPDVVAYTAMIDGHCKYGSMKEALKLYSDMLDNGLTPNC 511
           CK  N+  A+ +  E     L+ D V YT ++D +CK G M +A ++  +ML  GL P  
Sbjct: 507 CKSGNIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTI 566

Query: 512 YTISCLLDGLCKDGRISNALKLFTENTGFQTTRCDVNAGGSKPSFTNHVVYTALIHGLCE 571
            T + L++G C  G + +  KL             +N   +K    N   + +L+   C 
Sbjct: 567 VTFNVLMNGFCLHGMLEDGEKL-------------LNWMLAKGIAPNATTFNSLVKQYCI 626

Query: 572 DGQIFKAAKLFSDMRCYGLQPDEVIYVVMLQGYFQVKRILDVMMLHADMLKFGVIPNSAI 629
              +  A  ++ DM   G+ PD   Y  +++G+ + + + +   L  +M   G   + + 
Sbjct: 627 RNNLKAATAIYKDMCSRGVGPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKGFSVSVST 676

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038890299.10.0e+0089.47pentatricopeptide repeat-containing protein At5g61400, partial [Benincasa hispid... [more]
XP_022990473.10.0e+0084.57pentatricopeptide repeat-containing protein At5g61400 [Cucurbita maxima][more]
XP_023517041.10.0e+0085.03pentatricopeptide repeat-containing protein At5g61400 [Cucurbita pepo subsp. pep... [more]
XP_022959787.10.0e+0084.57LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g61400 [Cucu... [more]
XP_011655326.10.0e+0084.67pentatricopeptide repeat-containing protein At5g61400 [Cucumis sativus] >XP_0116... [more]
Match NameE-valueIdentityDescription
Q9FLJ42.9e-16246.44Pentatricopeptide repeat-containing protein At5g61400 OS=Arabidopsis thaliana OX... [more]
P0C8944.0e-7929.75Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis th... [more]
Q9LSL97.6e-7832.52Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX... [more]
Q0WVK75.5e-7628.45Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
Q9LFC53.9e-7427.57Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1JTD60.0e+0084.57pentatricopeptide repeat-containing protein At5g61400 OS=Cucurbita maxima OX=366... [more]
A0A6J1H5I40.0e+0084.57LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g61400 OS=Cu... [more]
A0A0A0KS300.0e+0084.67Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G496480 PE=4 SV=1[more]
A0A5D3C3E10.0e+0084.37Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S4E4Z40.0e+0084.06pentatricopeptide repeat-containing protein At5g61400 isoform X2 OS=Cucumis melo... [more]
Match NameE-valueIdentityDescription
AT5G61400.12.1e-16346.44Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G02150.12.9e-8029.75Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G65560.15.4e-7932.52Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G05670.13.9e-7728.45Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G05670.23.9e-7728.45Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 253..302
e-value: 3.0E-17
score: 62.6
coord: 463..512
e-value: 8.4E-19
score: 67.5
coord: 323..372
e-value: 1.3E-15
score: 57.3
coord: 547..593
e-value: 1.4E-11
score: 44.4
coord: 152..197
e-value: 7.9E-14
score: 51.6
coord: 394..442
e-value: 2.1E-19
score: 69.5
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 214..247
e-value: 1.1E-9
score: 37.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 221..255
e-value: 2.5E-8
score: 31.6
coord: 186..220
e-value: 4.1E-11
score: 40.4
coord: 466..500
e-value: 5.3E-10
score: 36.8
coord: 361..394
e-value: 3.8E-10
score: 37.3
coord: 584..617
e-value: 0.0019
score: 16.3
coord: 431..465
e-value: 4.1E-7
score: 27.8
coord: 327..360
e-value: 1.3E-8
score: 32.5
coord: 152..185
e-value: 8.9E-8
score: 29.8
coord: 256..289
e-value: 1.1E-9
score: 35.9
coord: 396..430
e-value: 6.4E-12
score: 42.9
coord: 549..583
e-value: 2.6E-8
score: 31.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 429..463
score: 11.246351
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 547..581
score: 11.586152
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 149..183
score: 10.500983
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 464..498
score: 13.350921
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 582..616
score: 9.481582
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 254..288
score: 13.285153
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 184..218
score: 14.194941
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 289..323
score: 9.196589
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 394..428
score: 14.447051
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 219..253
score: 11.925952
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 324..358
score: 12.002681
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 359..393
score: 13.318037
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 316..386
e-value: 5.3E-21
score: 77.0
coord: 242..315
e-value: 1.1E-21
score: 79.3
coord: 387..458
e-value: 2.0E-23
score: 84.9
coord: 459..531
e-value: 2.3E-20
score: 74.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 536..649
e-value: 7.3E-21
score: 76.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 92..237
e-value: 1.1E-31
score: 111.7
NoneNo IPR availablePANTHERPTHR47932ATPASE EXPRESSION PROTEIN 3coord: 22..633
NoneNo IPR availablePANTHERPTHR47932:SF42OSJNBA0060P14.6 PROTEINcoord: 22..633
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 303..496

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10022270.1HG10022270.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding