HG10010338 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10010338
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr06: 21252980 .. 21254791 (-)
RNA-Seq ExpressionHG10010338
SyntenyHG10010338
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGAAACGAAATCTCCATTTCGCCCTCACCAGGCCCTGCCTTTCCCGCGAAAACGAACGGCTATCTTTTCAAACCATTCAAAATCTCAGTCGCCATCTTCGAAATTGCAACGATTTGGTTTCTTCAATTTCGACTCACTCCATAGCCCTAAAGCTTGGATTCTTAAACCATACTGTCGCCGTCAACCATCTCATCAATTGCTATGTTAGATTCCGCAATGTTGCAACCGCACACCGACTGTTCGACGAAATGCCCAACCCAAATGTTGTGTCATGGACCTCACTCATGGCTGGTTACATCGACAATGGTCGGCCGAATACCGCTCTTTTATTGTTCAGGGCAATGTCGAGAAGTTCCGTTGTTCCGAATGACTTCACTTTCGCAACTGCGATTAAGGCCTGTTCGATCCTTTCGAATTTAAGACATGGTGAAAAGTTCCATGCCTATGTTGAGATTTTTGGTTATGGAGGTAATATTGTGGTTTGTTCTTCTCTTATTGCTATGTATGGGAAATGTAATGATGTTGTTAAAGCTAGGAGTGTCTTTAATTCCATGTCTTGTAAGAATATTGTTTCTTGGAATTCAATGATTGCTGCTTATGCTCAGAATGCTCACGGCGACGATGCATTAAAAGTATTTAGGGAATTCAGTAGTTTGAGTTCTGAACATCCAAATCCTTACATGTTAGCTAGTGTAATTAGTGCTTGTGCAAGCTTGGGAAGGCTGGTTTCCGGGAAAGTTGTGCACGGTGCAGCGATCCGTCTTGGGTGTGATTCGAGCGACGTAGTTGCGAGTGTGTTGGTTGATATGTATGCTAAATGTGGGAGTCTTGAGTATTCTGATAAGGTTTTTAACAGAATTTCAAACCCTTCTGTTATTCCTTATACTTCAATAATTGTGAGCACAGCAAAGTATGGATTTGGGAGAAAGTCTCTTCAACTCTTTGAAGAAATGGTGAGAAAAGGATTGAAACCTAATCATATCACTTTTGTTGGAGTCTTGCATGCTTGTAGCCATTCAGGGCTTCCTAATGAGGGCCTTGATTATTTGACATCCATGTATGAGAGATATGGAATAATGCCCGAGACTAAGCATTATACGTGCGTTGTCGACATGCTAGCACGAGCCGGGCAGCTAGATAAAGCCTACCAACTAGCAAAGTCGATGGAGGTAGCATCCGATGACAAGGCATTGTTGTGGGGAGCATTGCTTTCAGCTAGTAGGTGTCATGGCAGGGTAGATATTGCAGCTAAAGCTTGTCAGCAACTTGTGAATTCCAATAGACAAGTCGCCGGTGCATACGTTACTTTGTCAAATGCTTATGCATCTGCTGGGGATATGGAGAAGGCTCATAGACTCCGAGTTGAGATGAAACATACCGGAGTTCAGAAAGAACCAGGCTGCAGTTGGATCGAAATAAAAGATTCGAGTTATGTATTCTATGCTGGGGATATAACGTCGTGCCCACGAGGCGATGAAGTGTTGCGTTTACTAAGAGAGTTGGACCGGAAAATGAAGGAGCGAGGTTACGTAAGAGGAAGCAAAGGGTTGGTGTTTGTTGATATAGAAGAGGAGGCAGAGGAAGAAAAAGTTTGGTTGCACAGTGAAAGATTGGCATTGGGATTTGGTTTGATTAGCATTTCAAAAGGACTTACAATAAGAATAATGAAGAACTTGAGAATGTGCAGTGATTGTCATGAGGCTTTCAAGCTTATAAGTGAGATTGTGGAAAGAGAGTTTGTAGTTAGAGATATCAATAGATTTCATCATTTCAAAAGTGGTTGTTGTACTTGCAATGATTTCTGGTAA

mRNA sequence

ATGTTGAAACGAAATCTCCATTTCGCCCTCACCAGGCCCTGCCTTTCCCGCGAAAACGAACGGCTATCTTTTCAAACCATTCAAAATCTCAGTCGCCATCTTCGAAATTGCAACGATTTGGTTTCTTCAATTTCGACTCACTCCATAGCCCTAAAGCTTGGATTCTTAAACCATACTGTCGCCGTCAACCATCTCATCAATTGCTATGTTAGATTCCGCAATGTTGCAACCGCACACCGACTGTTCGACGAAATGCCCAACCCAAATGTTGTGTCATGGACCTCACTCATGGCTGGTTACATCGACAATGGTCGGCCGAATACCGCTCTTTTATTGTTCAGGGCAATGTCGAGAAGTTCCGTTGTTCCGAATGACTTCACTTTCGCAACTGCGATTAAGGCCTGTTCGATCCTTTCGAATTTAAGACATGGTGAAAAGTTCCATGCCTATGTTGAGATTTTTGGTTATGGAGGTAATATTGTGGTTTGTTCTTCTCTTATTGCTATGTATGGGAAATGTAATGATGTTGTTAAAGCTAGGAGTGTCTTTAATTCCATGTCTTGTAAGAATATTGTTTCTTGGAATTCAATGATTGCTGCTTATGCTCAGAATGCTCACGGCGACGATGCATTAAAAGTATTTAGGGAATTCAGTAGTTTGAGTTCTGAACATCCAAATCCTTACATGTTAGCTAGTGTAATTAGTGCTTGTGCAAGCTTGGGAAGGCTGGTTTCCGGGAAAGTTGTGCACGGTGCAGCGATCCGTCTTGGGTGTGATTCGAGCGACGTAGTTGCGAGTGTGTTGGTTGATATGTATGCTAAATGTGGGAGTCTTGAGTATTCTGATAAGGTTTTTAACAGAATTTCAAACCCTTCTGTTATTCCTTATACTTCAATAATTGTGAGCACAGCAAAGTATGGATTTGGGAGAAAGTCTCTTCAACTCTTTGAAGAAATGGTGAGAAAAGGATTGAAACCTAATCATATCACTTTTGTTGGAGTCTTGCATGCTTGTAGCCATTCAGGGCTTCCTAATGAGGGCCTTGATTATTTGACATCCATGTATGAGAGATATGGAATAATGCCCGAGACTAAGCATTATACGTGCGTTGTCGACATGCTAGCACGAGCCGGGCAGCTAGATAAAGCCTACCAACTAGCAAAGTCGATGGAGGTAGCATCCGATGACAAGGCATTGTTGTGGGGAGCATTGCTTTCAGCTAGTAGGTGTCATGGCAGGGTAGATATTGCAGCTAAAGCTTGTCAGCAACTTGTGAATTCCAATAGACAAGTCGCCGGTGCATACGTTACTTTGTCAAATGCTTATGCATCTGCTGGGGATATGGAGAAGGCTCATAGACTCCGAGTTGAGATGAAACATACCGGAGTTCAGAAAGAACCAGGCTGCAGTTGGATCGAAATAAAAGATTCGAGTTATGTATTCTATGCTGGGGATATAACGTCGTGCCCACGAGGCGATGAAGTGTTGCGTTTACTAAGAGAGTTGGACCGGAAAATGAAGGAGCGAGGTTACGTAAGAGGAAGCAAAGGGTTGGTGTTTGTTGATATAGAAGAGGAGGCAGAGGAAGAAAAAGTTTGGTTGCACAGTGAAAGATTGGCATTGGGATTTGGTTTGATTAGCATTTCAAAAGGACTTACAATAAGAATAATGAAGAACTTGAGAATGTGCAGTGATTGTCATGAGGCTTTCAAGCTTATAAGTGAGATTGTGGAAAGAGAGTTTGTAGTTAGAGATATCAATAGATTTCATCATTTCAAAAGTGGTTGTTGTACTTGCAATGATTTCTGGTAA

Coding sequence (CDS)

ATGTTGAAACGAAATCTCCATTTCGCCCTCACCAGGCCCTGCCTTTCCCGCGAAAACGAACGGCTATCTTTTCAAACCATTCAAAATCTCAGTCGCCATCTTCGAAATTGCAACGATTTGGTTTCTTCAATTTCGACTCACTCCATAGCCCTAAAGCTTGGATTCTTAAACCATACTGTCGCCGTCAACCATCTCATCAATTGCTATGTTAGATTCCGCAATGTTGCAACCGCACACCGACTGTTCGACGAAATGCCCAACCCAAATGTTGTGTCATGGACCTCACTCATGGCTGGTTACATCGACAATGGTCGGCCGAATACCGCTCTTTTATTGTTCAGGGCAATGTCGAGAAGTTCCGTTGTTCCGAATGACTTCACTTTCGCAACTGCGATTAAGGCCTGTTCGATCCTTTCGAATTTAAGACATGGTGAAAAGTTCCATGCCTATGTTGAGATTTTTGGTTATGGAGGTAATATTGTGGTTTGTTCTTCTCTTATTGCTATGTATGGGAAATGTAATGATGTTGTTAAAGCTAGGAGTGTCTTTAATTCCATGTCTTGTAAGAATATTGTTTCTTGGAATTCAATGATTGCTGCTTATGCTCAGAATGCTCACGGCGACGATGCATTAAAAGTATTTAGGGAATTCAGTAGTTTGAGTTCTGAACATCCAAATCCTTACATGTTAGCTAGTGTAATTAGTGCTTGTGCAAGCTTGGGAAGGCTGGTTTCCGGGAAAGTTGTGCACGGTGCAGCGATCCGTCTTGGGTGTGATTCGAGCGACGTAGTTGCGAGTGTGTTGGTTGATATGTATGCTAAATGTGGGAGTCTTGAGTATTCTGATAAGGTTTTTAACAGAATTTCAAACCCTTCTGTTATTCCTTATACTTCAATAATTGTGAGCACAGCAAAGTATGGATTTGGGAGAAAGTCTCTTCAACTCTTTGAAGAAATGGTGAGAAAAGGATTGAAACCTAATCATATCACTTTTGTTGGAGTCTTGCATGCTTGTAGCCATTCAGGGCTTCCTAATGAGGGCCTTGATTATTTGACATCCATGTATGAGAGATATGGAATAATGCCCGAGACTAAGCATTATACGTGCGTTGTCGACATGCTAGCACGAGCCGGGCAGCTAGATAAAGCCTACCAACTAGCAAAGTCGATGGAGGTAGCATCCGATGACAAGGCATTGTTGTGGGGAGCATTGCTTTCAGCTAGTAGGTGTCATGGCAGGGTAGATATTGCAGCTAAAGCTTGTCAGCAACTTGTGAATTCCAATAGACAAGTCGCCGGTGCATACGTTACTTTGTCAAATGCTTATGCATCTGCTGGGGATATGGAGAAGGCTCATAGACTCCGAGTTGAGATGAAACATACCGGAGTTCAGAAAGAACCAGGCTGCAGTTGGATCGAAATAAAAGATTCGAGTTATGTATTCTATGCTGGGGATATAACGTCGTGCCCACGAGGCGATGAAGTGTTGCGTTTACTAAGAGAGTTGGACCGGAAAATGAAGGAGCGAGGTTACGTAAGAGGAAGCAAAGGGTTGGTGTTTGTTGATATAGAAGAGGAGGCAGAGGAAGAAAAAGTTTGGTTGCACAGTGAAAGATTGGCATTGGGATTTGGTTTGATTAGCATTTCAAAAGGACTTACAATAAGAATAATGAAGAACTTGAGAATGTGCAGTGATTGTCATGAGGCTTTCAAGCTTATAAGTGAGATTGTGGAAAGAGAGTTTGTAGTTAGAGATATCAATAGATTTCATCATTTCAAAAGTGGTTGTTGTACTTGCAATGATTTCTGGTAA

Protein sequence

MLKRNLHFALTRPCLSRENERLSFQTIQNLSRHLRNCNDLVSSISTHSIALKLGFLNHTVAVNHLINCYVRFRNVATAHRLFDEMPNPNVVSWTSLMAGYIDNGRPNTALLLFRAMSRSSVVPNDFTFATAIKACSILSNLRHGEKFHAYVEIFGYGGNIVVCSSLIAMYGKCNDVVKARSVFNSMSCKNIVSWNSMIAAYAQNAHGDDALKVFREFSSLSSEHPNPYMLASVISACASLGRLVSGKVVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSIIVSTAKYGFGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGIMPETKHYTCVVDMLARAGQLDKAYQLAKSMEVASDDKALLWGALLSASRCHGRVDIAAKACQQLVNSNRQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSYVFYAGDITSCPRGDEVLRLLRELDRKMKERGYVRGSKGLVFVDIEEEAEEEKVWLHSERLALGFGLISISKGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTCNDFW
Homology
BLAST of HG10010338 vs. NCBI nr
Match: XP_038875319.1 (pentatricopeptide repeat-containing protein At4g15720 isoform X1 [Benincasa hispida])

HSP 1 Score: 1124.4 bits (2907), Expect = 0.0e+00
Identity = 552/603 (91.54%), Postives = 580/603 (96.19%), Query Frame = 0

Query: 1   MLKRNLHFALTRPCLSRENERLSFQTIQNLSRHLRNCNDLVSSISTHSIALKLGFLNHTV 60
           MLKRNLHFALTRPCLSR+NER SFQTI+NLSR+LR CNDL+SSISTHSIALKLGFLN+TV
Sbjct: 1   MLKRNLHFALTRPCLSRKNERPSFQTIENLSRYLRECNDLMSSISTHSIALKLGFLNYTV 60

Query: 61  AVNHLINCYVRFRNVATAHRLFDEMPNPNVVSWTSLMAGYIDNGRPNTALLLFRAMSRSS 120
            VNHLINCYVRFRNVATAH+LFDEMPNPNVVSWTSLMAGY++NGRP TALLLFRAMSRS 
Sbjct: 61  PVNHLINCYVRFRNVATAHQLFDEMPNPNVVSWTSLMAGYVNNGRPTTALLLFRAMSRSP 120

Query: 121 VVPNDFTFATAIKACSILSNLRHGEKFHAYVEIFGYGGNIVVCSSLIAMYGKCNDVVKAR 180
           +VPNDFTF TAIKACSILSNLRHGEK HA+VEIFGYGGN+VVCSSLI MYGKCNDVV+AR
Sbjct: 121 IVPNDFTFTTAIKACSILSNLRHGEKLHAHVEIFGYGGNVVVCSSLIDMYGKCNDVVRAR 180

Query: 181 SVFNSMSCKNIVSWNSMIAAYAQNAHGDDALKVFREFSSLSSEHPNPYMLASVISACASL 240
           SVFNSMSCKNIVSW SMIAAYAQNA+GDDALKVFREFSS SSEHPNP+MLASVISACASL
Sbjct: 181 SVFNSMSCKNIVSWTSMIAAYAQNAYGDDALKVFREFSSWSSEHPNPHMLASVISACASL 240

Query: 241 GRLVSGKVVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSII 300
           GRL+SGKV+HGAA+RLG DSS+VVASVLVDMYAKCG+LEYSDKVFNRISNPSVIPYTS+I
Sbjct: 241 GRLISGKVMHGAAVRLGSDSSNVVASVLVDMYAKCGNLEYSDKVFNRISNPSVIPYTSMI 300

Query: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGI 360
           VSTAKYGFGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYE+YGI
Sbjct: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYEKYGI 360

Query: 361 MPETKHYTCVVDMLARAGQLDKAYQLAKSMEVASDDKALLWGALLSASRCHGRVDIAAKA 420
           MPE KHYTCVVDMLARAGQLDKAY+LAKSMEV SDDKALLWGALLSASRCHGRVDIAA+A
Sbjct: 361 MPEIKHYTCVVDMLARAGQLDKAYELAKSMEVVSDDKALLWGALLSASRCHGRVDIAAEA 420

Query: 421 CQQLVNSNRQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSYV 480
           C+QLVNSN QVAGAYVTLSNAYA AGDMEKA RLRVEMKHTGVQKEPGCSWIEIKDSSYV
Sbjct: 421 CKQLVNSNEQVAGAYVTLSNAYALAGDMEKAQRLRVEMKHTGVQKEPGCSWIEIKDSSYV 480

Query: 481 FYAGDITSCPRGDEVLRLLRELDRKMKERGYVRGSKGLVFVDIEEEAEEEKVWLHSERLA 540
           FYAG+ITS PRGDEVL LLRELDRKM +RGYVRGS+GL FVDIEEEAEEEKVWLHSERLA
Sbjct: 481 FYAGEITSSPRGDEVLCLLRELDRKMNDRGYVRGSRGLAFVDIEEEAEEEKVWLHSERLA 540

Query: 541 LGFGLISISKGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTCN 600
           LGFGLISI KGL IRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFK+GCCTCN
Sbjct: 541 LGFGLISIPKGLIIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKNGCCTCN 600

Query: 601 DFW 604
            FW
Sbjct: 601 GFW 603

BLAST of HG10010338 vs. NCBI nr
Match: XP_004143199.1 (pentatricopeptide repeat-containing protein At4g15720 [Cucumis sativus] >KGN47194.1 hypothetical protein Csa_021072 [Cucumis sativus])

HSP 1 Score: 1114.8 bits (2882), Expect = 0.0e+00
Identity = 547/603 (90.71%), Postives = 576/603 (95.52%), Query Frame = 0

Query: 1   MLKRNLHFALTRPCLSRENERLSFQTIQNLSRHLRNCNDLVSSISTHSIALKLGFLNHTV 60
           MLKRNL+FALTRP LSRENER   QTI+NLSRHLRNCNDL+SSISTHSIALKLGFLN+TV
Sbjct: 1   MLKRNLYFALTRPYLSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTV 60

Query: 61  AVNHLINCYVRFRNVATAHRLFDEMPNPNVVSWTSLMAGYIDNGRPNTALLLFRAMSRSS 120
            VNHLINCYVRFR++ATAH+LFDEMPNPNVVSWTSLMAGY+DNG+P+TAL LF  M RS 
Sbjct: 61  NVNHLINCYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSP 120

Query: 121 VVPNDFTFATAIKACSILSNLRHGEKFHAYVEIFGYGGNIVVCSSLIAMYGKCNDVVKAR 180
           VVPNDFTFATAIKACSILSNLRHGE FHA+VEIFGYGGNIVVCSSLI MYGKCNDVVKAR
Sbjct: 121 VVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKAR 180

Query: 181 SVFNSMSCKNIVSWNSMIAAYAQNAHGDDALKVFREFSSLSSEHPNPYMLASVISACASL 240
            VFNSMSCKNIVSW SMIAAYAQNAHGD+ALKVFREF+SLSSEHPNPYMLASVISACASL
Sbjct: 181 GVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASL 240

Query: 241 GRLVSGKVVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSII 300
           GRLVSGKV+HGAAI LGCDSS+VVASVLVDMYAKCGSL+YSDKVFNRISNPSVIPYTS+I
Sbjct: 241 GRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMI 300

Query: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGI 360
           VSTAKYGFGRKSLQLFEEMVRKGLKPNHIT VGVLHACSHSGLPNEGL YLTSMYE+YGI
Sbjct: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGI 360

Query: 361 MPETKHYTCVVDMLARAGQLDKAYQLAKSMEVASDDKALLWGALLSASRCHGRVDIAAKA 420
           MPETKHYTCVVDMLAR G+LDKA+ LAKSM+VA DDKALLWGALLSASRCHGRVDIAA+A
Sbjct: 361 MPETKHYTCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEA 420

Query: 421 CQQLVNSNRQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSYV 480
           CQQLVNSNRQVAGAYVTLSN YASAGDMEKAH+LRVEMK TGV KEPGCSWIEIKDSSY+
Sbjct: 421 CQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYI 480

Query: 481 FYAGDITSCPRGDEVLRLLRELDRKMKERGYVRGSKGLVFVDIEEEAEEEKVWLHSERLA 540
           FYAG+ITSCPRGDEVL LLRELD+KMK+RGYVRG KGLVFVDIEEEAEEEKVWLHSERLA
Sbjct: 481 FYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLA 540

Query: 541 LGFGLISISKGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTCN 600
           LGFGLISI KGLTIRIMKNLRMCSDCHEAFKLISEI+EREFVVRDINRFHHFK+GCCTCN
Sbjct: 541 LGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCN 600

Query: 601 DFW 604
            FW
Sbjct: 601 GFW 603

BLAST of HG10010338 vs. NCBI nr
Match: XP_008458473.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g15720 [Cucumis melo])

HSP 1 Score: 1110.9 bits (2872), Expect = 0.0e+00
Identity = 546/603 (90.55%), Postives = 577/603 (95.69%), Query Frame = 0

Query: 1   MLKRNLHFALTRPCLSRENERLSFQTIQNLSRHLRNCNDLVSSISTHSIALKLGFLNHTV 60
           MLKRNLHFALTRP LSRENER   QTI+NLS HLRNCNDL+SSI THSIALKLGFLN+TV
Sbjct: 1   MLKRNLHFALTRPYLSRENERPFLQTIENLSLHLRNCNDLISSIFTHSIALKLGFLNNTV 60

Query: 61  AVNHLINCYVRFRNVATAHRLFDEMPNPNVVSWTSLMAGYIDNGRPNTALLLFRAMSRSS 120
            VNHLINCYVRFR++ATAHRLFDEMPNPNVVSWTSLMAGY+DNG+P+TALLLF  M RS 
Sbjct: 61  TVNHLINCYVRFRSIATAHRLFDEMPNPNVVSWTSLMAGYVDNGQPSTALLLFGEMLRSP 120

Query: 121 VVPNDFTFATAIKACSILSNLRHGEKFHAYVEIFGYGGNIVVCSSLIAMYGKCNDVVKAR 180
           VVPNDFTFATAIKACSILSNLR+GE+FHA+VEIFGYG NIVVCSSLI MYGKCNDVVKAR
Sbjct: 121 VVPNDFTFATAIKACSILSNLRYGERFHAHVEIFGYGCNIVVCSSLIDMYGKCNDVVKAR 180

Query: 181 SVFNSMSCKNIVSWNSMIAAYAQNAHGDDALKVFREFSSLSSEHPNPYMLASVISACASL 240
           SVFNSMSCKNIVSW SMIAAYAQNAHGD+ALKVFREF+SLSSE PNPYMLASVISACASL
Sbjct: 181 SVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSERPNPYMLASVISACASL 240

Query: 241 GRLVSGKVVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSII 300
           GRLVSGKVVHGAAI LGCDSS+VVASVLVDMYAKCGSL+YSD VFNRISNPSVIPYTS+I
Sbjct: 241 GRLVSGKVVHGAAICLGCDSSEVVASVLVDMYAKCGSLDYSDNVFNRISNPSVIPYTSMI 300

Query: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGI 360
           VSTAKYGFGRKSLQLFEEMVRKGLKPNHIT +GVLHACSHSGLPNEGL YLTSMYE+YGI
Sbjct: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITLLGVLHACSHSGLPNEGLYYLTSMYEKYGI 360

Query: 361 MPETKHYTCVVDMLARAGQLDKAYQLAKSMEVASDDKALLWGALLSASRCHGRVDIAAKA 420
           MPETKHYTCVVDMLAR+G+LDKA+ +AKSM+VA DDKALLWGALLSASRCHGRVDIAA+A
Sbjct: 361 MPETKHYTCVVDMLARSGELDKAFDIAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEA 420

Query: 421 CQQLVNSNRQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSYV 480
           CQQLVNSNRQVAGAYVTLSNAYASAGDMEKAH+LRVEMKHTGV KEPGCSWIEIKDSSY+
Sbjct: 421 CQQLVNSNRQVAGAYVTLSNAYASAGDMEKAHKLRVEMKHTGVHKEPGCSWIEIKDSSYI 480

Query: 481 FYAGDITSCPRGDEVLRLLRELDRKMKERGYVRGSKGLVFVDIEEEAEEEKVWLHSERLA 540
           FYAG+ITSCPRGDEVL LLRELD+KMK+RGYVRG KGLVFVDIEEEAEEEKVWLHSERLA
Sbjct: 481 FYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLA 540

Query: 541 LGFGLISISKGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTCN 600
           LGFGLISI KGLTIRIMKNLRMCSDCHEAFKLISEI+EREFVVRDINRFHHF+SGCCTCN
Sbjct: 541 LGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFQSGCCTCN 600

Query: 601 DFW 604
            FW
Sbjct: 601 GFW 603

BLAST of HG10010338 vs. NCBI nr
Match: XP_022138437.1 (pentatricopeptide repeat-containing protein At4g15720 [Momordica charantia])

HSP 1 Score: 1020.8 bits (2638), Expect = 4.9e-294
Identity = 502/603 (83.25%), Postives = 545/603 (90.38%), Query Frame = 0

Query: 1   MLKRNLHFALTRPCLSRENERLSFQTIQNLSRHLRNCNDLVSSISTHSIALKLGFLNHTV 60
           MLK N+HFALTRP LSRENERLSFQTI N SRHLRNCNDL+S+ S H IALKLGFL  T+
Sbjct: 1   MLKPNIHFALTRPRLSRENERLSFQTIANFSRHLRNCNDLISATSAHPIALKLGFLTQTL 60

Query: 61  AVNHLINCYVRFRNVATAHRLFDEMPNPNVVSWTSLMAGYIDNGRPNTALLLFRAMSRSS 120
            VNHLINCYVR R V+ AH LFDEMP PNVVSWTSLMAGYID  +P+TAL LF  MSRS 
Sbjct: 61  PVNHLINCYVRLRRVSIAHHLFDEMPTPNVVSWTSLMAGYIDACQPSTALSLFGEMSRSP 120

Query: 121 VVPNDFTFATAIKACSILSNLRHGEKFHAYVEIFGYGGNIVVCSSLIAMYGKCNDVVKAR 180
           VVPNDFTFATAIKACSILSNLR G+KFHAYVEI GYGGN+VVCSSLI MYGKCNDVV+AR
Sbjct: 121 VVPNDFTFATAIKACSILSNLREGKKFHAYVEISGYGGNVVVCSSLIDMYGKCNDVVRAR 180

Query: 181 SVFNSMSCKNIVSWNSMIAAYAQNAHGDDALKVFREFSSLSSEHPNPYMLASVISACASL 240
            VF+SM+CKNIVSW SMIA YAQNAHGDDALK+FREFSSL+ EHPN +MLAS ISACASL
Sbjct: 181 RVFDSMACKNIVSWTSMIATYAQNAHGDDALKLFREFSSLNYEHPNHFMLASAISACASL 240

Query: 241 GRLVSGKVVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSII 300
           GRLV G+VVHGA IRLG DS+DV++SVLVDMYAKCGSL  SDKVF+RI NPSVIPYTS+I
Sbjct: 241 GRLVLGRVVHGAVIRLGHDSNDVISSVLVDMYAKCGSLNCSDKVFSRILNPSVIPYTSMI 300

Query: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGI 360
           VS AKYG GR+SLQLFEEMVRKGLKPNH+TFVGVL+ACSHSGL +EGL YLTSMYER+GI
Sbjct: 301 VSRAKYGLGRQSLQLFEEMVRKGLKPNHVTFVGVLYACSHSGLLDEGLSYLTSMYERHGI 360

Query: 361 MPETKHYTCVVDMLARAGQLDKAYQLAKSMEVASDDKALLWGALLSASRCHGRVDIAAKA 420
           MPE+KHYTCVVDMLAR GQLD+AYQLAKSMEV  DD+ALLWGALLSASR HGRVDIA +A
Sbjct: 361 MPESKHYTCVVDMLARVGQLDRAYQLAKSMEVEPDDEALLWGALLSASRYHGRVDIAVEA 420

Query: 421 CQQLVNSNRQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSYV 480
           CQ+LVNSN+QVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTG+ KEPGCSW+EIK+ +YV
Sbjct: 421 CQRLVNSNQQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGINKEPGCSWLEIKNLTYV 480

Query: 481 FYAGDITSCPRGDEVLRLLRELDRKMKERGYVRGSKGLVFVDIEEEAEEEKVWLHSERLA 540
           FYAGD+ SCP   EVL LLREL+RKMK+RGY RGSKGLVFVD+EEEAEEE V LHSERLA
Sbjct: 481 FYAGDVESCPHSKEVLCLLRELERKMKDRGYGRGSKGLVFVDVEEEAEEEAVGLHSERLA 540

Query: 541 LGFGLISISKGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTCN 600
           LGFGLISI KG+TIRIMKNLRMCSDCHEAFKLISEIVER+FVVRD+NRFHHF  GCCTCN
Sbjct: 541 LGFGLISIPKGITIRIMKNLRMCSDCHEAFKLISEIVERDFVVRDVNRFHHFTGGCCTCN 600

Query: 601 DFW 604
           DFW
Sbjct: 601 DFW 603

BLAST of HG10010338 vs. NCBI nr
Match: KAG7026300.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1015.0 bits (2623), Expect = 2.7e-292
Identity = 505/604 (83.61%), Postives = 548/604 (90.73%), Query Frame = 0

Query: 1   MLKRNLHFALTRPCLSRENERLSFQTIQNLSRHLRNCNDLVSSISTHSIALKLGFLNHTV 60
           MLKR+L FAL        NER   QTI+NLS HLR+C DL+SS S HSIALKLGFLN T+
Sbjct: 1   MLKRSLPFAL--------NERPPLQTIENLSHHLRSCKDLISSTSIHSIALKLGFLNQTL 60

Query: 61  AVNHLINCYVRFRNVATAHRLFDEMPNPNVVSWTSLMAGYIDNGRPNTALLLFRAMSRSS 120
             NHLINCYVRFR VA AH+LFDEMP  NVVSWTSLMAGY+D+G+P+ AL LF AMSR+S
Sbjct: 61  TANHLINCYVRFRRVAIAHQLFDEMPTRNVVSWTSLMAGYVDDGQPSIALSLFGAMSRTS 120

Query: 121 VVPNDFTFATAIKACSILSNLRHGEKFHAYVEIFGYGGNIVVCSSLIAMYGKCNDVVKAR 180
           VVPNDFTFATAIKACSILSNLR GEKFHA +EIFGYGGN+VVCSSLI MYGKCNDV++AR
Sbjct: 121 VVPNDFTFATAIKACSILSNLRDGEKFHACMEIFGYGGNVVVCSSLIDMYGKCNDVIRAR 180

Query: 181 SVFNSMSCKNIVSWNSMIAAYAQNAHGDDALKVFREFSSLSSEHPNPYMLASVISACASL 240
            VF+SMSCKNIVSW SMIAAYAQNAHGDDALK+FREF S S E PN +MLASVISACA L
Sbjct: 181 RVFDSMSCKNIVSWTSMIAAYAQNAHGDDALKLFREFGSSSWERPNHFMLASVISACACL 240

Query: 241 GRLVSGKVVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSII 300
           GRLVSGKVVH AAIRLGCDS+DVVASVLVDMYAKCGSLEYSD+VF RI NP VI YTS+I
Sbjct: 241 GRLVSGKVVHSAAIRLGCDSNDVVASVLVDMYAKCGSLEYSDRVFKRILNPCVISYTSMI 300

Query: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGI 360
           VSTAKYG GR+SL+LFEEMVRKGLKPNH+TFVGVLHACSHSGLP+EGL YLTSMYE++GI
Sbjct: 301 VSTAKYGLGRQSLELFEEMVRKGLKPNHVTFVGVLHACSHSGLPDEGLHYLTSMYEKHGI 360

Query: 361 MPETKHYTCVVDMLARAGQLDKAYQLAKSME-VASDDKALLWGALLSASRCHGRVDIAAK 420
           MPETKHYTCVVDMLARAG+LDKAYQLAKSM+ +  DD+ALLWGALLS SR HGRVDIAA+
Sbjct: 361 MPETKHYTCVVDMLARAGELDKAYQLAKSMKAMPDDDQALLWGALLSTSRLHGRVDIAAE 420

Query: 421 ACQQLVNSNRQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSY 480
           ACQ+LV++NRQVAGAYVTLSNAYASAGDMEKA RLRVEMKH+GV KEPGCSW+EIKDSSY
Sbjct: 421 ACQRLVDANRQVAGAYVTLSNAYASAGDMEKAQRLRVEMKHSGVYKEPGCSWVEIKDSSY 480

Query: 481 VFYAGDITSCPRGDEVLRLLRELDRKMKERGYVRGSKGLVFVDIEEEAEEEKVWLHSERL 540
           VFYAGD+ SCPRGDEVLRLLREL+RKMK+RGY RGSKGLVFVDIEEEAEEEKVWLHSERL
Sbjct: 481 VFYAGDVMSCPRGDEVLRLLRELERKMKDRGYSRGSKGLVFVDIEEEAEEEKVWLHSERL 540

Query: 541 ALGFGLISISKGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTC 600
           ALGF LISI +GLTIRIMKNLRMCSDCHEAFKLISEIVER+FVVRDINRFHHFK+G CTC
Sbjct: 541 ALGFCLISIPEGLTIRIMKNLRMCSDCHEAFKLISEIVERDFVVRDINRFHHFKTGYCTC 596

Query: 601 NDFW 604
           NDFW
Sbjct: 601 NDFW 596

BLAST of HG10010338 vs. ExPASy Swiss-Prot
Match: Q8VYH0 (Pentatricopeptide repeat-containing protein At4g15720 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H1 PE=2 SV=1)

HSP 1 Score: 646.4 bits (1666), Expect = 3.3e-184
Identity = 325/567 (57.32%), Postives = 426/567 (75.13%), Query Frame = 0

Query: 47  HSIALKLGFLNHTVAVNHLINCYVRFRNVATAHRLFDEMPNPNVVSWTSLMAGYIDNGRP 106
           H++ LKLGF + T  VNHL+  YV+ + + TA +LFDEM  PNVVSWTS+++GY D G+P
Sbjct: 52  HTLTLKLGFASDTFTVNHLVISYVKLKEINTARKLFDEMCEPNVVSWTSVISGYNDMGKP 111

Query: 107 NTALLLFRAMSRS-SVVPNDFTFATAIKACSILSNLRHGEKFHAYVEIFGYGGNIVVCSS 166
             AL +F+ M     V PN++TFA+  KACS L+  R G+  HA +EI G   NIVV SS
Sbjct: 112 QNALSMFQKMHEDRPVPPNEYTFASVFKACSALAESRIGKNIHARLEISGLRRNIVVSSS 171

Query: 167 LIAMYGKCNDVVKARSVFNSM--SCKNIVSWNSMIAAYAQNAHGDDALKVFREF-SSLSS 226
           L+ MYGKCNDV  AR VF+SM    +N+VSW SMI AYAQNA G +A+++FR F ++L+S
Sbjct: 172 LVDMYGKCNDVETARRVFDSMIGYGRNVVSWTSMITAYAQNARGHEAIELFRSFNAALTS 231

Query: 227 EHPNPYMLASVISACASLGRLVSGKVVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSD 286
           +  N +MLASVISAC+SLGRL  GKV HG   R G +S+ VVA+ L+DMYAKCGSL  ++
Sbjct: 232 DRANQFMLASVISACSSLGRLQWGKVAHGLVTRGGYESNTVVATSLLDMYAKCGSLSCAE 291

Query: 287 KVFNRISNPSVIPYTSIIVSTAKYGFGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSG 346
           K+F RI   SVI YTS+I++ AK+G G  +++LF+EMV   + PN++T +GVLHACSHSG
Sbjct: 292 KIFLRIRCHSVISYTSMIMAKAKHGLGEAAVKLFDEMVAGRINPNYVTLLGVLHACSHSG 351

Query: 347 LPNEGLDYLTSMYERYGIMPETKHYTCVVDMLARAGQLDKAYQLAKSMEVASDDKALLWG 406
           L NEGL+YL+ M E+YG++P+++HYTCVVDML R G++D+AY+LAK++EV ++  ALLWG
Sbjct: 352 LVNEGLEYLSLMAEKYGVVPDSRHYTCVVDMLGRFGRVDEAYELAKTIEVGAEQGALLWG 411

Query: 407 ALLSASRCHGRVDIAAKACQQLVNSNRQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTG 466
           ALLSA R HGRV+I ++A ++L+ SN+QV  AY+ LSNAYA +G  E +  LR+EMK +G
Sbjct: 412 ALLSAGRLHGRVEIVSEASKRLIQSNQQVTSAYIALSNAYAVSGGWEDSESLRLEMKRSG 471

Query: 467 VQKEPGCSWIEIKDSSYVFYAGDITSCPRGDEVLRLLRELDRKMKERGYVRGSKGL---- 526
             KE  CSWIE KDS YVF+AGD+ SC    E+ R L++L+++MKERG+ RGS  +    
Sbjct: 472 NVKERACSWIENKDSVYVFHAGDL-SCDESGEIERFLKDLEKRMKERGH-RGSSSMITTS 531

Query: 527 --VFVDIEEEAEEEKVWLHSERLALGFGLISISKGLTIRIMKNLRMCSDCHEAFKLISEI 586
             VFVD++EEA++E V LH ERLAL +GL+ +  G TIRIM NLRMC DCHEAFKLISEI
Sbjct: 532 SSVFVDVDEEAKDEMVSLHCERLALAYGLLHLPAGSTIRIMNNLRMCRDCHEAFKLISEI 591

Query: 587 VEREFVVRDINRFHHFKSGCCTCNDFW 604
           VERE VVRD+NRFH FK+G CTC D+W
Sbjct: 592 VEREIVVRDVNRFHCFKNGSCTCRDYW 616

BLAST of HG10010338 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 412.1 bits (1058), Expect = 1.0e-113
Identity = 208/536 (38.81%), Postives = 333/536 (62.13%), Query Frame = 0

Query: 68  CYVRFRNVATAHRLFDEMPNPNVVSWTSLMAGYIDNGRPNTALLLFRAMSRSSVVPNDFT 127
           C + F  + +  R+F+ MP  +VVS+ +++AGY  +G    AL + R M  + + P+ FT
Sbjct: 186 CIMPF-GIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFT 245

Query: 128 FATAIKACSILSNLRHGEKFHAYVEIFGYGGNIVVCSSLIAMYGKCNDVVKARSVFNSMS 187
            ++ +   S   ++  G++ H YV   G   ++ + SSL+ MY K   +  +  VF+ + 
Sbjct: 246 LSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLY 305

Query: 188 CKNIVSWNSMIAAYAQNAHGDDALKVFREFSSLSSEHPNPYMLASVISACASLGRLVSGK 247
           C++ +SWNS++A Y QN   ++AL++FR+  + +   P     +SVI ACA L  L  GK
Sbjct: 306 CRDGISWNSLVAGYVQNGRYNEALRLFRQMVT-AKVKPGAVAFSSVIPACAHLATLHLGK 365

Query: 248 VVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSIIVSTAKYG 307
            +HG  +R G  S+  +AS LVDMY+KCG+++ + K+F+R++    + +T+II+  A +G
Sbjct: 366 QLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHG 425

Query: 308 FGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGIMPETKHY 367
            G +++ LFEEM R+G+KPN + FV VL ACSH GL +E   Y  SM + YG+  E +HY
Sbjct: 426 HGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHY 485

Query: 368 TCVVDMLARAGQLDKAYQLAKSMEVASDDKALLWGALLSASRCHGRVDIAAKACQQLVNS 427
             V D+L RAG+L++AY     M V  +    +W  LLS+   H  +++A K  +++   
Sbjct: 486 AAVADLLGRAGKLEEAYNFISKMCV--EPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTV 545

Query: 428 NRQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSYVFYAGDIT 487
           + +  GAYV + N YAS G  ++  +LR+ M+  G++K+P CSWIE+K+ ++ F +GD  
Sbjct: 546 DSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGD-R 605

Query: 488 SCPRGDEVLRLLRELDRKMKERGYVRGSKGLVFVDIEEEAEEEKVWLHSERLALGFGLIS 547
           S P  D++   L+ +  +M++ GYV  + G V  D++EE + E ++ HSERLA+ FG+I+
Sbjct: 606 SHPSMDKINEFLKAVMEQMEKEGYVADTSG-VLHDVDEEHKRELLFGHSERLAVAFGIIN 665

Query: 548 ISKGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTCNDFW 604
              G TIR+ KN+R+C+DCH A K IS+I ERE +VRD +RFHHF  G C+C D+W
Sbjct: 666 TEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDYW 715

BLAST of HG10010338 vs. ExPASy Swiss-Prot
Match: O23266 (Pentatricopeptide repeat-containing protein At4g14050, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H13 PE=1 SV=3)

HSP 1 Score: 409.1 bits (1050), Expect = 8.9e-113
Identity = 219/598 (36.62%), Postives = 354/598 (59.20%), Query Frame = 0

Query: 40  LVSSISTHSIALKLGFLNHTVAVNHLINCYVRFRNVATAHRLFDEMPNPNVVSWTSLMAG 99
           L ++ + H+  +KLG +      N L+N Y +    + A ++FDEMP+ + ++W S++  
Sbjct: 19  LTTAKALHAHIVKLGIVQCCPLANTLVNVYGKCGAASHALQVFDEMPHRDHIAWASVLTA 78

Query: 100 YIDNGRPNTALLLFRAM-SRSSVVPNDFTFATAIKACSILSNLRHGEKFHAYVEIFGYGG 159
                     L +F ++ S S + P+DF F+  +KAC+ L ++ HG + H +  +  Y  
Sbjct: 79  LNQANLSGKTLSVFSSVGSSSGLRPDDFVFSALVKACANLGSIDHGRQVHCHFIVSEYAN 138

Query: 160 NIVVCSSLIAMYGKCNDVVKARSVFNSMSCKNIVSWNSMIAAYAQNAHGDDALKVFR--- 219
           + VV SSL+ MY KC  +  A++VF+S+  KN +SW +M++ YA++   ++AL++FR   
Sbjct: 139 DEVVKSSLVDMYAKCGLLNSAKAVFDSIRVKNTISWTAMVSGYAKSGRKEEALELFRILP 198

Query: 220 -------------------------EFSSLSSEHP---NPYMLASVISACASLGRLVSGK 279
                                     F+ +  E     +P +L+S++ ACA+L   ++G+
Sbjct: 199 VKNLYSWTALISGFVQSGKGLEAFSVFTEMRRERVDILDPLVLSSIVGACANLAASIAGR 258

Query: 280 VVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSIIVSTAKYG 339
            VHG  I LG DS   +++ L+DMYAKC  +  +  +F+R+ +  V+ +TS+IV  A++G
Sbjct: 259 QVHGLVIALGFDSCVFISNALIDMYAKCSDVIAAKDIFSRMRHRDVVSWTSLIVGMAQHG 318

Query: 340 FGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGIMPETKHY 399
              K+L L+++MV  G+KPN +TFVG+++ACSH G   +G +   SM + YGI P  +HY
Sbjct: 319 QAEKALALYDDMVSHGVKPNEVTFVGLIYACSHVGFVEKGRELFQSMTKDYGIRPSLQHY 378

Query: 400 TCVVDMLARAGQLDKAYQLAKSMEVASDDKALLWGALLSASRCHGRVDIAAKACQQLVNS 459
           TC++D+L R+G LD+A  L  +M    D+    W ALLSA +  GR  +  +    LV+S
Sbjct: 379 TCLLDLLGRSGLLDEAENLIHTMPFPPDEPT--WAALLSACKRQGRGQMGIRIADHLVSS 438

Query: 460 NR-QVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSYVFYAGDI 519
            + +    Y+ LSN YASA    K    R ++    V+K+PG S +E++  + VFYAG+ 
Sbjct: 439 FKLKDPSTYILLSNIYASASLWGKVSEARRKLGEMEVRKDPGHSSVEVRKETEVFYAGE- 498

Query: 520 TSCPRGDEVLRLLRELDRKMKER-GYVRGSKGLVFVDIEEEAEEEKVWLHSERLALGFGL 579
           TS P  +++ RLL++L+ +M+ R GYV  +  ++  D++E+ +E+ ++ HSER A+ +GL
Sbjct: 499 TSHPLKEDIFRLLKKLEEEMRIRNGYVPDTSWILH-DMDEQEKEKLLFWHSERSAVAYGL 558

Query: 580 ISISKGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTCNDFW 604
           +    G  IRI+KNLR+C DCH   K ISEI ERE +VRD  R+HHFK G C+CNDFW
Sbjct: 559 LKAVPGTPIRIVKNLRVCGDCHVVLKHISEITEREIIVRDATRYHHFKGGKCSCNDFW 612

BLAST of HG10010338 vs. ExPASy Swiss-Prot
Match: O23169 (Pentatricopeptide repeat-containing protein At4g37170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H5 PE=3 SV=1)

HSP 1 Score: 408.7 bits (1049), Expect = 1.2e-112
Identity = 209/542 (38.56%), Postives = 326/542 (60.15%), Query Frame = 0

Query: 63  NHLINCYVRFRNVATAHRLFDEMPNPNVVSWTSLMAGYIDNGRPNTALLLFRAMSR-SSV 122
           N ++N Y     +  A +LFDEM   +  SWT+++ GY+   +P  AL+L+  M R  + 
Sbjct: 155 NVMVNGYAEVGLLEEARKLFDEMTEKDSYSWTAMVTGYVKKDQPEEALVLYSLMQRVPNS 214

Query: 123 VPNDFTFATAIKACSILSNLRHGEKFHAYVEIFGYGGNIVVCSSLIAMYGKCNDVVKARS 182
            PN FT + A+ A + +  +R G++ H ++   G   + V+ SSL+ MYGKC  + +AR+
Sbjct: 215 RPNIFTVSIAVAAAAAVKCIRRGKEIHGHIVRAGLDSDEVLWSSLMDMYGKCGCIDEARN 274

Query: 183 VFNSMSCKNIVSWNSMIAAYAQNAHGDDALKVFREFSSLSSEHPNPYMLASVISACASLG 242
           +F+ +  K++VSW SMI  Y +++   +   +F E    S E PN Y  A V++ACA L 
Sbjct: 275 IFDKIVEKDVVSWTSMIDRYFKSSRWREGFSLFSELVG-SCERPNEYTFAGVLNACADLT 334

Query: 243 RLVSGKVVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSIIV 302
               GK VHG   R+G D     +S LVDMY KCG++E +  V +    P ++ +TS+I 
Sbjct: 335 TEELGKQVHGYMTRVGFDPYSFASSSLVDMYTKCGNIESAKHVVDGCPKPDLVSWTSLIG 394

Query: 303 STAKYGFGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGIM 362
             A+ G   ++L+ F+ +++ G KP+H+TFV VL AC+H+GL  +GL++  S+ E++ + 
Sbjct: 395 GCAQNGQPDEALKYFDLLLKSGTKPDHVTFVNVLSACTHAGLVEKGLEFFYSITEKHRLS 454

Query: 363 PETKHYTCVVDMLARAGQLDKAYQLAKSMEVASDDKALLWGALLSASRCHGRVDIAAKAC 422
             + HYTC+VD+LAR+G+ ++   +   M +       LW ++L     +G +D+A +A 
Sbjct: 455 HTSDHYTCLVDLLARSGRFEQLKSVISEMPM--KPSKFLWASVLGGCSTYGNIDLAEEAA 514

Query: 423 QQLVNSNRQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSYVF 482
           Q+L     +    YVT++N YA+AG  E+  ++R  M+  GV K PG SW EIK   +VF
Sbjct: 515 QELFKIEPENPVTYVTMANIYAAAGKWEEEGKMRKRMQEIGVTKRPGSSWTEIKRKRHVF 574

Query: 483 YAGDITSCPRGDEVLRLLRELDRKMKERGYVRGSKGLVFVDIEEEAEEEKVWLHSERLAL 542
            A D TS P  ++++  LREL +KMKE GYV  +  LV  D+E+E +EE +  HSE+LA+
Sbjct: 575 IAAD-TSHPMYNQIVEFLRELRKKMKEEGYVPAT-SLVLHDVEDEQKEENLVYHSEKLAV 634

Query: 543 GFGLISISKGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTCND 602
            F ++S  +G  I++ KNLR C DCH A K IS I +R+  VRD  RFH F++G C+C D
Sbjct: 635 AFAILSTEEGTAIKVFKNLRSCVDCHGAIKFISNITKRKITVRDSTRFHCFENGQCSCGD 691

Query: 603 FW 604
           +W
Sbjct: 695 YW 691

BLAST of HG10010338 vs. ExPASy Swiss-Prot
Match: Q9STF3 (Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CRR2 PE=2 SV=1)

HSP 1 Score: 404.4 bits (1038), Expect = 2.2e-111
Identity = 223/594 (37.54%), Postives = 342/594 (57.58%), Query Frame = 0

Query: 15  LSRENERLSFQTIQNLSRHLRNCNDLVSSISTHSIALKLGFLNHTVAVNHLINCYVRFRN 74
           LS+E+   S QT + L     + + L  ++  H   L  G          LI  Y    +
Sbjct: 69  LSQESSP-SQQTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGMYSDLGS 128

Query: 75  VATAHRLFDEMPNPNVVSWTSLMAGYIDNGRPNTALLLFRAMSRSSVVPNDFTFATAIKA 134
           V  A ++FD+     +  W +L       G     L L+  M+R  V  + FT+   +KA
Sbjct: 129 VDYARKVFDKTRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTYVLKA 188

Query: 135 C----SILSNLRHGEKFHAYVEIFGYGGNIVVCSSLIAMYGKCNDVVKARSVFNSMSCKN 194
           C      +++L  G++ HA++   GY  ++ + ++L+ MY +   V  A  VF  M  +N
Sbjct: 189 CVASECTVNHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRN 248

Query: 195 IVSWNSMIAAYAQNAHGDDALKVFRE-FSSLSSEHPNPYMLASVISACASLGRLVSGKVV 254
           +VSW++MIA YA+N    +AL+ FRE         PN   + SV+ ACASL  L  GK++
Sbjct: 249 VVSWSAMIACYAKNGKAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLI 308

Query: 255 HGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSIIVSTAKYGFG 314
           HG  +R G DS   V S LV MY +CG LE   +VF+R+ +  V+ + S+I S   +G+G
Sbjct: 309 HGYILRRGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYG 368

Query: 315 RKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGIMPETKHYTC 374
           +K++Q+FEEM+  G  P  +TFV VL ACSH GL  EG     +M+  +GI P+ +HY C
Sbjct: 369 KKAIQIFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYAC 428

Query: 375 VVDMLARAGQLDKAYQLAKSMEVASDDKALLWGALLSASRCHGRVDIAAKACQQLVNSNR 434
           +VD+L RA +LD+A ++ + M      K  +WG+LL + R HG V++A +A ++L     
Sbjct: 429 MVDLLGRANRLDEAAKMVQDMRTEPGPK--VWGSLLGSCRIHGNVELAERASRRLFALEP 488

Query: 435 QVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSYVFYAGDITSC 494
           + AG YV L++ YA A   ++  R++  ++H G+QK PG  W+E++   Y F + D  + 
Sbjct: 489 KNAGNYVLLADIYAEAQMWDEVKRVKKLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFN- 548

Query: 495 PRGDEVLRLLRELDRKMKERGYVRGSKGLVFVDIEEEAEEEKVWLHSERLALGFGLISIS 554
           P  +++   L +L   MKE+GY+  +KG+++ ++E E +E  V  HSE+LAL FGLI+ S
Sbjct: 549 PLMEQIHAFLVKLAEDMKEKGYIPQTKGVLY-ELETEEKERIVLGHSEKLALAFGLINTS 608

Query: 555 KGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTCNDFW 604
           KG  IRI KNLR+C DCH   K IS+ +E+E +VRD+NRFH FK+G C+C D+W
Sbjct: 609 KGEPIRITKNLRLCEDCHLFTKFISKFMEKEILVRDVNRFHRFKNGVCSCGDYW 657

BLAST of HG10010338 vs. ExPASy TrEMBL
Match: A0A0A0KEC7 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G197240 PE=3 SV=1)

HSP 1 Score: 1114.8 bits (2882), Expect = 0.0e+00
Identity = 547/603 (90.71%), Postives = 576/603 (95.52%), Query Frame = 0

Query: 1   MLKRNLHFALTRPCLSRENERLSFQTIQNLSRHLRNCNDLVSSISTHSIALKLGFLNHTV 60
           MLKRNL+FALTRP LSRENER   QTI+NLSRHLRNCNDL+SSISTHSIALKLGFLN+TV
Sbjct: 1   MLKRNLYFALTRPYLSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTV 60

Query: 61  AVNHLINCYVRFRNVATAHRLFDEMPNPNVVSWTSLMAGYIDNGRPNTALLLFRAMSRSS 120
            VNHLINCYVRFR++ATAH+LFDEMPNPNVVSWTSLMAGY+DNG+P+TAL LF  M RS 
Sbjct: 61  NVNHLINCYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSP 120

Query: 121 VVPNDFTFATAIKACSILSNLRHGEKFHAYVEIFGYGGNIVVCSSLIAMYGKCNDVVKAR 180
           VVPNDFTFATAIKACSILSNLRHGE FHA+VEIFGYGGNIVVCSSLI MYGKCNDVVKAR
Sbjct: 121 VVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKAR 180

Query: 181 SVFNSMSCKNIVSWNSMIAAYAQNAHGDDALKVFREFSSLSSEHPNPYMLASVISACASL 240
            VFNSMSCKNIVSW SMIAAYAQNAHGD+ALKVFREF+SLSSEHPNPYMLASVISACASL
Sbjct: 181 GVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASL 240

Query: 241 GRLVSGKVVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSII 300
           GRLVSGKV+HGAAI LGCDSS+VVASVLVDMYAKCGSL+YSDKVFNRISNPSVIPYTS+I
Sbjct: 241 GRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMI 300

Query: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGI 360
           VSTAKYGFGRKSLQLFEEMVRKGLKPNHIT VGVLHACSHSGLPNEGL YLTSMYE+YGI
Sbjct: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGI 360

Query: 361 MPETKHYTCVVDMLARAGQLDKAYQLAKSMEVASDDKALLWGALLSASRCHGRVDIAAKA 420
           MPETKHYTCVVDMLAR G+LDKA+ LAKSM+VA DDKALLWGALLSASRCHGRVDIAA+A
Sbjct: 361 MPETKHYTCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEA 420

Query: 421 CQQLVNSNRQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSYV 480
           CQQLVNSNRQVAGAYVTLSN YASAGDMEKAH+LRVEMK TGV KEPGCSWIEIKDSSY+
Sbjct: 421 CQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYI 480

Query: 481 FYAGDITSCPRGDEVLRLLRELDRKMKERGYVRGSKGLVFVDIEEEAEEEKVWLHSERLA 540
           FYAG+ITSCPRGDEVL LLRELD+KMK+RGYVRG KGLVFVDIEEEAEEEKVWLHSERLA
Sbjct: 481 FYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLA 540

Query: 541 LGFGLISISKGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTCN 600
           LGFGLISI KGLTIRIMKNLRMCSDCHEAFKLISEI+EREFVVRDINRFHHFK+GCCTCN
Sbjct: 541 LGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCN 600

Query: 601 DFW 604
            FW
Sbjct: 601 GFW 603

BLAST of HG10010338 vs. ExPASy TrEMBL
Match: A0A1S3C8I5 (pentatricopeptide repeat-containing protein At4g15720 OS=Cucumis melo OX=3656 GN=LOC103497869 PE=3 SV=1)

HSP 1 Score: 1110.9 bits (2872), Expect = 0.0e+00
Identity = 546/603 (90.55%), Postives = 577/603 (95.69%), Query Frame = 0

Query: 1   MLKRNLHFALTRPCLSRENERLSFQTIQNLSRHLRNCNDLVSSISTHSIALKLGFLNHTV 60
           MLKRNLHFALTRP LSRENER   QTI+NLS HLRNCNDL+SSI THSIALKLGFLN+TV
Sbjct: 1   MLKRNLHFALTRPYLSRENERPFLQTIENLSLHLRNCNDLISSIFTHSIALKLGFLNNTV 60

Query: 61  AVNHLINCYVRFRNVATAHRLFDEMPNPNVVSWTSLMAGYIDNGRPNTALLLFRAMSRSS 120
            VNHLINCYVRFR++ATAHRLFDEMPNPNVVSWTSLMAGY+DNG+P+TALLLF  M RS 
Sbjct: 61  TVNHLINCYVRFRSIATAHRLFDEMPNPNVVSWTSLMAGYVDNGQPSTALLLFGEMLRSP 120

Query: 121 VVPNDFTFATAIKACSILSNLRHGEKFHAYVEIFGYGGNIVVCSSLIAMYGKCNDVVKAR 180
           VVPNDFTFATAIKACSILSNLR+GE+FHA+VEIFGYG NIVVCSSLI MYGKCNDVVKAR
Sbjct: 121 VVPNDFTFATAIKACSILSNLRYGERFHAHVEIFGYGCNIVVCSSLIDMYGKCNDVVKAR 180

Query: 181 SVFNSMSCKNIVSWNSMIAAYAQNAHGDDALKVFREFSSLSSEHPNPYMLASVISACASL 240
           SVFNSMSCKNIVSW SMIAAYAQNAHGD+ALKVFREF+SLSSE PNPYMLASVISACASL
Sbjct: 181 SVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSERPNPYMLASVISACASL 240

Query: 241 GRLVSGKVVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSII 300
           GRLVSGKVVHGAAI LGCDSS+VVASVLVDMYAKCGSL+YSD VFNRISNPSVIPYTS+I
Sbjct: 241 GRLVSGKVVHGAAICLGCDSSEVVASVLVDMYAKCGSLDYSDNVFNRISNPSVIPYTSMI 300

Query: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGI 360
           VSTAKYGFGRKSLQLFEEMVRKGLKPNHIT +GVLHACSHSGLPNEGL YLTSMYE+YGI
Sbjct: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITLLGVLHACSHSGLPNEGLYYLTSMYEKYGI 360

Query: 361 MPETKHYTCVVDMLARAGQLDKAYQLAKSMEVASDDKALLWGALLSASRCHGRVDIAAKA 420
           MPETKHYTCVVDMLAR+G+LDKA+ +AKSM+VA DDKALLWGALLSASRCHGRVDIAA+A
Sbjct: 361 MPETKHYTCVVDMLARSGELDKAFDIAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEA 420

Query: 421 CQQLVNSNRQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSYV 480
           CQQLVNSNRQVAGAYVTLSNAYASAGDMEKAH+LRVEMKHTGV KEPGCSWIEIKDSSY+
Sbjct: 421 CQQLVNSNRQVAGAYVTLSNAYASAGDMEKAHKLRVEMKHTGVHKEPGCSWIEIKDSSYI 480

Query: 481 FYAGDITSCPRGDEVLRLLRELDRKMKERGYVRGSKGLVFVDIEEEAEEEKVWLHSERLA 540
           FYAG+ITSCPRGDEVL LLRELD+KMK+RGYVRG KGLVFVDIEEEAEEEKVWLHSERLA
Sbjct: 481 FYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLA 540

Query: 541 LGFGLISISKGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTCN 600
           LGFGLISI KGLTIRIMKNLRMCSDCHEAFKLISEI+EREFVVRDINRFHHF+SGCCTCN
Sbjct: 541 LGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFQSGCCTCN 600

Query: 601 DFW 604
            FW
Sbjct: 601 GFW 603

BLAST of HG10010338 vs. ExPASy TrEMBL
Match: A0A6J1CB36 (pentatricopeptide repeat-containing protein At4g15720 OS=Momordica charantia OX=3673 GN=LOC111009611 PE=3 SV=1)

HSP 1 Score: 1020.8 bits (2638), Expect = 2.4e-294
Identity = 502/603 (83.25%), Postives = 545/603 (90.38%), Query Frame = 0

Query: 1   MLKRNLHFALTRPCLSRENERLSFQTIQNLSRHLRNCNDLVSSISTHSIALKLGFLNHTV 60
           MLK N+HFALTRP LSRENERLSFQTI N SRHLRNCNDL+S+ S H IALKLGFL  T+
Sbjct: 1   MLKPNIHFALTRPRLSRENERLSFQTIANFSRHLRNCNDLISATSAHPIALKLGFLTQTL 60

Query: 61  AVNHLINCYVRFRNVATAHRLFDEMPNPNVVSWTSLMAGYIDNGRPNTALLLFRAMSRSS 120
            VNHLINCYVR R V+ AH LFDEMP PNVVSWTSLMAGYID  +P+TAL LF  MSRS 
Sbjct: 61  PVNHLINCYVRLRRVSIAHHLFDEMPTPNVVSWTSLMAGYIDACQPSTALSLFGEMSRSP 120

Query: 121 VVPNDFTFATAIKACSILSNLRHGEKFHAYVEIFGYGGNIVVCSSLIAMYGKCNDVVKAR 180
           VVPNDFTFATAIKACSILSNLR G+KFHAYVEI GYGGN+VVCSSLI MYGKCNDVV+AR
Sbjct: 121 VVPNDFTFATAIKACSILSNLREGKKFHAYVEISGYGGNVVVCSSLIDMYGKCNDVVRAR 180

Query: 181 SVFNSMSCKNIVSWNSMIAAYAQNAHGDDALKVFREFSSLSSEHPNPYMLASVISACASL 240
            VF+SM+CKNIVSW SMIA YAQNAHGDDALK+FREFSSL+ EHPN +MLAS ISACASL
Sbjct: 181 RVFDSMACKNIVSWTSMIATYAQNAHGDDALKLFREFSSLNYEHPNHFMLASAISACASL 240

Query: 241 GRLVSGKVVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSII 300
           GRLV G+VVHGA IRLG DS+DV++SVLVDMYAKCGSL  SDKVF+RI NPSVIPYTS+I
Sbjct: 241 GRLVLGRVVHGAVIRLGHDSNDVISSVLVDMYAKCGSLNCSDKVFSRILNPSVIPYTSMI 300

Query: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGI 360
           VS AKYG GR+SLQLFEEMVRKGLKPNH+TFVGVL+ACSHSGL +EGL YLTSMYER+GI
Sbjct: 301 VSRAKYGLGRQSLQLFEEMVRKGLKPNHVTFVGVLYACSHSGLLDEGLSYLTSMYERHGI 360

Query: 361 MPETKHYTCVVDMLARAGQLDKAYQLAKSMEVASDDKALLWGALLSASRCHGRVDIAAKA 420
           MPE+KHYTCVVDMLAR GQLD+AYQLAKSMEV  DD+ALLWGALLSASR HGRVDIA +A
Sbjct: 361 MPESKHYTCVVDMLARVGQLDRAYQLAKSMEVEPDDEALLWGALLSASRYHGRVDIAVEA 420

Query: 421 CQQLVNSNRQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSYV 480
           CQ+LVNSN+QVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTG+ KEPGCSW+EIK+ +YV
Sbjct: 421 CQRLVNSNQQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGINKEPGCSWLEIKNLTYV 480

Query: 481 FYAGDITSCPRGDEVLRLLRELDRKMKERGYVRGSKGLVFVDIEEEAEEEKVWLHSERLA 540
           FYAGD+ SCP   EVL LLREL+RKMK+RGY RGSKGLVFVD+EEEAEEE V LHSERLA
Sbjct: 481 FYAGDVESCPHSKEVLCLLRELERKMKDRGYGRGSKGLVFVDVEEEAEEEAVGLHSERLA 540

Query: 541 LGFGLISISKGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTCN 600
           LGFGLISI KG+TIRIMKNLRMCSDCHEAFKLISEIVER+FVVRD+NRFHHF  GCCTCN
Sbjct: 541 LGFGLISIPKGITIRIMKNLRMCSDCHEAFKLISEIVERDFVVRDVNRFHHFTGGCCTCN 600

Query: 601 DFW 604
           DFW
Sbjct: 601 DFW 603

BLAST of HG10010338 vs. ExPASy TrEMBL
Match: A0A6J1ER03 (pentatricopeptide repeat-containing protein At4g15720 OS=Cucurbita moschata OX=3662 GN=LOC111436901 PE=3 SV=1)

HSP 1 Score: 1014.2 bits (2621), Expect = 2.2e-292
Identity = 503/604 (83.28%), Postives = 548/604 (90.73%), Query Frame = 0

Query: 1   MLKRNLHFALTRPCLSRENERLSFQTIQNLSRHLRNCNDLVSSISTHSIALKLGFLNHTV 60
           MLKR+L FAL        NER   QTI+NLS HLR+C DL+SS S HSIALKLGFLN T+
Sbjct: 1   MLKRSLPFAL--------NERPPLQTIENLSHHLRSCKDLISSTSIHSIALKLGFLNQTL 60

Query: 61  AVNHLINCYVRFRNVATAHRLFDEMPNPNVVSWTSLMAGYIDNGRPNTALLLFRAMSRSS 120
             NHLINCYVRFR VA AH+LFDEMP  NVVSWTSLMAGY+DNG+P+ AL LF AMSR+S
Sbjct: 61  TANHLINCYVRFRRVAIAHQLFDEMPTRNVVSWTSLMAGYVDNGQPSIALSLFGAMSRTS 120

Query: 121 VVPNDFTFATAIKACSILSNLRHGEKFHAYVEIFGYGGNIVVCSSLIAMYGKCNDVVKAR 180
           VVPNDFTFATAIKACSILSNLR GEKFHA +EIFGYGGN+VVCSSLI MYGKCNDV++AR
Sbjct: 121 VVPNDFTFATAIKACSILSNLRDGEKFHACMEIFGYGGNVVVCSSLIDMYGKCNDVIRAR 180

Query: 181 SVFNSMSCKNIVSWNSMIAAYAQNAHGDDALKVFREFSSLSSEHPNPYMLASVISACASL 240
            VF+SMSCKNIVSW SMIAAYAQNAHGDDALK+FREF S S E PN +MLASVISACA L
Sbjct: 181 RVFDSMSCKNIVSWTSMIAAYAQNAHGDDALKLFREFGSSSWERPNHFMLASVISACACL 240

Query: 241 GRLVSGKVVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSII 300
           GRLV+GKVVH AAIRLGCDS+DVVASVLVDMYAKCGSLEYSD+VF RI NP VI YTS+I
Sbjct: 241 GRLVAGKVVHSAAIRLGCDSNDVVASVLVDMYAKCGSLEYSDRVFKRILNPCVISYTSMI 300

Query: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGI 360
           VSTAKYG GR+SL+LFEEMVRKGLKPNH+TFVGVLHACSHSGLP+EGL YLTSMYE++GI
Sbjct: 301 VSTAKYGLGRQSLELFEEMVRKGLKPNHVTFVGVLHACSHSGLPDEGLHYLTSMYEKHGI 360

Query: 361 MPETKHYTCVVDMLARAGQLDKAYQLAKSME-VASDDKALLWGALLSASRCHGRVDIAAK 420
           MPETKHYTCVVDMLARAG+LDKAYQLAKSM+ +  DD+ALLWGALLS SR HGRVDIAA+
Sbjct: 361 MPETKHYTCVVDMLARAGELDKAYQLAKSMKAMPDDDQALLWGALLSTSRLHGRVDIAAE 420

Query: 421 ACQQLVNSNRQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSY 480
           ACQ+LV++NRQVAGAYVTLSNAYASAGDMEKA RLRVEMKH+GV KEPGCSW+EIKDSSY
Sbjct: 421 ACQRLVDANRQVAGAYVTLSNAYASAGDMEKAQRLRVEMKHSGVYKEPGCSWVEIKDSSY 480

Query: 481 VFYAGDITSCPRGDEVLRLLRELDRKMKERGYVRGSKGLVFVDIEEEAEEEKVWLHSERL 540
           VFYAGD+ SCPRGDEVLRLLREL+RKMK+RGY RGSKGLVFVDIEEEAEEEKVWLHSERL
Sbjct: 481 VFYAGDVMSCPRGDEVLRLLRELERKMKDRGYSRGSKGLVFVDIEEEAEEEKVWLHSERL 540

Query: 541 ALGFGLISISKGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTC 600
           ALGF LISI +GLTIR+MKNLRMCSDCHEAFKLISEIVER+FVVRD+NRFHHFK+G CTC
Sbjct: 541 ALGFCLISIPEGLTIRMMKNLRMCSDCHEAFKLISEIVERDFVVRDVNRFHHFKTGYCTC 596

Query: 601 NDFW 604
           NDFW
Sbjct: 601 NDFW 596

BLAST of HG10010338 vs. ExPASy TrEMBL
Match: A0A6J1KGB2 (pentatricopeptide repeat-containing protein At4g15720 OS=Cucurbita maxima OX=3661 GN=LOC111494855 PE=3 SV=1)

HSP 1 Score: 1005.4 bits (2598), Expect = 1.0e-289
Identity = 500/604 (82.78%), Postives = 545/604 (90.23%), Query Frame = 0

Query: 1   MLKRNLHFALTRPCLSRENERLSFQTIQNLSRHLRNCNDLVSSISTHSIALKLGFLNHTV 60
           MLKR+L FAL        NER   QTI+NLS HLR+C DLVSS S HSIALKLGFLN T+
Sbjct: 1   MLKRSLPFAL--------NERPPLQTIENLSHHLRSCKDLVSSTSIHSIALKLGFLNQTI 60

Query: 61  AVNHLINCYVRFRNVATAHRLFDEMPNPNVVSWTSLMAGYIDNGRPNTALLLFRAMSRSS 120
             NHLINCYVRFR VA AH+LFDEMP  NVVSWTSLMAGY+++G+P  AL LF AMSR+S
Sbjct: 61  TANHLINCYVRFRRVAIAHQLFDEMPTRNVVSWTSLMAGYVNDGQPCIALSLFGAMSRTS 120

Query: 121 VVPNDFTFATAIKACSILSNLRHGEKFHAYVEIFGYGGNIVVCSSLIAMYGKCNDVVKAR 180
           VVPNDFTFATAIKACSILSNLR GEKFHA +EIFGYGGN+VVCSSLI MYGKCN+V++AR
Sbjct: 121 VVPNDFTFATAIKACSILSNLRDGEKFHACMEIFGYGGNVVVCSSLIDMYGKCNNVIRAR 180

Query: 181 SVFNSMSCKNIVSWNSMIAAYAQNAHGDDALKVFREFSSLSSEHPNPYMLASVISACASL 240
            VF+SMSCKNI+SW SMIAAYAQNAH DDALK+FREF S S EHPN +MLASVISACA L
Sbjct: 181 RVFDSMSCKNIISWTSMIAAYAQNAHSDDALKLFREFGSSSWEHPNHFMLASVISACACL 240

Query: 241 GRLVSGKVVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSII 300
           GRLVSGKVVH AAIRLGCDS+DVVASVLVDMYAKCGSLEYSD+VF RI NP VI YTS+I
Sbjct: 241 GRLVSGKVVHSAAIRLGCDSNDVVASVLVDMYAKCGSLEYSDRVFKRILNPCVISYTSMI 300

Query: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGI 360
           VSTAKYG GR+SL+LFEEMVRKGLKPNH+TFVGVLHACSHSGLP+EGL YLTSMYE++GI
Sbjct: 301 VSTAKYGLGRQSLELFEEMVRKGLKPNHVTFVGVLHACSHSGLPDEGLHYLTSMYEKHGI 360

Query: 361 MPETKHYTCVVDMLARAGQLDKAYQLAKSME-VASDDKALLWGALLSASRCHGRVDIAAK 420
           MPETKHYTCVVDMLARAG+LDKAYQLAKSM+ +  DD+ALLWGALLS SR HGRVDIAA+
Sbjct: 361 MPETKHYTCVVDMLARAGELDKAYQLAKSMKAMPDDDQALLWGALLSTSRLHGRVDIAAE 420

Query: 421 ACQQLVNSNRQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSY 480
           ACQ+LV++NRQVAGAYVTLSNAYASAGDMEKA RLRVEMKH+GV KEPGCSW+EIKDSSY
Sbjct: 421 ACQRLVDANRQVAGAYVTLSNAYASAGDMEKAQRLRVEMKHSGVYKEPGCSWVEIKDSSY 480

Query: 481 VFYAGDITSCPRGDEVLRLLRELDRKMKERGYVRGSKGLVFVDIEEEAEEEKVWLHSERL 540
           VFYAGD+ SCPRG+EVL LLREL+RKMK RGY RGSKGLVFVDIEEEAEEEKVWLHSERL
Sbjct: 481 VFYAGDVMSCPRGNEVLHLLRELERKMKGRGYSRGSKGLVFVDIEEEAEEEKVWLHSERL 540

Query: 541 ALGFGLISISKGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTC 600
           ALGF LISI +GLTIRIMKNLRMCSDCHEAFKLISEIVER+FVVRD+NRFHHFK+G CTC
Sbjct: 541 ALGFCLISIPEGLTIRIMKNLRMCSDCHEAFKLISEIVERDFVVRDVNRFHHFKTGYCTC 596

Query: 601 NDFW 604
           NDFW
Sbjct: 601 NDFW 596

BLAST of HG10010338 vs. TAIR 10
Match: AT4G15720.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 646.4 bits (1666), Expect = 2.3e-185
Identity = 325/567 (57.32%), Postives = 426/567 (75.13%), Query Frame = 0

Query: 47  HSIALKLGFLNHTVAVNHLINCYVRFRNVATAHRLFDEMPNPNVVSWTSLMAGYIDNGRP 106
           H++ LKLGF + T  VNHL+  YV+ + + TA +LFDEM  PNVVSWTS+++GY D G+P
Sbjct: 52  HTLTLKLGFASDTFTVNHLVISYVKLKEINTARKLFDEMCEPNVVSWTSVISGYNDMGKP 111

Query: 107 NTALLLFRAMSRS-SVVPNDFTFATAIKACSILSNLRHGEKFHAYVEIFGYGGNIVVCSS 166
             AL +F+ M     V PN++TFA+  KACS L+  R G+  HA +EI G   NIVV SS
Sbjct: 112 QNALSMFQKMHEDRPVPPNEYTFASVFKACSALAESRIGKNIHARLEISGLRRNIVVSSS 171

Query: 167 LIAMYGKCNDVVKARSVFNSM--SCKNIVSWNSMIAAYAQNAHGDDALKVFREF-SSLSS 226
           L+ MYGKCNDV  AR VF+SM    +N+VSW SMI AYAQNA G +A+++FR F ++L+S
Sbjct: 172 LVDMYGKCNDVETARRVFDSMIGYGRNVVSWTSMITAYAQNARGHEAIELFRSFNAALTS 231

Query: 227 EHPNPYMLASVISACASLGRLVSGKVVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSD 286
           +  N +MLASVISAC+SLGRL  GKV HG   R G +S+ VVA+ L+DMYAKCGSL  ++
Sbjct: 232 DRANQFMLASVISACSSLGRLQWGKVAHGLVTRGGYESNTVVATSLLDMYAKCGSLSCAE 291

Query: 287 KVFNRISNPSVIPYTSIIVSTAKYGFGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSG 346
           K+F RI   SVI YTS+I++ AK+G G  +++LF+EMV   + PN++T +GVLHACSHSG
Sbjct: 292 KIFLRIRCHSVISYTSMIMAKAKHGLGEAAVKLFDEMVAGRINPNYVTLLGVLHACSHSG 351

Query: 347 LPNEGLDYLTSMYERYGIMPETKHYTCVVDMLARAGQLDKAYQLAKSMEVASDDKALLWG 406
           L NEGL+YL+ M E+YG++P+++HYTCVVDML R G++D+AY+LAK++EV ++  ALLWG
Sbjct: 352 LVNEGLEYLSLMAEKYGVVPDSRHYTCVVDMLGRFGRVDEAYELAKTIEVGAEQGALLWG 411

Query: 407 ALLSASRCHGRVDIAAKACQQLVNSNRQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTG 466
           ALLSA R HGRV+I ++A ++L+ SN+QV  AY+ LSNAYA +G  E +  LR+EMK +G
Sbjct: 412 ALLSAGRLHGRVEIVSEASKRLIQSNQQVTSAYIALSNAYAVSGGWEDSESLRLEMKRSG 471

Query: 467 VQKEPGCSWIEIKDSSYVFYAGDITSCPRGDEVLRLLRELDRKMKERGYVRGSKGL---- 526
             KE  CSWIE KDS YVF+AGD+ SC    E+ R L++L+++MKERG+ RGS  +    
Sbjct: 472 NVKERACSWIENKDSVYVFHAGDL-SCDESGEIERFLKDLEKRMKERGH-RGSSSMITTS 531

Query: 527 --VFVDIEEEAEEEKVWLHSERLALGFGLISISKGLTIRIMKNLRMCSDCHEAFKLISEI 586
             VFVD++EEA++E V LH ERLAL +GL+ +  G TIRIM NLRMC DCHEAFKLISEI
Sbjct: 532 SSVFVDVDEEAKDEMVSLHCERLALAYGLLHLPAGSTIRIMNNLRMCRDCHEAFKLISEI 591

Query: 587 VEREFVVRDINRFHHFKSGCCTCNDFW 604
           VERE VVRD+NRFH FK+G CTC D+W
Sbjct: 592 VEREIVVRDVNRFHCFKNGSCTCRDYW 616

BLAST of HG10010338 vs. TAIR 10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 412.1 bits (1058), Expect = 7.4e-115
Identity = 208/536 (38.81%), Postives = 333/536 (62.13%), Query Frame = 0

Query: 68  CYVRFRNVATAHRLFDEMPNPNVVSWTSLMAGYIDNGRPNTALLLFRAMSRSSVVPNDFT 127
           C + F  + +  R+F+ MP  +VVS+ +++AGY  +G    AL + R M  + + P+ FT
Sbjct: 186 CIMPF-GIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFT 245

Query: 128 FATAIKACSILSNLRHGEKFHAYVEIFGYGGNIVVCSSLIAMYGKCNDVVKARSVFNSMS 187
            ++ +   S   ++  G++ H YV   G   ++ + SSL+ MY K   +  +  VF+ + 
Sbjct: 246 LSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLY 305

Query: 188 CKNIVSWNSMIAAYAQNAHGDDALKVFREFSSLSSEHPNPYMLASVISACASLGRLVSGK 247
           C++ +SWNS++A Y QN   ++AL++FR+  + +   P     +SVI ACA L  L  GK
Sbjct: 306 CRDGISWNSLVAGYVQNGRYNEALRLFRQMVT-AKVKPGAVAFSSVIPACAHLATLHLGK 365

Query: 248 VVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSIIVSTAKYG 307
            +HG  +R G  S+  +AS LVDMY+KCG+++ + K+F+R++    + +T+II+  A +G
Sbjct: 366 QLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHG 425

Query: 308 FGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGIMPETKHY 367
            G +++ LFEEM R+G+KPN + FV VL ACSH GL +E   Y  SM + YG+  E +HY
Sbjct: 426 HGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHY 485

Query: 368 TCVVDMLARAGQLDKAYQLAKSMEVASDDKALLWGALLSASRCHGRVDIAAKACQQLVNS 427
             V D+L RAG+L++AY     M V  +    +W  LLS+   H  +++A K  +++   
Sbjct: 486 AAVADLLGRAGKLEEAYNFISKMCV--EPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTV 545

Query: 428 NRQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSYVFYAGDIT 487
           + +  GAYV + N YAS G  ++  +LR+ M+  G++K+P CSWIE+K+ ++ F +GD  
Sbjct: 546 DSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGD-R 605

Query: 488 SCPRGDEVLRLLRELDRKMKERGYVRGSKGLVFVDIEEEAEEEKVWLHSERLALGFGLIS 547
           S P  D++   L+ +  +M++ GYV  + G V  D++EE + E ++ HSERLA+ FG+I+
Sbjct: 606 SHPSMDKINEFLKAVMEQMEKEGYVADTSG-VLHDVDEEHKRELLFGHSERLAVAFGIIN 665

Query: 548 ISKGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTCNDFW 604
              G TIR+ KN+R+C+DCH A K IS+I ERE +VRD +RFHHF  G C+C D+W
Sbjct: 666 TEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDYW 715

BLAST of HG10010338 vs. TAIR 10
Match: AT4G14050.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 409.1 bits (1050), Expect = 6.3e-114
Identity = 219/598 (36.62%), Postives = 354/598 (59.20%), Query Frame = 0

Query: 40  LVSSISTHSIALKLGFLNHTVAVNHLINCYVRFRNVATAHRLFDEMPNPNVVSWTSLMAG 99
           L ++ + H+  +KLG +      N L+N Y +    + A ++FDEMP+ + ++W S++  
Sbjct: 19  LTTAKALHAHIVKLGIVQCCPLANTLVNVYGKCGAASHALQVFDEMPHRDHIAWASVLTA 78

Query: 100 YIDNGRPNTALLLFRAM-SRSSVVPNDFTFATAIKACSILSNLRHGEKFHAYVEIFGYGG 159
                     L +F ++ S S + P+DF F+  +KAC+ L ++ HG + H +  +  Y  
Sbjct: 79  LNQANLSGKTLSVFSSVGSSSGLRPDDFVFSALVKACANLGSIDHGRQVHCHFIVSEYAN 138

Query: 160 NIVVCSSLIAMYGKCNDVVKARSVFNSMSCKNIVSWNSMIAAYAQNAHGDDALKVFR--- 219
           + VV SSL+ MY KC  +  A++VF+S+  KN +SW +M++ YA++   ++AL++FR   
Sbjct: 139 DEVVKSSLVDMYAKCGLLNSAKAVFDSIRVKNTISWTAMVSGYAKSGRKEEALELFRILP 198

Query: 220 -------------------------EFSSLSSEHP---NPYMLASVISACASLGRLVSGK 279
                                     F+ +  E     +P +L+S++ ACA+L   ++G+
Sbjct: 199 VKNLYSWTALISGFVQSGKGLEAFSVFTEMRRERVDILDPLVLSSIVGACANLAASIAGR 258

Query: 280 VVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSIIVSTAKYG 339
            VHG  I LG DS   +++ L+DMYAKC  +  +  +F+R+ +  V+ +TS+IV  A++G
Sbjct: 259 QVHGLVIALGFDSCVFISNALIDMYAKCSDVIAAKDIFSRMRHRDVVSWTSLIVGMAQHG 318

Query: 340 FGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGIMPETKHY 399
              K+L L+++MV  G+KPN +TFVG+++ACSH G   +G +   SM + YGI P  +HY
Sbjct: 319 QAEKALALYDDMVSHGVKPNEVTFVGLIYACSHVGFVEKGRELFQSMTKDYGIRPSLQHY 378

Query: 400 TCVVDMLARAGQLDKAYQLAKSMEVASDDKALLWGALLSASRCHGRVDIAAKACQQLVNS 459
           TC++D+L R+G LD+A  L  +M    D+    W ALLSA +  GR  +  +    LV+S
Sbjct: 379 TCLLDLLGRSGLLDEAENLIHTMPFPPDEPT--WAALLSACKRQGRGQMGIRIADHLVSS 438

Query: 460 NR-QVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSYVFYAGDI 519
            + +    Y+ LSN YASA    K    R ++    V+K+PG S +E++  + VFYAG+ 
Sbjct: 439 FKLKDPSTYILLSNIYASASLWGKVSEARRKLGEMEVRKDPGHSSVEVRKETEVFYAGE- 498

Query: 520 TSCPRGDEVLRLLRELDRKMKER-GYVRGSKGLVFVDIEEEAEEEKVWLHSERLALGFGL 579
           TS P  +++ RLL++L+ +M+ R GYV  +  ++  D++E+ +E+ ++ HSER A+ +GL
Sbjct: 499 TSHPLKEDIFRLLKKLEEEMRIRNGYVPDTSWILH-DMDEQEKEKLLFWHSERSAVAYGL 558

Query: 580 ISISKGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTCNDFW 604
           +    G  IRI+KNLR+C DCH   K ISEI ERE +VRD  R+HHFK G C+CNDFW
Sbjct: 559 LKAVPGTPIRIVKNLRVCGDCHVVLKHISEITEREIIVRDATRYHHFKGGKCSCNDFW 612

BLAST of HG10010338 vs. TAIR 10
Match: AT4G37170.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 408.7 bits (1049), Expect = 8.2e-114
Identity = 209/542 (38.56%), Postives = 326/542 (60.15%), Query Frame = 0

Query: 63  NHLINCYVRFRNVATAHRLFDEMPNPNVVSWTSLMAGYIDNGRPNTALLLFRAMSR-SSV 122
           N ++N Y     +  A +LFDEM   +  SWT+++ GY+   +P  AL+L+  M R  + 
Sbjct: 155 NVMVNGYAEVGLLEEARKLFDEMTEKDSYSWTAMVTGYVKKDQPEEALVLYSLMQRVPNS 214

Query: 123 VPNDFTFATAIKACSILSNLRHGEKFHAYVEIFGYGGNIVVCSSLIAMYGKCNDVVKARS 182
            PN FT + A+ A + +  +R G++ H ++   G   + V+ SSL+ MYGKC  + +AR+
Sbjct: 215 RPNIFTVSIAVAAAAAVKCIRRGKEIHGHIVRAGLDSDEVLWSSLMDMYGKCGCIDEARN 274

Query: 183 VFNSMSCKNIVSWNSMIAAYAQNAHGDDALKVFREFSSLSSEHPNPYMLASVISACASLG 242
           +F+ +  K++VSW SMI  Y +++   +   +F E    S E PN Y  A V++ACA L 
Sbjct: 275 IFDKIVEKDVVSWTSMIDRYFKSSRWREGFSLFSELVG-SCERPNEYTFAGVLNACADLT 334

Query: 243 RLVSGKVVHGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSIIV 302
               GK VHG   R+G D     +S LVDMY KCG++E +  V +    P ++ +TS+I 
Sbjct: 335 TEELGKQVHGYMTRVGFDPYSFASSSLVDMYTKCGNIESAKHVVDGCPKPDLVSWTSLIG 394

Query: 303 STAKYGFGRKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGIM 362
             A+ G   ++L+ F+ +++ G KP+H+TFV VL AC+H+GL  +GL++  S+ E++ + 
Sbjct: 395 GCAQNGQPDEALKYFDLLLKSGTKPDHVTFVNVLSACTHAGLVEKGLEFFYSITEKHRLS 454

Query: 363 PETKHYTCVVDMLARAGQLDKAYQLAKSMEVASDDKALLWGALLSASRCHGRVDIAAKAC 422
             + HYTC+VD+LAR+G+ ++   +   M +       LW ++L     +G +D+A +A 
Sbjct: 455 HTSDHYTCLVDLLARSGRFEQLKSVISEMPM--KPSKFLWASVLGGCSTYGNIDLAEEAA 514

Query: 423 QQLVNSNRQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSYVF 482
           Q+L     +    YVT++N YA+AG  E+  ++R  M+  GV K PG SW EIK   +VF
Sbjct: 515 QELFKIEPENPVTYVTMANIYAAAGKWEEEGKMRKRMQEIGVTKRPGSSWTEIKRKRHVF 574

Query: 483 YAGDITSCPRGDEVLRLLRELDRKMKERGYVRGSKGLVFVDIEEEAEEEKVWLHSERLAL 542
            A D TS P  ++++  LREL +KMKE GYV  +  LV  D+E+E +EE +  HSE+LA+
Sbjct: 575 IAAD-TSHPMYNQIVEFLRELRKKMKEEGYVPAT-SLVLHDVEDEQKEENLVYHSEKLAV 634

Query: 543 GFGLISISKGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTCND 602
            F ++S  +G  I++ KNLR C DCH A K IS I +R+  VRD  RFH F++G C+C D
Sbjct: 635 AFAILSTEEGTAIKVFKNLRSCVDCHGAIKFISNITKRKITVRDSTRFHCFENGQCSCGD 691

Query: 603 FW 604
           +W
Sbjct: 695 YW 691

BLAST of HG10010338 vs. TAIR 10
Match: AT3G46790.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 404.4 bits (1038), Expect = 1.6e-112
Identity = 223/594 (37.54%), Postives = 342/594 (57.58%), Query Frame = 0

Query: 15  LSRENERLSFQTIQNLSRHLRNCNDLVSSISTHSIALKLGFLNHTVAVNHLINCYVRFRN 74
           LS+E+   S QT + L     + + L  ++  H   L  G          LI  Y    +
Sbjct: 69  LSQESSP-SQQTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGMYSDLGS 128

Query: 75  VATAHRLFDEMPNPNVVSWTSLMAGYIDNGRPNTALLLFRAMSRSSVVPNDFTFATAIKA 134
           V  A ++FD+     +  W +L       G     L L+  M+R  V  + FT+   +KA
Sbjct: 129 VDYARKVFDKTRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTYVLKA 188

Query: 135 C----SILSNLRHGEKFHAYVEIFGYGGNIVVCSSLIAMYGKCNDVVKARSVFNSMSCKN 194
           C      +++L  G++ HA++   GY  ++ + ++L+ MY +   V  A  VF  M  +N
Sbjct: 189 CVASECTVNHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRN 248

Query: 195 IVSWNSMIAAYAQNAHGDDALKVFRE-FSSLSSEHPNPYMLASVISACASLGRLVSGKVV 254
           +VSW++MIA YA+N    +AL+ FRE         PN   + SV+ ACASL  L  GK++
Sbjct: 249 VVSWSAMIACYAKNGKAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLI 308

Query: 255 HGAAIRLGCDSSDVVASVLVDMYAKCGSLEYSDKVFNRISNPSVIPYTSIIVSTAKYGFG 314
           HG  +R G DS   V S LV MY +CG LE   +VF+R+ +  V+ + S+I S   +G+G
Sbjct: 309 HGYILRRGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYG 368

Query: 315 RKSLQLFEEMVRKGLKPNHITFVGVLHACSHSGLPNEGLDYLTSMYERYGIMPETKHYTC 374
           +K++Q+FEEM+  G  P  +TFV VL ACSH GL  EG     +M+  +GI P+ +HY C
Sbjct: 369 KKAIQIFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYAC 428

Query: 375 VVDMLARAGQLDKAYQLAKSMEVASDDKALLWGALLSASRCHGRVDIAAKACQQLVNSNR 434
           +VD+L RA +LD+A ++ + M      K  +WG+LL + R HG V++A +A ++L     
Sbjct: 429 MVDLLGRANRLDEAAKMVQDMRTEPGPK--VWGSLLGSCRIHGNVELAERASRRLFALEP 488

Query: 435 QVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGVQKEPGCSWIEIKDSSYVFYAGDITSC 494
           + AG YV L++ YA A   ++  R++  ++H G+QK PG  W+E++   Y F + D  + 
Sbjct: 489 KNAGNYVLLADIYAEAQMWDEVKRVKKLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFN- 548

Query: 495 PRGDEVLRLLRELDRKMKERGYVRGSKGLVFVDIEEEAEEEKVWLHSERLALGFGLISIS 554
           P  +++   L +L   MKE+GY+  +KG+++ ++E E +E  V  HSE+LAL FGLI+ S
Sbjct: 549 PLMEQIHAFLVKLAEDMKEKGYIPQTKGVLY-ELETEEKERIVLGHSEKLALAFGLINTS 608

Query: 555 KGLTIRIMKNLRMCSDCHEAFKLISEIVEREFVVRDINRFHHFKSGCCTCNDFW 604
           KG  IRI KNLR+C DCH   K IS+ +E+E +VRD+NRFH FK+G C+C D+W
Sbjct: 609 KGEPIRITKNLRLCEDCHLFTKFISKFMEKEILVRDVNRFHRFKNGVCSCGDYW 657

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038875319.10.0e+0091.54pentatricopeptide repeat-containing protein At4g15720 isoform X1 [Benincasa hisp... [more]
XP_004143199.10.0e+0090.71pentatricopeptide repeat-containing protein At4g15720 [Cucumis sativus] >KGN4719... [more]
XP_008458473.10.0e+0090.55PREDICTED: pentatricopeptide repeat-containing protein At4g15720 [Cucumis melo][more]
XP_022138437.14.9e-29483.25pentatricopeptide repeat-containing protein At4g15720 [Momordica charantia][more]
KAG7026300.12.7e-29283.61Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
Q8VYH03.3e-18457.32Pentatricopeptide repeat-containing protein At4g15720 OS=Arabidopsis thaliana OX... [more]
Q9LW631.0e-11338.81Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
O232668.9e-11336.62Pentatricopeptide repeat-containing protein At4g14050, mitochondrial OS=Arabidop... [more]
O231691.2e-11238.56Pentatricopeptide repeat-containing protein At4g37170 OS=Arabidopsis thaliana OX... [more]
Q9STF32.2e-11137.54Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0KEC70.0e+0090.71DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G1972... [more]
A0A1S3C8I50.0e+0090.55pentatricopeptide repeat-containing protein At4g15720 OS=Cucumis melo OX=3656 GN... [more]
A0A6J1CB362.4e-29483.25pentatricopeptide repeat-containing protein At4g15720 OS=Momordica charantia OX=... [more]
A0A6J1ER032.2e-29283.28pentatricopeptide repeat-containing protein At4g15720 OS=Cucurbita moschata OX=3... [more]
A0A6J1KGB21.0e-28982.78pentatricopeptide repeat-containing protein At4g15720 OS=Cucurbita maxima OX=366... [more]
Match NameE-valueIdentityDescription
AT4G15720.12.3e-18557.32Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G23330.17.4e-11538.81Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G14050.16.3e-11436.62Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G37170.18.2e-11438.56Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G46790.11.6e-11237.54Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 62..85
e-value: 0.0016
score: 16.5
coord: 161..192
e-value: 5.5E-5
score: 21.1
coord: 91..124
e-value: 4.0E-6
score: 24.6
coord: 192..220
e-value: 2.0E-4
score: 19.3
coord: 296..327
e-value: 4.9E-4
score: 18.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 88..135
e-value: 2.7E-11
score: 43.5
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 291..338
e-value: 0.0012
score: 18.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 60..86
e-value: 0.0075
score: 16.4
coord: 366..391
e-value: 0.082
score: 13.2
coord: 435..462
e-value: 0.0064
score: 16.7
coord: 192..217
e-value: 6.3E-5
score: 23.0
coord: 161..191
e-value: 0.0013
score: 18.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 292..326
score: 10.062531
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 89..123
score: 11.202506
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 190..224
score: 9.29524
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 8..139
e-value: 2.6E-22
score: 81.1
coord: 140..251
e-value: 6.2E-19
score: 70.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 266..437
e-value: 1.8E-31
score: 111.7
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 25..454
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 467..593
e-value: 2.0E-30
score: 105.3
NoneNo IPR availablePANTHERPTHR47926:SF252OS12G0163600 PROTEINcoord: 28..582
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 28..582

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10010338.1HG10010338.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding