HG10000673 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10000673
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr09: 7922407 .. 7924509 (-)
RNA-Seq ExpressionHG10000673
SyntenyHG10000673
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACGGAAGCGCTTTCTTTAGCAGCGCTTGTCCAAAAATGCACTTCCGTAGTCTCACTGAAGGCGGCGCGTCAGCTCCACGCTCTGATCCTCACCTCCATAGCCACCGCCTCGTCTCTATCTCCTTACATTTGCAACAACATCTTATCCATGTACGCACGATGCGGTACAATTTGGGAGTCACAGCAAGTGTTCGACAAAATGCCGCAAAGAAATCTCGTTTCGTTTAACGCTCTAATGGCAGCCTATTCTCGGTCTCACGATCACGCTCCATTGGCTTTCAACCTGCTTTCGCAAATGGAGCTTGAATTCCTCAGACCTAATTGTTCCACTCTAACGAGCTTATTGCAGGCAGCTTCGTCCATTGAAGATCGATTTTGGGGCTCTTTGATTCATACGCAAGTAGTAAAACGTGGATTCGTTAATGATGTTCGTGTTCAAACCGCGTTAATTGGGACTTACTCACATTGCTTGGATTTGGAATCTGCTGGAAAAGTTTTCCGGTGGACGATCGGTAAGGATGTAGTGGCTTGGAATTCGATGATCTTTGGAAACTTGAAGCACGATAAATTAAACGAGGCGCTTCGTTTGTTCAATGAAATGCTGGGAATTGGTCTGATTCCTACTCAATTCACCTATGCAATGATTTTGAATATATGTTGTAGAAATGGAGACTACCTATTTGGTCGGCTCGTCCATGGCCGTATCATCACCTCAAACGCCATTATCGATAGAACTCTGCAAAATGTTCTGTTGGATTTGTACTGCATTTGTGGGGATACCCATACAGCATTTTGGATCTTTAACAGAATTGAAAACGCAGATTTGGTTGCTTGGAACACAATCATCTCAGGATGTTCTCAGAATGAAGAAGAAGAGAAGGCCATGAAGCTATTTCAACAATTGCTGAAATCATCACTTCCCAAACCAGATGATTATACATATGCAGCTGTAATTTCCACCATTGACAACCTTTTGAGTGGAATGTCTTTTCTTGCTCAAGTTATAAAGGATGGATTTGAGGGCAGTGTCTTTGTAAGCAGTGTCATTGTGTCTATGTTATTTAGAAATGGTGAATCTCAAGCTGCTGAGAGGGTTTTTGTGTCTGTTGTAGAGAAGGATGTTGTTCTATGGACAGAAATGATCAGTGGGTATTTAAGAATTGGTGAAGGGGAAAAGGCAATCAGATGCTTTCATCAAATGCATCAGAATGGCCATGAACTTGACAGTTTTTCACTCAGTTTGGCTTTGAGTTCATGTGCTGATCTTGCCACTCTAAAACAGGGGGAGATTTTTCATTCTCTGGCCATAAAAACAGGGTGTGAAGCTCAAATCTATGTCTTAGGGAGTCTAATCGACATGTATGCCAAAAATGGCGACTTGGGTTCTGCTATATTGATATTTTCTCAAGTCCCATGTCCAGATTTGAAGTGTTGGAACTCCATGCTTGGTGGGTACAGCCACCATGGGAATATGGAACAGGCATTGAAGCTCTTCTTTGAGCTTCAAAACCATGGTGTCAAGCCAGATCAAGTAACATTTCTTTCCTTACTTTCTGCTTGTAACCACAGCAACTCAGTTGAAATAGGGCAGTTTTTATGGAACTATATGAAAGAAAGTAATATTATACCAAATTCTAAGCACTATTCTTGTATGGTAAGCTTGTTAAGTGGAGCTGGGTTGATGGATGAAGCAGAGGAAATGATTACCAAATCACCATTTGCAAATAATGATCCAGAGCTATGGAGAACTCTGTTAAGTTCATGTGTTGCTAGGAAAAACCTGAGAGTGGGAGTTAATGCAGCAAAACAAGTGTTGAGATTAGATTCAGAAGACAGTGCAGCACATATTTTGCTATCAAATCTGTATGCTGCAGCAGGAAAATGGGATGGTGTTGTAGAAATGAGGAGAAGAATAAGAGAGACAATGGTGGGAAAGGATCCTGGACTGAGTTGGATTGAGGCCAAAACACATATCCAAGCATTTTCTTCTGGTTTACAGTCACATCCAGAGGTTGATGAAGCACTAACTACACTGCTTAGGCTGAGAGGAAACATGAGTGAAGAAATGGATGAATCAAGTGGAATAGGAGGTAAAGACTAA

mRNA sequence

ATGACGGAAGCGCTTTCTTTAGCAGCGCTTGTCCAAAAATGCACTTCCGTAGTCTCACTGAAGGCGGCGCGTCAGCTCCACGCTCTGATCCTCACCTCCATAGCCACCGCCTCGTCTCTATCTCCTTACATTTGCAACAACATCTTATCCATGTACGCACGATGCGGTACAATTTGGGAGTCACAGCAAGTGTTCGACAAAATGCCGCAAAGAAATCTCGTTTCGTTTAACGCTCTAATGGCAGCCTATTCTCGGTCTCACGATCACGCTCCATTGGCTTTCAACCTGCTTTCGCAAATGGAGCTTGAATTCCTCAGACCTAATTGTTCCACTCTAACGAGCTTATTGCAGGCAGCTTCGTCCATTGAAGATCGATTTTGGGGCTCTTTGATTCATACGCAAGTAGTAAAACGTGGATTCGTTAATGATGTTCGTGTTCAAACCGCGTTAATTGGGACTTACTCACATTGCTTGGATTTGGAATCTGCTGGAAAAGTTTTCCGGTGGACGATCGGTAAGGATGTAGTGGCTTGGAATTCGATGATCTTTGGAAACTTGAAGCACGATAAATTAAACGAGGCGCTTCGTTTGTTCAATGAAATGCTGGGAATTGGTCTGATTCCTACTCAATTCACCTATGCAATGATTTTGAATATATGTTGTAGAAATGGAGACTACCTATTTGGTCGGCTCGTCCATGGCCGTATCATCACCTCAAACGCCATTATCGATAGAACTCTGCAAAATGTTCTGTTGGATTTGTACTGCATTTGTGGGGATACCCATACAGCATTTTGGATCTTTAACAGAATTGAAAACGCAGATTTGGTTGCTTGGAACACAATCATCTCAGGATGTTCTCAGAATGAAGAAGAAGAGAAGGCCATGAAGCTATTTCAACAATTGCTGAAATCATCACTTCCCAAACCAGATGATTATACATATGCAGCTGTAATTTCCACCATTGACAACCTTTTGAGTGGAATGTCTTTTCTTGCTCAAGTTATAAAGGATGGATTTGAGGGCAGTGTCTTTGTAAGCAGTGTCATTGTGTCTATGTTATTTAGAAATGGTGAATCTCAAGCTGCTGAGAGGGTTTTTGTGTCTGTTGTAGAGAAGGATGTTGTTCTATGGACAGAAATGATCAGTGGGTATTTAAGAATTGGTGAAGGGGAAAAGGCAATCAGATGCTTTCATCAAATGCATCAGAATGGCCATGAACTTGACAGTTTTTCACTCAGTTTGGCTTTGAGTTCATGTGCTGATCTTGCCACTCTAAAACAGGGGGAGATTTTTCATTCTCTGGCCATAAAAACAGGGTGTGAAGCTCAAATCTATGTCTTAGGGAGTCTAATCGACATGTATGCCAAAAATGGCGACTTGGGTTCTGCTATATTGATATTTTCTCAAGTCCCATGTCCAGATTTGAAGTGTTGGAACTCCATGCTTGGTGGCTTGTTAAGTGGAGCTGGGTTGATGGATGAAGCAGAGGAAATGATTACCAAATCACCATTTGCAAATAATGATCCAGAGCTATGGAGAACTCTGTTAAGTTCATGTGTTGCTAGGAAAAACCTGAGAGTGGGAGTTAATGCAGCAAAACAAGTGTTGAGATTAGATTCAGAAGACAGTGCAGCACATATTTTGCTATCAAATCTGTATGCTGCAGCAGGAAAATGGGATGGTGTTGTAGAAATGAGGAGAAGAATAAGAGAGACAATGGTGGGAAAGGATCCTGGACTGAGTTGGATTGAGGCCAAAACACATATCCAAGCATTTTCTTCTGGTTTACAGTCACATCCAGAGGTTGATGAAGCACTAACTACACTGCTTAGGCTGAGAGGAAACATGAGTGAAGAAATGGATGAATCAAGTGGAATAGGAGGTAAAGACTAA

Coding sequence (CDS)

ATGACGGAAGCGCTTTCTTTAGCAGCGCTTGTCCAAAAATGCACTTCCGTAGTCTCACTGAAGGCGGCGCGTCAGCTCCACGCTCTGATCCTCACCTCCATAGCCACCGCCTCGTCTCTATCTCCTTACATTTGCAACAACATCTTATCCATGTACGCACGATGCGGTACAATTTGGGAGTCACAGCAAGTGTTCGACAAAATGCCGCAAAGAAATCTCGTTTCGTTTAACGCTCTAATGGCAGCCTATTCTCGGTCTCACGATCACGCTCCATTGGCTTTCAACCTGCTTTCGCAAATGGAGCTTGAATTCCTCAGACCTAATTGTTCCACTCTAACGAGCTTATTGCAGGCAGCTTCGTCCATTGAAGATCGATTTTGGGGCTCTTTGATTCATACGCAAGTAGTAAAACGTGGATTCGTTAATGATGTTCGTGTTCAAACCGCGTTAATTGGGACTTACTCACATTGCTTGGATTTGGAATCTGCTGGAAAAGTTTTCCGGTGGACGATCGGTAAGGATGTAGTGGCTTGGAATTCGATGATCTTTGGAAACTTGAAGCACGATAAATTAAACGAGGCGCTTCGTTTGTTCAATGAAATGCTGGGAATTGGTCTGATTCCTACTCAATTCACCTATGCAATGATTTTGAATATATGTTGTAGAAATGGAGACTACCTATTTGGTCGGCTCGTCCATGGCCGTATCATCACCTCAAACGCCATTATCGATAGAACTCTGCAAAATGTTCTGTTGGATTTGTACTGCATTTGTGGGGATACCCATACAGCATTTTGGATCTTTAACAGAATTGAAAACGCAGATTTGGTTGCTTGGAACACAATCATCTCAGGATGTTCTCAGAATGAAGAAGAAGAGAAGGCCATGAAGCTATTTCAACAATTGCTGAAATCATCACTTCCCAAACCAGATGATTATACATATGCAGCTGTAATTTCCACCATTGACAACCTTTTGAGTGGAATGTCTTTTCTTGCTCAAGTTATAAAGGATGGATTTGAGGGCAGTGTCTTTGTAAGCAGTGTCATTGTGTCTATGTTATTTAGAAATGGTGAATCTCAAGCTGCTGAGAGGGTTTTTGTGTCTGTTGTAGAGAAGGATGTTGTTCTATGGACAGAAATGATCAGTGGGTATTTAAGAATTGGTGAAGGGGAAAAGGCAATCAGATGCTTTCATCAAATGCATCAGAATGGCCATGAACTTGACAGTTTTTCACTCAGTTTGGCTTTGAGTTCATGTGCTGATCTTGCCACTCTAAAACAGGGGGAGATTTTTCATTCTCTGGCCATAAAAACAGGGTGTGAAGCTCAAATCTATGTCTTAGGGAGTCTAATCGACATGTATGCCAAAAATGGCGACTTGGGTTCTGCTATATTGATATTTTCTCAAGTCCCATGTCCAGATTTGAAGTGTTGGAACTCCATGCTTGGTGGCTTGTTAAGTGGAGCTGGGTTGATGGATGAAGCAGAGGAAATGATTACCAAATCACCATTTGCAAATAATGATCCAGAGCTATGGAGAACTCTGTTAAGTTCATGTGTTGCTAGGAAAAACCTGAGAGTGGGAGTTAATGCAGCAAAACAAGTGTTGAGATTAGATTCAGAAGACAGTGCAGCACATATTTTGCTATCAAATCTGTATGCTGCAGCAGGAAAATGGGATGGTGTTGTAGAAATGAGGAGAAGAATAAGAGAGACAATGGTGGGAAAGGATCCTGGACTGAGTTGGATTGAGGCCAAAACACATATCCAAGCATTTTCTTCTGGTTTACAGTCACATCCAGAGGTTGATGAAGCACTAACTACACTGCTTAGGCTGAGAGGAAACATGAGTGAAGAAATGGATGAATCAAGTGGAATAGGAGGTAAAGACTAA

Protein sequence

MTEALSLAALVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWESQQVFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAASSIEDRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNSMIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSNAIIDRTLQNVLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEEKAMKLFQQLLKSSLPKPDDYTYAAVISTIDNLLSGMSFLAQVIKDGFEGSVFVSSVIVSMLFRNGESQAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLALSSCADLATLKQGEIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPCPDLKCWNSMLGGLLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLRVGVNAAKQVLRLDSEDSAAHILLSNLYAAAGKWDGVVEMRRRIRETMVGKDPGLSWIEAKTHIQAFSSGLQSHPEVDEALTTLLRLRGNMSEEMDESSGIGGKD
Homology
BLAST of HG10000673 vs. NCBI nr
Match: XP_038901658.1 (pentatricopeptide repeat-containing protein At3g50420 [Benincasa hispida])

HSP 1 Score: 1122.5 bits (2902), Expect = 0.0e+00
Identity = 583/698 (83.52%), Postives = 599/698 (85.82%), Query Frame = 0

Query: 1   MTEALSLAALVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWE 60
           MTEALSLAALVQKCTSV SLKAARQLH LILTSIATASSLSPY+CNNILSMYARCG IWE
Sbjct: 1   MTEALSLAALVQKCTSVASLKAARQLHGLILTSIATASSLSPYVCNNILSMYARCGAIWE 60

Query: 61  SQQVFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAAS 120
           SQQVFDKMPQRNLVSFNAL+AAYSRSHD AP AFNLL++MELEFLRPN STLTSLLQAA 
Sbjct: 61  SQQVFDKMPQRNLVSFNALIAAYSRSHDRAPSAFNLLAKMELEFLRPNGSTLTSLLQAAY 120

Query: 121 SIEDRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNS 180
           SIEDRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHC DLESAGKVFRWTI KDVVAWNS
Sbjct: 121 SIEDRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHCSDLESAGKVFRWTIDKDVVAWNS 180

Query: 181 MIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSN 240
           MIFGNLKHDKLNEAL LFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRL+HGR+ITSN
Sbjct: 181 MIFGNLKHDKLNEALHLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLIHGRMITSN 240

Query: 241 AIIDRTLQNVLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEE-KAMKLF 300
           AI DRTLQNVLLDLYC CGDTHT F IFNRIEN DL+ WNTIISGCSQNEEEE KAMKLF
Sbjct: 241 AITDRTLQNVLLDLYCSCGDTHTGFCIFNRIENPDLITWNTIISGCSQNEEEEDKAMKLF 300

Query: 301 QQLLKSSLPKPDDYTYAAVISTIDNLLSGMSFLAQVIKDGFEGSVFVSSVIVSMLFRNGE 360
           QQL KSSLPKPDDYTYAAVISTIDNLLSGMSFLAQVIKDGFE SVFVSSVIVSM FRNGE
Sbjct: 301 QQLQKSSLPKPDDYTYAAVISTIDNLLSGMSFLAQVIKDGFESSVFVSSVIVSMFFRNGE 360

Query: 361 SQAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLALSS 420
           SQAAERVFVSV EKDVVLWTEMISGY RIGEGE AI+CFHQMHQNGHELDSFS+SLALSS
Sbjct: 361 SQAAERVFVSVAEKDVVLWTEMISGYSRIGEGENAIKCFHQMHQNGHELDSFSISLALSS 420

Query: 421 CADLATLKQGEIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPCPDLKCW 480
           CADLATLKQGEIFHSLAIKTGCEA+IYVLGSLI+MYAKNGDLGSA LIFSQVPCPDLKCW
Sbjct: 421 CADLATLKQGEIFHSLAIKTGCEAEIYVLGSLINMYAKNGDLGSAKLIFSQVPCPDLKCW 480

Query: 481 NSMLGG------------------------------------------------------ 540
           NSMLGG                                                      
Sbjct: 481 NSMLGGYSHHGQMEQALKLFFELQNHGVKPDQITFLSLFSACNHSNSVEIGQFLWNYMKE 540

Query: 541 ---------------LLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLRVGV 600
                          LLSGAGL+DEAEEMITKSPFANNDPELWRTLLSSCV ++NLRVG 
Sbjct: 541 SNIIPNSKHYSCMVSLLSGAGLIDEAEEMITKSPFANNDPELWRTLLSSCVLKRNLRVGA 600

Query: 601 NAAKQVLRLDSEDSAAHILLSNLYAAAGKWDGVVEMRRRIRETMVGKDPGLSWIEAKTHI 629
           NAAKQVLRLDSEDSAAHILLSNLYAA GKWDGVVEMRRRIRET+VGKDPGLSWIEAKTHI
Sbjct: 601 NAAKQVLRLDSEDSAAHILLSNLYAAVGKWDGVVEMRRRIRETIVGKDPGLSWIEAKTHI 660

BLAST of HG10000673 vs. NCBI nr
Match: XP_008457819.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g50420 [Cucumis melo] >KAA0045808.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYJ99474.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1105.5 bits (2858), Expect = 0.0e+00
Identity = 570/696 (81.90%), Postives = 597/696 (85.78%), Query Frame = 0

Query: 1   MTEALSLAALVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWE 60
           MTEALSL+ALVQKCTSV SLKAARQLHALILTSIATASSLSPY+CNNILSMYARCG IWE
Sbjct: 1   MTEALSLSALVQKCTSVASLKAARQLHALILTSIATASSLSPYVCNNILSMYARCGAIWE 60

Query: 61  SQQVFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAAS 120
           SQQVF+KMPQRNLVSFNAL+AAYSRSH HAPLAFNLLSQMELEFLRPN  T+TSLLQAAS
Sbjct: 61  SQQVFEKMPQRNLVSFNALIAAYSRSHGHAPLAFNLLSQMELEFLRPNSFTITSLLQAAS 120

Query: 121 SIEDRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNS 180
           S+ED+FW SLIHTQVVK GFV+DVRVQTALIGTYS+CLDLESAGKVFR TI KDVV WN+
Sbjct: 121 SLEDQFWSSLIHTQVVKCGFVHDVRVQTALIGTYSYCLDLESAGKVFRCTIDKDVVTWNT 180

Query: 181 MIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSN 240
           MIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRL+HGRIITSN
Sbjct: 181 MIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLIHGRIITSN 240

Query: 241 AIIDRTLQNVLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEEKAMKLFQ 300
           AIIDRTLQNVLLDLYC CGD HTAF IFN  EN DLVAWNTIISGCS+NEE EKAMKLFQ
Sbjct: 241 AIIDRTLQNVLLDLYCNCGDIHTAFCIFNSNENPDLVAWNTIISGCSENEEGEKAMKLFQ 300

Query: 301 QLLKSSLPKPDDYTYAAVISTIDNLLSGMSFLAQVIKDGFEGSVFVSSVIVSMLFRNGES 360
           QL KSSL KPDDYTYAAVISTIDNLLSGMSF+AQVIKDGFEGSVF+SSVIVSMLFRNGES
Sbjct: 301 QLKKSSLTKPDDYTYAAVISTIDNLLSGMSFIAQVIKDGFEGSVFISSVIVSMLFRNGES 360

Query: 361 QAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLALSSC 420
           QAA RVFV+V EKDVVLWTEMISGY RIGEGEKAI+CFHQM QNGHELDSFSLSL LSSC
Sbjct: 361 QAAARVFVTVAEKDVVLWTEMISGYSRIGEGEKAIKCFHQMRQNGHELDSFSLSLVLSSC 420

Query: 421 ADLATLKQGEIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPCPDLKCWN 480
           ADLATLKQGEIFHSLA+KTGCEA+IYVLGSLI+MYAKNGDLGSA LIFSQVPCPDLKCWN
Sbjct: 421 ADLATLKQGEIFHSLALKTGCEAEIYVLGSLINMYAKNGDLGSAQLIFSQVPCPDLKCWN 480

Query: 481 SMLGG------------------------------------------------------- 540
           SMLGG                                                       
Sbjct: 481 SMLGGYSHHGNMEQALNLFFNLRNNGVKPDQVTFLSLLSACNHSNSVEIGQFLWNYMKEC 540

Query: 541 --------------LLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLRVGVN 600
                         LLSGAG +DEAEEMITKSPFANNDPELWRTLLSSCV +KNLRVGVN
Sbjct: 541 DIIPNSKHYSCMVSLLSGAGFIDEAEEMITKSPFANNDPELWRTLLSSCVVKKNLRVGVN 600

Query: 601 AAKQVLRLDSEDSAAHILLSNLYAAAGKWDGVVEMRRRIRETMVGKDPGLSWIEAKTHIQ 628
           AAKQVLR+D EDSAAHILLSNLYAAAGKWDGV+EMRRRIRE +VGKDPG+SWIEAKT IQ
Sbjct: 601 AAKQVLRIDPEDSAAHILLSNLYAAAGKWDGVIEMRRRIREKLVGKDPGVSWIEAKTKIQ 660

BLAST of HG10000673 vs. NCBI nr
Match: KGN62097.2 (hypothetical protein Csa_006310 [Cucumis sativus])

HSP 1 Score: 1102.8 bits (2851), Expect = 0.0e+00
Identity = 569/696 (81.75%), Postives = 594/696 (85.34%), Query Frame = 0

Query: 1   MTEALSLAALVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWE 60
           MTEALSL+ALVQKC SV SLKAARQLHALILTSIATASSLSPY+CNNILSMYARCG IWE
Sbjct: 1   MTEALSLSALVQKCISVTSLKAARQLHALILTSIATASSLSPYVCNNILSMYARCGAIWE 60

Query: 61  SQQVFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAAS 120
           SQ+VF+KMPQRNLVSFNAL+AAYSRSH HAPLAFNLLSQMELEFL+PN  T+TSLLQAAS
Sbjct: 61  SQKVFEKMPQRNLVSFNALIAAYSRSHGHAPLAFNLLSQMELEFLKPNSFTITSLLQAAS 120

Query: 121 SIEDRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNS 180
             ED FW SLIH QVVK GFV+DVRVQTALIGTYSHCLDLESAGKVFRWTI KDVV WN+
Sbjct: 121 FTEDPFWSSLIHAQVVKCGFVHDVRVQTALIGTYSHCLDLESAGKVFRWTIDKDVVTWNT 180

Query: 181 MIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSN 240
           MIFGNLKHDKLNEALRLFN+MLGIGLIPTQFTYAMILNICCRNGDYLFGRL+HGRIITSN
Sbjct: 181 MIFGNLKHDKLNEALRLFNQMLGIGLIPTQFTYAMILNICCRNGDYLFGRLIHGRIITSN 240

Query: 241 AIIDRTLQNVLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEEKAMKLFQ 300
           AIIDRTLQNVLLDLYC CGD HTAF IFNR EN DLVAWNTIISGCS+NEE+EKAMKLFQ
Sbjct: 241 AIIDRTLQNVLLDLYCNCGDIHTAFCIFNRNENPDLVAWNTIISGCSENEEDEKAMKLFQ 300

Query: 301 QLLKSSLPKPDDYTYAAVISTIDNLLSGMSFLAQVIKDGFEGSVFVSSVIVSMLFRNGES 360
           QL KSSL KPDDYTYAAVISTIDNLLSGMSF+AQVIKDGFEGSVF+SSVIVSMLFRNGES
Sbjct: 301 QLKKSSLTKPDDYTYAAVISTIDNLLSGMSFIAQVIKDGFEGSVFISSVIVSMLFRNGES 360

Query: 361 QAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLALSSC 420
           QAA RVFV+V  KDVVLWTEMISGY RIGEGEKAI+CFHQMHQNGHELDSFSLSLALSSC
Sbjct: 361 QAAARVFVTVAVKDVVLWTEMISGYSRIGEGEKAIKCFHQMHQNGHELDSFSLSLALSSC 420

Query: 421 ADLATLKQGEIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPCPDLKCWN 480
           ADLATLKQGEIFHSLAIKTG EA+IYVLGSLI+MYAKNGDLGSA LIFSQVPCPDLKCWN
Sbjct: 421 ADLATLKQGEIFHSLAIKTGSEAEIYVLGSLINMYAKNGDLGSAQLIFSQVPCPDLKCWN 480

Query: 481 SMLGG------------------------------------------------------- 540
           SMLGG                                                       
Sbjct: 481 SMLGGYSHHGNMEQALNLFFNLQNNGVKPDQVTFLSLLSACNHSNSVEIGQFLWNYMKEC 540

Query: 541 --------------LLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLRVGVN 600
                         LLSGAG MDEAEEMITKSPFANNDPELWRTLLSSCV +KNLRVGVN
Sbjct: 541 NIIPNSKHYSCMVSLLSGAGFMDEAEEMITKSPFANNDPELWRTLLSSCVVKKNLRVGVN 600

Query: 601 AAKQVLRLDSEDSAAHILLSNLYAAAGKWDGVVEMRRRIRETMVGKDPGLSWIEAKTHIQ 628
           AAKQVLR+D EDSAAHILLSNLYAAAGKWDGVVEMRRRIRE MVGKDPG+SWIEAK+ IQ
Sbjct: 601 AAKQVLRIDPEDSAAHILLSNLYAAAGKWDGVVEMRRRIREKMVGKDPGVSWIEAKSKIQ 660

BLAST of HG10000673 vs. NCBI nr
Match: XP_004148057.2 (pentatricopeptide repeat-containing protein At3g50420 isoform X1 [Cucumis sativus])

HSP 1 Score: 1102.8 bits (2851), Expect = 0.0e+00
Identity = 569/696 (81.75%), Postives = 594/696 (85.34%), Query Frame = 0

Query: 1   MTEALSLAALVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWE 60
           MTEALSL+ALVQKC SV SLKAARQLHALILTSIATASSLSPY+CNNILSMYARCG IWE
Sbjct: 37  MTEALSLSALVQKCISVTSLKAARQLHALILTSIATASSLSPYVCNNILSMYARCGAIWE 96

Query: 61  SQQVFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAAS 120
           SQ+VF+KMPQRNLVSFNAL+AAYSRSH HAPLAFNLLSQMELEFL+PN  T+TSLLQAAS
Sbjct: 97  SQKVFEKMPQRNLVSFNALIAAYSRSHGHAPLAFNLLSQMELEFLKPNSFTITSLLQAAS 156

Query: 121 SIEDRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNS 180
             ED FW SLIH QVVK GFV+DVRVQTALIGTYSHCLDLESAGKVFRWTI KDVV WN+
Sbjct: 157 FTEDPFWSSLIHAQVVKCGFVHDVRVQTALIGTYSHCLDLESAGKVFRWTIDKDVVTWNT 216

Query: 181 MIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSN 240
           MIFGNLKHDKLNEALRLFN+MLGIGLIPTQFTYAMILNICCRNGDYLFGRL+HGRIITSN
Sbjct: 217 MIFGNLKHDKLNEALRLFNQMLGIGLIPTQFTYAMILNICCRNGDYLFGRLIHGRIITSN 276

Query: 241 AIIDRTLQNVLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEEKAMKLFQ 300
           AIIDRTLQNVLLDLYC CGD HTAF IFNR EN DLVAWNTIISGCS+NEE+EKAMKLFQ
Sbjct: 277 AIIDRTLQNVLLDLYCNCGDIHTAFCIFNRNENPDLVAWNTIISGCSENEEDEKAMKLFQ 336

Query: 301 QLLKSSLPKPDDYTYAAVISTIDNLLSGMSFLAQVIKDGFEGSVFVSSVIVSMLFRNGES 360
           QL KSSL KPDDYTYAAVISTIDNLLSGMSF+AQVIKDGFEGSVF+SSVIVSMLFRNGES
Sbjct: 337 QLKKSSLTKPDDYTYAAVISTIDNLLSGMSFIAQVIKDGFEGSVFISSVIVSMLFRNGES 396

Query: 361 QAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLALSSC 420
           QAA RVFV+V  KDVVLWTEMISGY RIGEGEKAI+CFHQMHQNGHELDSFSLSLALSSC
Sbjct: 397 QAAARVFVTVAVKDVVLWTEMISGYSRIGEGEKAIKCFHQMHQNGHELDSFSLSLALSSC 456

Query: 421 ADLATLKQGEIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPCPDLKCWN 480
           ADLATLKQGEIFHSLAIKTG EA+IYVLGSLI+MYAKNGDLGSA LIFSQVPCPDLKCWN
Sbjct: 457 ADLATLKQGEIFHSLAIKTGSEAEIYVLGSLINMYAKNGDLGSAQLIFSQVPCPDLKCWN 516

Query: 481 SMLGG------------------------------------------------------- 540
           SMLGG                                                       
Sbjct: 517 SMLGGYSHHGNMEQALNLFFNLQNNGVKPDQVTFLSLLSACNHSNSVEIGQFLWNYMKEC 576

Query: 541 --------------LLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLRVGVN 600
                         LLSGAG MDEAEEMITKSPFANNDPELWRTLLSSCV +KNLRVGVN
Sbjct: 577 NIIPNSKHYSCMVSLLSGAGFMDEAEEMITKSPFANNDPELWRTLLSSCVVKKNLRVGVN 636

Query: 601 AAKQVLRLDSEDSAAHILLSNLYAAAGKWDGVVEMRRRIRETMVGKDPGLSWIEAKTHIQ 628
           AAKQVLR+D EDSAAHILLSNLYAAAGKWDGVVEMRRRIRE MVGKDPG+SWIEAK+ IQ
Sbjct: 637 AAKQVLRIDPEDSAAHILLSNLYAAAGKWDGVVEMRRRIREKMVGKDPGVSWIEAKSKIQ 696

BLAST of HG10000673 vs. NCBI nr
Match: XP_023512997.1 (pentatricopeptide repeat-containing protein At3g50420 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1074.3 bits (2777), Expect = 4.8e-310
Identity = 553/697 (79.34%), Postives = 581/697 (83.36%), Query Frame = 0

Query: 1   MTEALSLAALVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWE 60
           MTEALSLAALVQKCTSV SLKAARQLHALILTS  TASSLSPYICNNI+SMYARCGTIWE
Sbjct: 1   MTEALSLAALVQKCTSVASLKAARQLHALILTSAPTASSLSPYICNNIVSMYARCGTIWE 60

Query: 61  SQQVFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAAS 120
           SQ+VFD MPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPN STLTSLLQAAS
Sbjct: 61  SQKVFDTMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNASTLTSLLQAAS 120

Query: 121 SIEDRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNS 180
           S+EDRFWGSL+HTQVVKRGFVNDV VQTALIGTYSHCLDLESAGKVFRWTI KD+VAWNS
Sbjct: 121 SMEDRFWGSLVHTQVVKRGFVNDVPVQTALIGTYSHCLDLESAGKVFRWTINKDIVAWNS 180

Query: 181 MIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSN 240
           MIFGNLK+DKLNEALRLFNEMLGIGL+PTQFTY+M+LNICCRNGD+ FGRLVHGRIITSN
Sbjct: 181 MIFGNLKNDKLNEALRLFNEMLGIGLVPTQFTYSMVLNICCRNGDWQFGRLVHGRIITSN 240

Query: 241 AIIDRTLQNVLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEEKAMKLFQ 300
           AIIDRTLQNVLLDLYC CGDTHTAFWIFNRIEN DLV WNTIISGCS+NEEEE AMKLF 
Sbjct: 241 AIIDRTLQNVLLDLYCNCGDTHTAFWIFNRIENPDLVTWNTIISGCSENEEEEMAMKLFL 300

Query: 301 QLLKSSLPKPDDYTYAAVISTIDNLLSGMSFLAQVIKDGFEGSVFVSSVIVSMLFRNGES 360
           QL KSSLPKPDDYTYAAVI+TID+  SGMSF+AQVIKDGFE SVFVSSVIVSMLFR+GES
Sbjct: 301 QLRKSSLPKPDDYTYAAVIATIDSPRSGMSFVAQVIKDGFESSVFVSSVIVSMLFRSGES 360

Query: 361 QAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLALSSC 420
           QAAERVF  +  KDVVLWTEMISGY R+GEGEK I+CFHQMHQ GHE DSFSLSLALS C
Sbjct: 361 QAAERVFGCIAMKDVVLWTEMISGYSRMGEGEKGIKCFHQMHQAGHETDSFSLSLALSLC 420

Query: 421 ADLATLKQGEIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPCPDLKCWN 480
           ADLATLKQGE FHS AIKTGCEA+ YVLGSLIDMYAKNGDL SA  +FSQ+P PDLKCWN
Sbjct: 421 ADLATLKQGETFHSQAIKTGCEAETYVLGSLIDMYAKNGDLSSAKFVFSQIPYPDLKCWN 480

Query: 481 SMLGG------------------------------------------------------- 540
           S+LGG                                                       
Sbjct: 481 SILGGYSHHGNMEEALNLFFELRDHGVEPDQVTFLSLLSACNHSSSVEIGKFLWNYMKES 540

Query: 541 --------------LLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLRVGVN 600
                         LLS AG MDEAEEMI KSPFA+ DPELWRTLLSSCVA+KNLRVGV 
Sbjct: 541 NIMPNCKHYSCMVSLLSRAGFMDEAEEMIIKSPFADGDPELWRTLLSSCVAKKNLRVGVR 600

Query: 601 AAKQVLRLDSEDSAAHILLSNLYAAAGKWDGVVEMRRRIRETMVGKDPGLSWIEAKTHIQ 629
           AAK+VL LDSEDSAAHILLSNLYAAAGKWDGV EMRRRIRE+ VGK+PGLSWIE+K HIQ
Sbjct: 601 AAKRVLGLDSEDSAAHILLSNLYAAAGKWDGVAEMRRRIRESRVGKEPGLSWIESKRHIQ 660

BLAST of HG10000673 vs. ExPASy Swiss-Prot
Match: Q9SCT2 (Pentatricopeptide repeat-containing protein At3g50420 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E85 PE=2 SV=1)

HSP 1 Score: 602.8 bits (1553), Expect = 4.4e-171
Identity = 316/688 (45.93%), Postives = 443/688 (64.39%), Query Frame = 0

Query: 4   ALSLAALVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWESQQ 63
           A S+  L +KC S+  LK ARQ+HAL+LT+ A A++ SPY  NN++SMY RCG++ ++++
Sbjct: 94  ASSVVELTRKCVSITVLKRARQIHALVLTAGAGAATESPYANNNLISMYVRCGSLEQARK 153

Query: 64  VFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAASSIE 123
           VFDKMP RN+VS+NAL +AYSR+ D A  AF L + M  E+++PN ST TSL+Q  + +E
Sbjct: 154 VFDKMPHRNVVSYNALYSAYSRNPDFASYAFPLTTHMAFEYVKPNSSTFTSLVQVCAVLE 213

Query: 124 DRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNSMIF 183
           D   GS +++Q++K G+ ++V VQT+++G YS C DLESA ++F     +D VAWN+MI 
Sbjct: 214 DVLMGSSLNSQIIKLGYSDNVVVQTSVLGMYSSCGDLESARRIFDCVNNRDAVAWNTMIV 273

Query: 184 GNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSNAII 243
           G+LK+DK+ + L  F  ML  G+ PTQFTY+++LN C + G Y  G+L+H RII S+++ 
Sbjct: 274 GSLKNDKIEDGLMFFRNMLMSGVDPTQFTYSIVLNGCSKLGSYSLGKLIHARIIVSDSLA 333

Query: 244 DRTLQNVLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEEKAMKLFQQLL 303
           D  L N LLD+YC CGD   AF++F RI N +LV+WN+IISGCS+N   E+AM ++++LL
Sbjct: 334 DLPLDNALLDMYCSCGDMREAFYVFGRIHNPNLVSWNSIISGCSENGFGEQAMLMYRRLL 393

Query: 304 KSSLPKPDDYTYAAVISTI---DNLLSGMSFLAQVIKDGFEGSVFVSSVIVSMLFRNGES 363
           + S P+PD+YT++A IS     +  + G     QV K G+E SVFV + ++SM F+N E+
Sbjct: 394 RMSTPRPDEYTFSAAISATAEPERFVHGKLLHGQVTKLGYERSVFVGTTLLSMYFKNREA 453

Query: 364 QAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLALSSC 423
           ++A++VF  + E+DVVLWTEMI G+ R+G  E A++ F +M++  +  D FSLS  + +C
Sbjct: 454 ESAQKVFDVMKERDVVLWTEMIVGHSRLGNSELAVQFFIEMYREKNRSDGFSLSSVIGAC 513

Query: 424 ADLATLKQGEIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPCPDLKCWN 483
           +D+A L+QGE+FH LAI+TG +  + V G+L+DMY KNG   +A  IFS    PDLKCWN
Sbjct: 514 SDMAMLRQGEVFHCLAIRTGFDCVMSVCGALVDMYGKNGKYETAETIFSLASNPDLKCWN 573

Query: 484 SMLG-------------------------------------------------------- 543
           SMLG                                                        
Sbjct: 574 SMLGAYSQHGMVEKALSFFEQILENGFMPDAVTYLSLLAACSHRGSTLQGKFLWNQMKEQ 633

Query: 544 -------------GLLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLRVGVN 603
                         L+S AGL+DEA E+I +SP  NN  ELWRTLLS+CV  +NL++G+ 
Sbjct: 634 GIKAGFKHYSCMVNLVSKAGLVDEALELIEQSPPGNNQAELWRTLLSACVNTRNLQIGLY 693

Query: 604 AAKQVLRLDSEDSAAHILLSNLYAAAGKWDGVVEMRRRIRETMVGKDPGLSWIEA-KTHI 618
           AA+Q+L+LD ED+A HILLSNLYA  G+W+ V EMRR+IR     KDPGLSWIE    + 
Sbjct: 694 AAEQILKLDPEDTATHILLSNLYAVNGRWEDVAEMRRKIRGLASSKDPGLSWIEVNNNNT 753

BLAST of HG10000673 vs. ExPASy Swiss-Prot
Match: Q9SVP7 (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 326.6 bits (836), Expect = 6.1e-88
Identity = 200/678 (29.50%), Postives = 333/678 (49.12%), Query Frame = 0

Query: 1   MTEALSLAALVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWE 60
           M    + ++++  C  + SL+   QLH L+L       S   Y+CN ++S+Y   G +  
Sbjct: 285 MPTPYAFSSVLSACKKIESLEIGEQLHGLVL---KLGFSSDTYVCNALVSLYFHLGNLIS 344

Query: 61  SQQVFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAAS 120
           ++ +F  M QR+ V++N L+   S+   +   A  L  +M L+ L P+ +TL SL+ A S
Sbjct: 345 AEHIFSNMSQRDAVTYNTLINGLSQC-GYGEKAMELFKRMHLDGLEPDSNTLASLVVACS 404

Query: 121 SIEDRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNS 180
           +    F G  +H    K GF ++ +++ AL+  Y+ C D+E+A   F  T  ++VV WN 
Sbjct: 405 ADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKCADIETALDYFLETEVENVVLWNV 464

Query: 181 MIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSN 240
           M+      D L  + R+F +M    ++P Q+TY  IL  C R GD   G  +H +II +N
Sbjct: 465 MLVAYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTN 524

Query: 241 AIIDRTLQNVLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEEKAMKLFQ 300
             ++  + +VL+D+Y   G   TA+ I  R    D+V+W T+I+G +Q   ++KA+  F+
Sbjct: 525 FQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQYNFDDKALTTFR 584

Query: 301 QLLKSSLPKPDDYTYAAVIST---IDNLLSGMSFLAQVIKDGFEGSVFVSSVIVSMLFRN 360
           Q+L   + + D+      +S    +  L  G    AQ    GF   +   + +V++  R 
Sbjct: 585 QMLDRGI-RSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLPFQNALVTLYSRC 644

Query: 361 GESQAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLAL 420
           G+ + +   F      D + W  ++SG+ + G  E+A+R F +M++ G + ++F+   A+
Sbjct: 645 GKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAV 704

Query: 421 SSCADLATLKQGEIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPCPDLK 480
            + ++ A +KQG+  H++  KTG +++  V  +LI MYAK G +  A   F +V   +  
Sbjct: 705 KAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYAKCGSISDAEKQFLEVSTKNEV 764

Query: 481 CWNSMLGG---------------------------------------------------- 540
            WN+++                                                      
Sbjct: 765 SWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESM 824

Query: 541 ------------------LLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLR 600
                             +L+ AGL+  A+E I + P    D  +WRTLLS+CV  KN+ 
Sbjct: 825 NSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPI-KPDALVWRTLLSACVVHKNME 884

Query: 601 VGVNAAKQVLRLDSEDSAAHILLSNLYAAAGKWDGVVEMRRRIRETMVGKDPGLSWIEAK 606
           +G  AA  +L L+ EDSA ++LLSNLYA + KWD     R++++E  V K+PG SWIE K
Sbjct: 885 IGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVK 944

BLAST of HG10000673 vs. ExPASy Swiss-Prot
Match: Q9ZUW3 (Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H60 PE=2 SV=1)

HSP 1 Score: 282.3 bits (721), Expect = 1.3e-74
Identity = 166/569 (29.17%), Postives = 311/569 (54.66%), Query Frame = 0

Query: 44  ICNNILSMYARCGTIWESQQVFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELE 103
           + N+++++Y +CG + +++ +FDK   +++V++N++++ Y+ +      A  +   M L 
Sbjct: 231 VSNSLINLYLKCGNVRKARILFDKTEVKSVVTWNSMISGYAANGLDLE-ALGMFYSMRLN 290

Query: 104 FLRPNCSTLTSLLQAASSIEDRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESA 163
           ++R + S+  S+++  +++++  +   +H  VVK GF+ D  ++TAL+  YS C  +  A
Sbjct: 291 YVRLSESSFASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSKCTAMLDA 350

Query: 164 GKVFRWTIG--KDVVAWNSMIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICC 223
            ++F+  IG   +VV+W +MI G L++D   EA+ LF+EM   G+ P +FTY++IL    
Sbjct: 351 LRLFK-EIGCVGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTYSVILTAL- 410

Query: 224 RNGDYLFGRLVHGRIITSNAIIDRTLQNVLLDLYCICGDTHTAFWIFNRIENADLVAWNT 283
                +    VH +++ +N     T+   LLD Y   G    A  +F+ I++ D+VAW+ 
Sbjct: 411 ---PVISPSEVHAQVVKTNYERSSTVGTALLDAYVKLGKVEEAAKVFSGIDDKDIVAWSA 470

Query: 284 IISGCSQNEEEEKAMKLFQQLLKSSLPKPDDYTYAAVI----STIDNLLSGMSFLAQVIK 343
           +++G +Q  E E A+K+F +L K  + KP+++T+++++    +T  ++  G  F    IK
Sbjct: 471 MLAGYAQTGETEAAIKMFGELTKGGI-KPNEFTFSSILNVCAATNASMGQGKQFHGFAIK 530

Query: 344 DGFEGSVFVSSVIVSMLFRNGESQAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRC 403
              + S+ VSS +++M  + G  ++AE VF    EKD+V W  MISGY + G+  KA+  
Sbjct: 531 SRLDSSLCVSSALLTMYAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAMKALDV 590

Query: 404 FHQMHQNGHELDSFSLSLALSSCADLATLKQGEIFHSLAIKTGCEAQIYVLGS-LIDMYA 463
           F +M +   ++D  +     ++C     +++GE +  + ++    A      S ++D+Y+
Sbjct: 591 FKEMKKRKVKMDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCMVDLYS 650

Query: 464 KNGDLGSAILIFSQVPCPDLKCWNSMLGGLLSGAGLMDEAEEMITKSPFANNDPELWRTL 523
           + G L  A+ +   +P P               AG                    +WRT+
Sbjct: 651 RAGQLEKAMKVIENMPNP---------------AG------------------STIWRTI 710

Query: 524 LSSCVARKNLRVGVNAAKQVLRLDSEDSAAHILLSNLYAAAGKWDGVVEMRRRIRETMVG 583
           L++C   K   +G  AA++++ +  EDSAA++LLSN+YA +G W    ++R+ + E  V 
Sbjct: 711 LAACRVHKKTELGRLAAEKIIAMKPEDSAAYVLLSNMYAESGDWQERAKVRKLMNERNVK 759

Query: 584 KDPGLSWIEAKTHIQAFSSGLQSHPEVDE 606
           K+PG SWIE K    +F +G +SHP  D+
Sbjct: 771 KEPGYSWIEVKNKTYSFLAGDRSHPLKDQ 759

BLAST of HG10000673 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 269.2 bits (687), Expect = 1.1e-70
Identity = 171/615 (27.80%), Postives = 309/615 (50.24%), Query Frame = 0

Query: 10  LVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWESQQVFDKMP 69
           L++ C     L+  +++H L++ S     SL  +    + +MYA+C  + E+++VFD+MP
Sbjct: 141 LLKVCGDEAELRVGKEIHGLLVKS---GFSLDLFAMTGLENMYAKCRQVNEARKVFDRMP 200

Query: 70  QRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAASSIEDRFWGS 129
           +R+LVS+N ++A YS+ +  A +A  ++  M  E L+P+  T+ S+L A S++     G 
Sbjct: 201 ERDLVSWNTIVAGYSQ-NGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGK 260

Query: 130 LIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNSMIFGNLKHD 189
            IH   ++ GF + V + TAL+  Y+ C  LE+A ++F   + ++VV+WNSMI   ++++
Sbjct: 261 EIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNE 320

Query: 190 KLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSNAIIDRTLQN 249
              EA+ +F +ML  G+ PT  +    L+ C   GD   GR +H   +      + ++ N
Sbjct: 321 NPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVN 380

Query: 250 VLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEEKAMKLFQQLLKSSLPK 309
            L+ +YC C +  TA  +F ++++  LV+WN +I G +QN     A+  F Q ++S   K
Sbjct: 381 SLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQ-MRSRTVK 440

Query: 310 PDDYTYAAVISTIDNLL---SGMSFLAQVIKDGFEGSVFVSSVIVSMLFRNGESQAAERV 369
           PD +TY +VI+ I  L            V++   + +VFV++ +V M  + G    A  +
Sbjct: 441 PDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLI 500

Query: 370 FVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLALSSCADLATL 429
           F  + E+ V  W  MI GY   G G+ A+  F +M +   + +  +    +S+C+    +
Sbjct: 501 FDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLV 560

Query: 430 KQG-EIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPC-PDLKCWNSMLG 489
           + G + F+ +      E  +   G+++D+  + G L  A     Q+P  P +  + +MLG
Sbjct: 561 EAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLG 620

Query: 490 GLLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLRVGVNAAKQVLRLDSEDS 549
                                             +C   KN+     AA+++  L+ +D 
Sbjct: 621 ----------------------------------ACQIHKNVNFAEKAAERLFELNPDDG 680

Query: 550 AAHILLSNLYAAAGKWDGVVEMRRRIRETMVGKDPGLSWIEAKTHIQAFSSGLQSHPEVD 609
             H+LL+N+Y AA  W+ V ++R  +    + K PG S +E K  + +F SG  +HP+  
Sbjct: 681 GYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSK 716

Query: 610 EALTTLLRLRGNMSE 620
           +    L +L  ++ E
Sbjct: 741 KIYAFLEKLICHIKE 716

BLAST of HG10000673 vs. ExPASy Swiss-Prot
Match: Q9LYV3 (Putative pentatricopeptide repeat-containing protein At5g13230, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H89 PE=3 SV=1)

HSP 1 Score: 267.7 bits (683), Expect = 3.3e-70
Identity = 181/673 (26.89%), Postives = 321/673 (47.70%), Query Frame = 0

Query: 9   ALVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWESQQVFDKM 68
           A++++C       +A+ +H  IL      S L  +  N +L+ Y + G   ++  +FD+M
Sbjct: 54  AMLRRCIQKNDPISAKAIHCDILKK---GSCLDLFATNILLNAYVKAGFDKDALNLFDEM 113

Query: 69  PQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAASSIEDRFWG 128
           P+RN VSF  L   Y+           L S++  E    N    TS L+   S++     
Sbjct: 114 PERNNVSFVTLAQGYACQD-----PIGLYSRLHREGHELNPHVFTSFLKLFVSLDKAEIC 173

Query: 129 SLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNSMIFGNLKH 188
             +H+ +VK G+ ++  V  ALI  YS C  ++SA  VF   + KD+V W  ++   +++
Sbjct: 174 PWLHSPIVKLGYDSNAFVGAALINAYSVCGSVDSARTVFEGILCKDIVVWAGIVSCYVEN 233

Query: 189 DKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSNAIIDRTLQ 248
               ++L+L + M   G +P  +T+   L      G + F + VHG+I+ +  ++D  + 
Sbjct: 234 GYFEDSLKLLSCMRMAGFMPNNYTFDTALKASIGLGAFDFAKGVHGQILKTCYVLDPRVG 293

Query: 249 NVLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEEKAMKLFQQLLKSSLP 308
             LL LY   GD   AF +FN +   D+V W+ +I+   QN    +A+ LF + ++ +  
Sbjct: 294 VGLLQLYTQLGDMSDAFKVFNEMPKNDVVPWSFMIARFCQNGFCNEAVDLFIR-MREAFV 353

Query: 309 KPDDYTYAAVI--------STIDNLLSGMSFLAQVIKDGFEGSVFVSSVIVSMLFRNGES 368
            P+++T ++++        S +   L G+     V+K GF+  ++VS+ ++ +  +  + 
Sbjct: 354 VPNEFTLSSILNGCAIGKCSGLGEQLHGL-----VVKVGFDLDIYVSNALIDVYAKCEKM 413

Query: 369 QAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLALSSC 428
             A ++F  +  K+ V W  +I GY  +GEG KA   F +  +N   +   + S AL +C
Sbjct: 414 DTAVKLFAELSSKNEVSWNTVIVGYENLGEGGKAFSMFREALRNQVSVTEVTFSSALGAC 473

Query: 429 ADLATLKQGEIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPCPDLKCWN 488
           A LA++  G   H LAIKT    ++ V  SLIDMYAK GD+  A  +F+++   D+  WN
Sbjct: 474 ASLASMDLGVQVHGLAIKTNNAKKVAVSNSLIDMYAKCGDIKFAQSVFNEMETIDVASWN 533

Query: 489 SMLGG------------------------------------------------------- 548
           +++ G                                                       
Sbjct: 534 ALISGYSTHGLGRQALRILDIMKDRDCKPNGLTFLGVLSGCSNAGLIDQGQECFESMIRD 593

Query: 549 ---------------LLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLRVGV 604
                          LL  +G +D+A ++I   P+      +WR +LS+ + + N     
Sbjct: 594 HGIEPCLEHYTCMVRLLGRSGQLDKAMKLIEGIPY-EPSVMIWRAMLSASMNQNNEEFAR 653

BLAST of HG10000673 vs. ExPASy TrEMBL
Match: A0A5A7TSH7 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold123G00390 PE=4 SV=1)

HSP 1 Score: 1105.5 bits (2858), Expect = 0.0e+00
Identity = 570/696 (81.90%), Postives = 597/696 (85.78%), Query Frame = 0

Query: 1   MTEALSLAALVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWE 60
           MTEALSL+ALVQKCTSV SLKAARQLHALILTSIATASSLSPY+CNNILSMYARCG IWE
Sbjct: 1   MTEALSLSALVQKCTSVASLKAARQLHALILTSIATASSLSPYVCNNILSMYARCGAIWE 60

Query: 61  SQQVFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAAS 120
           SQQVF+KMPQRNLVSFNAL+AAYSRSH HAPLAFNLLSQMELEFLRPN  T+TSLLQAAS
Sbjct: 61  SQQVFEKMPQRNLVSFNALIAAYSRSHGHAPLAFNLLSQMELEFLRPNSFTITSLLQAAS 120

Query: 121 SIEDRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNS 180
           S+ED+FW SLIHTQVVK GFV+DVRVQTALIGTYS+CLDLESAGKVFR TI KDVV WN+
Sbjct: 121 SLEDQFWSSLIHTQVVKCGFVHDVRVQTALIGTYSYCLDLESAGKVFRCTIDKDVVTWNT 180

Query: 181 MIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSN 240
           MIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRL+HGRIITSN
Sbjct: 181 MIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLIHGRIITSN 240

Query: 241 AIIDRTLQNVLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEEKAMKLFQ 300
           AIIDRTLQNVLLDLYC CGD HTAF IFN  EN DLVAWNTIISGCS+NEE EKAMKLFQ
Sbjct: 241 AIIDRTLQNVLLDLYCNCGDIHTAFCIFNSNENPDLVAWNTIISGCSENEEGEKAMKLFQ 300

Query: 301 QLLKSSLPKPDDYTYAAVISTIDNLLSGMSFLAQVIKDGFEGSVFVSSVIVSMLFRNGES 360
           QL KSSL KPDDYTYAAVISTIDNLLSGMSF+AQVIKDGFEGSVF+SSVIVSMLFRNGES
Sbjct: 301 QLKKSSLTKPDDYTYAAVISTIDNLLSGMSFIAQVIKDGFEGSVFISSVIVSMLFRNGES 360

Query: 361 QAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLALSSC 420
           QAA RVFV+V EKDVVLWTEMISGY RIGEGEKAI+CFHQM QNGHELDSFSLSL LSSC
Sbjct: 361 QAAARVFVTVAEKDVVLWTEMISGYSRIGEGEKAIKCFHQMRQNGHELDSFSLSLVLSSC 420

Query: 421 ADLATLKQGEIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPCPDLKCWN 480
           ADLATLKQGEIFHSLA+KTGCEA+IYVLGSLI+MYAKNGDLGSA LIFSQVPCPDLKCWN
Sbjct: 421 ADLATLKQGEIFHSLALKTGCEAEIYVLGSLINMYAKNGDLGSAQLIFSQVPCPDLKCWN 480

Query: 481 SMLGG------------------------------------------------------- 540
           SMLGG                                                       
Sbjct: 481 SMLGGYSHHGNMEQALNLFFNLRNNGVKPDQVTFLSLLSACNHSNSVEIGQFLWNYMKEC 540

Query: 541 --------------LLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLRVGVN 600
                         LLSGAG +DEAEEMITKSPFANNDPELWRTLLSSCV +KNLRVGVN
Sbjct: 541 DIIPNSKHYSCMVSLLSGAGFIDEAEEMITKSPFANNDPELWRTLLSSCVVKKNLRVGVN 600

Query: 601 AAKQVLRLDSEDSAAHILLSNLYAAAGKWDGVVEMRRRIRETMVGKDPGLSWIEAKTHIQ 628
           AAKQVLR+D EDSAAHILLSNLYAAAGKWDGV+EMRRRIRE +VGKDPG+SWIEAKT IQ
Sbjct: 601 AAKQVLRIDPEDSAAHILLSNLYAAAGKWDGVIEMRRRIREKLVGKDPGVSWIEAKTKIQ 660

BLAST of HG10000673 vs. ExPASy TrEMBL
Match: A0A1S3C6D5 (pentatricopeptide repeat-containing protein At3g50420 OS=Cucumis melo OX=3656 GN=LOC103497412 PE=4 SV=1)

HSP 1 Score: 1105.5 bits (2858), Expect = 0.0e+00
Identity = 570/696 (81.90%), Postives = 597/696 (85.78%), Query Frame = 0

Query: 1   MTEALSLAALVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWE 60
           MTEALSL+ALVQKCTSV SLKAARQLHALILTSIATASSLSPY+CNNILSMYARCG IWE
Sbjct: 1   MTEALSLSALVQKCTSVASLKAARQLHALILTSIATASSLSPYVCNNILSMYARCGAIWE 60

Query: 61  SQQVFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAAS 120
           SQQVF+KMPQRNLVSFNAL+AAYSRSH HAPLAFNLLSQMELEFLRPN  T+TSLLQAAS
Sbjct: 61  SQQVFEKMPQRNLVSFNALIAAYSRSHGHAPLAFNLLSQMELEFLRPNSFTITSLLQAAS 120

Query: 121 SIEDRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNS 180
           S+ED+FW SLIHTQVVK GFV+DVRVQTALIGTYS+CLDLESAGKVFR TI KDVV WN+
Sbjct: 121 SLEDQFWSSLIHTQVVKCGFVHDVRVQTALIGTYSYCLDLESAGKVFRCTIDKDVVTWNT 180

Query: 181 MIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSN 240
           MIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRL+HGRIITSN
Sbjct: 181 MIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLIHGRIITSN 240

Query: 241 AIIDRTLQNVLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEEKAMKLFQ 300
           AIIDRTLQNVLLDLYC CGD HTAF IFN  EN DLVAWNTIISGCS+NEE EKAMKLFQ
Sbjct: 241 AIIDRTLQNVLLDLYCNCGDIHTAFCIFNSNENPDLVAWNTIISGCSENEEGEKAMKLFQ 300

Query: 301 QLLKSSLPKPDDYTYAAVISTIDNLLSGMSFLAQVIKDGFEGSVFVSSVIVSMLFRNGES 360
           QL KSSL KPDDYTYAAVISTIDNLLSGMSF+AQVIKDGFEGSVF+SSVIVSMLFRNGES
Sbjct: 301 QLKKSSLTKPDDYTYAAVISTIDNLLSGMSFIAQVIKDGFEGSVFISSVIVSMLFRNGES 360

Query: 361 QAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLALSSC 420
           QAA RVFV+V EKDVVLWTEMISGY RIGEGEKAI+CFHQM QNGHELDSFSLSL LSSC
Sbjct: 361 QAAARVFVTVAEKDVVLWTEMISGYSRIGEGEKAIKCFHQMRQNGHELDSFSLSLVLSSC 420

Query: 421 ADLATLKQGEIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPCPDLKCWN 480
           ADLATLKQGEIFHSLA+KTGCEA+IYVLGSLI+MYAKNGDLGSA LIFSQVPCPDLKCWN
Sbjct: 421 ADLATLKQGEIFHSLALKTGCEAEIYVLGSLINMYAKNGDLGSAQLIFSQVPCPDLKCWN 480

Query: 481 SMLGG------------------------------------------------------- 540
           SMLGG                                                       
Sbjct: 481 SMLGGYSHHGNMEQALNLFFNLRNNGVKPDQVTFLSLLSACNHSNSVEIGQFLWNYMKEC 540

Query: 541 --------------LLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLRVGVN 600
                         LLSGAG +DEAEEMITKSPFANNDPELWRTLLSSCV +KNLRVGVN
Sbjct: 541 DIIPNSKHYSCMVSLLSGAGFIDEAEEMITKSPFANNDPELWRTLLSSCVVKKNLRVGVN 600

Query: 601 AAKQVLRLDSEDSAAHILLSNLYAAAGKWDGVVEMRRRIRETMVGKDPGLSWIEAKTHIQ 628
           AAKQVLR+D EDSAAHILLSNLYAAAGKWDGV+EMRRRIRE +VGKDPG+SWIEAKT IQ
Sbjct: 601 AAKQVLRIDPEDSAAHILLSNLYAAAGKWDGVIEMRRRIREKLVGKDPGVSWIEAKTKIQ 660

BLAST of HG10000673 vs. ExPASy TrEMBL
Match: A0A0A0LJT0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G286470 PE=4 SV=1)

HSP 1 Score: 1102.8 bits (2851), Expect = 0.0e+00
Identity = 569/696 (81.75%), Postives = 594/696 (85.34%), Query Frame = 0

Query: 1   MTEALSLAALVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWE 60
           MTEALSL+ALVQKC SV SLKAARQLHALILTSIATASSLSPY+CNNILSMYARCG IWE
Sbjct: 37  MTEALSLSALVQKCISVTSLKAARQLHALILTSIATASSLSPYVCNNILSMYARCGAIWE 96

Query: 61  SQQVFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAAS 120
           SQ+VF+KMPQRNLVSFNAL+AAYSRSH HAPLAFNLLSQMELEFL+PN  T+TSLLQAAS
Sbjct: 97  SQKVFEKMPQRNLVSFNALIAAYSRSHGHAPLAFNLLSQMELEFLKPNSFTITSLLQAAS 156

Query: 121 SIEDRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNS 180
             ED FW SLIH QVVK GFV+DVRVQTALIGTYSHCLDLESAGKVFRWTI KDVV WN+
Sbjct: 157 FTEDPFWSSLIHAQVVKCGFVHDVRVQTALIGTYSHCLDLESAGKVFRWTIDKDVVTWNT 216

Query: 181 MIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSN 240
           MIFGNLKHDKLNEALRLFN+MLGIGLIPTQFTYAMILNICCRNGDYLFGRL+HGRIITSN
Sbjct: 217 MIFGNLKHDKLNEALRLFNQMLGIGLIPTQFTYAMILNICCRNGDYLFGRLIHGRIITSN 276

Query: 241 AIIDRTLQNVLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEEKAMKLFQ 300
           AIIDRTLQNVLLDLYC CGD HTAF IFNR EN DLVAWNTIISGCS+NEE+EKAMKLFQ
Sbjct: 277 AIIDRTLQNVLLDLYCNCGDIHTAFCIFNRNENPDLVAWNTIISGCSENEEDEKAMKLFQ 336

Query: 301 QLLKSSLPKPDDYTYAAVISTIDNLLSGMSFLAQVIKDGFEGSVFVSSVIVSMLFRNGES 360
           QL KSSL KPDDYTYAAVISTIDNLLSGMSF+AQVIKDGFEGSVF+SSVIVSMLFRNGES
Sbjct: 337 QLKKSSLTKPDDYTYAAVISTIDNLLSGMSFIAQVIKDGFEGSVFISSVIVSMLFRNGES 396

Query: 361 QAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLALSSC 420
           QAA RVFV+V  KDVVLWTEMISGY RIGEGEKAI+CFHQMHQNGHELDSFSLSLALSSC
Sbjct: 397 QAAARVFVTVAVKDVVLWTEMISGYSRIGEGEKAIKCFHQMHQNGHELDSFSLSLALSSC 456

Query: 421 ADLATLKQGEIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPCPDLKCWN 480
           ADLATLKQGEIFHSLAIKTG EA+IYVLGSLI+MYAKNGDLGSA LIFSQVPCPDLKCWN
Sbjct: 457 ADLATLKQGEIFHSLAIKTGSEAEIYVLGSLINMYAKNGDLGSAQLIFSQVPCPDLKCWN 516

Query: 481 SMLGG------------------------------------------------------- 540
           SMLGG                                                       
Sbjct: 517 SMLGGYSHHGNMEQALNLFFNLQNNGVKPDQVTFLSLLSACNHSNSVEIGQFLWNYMKEC 576

Query: 541 --------------LLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLRVGVN 600
                         LLSGAG MDEAEEMITKSPFANNDPELWRTLLSSCV +KNLRVGVN
Sbjct: 577 NIIPNSKHYSCMVSLLSGAGFMDEAEEMITKSPFANNDPELWRTLLSSCVVKKNLRVGVN 636

Query: 601 AAKQVLRLDSEDSAAHILLSNLYAAAGKWDGVVEMRRRIRETMVGKDPGLSWIEAKTHIQ 628
           AAKQVLR+D EDSAAHILLSNLYAAAGKWDGVVEMRRRIRE MVGKDPG+SWIEAK+ IQ
Sbjct: 637 AAKQVLRIDPEDSAAHILLSNLYAAAGKWDGVVEMRRRIREKMVGKDPGVSWIEAKSKIQ 696

BLAST of HG10000673 vs. ExPASy TrEMBL
Match: A0A6J1EIS4 (pentatricopeptide repeat-containing protein At3g50420 OS=Cucurbita moschata OX=3662 GN=LOC111434939 PE=4 SV=1)

HSP 1 Score: 1068.5 bits (2762), Expect = 1.0e-308
Identity = 551/697 (79.05%), Postives = 579/697 (83.07%), Query Frame = 0

Query: 1   MTEALSLAALVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWE 60
           MTEALSLAALVQKCTSV SLKAARQLHALILTS  TASSLSPYICNNI+SMYARCGTIWE
Sbjct: 1   MTEALSLAALVQKCTSVASLKAARQLHALILTSAPTASSLSPYICNNIVSMYARCGTIWE 60

Query: 61  SQQVFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAAS 120
           SQ+VFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPN STLTSLLQAAS
Sbjct: 61  SQKVFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNASTLTSLLQAAS 120

Query: 121 SIEDRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNS 180
           S+ED  WGSL+HTQVVKRGFVNDV VQTALIGTYSHCLDLESAGKVFRWTI KD+VAWNS
Sbjct: 121 SMEDGLWGSLVHTQVVKRGFVNDVPVQTALIGTYSHCLDLESAGKVFRWTINKDIVAWNS 180

Query: 181 MIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSN 240
           MIFGNLK+DKLNEALRLFNEMLGIGL+PTQFTY+M+LNICCRNGD+ FGRLVHGRIITSN
Sbjct: 181 MIFGNLKNDKLNEALRLFNEMLGIGLVPTQFTYSMVLNICCRNGDWQFGRLVHGRIITSN 240

Query: 241 AIIDRTLQNVLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEEKAMKLFQ 300
           AIIDRTLQNVLLDLYC CGDTHTAFWIFNRIEN DLV WNTIISGCS+NEEEE AMKLF 
Sbjct: 241 AIIDRTLQNVLLDLYCNCGDTHTAFWIFNRIENPDLVTWNTIISGCSENEEEELAMKLFL 300

Query: 301 QLLKSSLPKPDDYTYAAVISTIDNLLSGMSFLAQVIKDGFEGSVFVSSVIVSMLFRNGES 360
           QL KSSLPKPDDYTYAAVISTID+  SGMSF+AQVIKDGFE SVFVSSVIVSMLFR+GES
Sbjct: 301 QLRKSSLPKPDDYTYAAVISTIDSPRSGMSFVAQVIKDGFESSVFVSSVIVSMLFRSGES 360

Query: 361 QAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLALSSC 420
           QAAERVF  +  KDVVLWTEMISGY R+GEGEK I+CFHQMHQ GHE DSFSLSLALS C
Sbjct: 361 QAAERVFGCIAMKDVVLWTEMISGYSRMGEGEKGIKCFHQMHQAGHETDSFSLSLALSLC 420

Query: 421 ADLATLKQGEIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPCPDLKCWN 480
           ADLATLKQGE FHS AIKTGCEA+ YVLGSLIDMYAKNGDL SA  +FSQ+P PDLKCWN
Sbjct: 421 ADLATLKQGETFHSQAIKTGCEAETYVLGSLIDMYAKNGDLSSAKFVFSQIPYPDLKCWN 480

Query: 481 SMLGG------------------------------------------------------- 540
           S+LGG                                                       
Sbjct: 481 SILGGYSHHGNMEEALNLFFELRDHGVEPDQVTFLSLLSACNHSSSVEIGKFLWNYMKES 540

Query: 541 --------------LLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLRVGVN 600
                         LLS AGLMDEAEEMI KSPFA+ DPELWRTLLSSCVA+KNLRVG+ 
Sbjct: 541 NIMPNCKHYSCMVSLLSRAGLMDEAEEMIIKSPFADGDPELWRTLLSSCVAKKNLRVGLR 600

Query: 601 AAKQVLRLDSEDSAAHILLSNLYAAAGKWDGVVEMRRRIRETMVGKDPGLSWIEAKTHIQ 629
           AA +VL LDSEDSAAHILLSNLYAAAGKWDGV E RRRIRE+ VGK+PGLSWIE+K HIQ
Sbjct: 601 AANRVLGLDSEDSAAHILLSNLYAAAGKWDGVAETRRRIRESRVGKEPGLSWIESKKHIQ 660

BLAST of HG10000673 vs. ExPASy TrEMBL
Match: A0A6J1I975 (pentatricopeptide repeat-containing protein At3g50420 OS=Cucurbita maxima OX=3661 GN=LOC111470345 PE=4 SV=1)

HSP 1 Score: 1062.8 bits (2747), Expect = 5.7e-307
Identity = 549/697 (78.77%), Postives = 577/697 (82.78%), Query Frame = 0

Query: 1   MTEALSLAALVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWE 60
           MTEALSLAALVQKCTSV S KAARQLHALILTS  TASSLSPYICNNI+SMYARCGTIWE
Sbjct: 1   MTEALSLAALVQKCTSVASQKAARQLHALILTSAPTASSLSPYICNNIVSMYARCGTIWE 60

Query: 61  SQQVFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAAS 120
           SQ+VFD MPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPN STLTSLLQAAS
Sbjct: 61  SQKVFDTMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNASTLTSLLQAAS 120

Query: 121 SIEDRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNS 180
           S+EDRFWG L+ TQVVKRGFVNDV VQTALIGTYSHCLDLESAGKVFRWTI KD+VAWNS
Sbjct: 121 SMEDRFWGPLVQTQVVKRGFVNDVPVQTALIGTYSHCLDLESAGKVFRWTINKDIVAWNS 180

Query: 181 MIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSN 240
           MIFGNLK+DKLNEALRLFNEMLGIGL+PTQFTY+M+LNICCRNGD+ FGRLVHGRIITSN
Sbjct: 181 MIFGNLKNDKLNEALRLFNEMLGIGLVPTQFTYSMVLNICCRNGDWQFGRLVHGRIITSN 240

Query: 241 AIIDRTLQNVLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEEKAMKLFQ 300
           AIIDRTLQNVLLDLYC CGDTHTAFWIFNRIEN DLV WNTIISGCS+NEEEE AMKLF 
Sbjct: 241 AIIDRTLQNVLLDLYCNCGDTHTAFWIFNRIENPDLVTWNTIISGCSENEEEEMAMKLFL 300

Query: 301 QLLKSSLPKPDDYTYAAVISTIDNLLSGMSFLAQVIKDGFEGSVFVSSVIVSMLFRNGES 360
           QL KSSLPK D YTYAAVISTID+  SGMSF+AQVIKDGFE SVFVSSVIVSMLFR+GES
Sbjct: 301 QLRKSSLPKLDGYTYAAVISTIDSPRSGMSFVAQVIKDGFESSVFVSSVIVSMLFRSGES 360

Query: 361 QAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLALSSC 420
           QAAERVF  +  KDVVLWTEMISGY R+GEGEK I+CFHQMHQ GHE+DSFSLSLALS C
Sbjct: 361 QAAERVFGCIAMKDVVLWTEMISGYSRMGEGEKGIKCFHQMHQAGHEIDSFSLSLALSLC 420

Query: 421 ADLATLKQGEIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPCPDLKCWN 480
           ADLATLKQGE FHS AIKTGCEA+ YVLGSLIDMYAKNGDL SA  +FSQ+P PDLKCWN
Sbjct: 421 ADLATLKQGETFHSQAIKTGCEAETYVLGSLIDMYAKNGDLSSAKFVFSQIPYPDLKCWN 480

Query: 481 SMLGG------------------------------------------------------- 540
           S+LGG                                                       
Sbjct: 481 SILGGYSHRGNMEEALNLFFELRDHGVEPDQVTFLSLLSACNHSSSVEIGKFLWSYMKES 540

Query: 541 --------------LLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLRVGVN 600
                         LLS AG MDEAEEMI KSPFA+ DPELWRTLLSSCVA+KNLRVGV 
Sbjct: 541 NIMPNCKHYSCMVSLLSRAGFMDEAEEMIIKSPFADGDPELWRTLLSSCVAKKNLRVGVR 600

Query: 601 AAKQVLRLDSEDSAAHILLSNLYAAAGKWDGVVEMRRRIRETMVGKDPGLSWIEAKTHIQ 629
           AAK+VL LDSEDSAAHILLSNLYAAAGKWDGV EMRRRIRE+ VGK+PGLSWIE+K HIQ
Sbjct: 601 AAKRVLGLDSEDSAAHILLSNLYAAAGKWDGVAEMRRRIRESRVGKEPGLSWIESKRHIQ 660

BLAST of HG10000673 vs. TAIR 10
Match: AT3G50420.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 602.8 bits (1553), Expect = 3.1e-172
Identity = 316/688 (45.93%), Postives = 443/688 (64.39%), Query Frame = 0

Query: 4   ALSLAALVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWESQQ 63
           A S+  L +KC S+  LK ARQ+HAL+LT+ A A++ SPY  NN++SMY RCG++ ++++
Sbjct: 94  ASSVVELTRKCVSITVLKRARQIHALVLTAGAGAATESPYANNNLISMYVRCGSLEQARK 153

Query: 64  VFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAASSIE 123
           VFDKMP RN+VS+NAL +AYSR+ D A  AF L + M  E+++PN ST TSL+Q  + +E
Sbjct: 154 VFDKMPHRNVVSYNALYSAYSRNPDFASYAFPLTTHMAFEYVKPNSSTFTSLVQVCAVLE 213

Query: 124 DRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNSMIF 183
           D   GS +++Q++K G+ ++V VQT+++G YS C DLESA ++F     +D VAWN+MI 
Sbjct: 214 DVLMGSSLNSQIIKLGYSDNVVVQTSVLGMYSSCGDLESARRIFDCVNNRDAVAWNTMIV 273

Query: 184 GNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSNAII 243
           G+LK+DK+ + L  F  ML  G+ PTQFTY+++LN C + G Y  G+L+H RII S+++ 
Sbjct: 274 GSLKNDKIEDGLMFFRNMLMSGVDPTQFTYSIVLNGCSKLGSYSLGKLIHARIIVSDSLA 333

Query: 244 DRTLQNVLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEEKAMKLFQQLL 303
           D  L N LLD+YC CGD   AF++F RI N +LV+WN+IISGCS+N   E+AM ++++LL
Sbjct: 334 DLPLDNALLDMYCSCGDMREAFYVFGRIHNPNLVSWNSIISGCSENGFGEQAMLMYRRLL 393

Query: 304 KSSLPKPDDYTYAAVISTI---DNLLSGMSFLAQVIKDGFEGSVFVSSVIVSMLFRNGES 363
           + S P+PD+YT++A IS     +  + G     QV K G+E SVFV + ++SM F+N E+
Sbjct: 394 RMSTPRPDEYTFSAAISATAEPERFVHGKLLHGQVTKLGYERSVFVGTTLLSMYFKNREA 453

Query: 364 QAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLALSSC 423
           ++A++VF  + E+DVVLWTEMI G+ R+G  E A++ F +M++  +  D FSLS  + +C
Sbjct: 454 ESAQKVFDVMKERDVVLWTEMIVGHSRLGNSELAVQFFIEMYREKNRSDGFSLSSVIGAC 513

Query: 424 ADLATLKQGEIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPCPDLKCWN 483
           +D+A L+QGE+FH LAI+TG +  + V G+L+DMY KNG   +A  IFS    PDLKCWN
Sbjct: 514 SDMAMLRQGEVFHCLAIRTGFDCVMSVCGALVDMYGKNGKYETAETIFSLASNPDLKCWN 573

Query: 484 SMLG-------------------------------------------------------- 543
           SMLG                                                        
Sbjct: 574 SMLGAYSQHGMVEKALSFFEQILENGFMPDAVTYLSLLAACSHRGSTLQGKFLWNQMKEQ 633

Query: 544 -------------GLLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLRVGVN 603
                         L+S AGL+DEA E+I +SP  NN  ELWRTLLS+CV  +NL++G+ 
Sbjct: 634 GIKAGFKHYSCMVNLVSKAGLVDEALELIEQSPPGNNQAELWRTLLSACVNTRNLQIGLY 693

Query: 604 AAKQVLRLDSEDSAAHILLSNLYAAAGKWDGVVEMRRRIRETMVGKDPGLSWIEA-KTHI 618
           AA+Q+L+LD ED+A HILLSNLYA  G+W+ V EMRR+IR     KDPGLSWIE    + 
Sbjct: 694 AAEQILKLDPEDTATHILLSNLYAVNGRWEDVAEMRRKIRGLASSKDPGLSWIEVNNNNT 753

BLAST of HG10000673 vs. TAIR 10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 326.6 bits (836), Expect = 4.3e-89
Identity = 200/678 (29.50%), Postives = 333/678 (49.12%), Query Frame = 0

Query: 1   MTEALSLAALVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWE 60
           M    + ++++  C  + SL+   QLH L+L       S   Y+CN ++S+Y   G +  
Sbjct: 285 MPTPYAFSSVLSACKKIESLEIGEQLHGLVL---KLGFSSDTYVCNALVSLYFHLGNLIS 344

Query: 61  SQQVFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAAS 120
           ++ +F  M QR+ V++N L+   S+   +   A  L  +M L+ L P+ +TL SL+ A S
Sbjct: 345 AEHIFSNMSQRDAVTYNTLINGLSQC-GYGEKAMELFKRMHLDGLEPDSNTLASLVVACS 404

Query: 121 SIEDRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNS 180
           +    F G  +H    K GF ++ +++ AL+  Y+ C D+E+A   F  T  ++VV WN 
Sbjct: 405 ADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKCADIETALDYFLETEVENVVLWNV 464

Query: 181 MIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSN 240
           M+      D L  + R+F +M    ++P Q+TY  IL  C R GD   G  +H +II +N
Sbjct: 465 MLVAYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTN 524

Query: 241 AIIDRTLQNVLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEEKAMKLFQ 300
             ++  + +VL+D+Y   G   TA+ I  R    D+V+W T+I+G +Q   ++KA+  F+
Sbjct: 525 FQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQYNFDDKALTTFR 584

Query: 301 QLLKSSLPKPDDYTYAAVIST---IDNLLSGMSFLAQVIKDGFEGSVFVSSVIVSMLFRN 360
           Q+L   + + D+      +S    +  L  G    AQ    GF   +   + +V++  R 
Sbjct: 585 QMLDRGI-RSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLPFQNALVTLYSRC 644

Query: 361 GESQAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLAL 420
           G+ + +   F      D + W  ++SG+ + G  E+A+R F +M++ G + ++F+   A+
Sbjct: 645 GKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAV 704

Query: 421 SSCADLATLKQGEIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPCPDLK 480
            + ++ A +KQG+  H++  KTG +++  V  +LI MYAK G +  A   F +V   +  
Sbjct: 705 KAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYAKCGSISDAEKQFLEVSTKNEV 764

Query: 481 CWNSMLGG---------------------------------------------------- 540
            WN+++                                                      
Sbjct: 765 SWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESM 824

Query: 541 ------------------LLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLR 600
                             +L+ AGL+  A+E I + P    D  +WRTLLS+CV  KN+ 
Sbjct: 825 NSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPI-KPDALVWRTLLSACVVHKNME 884

Query: 601 VGVNAAKQVLRLDSEDSAAHILLSNLYAAAGKWDGVVEMRRRIRETMVGKDPGLSWIEAK 606
           +G  AA  +L L+ EDSA ++LLSNLYA + KWD     R++++E  V K+PG SWIE K
Sbjct: 885 IGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVK 944

BLAST of HG10000673 vs. TAIR 10
Match: AT2G27610.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 282.3 bits (721), Expect = 9.3e-76
Identity = 166/569 (29.17%), Postives = 311/569 (54.66%), Query Frame = 0

Query: 44  ICNNILSMYARCGTIWESQQVFDKMPQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELE 103
           + N+++++Y +CG + +++ +FDK   +++V++N++++ Y+ +      A  +   M L 
Sbjct: 231 VSNSLINLYLKCGNVRKARILFDKTEVKSVVTWNSMISGYAANGLDLE-ALGMFYSMRLN 290

Query: 104 FLRPNCSTLTSLLQAASSIEDRFWGSLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESA 163
           ++R + S+  S+++  +++++  +   +H  VVK GF+ D  ++TAL+  YS C  +  A
Sbjct: 291 YVRLSESSFASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSKCTAMLDA 350

Query: 164 GKVFRWTIG--KDVVAWNSMIFGNLKHDKLNEALRLFNEMLGIGLIPTQFTYAMILNICC 223
            ++F+  IG   +VV+W +MI G L++D   EA+ LF+EM   G+ P +FTY++IL    
Sbjct: 351 LRLFK-EIGCVGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTYSVILTAL- 410

Query: 224 RNGDYLFGRLVHGRIITSNAIIDRTLQNVLLDLYCICGDTHTAFWIFNRIENADLVAWNT 283
                +    VH +++ +N     T+   LLD Y   G    A  +F+ I++ D+VAW+ 
Sbjct: 411 ---PVISPSEVHAQVVKTNYERSSTVGTALLDAYVKLGKVEEAAKVFSGIDDKDIVAWSA 470

Query: 284 IISGCSQNEEEEKAMKLFQQLLKSSLPKPDDYTYAAVI----STIDNLLSGMSFLAQVIK 343
           +++G +Q  E E A+K+F +L K  + KP+++T+++++    +T  ++  G  F    IK
Sbjct: 471 MLAGYAQTGETEAAIKMFGELTKGGI-KPNEFTFSSILNVCAATNASMGQGKQFHGFAIK 530

Query: 344 DGFEGSVFVSSVIVSMLFRNGESQAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRC 403
              + S+ VSS +++M  + G  ++AE VF    EKD+V W  MISGY + G+  KA+  
Sbjct: 531 SRLDSSLCVSSALLTMYAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAMKALDV 590

Query: 404 FHQMHQNGHELDSFSLSLALSSCADLATLKQGEIFHSLAIKTGCEAQIYVLGS-LIDMYA 463
           F +M +   ++D  +     ++C     +++GE +  + ++    A      S ++D+Y+
Sbjct: 591 FKEMKKRKVKMDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCMVDLYS 650

Query: 464 KNGDLGSAILIFSQVPCPDLKCWNSMLGGLLSGAGLMDEAEEMITKSPFANNDPELWRTL 523
           + G L  A+ +   +P P               AG                    +WRT+
Sbjct: 651 RAGQLEKAMKVIENMPNP---------------AG------------------STIWRTI 710

Query: 524 LSSCVARKNLRVGVNAAKQVLRLDSEDSAAHILLSNLYAAAGKWDGVVEMRRRIRETMVG 583
           L++C   K   +G  AA++++ +  EDSAA++LLSN+YA +G W    ++R+ + E  V 
Sbjct: 711 LAACRVHKKTELGRLAAEKIIAMKPEDSAAYVLLSNMYAESGDWQERAKVRKLMNERNVK 759

Query: 584 KDPGLSWIEAKTHIQAFSSGLQSHPEVDE 606
           K+PG SWIE K    +F +G +SHP  D+
Sbjct: 771 KEPGYSWIEVKNKTYSFLAGDRSHPLKDQ 759

BLAST of HG10000673 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 269.2 bits (687), Expect = 8.2e-72
Identity = 171/615 (27.80%), Postives = 309/615 (50.24%), Query Frame = 0

Query: 10  LVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWESQQVFDKMP 69
           L++ C     L+  +++H L++ S     SL  +    + +MYA+C  + E+++VFD+MP
Sbjct: 141 LLKVCGDEAELRVGKEIHGLLVKS---GFSLDLFAMTGLENMYAKCRQVNEARKVFDRMP 200

Query: 70  QRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAASSIEDRFWGS 129
           +R+LVS+N ++A YS+ +  A +A  ++  M  E L+P+  T+ S+L A S++     G 
Sbjct: 201 ERDLVSWNTIVAGYSQ-NGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGK 260

Query: 130 LIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNSMIFGNLKHD 189
            IH   ++ GF + V + TAL+  Y+ C  LE+A ++F   + ++VV+WNSMI   ++++
Sbjct: 261 EIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNE 320

Query: 190 KLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSNAIIDRTLQN 249
              EA+ +F +ML  G+ PT  +    L+ C   GD   GR +H   +      + ++ N
Sbjct: 321 NPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVN 380

Query: 250 VLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEEKAMKLFQQLLKSSLPK 309
            L+ +YC C +  TA  +F ++++  LV+WN +I G +QN     A+  F Q ++S   K
Sbjct: 381 SLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQ-MRSRTVK 440

Query: 310 PDDYTYAAVISTIDNLL---SGMSFLAQVIKDGFEGSVFVSSVIVSMLFRNGESQAAERV 369
           PD +TY +VI+ I  L            V++   + +VFV++ +V M  + G    A  +
Sbjct: 441 PDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLI 500

Query: 370 FVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLALSSCADLATL 429
           F  + E+ V  W  MI GY   G G+ A+  F +M +   + +  +    +S+C+    +
Sbjct: 501 FDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLV 560

Query: 430 KQG-EIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPC-PDLKCWNSMLG 489
           + G + F+ +      E  +   G+++D+  + G L  A     Q+P  P +  + +MLG
Sbjct: 561 EAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLG 620

Query: 490 GLLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLRVGVNAAKQVLRLDSEDS 549
                                             +C   KN+     AA+++  L+ +D 
Sbjct: 621 ----------------------------------ACQIHKNVNFAEKAAERLFELNPDDG 680

Query: 550 AAHILLSNLYAAAGKWDGVVEMRRRIRETMVGKDPGLSWIEAKTHIQAFSSGLQSHPEVD 609
             H+LL+N+Y AA  W+ V ++R  +    + K PG S +E K  + +F SG  +HP+  
Sbjct: 681 GYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSK 716

Query: 610 EALTTLLRLRGNMSE 620
           +    L +L  ++ E
Sbjct: 741 KIYAFLEKLICHIKE 716

BLAST of HG10000673 vs. TAIR 10
Match: AT5G13230.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 267.7 bits (683), Expect = 2.4e-71
Identity = 181/673 (26.89%), Postives = 321/673 (47.70%), Query Frame = 0

Query: 9   ALVQKCTSVVSLKAARQLHALILTSIATASSLSPYICNNILSMYARCGTIWESQQVFDKM 68
           A++++C       +A+ +H  IL      S L  +  N +L+ Y + G   ++  +FD+M
Sbjct: 54  AMLRRCIQKNDPISAKAIHCDILKK---GSCLDLFATNILLNAYVKAGFDKDALNLFDEM 113

Query: 69  PQRNLVSFNALMAAYSRSHDHAPLAFNLLSQMELEFLRPNCSTLTSLLQAASSIEDRFWG 128
           P+RN VSF  L   Y+           L S++  E    N    TS L+   S++     
Sbjct: 114 PERNNVSFVTLAQGYACQD-----PIGLYSRLHREGHELNPHVFTSFLKLFVSLDKAEIC 173

Query: 129 SLIHTQVVKRGFVNDVRVQTALIGTYSHCLDLESAGKVFRWTIGKDVVAWNSMIFGNLKH 188
             +H+ +VK G+ ++  V  ALI  YS C  ++SA  VF   + KD+V W  ++   +++
Sbjct: 174 PWLHSPIVKLGYDSNAFVGAALINAYSVCGSVDSARTVFEGILCKDIVVWAGIVSCYVEN 233

Query: 189 DKLNEALRLFNEMLGIGLIPTQFTYAMILNICCRNGDYLFGRLVHGRIITSNAIIDRTLQ 248
               ++L+L + M   G +P  +T+   L      G + F + VHG+I+ +  ++D  + 
Sbjct: 234 GYFEDSLKLLSCMRMAGFMPNNYTFDTALKASIGLGAFDFAKGVHGQILKTCYVLDPRVG 293

Query: 249 NVLLDLYCICGDTHTAFWIFNRIENADLVAWNTIISGCSQNEEEEKAMKLFQQLLKSSLP 308
             LL LY   GD   AF +FN +   D+V W+ +I+   QN    +A+ LF + ++ +  
Sbjct: 294 VGLLQLYTQLGDMSDAFKVFNEMPKNDVVPWSFMIARFCQNGFCNEAVDLFIR-MREAFV 353

Query: 309 KPDDYTYAAVI--------STIDNLLSGMSFLAQVIKDGFEGSVFVSSVIVSMLFRNGES 368
            P+++T ++++        S +   L G+     V+K GF+  ++VS+ ++ +  +  + 
Sbjct: 354 VPNEFTLSSILNGCAIGKCSGLGEQLHGL-----VVKVGFDLDIYVSNALIDVYAKCEKM 413

Query: 369 QAAERVFVSVVEKDVVLWTEMISGYLRIGEGEKAIRCFHQMHQNGHELDSFSLSLALSSC 428
             A ++F  +  K+ V W  +I GY  +GEG KA   F +  +N   +   + S AL +C
Sbjct: 414 DTAVKLFAELSSKNEVSWNTVIVGYENLGEGGKAFSMFREALRNQVSVTEVTFSSALGAC 473

Query: 429 ADLATLKQGEIFHSLAIKTGCEAQIYVLGSLIDMYAKNGDLGSAILIFSQVPCPDLKCWN 488
           A LA++  G   H LAIKT    ++ V  SLIDMYAK GD+  A  +F+++   D+  WN
Sbjct: 474 ASLASMDLGVQVHGLAIKTNNAKKVAVSNSLIDMYAKCGDIKFAQSVFNEMETIDVASWN 533

Query: 489 SMLGG------------------------------------------------------- 548
           +++ G                                                       
Sbjct: 534 ALISGYSTHGLGRQALRILDIMKDRDCKPNGLTFLGVLSGCSNAGLIDQGQECFESMIRD 593

Query: 549 ---------------LLSGAGLMDEAEEMITKSPFANNDPELWRTLLSSCVARKNLRVGV 604
                          LL  +G +D+A ++I   P+      +WR +LS+ + + N     
Sbjct: 594 HGIEPCLEHYTCMVRLLGRSGQLDKAMKLIEGIPY-EPSVMIWRAMLSASMNQNNEEFAR 653

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038901658.10.0e+0083.52pentatricopeptide repeat-containing protein At3g50420 [Benincasa hispida][more]
XP_008457819.10.0e+0081.90PREDICTED: pentatricopeptide repeat-containing protein At3g50420 [Cucumis melo] ... [more]
KGN62097.20.0e+0081.75hypothetical protein Csa_006310 [Cucumis sativus][more]
XP_004148057.20.0e+0081.75pentatricopeptide repeat-containing protein At3g50420 isoform X1 [Cucumis sativu... [more]
XP_023512997.14.8e-31079.34pentatricopeptide repeat-containing protein At3g50420 [Cucurbita pepo subsp. pep... [more]
Match NameE-valueIdentityDescription
Q9SCT24.4e-17145.93Pentatricopeptide repeat-containing protein At3g50420 OS=Arabidopsis thaliana OX... [more]
Q9SVP76.1e-8829.50Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
Q9ZUW31.3e-7429.17Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX... [more]
Q3E6Q11.1e-7027.80Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9LYV33.3e-7026.89Putative pentatricopeptide repeat-containing protein At5g13230, mitochondrial OS... [more]
Match NameE-valueIdentityDescription
A0A5A7TSH70.0e+0081.90Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3C6D50.0e+0081.90pentatricopeptide repeat-containing protein At3g50420 OS=Cucumis melo OX=3656 GN... [more]
A0A0A0LJT00.0e+0081.75Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G286470 PE=4 SV=1[more]
A0A6J1EIS41.0e-30879.05pentatricopeptide repeat-containing protein At3g50420 OS=Cucurbita moschata OX=3... [more]
A0A6J1I9755.7e-30778.77pentatricopeptide repeat-containing protein At3g50420 OS=Cucurbita maxima OX=366... [more]
Match NameE-valueIdentityDescription
AT3G50420.13.1e-17245.93Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G13650.14.3e-8929.50Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G27610.19.3e-7629.17Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G11290.18.2e-7227.80Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G13230.12.4e-7126.89Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 377..409
e-value: 4.2E-7
score: 27.7
coord: 277..308
e-value: 2.0E-5
score: 22.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 449..469
e-value: 0.22
score: 11.8
coord: 249..273
e-value: 0.18
score: 12.1
coord: 44..72
e-value: 0.085
score: 13.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 275..320
e-value: 2.0E-9
score: 37.5
coord: 373..421
e-value: 1.2E-8
score: 35.0
coord: 173..222
e-value: 3.6E-12
score: 46.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 41..75
score: 9.119859
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 374..408
score: 10.698286
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 174..208
score: 11.279235
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 275..309
score: 10.358486
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 428..619
e-value: 1.1E-11
score: 46.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 2..126
e-value: 7.8E-16
score: 59.9
coord: 129..230
e-value: 4.1E-18
score: 67.4
coord: 240..324
e-value: 5.9E-15
score: 57.1
coord: 325..427
e-value: 2.8E-16
score: 61.4
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 374..577
NoneNo IPR availablePANTHERPTHR47925:SF15BNAC07G31780D PROTEINcoord: 87..486
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 87..486
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 7..88
coord: 486..619
NoneNo IPR availablePANTHERPTHR47925:SF15BNAC07G31780D PROTEINcoord: 7..88
coord: 486..619

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10000673.1HG10000673.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1901962 S-adenosyl-L-methionine transmembrane transport
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005743 mitochondrial inner membrane
molecular_function GO:0005515 protein binding
molecular_function GO:0000095 S-adenosyl-L-methionine transmembrane transporter activity