HG10018388 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10018388
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr04: 3670346 .. 3673504 (-)
RNA-Seq ExpressionHG10018388
SyntenyHG10018388
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGGTACACATACGTGGTTGGCCACTGAAGGTGGTAGTCCGCAGAGTTGGACACTGGAGGTGGTCGCTGTAGATGATTATCAAAGGTGGTTGTTAAAGGTGGTCAACAAAGCATGTCGCTAGACTGAGTCGGCAATAGTCATCAATGGTGGTTGTTGCTAGAGTTGGGCGTCGTAGGTAGTCGTCGAAGTTAACCACTAAAAGTGGTCGTTGGAGTATGTCACTAGCGATGATTGTTGTGCATAGTTGGTCATCAACGATGGTCGGCGAGTAGAGTCATTGGAAATATTAGAGGGAGGGAGAGAGAAAGAAGAGAATATGGAGTTTCTCGATAAATGAGTTCTCGAGAAAGGTACAAAATTCAATAACAAATACTGTGAAGTTGGTGAAGTTGAGTTATTTTGAGGGTCAATATGGATTGACTAGAGAAAAAAAAATTTTAAAAAATGTTTTTTAAAAAATTTACTTTTACTTTAAACTCTTTTGATAAAATCTAGTTAAAATACACTTCAAACACTATTTTAATAGTTATCAAATACTTCAACTTTTTACCAAATGACTTATTTTCAAAATTAAATAGTTGAAAAGTTAATCCGAACTTATCTTAAATCAATTTTTTTTCTTTTTAAATTTTCTTGAATCCAAAAAAGAGGGTTAATCCGCACGGAATTTTAGTGGGAAATGGATGGGGATACGGATATCTCAAGTGATGAACTGGAAACCAGAAGGAACCACTAATTTCAAGCCGGAACTGGAAGCACTTGCAGTTGCAGTAACGGGGCTACTTGCTAAAGCTCCACTATAAACTATTATGGGTTTCTCCATCAAATTTGCCGTCTCCCAGCCTGTTCAATCCATTGTCTTCCCTTCACGAACGCCCAATTATCAGGCCAGTCTCTCATTCCCTATTTAGAATTTTCGTAATGGCTGAAATTTTTGGCAGATAAGGAATTCAGCTTTAACAATAACACTTTCTGCTACCCAAAAAGCTATTGCAACCTCCGGTATCAAATTTCCCAATTCGGTTACAGTCCATAAAACTGATTCCCATCTTGAAATTCAGCCGCTTGTTGATCTTCTGCGTGATTGTGTGGATGCAACGTTTCTGAAACAAGCCAAAACTGTTCATGGGTTTTTGTTAAAATCAAAATTTTCAAACCACGATTCTCTGGTCTTGCTTAATCATGTTGCTCACGCTTATTCGAAATGCTCCGATATCGATGCTGCCTGTCGCCTGTTTGATCAAATGTCCCAGAGAAACATATTTTCATGGACTGTCATAATTGTTGGATTGGCTGAGAATGGTTTATTCCTCGATGGGTTTGAGCTCTTCTGTGAAATGCAGAGTCAGGGAATTTTCCCTGATCAGTTTGCTTATTCTGGTATATTGCAGATATGTATTGGTCTGGATTCCATTGAATTGGGAAAGATGGTACACGCCCAGATTGTTATTAGAGGCTTTGCATCTCATACTTTTGTATCTACTGCTCTTCTTAATATGTATGCAAAGTTACAAAAGATTGAGGATTCATACAATGTGTTTAACACCATGACTGAAGTTAATGTAGTCTCATGGAATGCTATGATCTCAGGGTTTACATCAAACGGTCTTTACTTAGATGCTTTTGATATTTTTCTCAGAATGAACGGGGAAGGAGTAACACCCGACGCACAAACATTTATTGGAGTTGCAAAAGCGATCGGTATGTTACGAGATGTGAACAAGGCAAAAGAAGTTAGCGGTTATGCTTCGGAGTTGGGTATGGACTCTAATACTCTTGTGGGAACTGCTCTCATTGATATGCATTCTAAATGTGGATCTTTGCAGGAGGCAGGATCTATTTTTGACTCACATTTCACAAATTGTCGGGTTAATGCACCGTGGAATGCAATGATTTCGGGGTATTTACAGAGTGAGTTTAATGAAAAGGCCTTGGAATTATTTGCCAAAATGTGTCAAAATGACATACACTTGGACCGTTACACTTACTGTAGTGTATTTAATGCTATAGCTGCTTTGAAGTGTTTGTCCTCGGGAAAGAAGGTTCATGCCAGGGCTATAAAATCAGGACTGGAAGTGAATTATATAAGTATTTCCAATGCAGTGGCTAATGCGTATGCTAAATGTGGATCGCTGGAGGATGTAAGGAAGGTCTTTTACAGGATGGAAGATAGAGATTTGGTATCTTGGACCAGCCTAGTGACTGCTTATTCTCAGTGTTCTGAATGGGATAAGGCAATAGAGATCTTCTCAAATATGAGAGAAGAAGGTTATGCACCCAATCAGTTCGCCTTTTCTAGCGTGCTTGTTTCATGTGCTAGCCTTTGCTTACTTGAGTATGGTCAGCAAGTCCACGGGTTCATCTGCAAGGTTGGCTTGGATATGGACAAATGCATTGAAAGTGCTCTGATTGACATGTATGCCAAATGTGGTTGTCTGGCTGAGGCGAAGAAGGTTTTCGACAGAATCTCTAATGCTGATACAGTGTCGTGGACTGCAATAATAGCTGGTCATGCTCAACACGGTATTGTCGATGATGCCCTTCAACTCTTTAGAAGGATGGAGCAGTTAGGTGTGGAGCCCAATGCTGTTACTTTTGTGTGTGTTCTATTTGCATGTAGCCACGGCGGTCTGGTAGAGGAAGGCCTACAGTACTTCAAGCTAATGAAGGAAAACTATGGTTTGGTGCCAGAGATGGAACATTATTCCTGTATCGTGGATCTCTTAAGTCGTGTGGGTCATCTAAATGATGCAATGGAATTTATTAGTAGGATGCCCGTAGAGCCCAATGAAATGGTTTGGCAGACCTTGCTGGGGGCTTGCAGGGTCCATGGTAATGTTGAATTGGGAGAGCTTGCTGCTCAGAAGATACTTTCTTTCAAAGCAGAAAACTCTGCCACCTATGTTCTTTTATCCAACACCTATATCGAATCAGGAAGTTACAAAGATGGACTTAGTTTGCGGCATGTGATGAAAGAGCAGGGGGTAAAAAAGGAACCAGGATGTAGTTGGATCTCTGTGAATGGTACATTGCACAAGTTTTATGCAGGTGACCAACAACATCCAGAAAAAGATAAAATTTATGCAAAGCTAGGAGAGTTGAGGTTGAAAGCCATTTCTCTGGATGATGTACCAGATTTGAGTTACGAGCTGTAA

mRNA sequence

ATGGTGGTACACATACGTGGTTGGCCACTGAAGGTGATAAGGAATTCAGCTTTAACAATAACACTTTCTGCTACCCAAAAAGCTATTGCAACCTCCGGTATCAAATTTCCCAATTCGGTTACAGTCCATAAAACTGATTCCCATCTTGAAATTCAGCCGCTTGTTGATCTTCTGCGTGATTGTGTGGATGCAACGTTTCTGAAACAAGCCAAAACTGTTCATGGGTTTTTGTTAAAATCAAAATTTTCAAACCACGATTCTCTGGTCTTGCTTAATCATGTTGCTCACGCTTATTCGAAATGCTCCGATATCGATGCTGCCTGTCGCCTGTTTGATCAAATGTCCCAGAGAAACATATTTTCATGGACTGTCATAATTGTTGGATTGGCTGAGAATGGTTTATTCCTCGATGGGTTTGAGCTCTTCTGTGAAATGCAGAGTCAGGGAATTTTCCCTGATCAGTTTGCTTATTCTGGTATATTGCAGATATGTATTGGTCTGGATTCCATTGAATTGGGAAAGATGGTACACGCCCAGATTGTTATTAGAGGCTTTGCATCTCATACTTTTGTATCTACTGCTCTTCTTAATATGTATGCAAAGTTACAAAAGATTGAGGATTCATACAATGTGTTTAACACCATGACTGAAGTTAATGTAGTCTCATGGAATGCTATGATCTCAGGGTTTACATCAAACGGTCTTTACTTAGATGCTTTTGATATTTTTCTCAGAATGAACGGGGAAGGAGTAACACCCGACGCACAAACATTTATTGGAGTTGCAAAAGCGATCGGTATGTTACGAGATGTGAACAAGGCAAAAGAAGTTAGCGGTTATGCTTCGGAGTTGGGATCTATTTTTGACTCACATTTCACAAATTCTGCTTTGAAGTGTTTGTCCTCGGGAAAGAAGGTTCATGCCAGGGCTATAAAATCAGGACTGGAAGTGAATTATATAAGTATTTCCAATGCAGTGGCTAATGCGTATGCTAAATGTGGATCGCTGGAGGATGTAAGGAAGGTCTTTTACAGGATGGAAGATAGAGATTTGGTATCTTGGACCAGCCTAGTGACTGCTTATTCTCAGTGTTCTGAATGGGATAAGGCAATAGAGATCTTCTCAAATATGAGAGAAGAAGGTTATGCACCCAATCAGTTCGCCTTTTCTAGCGTGCTTGTTTCATGTGCTAGCCTTTGCTTACTTGAGTATGGTCAGCAAGTCCACGGGTTCATCTGCAAGGTTGGCTTGGATATGGACAAATGCATTGAAAGTGCTCTGATTGACATGTATGCCAAATGTGGTTGTCTGGCTGAGGCGAAGAAGGTTTTCGACAGAATCTCTAATGCTGATACAGTGTCGTGGACTGCAATAATAGCTGGTCATGCTCAACACGGTATTGTCGATGATGCCCTTCAACTCTTTAGAAGGATGGAGCAGTTAGGTGTGGAGCCCAATGCTGTTACTTTTGTGTGTGTTCTATTTGCATGTAGCCACGGCGGTCTGGTAGAGGAAGGCCTACAGTACTTCAAGCTAATGAAGGAAAACTATGGTTTGGTGCCAGAGATGGAACATTATTCCTGTATCGTGGATCTCTTAAGTCGTGTGGGTCATCTAAATGATGCAATGGAATTTATTAGTAGGATGCCCGTAGAGCCCAATGAAATGGTTTGGCAGACCTTGCTGGGGGCTTGCAGGGTCCATGGTAATGTTGAATTGGGAGAGCTTGCTGCTCAGAAGATACTTTCTTTCAAAGCAGAAAACTCTGCCACCTATGTTCTTTTATCCAACACCTATATCGAATCAGGAAGTTACAAAGATGGACTTAGTTTGCGGCATGTGATGAAAGAGCAGGGGGTAAAAAAGGAACCAGGATGTAGTTGGATCTCTGTGAATGGTACATTGCACAAGTTTTATGCAGGTGACCAACAACATCCAGAAAAAGATAAAATTTATGCAAAGCTAGGAGAGTTGAGGTTGAAAGCCATTTCTCTGGATGATGTACCAGATTTGAGTTACGAGCTGTAA

Coding sequence (CDS)

ATGGTGGTACACATACGTGGTTGGCCACTGAAGGTGATAAGGAATTCAGCTTTAACAATAACACTTTCTGCTACCCAAAAAGCTATTGCAACCTCCGGTATCAAATTTCCCAATTCGGTTACAGTCCATAAAACTGATTCCCATCTTGAAATTCAGCCGCTTGTTGATCTTCTGCGTGATTGTGTGGATGCAACGTTTCTGAAACAAGCCAAAACTGTTCATGGGTTTTTGTTAAAATCAAAATTTTCAAACCACGATTCTCTGGTCTTGCTTAATCATGTTGCTCACGCTTATTCGAAATGCTCCGATATCGATGCTGCCTGTCGCCTGTTTGATCAAATGTCCCAGAGAAACATATTTTCATGGACTGTCATAATTGTTGGATTGGCTGAGAATGGTTTATTCCTCGATGGGTTTGAGCTCTTCTGTGAAATGCAGAGTCAGGGAATTTTCCCTGATCAGTTTGCTTATTCTGGTATATTGCAGATATGTATTGGTCTGGATTCCATTGAATTGGGAAAGATGGTACACGCCCAGATTGTTATTAGAGGCTTTGCATCTCATACTTTTGTATCTACTGCTCTTCTTAATATGTATGCAAAGTTACAAAAGATTGAGGATTCATACAATGTGTTTAACACCATGACTGAAGTTAATGTAGTCTCATGGAATGCTATGATCTCAGGGTTTACATCAAACGGTCTTTACTTAGATGCTTTTGATATTTTTCTCAGAATGAACGGGGAAGGAGTAACACCCGACGCACAAACATTTATTGGAGTTGCAAAAGCGATCGGTATGTTACGAGATGTGAACAAGGCAAAAGAAGTTAGCGGTTATGCTTCGGAGTTGGGATCTATTTTTGACTCACATTTCACAAATTCTGCTTTGAAGTGTTTGTCCTCGGGAAAGAAGGTTCATGCCAGGGCTATAAAATCAGGACTGGAAGTGAATTATATAAGTATTTCCAATGCAGTGGCTAATGCGTATGCTAAATGTGGATCGCTGGAGGATGTAAGGAAGGTCTTTTACAGGATGGAAGATAGAGATTTGGTATCTTGGACCAGCCTAGTGACTGCTTATTCTCAGTGTTCTGAATGGGATAAGGCAATAGAGATCTTCTCAAATATGAGAGAAGAAGGTTATGCACCCAATCAGTTCGCCTTTTCTAGCGTGCTTGTTTCATGTGCTAGCCTTTGCTTACTTGAGTATGGTCAGCAAGTCCACGGGTTCATCTGCAAGGTTGGCTTGGATATGGACAAATGCATTGAAAGTGCTCTGATTGACATGTATGCCAAATGTGGTTGTCTGGCTGAGGCGAAGAAGGTTTTCGACAGAATCTCTAATGCTGATACAGTGTCGTGGACTGCAATAATAGCTGGTCATGCTCAACACGGTATTGTCGATGATGCCCTTCAACTCTTTAGAAGGATGGAGCAGTTAGGTGTGGAGCCCAATGCTGTTACTTTTGTGTGTGTTCTATTTGCATGTAGCCACGGCGGTCTGGTAGAGGAAGGCCTACAGTACTTCAAGCTAATGAAGGAAAACTATGGTTTGGTGCCAGAGATGGAACATTATTCCTGTATCGTGGATCTCTTAAGTCGTGTGGGTCATCTAAATGATGCAATGGAATTTATTAGTAGGATGCCCGTAGAGCCCAATGAAATGGTTTGGCAGACCTTGCTGGGGGCTTGCAGGGTCCATGGTAATGTTGAATTGGGAGAGCTTGCTGCTCAGAAGATACTTTCTTTCAAAGCAGAAAACTCTGCCACCTATGTTCTTTTATCCAACACCTATATCGAATCAGGAAGTTACAAAGATGGACTTAGTTTGCGGCATGTGATGAAAGAGCAGGGGGTAAAAAAGGAACCAGGATGTAGTTGGATCTCTGTGAATGGTACATTGCACAAGTTTTATGCAGGTGACCAACAACATCCAGAAAAAGATAAAATTTATGCAAAGCTAGGAGAGTTGAGGTTGAAAGCCATTTCTCTGGATGATGTACCAGATTTGAGTTACGAGCTGTAA

Protein sequence

MVVHIRGWPLKVIRNSALTITLSATQKAIATSGIKFPNSVTVHKTDSHLEIQPLVDLLRDCVDATFLKQAKTVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNIFSWTVIIVGLAENGLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIELGKMVHAQIVIRGFASHTFVSTALLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFTSNGLYLDAFDIFLRMNGEGVTPDAQTFIGVAKAIGMLRDVNKAKEVSGYASELGSIFDSHFTNSALKCLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWTSLVTAYSQCSEWDKAIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQVHGFICKVGLDMDKCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFRRMEQLGVEPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLSRVGHLNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNTYIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLGELRLKAISLDDVPDLSYEL
Homology
BLAST of HG10018388 vs. NCBI nr
Match: XP_038884632.1 (pentatricopeptide repeat-containing protein At3g16610-like [Benincasa hispida] >XP_038884633.1 pentatricopeptide repeat-containing protein At3g16610-like [Benincasa hispida])

HSP 1 Score: 1205.7 bits (3118), Expect = 0.0e+00
Identity = 621/737 (84.26%), Postives = 636/737 (86.30%), Query Frame = 0

Query: 13  IRNSALTITLSATQKAIATSGIKFPNSVTVHKTDSHLEIQPLVDLLRDCVDATFLKQAKT 72
           IRNSALTIT SATQKAIA S IK P+SVTVHKTDSHLEIQ LVDLLRDCVDA FLKQAKT
Sbjct: 27  IRNSALTITHSATQKAIANSAIKIPDSVTVHKTDSHLEIQQLVDLLRDCVDARFLKQAKT 86

Query: 73  VHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNIFSWTVIIVGLAEN 132
           VHGFLLKS+FSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNIFSWTVIIVGLAEN
Sbjct: 87  VHGFLLKSEFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNIFSWTVIIVGLAEN 146

Query: 133 GLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIELGKMVHAQIVIRGFASHTFVS 192
           GLFLDGFE FCEMQS GIFPDQFAYSGILQICIGLDS+ELGKMVHAQI IRGFASHTFVS
Sbjct: 147 GLFLDGFEFFCEMQSHGIFPDQFAYSGILQICIGLDSLELGKMVHAQIFIRGFASHTFVS 206

Query: 193 TALLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFTSNGLYLDAFDIFLRMNGEGVT 252
           TALLNMYAKLQ+IE+SY VFNTMTEVNVVSWNAMISGFTSNGLYLDAFD+FLRM  EGVT
Sbjct: 207 TALLNMYAKLQQIENSYKVFNTMTEVNVVSWNAMISGFTSNGLYLDAFDLFLRMKREGVT 266

Query: 253 PDAQTFIGVAKAIGMLRDVNKAKEVSGYASELG-------------------------SI 312
            DAQTFIGVAKAIGMLRDVNKAKEVS  ASELG                         SI
Sbjct: 267 LDAQTFIGVAKAIGMLRDVNKAKEVSCSASELGVDSNTFVGTALIDMHSKCGSLREARSI 326

Query: 313 FDSHFTN-------------------------------------------------SALK 372
           FDSHFTN                                                 +ALK
Sbjct: 327 FDSHFTNCRVNAPWNAMISGYLQSEFNEKALELFAKMCLNDIHLDHYTYCSVFNAIAALK 386

Query: 373 CLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWTSLV 432
           CLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCG LEDVRKVFYRMEDRDLVSWT+LV
Sbjct: 387 CLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGLLEDVRKVFYRMEDRDLVSWTTLV 446

Query: 433 TAYSQCSEWDKAIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQVHGFICKVGLD 492
           TAYSQCSEWDKAIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQVHGFI KVGLD
Sbjct: 447 TAYSQCSEWDKAIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQVHGFIYKVGLD 506

Query: 493 MDKCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFRRM 552
           MD CIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFRRM
Sbjct: 507 MDTCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFRRM 566

Query: 553 EQLGVEPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLSRVGH 612
            Q GVEPNAVTF+CVLFACSHGGLVEEGLQYFKLMKE Y LVPEMEHYSCIVD+LSRVGH
Sbjct: 567 VQSGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYSLVPEMEHYSCIVDILSRVGH 626

Query: 613 LNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNT 672
           LNDAMEFISRMPVEPNEMVWQTLLGACR+HGN+ELGELAAQKILS KAENSAT+VLLSNT
Sbjct: 627 LNDAMEFISRMPVEPNEMVWQTLLGACRIHGNIELGELAAQKILSSKAENSATFVLLSNT 686

Query: 673 YIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLGEL 676
           YIESGSYKDGLSLR VMKEQGVKKEPG SWIS+NGTLHKFYAGDQQHPEKDKIYAKL EL
Sbjct: 687 YIESGSYKDGLSLRLVMKEQGVKKEPGFSWISMNGTLHKFYAGDQQHPEKDKIYAKLEEL 746

BLAST of HG10018388 vs. NCBI nr
Match: XP_011656423.1 (putative pentatricopeptide repeat-containing protein At3g25970 [Cucumis sativus])

HSP 1 Score: 1204.5 bits (3115), Expect = 0.0e+00
Identity = 614/737 (83.31%), Postives = 632/737 (85.75%), Query Frame = 0

Query: 13  IRNSALTITLSATQKAIATSGIKFPNSVTVHKTDSHLEIQPLVDLLRDCVDATFLKQAKT 72
           IRNSALTIT SA QK  ATSGIK PNSV V KTDSHL+IQPLVDLLRDCVDA FLKQAKT
Sbjct: 27  IRNSALTITHSAIQKPFATSGIKTPNSVKVDKTDSHLQIQPLVDLLRDCVDARFLKQAKT 86

Query: 73  VHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNIFSWTVIIVGLAEN 132
           VHGFLLKSKFSNH SLVLLNHVAHAYSKCSDIDAACRLFDQMSQRN FSWTV+I GLAEN
Sbjct: 87  VHGFLLKSKFSNHHSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNTFSWTVLIAGLAEN 146

Query: 133 GLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIELGKMVHAQIVIRGFASHTFVS 192
           GLFLDGFE FCEMQSQGIFPDQFAYSGILQICIGLDSIELG MVHAQIVIRGF SHTFVS
Sbjct: 147 GLFLDGFEFFCEMQSQGIFPDQFAYSGILQICIGLDSIELGNMVHAQIVIRGFTSHTFVS 206

Query: 193 TALLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFTSNGLYLDAFDIFLRMNGEGVT 252
           TALLNMYAKLQ+IEDSY VFNTMTEVNVVSWNAMI+GFTSN LYLDAFD+FLRM GEGVT
Sbjct: 207 TALLNMYAKLQEIEDSYKVFNTMTEVNVVSWNAMITGFTSNDLYLDAFDLFLRMMGEGVT 266

Query: 253 PDAQTFIGVAKAIGMLRDVNKAKEVSGYASELG-------------------------SI 312
           PDAQTFIGVAKAIGMLRDVNKAKEVSGYA ELG                         SI
Sbjct: 267 PDAQTFIGVAKAIGMLRDVNKAKEVSGYALELGVDSNTLVGTALIDMNSKCGSLQEARSI 326

Query: 313 FDSHFTN-------------------------------------------------SALK 372
           F+SHF                                                   +ALK
Sbjct: 327 FNSHFITCRFNAPWNAMISGYLRSGFNEKALELFAKMCQNDIYLDHYTYCSVFNAIAALK 386

Query: 373 CLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWTSLV 432
           CLS GKKVHARAIKSGLEVNY+SISNAVANAYAKCGSLEDVRKVF RMEDRDL+SWTSLV
Sbjct: 387 CLSLGKKVHARAIKSGLEVNYVSISNAVANAYAKCGSLEDVRKVFNRMEDRDLISWTSLV 446

Query: 433 TAYSQCSEWDKAIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQVHGFICKVGLD 492
           TAYSQCSEWDKAIEIFSNMR EG APNQF FSSVLVSCA+LCLLEYGQQVHG ICKVGLD
Sbjct: 447 TAYSQCSEWDKAIEIFSNMRAEGIAPNQFTFSSVLVSCANLCLLEYGQQVHGIICKVGLD 506

Query: 493 MDKCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFRRM 552
           MDKCIESAL+DMYAKCGCL +AKKVF+RISNADTVSWTAIIAGHAQHGIVDDALQLFRRM
Sbjct: 507 MDKCIESALVDMYAKCGCLGDAKKVFNRISNADTVSWTAIIAGHAQHGIVDDALQLFRRM 566

Query: 553 EQLGVEPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLSRVGH 612
            QLGVEPNAVTF+CVLFACSHGGLVEEGLQYFKLMK+ YGLVPEMEHY+CIVDLLSRVGH
Sbjct: 567 VQLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKKTYGLVPEMEHYACIVDLLSRVGH 626

Query: 613 LNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNT 672
           LNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNT
Sbjct: 627 LNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNT 686

Query: 673 YIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLGEL 676
           YIESGSYKDGLSLRH+MKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKL EL
Sbjct: 687 YIESGSYKDGLSLRHLMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEEL 746

BLAST of HG10018388 vs. NCBI nr
Match: KAA0052031.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1194.1 bits (3088), Expect = 0.0e+00
Identity = 611/739 (82.68%), Postives = 631/739 (85.39%), Query Frame = 0

Query: 11  KVIRNSALTITLSATQKAIATSGIKFPNSVTVHKTDSHLEIQPLVDLLRDCVDATFLKQA 70
           + IR SALTIT SATQKAIATSGIK PNSV V KTDSHLEIQPLVDLLR CVDA FLKQA
Sbjct: 25  RTIRTSALTITHSATQKAIATSGIKIPNSVKVDKTDSHLEIQPLVDLLRGCVDARFLKQA 84

Query: 71  KTVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNIFSWTVIIVGLA 130
           KTVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACR+FDQMSQRNIFSWT II GLA
Sbjct: 85  KTVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRVFDQMSQRNIFSWTAIIAGLA 144

Query: 131 ENGLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIELGKMVHAQIVIRGFASHTF 190
           ENGLFLDGFE FCEMQSQGIFPD FAYSGILQICIGLDS+ELGKMVHAQIVIRGF SHTF
Sbjct: 145 ENGLFLDGFEFFCEMQSQGIFPDHFAYSGILQICIGLDSVELGKMVHAQIVIRGFTSHTF 204

Query: 191 VSTALLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFTSNGLYLDAFDIFLRMNGEG 250
           VSTALLNMYAKLQ+IEDS  VFNTMTEVNVVSWNAMI+GFTSNG YLDAFD+FLRM GEG
Sbjct: 205 VSTALLNMYAKLQEIEDSCKVFNTMTEVNVVSWNAMITGFTSNGFYLDAFDLFLRMKGEG 264

Query: 251 VTPDAQTFIGVAKAIGMLRDVNKAKEVSGYASELG------------------------- 310
           VTPDAQTFIGVAKAIGMLRDVNKAKEVSGYA ELG                         
Sbjct: 265 VTPDAQTFIGVAKAIGMLRDVNKAKEVSGYALELGVDSNTLVGTALIDMHSKCGSLQEAR 324

Query: 311 SIFDSHFTN-------------------------------------------------SA 370
           SIF+SHF                                                   ++
Sbjct: 325 SIFNSHFVTCRFNAPWNAMISGYLQSGLNEKALELFAKMCQSDIHLDRYTYCSVFNAIAS 384

Query: 371 LKCLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWTS 430
           LKCL SGKKVHARAIKSGLEVN +SISNAVANAYAKCGSLEDVRKVF RMEDRDL+SWTS
Sbjct: 385 LKCLLSGKKVHARAIKSGLEVNCVSISNAVANAYAKCGSLEDVRKVFNRMEDRDLISWTS 444

Query: 431 LVTAYSQCSEWDKAIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQVHGFICKVG 490
           LVTAYSQCSEWDKAIEIFSNMR EGYAPNQFAFSSVLVSCA+LCLLEYGQQVHG ICKVG
Sbjct: 445 LVTAYSQCSEWDKAIEIFSNMRAEGYAPNQFAFSSVLVSCANLCLLEYGQQVHGIICKVG 504

Query: 491 LDMDKCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFR 550
           LDMDKCIESAL+DMYAKCGCLA+AKKVF+RISNADTVSWTAIIAGHAQHGIVDDALQLFR
Sbjct: 505 LDMDKCIESALVDMYAKCGCLADAKKVFNRISNADTVSWTAIIAGHAQHGIVDDALQLFR 564

Query: 551 RMEQLGVEPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLSRV 610
           RM  LGVEPNAVTF+CVLFACSHGGLVEEGLQYFKLMK+ YGLVPEMEHY+CIVDLLSRV
Sbjct: 565 RMVLLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKKTYGLVPEMEHYACIVDLLSRV 624

Query: 611 GHLNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLS 670
           G LNDAM FIS+MPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLS
Sbjct: 625 GRLNDAMGFISKMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLS 684

Query: 671 NTYIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLG 676
           NTYIESGSYKDGLSLRHVMKEQGVKKEPG SWISVNGTLHKFYAGDQQHPEKDKIYAKL 
Sbjct: 685 NTYIESGSYKDGLSLRHVMKEQGVKKEPGFSWISVNGTLHKFYAGDQQHPEKDKIYAKLE 744

BLAST of HG10018388 vs. NCBI nr
Match: TYK04529.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1193.7 bits (3087), Expect = 0.0e+00
Identity = 611/738 (82.79%), Postives = 631/738 (85.50%), Query Frame = 0

Query: 12  VIRNSALTITLSATQKAIATSGIKFPNSVTVHKTDSHLEIQPLVDLLRDCVDATFLKQAK 71
           +IR SALTIT SATQKAIATSGIK PNSV V KTDSHLEIQPLVDLLR CVDA FLKQAK
Sbjct: 43  LIRTSALTITHSATQKAIATSGIKIPNSVKVDKTDSHLEIQPLVDLLRGCVDARFLKQAK 102

Query: 72  TVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNIFSWTVIIVGLAE 131
           TVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACR+FDQMSQRNIFSWT II GLAE
Sbjct: 103 TVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRVFDQMSQRNIFSWTAIIAGLAE 162

Query: 132 NGLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIELGKMVHAQIVIRGFASHTFV 191
           NGLFLDGFE FCEMQSQGIFPD FAYSGILQICIGLDS+ELGKMVHAQIVIRGF SHTFV
Sbjct: 163 NGLFLDGFEFFCEMQSQGIFPDHFAYSGILQICIGLDSVELGKMVHAQIVIRGFTSHTFV 222

Query: 192 STALLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFTSNGLYLDAFDIFLRMNGEGV 251
           STALLNMYAKLQ+IEDS  VFNTMTEVNVVSWNAMI+GFTSNG YLDAFD+FLRM GEGV
Sbjct: 223 STALLNMYAKLQEIEDSCKVFNTMTEVNVVSWNAMITGFTSNGFYLDAFDLFLRMKGEGV 282

Query: 252 TPDAQTFIGVAKAIGMLRDVNKAKEVSGYASELG-------------------------S 311
           TPDAQTFIGVAKAIGMLRDVNKAKEVSGYA ELG                         S
Sbjct: 283 TPDAQTFIGVAKAIGMLRDVNKAKEVSGYALELGVDSNTLVGTALIDMHSKCGSLQEARS 342

Query: 312 IFDSHFTN-------------------------------------------------SAL 371
           IF+SHF                                                   ++L
Sbjct: 343 IFNSHFVTCRFNAPWNAMISGYLQSGLNEKALELFAKMCQSDIHLDRYTYCSVFNAIASL 402

Query: 372 KCLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWTSL 431
           KCL SGKKVHARAIKSGLEVN +SISNAVANAYAKCGSLEDVRKVF RMEDRDL+SWTSL
Sbjct: 403 KCLLSGKKVHARAIKSGLEVNCVSISNAVANAYAKCGSLEDVRKVFNRMEDRDLISWTSL 462

Query: 432 VTAYSQCSEWDKAIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQVHGFICKVGL 491
           VTAYSQCSEWDKAIEIFSNMR EGYAPNQFAFSSVLVSCA+LCLLEYGQQVHG ICKVGL
Sbjct: 463 VTAYSQCSEWDKAIEIFSNMRAEGYAPNQFAFSSVLVSCANLCLLEYGQQVHGIICKVGL 522

Query: 492 DMDKCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFRR 551
           DMDKCIESAL+DMYAKCGCLA+AKKVF+RISNADTVSWTAIIAGHAQHGIVDDALQLFRR
Sbjct: 523 DMDKCIESALVDMYAKCGCLADAKKVFNRISNADTVSWTAIIAGHAQHGIVDDALQLFRR 582

Query: 552 MEQLGVEPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLSRVG 611
           M  LGVEPNAVTF+CVLFACSHGGLVEEGLQYFKLMK+ YGLVPEMEHY+CIVDLLSRVG
Sbjct: 583 MVLLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKKTYGLVPEMEHYACIVDLLSRVG 642

Query: 612 HLNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSN 671
            LNDAM FIS+MPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSN
Sbjct: 643 RLNDAMGFISKMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSN 702

Query: 672 TYIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLGE 676
           TYIESGSYKDGLSLRHVMKEQGVKKEPG SWISVNGTLHKFYAGDQQHPEKDKIYAKL E
Sbjct: 703 TYIESGSYKDGLSLRHVMKEQGVKKEPGFSWISVNGTLHKFYAGDQQHPEKDKIYAKLEE 762

BLAST of HG10018388 vs. NCBI nr
Match: XP_016901974.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g27610-like [Cucumis melo])

HSP 1 Score: 1193.7 bits (3087), Expect = 0.0e+00
Identity = 611/737 (82.90%), Postives = 630/737 (85.48%), Query Frame = 0

Query: 13  IRNSALTITLSATQKAIATSGIKFPNSVTVHKTDSHLEIQPLVDLLRDCVDATFLKQAKT 72
           IR SALTIT SATQKAIATSGIK PNSV V KTDSHLEIQPLVDLLR CVDA FLKQAKT
Sbjct: 27  IRTSALTITHSATQKAIATSGIKIPNSVKVDKTDSHLEIQPLVDLLRGCVDARFLKQAKT 86

Query: 73  VHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNIFSWTVIIVGLAEN 132
           VHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACR+FDQMSQRNIFSWT II GLAEN
Sbjct: 87  VHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRVFDQMSQRNIFSWTAIIAGLAEN 146

Query: 133 GLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIELGKMVHAQIVIRGFASHTFVS 192
           GLFLDGFE FCEMQSQGIFPD FAYSGILQICIGLDS+ELGKMVHAQIVIRGF SHTFVS
Sbjct: 147 GLFLDGFEFFCEMQSQGIFPDHFAYSGILQICIGLDSVELGKMVHAQIVIRGFTSHTFVS 206

Query: 193 TALLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFTSNGLYLDAFDIFLRMNGEGVT 252
           TALLNMYAKLQ+IEDS  VFNTMTEVNVVSWNAMI+GFTSNG YLDAFD+FLRM GEGVT
Sbjct: 207 TALLNMYAKLQEIEDSCKVFNTMTEVNVVSWNAMITGFTSNGFYLDAFDLFLRMKGEGVT 266

Query: 253 PDAQTFIGVAKAIGMLRDVNKAKEVSGYASELG-------------------------SI 312
           PDAQTFIGVAKAIGMLRDVNKAKEVSGYA ELG                         SI
Sbjct: 267 PDAQTFIGVAKAIGMLRDVNKAKEVSGYALELGVDSNTLVGTALIDMHSKCGSLQEARSI 326

Query: 313 FDSHFTN-------------------------------------------------SALK 372
           F+SHF                                                   ++LK
Sbjct: 327 FNSHFVTCRFNAPWNAMISGYLQSGLNEKALELFAKMCQSDIHLDRYTYCSVFNAIASLK 386

Query: 373 CLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWTSLV 432
           CL SGKKVHARAIKSGLEVN +SISNAVANAYAKCGSLEDVRKVF RMEDRDL+SWTSLV
Sbjct: 387 CLLSGKKVHARAIKSGLEVNCVSISNAVANAYAKCGSLEDVRKVFNRMEDRDLISWTSLV 446

Query: 433 TAYSQCSEWDKAIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQVHGFICKVGLD 492
           TAYSQCSEWDKAIEIFSNMR EGYAPNQFAFSSVLVSCA+LCLLEYGQQVHG ICKVGLD
Sbjct: 447 TAYSQCSEWDKAIEIFSNMRAEGYAPNQFAFSSVLVSCANLCLLEYGQQVHGIICKVGLD 506

Query: 493 MDKCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFRRM 552
           MDKCIESAL+DMYAKCGCLA+AKKVF+RISNADTVSWTAIIAGHAQHGIVDDALQLFRRM
Sbjct: 507 MDKCIESALVDMYAKCGCLADAKKVFNRISNADTVSWTAIIAGHAQHGIVDDALQLFRRM 566

Query: 553 EQLGVEPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLSRVGH 612
             LGVEPNAVTF+CVLFACSHGGLVEEGLQYFKLMK+ YGLVPEMEHY+CIVDLLSRVG 
Sbjct: 567 VLLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKKTYGLVPEMEHYACIVDLLSRVGR 626

Query: 613 LNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNT 672
           LNDAM FIS+MPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNT
Sbjct: 627 LNDAMGFISKMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNT 686

Query: 673 YIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLGEL 676
           YIESGSYKDGLSLRHVMKEQGVKKEPG SWISVNGTLHKFYAGDQQHPEKDKIYAKL EL
Sbjct: 687 YIESGSYKDGLSLRHVMKEQGVKKEPGFSWISVNGTLHKFYAGDQQHPEKDKIYAKLEEL 746

BLAST of HG10018388 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 424.1 bits (1089), Expect = 3.0e-117
Identity = 222/652 (34.05%), Postives = 353/652 (54.14%), Query Frame = 0

Query: 47  SHLEIQPLVDLLRDCVDATFLK-QAKTVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDID 106
           S  +  P   LL  C+ +       + VH  ++KS FSN   + + N +  AYSKC  ++
Sbjct: 15  SFTDSSPFAKLLDSCIKSKLSAIYVRYVHASVIKSGFSN--EIFIQNRLIDAYSKCGSLE 74

Query: 107 AACRLFDQMSQRNIFSWTVIIVGLAENGLFLDGFELF----------------------- 166
              ++FD+M QRNI++W  ++ GL + G   +   LF                       
Sbjct: 75  DGRQVFDKMPQRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDR 134

Query: 167 CE--------MQSQGIFPDQFAYSGILQICIGLDSIELGKMVHAQIVIRGFASHTFVSTA 226
           CE        M  +G   ++++++ +L  C GL+ +  G  VH+ I    F S  ++ +A
Sbjct: 135 CEEALCYFAMMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSA 194

Query: 227 LLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFTSNGLYLDAFDIFLRMNGEGVTPD 286
           L++MY+K   + D+  VF+ M + NVVSWN++I+ F  NG  ++A D+F  M    V PD
Sbjct: 195 LVDMYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPD 254

Query: 287 AQTFIGVAKAIGMLRDVNKAKEVSGYASELGS-----IFDSHFTNSALKCLSSGKKVHAR 346
             T   V  A   L  +   +EV G   +        I  + F +   KC    +   AR
Sbjct: 255 EVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKC---SRIKEAR 314

Query: 347 AIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWTSLVTAYSQCSEWDK 406
            I   + +  +    ++ + YA   S +  R +F +M +R++VSW +L+  Y+Q  E ++
Sbjct: 315 FIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEE 374

Query: 407 AIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQV------HGFICKVGLDMDKCI 466
           A+ +F  ++ E   P  ++F+++L +CA L  L  G Q       HGF  + G + D  +
Sbjct: 375 ALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFV 434

Query: 467 ESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFRRMEQLGV 526
            ++LIDMY KCGC+ E   VF ++   D VSW A+I G AQ+G  ++AL+LFR M + G 
Sbjct: 435 GNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGE 494

Query: 527 EPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLSRVGHLNDAM 586
           +P+ +T + VL AC H G VEEG  YF  M  ++G+ P  +HY+C+VDLL R G L +A 
Sbjct: 495 KPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAK 554

Query: 587 EFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNTYIESG 646
             I  MP++P+ ++W +LL AC+VH N+ LG+  A+K+L  +  NS  YVLLSN Y E G
Sbjct: 555 SMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELG 614

Query: 647 SYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKL 656
            ++D +++R  M+++GV K+PGCSWI + G  H F   D+ HP K +I++ L
Sbjct: 615 KWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQIHSLL 661

BLAST of HG10018388 vs. ExPASy Swiss-Prot
Match: Q9SVP7 (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 419.1 bits (1076), Expect = 9.6e-116
Identity = 224/619 (36.19%), Postives = 343/619 (55.41%), Query Frame = 0

Query: 54  LVDLLRDCVDATFLKQAKTVHGFLLKSKFSNHDSL--VLLNHVAHAYSKCSDIDAACRLF 113
           L  L+  C     L + + +H +  K  F++++ +   LLN     Y+KC+DI+ A   F
Sbjct: 392 LASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALLN----LYAKCADIETALDYF 451

Query: 114 DQMSQRNIFSWTVIIVGLAENGLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIE 173
            +    N+  W V++V         + F +F +MQ + I P+Q+ Y  IL+ CI L  +E
Sbjct: 452 LETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLE 511

Query: 174 LGKMVHAQIVIRGFASHTFVSTALLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFT 233
           LG+ +H+QI+   F  + +V + L++MYAKL K++ ++++       +VVSW  MI+G+T
Sbjct: 512 LGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYT 571

Query: 234 SNGLYLDAFDIFLRMNGEGVTPDAQTFIGVAKAIGMLRDVNKAKEVSGYASELGSIFDSH 293
                  A   F +M   G+  D    +G+  A+                          
Sbjct: 572 QYNFDDKALTTFRQMLDRGIRSDE---VGLTNAVSAC----------------------- 631

Query: 294 FTNSALKCLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDL 353
              + L+ L  G+++HA+A  SG   + +   NA+   Y++CG +E+    F + E  D 
Sbjct: 632 ---AGLQALKEGQQIHAQACVSGFSSD-LPFQNALVTLYSRCGKIEESYLAFEQTEAGDN 691

Query: 354 VSWTSLVTAYSQCSEWDKAIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQVHGF 413
           ++W +LV+ + Q    ++A+ +F  M  EG   N F F S + + +    ++ G+QVH  
Sbjct: 692 IAWNALVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAV 751

Query: 414 ICKVGLDMDKCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDA 473
           I K G D +  + +ALI MYAKCG +++A+K F  +S  + VSW AII  +++HG   +A
Sbjct: 752 ITKTGYDSETEVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEA 811

Query: 474 LQLFRRMEQLGVEPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVD 533
           L  F +M    V PN VT V VL ACSH GLV++G+ YF+ M   YGL P+ EHY C+VD
Sbjct: 812 LDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVD 871

Query: 534 LLSRVGHLNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSAT 593
           +L+R G L+ A EFI  MP++P+ +VW+TLL AC VH N+E+GE AA  +L  + E+SAT
Sbjct: 872 MLTRAGLLSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSAT 931

Query: 594 YVLLSNTYIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKI 653
           YVLLSN Y  S  +      R  MKE+GVKKEPG SWI V  ++H FY GDQ HP  D+I
Sbjct: 932 YVLLSNLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEI 976

Query: 654 YAKLGELRLKAISLDDVPD 671
           +    +L  +A  +  V D
Sbjct: 992 HEYFQDLTKRASEIGYVQD 976

BLAST of HG10018388 vs. ExPASy Swiss-Prot
Match: Q9ZUW3 (Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H60 PE=2 SV=1)

HSP 1 Score: 413.7 bits (1062), Expect = 4.0e-114
Identity = 222/605 (36.69%), Postives = 341/605 (56.36%), Query Frame = 0

Query: 73  VHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNIFSWTVIIVGLAEN 132
           VH  ++K+      ++ + N + + Y KC ++  A  LFD+   +++ +W  +I G A N
Sbjct: 216 VHTVVVKNGLDK--TIPVSNSLINLYLKCGNVRKARILFDKTEVKSVVTWNSMISGYAAN 275

Query: 133 GLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIELGKMVHAQIVIRGFASHTFVS 192
           GL L+   +F  M+   +   + +++ ++++C  L  +   + +H  +V  GF     + 
Sbjct: 276 GLDLEALGMFYSMRLNYVRLSESSFASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIR 335

Query: 193 TALLNMYAKLQKIEDSYNVFNTMTEV-NVVSWNAMISGFTSNGLYLDAFDIFLRMNGEGV 252
           TAL+  Y+K   + D+  +F  +  V NVVSW AMISGF  N    +A D+F  M  +GV
Sbjct: 336 TALMVAYSKCTAMLDALRLFKEIGCVGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGV 395

Query: 253 TPDAQTFIGVAKAIGMLRDVNKAKEVSGYASELGSIFDSHFTNSALKCLSSGKKVHARAI 312
            P+  T+  +  A+ ++                                 S  +VHA+ +
Sbjct: 396 RPNEFTYSVILTALPVI---------------------------------SPSEVHAQVV 455

Query: 313 KSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWTSLVTAYSQCSEWDKAI 372
           K+  E    ++  A+ +AY K G +E+  KVF  ++D+D+V+W++++  Y+Q  E + AI
Sbjct: 456 KTNYE-RSSTVGTALLDAYVKLGKVEEAAKVFSGIDDKDIVAWSAMLAGYAQTGETEAAI 515

Query: 373 EIFSNMREEGYAPNQFAFSSVLVSCASL-CLLEYGQQVHGFICKVGLDMDKCIESALIDM 432
           ++F  + + G  PN+F FSS+L  CA+    +  G+Q HGF  K  LD   C+ SAL+ M
Sbjct: 516 KMFGELTKGGIKPNEFTFSSILNVCAATNASMGQGKQFHGFAIKSRLDSSLCVSSALLTM 575

Query: 433 YAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFRRMEQLGVEPNAVTF 492
           YAK G +  A++VF R    D VSW ++I+G+AQHG    AL +F+ M++  V+ + VTF
Sbjct: 576 YAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAMKALDVFKEMKKRKVKMDGVTF 635

Query: 493 VCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLSRVGHLNDAMEFISRMP 552
           + V  AC+H GLVEEG +YF +M  +  + P  EH SC+VDL SR G L  AM+ I  MP
Sbjct: 636 IGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCMVDLYSRAGQLEKAMKVIENMP 695

Query: 553 VEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNTYIESGSYKDGLS 612
                 +W+T+L ACRVH   ELG LAA+KI++ K E+SA YVLLSN Y ESG +++   
Sbjct: 696 NPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDSAAYVLLSNMYAESGDWQERAK 755

Query: 613 LRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLGELRLKAISLDDVPD 672
           +R +M E+ VKKEPG SWI V    + F AGD+ HP KD+IY KL +L  +   L   PD
Sbjct: 756 VRKLMNERNVKKEPGYSWIEVKNKTYSFLAGDRSHPLKDQIYMKLEDLSTRLKDLGYEPD 784

Query: 673 LSYEL 676
            SY L
Sbjct: 816 TSYVL 784

BLAST of HG10018388 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 406.0 bits (1042), Expect = 8.4e-112
Identity = 214/621 (34.46%), Postives = 345/621 (55.56%), Query Frame = 0

Query: 61  CVDATF-----LKQAKTVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMS 120
           CV  +F     +   + +HGF+LKS F   +S+   N +   Y K   +D+A ++FD+M+
Sbjct: 200 CVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVG--NSLVAFYLKNQRVDSARKVFDEMT 259

Query: 121 QRNIFSWTVIIVGLAENGLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIELGKM 180
           +R++ SW  II G   NGL   G  +F +M   GI  D      +   C     I LG+ 
Sbjct: 260 ERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRA 319

Query: 181 VHAQIVIRGFASHTFVSTALLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFTSNGL 240
           VH+  V   F+        LL+MY+K   ++ +  VF  M++ +VVS+ +MI+G+   GL
Sbjct: 320 VHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGL 379

Query: 241 YLDAFDIFLRMNGEGVTPDAQTFIGVAKAIGMLRDVNKAKEVSGYASELGSIFDSHFTNS 300
             +A  +F  M  EG++PD  T   V       R +++ K V  +  E    FD      
Sbjct: 380 AGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFD------ 439

Query: 301 ALKCLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWT 360
                                   I +SNA+ + YAKCGS+++   VF  M  +D++SW 
Sbjct: 440 ------------------------IFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWN 499

Query: 361 SLVTAYSQCSEWDKAIEIFSNMREE-GYAPNQFAFSSVLVSCASLCLLEYGQQVHGFICK 420
           +++  YS+    ++A+ +F+ + EE  ++P++   + VL +CASL   + G+++HG+I +
Sbjct: 500 TIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMR 559

Query: 421 VGLDMDKCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQL 480
            G   D+ + ++L+DMYAKCG L  A  +FD I++ D VSWT +IAG+  HG   +A+ L
Sbjct: 560 NGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIAL 619

Query: 481 FRRMEQLGVEPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLS 540
           F +M Q G+E + ++FV +L+ACSH GLV+EG ++F +M+    + P +EHY+CIVD+L+
Sbjct: 620 FNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLA 679

Query: 541 RVGHLNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVL 600
           R G L  A  FI  MP+ P+  +W  LL  CR+H +V+L E  A+K+   + EN+  YVL
Sbjct: 680 RTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVL 739

Query: 601 LSNTYIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAK 660
           ++N Y E+  ++    LR  + ++G++K PGCSWI + G ++ F AGD  +PE + I A 
Sbjct: 740 MANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAF 788

Query: 661 LGELRLKAISLDDVPDLSYEL 676
           L ++R + I     P   Y L
Sbjct: 800 LRKVRARMIEEGYSPLTKYAL 788

BLAST of HG10018388 vs. ExPASy Swiss-Prot
Match: Q9STE1 (Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E36 PE=3 SV=1)

HSP 1 Score: 400.2 bits (1027), Expect = 4.6e-110
Identity = 215/610 (35.25%), Postives = 331/610 (54.26%), Query Frame = 0

Query: 57  LLRDCVDATFLKQAKTVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMSQ 116
           +L  C     +     +HG ++ S      S  + N +   YSKC   D A +LF  MS+
Sbjct: 245 VLSVCASKLLIDLGVQLHGLVVVSGVDFEGS--IKNSLLSMYSKCGRFDDASKLFRMMSR 304

Query: 117 RNIFSWTVIIVGLAENGLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIELGKMV 176
            +  +W  +I G  ++GL  +    F EM S G+ PD   +S +L      +++E  K +
Sbjct: 305 ADTVTWNCMISGYVQSGLMEESLTFFYEMISSGVLPDAITFSSLLPSVSKFENLEYCKQI 364

Query: 177 HAQIVIRGFASHTFVSTALLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFTSNGLY 236
           H  I+    +   F+++AL++ Y K + +  + N+F+    V+VV + AMISG+  NGLY
Sbjct: 365 HCYIMRHSISLDIFLTSALIDAYFKCRGVSMAQNIFSQCNSVDVVVFTAMISGYLHNGLY 424

Query: 237 LDAFDIFLRMNGEGVTPDAQTFIGVAKAIGMLRDVNKAKEVSGYASELGSIFDSHFTNSA 296
           +D+ ++F  +    ++P+  T + +   IG+                             
Sbjct: 425 IDSLEMFRWLVKVKISPNEITLVSILPVIGI----------------------------- 484

Query: 297 LKCLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWTS 356
           L  L  G+++H   IK G + N  +I  AV + YAKCG +    ++F R+  RD+VSW S
Sbjct: 485 LLALKLGRELHGFIIKKGFD-NRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNS 544

Query: 357 LVTAYSQCSEWDKAIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQVHGFICKVG 416
           ++T  +Q      AI+IF  M   G   +  + S+ L +CA+L    +G+ +HGF+ K  
Sbjct: 545 MITRCAQSDNPSAAIDIFRQMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKHS 604

Query: 417 LDMDKCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFR 476
           L  D   ES LIDMYAKCG L  A  VF  +   + VSW +IIA    HG + D+L LF 
Sbjct: 605 LASDVYSESTLIDMYAKCGNLKAAMNVFKTMKEKNIVSWNSIIAACGNHGKLKDSLCLFH 664

Query: 477 RM-EQLGVEPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLSR 536
            M E+ G+ P+ +TF+ ++ +C H G V+EG+++F+ M E+YG+ P+ EHY+C+VDL  R
Sbjct: 665 EMVEKSGIRPDQITFLEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLFGR 724

Query: 537 VGHLNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLL 596
            G L +A E +  MP  P+  VW TLLGACR+H NVEL E+A+ K++     NS  YVL+
Sbjct: 725 AGRLTEAYETVKSMPFPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYYVLI 784

Query: 597 SNTYIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIY--- 656
           SN +  +  ++    +R +MKE+ V+K PG SWI +N   H F +GD  HPE   IY   
Sbjct: 785 SNAHANAREWESVTKVRSLMKEREVQKIPGYSWIEINKRTHLFVSGDVNHPESSHIYSLL 822

Query: 657 -AKLGELRLK 662
            + LGELRL+
Sbjct: 845 NSLLGELRLE 822

BLAST of HG10018388 vs. ExPASy TrEMBL
Match: A0A0A0KBQ4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G013890 PE=4 SV=1)

HSP 1 Score: 1204.5 bits (3115), Expect = 0.0e+00
Identity = 614/737 (83.31%), Postives = 632/737 (85.75%), Query Frame = 0

Query: 13  IRNSALTITLSATQKAIATSGIKFPNSVTVHKTDSHLEIQPLVDLLRDCVDATFLKQAKT 72
           IRNSALTIT SA QK  ATSGIK PNSV V KTDSHL+IQPLVDLLRDCVDA FLKQAKT
Sbjct: 17  IRNSALTITHSAIQKPFATSGIKTPNSVKVDKTDSHLQIQPLVDLLRDCVDARFLKQAKT 76

Query: 73  VHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNIFSWTVIIVGLAEN 132
           VHGFLLKSKFSNH SLVLLNHVAHAYSKCSDIDAACRLFDQMSQRN FSWTV+I GLAEN
Sbjct: 77  VHGFLLKSKFSNHHSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNTFSWTVLIAGLAEN 136

Query: 133 GLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIELGKMVHAQIVIRGFASHTFVS 192
           GLFLDGFE FCEMQSQGIFPDQFAYSGILQICIGLDSIELG MVHAQIVIRGF SHTFVS
Sbjct: 137 GLFLDGFEFFCEMQSQGIFPDQFAYSGILQICIGLDSIELGNMVHAQIVIRGFTSHTFVS 196

Query: 193 TALLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFTSNGLYLDAFDIFLRMNGEGVT 252
           TALLNMYAKLQ+IEDSY VFNTMTEVNVVSWNAMI+GFTSN LYLDAFD+FLRM GEGVT
Sbjct: 197 TALLNMYAKLQEIEDSYKVFNTMTEVNVVSWNAMITGFTSNDLYLDAFDLFLRMMGEGVT 256

Query: 253 PDAQTFIGVAKAIGMLRDVNKAKEVSGYASELG-------------------------SI 312
           PDAQTFIGVAKAIGMLRDVNKAKEVSGYA ELG                         SI
Sbjct: 257 PDAQTFIGVAKAIGMLRDVNKAKEVSGYALELGVDSNTLVGTALIDMNSKCGSLQEARSI 316

Query: 313 FDSHFTN-------------------------------------------------SALK 372
           F+SHF                                                   +ALK
Sbjct: 317 FNSHFITCRFNAPWNAMISGYLRSGFNEKALELFAKMCQNDIYLDHYTYCSVFNAIAALK 376

Query: 373 CLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWTSLV 432
           CLS GKKVHARAIKSGLEVNY+SISNAVANAYAKCGSLEDVRKVF RMEDRDL+SWTSLV
Sbjct: 377 CLSLGKKVHARAIKSGLEVNYVSISNAVANAYAKCGSLEDVRKVFNRMEDRDLISWTSLV 436

Query: 433 TAYSQCSEWDKAIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQVHGFICKVGLD 492
           TAYSQCSEWDKAIEIFSNMR EG APNQF FSSVLVSCA+LCLLEYGQQVHG ICKVGLD
Sbjct: 437 TAYSQCSEWDKAIEIFSNMRAEGIAPNQFTFSSVLVSCANLCLLEYGQQVHGIICKVGLD 496

Query: 493 MDKCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFRRM 552
           MDKCIESAL+DMYAKCGCL +AKKVF+RISNADTVSWTAIIAGHAQHGIVDDALQLFRRM
Sbjct: 497 MDKCIESALVDMYAKCGCLGDAKKVFNRISNADTVSWTAIIAGHAQHGIVDDALQLFRRM 556

Query: 553 EQLGVEPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLSRVGH 612
            QLGVEPNAVTF+CVLFACSHGGLVEEGLQYFKLMK+ YGLVPEMEHY+CIVDLLSRVGH
Sbjct: 557 VQLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKKTYGLVPEMEHYACIVDLLSRVGH 616

Query: 613 LNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNT 672
           LNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNT
Sbjct: 617 LNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNT 676

Query: 673 YIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLGEL 676
           YIESGSYKDGLSLRH+MKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKL EL
Sbjct: 677 YIESGSYKDGLSLRHLMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEEL 736

BLAST of HG10018388 vs. ExPASy TrEMBL
Match: A0A5A7UEQ9 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold60G005280 PE=4 SV=1)

HSP 1 Score: 1194.1 bits (3088), Expect = 0.0e+00
Identity = 611/739 (82.68%), Postives = 631/739 (85.39%), Query Frame = 0

Query: 11  KVIRNSALTITLSATQKAIATSGIKFPNSVTVHKTDSHLEIQPLVDLLRDCVDATFLKQA 70
           + IR SALTIT SATQKAIATSGIK PNSV V KTDSHLEIQPLVDLLR CVDA FLKQA
Sbjct: 25  RTIRTSALTITHSATQKAIATSGIKIPNSVKVDKTDSHLEIQPLVDLLRGCVDARFLKQA 84

Query: 71  KTVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNIFSWTVIIVGLA 130
           KTVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACR+FDQMSQRNIFSWT II GLA
Sbjct: 85  KTVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRVFDQMSQRNIFSWTAIIAGLA 144

Query: 131 ENGLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIELGKMVHAQIVIRGFASHTF 190
           ENGLFLDGFE FCEMQSQGIFPD FAYSGILQICIGLDS+ELGKMVHAQIVIRGF SHTF
Sbjct: 145 ENGLFLDGFEFFCEMQSQGIFPDHFAYSGILQICIGLDSVELGKMVHAQIVIRGFTSHTF 204

Query: 191 VSTALLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFTSNGLYLDAFDIFLRMNGEG 250
           VSTALLNMYAKLQ+IEDS  VFNTMTEVNVVSWNAMI+GFTSNG YLDAFD+FLRM GEG
Sbjct: 205 VSTALLNMYAKLQEIEDSCKVFNTMTEVNVVSWNAMITGFTSNGFYLDAFDLFLRMKGEG 264

Query: 251 VTPDAQTFIGVAKAIGMLRDVNKAKEVSGYASELG------------------------- 310
           VTPDAQTFIGVAKAIGMLRDVNKAKEVSGYA ELG                         
Sbjct: 265 VTPDAQTFIGVAKAIGMLRDVNKAKEVSGYALELGVDSNTLVGTALIDMHSKCGSLQEAR 324

Query: 311 SIFDSHFTN-------------------------------------------------SA 370
           SIF+SHF                                                   ++
Sbjct: 325 SIFNSHFVTCRFNAPWNAMISGYLQSGLNEKALELFAKMCQSDIHLDRYTYCSVFNAIAS 384

Query: 371 LKCLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWTS 430
           LKCL SGKKVHARAIKSGLEVN +SISNAVANAYAKCGSLEDVRKVF RMEDRDL+SWTS
Sbjct: 385 LKCLLSGKKVHARAIKSGLEVNCVSISNAVANAYAKCGSLEDVRKVFNRMEDRDLISWTS 444

Query: 431 LVTAYSQCSEWDKAIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQVHGFICKVG 490
           LVTAYSQCSEWDKAIEIFSNMR EGYAPNQFAFSSVLVSCA+LCLLEYGQQVHG ICKVG
Sbjct: 445 LVTAYSQCSEWDKAIEIFSNMRAEGYAPNQFAFSSVLVSCANLCLLEYGQQVHGIICKVG 504

Query: 491 LDMDKCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFR 550
           LDMDKCIESAL+DMYAKCGCLA+AKKVF+RISNADTVSWTAIIAGHAQHGIVDDALQLFR
Sbjct: 505 LDMDKCIESALVDMYAKCGCLADAKKVFNRISNADTVSWTAIIAGHAQHGIVDDALQLFR 564

Query: 551 RMEQLGVEPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLSRV 610
           RM  LGVEPNAVTF+CVLFACSHGGLVEEGLQYFKLMK+ YGLVPEMEHY+CIVDLLSRV
Sbjct: 565 RMVLLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKKTYGLVPEMEHYACIVDLLSRV 624

Query: 611 GHLNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLS 670
           G LNDAM FIS+MPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLS
Sbjct: 625 GRLNDAMGFISKMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLS 684

Query: 671 NTYIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLG 676
           NTYIESGSYKDGLSLRHVMKEQGVKKEPG SWISVNGTLHKFYAGDQQHPEKDKIYAKL 
Sbjct: 685 NTYIESGSYKDGLSLRHVMKEQGVKKEPGFSWISVNGTLHKFYAGDQQHPEKDKIYAKLE 744

BLAST of HG10018388 vs. ExPASy TrEMBL
Match: A0A5D3C2B1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold409G001490 PE=4 SV=1)

HSP 1 Score: 1193.7 bits (3087), Expect = 0.0e+00
Identity = 611/738 (82.79%), Postives = 631/738 (85.50%), Query Frame = 0

Query: 12  VIRNSALTITLSATQKAIATSGIKFPNSVTVHKTDSHLEIQPLVDLLRDCVDATFLKQAK 71
           +IR SALTIT SATQKAIATSGIK PNSV V KTDSHLEIQPLVDLLR CVDA FLKQAK
Sbjct: 43  LIRTSALTITHSATQKAIATSGIKIPNSVKVDKTDSHLEIQPLVDLLRGCVDARFLKQAK 102

Query: 72  TVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNIFSWTVIIVGLAE 131
           TVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACR+FDQMSQRNIFSWT II GLAE
Sbjct: 103 TVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRVFDQMSQRNIFSWTAIIAGLAE 162

Query: 132 NGLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIELGKMVHAQIVIRGFASHTFV 191
           NGLFLDGFE FCEMQSQGIFPD FAYSGILQICIGLDS+ELGKMVHAQIVIRGF SHTFV
Sbjct: 163 NGLFLDGFEFFCEMQSQGIFPDHFAYSGILQICIGLDSVELGKMVHAQIVIRGFTSHTFV 222

Query: 192 STALLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFTSNGLYLDAFDIFLRMNGEGV 251
           STALLNMYAKLQ+IEDS  VFNTMTEVNVVSWNAMI+GFTSNG YLDAFD+FLRM GEGV
Sbjct: 223 STALLNMYAKLQEIEDSCKVFNTMTEVNVVSWNAMITGFTSNGFYLDAFDLFLRMKGEGV 282

Query: 252 TPDAQTFIGVAKAIGMLRDVNKAKEVSGYASELG-------------------------S 311
           TPDAQTFIGVAKAIGMLRDVNKAKEVSGYA ELG                         S
Sbjct: 283 TPDAQTFIGVAKAIGMLRDVNKAKEVSGYALELGVDSNTLVGTALIDMHSKCGSLQEARS 342

Query: 312 IFDSHFTN-------------------------------------------------SAL 371
           IF+SHF                                                   ++L
Sbjct: 343 IFNSHFVTCRFNAPWNAMISGYLQSGLNEKALELFAKMCQSDIHLDRYTYCSVFNAIASL 402

Query: 372 KCLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWTSL 431
           KCL SGKKVHARAIKSGLEVN +SISNAVANAYAKCGSLEDVRKVF RMEDRDL+SWTSL
Sbjct: 403 KCLLSGKKVHARAIKSGLEVNCVSISNAVANAYAKCGSLEDVRKVFNRMEDRDLISWTSL 462

Query: 432 VTAYSQCSEWDKAIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQVHGFICKVGL 491
           VTAYSQCSEWDKAIEIFSNMR EGYAPNQFAFSSVLVSCA+LCLLEYGQQVHG ICKVGL
Sbjct: 463 VTAYSQCSEWDKAIEIFSNMRAEGYAPNQFAFSSVLVSCANLCLLEYGQQVHGIICKVGL 522

Query: 492 DMDKCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFRR 551
           DMDKCIESAL+DMYAKCGCLA+AKKVF+RISNADTVSWTAIIAGHAQHGIVDDALQLFRR
Sbjct: 523 DMDKCIESALVDMYAKCGCLADAKKVFNRISNADTVSWTAIIAGHAQHGIVDDALQLFRR 582

Query: 552 MEQLGVEPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLSRVG 611
           M  LGVEPNAVTF+CVLFACSHGGLVEEGLQYFKLMK+ YGLVPEMEHY+CIVDLLSRVG
Sbjct: 583 MVLLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKKTYGLVPEMEHYACIVDLLSRVG 642

Query: 612 HLNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSN 671
            LNDAM FIS+MPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSN
Sbjct: 643 RLNDAMGFISKMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSN 702

Query: 672 TYIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLGE 676
           TYIESGSYKDGLSLRHVMKEQGVKKEPG SWISVNGTLHKFYAGDQQHPEKDKIYAKL E
Sbjct: 703 TYIESGSYKDGLSLRHVMKEQGVKKEPGFSWISVNGTLHKFYAGDQQHPEKDKIYAKLEE 762

BLAST of HG10018388 vs. ExPASy TrEMBL
Match: A0A1S4E171 (pentatricopeptide repeat-containing protein At2g27610-like OS=Cucumis melo OX=3656 GN=LOC103496600 PE=4 SV=1)

HSP 1 Score: 1193.7 bits (3087), Expect = 0.0e+00
Identity = 611/737 (82.90%), Postives = 630/737 (85.48%), Query Frame = 0

Query: 13  IRNSALTITLSATQKAIATSGIKFPNSVTVHKTDSHLEIQPLVDLLRDCVDATFLKQAKT 72
           IR SALTIT SATQKAIATSGIK PNSV V KTDSHLEIQPLVDLLR CVDA FLKQAKT
Sbjct: 27  IRTSALTITHSATQKAIATSGIKIPNSVKVDKTDSHLEIQPLVDLLRGCVDARFLKQAKT 86

Query: 73  VHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNIFSWTVIIVGLAEN 132
           VHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACR+FDQMSQRNIFSWT II GLAEN
Sbjct: 87  VHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRVFDQMSQRNIFSWTAIIAGLAEN 146

Query: 133 GLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIELGKMVHAQIVIRGFASHTFVS 192
           GLFLDGFE FCEMQSQGIFPD FAYSGILQICIGLDS+ELGKMVHAQIVIRGF SHTFVS
Sbjct: 147 GLFLDGFEFFCEMQSQGIFPDHFAYSGILQICIGLDSVELGKMVHAQIVIRGFTSHTFVS 206

Query: 193 TALLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFTSNGLYLDAFDIFLRMNGEGVT 252
           TALLNMYAKLQ+IEDS  VFNTMTEVNVVSWNAMI+GFTSNG YLDAFD+FLRM GEGVT
Sbjct: 207 TALLNMYAKLQEIEDSCKVFNTMTEVNVVSWNAMITGFTSNGFYLDAFDLFLRMKGEGVT 266

Query: 253 PDAQTFIGVAKAIGMLRDVNKAKEVSGYASELG-------------------------SI 312
           PDAQTFIGVAKAIGMLRDVNKAKEVSGYA ELG                         SI
Sbjct: 267 PDAQTFIGVAKAIGMLRDVNKAKEVSGYALELGVDSNTLVGTALIDMHSKCGSLQEARSI 326

Query: 313 FDSHFTN-------------------------------------------------SALK 372
           F+SHF                                                   ++LK
Sbjct: 327 FNSHFVTCRFNAPWNAMISGYLQSGLNEKALELFAKMCQSDIHLDRYTYCSVFNAIASLK 386

Query: 373 CLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWTSLV 432
           CL SGKKVHARAIKSGLEVN +SISNAVANAYAKCGSLEDVRKVF RMEDRDL+SWTSLV
Sbjct: 387 CLLSGKKVHARAIKSGLEVNCVSISNAVANAYAKCGSLEDVRKVFNRMEDRDLISWTSLV 446

Query: 433 TAYSQCSEWDKAIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQVHGFICKVGLD 492
           TAYSQCSEWDKAIEIFSNMR EGYAPNQFAFSSVLVSCA+LCLLEYGQQVHG ICKVGLD
Sbjct: 447 TAYSQCSEWDKAIEIFSNMRAEGYAPNQFAFSSVLVSCANLCLLEYGQQVHGIICKVGLD 506

Query: 493 MDKCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFRRM 552
           MDKCIESAL+DMYAKCGCLA+AKKVF+RISNADTVSWTAIIAGHAQHGIVDDALQLFRRM
Sbjct: 507 MDKCIESALVDMYAKCGCLADAKKVFNRISNADTVSWTAIIAGHAQHGIVDDALQLFRRM 566

Query: 553 EQLGVEPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLSRVGH 612
             LGVEPNAVTF+CVLFACSHGGLVEEGLQYFKLMK+ YGLVPEMEHY+CIVDLLSRVG 
Sbjct: 567 VLLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKKTYGLVPEMEHYACIVDLLSRVGR 626

Query: 613 LNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNT 672
           LNDAM FIS+MPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNT
Sbjct: 627 LNDAMGFISKMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNT 686

Query: 673 YIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLGEL 676
           YIESGSYKDGLSLRHVMKEQGVKKEPG SWISVNGTLHKFYAGDQQHPEKDKIYAKL EL
Sbjct: 687 YIESGSYKDGLSLRHVMKEQGVKKEPGFSWISVNGTLHKFYAGDQQHPEKDKIYAKLEEL 746

BLAST of HG10018388 vs. ExPASy TrEMBL
Match: A0A6J1FGJ6 (pentatricopeptide repeat-containing protein At3g16610-like OS=Cucurbita moschata OX=3662 GN=LOC111445205 PE=4 SV=1)

HSP 1 Score: 1183.3 bits (3060), Expect = 0.0e+00
Identity = 605/737 (82.09%), Postives = 631/737 (85.62%), Query Frame = 0

Query: 13  IRNSALTITLSATQKAIATSGIKFPNSVTVHKTDSHLEIQPLVDLLRDCVDATFLKQAKT 72
           IRNS+LT T SATQKAI TSGIK PNSV+V K++S LEIQPLVDLLR CVDA FLKQAKT
Sbjct: 27  IRNSSLTTTHSATQKAITTSGIKIPNSVSVDKSNSRLEIQPLVDLLRGCVDARFLKQAKT 86

Query: 73  VHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNIFSWTVIIVGLAEN 132
           VHGFLLKSKFSNHDSLVLLNHVA AYSKCSDIDAACRLFD+MSQRNIFSWTVII GLA+N
Sbjct: 87  VHGFLLKSKFSNHDSLVLLNHVADAYSKCSDIDAACRLFDKMSQRNIFSWTVIIAGLAKN 146

Query: 133 GLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIELGKMVHAQIVIRGFASHTFVS 192
           GLF DGFE FCEMQSQ IFPDQFAYSG+LQICIGL+SIELGKMVHAQIVIRGFASHTFVS
Sbjct: 147 GLFHDGFEFFCEMQSQDIFPDQFAYSGVLQICIGLESIELGKMVHAQIVIRGFASHTFVS 206

Query: 193 TALLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFTSNGLYLDAFDIFLRMNGEGVT 252
           TALLNMYAKLQKI+DSY VFNTMTEVNVVSWNAMISGFTSNGLY DAFD FLRM GEGVT
Sbjct: 207 TALLNMYAKLQKIDDSYEVFNTMTEVNVVSWNAMISGFTSNGLYSDAFDHFLRMKGEGVT 266

Query: 253 PDAQTFIGVAKAIGMLRDVNKAKEVSGYASELG-------------------------SI 312
           PDAQTFI +AKAIGMLRDVNKAKE+S YASELG                         SI
Sbjct: 267 PDAQTFISIAKAIGMLRDVNKAKEISRYASELGMDSNPLVGTALIDMHSKCGSLQEARSI 326

Query: 313 FDSHFTN-------------------------------------------------SALK 372
           FDSHFTN                                                 +ALK
Sbjct: 327 FDSHFTNCRVNGPWNAMISGYLQSEFNEKALELFAKMCLNNVHLDRYTYCSVFNAIAALK 386

Query: 373 CLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWTSLV 432
           CLS GKKVHARAIKSGLEVN ISISNAVANAYAKCGSLED+RKVFY ME+RDLVSWT+LV
Sbjct: 387 CLSLGKKVHARAIKSGLEVNNISISNAVANAYAKCGSLEDLRKVFYSMEERDLVSWTTLV 446

Query: 433 TAYSQCSEWDKAIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQVHGFICKVGLD 492
           TAYSQCSEWDKAIEIFSNMREEG+APNQFAFSSVL+SCASLCLLEYGQQVHGFI KVGLD
Sbjct: 447 TAYSQCSEWDKAIEIFSNMREEGFAPNQFAFSSVLISCASLCLLEYGQQVHGFIGKVGLD 506

Query: 493 MDKCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFRRM 552
           MDKCI+SALIDMYAKCG LAEAKKVFD+IS+ADT+SWTAIIAGHAQHG+VDDALQLFRRM
Sbjct: 507 MDKCIQSALIDMYAKCGSLAEAKKVFDKISDADTISWTAIIAGHAQHGMVDDALQLFRRM 566

Query: 553 EQLGVEPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLSRVGH 612
           EQLGVEPNAVTF+CVLFACSHGGLVEEGLQYFKLMKE YGLVP MEHYSCIVDLLSRVG 
Sbjct: 567 EQLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKETYGLVPGMEHYSCIVDLLSRVGR 626

Query: 613 LNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNT 672
           LNDAMEFIS+MP+EPNEMVWQTLLGACRVHGNVELGELAA+KI SFKAENSATYVLLSNT
Sbjct: 627 LNDAMEFISKMPIEPNEMVWQTLLGACRVHGNVELGELAARKIRSFKAENSATYVLLSNT 686

Query: 673 YIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLGEL 676
           YIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKL EL
Sbjct: 687 YIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEEL 746

BLAST of HG10018388 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 424.1 bits (1089), Expect = 2.1e-118
Identity = 222/652 (34.05%), Postives = 353/652 (54.14%), Query Frame = 0

Query: 47  SHLEIQPLVDLLRDCVDATFLK-QAKTVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDID 106
           S  +  P   LL  C+ +       + VH  ++KS FSN   + + N +  AYSKC  ++
Sbjct: 15  SFTDSSPFAKLLDSCIKSKLSAIYVRYVHASVIKSGFSN--EIFIQNRLIDAYSKCGSLE 74

Query: 107 AACRLFDQMSQRNIFSWTVIIVGLAENGLFLDGFELF----------------------- 166
              ++FD+M QRNI++W  ++ GL + G   +   LF                       
Sbjct: 75  DGRQVFDKMPQRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDR 134

Query: 167 CE--------MQSQGIFPDQFAYSGILQICIGLDSIELGKMVHAQIVIRGFASHTFVSTA 226
           CE        M  +G   ++++++ +L  C GL+ +  G  VH+ I    F S  ++ +A
Sbjct: 135 CEEALCYFAMMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSA 194

Query: 227 LLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFTSNGLYLDAFDIFLRMNGEGVTPD 286
           L++MY+K   + D+  VF+ M + NVVSWN++I+ F  NG  ++A D+F  M    V PD
Sbjct: 195 LVDMYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPD 254

Query: 287 AQTFIGVAKAIGMLRDVNKAKEVSGYASELGS-----IFDSHFTNSALKCLSSGKKVHAR 346
             T   V  A   L  +   +EV G   +        I  + F +   KC    +   AR
Sbjct: 255 EVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKC---SRIKEAR 314

Query: 347 AIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWTSLVTAYSQCSEWDK 406
            I   + +  +    ++ + YA   S +  R +F +M +R++VSW +L+  Y+Q  E ++
Sbjct: 315 FIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEE 374

Query: 407 AIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQV------HGFICKVGLDMDKCI 466
           A+ +F  ++ E   P  ++F+++L +CA L  L  G Q       HGF  + G + D  +
Sbjct: 375 ALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFV 434

Query: 467 ESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFRRMEQLGV 526
            ++LIDMY KCGC+ E   VF ++   D VSW A+I G AQ+G  ++AL+LFR M + G 
Sbjct: 435 GNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGE 494

Query: 527 EPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLSRVGHLNDAM 586
           +P+ +T + VL AC H G VEEG  YF  M  ++G+ P  +HY+C+VDLL R G L +A 
Sbjct: 495 KPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAK 554

Query: 587 EFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNTYIESG 646
             I  MP++P+ ++W +LL AC+VH N+ LG+  A+K+L  +  NS  YVLLSN Y E G
Sbjct: 555 SMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELG 614

Query: 647 SYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKL 656
            ++D +++R  M+++GV K+PGCSWI + G  H F   D+ HP K +I++ L
Sbjct: 615 KWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQIHSLL 661

BLAST of HG10018388 vs. TAIR 10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 419.1 bits (1076), Expect = 6.8e-117
Identity = 224/619 (36.19%), Postives = 343/619 (55.41%), Query Frame = 0

Query: 54  LVDLLRDCVDATFLKQAKTVHGFLLKSKFSNHDSL--VLLNHVAHAYSKCSDIDAACRLF 113
           L  L+  C     L + + +H +  K  F++++ +   LLN     Y+KC+DI+ A   F
Sbjct: 392 LASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALLN----LYAKCADIETALDYF 451

Query: 114 DQMSQRNIFSWTVIIVGLAENGLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIE 173
            +    N+  W V++V         + F +F +MQ + I P+Q+ Y  IL+ CI L  +E
Sbjct: 452 LETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLE 511

Query: 174 LGKMVHAQIVIRGFASHTFVSTALLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFT 233
           LG+ +H+QI+   F  + +V + L++MYAKL K++ ++++       +VVSW  MI+G+T
Sbjct: 512 LGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYT 571

Query: 234 SNGLYLDAFDIFLRMNGEGVTPDAQTFIGVAKAIGMLRDVNKAKEVSGYASELGSIFDSH 293
                  A   F +M   G+  D    +G+  A+                          
Sbjct: 572 QYNFDDKALTTFRQMLDRGIRSDE---VGLTNAVSAC----------------------- 631

Query: 294 FTNSALKCLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDL 353
              + L+ L  G+++HA+A  SG   + +   NA+   Y++CG +E+    F + E  D 
Sbjct: 632 ---AGLQALKEGQQIHAQACVSGFSSD-LPFQNALVTLYSRCGKIEESYLAFEQTEAGDN 691

Query: 354 VSWTSLVTAYSQCSEWDKAIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQVHGF 413
           ++W +LV+ + Q    ++A+ +F  M  EG   N F F S + + +    ++ G+QVH  
Sbjct: 692 IAWNALVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAV 751

Query: 414 ICKVGLDMDKCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDA 473
           I K G D +  + +ALI MYAKCG +++A+K F  +S  + VSW AII  +++HG   +A
Sbjct: 752 ITKTGYDSETEVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEA 811

Query: 474 LQLFRRMEQLGVEPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVD 533
           L  F +M    V PN VT V VL ACSH GLV++G+ YF+ M   YGL P+ EHY C+VD
Sbjct: 812 LDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVD 871

Query: 534 LLSRVGHLNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSAT 593
           +L+R G L+ A EFI  MP++P+ +VW+TLL AC VH N+E+GE AA  +L  + E+SAT
Sbjct: 872 MLTRAGLLSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSAT 931

Query: 594 YVLLSNTYIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKI 653
           YVLLSN Y  S  +      R  MKE+GVKKEPG SWI V  ++H FY GDQ HP  D+I
Sbjct: 932 YVLLSNLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEI 976

Query: 654 YAKLGELRLKAISLDDVPD 671
           +    +L  +A  +  V D
Sbjct: 992 HEYFQDLTKRASEIGYVQD 976

BLAST of HG10018388 vs. TAIR 10
Match: AT2G27610.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 413.7 bits (1062), Expect = 2.9e-115
Identity = 222/605 (36.69%), Postives = 341/605 (56.36%), Query Frame = 0

Query: 73  VHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMSQRNIFSWTVIIVGLAEN 132
           VH  ++K+      ++ + N + + Y KC ++  A  LFD+   +++ +W  +I G A N
Sbjct: 216 VHTVVVKNGLDK--TIPVSNSLINLYLKCGNVRKARILFDKTEVKSVVTWNSMISGYAAN 275

Query: 133 GLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIELGKMVHAQIVIRGFASHTFVS 192
           GL L+   +F  M+   +   + +++ ++++C  L  +   + +H  +V  GF     + 
Sbjct: 276 GLDLEALGMFYSMRLNYVRLSESSFASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIR 335

Query: 193 TALLNMYAKLQKIEDSYNVFNTMTEV-NVVSWNAMISGFTSNGLYLDAFDIFLRMNGEGV 252
           TAL+  Y+K   + D+  +F  +  V NVVSW AMISGF  N    +A D+F  M  +GV
Sbjct: 336 TALMVAYSKCTAMLDALRLFKEIGCVGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGV 395

Query: 253 TPDAQTFIGVAKAIGMLRDVNKAKEVSGYASELGSIFDSHFTNSALKCLSSGKKVHARAI 312
            P+  T+  +  A+ ++                                 S  +VHA+ +
Sbjct: 396 RPNEFTYSVILTALPVI---------------------------------SPSEVHAQVV 455

Query: 313 KSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWTSLVTAYSQCSEWDKAI 372
           K+  E    ++  A+ +AY K G +E+  KVF  ++D+D+V+W++++  Y+Q  E + AI
Sbjct: 456 KTNYE-RSSTVGTALLDAYVKLGKVEEAAKVFSGIDDKDIVAWSAMLAGYAQTGETEAAI 515

Query: 373 EIFSNMREEGYAPNQFAFSSVLVSCASL-CLLEYGQQVHGFICKVGLDMDKCIESALIDM 432
           ++F  + + G  PN+F FSS+L  CA+    +  G+Q HGF  K  LD   C+ SAL+ M
Sbjct: 516 KMFGELTKGGIKPNEFTFSSILNVCAATNASMGQGKQFHGFAIKSRLDSSLCVSSALLTM 575

Query: 433 YAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFRRMEQLGVEPNAVTF 492
           YAK G +  A++VF R    D VSW ++I+G+AQHG    AL +F+ M++  V+ + VTF
Sbjct: 576 YAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAMKALDVFKEMKKRKVKMDGVTF 635

Query: 493 VCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLSRVGHLNDAMEFISRMP 552
           + V  AC+H GLVEEG +YF +M  +  + P  EH SC+VDL SR G L  AM+ I  MP
Sbjct: 636 IGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCMVDLYSRAGQLEKAMKVIENMP 695

Query: 553 VEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNTYIESGSYKDGLS 612
                 +W+T+L ACRVH   ELG LAA+KI++ K E+SA YVLLSN Y ESG +++   
Sbjct: 696 NPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDSAAYVLLSNMYAESGDWQERAK 755

Query: 613 LRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLGELRLKAISLDDVPD 672
           +R +M E+ VKKEPG SWI V    + F AGD+ HP KD+IY KL +L  +   L   PD
Sbjct: 756 VRKLMNERNVKKEPGYSWIEVKNKTYSFLAGDRSHPLKDQIYMKLEDLSTRLKDLGYEPD 784

Query: 673 LSYEL 676
            SY L
Sbjct: 816 TSYVL 784

BLAST of HG10018388 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 406.0 bits (1042), Expect = 6.0e-113
Identity = 214/621 (34.46%), Postives = 345/621 (55.56%), Query Frame = 0

Query: 61  CVDATF-----LKQAKTVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMS 120
           CV  +F     +   + +HGF+LKS F   +S+   N +   Y K   +D+A ++FD+M+
Sbjct: 200 CVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVG--NSLVAFYLKNQRVDSARKVFDEMT 259

Query: 121 QRNIFSWTVIIVGLAENGLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIELGKM 180
           +R++ SW  II G   NGL   G  +F +M   GI  D      +   C     I LG+ 
Sbjct: 260 ERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRA 319

Query: 181 VHAQIVIRGFASHTFVSTALLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFTSNGL 240
           VH+  V   F+        LL+MY+K   ++ +  VF  M++ +VVS+ +MI+G+   GL
Sbjct: 320 VHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGL 379

Query: 241 YLDAFDIFLRMNGEGVTPDAQTFIGVAKAIGMLRDVNKAKEVSGYASELGSIFDSHFTNS 300
             +A  +F  M  EG++PD  T   V       R +++ K V  +  E    FD      
Sbjct: 380 AGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFD------ 439

Query: 301 ALKCLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWT 360
                                   I +SNA+ + YAKCGS+++   VF  M  +D++SW 
Sbjct: 440 ------------------------IFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWN 499

Query: 361 SLVTAYSQCSEWDKAIEIFSNMREE-GYAPNQFAFSSVLVSCASLCLLEYGQQVHGFICK 420
           +++  YS+    ++A+ +F+ + EE  ++P++   + VL +CASL   + G+++HG+I +
Sbjct: 500 TIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMR 559

Query: 421 VGLDMDKCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQL 480
            G   D+ + ++L+DMYAKCG L  A  +FD I++ D VSWT +IAG+  HG   +A+ L
Sbjct: 560 NGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIAL 619

Query: 481 FRRMEQLGVEPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLS 540
           F +M Q G+E + ++FV +L+ACSH GLV+EG ++F +M+    + P +EHY+CIVD+L+
Sbjct: 620 FNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLA 679

Query: 541 RVGHLNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVL 600
           R G L  A  FI  MP+ P+  +W  LL  CR+H +V+L E  A+K+   + EN+  YVL
Sbjct: 680 RTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVL 739

Query: 601 LSNTYIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAK 660
           ++N Y E+  ++    LR  + ++G++K PGCSWI + G ++ F AGD  +PE + I A 
Sbjct: 740 MANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAF 788

Query: 661 LGELRLKAISLDDVPDLSYEL 676
           L ++R + I     P   Y L
Sbjct: 800 LRKVRARMIEEGYSPLTKYAL 788

BLAST of HG10018388 vs. TAIR 10
Match: AT4G21300.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 400.2 bits (1027), Expect = 3.3e-111
Identity = 215/610 (35.25%), Postives = 331/610 (54.26%), Query Frame = 0

Query: 57  LLRDCVDATFLKQAKTVHGFLLKSKFSNHDSLVLLNHVAHAYSKCSDIDAACRLFDQMSQ 116
           +L  C     +     +HG ++ S      S  + N +   YSKC   D A +LF  MS+
Sbjct: 245 VLSVCASKLLIDLGVQLHGLVVVSGVDFEGS--IKNSLLSMYSKCGRFDDASKLFRMMSR 304

Query: 117 RNIFSWTVIIVGLAENGLFLDGFELFCEMQSQGIFPDQFAYSGILQICIGLDSIELGKMV 176
            +  +W  +I G  ++GL  +    F EM S G+ PD   +S +L      +++E  K +
Sbjct: 305 ADTVTWNCMISGYVQSGLMEESLTFFYEMISSGVLPDAITFSSLLPSVSKFENLEYCKQI 364

Query: 177 HAQIVIRGFASHTFVSTALLNMYAKLQKIEDSYNVFNTMTEVNVVSWNAMISGFTSNGLY 236
           H  I+    +   F+++AL++ Y K + +  + N+F+    V+VV + AMISG+  NGLY
Sbjct: 365 HCYIMRHSISLDIFLTSALIDAYFKCRGVSMAQNIFSQCNSVDVVVFTAMISGYLHNGLY 424

Query: 237 LDAFDIFLRMNGEGVTPDAQTFIGVAKAIGMLRDVNKAKEVSGYASELGSIFDSHFTNSA 296
           +D+ ++F  +    ++P+  T + +   IG+                             
Sbjct: 425 IDSLEMFRWLVKVKISPNEITLVSILPVIGI----------------------------- 484

Query: 297 LKCLSSGKKVHARAIKSGLEVNYISISNAVANAYAKCGSLEDVRKVFYRMEDRDLVSWTS 356
           L  L  G+++H   IK G + N  +I  AV + YAKCG +    ++F R+  RD+VSW S
Sbjct: 485 LLALKLGRELHGFIIKKGFD-NRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNS 544

Query: 357 LVTAYSQCSEWDKAIEIFSNMREEGYAPNQFAFSSVLVSCASLCLLEYGQQVHGFICKVG 416
           ++T  +Q      AI+IF  M   G   +  + S+ L +CA+L    +G+ +HGF+ K  
Sbjct: 545 MITRCAQSDNPSAAIDIFRQMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKHS 604

Query: 417 LDMDKCIESALIDMYAKCGCLAEAKKVFDRISNADTVSWTAIIAGHAQHGIVDDALQLFR 476
           L  D   ES LIDMYAKCG L  A  VF  +   + VSW +IIA    HG + D+L LF 
Sbjct: 605 LASDVYSESTLIDMYAKCGNLKAAMNVFKTMKEKNIVSWNSIIAACGNHGKLKDSLCLFH 664

Query: 477 RM-EQLGVEPNAVTFVCVLFACSHGGLVEEGLQYFKLMKENYGLVPEMEHYSCIVDLLSR 536
            M E+ G+ P+ +TF+ ++ +C H G V+EG+++F+ M E+YG+ P+ EHY+C+VDL  R
Sbjct: 665 EMVEKSGIRPDQITFLEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLFGR 724

Query: 537 VGHLNDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLL 596
            G L +A E +  MP  P+  VW TLLGACR+H NVEL E+A+ K++     NS  YVL+
Sbjct: 725 AGRLTEAYETVKSMPFPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYYVLI 784

Query: 597 SNTYIESGSYKDGLSLRHVMKEQGVKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIY--- 656
           SN +  +  ++    +R +MKE+ V+K PG SWI +N   H F +GD  HPE   IY   
Sbjct: 785 SNAHANAREWESVTKVRSLMKEREVQKIPGYSWIEINKRTHLFVSGDVNHPESSHIYSLL 822

Query: 657 -AKLGELRLK 662
            + LGELRL+
Sbjct: 845 NSLLGELRLE 822

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038884632.10.0e+0084.26pentatricopeptide repeat-containing protein At3g16610-like [Benincasa hispida] >... [more]
XP_011656423.10.0e+0083.31putative pentatricopeptide repeat-containing protein At3g25970 [Cucumis sativus][more]
KAA0052031.10.0e+0082.68pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
TYK04529.10.0e+0082.79pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_016901974.10.0e+0082.90PREDICTED: pentatricopeptide repeat-containing protein At2g27610-like [Cucumis m... [more]
Match NameE-valueIdentityDescription
Q9SIT73.0e-11734.05Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q9SVP79.6e-11636.19Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
Q9ZUW34.0e-11436.69Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX... [more]
Q9SN398.4e-11234.46Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9STE14.6e-11035.25Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A0A0KBQ40.0e+0083.31Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G013890 PE=4 SV=1[more]
A0A5A7UEQ90.0e+0082.68Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A5D3C2B10.0e+0082.79Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S4E1710.0e+0082.90pentatricopeptide repeat-containing protein At2g27610-like OS=Cucumis melo OX=36... [more]
A0A6J1FGJ60.0e+0082.09pentatricopeptide repeat-containing protein At3g16610-like OS=Cucurbita moschata... [more]
Match NameE-valueIdentityDescription
AT2G13600.12.1e-11834.05Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G13650.16.8e-11736.19Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G27610.12.9e-11536.69Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G18750.16.0e-11334.46Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G21300.13.3e-11135.25Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 425..650
e-value: 5.5E-42
score: 146.2
coord: 169..286
e-value: 7.3E-18
score: 67.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 298..407
e-value: 4.5E-21
score: 77.0
coord: 38..168
e-value: 1.5E-18
score: 68.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 352..385
e-value: 4.3E-7
score: 27.7
coord: 324..352
e-value: 0.0013
score: 16.8
coord: 120..153
e-value: 7.4E-7
score: 27.0
coord: 92..119
e-value: 0.0017
score: 16.4
coord: 453..487
e-value: 1.7E-7
score: 29.0
coord: 221..255
e-value: 3.1E-6
score: 25.0
coord: 488..521
e-value: 6.5E-4
score: 17.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 425..447
e-value: 0.01
score: 16.1
coord: 525..550
e-value: 1.0
score: 9.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 451..497
e-value: 7.6E-11
score: 42.0
coord: 350..397
e-value: 1.3E-7
score: 31.8
coord: 219..259
e-value: 3.0E-9
score: 37.0
coord: 118..162
e-value: 1.0E-7
score: 32.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 588..622
score: 8.604678
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 451..485
score: 12.780933
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 350..384
score: 12.682281
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 219..253
score: 11.597113
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 118..152
score: 10.917512
NoneNo IPR availablePANTHERPTHR24015:SF1875PENTATRICOPEPTIDE (PPR) REPEAT PROTEINcoord: 13..218
coord: 221..281
NoneNo IPR availablePANTHERPTHR24015:SF1875PENTATRICOPEPTIDE (PPR) REPEAT PROTEINcoord: 285..658
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 221..281
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 13..218
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 285..658

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10018388.1HG10018388.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding