HG10019137 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10019137
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr04: 17395332 .. 17399142 (-)
RNA-Seq ExpressionHG10019137
SyntenyHG10019137
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGAGAAACTACGGGCTGTGGAGCTTTATATTTTAAGTTGCAAGGTTTCAGCCATTGGAACGACTCAGGCGAGGCCGAAGAACGGGTCGTCGAGGAGAAAGAAAGGGACCCTTTTCGATCAAGTCGCGTTTATGGTTTTGATATGATTAAATACCAATCGTTGTTGCCGAATTCAGGTATACCTTAAAACCAACTCTATGAAACCTTTGTGGCTTTTTATGGGAATGCCCTATAGTGGACTGGAGATATGTTATAGCTCTATGTAGATTTCAACTCGATTTTAAGGTCCCGTTTGATAACCGTGGTTACACTACTTCAACTTATGAGTTTATTTGTTTTGTTACATAGTGTTTCTAATCCAAGCCAAATTTTGAAAACTAAAAAAAAAAAAAAAGTGGTTTTAAAATGTAGTAGCTTCTGTTTTTCAGCAGCTGAAGGTGGCTTGTAGGTCGAAGTTGTTCGAAAGATGATCTCTTCATCTTTCTTTTTGGTTGTATTTGTTGATCGTGTCTTTTCTTTTTCTCTTTAGTGAGAGTTTGTATCTCTTAAACGTTTTAATTCTTTTCATTATATCAATAAGAAGTTGTTTCTTGTTTCAAAAAAAATTACAATTTTCAAAAACAAAAAGCTTTGAACCAAATGGTTATCAAACGTGCATTAACTTTCTTCTAAACAACATTCTTAAAAATGGGTGATTGGATGTGCATCAGATTTTTTTGTCTTAGATGTTTATTAGGTCTGTTTTTGCCTTTAGGATACAAATTTGGTGGACAAGACCCCACTACATAAAGAACTATCAGTCTCACTATGAGGTATCCAACTCTCTATATGTGTGCATGCTCTTTTGCTTATTTAACTTCTATGTTGCGAATTCCTAGTTTCAATTCGACATGTTTAAGCATGGAGAAACCTTTTGTTGACATTGATTGTGAAAATTAATAAGGATGTGTTTTGTTTTTCAACATGTTATATTGTAGTTCCTGTTATCTCTTATATTTCAGTGCATTGACCTCAACCTTTTCTCTCTGCCTTTGTTCATCACATGATGATTATTGATTAATGGATTTTAAAAACCAAAATTGGGAGGGTAAGAAACTGTCCAGACAAAAGAGTGATAGGAATGCTATTAGGAATGTTATTGGGGCATTAAAATTAAGATTACATTTATCAAGGTGTCTATTTTTATGGAATCTATGAATTATCTTTACTTCTTGATCCTCAACGTTGTTTACAACAGTTTCTTGTTTGTTTGTTTTGTTTGTATGTTTTTTCCTTTCCTGCTTTGAGTCAAGATAGTTGCCTGTCAATTACTGAAAGTTATTGACAAAAATATATCTTTCCCTTCTATTGTTCTCAGACTTCTGCCATGTAAATGGAGACGGGTTTCCTTGTTCAGGCCTTCATTCCAAGCTTGTTGTTCTCTGTATTCAGCAACTACAACTGCTCCCAAGTATTACTTGGATGGAGTTGAAAATGAGAAAAGGGAGATTGATTTCAACCGACTATTCCTTTTCTGCACAAAAGTACACCTTGCTAAGCGACTCCATGCACTACTTGTGGTGTCTGGGAAGGCCCAGAACATCTTTTTTTCTGCTAAACTCATCAATCTCTATGCTTTTCTTGGTGATGTATCATTCGCTCGCCATACTTTTAACCAAATTCAGACAAAAGATGTCTACACATGGAATTCCATGATATCTGCATATACTCGAATTGGTCGCTTCCATGATGCTGTAGATTGTTTCTATGAATTTTTGTCAACTTCTATCCTTCAGCCTGATCATTACACATTTCCACCTGTTATAAGGGCATGTGGAAATCTAGATGATGGGAAGAAGATACATTGCTTGGTTCTAAAATTGGGTTTTGAATGTGATGTATTTATTGCGGCTTCTTTTATCCATTTTTATTCTCGGTTTGGCTTTGTCAATTTAGCTCGTAACTTGTTTGATAACATGATGATTCGAGATATTGGTACTTGGAATGCAATGATCTCAGGGTTTTGTCTTAATGGAAAAGTTGTAGAAGCATTGGAAGTCTTTGATGAGATGCGATTCAAGAGTGTAACTATGGATTCTGTAACAGTCTCAAGTTTACTTCCTATTTGTGCGCAGTTGGATGATATAATAAGCGGCATCCTAATTCATGTCTATGCCGTCAAGCTTGGGTTGGAATTTGACTTGTTTGTATGTAATGCATTGATAAACATGTATGCCAAATTTGGTGAACTGAGAAGTGCAGAAACCATTTTCAATCAAATGGAAGTTAGGGATATTGTATCTTGGAACTCTCTGATTGCAGCATTCGAGCAGAATAAAGAGCCAGTCGTAGCTCTTGGGGTGTATAATAAGATGCATTCTATTGGGGTTGTACCCGACTTATTGACACTTGTGAGTTTAGCTTCTGTTGCTGCTGAACTTGGCAATTTTTTAAGTAGTAGGTCTATTCATGGATTTGTTACAAGGAGAGGTTGGTTTTTACATGATGTTGTCATTGGTAATGCAATTATAGACATGTATGCAAAGTTGGGTTTTATAGATTCAGCACGAAAAGTTTTCGAAGGACTTCCTGTCAAAGACGTCGTCTCATGGAATACTTTGATAACAGGTTATTCTCAAAATGGTTTAGCAAACGAGGCAATTGATGTGTCTCATTTGATGAAAGATTATAGCGATGCAGTTCCTAACCAAGGCACTTGGGTGAGTATTCTCACAGCACACTCCCAGATAGGAGCCTTGAAACAAGGGATGAAAACACATGGTCAGCTGATAAAAAAATTTCTATACTTTGACATCTTTGTGGGTACTTGTCTTGTTGATATGTATGGAAAATGTGGAAGATTAGCTGATGCACTGTCTTTATTTTATGAAGTACCACATAAAAGTTCGGTTTCTTGGAACACCATCATATCATGTCACGGACTCCATGGATACGGTTTAAAAGCTGTCGAGTTATTTAGGGAAATGCAAAGTGAAGGAGTAAAGCCTGACCATATCACTTTTGTATCTCTGTTATCTGCTTGTAGCCATTCAGGTTTGGTTGACGAGGGTCAGTGGTGCTTCCAATTGATGGAAGAGACTTATGGGATAAGGCCTGGCTTGAAGCATTATGGCTGCATGGTAGATTTGTTCGGTAGGGCTGGCCATCTCGAAAAAGCTTATAATTTTGTAAAAAATATGCCGATACGGCCTGATGCTTCCGTGTGGGGTGCACTTCTTGGTGCTTGTAGGATACATGAGAATGTAGAGTTGGTCAGAACTATCTCAGATCACTTGTTGGAGGCTGAATCAGAAAATGTTGGCTACTATGTTTTGTTATCGAATATTTATGCTAAACTCGGACAGTGGGAAGGAGTTGATGAAGTGCGATCATTAGCTCGAGACAGGGGATTGAAGAAGACTCCTGGGTGGAGTTCAATTGAAGTAGACAAGAAAATTGATGTCTTTTACACTGGCAACCAAACACATCCAAAATGTGAGGAGATATACAATGAACTAAGGAATCTAACTGTTAAAATGAAGAGTCTTGGTTATGTTCCAGACTATAACTTTGTATTGCAGGATGTGGAGGATGATGAAAAGGAAAACATTCTTACGAGCCATAGCGAGCGATTGGCGATGGCATTTGGGATTATCAGCACACCACCGAAAACAACTCTTCAGATCTTCAAGAACTTGCGGATTTGTGGAGACTGTCATAACGCTACTAAGTTCATATCTAAAATTACTGAAAGAGAGATCATCGTAAGAGATTCGAACCGATTCCATCATTTCAAAGATGGAGTTTGTTCTTGTGGTGATTATTGGTGA

mRNA sequence

ATGCGAGAAACTACGGGCTGTGGAGCTTTATATTTTAAGTTGCAAGGTTTCAGCCATTGGAACGACTCAGGCGAGGCCGAAGAACGGGTCGTCGAGGAGAAAGAAAGGGACCCTTTTCGATCAAGTCGCGTTTATGGTTTTGATATGATTAAATACCAATCGTTGTTGCCGAATTCAGCAACTACAACTGCTCCCAAGTATTACTTGGATGGAGTTGAAAATGAGAAAAGGGAGATTGATTTCAACCGACTATTCCTTTTCTGCACAAAAGTACACCTTGCTAAGCGACTCCATGCACTACTTGTGGTGTCTGGGAAGGCCCAGAACATCTTTTTTTCTGCTAAACTCATCAATCTCTATGCTTTTCTTGGTGATGTATCATTCGCTCGCCATACTTTTAACCAAATTCAGACAAAAGATGTCTACACATGGAATTCCATGATATCTGCATATACTCGAATTGGTCGCTTCCATGATGCTGTAGATTGTTTCTATGAATTTTTGTCAACTTCTATCCTTCAGCCTGATCATTACACATTTCCACCTGTTATAAGGGCATGTGGAAATCTAGATGATGGGAAGAAGATACATTGCTTGGTTCTAAAATTGGGTTTTGAATGTGATGTATTTATTGCGGCTTCTTTTATCCATTTTTATTCTCGGTTTGGCTTTGTCAATTTAGCTCGTAACTTGTTTGATAACATGATGATTCGAGATATTGGTACTTGGAATGCAATGATCTCAGGGTTTTGTCTTAATGGAAAAGTTGTAGAAGCATTGGAAGTCTTTGATGAGATGCGATTCAAGAGTGTAACTATGGATTCTGTAACAGTCTCAAGTTTACTTCCTATTTGTGCGCAGTTGGATGATATAATAAGCGGCATCCTAATTCATGTCTATGCCGTCAAGCTTGGGTTGGAATTTGACTTGTTTGTATGTAATGCATTGATAAACATGTATGCCAAATTTGGTGAACTGAGAAGTGCAGAAACCATTTTCAATCAAATGGAAGTTAGGGATATTGTATCTTGGAACTCTCTGATTGCAGCATTCGAGCAGAATAAAGAGCCAGTCGTAGCTCTTGGGGTGTATAATAAGATGCATTCTATTGGGGTTGTACCCGACTTATTGACACTTGTGAGTTTAGCTTCTGTTGCTGCTGAACTTGGCAATTTTTTAAGTAGTAGGTCTATTCATGGATTTGTTACAAGGAGAGGTTGGTTTTTACATGATGTTGTCATTGGTAATGCAATTATAGACATGTATGCAAAGTTGGGTTTTATAGATTCAGCACGAAAAGTTTTCGAAGGACTTCCTGTCAAAGACGTCGTCTCATGGAATACTTTGATAACAGGTTATTCTCAAAATGGTTTAGCAAACGAGGCAATTGATGTGTCTCATTTGATGAAAGATTATAGCGATGCAGTTCCTAACCAAGGCACTTGGGATGTGGAGGATGATGAAAAGGAAAACATTCTTACGAGCCATAGCGAGCGATTGGCGATGGCATTTGGGATTATCAGCACACCACCGAAAACAACTCTTCAGATCTTCAAGAACTTGCGGATTTGTGGAGACTGTCATAACGCTACTAAGTTCATATCTAAAATTACTGAAAGAGAGATCATCGTAAGAGATTCGAACCGATTCCATCATTTCAAAGATGGAGTTTGTTCTTGTGGTGATTATTGGTGA

Coding sequence (CDS)

ATGCGAGAAACTACGGGCTGTGGAGCTTTATATTTTAAGTTGCAAGGTTTCAGCCATTGGAACGACTCAGGCGAGGCCGAAGAACGGGTCGTCGAGGAGAAAGAAAGGGACCCTTTTCGATCAAGTCGCGTTTATGGTTTTGATATGATTAAATACCAATCGTTGTTGCCGAATTCAGCAACTACAACTGCTCCCAAGTATTACTTGGATGGAGTTGAAAATGAGAAAAGGGAGATTGATTTCAACCGACTATTCCTTTTCTGCACAAAAGTACACCTTGCTAAGCGACTCCATGCACTACTTGTGGTGTCTGGGAAGGCCCAGAACATCTTTTTTTCTGCTAAACTCATCAATCTCTATGCTTTTCTTGGTGATGTATCATTCGCTCGCCATACTTTTAACCAAATTCAGACAAAAGATGTCTACACATGGAATTCCATGATATCTGCATATACTCGAATTGGTCGCTTCCATGATGCTGTAGATTGTTTCTATGAATTTTTGTCAACTTCTATCCTTCAGCCTGATCATTACACATTTCCACCTGTTATAAGGGCATGTGGAAATCTAGATGATGGGAAGAAGATACATTGCTTGGTTCTAAAATTGGGTTTTGAATGTGATGTATTTATTGCGGCTTCTTTTATCCATTTTTATTCTCGGTTTGGCTTTGTCAATTTAGCTCGTAACTTGTTTGATAACATGATGATTCGAGATATTGGTACTTGGAATGCAATGATCTCAGGGTTTTGTCTTAATGGAAAAGTTGTAGAAGCATTGGAAGTCTTTGATGAGATGCGATTCAAGAGTGTAACTATGGATTCTGTAACAGTCTCAAGTTTACTTCCTATTTGTGCGCAGTTGGATGATATAATAAGCGGCATCCTAATTCATGTCTATGCCGTCAAGCTTGGGTTGGAATTTGACTTGTTTGTATGTAATGCATTGATAAACATGTATGCCAAATTTGGTGAACTGAGAAGTGCAGAAACCATTTTCAATCAAATGGAAGTTAGGGATATTGTATCTTGGAACTCTCTGATTGCAGCATTCGAGCAGAATAAAGAGCCAGTCGTAGCTCTTGGGGTGTATAATAAGATGCATTCTATTGGGGTTGTACCCGACTTATTGACACTTGTGAGTTTAGCTTCTGTTGCTGCTGAACTTGGCAATTTTTTAAGTAGTAGGTCTATTCATGGATTTGTTACAAGGAGAGGTTGGTTTTTACATGATGTTGTCATTGGTAATGCAATTATAGACATGTATGCAAAGTTGGGTTTTATAGATTCAGCACGAAAAGTTTTCGAAGGACTTCCTGTCAAAGACGTCGTCTCATGGAATACTTTGATAACAGGTTATTCTCAAAATGGTTTAGCAAACGAGGCAATTGATGTGTCTCATTTGATGAAAGATTATAGCGATGCAGTTCCTAACCAAGGCACTTGGGATGTGGAGGATGATGAAAAGGAAAACATTCTTACGAGCCATAGCGAGCGATTGGCGATGGCATTTGGGATTATCAGCACACCACCGAAAACAACTCTTCAGATCTTCAAGAACTTGCGGATTTGTGGAGACTGTCATAACGCTACTAAGTTCATATCTAAAATTACTGAAAGAGAGATCATCGTAAGAGATTCGAACCGATTCCATCATTTCAAAGATGGAGTTTGTTCTTGTGGTGATTATTGGTGA

Protein sequence

MRETTGCGALYFKLQGFSHWNDSGEAEERVVEEKERDPFRSSRVYGFDMIKYQSLLPNSATTTAPKYYLDGVENEKREIDFNRLFLFCTKVHLAKRLHALLVVSGKAQNIFFSAKLINLYAFLGDVSFARHTFNQIQTKDVYTWNSMISAYTRIGRFHDAVDCFYEFLSTSILQPDHYTFPPVIRACGNLDDGKKIHCLVLKLGFECDVFIAASFIHFYSRFGFVNLARNLFDNMMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVTVSSLLPICAQLDDIISGILIHVYAVKLGLEFDLFVCNALINMYAKFGELRSAETIFNQMEVRDIVSWNSLIAAFEQNKEPVVALGVYNKMHSIGVVPDLLTLVSLASVAAELGNFLSSRSIHGFVTRRGWFLHDVVIGNAIIDMYAKLGFIDSARKVFEGLPVKDVVSWNTLITGYSQNGLANEAIDVSHLMKDYSDAVPNQGTWDVEDDEKENILTSHSERLAMAFGIISTPPKTTLQIFKNLRICGDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW
Homology
BLAST of HG10019137 vs. NCBI nr
Match: KAG7014884.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 897.1 bits (2317), Expect = 7.7e-257
Identity = 505/887 (56.93%), Postives = 530/887 (59.75%), Query Frame = 0

Query: 3    ETTGCGALYFKLQGFSHWNDSGEAEERVVEEKERDPFRSSRVYGFDMIKYQSLLPN---- 62
            E+TGCGAL  KLQ F +WN+SGEAEE+VVEEKER PFRSSR+YGFDMIK++  LPN    
Sbjct: 634  ESTGCGALKLKLQDFCYWNNSGEAEEQVVEEKERSPFRSSRIYGFDMIKHRFSLPNSVKL 693

Query: 63   --------------------------------SATTTA--PKYYLDGVENEKREIDFNRL 122
                                            SATTTA  PKYYLD VE EK+EIDFNRL
Sbjct: 694  LSESGYEFKCLFSEDEHMVKKLPSFQACCPLYSATTTAPTPKYYLDEVEIEKKEIDFNRL 753

Query: 123  FLFCTKVHLAKRLHALLVVSGKAQNIFFSAKLINLYAFLGDVSFARHTFNQIQTKDVYTW 182
            FL C KVHLAKRLHALLVVSGK Q+IF SAKLINLYAFLGDVSFAR TF+QIQ KDVYTW
Sbjct: 754  FLVCKKVHLAKRLHALLVVSGKIQSIFLSAKLINLYAFLGDVSFARRTFDQIQAKDVYTW 813

Query: 183  NSMISAYTRIGRFHDAVDCFYEFLSTSILQPDHYTFPPVIRACGNLDDGKKIHCLVLKLG 242
            NSMISAY RIG FH+AVDCF+EF+STSILQPD+YTFPPVIRACGNLDDGKKIHCL LKLG
Sbjct: 814  NSMISAYARIGHFHEAVDCFHEFMSTSILQPDYYTFPPVIRACGNLDDGKKIHCLALKLG 873

Query: 243  FECDVFIAASFIHFYSRFGFVNLARNLFDNMMIRDIGTWNAMISGFCLNGKVVEALEVFD 302
            FECDVFIAAS IHFYSRFGFVNLARNLFD++MIRDIGTWNAMISGFCLNGKVVEALEVFD
Sbjct: 874  FECDVFIAASLIHFYSRFGFVNLARNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVFD 933

Query: 303  EMRFKSVTMDSVTVSSLLPICAQLDDIISGILIHVYAVKLGLEFDLFVCNALINMYAKFG 362
            EMRFKSV MDSVT SSLLPICAQLDDIISG+LIHVYA+KLGLEFDLFVCNALINMYAKFG
Sbjct: 934  EMRFKSVNMDSVTFSSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFG 993

Query: 363  ELRSAETIFNQMEVRDIVSWNSLIAAFEQNKEPVVALGVYNKMHSIGVVPDLLTLVSLAS 422
            EL SAETIFNQ+E +DIVSWNSLIAAFEQNKEPVVALG+Y KMH+ G VPDLLTLVSLAS
Sbjct: 994  ELGSAETIFNQIEAKDIVSWNSLIAAFEQNKEPVVALGLYKKMHATGAVPDLLTLVSLAS 1053

Query: 423  VAAELGNFLSSRSIHGFVTRRGWFLHDVVIGNAIIDMYAKLGFIDSARKVFEGLPVKDVV 482
            VAAELGNFLSSRSIHGFVTR+GWFL DVVIGNAIIDMYAKLG+IDSARKVFE LPVKD V
Sbjct: 1054 VAAELGNFLSSRSIHGFVTRKGWFLQDVVIGNAIIDMYAKLGYIDSARKVFEELPVKDGV 1113

Query: 483  SWNTLITGYSQNGLANEAIDVSHLMKDYSDAVPNQGTW---------------------- 542
            SWNTLITGYSQNGLANEAIDV HLM DYSDAVPNQGTW                      
Sbjct: 1114 SWNTLITGYSQNGLANEAIDVYHLMNDYSDAVPNQGTWVSILTAYSQIGALKQGMKTHGL 1173

Query: 543  ------------------------------------------------------------ 565
                                                                        
Sbjct: 1174 LIKNFLYFDIFVGTCLIDLYGKCGRLADALSLFYEIPHKSSVSWNAIISCHGLHGYGLKA 1233

BLAST of HG10019137 vs. NCBI nr
Match: XP_038903939.1 (pentatricopeptide repeat-containing protein At4g33990 [Benincasa hispida] >XP_038903940.1 pentatricopeptide repeat-containing protein At4g33990 [Benincasa hispida] >XP_038903941.1 pentatricopeptide repeat-containing protein At4g33990 [Benincasa hispida])

HSP 1 Score: 868.6 bits (2243), Expect = 2.9e-248
Identity = 481/793 (60.66%), Postives = 496/793 (62.55%), Query Frame = 0

Query: 59  SATTTAPKYYLDGVENEKREIDFNRLFLFCTKVHLAKRLHALLVVSGKAQNIFFSAKLIN 118
           SATTTAPKYYLDGVENEKREIDFNRLFLFCTKVH A+RLHALLVVSGKAQNIF SAKL+N
Sbjct: 26  SATTTAPKYYLDGVENEKREIDFNRLFLFCTKVHHAQRLHALLVVSGKAQNIFLSAKLVN 85

Query: 119 LYAFLGDVSFARHTFNQIQTKDVYTWNSMISAYTRIGRFHDAVDCFYEFLSTSILQPDHY 178
           LYAFLGD+SFA HTF+QIQTKDVYTWNSMISAY RIGRFH+AVDCFY+FLSTSILQPD+Y
Sbjct: 86  LYAFLGDISFACHTFDQIQTKDVYTWNSMISAYARIGRFHEAVDCFYKFLSTSILQPDYY 145

Query: 179 TFPPVIRACGNLDDGKKIHCLVLKLGFECDVFIAASFIHFYSRFGFVNLARNLFDNMMIR 238
           TFPPVIRACG LDDGKKIHCLVLKLGFECDVFIAAS +HFYSRFGFV+LARNLFDNMMIR
Sbjct: 146 TFPPVIRACGKLDDGKKIHCLVLKLGFECDVFIAASLVHFYSRFGFVSLARNLFDNMMIR 205

Query: 239 DIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVTVSSLLPICAQLDDIISGILIH 298
           DIGTWNAMISGF LNGKVVEALEVFDEMRFKSVTMDSVT+S+LLPICAQLDDIISG+LIH
Sbjct: 206 DIGTWNAMISGFYLNGKVVEALEVFDEMRFKSVTMDSVTISTLLPICAQLDDIISGVLIH 265

Query: 299 VYAVKLGLEFDLFVCNALINMYAKFGELRSAETIFNQMEVRDIVSWNSLIAAFEQNKEPV 358
           VYA+KLGLEFDLFV NALINMYAKFGELRSAETIFNQMEVRDIVSWNSLIAAFEQNKEPV
Sbjct: 266 VYAIKLGLEFDLFVRNALINMYAKFGELRSAETIFNQMEVRDIVSWNSLIAAFEQNKEPV 325

Query: 359 VALGVYNKMHSIGVVPDLLTLVSLASVAAELGNFLSSRSIHGFVTRRGWFLHDVVIGNAI 418
           VALGVYNKMHSIGVVPDLLTLVSLASVAAELGNFLSSRSIHGFVTRRGWFLHDVVIGNAI
Sbjct: 326 VALGVYNKMHSIGVVPDLLTLVSLASVAAELGNFLSSRSIHGFVTRRGWFLHDVVIGNAI 385

Query: 419 IDMYAKLGFIDSARKVFEGLPVKDVVSWNTLITGYSQNGLANEAIDVSHLMKDYSDAVPN 478
           IDMYAKLGFIDSARKVFEGLPVKDVVSWNTLITGYSQNGLAN+AIDV H MKDYSDAVPN
Sbjct: 386 IDMYAKLGFIDSARKVFEGLPVKDVVSWNTLITGYSQNGLANKAIDVYHSMKDYSDAVPN 445

Query: 479 QGTW-------------------------------------------------------- 538
           QGTW                                                        
Sbjct: 446 QGTWVSILTAHSQIGALKQGMKTHGQLIKFFLYFDIFVGTCLIDMYGKCGRLADALSLFY 505

Query: 539 ------------------------------------------------------------ 565
                                                                       
Sbjct: 506 EVPHKSSVSWNAIISCHGLHGYGLKAVELFREMQSEGVKPDHITFVSLLSACSHSGLVDE 565

BLAST of HG10019137 vs. NCBI nr
Match: XP_022923150.1 (pentatricopeptide repeat-containing protein At4g33990 [Cucurbita moschata])

HSP 1 Score: 832.0 bits (2148), Expect = 3.0e-237
Identity = 465/795 (58.49%), Postives = 483/795 (60.75%), Query Frame = 0

Query: 59  SATTTA--PKYYLDGVENEKREIDFNRLFLFCTKVHLAKRLHALLVVSGKAQNIFFSAKL 118
           SATTTA  PKYYLD VE EK+EIDFNRLFL C KVHLAKRLHALLVVSGK Q+IF SAKL
Sbjct: 26  SATTTAPTPKYYLDEVEIEKKEIDFNRLFLVCKKVHLAKRLHALLVVSGKVQSIFLSAKL 85

Query: 119 INLYAFLGDVSFARHTFNQIQTKDVYTWNSMISAYTRIGRFHDAVDCFYEFLSTSILQPD 178
           INLYAFLGDVSFAR TF+QIQ KDVYTWNSMISAY RIG FH+AVDCF+EF+STSILQPD
Sbjct: 86  INLYAFLGDVSFARRTFDQIQAKDVYTWNSMISAYARIGHFHEAVDCFHEFMSTSILQPD 145

Query: 179 HYTFPPVIRACGNLDDGKKIHCLVLKLGFECDVFIAASFIHFYSRFGFVNLARNLFDNMM 238
           +YTFPPVIRACGNLDDGKKIHCL LKLGFECDVFIAAS IHFYSRFGFVNLARNLFD++M
Sbjct: 146 YYTFPPVIRACGNLDDGKKIHCLALKLGFECDVFIAASLIHFYSRFGFVNLARNLFDSLM 205

Query: 239 IRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVTVSSLLPICAQLDDIISGIL 298
           IRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVT SSLLPICAQLDDIISG+L
Sbjct: 206 IRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVTFSSLLPICAQLDDIISGVL 265

Query: 299 IHVYAVKLGLEFDLFVCNALINMYAKFGELRSAETIFNQMEVRDIVSWNSLIAAFEQNKE 358
           IHVYA+KLGLEFDLFVCNALINMYAKFGEL SAETIFNQ+E +DIVSWNSLIAAFEQNKE
Sbjct: 266 IHVYAIKLGLEFDLFVCNALINMYAKFGELGSAETIFNQIEAKDIVSWNSLIAAFEQNKE 325

Query: 359 PVVALGVYNKMHSIGVVPDLLTLVSLASVAAELGNFLSSRSIHGFVTRRGWFLHDVVIGN 418
           PVVALG+Y KMH+ G VPDLLTLVSLASVAAELGNFLSSRSIHGFVTR+GWFL DVVIGN
Sbjct: 326 PVVALGLYKKMHATGAVPDLLTLVSLASVAAELGNFLSSRSIHGFVTRKGWFLQDVVIGN 385

Query: 419 AIIDMYAKLGFIDSARKVFEGLPVKDVVSWNTLITGYSQNGLANEAIDVSHLMKDYSDAV 478
           AIIDMYAKLG+IDSARKVFE LPVKDVVSWNTLITGYSQNGLANEAIDV HLM DYSDAV
Sbjct: 386 AIIDMYAKLGYIDSARKVFEELPVKDVVSWNTLITGYSQNGLANEAIDVYHLMNDYSDAV 445

Query: 479 PNQGTW------------------------------------------------------ 538
           PNQGTW                                                      
Sbjct: 446 PNQGTWVSILTAYSQIGALKQGMKTHGLLIKNFLYFDIFVGTCLIDMYGKCGRLADALSL 505

Query: 539 ------------------------------------------------------------ 565
                                                                       
Sbjct: 506 FYEIPHKSSVSWNAIISCHGLHGYGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLV 565

BLAST of HG10019137 vs. NCBI nr
Match: KAG6576861.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 828.6 bits (2139), Expect = 3.3e-236
Identity = 460/793 (58.01%), Postives = 480/793 (60.53%), Query Frame = 0

Query: 59  SATTTAPKYYLDGVENEKREIDFNRLFLFCTKVHLAKRLHALLVVSGKAQNIFFSAKLIN 118
           SATTT+PKYY D VE EK+EIDFNRLF  C KVHLAKRLHALL+VSGK Q+IF SAKLIN
Sbjct: 30  SATTTSPKYYFDEVEIEKKEIDFNRLFPVCKKVHLAKRLHALLLVSGKVQSIFLSAKLIN 89

Query: 119 LYAFLGDVSFARHTFNQIQTKDVYTWNSMISAYTRIGRFHDAVDCFYEFLSTSILQPDHY 178
           LYAFLGDVSFAR TF+QIQ KDVYTWNSMISAY RIG FH+AVDCF+EF+STSILQPD+Y
Sbjct: 90  LYAFLGDVSFARRTFDQIQAKDVYTWNSMISAYARIGHFHEAVDCFHEFMSTSILQPDYY 149

Query: 179 TFPPVIRACGNLDDGKKIHCLVLKLGFECDVFIAASFIHFYSRFGFVNLARNLFDNMMIR 238
           TFPPVIRACGNLDDGKKIHCL LKLGFECDVFIAAS IHFYSRFGFVNLARNLFD++MIR
Sbjct: 150 TFPPVIRACGNLDDGKKIHCLALKLGFECDVFIAASLIHFYSRFGFVNLARNLFDSLMIR 209

Query: 239 DIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVTVSSLLPICAQLDDIISGILIH 298
           DIGTWNAMISGFCLNGKVVEALEVFDEMRFKSV MDSVT SSLLPICAQLDDIISG+LIH
Sbjct: 210 DIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVNMDSVTFSSLLPICAQLDDIISGVLIH 269

Query: 299 VYAVKLGLEFDLFVCNALINMYAKFGELRSAETIFNQMEVRDIVSWNSLIAAFEQNKEPV 358
           VYA+KLGLEFDLFVCNALINMYAKFGEL SAETIFNQ+E +DIVSWNSLIAAFEQNKEPV
Sbjct: 270 VYAIKLGLEFDLFVCNALINMYAKFGELGSAETIFNQIEAKDIVSWNSLIAAFEQNKEPV 329

Query: 359 VALGVYNKMHSIGVVPDLLTLVSLASVAAELGNFLSSRSIHGFVTRRGWFLHDVVIGNAI 418
           VALG+Y KMH+ G VPDLLTLVSLASVAAELGNFLSSRSIHGFVTR+GWFL DVVIGNAI
Sbjct: 330 VALGLYKKMHATGAVPDLLTLVSLASVAAELGNFLSSRSIHGFVTRKGWFLQDVVIGNAI 389

Query: 419 IDMYAKLGFIDSARKVFEGLPVKDVVSWNTLITGYSQNGLANEAIDVSHLMKDYSDAVPN 478
           IDMYAKLG+IDSARKVFE LPVKDVVSWNTLITGYSQNGLANEAIDV HLM DYSDAVPN
Sbjct: 390 IDMYAKLGYIDSARKVFEELPVKDVVSWNTLITGYSQNGLANEAIDVYHLMNDYSDAVPN 449

Query: 479 QGTW-------------------------------------------------------- 538
           QGTW                                                        
Sbjct: 450 QGTWVSILTAYSQIGALKQGMKTHGLLIKNFLYFDIFVGTCLIDLYGKCGRLADALSLFY 509

Query: 539 ------------------------------------------------------------ 565
                                                                       
Sbjct: 510 EIPHKSSVSWNAIISCHGLHGYGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDE 569

BLAST of HG10019137 vs. NCBI nr
Match: KAA0066593.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 828.6 bits (2139), Expect = 3.3e-236
Identity = 459/800 (57.38%), Postives = 484/800 (60.50%), Query Frame = 0

Query: 52  YQSLLPNSATTTAPKYYLDGVENEKREIDFNRLFLFCTKVHLAKRLHALLVVSGKAQNIF 111
           +Q+     + TTAPKYYLDGVENEKREIDFNRLFLFCTKVHLAK+LH LLVVSGK Q+IF
Sbjct: 9   FQACCSLYSATTAPKYYLDGVENEKREIDFNRLFLFCTKVHLAKQLHGLLVVSGKTQSIF 68

Query: 112 FSAKLINLYAFLGDVSFARHTFNQIQTKDVYTWNSMISAYTRIGRFHDAVDCFYEFLSTS 171
            SAKLIN YAFLGD+S AR TF+QIQTKDVYTWNSMISAY RIG FH A+DCF EFLSTS
Sbjct: 69  LSAKLINRYAFLGDISHARLTFDQIQTKDVYTWNSMISAYARIGHFHAAIDCFNEFLSTS 128

Query: 172 ILQPDHYTFPPVIRACGNLDDGKKIHCLVLKLGFECDVFIAASFIHFYSRFGFVNLARNL 231
           ILQ DHYTFPPVIRACGNLDDG+KIHCLVLKLGFECDV+IAASFIHFYSRFGFV+LA NL
Sbjct: 129 ILQSDHYTFPPVIRACGNLDDGRKIHCLVLKLGFECDVYIAASFIHFYSRFGFVSLACNL 188

Query: 232 FDNMMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVTVSSLLPICAQLDDI 291
           FDNMMIRDIGTWNAMISGFCLN KV EALEVFDEMR KSVTMDSVT+SSLLPICAQLDDI
Sbjct: 189 FDNMMIRDIGTWNAMISGFCLNDKVAEALEVFDEMRLKSVTMDSVTISSLLPICAQLDDI 248

Query: 292 ISGILIHVYAVKLGLEFDLFVCNALINMYAKFGELRSAETIFNQMEVRDIVSWNSLIAAF 351
           I G+LIHVYA+KLGLEFDLFVCNALINMYAKFGELRSAETIFNQM+VRDIVSWNSL+AAF
Sbjct: 249 IWGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELRSAETIFNQMKVRDIVSWNSLLAAF 308

Query: 352 EQNKEPVVALGVYNKMHSIGVVPDLLTLVSLASVAAELGNFLSSRSIHGFVTRRGWFLHD 411
           EQNK+PV+ALGVYNKMHSIGVVPDLLTLVSLASV AELGNFLSSRSIHGFVTRR WFLHD
Sbjct: 309 EQNKKPVIALGVYNKMHSIGVVPDLLTLVSLASVIAELGNFLSSRSIHGFVTRRCWFLHD 368

Query: 412 VVIGNAIIDMYAKLGFIDSARKVFEGLPVKDVVSWNTLITGYSQNGLANEAIDVSHLMKD 471
           + +GNAIIDMYAKLGFIDSARKVFEGLPVKDV+SWN+LITGYSQNGLANEAIDV   M+D
Sbjct: 369 IALGNAIIDMYAKLGFIDSARKVFEGLPVKDVISWNSLITGYSQNGLANEAIDVYCSMRD 428

Query: 472 YSDAVPNQGTW------------------------------------------------- 531
           YS+AVPNQGTW                                                 
Sbjct: 429 YSNAVPNQGTWVSILTALSQLGALKQGMKTHGQLIKNFLYFDIFVSTCLIDMYGKCGRLA 488

Query: 532 ------------------------------------------------------------ 565
                                                                       
Sbjct: 489 DALSLFYEVPHKSSVSWNAIISCHGLHGYGLKAVKLFKEMQSEGVKPDHITFVSLLSACS 548

BLAST of HG10019137 vs. ExPASy Swiss-Prot
Match: O81767 (Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX=3702 GN=EMB2758 PE=3 SV=2)

HSP 1 Score: 479.6 bits (1233), Expect = 5.0e-134
Identity = 282/779 (36.20%), Postives = 370/779 (47.50%), Query Frame = 0

Query: 74  NEKREI-DFNRLFLFCTKVHLAKRLHALLVVSGKAQNIFFSAKLINLYAFLGDVSFARHT 133
           NE +EI D + LF +CT +  AK LHA LVVS + QN+  SAKL+NLY +LG+V+ ARHT
Sbjct: 49  NESKEIDDVHTLFRYCTNLQSAKCLHARLVVSKQIQNVCISAKLVNLYCYLGNVALARHT 108

Query: 134 FNQIQTKDVYTWNSMISAYTRIGRFHDAVDCFYEFLSTSILQPDHYTFPPVIRACGNLDD 193
           F+ IQ +DVY WN MIS Y R G   + + CF  F+ +S L PD+ TFP V++AC  + D
Sbjct: 109 FDHIQNRDVYAWNLMISGYGRAGNSSEVIRCFSLFMLSSGLTPDYRTFPSVLKACRTVID 168

Query: 194 GKKIHCLVLKLGFECDVFIAASFIHFYSRFGFVNLARNLFDNMMIRDIGTWNAMISGFCL 253
           G KIHCL LK GF  DV++AAS IH YSR+  V  AR LFD M +RD+G+WNAMISG+C 
Sbjct: 169 GNKIHCLALKFGFMWDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGYCQ 228

Query: 254 NGKVVEALEVFDEMRFKSVTMDSVTVSSLLPICAQLDDIISGILIHVYAVKLGLEFDLFV 313
           +G   EAL + + +R     MDSVTV SLL  C +  D   G+ IH Y++K GLE +LFV
Sbjct: 229 SGNAKEALTLSNGLR----AMDSVTVVSLLSACTEAGDFNRGVTIHSYSIKHGLESELFV 288

Query: 314 CNALINMYAKFGELRSAETIFNQMEVRDIVSWNSLIAAFEQNKEPVVALGVYNKMHSIGV 373
            N LI++YA+FG LR  + +F++M VRD++SWNS+I A+E N++P+ A+ ++ +M    +
Sbjct: 289 SNKLIDLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEMRLSRI 348

Query: 374 VPDLLTLVSLASVAAELGNFLSSRSIHGFVTRRGWFLHDVVIGNAIIDMYAKLGFIDSAR 433
            PD LTL+SLAS+ ++LG+  + RS+ GF  R+GWFL D+ IGNA++ MYAKLG +DSAR
Sbjct: 349 QPDCLTLISLASILSQLGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMYAKLGLVDSAR 408

Query: 434 KVFEGLPVKDVVSWNTLITGYSQNGLANEAIDVSHLMKDYSDAVPNQGTW---------- 493
            VF  LP  DV+SWNT+I+GY+QNG A+EAI++ ++M++  +   NQGTW          
Sbjct: 409 AVFNWLPNTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTWVSVLPACSQA 468

Query: 494 ------------------------------------------------------------ 553
                                                                       
Sbjct: 469 GALRQGMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPRVNSVPWNTLI 528

Query: 554 ------------------------------------------------------------ 565
                                                                       
Sbjct: 529 ACHGFHGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWCFEMMQTDYGI 588

BLAST of HG10019137 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 289.7 bits (740), Expect = 7.3e-77
Identity = 189/659 (28.68%), Postives = 298/659 (45.22%), Query Frame = 0

Query: 95  KRLHALLVVSGKAQNIFFSAKLINLYAFLGDVSFARHTFNQIQTKDVYTWNSMISAYTRI 154
           K++HA L+V G   + F   KLI+  +  GD++FAR  F+ +    ++ WN++I  Y+R 
Sbjct: 38  KQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSRN 97

Query: 155 GRFHDAVDCFYEFLSTSILQPDHYTFPPVIRACGNLDD---GKKIHCLVLKLGFECDVFI 214
             F DA+   Y  +  + + PD +TFP +++AC  L     G+ +H  V +LGF+ DVF+
Sbjct: 98  NHFQDAL-LMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFV 157

Query: 215 AASFIHFYSRFGFVNLARNLFDNMMI--RDIGTWNAMISGFCLNGKVVEALEVFDEMRFK 274
               I  Y++   +  AR +F+ + +  R I +W A++S +  NG+ +EALE+F +MR  
Sbjct: 158 QNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKM 217

Query: 275 SVTMDSVTVSSLLPICAQLDDIISGILIHVYAVKLGLEFDLFVCNALINMYAKFGELRSA 334
            V  D V + S+L     L D+  G  IH   VK+GLE +  +  +L  MYAK G++ +A
Sbjct: 218 DVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATA 277

Query: 335 ETIFNQMEVRDIVSWNSLIAAFEQNKEPVVALGVYNKMHSIGVVPDLLTLVSLASVAAEL 394
           + +F++M+  +++ WN++I+ + +N     A+ ++++M +  V PD +++ S  S  A++
Sbjct: 278 KILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTISITSAISACAQV 337

Query: 395 GNFLSSRSIHGFVTRRGWFLHDVVIGNAIIDMYAKLGFIDSARKVFEGLPVKDVVSWNTL 454
           G+   +RS++ +V R   +  DV I +A+IDM+AK G ++ AR VF+    +DVV W+ +
Sbjct: 338 GSLEQARSMYEYVGRSD-YRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAM 397

Query: 455 ITGYSQNGLANEAIDV-------------------------------------------- 514
           I GY  +G A EAI +                                            
Sbjct: 398 IVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRMADHKI 457

Query: 515 -----------------SHLMKDYS----------------------------------- 565
                             HL + Y                                    
Sbjct: 458 NPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQ 517

BLAST of HG10019137 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 273.5 bits (698), Expect = 5.4e-72
Identity = 188/675 (27.85%), Postives = 299/675 (44.30%), Query Frame = 0

Query: 80  DFNRLFLFC---TKVHLAKRLHALLVVSGKAQNIFFSAKLINLYAFLGDVSFARHTFNQI 139
           +F  L   C    ++ + K +H LLV SG + ++F    L N+YA    V+ AR  F+++
Sbjct: 137 NFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRM 196

Query: 140 QTKDVYTWNSMISAYTRIGRFHDAVDCFYEFLSTSILQPDHYTFP---PVIRACGNLDDG 199
             +D+ +WN++++ Y++ G    A++   + +    L+P   T     P + A   +  G
Sbjct: 197 PERDLVSWNTIVAGYSQNGMARMALE-MVKSMCEENLKPSFITIVSVLPAVSALRLISVG 256

Query: 200 KKIHCLVLKLGFECDVFIAASFIHFYSRFGFVNLARNLFDNMMIRDIGTWNAMISGFCLN 259
           K+IH   ++ GF+  V I+ + +  Y++ G +  AR LFD M+ R++ +WN+MI  +  N
Sbjct: 257 KEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQN 316

Query: 260 GKVVEALEVFDEMRFKSVTMDSVTVSSLLPICAQLDDIISGILIHVYAVKLGLEFDLFVC 319
               EA+ +F +M  + V    V+V   L  CA L D+  G  IH  +V+LGL+ ++ V 
Sbjct: 317 ENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVV 376

Query: 320 NALINMYAKFGELRSAETIFNQMEVRDIVSWNSLIAAFEQNKEPVVALGVYNKMHSIGVV 379
           N+LI+MY K  E+ +A ++F +++ R +VSWN++I  F QN  P+ AL  +++M S  V 
Sbjct: 377 NSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVK 436

Query: 380 PDLLTLVSLASVAAELGNFLSSRSIHGFVTRRGWFLHDVVIGNAIIDMYAKLGFIDSARK 439
           PD  T VS+ +  AEL     ++ IHG V  R     +V +  A++DMYAK G I  AR 
Sbjct: 437 PDTFTYVSVITAIAELSITHHAKWIHG-VVMRSCLDKNVFVTTALVDMYAKCGAIMIARL 496

Query: 440 VFEGLPVKDVVSWNTLITGYSQNGLANEAIDV-----------------------SH--- 499
           +F+ +  + V +WN +I GY  +G    A+++                       SH   
Sbjct: 497 IFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGL 556

Query: 500 ----------LMKDYS------------DAVPNQG----TWD------------------ 559
                     + ++YS            D +   G     WD                  
Sbjct: 557 VEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAML 616

Query: 560 ------------------------------------------------------------ 565
                                                                       
Sbjct: 617 GACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRK 676

BLAST of HG10019137 vs. ExPASy Swiss-Prot
Match: Q9STF3 (Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CRR2 PE=2 SV=1)

HSP 1 Score: 270.8 bits (691), Expect = 3.5e-71
Identity = 180/587 (30.66%), Postives = 293/587 (49.91%), Query Frame = 0

Query: 73  ENEKREIDFNRLFLFC---TKVHLAKRLHALLVVSGKAQNIFFSAKLINLYAFLGDVSFA 132
           E+   +  +  L L C   + +  A R+H  ++ +G  Q+ F + KLI +Y+ LG V +A
Sbjct: 72  ESSPSQQTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGMYSDLGSVDYA 131

Query: 133 RHTFNQIQTKDVYTWNSMISAYTRIGRFHDAVDCFYEFLSTSILQPDHYTFPPVIRAC-- 192
           R  F++ + + +Y WN++  A T  G   + +  +++     + + D +T+  V++AC  
Sbjct: 132 RKVFDKTRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGV-ESDRFTYTYVLKACVA 191

Query: 193 -----GNLDDGKKIHCLVLKLGFECDVFIAASFIHFYSRFGFVNLARNLFDNMMIRDIGT 252
                 +L  GK+IH  + + G+   V+I  + +  Y+RFG V+ A  +F  M +R++ +
Sbjct: 192 SECTVNHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVS 251

Query: 253 WNAMISGFCLNGKVVEALEVFDEM--RFKSVTMDSVTVSSLLPICAQLDDIISGILIHVY 312
           W+AMI+ +  NGK  EAL  F EM    K  + +SVT+ S+L  CA L  +  G LIH Y
Sbjct: 252 WSAMIACYAKNGKAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLIHGY 311

Query: 313 AVKLGLEFDLFVCNALINMYAKFGELRSAETIFNQMEVRDIVSWNSLIAAFEQNKEPVVA 372
            ++ GL+  L V +AL+ MY + G+L   + +F++M  RD+VSWNSLI+++  +     A
Sbjct: 312 ILRRGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYGKKA 371

Query: 373 LGVYNKMHSIGVVPDLLTLVSLASV----------------------------------- 432
           + ++ +M + G  P  +T VS+                                      
Sbjct: 372 IQIFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVD 431

Query: 433 ----------AAEL-------------GNFLSSRSIHGFV------TRRGWFLHDVVIGN 492
                     AA++             G+ L S  IHG V      +RR + L     GN
Sbjct: 432 LLGRANRLDEAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGN 491

Query: 493 AII--DMYAKLGFIDSARKV--------FEGLPVKDVVSWNTLITGYSQNGLANEAIDVS 552
            ++  D+YA+    D  ++V         + LP +  +     +  +      N  ++  
Sbjct: 492 YVLLADIYAEAQMWDEVKRVKKLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFNPLMEQI 551

Query: 553 HL--------MKDYSDAVPNQGT-WDVEDDEKENILTSHSERLAMAFGIISTPPKTTLQI 565
           H         MK+       +G  +++E +EKE I+  HSE+LA+AFG+I+T     ++I
Sbjct: 552 HAFLVKLAEDMKEKGYIPQTKGVLYELETEEKERIVLGHSEKLALAFGLINTSKGEPIRI 611

BLAST of HG10019137 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 269.2 bits (687), Expect = 1.0e-70
Identity = 190/621 (30.60%), Postives = 275/621 (44.28%), Query Frame = 0

Query: 95  KRLHALLVVSGKAQNIFFSAKLINLYAFLGDVSFARHTFN-------------------- 154
           K++H+ +V  G   N+  S  L+N+YA  GD   A+  F+                    
Sbjct: 166 KKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQV 225

Query: 155 -----------QIQTKDVYTWNSMISAYTRIGRFHDAVDCFYEFLSTSILQPDHYTFPPV 214
                      Q+  +D+ TWNSMIS + + G    A+D F + L  S+L PD +T   V
Sbjct: 226 GQMDLAMAQFEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASV 285

Query: 215 IRACGNLDD---GKKIHCLVLKLGFECDVFIAASFIHFYSRFGFV--------------- 274
           + AC NL+    GK+IH  ++  GF+    +  + I  YSR G V               
Sbjct: 286 LSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDL 345

Query: 275 ------------------NLARNLFDNMMIRDIGTWNAMISGFCLNGKVVEALEVFDEMR 334
                             N A+N+F ++  RD+  W AMI G+  +G   EA+ +F  M 
Sbjct: 346 KIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMV 405

Query: 335 FKSVTMDSVTVSSLLPICAQLDDIISGILIHVYAVKLGLEFDLFVCNALINMYAKFGELR 394
                 +S T++++L + + L  +  G  IH  AVK G  + + V NALI MYAK G + 
Sbjct: 406 GGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNIT 465

Query: 395 SAETIFNQMEV-RDIVSWNSLIAAFEQNKEPVVALGVYNKMHSIGVVPDLLTLVSLASVA 454
           SA   F+ +   RD VSW S+I A  Q+     AL ++  M   G+ PD +T V + S  
Sbjct: 466 SASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSAC 525

Query: 455 AELGNFLSSRSIHGFVTRRGWFLHDVVIGNAIIDMYAKLGFIDSARKVFEGLPVK-DVVS 514
              G     R     +      +  +     ++D++ + G +  A++  E +P++ DVV+
Sbjct: 526 THAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVT 585

Query: 515 WNTLITG----------------------------------YSQNGLANEA--------- 565
           W +L++                                   YS  G   EA         
Sbjct: 586 WGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKD 645

BLAST of HG10019137 vs. ExPASy TrEMBL
Match: A0A6J1EAW2 (pentatricopeptide repeat-containing protein At4g33990 OS=Cucurbita moschata OX=3662 GN=LOC111430902 PE=3 SV=1)

HSP 1 Score: 832.0 bits (2148), Expect = 1.5e-237
Identity = 465/795 (58.49%), Postives = 483/795 (60.75%), Query Frame = 0

Query: 59  SATTTA--PKYYLDGVENEKREIDFNRLFLFCTKVHLAKRLHALLVVSGKAQNIFFSAKL 118
           SATTTA  PKYYLD VE EK+EIDFNRLFL C KVHLAKRLHALLVVSGK Q+IF SAKL
Sbjct: 26  SATTTAPTPKYYLDEVEIEKKEIDFNRLFLVCKKVHLAKRLHALLVVSGKVQSIFLSAKL 85

Query: 119 INLYAFLGDVSFARHTFNQIQTKDVYTWNSMISAYTRIGRFHDAVDCFYEFLSTSILQPD 178
           INLYAFLGDVSFAR TF+QIQ KDVYTWNSMISAY RIG FH+AVDCF+EF+STSILQPD
Sbjct: 86  INLYAFLGDVSFARRTFDQIQAKDVYTWNSMISAYARIGHFHEAVDCFHEFMSTSILQPD 145

Query: 179 HYTFPPVIRACGNLDDGKKIHCLVLKLGFECDVFIAASFIHFYSRFGFVNLARNLFDNMM 238
           +YTFPPVIRACGNLDDGKKIHCL LKLGFECDVFIAAS IHFYSRFGFVNLARNLFD++M
Sbjct: 146 YYTFPPVIRACGNLDDGKKIHCLALKLGFECDVFIAASLIHFYSRFGFVNLARNLFDSLM 205

Query: 239 IRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVTVSSLLPICAQLDDIISGIL 298
           IRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVT SSLLPICAQLDDIISG+L
Sbjct: 206 IRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVTFSSLLPICAQLDDIISGVL 265

Query: 299 IHVYAVKLGLEFDLFVCNALINMYAKFGELRSAETIFNQMEVRDIVSWNSLIAAFEQNKE 358
           IHVYA+KLGLEFDLFVCNALINMYAKFGEL SAETIFNQ+E +DIVSWNSLIAAFEQNKE
Sbjct: 266 IHVYAIKLGLEFDLFVCNALINMYAKFGELGSAETIFNQIEAKDIVSWNSLIAAFEQNKE 325

Query: 359 PVVALGVYNKMHSIGVVPDLLTLVSLASVAAELGNFLSSRSIHGFVTRRGWFLHDVVIGN 418
           PVVALG+Y KMH+ G VPDLLTLVSLASVAAELGNFLSSRSIHGFVTR+GWFL DVVIGN
Sbjct: 326 PVVALGLYKKMHATGAVPDLLTLVSLASVAAELGNFLSSRSIHGFVTRKGWFLQDVVIGN 385

Query: 419 AIIDMYAKLGFIDSARKVFEGLPVKDVVSWNTLITGYSQNGLANEAIDVSHLMKDYSDAV 478
           AIIDMYAKLG+IDSARKVFE LPVKDVVSWNTLITGYSQNGLANEAIDV HLM DYSDAV
Sbjct: 386 AIIDMYAKLGYIDSARKVFEELPVKDVVSWNTLITGYSQNGLANEAIDVYHLMNDYSDAV 445

Query: 479 PNQGTW------------------------------------------------------ 538
           PNQGTW                                                      
Sbjct: 446 PNQGTWVSILTAYSQIGALKQGMKTHGLLIKNFLYFDIFVGTCLIDMYGKCGRLADALSL 505

Query: 539 ------------------------------------------------------------ 565
                                                                       
Sbjct: 506 FYEIPHKSSVSWNAIISCHGLHGYGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLV 565

BLAST of HG10019137 vs. ExPASy TrEMBL
Match: A0A5A7VEX7 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold25G001700 PE=3 SV=1)

HSP 1 Score: 828.6 bits (2139), Expect = 1.6e-236
Identity = 459/800 (57.38%), Postives = 484/800 (60.50%), Query Frame = 0

Query: 52  YQSLLPNSATTTAPKYYLDGVENEKREIDFNRLFLFCTKVHLAKRLHALLVVSGKAQNIF 111
           +Q+     + TTAPKYYLDGVENEKREIDFNRLFLFCTKVHLAK+LH LLVVSGK Q+IF
Sbjct: 9   FQACCSLYSATTAPKYYLDGVENEKREIDFNRLFLFCTKVHLAKQLHGLLVVSGKTQSIF 68

Query: 112 FSAKLINLYAFLGDVSFARHTFNQIQTKDVYTWNSMISAYTRIGRFHDAVDCFYEFLSTS 171
            SAKLIN YAFLGD+S AR TF+QIQTKDVYTWNSMISAY RIG FH A+DCF EFLSTS
Sbjct: 69  LSAKLINRYAFLGDISHARLTFDQIQTKDVYTWNSMISAYARIGHFHAAIDCFNEFLSTS 128

Query: 172 ILQPDHYTFPPVIRACGNLDDGKKIHCLVLKLGFECDVFIAASFIHFYSRFGFVNLARNL 231
           ILQ DHYTFPPVIRACGNLDDG+KIHCLVLKLGFECDV+IAASFIHFYSRFGFV+LA NL
Sbjct: 129 ILQSDHYTFPPVIRACGNLDDGRKIHCLVLKLGFECDVYIAASFIHFYSRFGFVSLACNL 188

Query: 232 FDNMMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVTVSSLLPICAQLDDI 291
           FDNMMIRDIGTWNAMISGFCLN KV EALEVFDEMR KSVTMDSVT+SSLLPICAQLDDI
Sbjct: 189 FDNMMIRDIGTWNAMISGFCLNDKVAEALEVFDEMRLKSVTMDSVTISSLLPICAQLDDI 248

Query: 292 ISGILIHVYAVKLGLEFDLFVCNALINMYAKFGELRSAETIFNQMEVRDIVSWNSLIAAF 351
           I G+LIHVYA+KLGLEFDLFVCNALINMYAKFGELRSAETIFNQM+VRDIVSWNSL+AAF
Sbjct: 249 IWGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELRSAETIFNQMKVRDIVSWNSLLAAF 308

Query: 352 EQNKEPVVALGVYNKMHSIGVVPDLLTLVSLASVAAELGNFLSSRSIHGFVTRRGWFLHD 411
           EQNK+PV+ALGVYNKMHSIGVVPDLLTLVSLASV AELGNFLSSRSIHGFVTRR WFLHD
Sbjct: 309 EQNKKPVIALGVYNKMHSIGVVPDLLTLVSLASVIAELGNFLSSRSIHGFVTRRCWFLHD 368

Query: 412 VVIGNAIIDMYAKLGFIDSARKVFEGLPVKDVVSWNTLITGYSQNGLANEAIDVSHLMKD 471
           + +GNAIIDMYAKLGFIDSARKVFEGLPVKDV+SWN+LITGYSQNGLANEAIDV   M+D
Sbjct: 369 IALGNAIIDMYAKLGFIDSARKVFEGLPVKDVISWNSLITGYSQNGLANEAIDVYCSMRD 428

Query: 472 YSDAVPNQGTW------------------------------------------------- 531
           YS+AVPNQGTW                                                 
Sbjct: 429 YSNAVPNQGTWVSILTALSQLGALKQGMKTHGQLIKNFLYFDIFVSTCLIDMYGKCGRLA 488

Query: 532 ------------------------------------------------------------ 565
                                                                       
Sbjct: 489 DALSLFYEVPHKSSVSWNAIISCHGLHGYGLKAVKLFKEMQSEGVKPDHITFVSLLSACS 548

BLAST of HG10019137 vs. ExPASy TrEMBL
Match: A0A5D3CYS9 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1706G00400 PE=3 SV=1)

HSP 1 Score: 828.2 bits (2138), Expect = 2.1e-236
Identity = 458/800 (57.25%), Postives = 484/800 (60.50%), Query Frame = 0

Query: 52  YQSLLPNSATTTAPKYYLDGVENEKREIDFNRLFLFCTKVHLAKRLHALLVVSGKAQNIF 111
           +Q+     + TTAPKYYLDGVENEKREIDFNRLFLFCTKVHLAK+LH LLVVSGK Q+IF
Sbjct: 9   FQACCSLYSATTAPKYYLDGVENEKREIDFNRLFLFCTKVHLAKQLHGLLVVSGKTQSIF 68

Query: 112 FSAKLINLYAFLGDVSFARHTFNQIQTKDVYTWNSMISAYTRIGRFHDAVDCFYEFLSTS 171
            SAKLIN YAFLGD+S AR TF+QIQTKDVYTWNSMISAY RIG FH A+DCF EFLSTS
Sbjct: 69  LSAKLINRYAFLGDISHARLTFDQIQTKDVYTWNSMISAYARIGHFHAAIDCFNEFLSTS 128

Query: 172 ILQPDHYTFPPVIRACGNLDDGKKIHCLVLKLGFECDVFIAASFIHFYSRFGFVNLARNL 231
           ILQ DHYTFPPVIRACGNLDDG+KIHCLVLKLGFECDV+IAASFIHFYSRFGFV+LA NL
Sbjct: 129 ILQSDHYTFPPVIRACGNLDDGRKIHCLVLKLGFECDVYIAASFIHFYSRFGFVSLACNL 188

Query: 232 FDNMMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVTVSSLLPICAQLDDI 291
           FDNMMIRDIGTWNAMISGFCLN KV EALEVFDEMR KSVTMDSVT+SSLLPICAQLDDI
Sbjct: 189 FDNMMIRDIGTWNAMISGFCLNDKVAEALEVFDEMRLKSVTMDSVTISSLLPICAQLDDI 248

Query: 292 ISGILIHVYAVKLGLEFDLFVCNALINMYAKFGELRSAETIFNQMEVRDIVSWNSLIAAF 351
           I G+LIHVYA+KLGLEFDLFVCNALINMYAKFGELRSAETIFNQM+VRDIVSWNSL+AAF
Sbjct: 249 IWGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELRSAETIFNQMKVRDIVSWNSLLAAF 308

Query: 352 EQNKEPVVALGVYNKMHSIGVVPDLLTLVSLASVAAELGNFLSSRSIHGFVTRRGWFLHD 411
           EQNK+PV+ALGVYNKMHSIG+VPDLLTLVSLASV AELGNFLSSRSIHGFVTRR WFLHD
Sbjct: 309 EQNKKPVIALGVYNKMHSIGIVPDLLTLVSLASVIAELGNFLSSRSIHGFVTRRCWFLHD 368

Query: 412 VVIGNAIIDMYAKLGFIDSARKVFEGLPVKDVVSWNTLITGYSQNGLANEAIDVSHLMKD 471
           + +GNAIIDMYAKLGFIDSARKVFEGLPVKDV+SWN+LITGYSQNGLANEAIDV   M+D
Sbjct: 369 IALGNAIIDMYAKLGFIDSARKVFEGLPVKDVISWNSLITGYSQNGLANEAIDVYCSMRD 428

Query: 472 YSDAVPNQGTW------------------------------------------------- 531
           YS+AVPNQGTW                                                 
Sbjct: 429 YSNAVPNQGTWVSILTALSQLGALKQGMKTHGQLIKNFLYFDIFVSTCLIDMYGKCGRLA 488

Query: 532 ------------------------------------------------------------ 565
                                                                       
Sbjct: 489 DALSLFYEVPHKSSVSWNAIISCHGLHGYGLKAVKLFKEMQSEGVKPDHITFVSLLSACS 548

BLAST of HG10019137 vs. ExPASy TrEMBL
Match: A0A1S3C233 (pentatricopeptide repeat-containing protein At4g33990 OS=Cucumis melo OX=3656 GN=LOC103495766 PE=3 SV=1)

HSP 1 Score: 828.2 bits (2138), Expect = 2.1e-236
Identity = 458/800 (57.25%), Postives = 484/800 (60.50%), Query Frame = 0

Query: 52  YQSLLPNSATTTAPKYYLDGVENEKREIDFNRLFLFCTKVHLAKRLHALLVVSGKAQNIF 111
           +Q+     + TTAPKYYLDGVENEKREIDFNRLFLFCTKVHLAK+LH LLVVSGK Q+IF
Sbjct: 18  FQACCSLYSATTAPKYYLDGVENEKREIDFNRLFLFCTKVHLAKQLHGLLVVSGKTQSIF 77

Query: 112 FSAKLINLYAFLGDVSFARHTFNQIQTKDVYTWNSMISAYTRIGRFHDAVDCFYEFLSTS 171
            SAKLIN YAFLGD+S AR TF+QIQTKDVYTWNSMISAY RIG FH A+DCF EFLSTS
Sbjct: 78  LSAKLINRYAFLGDISHARLTFDQIQTKDVYTWNSMISAYARIGHFHAAIDCFNEFLSTS 137

Query: 172 ILQPDHYTFPPVIRACGNLDDGKKIHCLVLKLGFECDVFIAASFIHFYSRFGFVNLARNL 231
           ILQ DHYTFPPVIRACGNLDDG+KIHCLVLKLGFECDV+IAASFIHFYSRFGFV+LA NL
Sbjct: 138 ILQSDHYTFPPVIRACGNLDDGRKIHCLVLKLGFECDVYIAASFIHFYSRFGFVSLACNL 197

Query: 232 FDNMMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVTVSSLLPICAQLDDI 291
           FDNMMIRDIGTWNAMISGFCLN KV EALEVFDEMR KSVTMDSVT+SSLLPICAQLDDI
Sbjct: 198 FDNMMIRDIGTWNAMISGFCLNDKVAEALEVFDEMRLKSVTMDSVTISSLLPICAQLDDI 257

Query: 292 ISGILIHVYAVKLGLEFDLFVCNALINMYAKFGELRSAETIFNQMEVRDIVSWNSLIAAF 351
           I G+LIHVYA+KLGLEFDLFVCNALINMYAKFGELRSAETIFNQM+VRDIVSWNSL+AAF
Sbjct: 258 IWGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELRSAETIFNQMKVRDIVSWNSLLAAF 317

Query: 352 EQNKEPVVALGVYNKMHSIGVVPDLLTLVSLASVAAELGNFLSSRSIHGFVTRRGWFLHD 411
           EQNK+PV+ALGVYNKMHSIG+VPDLLTLVSLASV AELGNFLSSRSIHGFVTRR WFLHD
Sbjct: 318 EQNKKPVIALGVYNKMHSIGIVPDLLTLVSLASVIAELGNFLSSRSIHGFVTRRCWFLHD 377

Query: 412 VVIGNAIIDMYAKLGFIDSARKVFEGLPVKDVVSWNTLITGYSQNGLANEAIDVSHLMKD 471
           + +GNAIIDMYAKLGFIDSARKVFEGLPVKDV+SWN+LITGYSQNGLANEAIDV   M+D
Sbjct: 378 IALGNAIIDMYAKLGFIDSARKVFEGLPVKDVISWNSLITGYSQNGLANEAIDVYCSMRD 437

Query: 472 YSDAVPNQGTW------------------------------------------------- 531
           YS+AVPNQGTW                                                 
Sbjct: 438 YSNAVPNQGTWVSILTALSQLGALKQGMKTHGQLIKNFLYFDIFVSTCLIDMYGKCGRLA 497

Query: 532 ------------------------------------------------------------ 565
                                                                       
Sbjct: 498 DALSLFYEVPHKSSVSWNAIISCHGLHGYGLKAVKLFKEMQSEGVKPDHITFVSLLSACS 557

BLAST of HG10019137 vs. ExPASy TrEMBL
Match: A0A6J1J817 (pentatricopeptide repeat-containing protein At4g33990 OS=Cucurbita maxima OX=3661 GN=LOC111482594 PE=3 SV=1)

HSP 1 Score: 827.0 bits (2135), Expect = 4.7e-236
Identity = 460/793 (58.01%), Postives = 478/793 (60.28%), Query Frame = 0

Query: 59  SATTTAPKYYLDGVENEKREIDFNRLFLFCTKVHLAKRLHALLVVSGKAQNIFFSAKLIN 118
           SAT  APKYY D VE EK+EIDFNRLFL C KVHLAKRLHALLVVSGK Q+IF SAKLIN
Sbjct: 26  SATAPAPKYYFDEVEIEKKEIDFNRLFLVCKKVHLAKRLHALLVVSGKVQSIFLSAKLIN 85

Query: 119 LYAFLGDVSFARHTFNQIQTKDVYTWNSMISAYTRIGRFHDAVDCFYEFLSTSILQPDHY 178
           LYAFLGDVSFAR TF+QIQ KDVYTWNSMISAY RIG FH+AVDCF+EF+STSILQPD+Y
Sbjct: 86  LYAFLGDVSFARRTFDQIQAKDVYTWNSMISAYARIGHFHEAVDCFHEFMSTSILQPDYY 145

Query: 179 TFPPVIRACGNLDDGKKIHCLVLKLGFECDVFIAASFIHFYSRFGFVNLARNLFDNMMIR 238
           TFPPVIRACGNLDDGKKIHCL LKLGFECDVFIAAS IHFYSRFGFVNLARNLFD++MIR
Sbjct: 146 TFPPVIRACGNLDDGKKIHCLALKLGFECDVFIAASLIHFYSRFGFVNLARNLFDSLMIR 205

Query: 239 DIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVTVSSLLPICAQLDDIISGILIH 298
           DIGTWNAMISGFCLNGKVVEALEVFDEMRFKSV MDSVT SSLLPICAQLDDIISG+LIH
Sbjct: 206 DIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVNMDSVTFSSLLPICAQLDDIISGVLIH 265

Query: 299 VYAVKLGLEFDLFVCNALINMYAKFGELRSAETIFNQMEVRDIVSWNSLIAAFEQNKEPV 358
           VYA+KLGLEFDLFVCNALINMYAKFGEL SAETIFNQ+E +DIVSWNSLIAAFEQNKEPV
Sbjct: 266 VYAIKLGLEFDLFVCNALINMYAKFGELGSAETIFNQIEAKDIVSWNSLIAAFEQNKEPV 325

Query: 359 VALGVYNKMHSIGVVPDLLTLVSLASVAAELGNFLSSRSIHGFVTRRGWFLHDVVIGNAI 418
           VALG+Y KMH+ G VPDLLTLVSLASVAAELGNFLSSRSIHGFVTR+GWFL DVVIGNAI
Sbjct: 326 VALGLYKKMHATGAVPDLLTLVSLASVAAELGNFLSSRSIHGFVTRKGWFLQDVVIGNAI 385

Query: 419 IDMYAKLGFIDSARKVFEGLPVKDVVSWNTLITGYSQNGLANEAIDVSHLMKDYSDAVPN 478
           IDMYAKLG+IDSARKVFE LPVKDVVSWNTLITGYSQNGLANEAIDV H M DYSDAVPN
Sbjct: 386 IDMYAKLGYIDSARKVFEELPVKDVVSWNTLITGYSQNGLANEAIDVYHSMNDYSDAVPN 445

Query: 479 QGTW-------------------------------------------------------- 538
           QGTW                                                        
Sbjct: 446 QGTWVSILTAYSQIGALKQGMKTHGLLIKNFLYFDIFVGTCLIDMYGKCGRLADALSLFY 505

Query: 539 ------------------------------------------------------------ 565
                                                                       
Sbjct: 506 EIPHKSSVSWNSIISCHGVHGYGLRAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDE 565

BLAST of HG10019137 vs. TAIR 10
Match: AT4G33990.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 479.6 bits (1233), Expect = 3.6e-135
Identity = 282/779 (36.20%), Postives = 370/779 (47.50%), Query Frame = 0

Query: 74  NEKREI-DFNRLFLFCTKVHLAKRLHALLVVSGKAQNIFFSAKLINLYAFLGDVSFARHT 133
           NE +EI D + LF +CT +  AK LHA LVVS + QN+  SAKL+NLY +LG+V+ ARHT
Sbjct: 49  NESKEIDDVHTLFRYCTNLQSAKCLHARLVVSKQIQNVCISAKLVNLYCYLGNVALARHT 108

Query: 134 FNQIQTKDVYTWNSMISAYTRIGRFHDAVDCFYEFLSTSILQPDHYTFPPVIRACGNLDD 193
           F+ IQ +DVY WN MIS Y R G   + + CF  F+ +S L PD+ TFP V++AC  + D
Sbjct: 109 FDHIQNRDVYAWNLMISGYGRAGNSSEVIRCFSLFMLSSGLTPDYRTFPSVLKACRTVID 168

Query: 194 GKKIHCLVLKLGFECDVFIAASFIHFYSRFGFVNLARNLFDNMMIRDIGTWNAMISGFCL 253
           G KIHCL LK GF  DV++AAS IH YSR+  V  AR LFD M +RD+G+WNAMISG+C 
Sbjct: 169 GNKIHCLALKFGFMWDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGYCQ 228

Query: 254 NGKVVEALEVFDEMRFKSVTMDSVTVSSLLPICAQLDDIISGILIHVYAVKLGLEFDLFV 313
           +G   EAL + + +R     MDSVTV SLL  C +  D   G+ IH Y++K GLE +LFV
Sbjct: 229 SGNAKEALTLSNGLR----AMDSVTVVSLLSACTEAGDFNRGVTIHSYSIKHGLESELFV 288

Query: 314 CNALINMYAKFGELRSAETIFNQMEVRDIVSWNSLIAAFEQNKEPVVALGVYNKMHSIGV 373
            N LI++YA+FG LR  + +F++M VRD++SWNS+I A+E N++P+ A+ ++ +M    +
Sbjct: 289 SNKLIDLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEMRLSRI 348

Query: 374 VPDLLTLVSLASVAAELGNFLSSRSIHGFVTRRGWFLHDVVIGNAIIDMYAKLGFIDSAR 433
            PD LTL+SLAS+ ++LG+  + RS+ GF  R+GWFL D+ IGNA++ MYAKLG +DSAR
Sbjct: 349 QPDCLTLISLASILSQLGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMYAKLGLVDSAR 408

Query: 434 KVFEGLPVKDVVSWNTLITGYSQNGLANEAIDVSHLMKDYSDAVPNQGTW---------- 493
            VF  LP  DV+SWNT+I+GY+QNG A+EAI++ ++M++  +   NQGTW          
Sbjct: 409 AVFNWLPNTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTWVSVLPACSQA 468

Query: 494 ------------------------------------------------------------ 553
                                                                       
Sbjct: 469 GALRQGMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPRVNSVPWNTLI 528

Query: 554 ------------------------------------------------------------ 565
                                                                       
Sbjct: 529 ACHGFHGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWCFEMMQTDYGI 588

BLAST of HG10019137 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 289.7 bits (740), Expect = 5.2e-78
Identity = 189/659 (28.68%), Postives = 298/659 (45.22%), Query Frame = 0

Query: 95  KRLHALLVVSGKAQNIFFSAKLINLYAFLGDVSFARHTFNQIQTKDVYTWNSMISAYTRI 154
           K++HA L+V G   + F   KLI+  +  GD++FAR  F+ +    ++ WN++I  Y+R 
Sbjct: 38  KQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSRN 97

Query: 155 GRFHDAVDCFYEFLSTSILQPDHYTFPPVIRACGNLDD---GKKIHCLVLKLGFECDVFI 214
             F DA+   Y  +  + + PD +TFP +++AC  L     G+ +H  V +LGF+ DVF+
Sbjct: 98  NHFQDAL-LMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFV 157

Query: 215 AASFIHFYSRFGFVNLARNLFDNMMI--RDIGTWNAMISGFCLNGKVVEALEVFDEMRFK 274
               I  Y++   +  AR +F+ + +  R I +W A++S +  NG+ +EALE+F +MR  
Sbjct: 158 QNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKM 217

Query: 275 SVTMDSVTVSSLLPICAQLDDIISGILIHVYAVKLGLEFDLFVCNALINMYAKFGELRSA 334
            V  D V + S+L     L D+  G  IH   VK+GLE +  +  +L  MYAK G++ +A
Sbjct: 218 DVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATA 277

Query: 335 ETIFNQMEVRDIVSWNSLIAAFEQNKEPVVALGVYNKMHSIGVVPDLLTLVSLASVAAEL 394
           + +F++M+  +++ WN++I+ + +N     A+ ++++M +  V PD +++ S  S  A++
Sbjct: 278 KILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTISITSAISACAQV 337

Query: 395 GNFLSSRSIHGFVTRRGWFLHDVVIGNAIIDMYAKLGFIDSARKVFEGLPVKDVVSWNTL 454
           G+   +RS++ +V R   +  DV I +A+IDM+AK G ++ AR VF+    +DVV W+ +
Sbjct: 338 GSLEQARSMYEYVGRSD-YRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAM 397

Query: 455 ITGYSQNGLANEAIDV-------------------------------------------- 514
           I GY  +G A EAI +                                            
Sbjct: 398 IVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRMADHKI 457

Query: 515 -----------------SHLMKDYS----------------------------------- 565
                             HL + Y                                    
Sbjct: 458 NPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQ 517

BLAST of HG10019137 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 273.5 bits (698), Expect = 3.9e-73
Identity = 188/675 (27.85%), Postives = 299/675 (44.30%), Query Frame = 0

Query: 80  DFNRLFLFC---TKVHLAKRLHALLVVSGKAQNIFFSAKLINLYAFLGDVSFARHTFNQI 139
           +F  L   C    ++ + K +H LLV SG + ++F    L N+YA    V+ AR  F+++
Sbjct: 137 NFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRM 196

Query: 140 QTKDVYTWNSMISAYTRIGRFHDAVDCFYEFLSTSILQPDHYTFP---PVIRACGNLDDG 199
             +D+ +WN++++ Y++ G    A++   + +    L+P   T     P + A   +  G
Sbjct: 197 PERDLVSWNTIVAGYSQNGMARMALE-MVKSMCEENLKPSFITIVSVLPAVSALRLISVG 256

Query: 200 KKIHCLVLKLGFECDVFIAASFIHFYSRFGFVNLARNLFDNMMIRDIGTWNAMISGFCLN 259
           K+IH   ++ GF+  V I+ + +  Y++ G +  AR LFD M+ R++ +WN+MI  +  N
Sbjct: 257 KEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQN 316

Query: 260 GKVVEALEVFDEMRFKSVTMDSVTVSSLLPICAQLDDIISGILIHVYAVKLGLEFDLFVC 319
               EA+ +F +M  + V    V+V   L  CA L D+  G  IH  +V+LGL+ ++ V 
Sbjct: 317 ENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVV 376

Query: 320 NALINMYAKFGELRSAETIFNQMEVRDIVSWNSLIAAFEQNKEPVVALGVYNKMHSIGVV 379
           N+LI+MY K  E+ +A ++F +++ R +VSWN++I  F QN  P+ AL  +++M S  V 
Sbjct: 377 NSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVK 436

Query: 380 PDLLTLVSLASVAAELGNFLSSRSIHGFVTRRGWFLHDVVIGNAIIDMYAKLGFIDSARK 439
           PD  T VS+ +  AEL     ++ IHG V  R     +V +  A++DMYAK G I  AR 
Sbjct: 437 PDTFTYVSVITAIAELSITHHAKWIHG-VVMRSCLDKNVFVTTALVDMYAKCGAIMIARL 496

Query: 440 VFEGLPVKDVVSWNTLITGYSQNGLANEAIDV-----------------------SH--- 499
           +F+ +  + V +WN +I GY  +G    A+++                       SH   
Sbjct: 497 IFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGL 556

Query: 500 ----------LMKDYS------------DAVPNQG----TWD------------------ 559
                     + ++YS            D +   G     WD                  
Sbjct: 557 VEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAML 616

Query: 560 ------------------------------------------------------------ 565
                                                                       
Sbjct: 617 GACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRK 676

BLAST of HG10019137 vs. TAIR 10
Match: AT3G46790.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 270.8 bits (691), Expect = 2.5e-72
Identity = 180/587 (30.66%), Postives = 293/587 (49.91%), Query Frame = 0

Query: 73  ENEKREIDFNRLFLFC---TKVHLAKRLHALLVVSGKAQNIFFSAKLINLYAFLGDVSFA 132
           E+   +  +  L L C   + +  A R+H  ++ +G  Q+ F + KLI +Y+ LG V +A
Sbjct: 72  ESSPSQQTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGMYSDLGSVDYA 131

Query: 133 RHTFNQIQTKDVYTWNSMISAYTRIGRFHDAVDCFYEFLSTSILQPDHYTFPPVIRAC-- 192
           R  F++ + + +Y WN++  A T  G   + +  +++     + + D +T+  V++AC  
Sbjct: 132 RKVFDKTRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGV-ESDRFTYTYVLKACVA 191

Query: 193 -----GNLDDGKKIHCLVLKLGFECDVFIAASFIHFYSRFGFVNLARNLFDNMMIRDIGT 252
                 +L  GK+IH  + + G+   V+I  + +  Y+RFG V+ A  +F  M +R++ +
Sbjct: 192 SECTVNHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVS 251

Query: 253 WNAMISGFCLNGKVVEALEVFDEM--RFKSVTMDSVTVSSLLPICAQLDDIISGILIHVY 312
           W+AMI+ +  NGK  EAL  F EM    K  + +SVT+ S+L  CA L  +  G LIH Y
Sbjct: 252 WSAMIACYAKNGKAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLIHGY 311

Query: 313 AVKLGLEFDLFVCNALINMYAKFGELRSAETIFNQMEVRDIVSWNSLIAAFEQNKEPVVA 372
            ++ GL+  L V +AL+ MY + G+L   + +F++M  RD+VSWNSLI+++  +     A
Sbjct: 312 ILRRGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYGKKA 371

Query: 373 LGVYNKMHSIGVVPDLLTLVSLASV----------------------------------- 432
           + ++ +M + G  P  +T VS+                                      
Sbjct: 372 IQIFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVD 431

Query: 433 ----------AAEL-------------GNFLSSRSIHGFV------TRRGWFLHDVVIGN 492
                     AA++             G+ L S  IHG V      +RR + L     GN
Sbjct: 432 LLGRANRLDEAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGN 491

Query: 493 AII--DMYAKLGFIDSARKV--------FEGLPVKDVVSWNTLITGYSQNGLANEAIDVS 552
            ++  D+YA+    D  ++V         + LP +  +     +  +      N  ++  
Sbjct: 492 YVLLADIYAEAQMWDEVKRVKKLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFNPLMEQI 551

Query: 553 HL--------MKDYSDAVPNQGT-WDVEDDEKENILTSHSERLAMAFGIISTPPKTTLQI 565
           H         MK+       +G  +++E +EKE I+  HSE+LA+AFG+I+T     ++I
Sbjct: 552 HAFLVKLAEDMKEKGYIPQTKGVLYELETEEKERIVLGHSEKLALAFGLINTSKGEPIRI 611

BLAST of HG10019137 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 269.2 bits (687), Expect = 7.3e-72
Identity = 190/621 (30.60%), Postives = 275/621 (44.28%), Query Frame = 0

Query: 95  KRLHALLVVSGKAQNIFFSAKLINLYAFLGDVSFARHTFN-------------------- 154
           K++H+ +V  G   N+  S  L+N+YA  GD   A+  F+                    
Sbjct: 166 KKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQV 225

Query: 155 -----------QIQTKDVYTWNSMISAYTRIGRFHDAVDCFYEFLSTSILQPDHYTFPPV 214
                      Q+  +D+ TWNSMIS + + G    A+D F + L  S+L PD +T   V
Sbjct: 226 GQMDLAMAQFEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASV 285

Query: 215 IRACGNLDD---GKKIHCLVLKLGFECDVFIAASFIHFYSRFGFV--------------- 274
           + AC NL+    GK+IH  ++  GF+    +  + I  YSR G V               
Sbjct: 286 LSACANLEKLCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDL 345

Query: 275 ------------------NLARNLFDNMMIRDIGTWNAMISGFCLNGKVVEALEVFDEMR 334
                             N A+N+F ++  RD+  W AMI G+  +G   EA+ +F  M 
Sbjct: 346 KIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMV 405

Query: 335 FKSVTMDSVTVSSLLPICAQLDDIISGILIHVYAVKLGLEFDLFVCNALINMYAKFGELR 394
                 +S T++++L + + L  +  G  IH  AVK G  + + V NALI MYAK G + 
Sbjct: 406 GGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNIT 465

Query: 395 SAETIFNQMEV-RDIVSWNSLIAAFEQNKEPVVALGVYNKMHSIGVVPDLLTLVSLASVA 454
           SA   F+ +   RD VSW S+I A  Q+     AL ++  M   G+ PD +T V + S  
Sbjct: 466 SASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSAC 525

Query: 455 AELGNFLSSRSIHGFVTRRGWFLHDVVIGNAIIDMYAKLGFIDSARKVFEGLPVK-DVVS 514
              G     R     +      +  +     ++D++ + G +  A++  E +P++ DVV+
Sbjct: 526 THAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVT 585

Query: 515 WNTLITG----------------------------------YSQNGLANEA--------- 565
           W +L++                                   YS  G   EA         
Sbjct: 586 WGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKD 645

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG7014884.17.7e-25756.93Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_038903939.12.9e-24860.66pentatricopeptide repeat-containing protein At4g33990 [Benincasa hispida] >XP_03... [more]
XP_022923150.13.0e-23758.49pentatricopeptide repeat-containing protein At4g33990 [Cucurbita moschata][more]
KAG6576861.13.3e-23658.01Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
KAA0066593.13.3e-23657.38pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
O817675.0e-13436.20Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX... [more]
Q9LTV87.3e-7728.68Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Q3E6Q15.4e-7227.85Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9STF33.5e-7130.66Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidop... [more]
Q9SHZ81.0e-7030.60Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1EAW21.5e-23758.49pentatricopeptide repeat-containing protein At4g33990 OS=Cucurbita moschata OX=3... [more]
A0A5A7VEX71.6e-23657.38Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A5D3CYS92.1e-23657.25Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3C2332.1e-23657.25pentatricopeptide repeat-containing protein At4g33990 OS=Cucumis melo OX=3656 GN... [more]
A0A6J1J8174.7e-23658.01pentatricopeptide repeat-containing protein At4g33990 OS=Cucurbita maxima OX=366... [more]
Match NameE-valueIdentityDescription
AT4G33990.13.6e-13536.20Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G12770.15.2e-7828.68mitochondrial editing factor 22 [more]
AT1G11290.13.9e-7327.85Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G46790.12.5e-7230.66Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G22070.17.3e-7230.60pentatricopeptide (PPR) repeat-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 239..283
e-value: 8.2E-13
score: 48.3
coord: 139..187
e-value: 1.7E-7
score: 31.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 342..375
e-value: 1.8E-4
score: 19.5
coord: 444..472
e-value: 6.2E-5
score: 20.9
coord: 142..172
e-value: 2.4E-6
score: 25.3
coord: 312..342
e-value: 2.1E-5
score: 22.4
coord: 242..274
e-value: 2.9E-9
score: 34.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 342..372
e-value: 3.2E-4
score: 20.7
coord: 312..340
e-value: 6.4E-5
score: 22.9
coord: 416..437
e-value: 0.063
score: 13.6
coord: 444..470
e-value: 6.8E-5
score: 22.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 340..374
score: 10.314641
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 442..472
score: 9.328124
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 239..273
score: 12.210946
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 140..174
score: 10.292718
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 309..339
score: 9.141782
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 395..485
e-value: 3.9E-13
score: 51.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 193..295
e-value: 1.9E-19
score: 71.7
coord: 76..192
e-value: 4.3E-14
score: 54.2
coord: 296..394
e-value: 1.3E-18
score: 68.9
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 481..554
e-value: 1.1E-28
score: 99.6
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 216..385
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 59..211
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 322..424
NoneNo IPR availablePANTHERPTHR47926:SF212PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 486..550
coord: 59..211
coord: 423..482
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 486..550
coord: 423..482
NoneNo IPR availablePANTHERPTHR47926:SF212PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 216..385
NoneNo IPR availablePANTHERPTHR47926:SF212PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 322..424

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10019137.1HG10019137.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding