Moc01g21750 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc01g21750
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Locationchr1: 15127056 .. 15127670 (-)
RNA-Seq ExpressionMoc01g21750
SyntenyMoc01g21750
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGAAGCAGCAGCAAGTCAAAGCCGATGCGATCACTTTCCTCGGTGTCCTGTCCTCTTGTAGGCATGCTGGGCTTGTGGAAGAGGGCAGATGTTACTTCAATCTAATGCTCGAGCTCGGTTTTAAACCAGACTTGGATCATTATTCGTGTGTTATTGATCTGTTTGGTCGAGCGGGGCTGCTGACAGAGGCTCAGAACTTCATTGAGAACATGCCCATATCTCCGAATTCAATCATTTGGGGATCGCTTCTCTCTGCTTGCAGGCTTCATGGGAATGTCTGGATAGGAATCCAGGCTGCAGAGAGTAGATTGTTGCTGCAACCCGACTGCGCGTCGACGCACTTGCAGTTGGCTAATCTGTATGCAAGAGCAGGGCAGTTGGATGATGCTGCAAGATTGAGGAAGATGATGAAAGAGAGAGGGCTGAAGACTGCTCCTGGTTATAGCTGGATTGAGATTCAGAATAGAGTTTATAGATTTAAAGCAGAAGATAAGTCAAACCCTCTAATGGTTGAAATTTTTGGTCTTATGGATGGCATAGTGAAACACATGAGATGTGTAGGCTGTCCTCCTGAAGAGGACGACTTTGATGTTTTACATGAAACATTTTGA

mRNA sequence

ATGGGGAAGCAGCAGCAAGTCAAAGCCGATGCGATCACTTTCCTCGGTGTCCTGTCCTCTTGTAGGCATGCTGGGCTTGTGGAAGAGGGCAGATGTTACTTCAATCTAATGCTCGAGCTCGGTTTTAAACCAGACTTGGATCATTATTCGTGTGTTATTGATCTGTTTGGTCGAGCGGGGCTGCTGACAGAGGCTCAGAACTTCATTGAGAACATGCCCATATCTCCGAATTCAATCATTTGGGGATCGCTTCTCTCTGCTTGCAGGCTTCATGGGAATGTCTGGATAGGAATCCAGGCTGCAGAGAGTAGATTGTTGCTGCAACCCGACTGCGCGTCGACGCACTTGCAGTTGGCTAATCTGTATGCAAGAGCAGGGCAGTTGGATGATGCTGCAAGATTGAGGAAGATGATGAAAGAGAGAGGGCTGAAGACTGCTCCTGGTTATAGCTGGATTGAGATTCAGAATAGAGTTTATAGATTTAAAGCAGAAGATAAGTCAAACCCTCTAATGGTTGAAATTTTTGGTCTTATGGATGGCATAGTGAAACACATGAGATGTGTAGGCTGTCCTCCTGAAGAGGACGACTTTGATGTTTTACATGAAACATTTTGA

Coding sequence (CDS)

ATGGGGAAGCAGCAGCAAGTCAAAGCCGATGCGATCACTTTCCTCGGTGTCCTGTCCTCTTGTAGGCATGCTGGGCTTGTGGAAGAGGGCAGATGTTACTTCAATCTAATGCTCGAGCTCGGTTTTAAACCAGACTTGGATCATTATTCGTGTGTTATTGATCTGTTTGGTCGAGCGGGGCTGCTGACAGAGGCTCAGAACTTCATTGAGAACATGCCCATATCTCCGAATTCAATCATTTGGGGATCGCTTCTCTCTGCTTGCAGGCTTCATGGGAATGTCTGGATAGGAATCCAGGCTGCAGAGAGTAGATTGTTGCTGCAACCCGACTGCGCGTCGACGCACTTGCAGTTGGCTAATCTGTATGCAAGAGCAGGGCAGTTGGATGATGCTGCAAGATTGAGGAAGATGATGAAAGAGAGAGGGCTGAAGACTGCTCCTGGTTATAGCTGGATTGAGATTCAGAATAGAGTTTATAGATTTAAAGCAGAAGATAAGTCAAACCCTCTAATGGTTGAAATTTTTGGTCTTATGGATGGCATAGTGAAACACATGAGATGTGTAGGCTGTCCTCCTGAAGAGGACGACTTTGATGTTTTACATGAAACATTTTGA

Protein sequence

MGKQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLMLELGFKPDLDHYSCVIDLFGRAGLLTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLANLYARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDGIVKHMRCVGCPPEEDDFDVLHETF
Homology
BLAST of Moc01g21750 vs. NCBI nr
Match: XP_022143730.1 (pentatricopeptide repeat-containing protein At2g37320 [Momordica charantia])

HSP 1 Score: 429.1 bits (1102), Expect = 2.1e-116
Identity = 204/204 (100.00%), Postives = 204/204 (100.00%), Query Frame = 0

Query: 1   MGKQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLMLELGFKPDLDHYSCVIDLFGRAG 60
           MGKQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLMLELGFKPDLDHYSCVIDLFGRAG
Sbjct: 324 MGKQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLMLELGFKPDLDHYSCVIDLFGRAG 383

Query: 61  LLTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLAN 120
           LLTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLAN
Sbjct: 384 LLTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLAN 443

Query: 121 LYARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDG 180
           LYARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDG
Sbjct: 444 LYARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDG 503

Query: 181 IVKHMRCVGCPPEEDDFDVLHETF 205
           IVKHMRCVGCPPEEDDFDVLHETF
Sbjct: 504 IVKHMRCVGCPPEEDDFDVLHETF 527

BLAST of Moc01g21750 vs. NCBI nr
Match: KAG7033868.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 363.6 bits (932), Expect = 1.1e-96
Identity = 170/196 (86.73%), Postives = 184/196 (93.88%), Query Frame = 0

Query: 1   MGKQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLMLELGFKPDLDHYSCVIDLFGRAG 60
           M KQQQV+AD ITFLGVLSSCRH GLVEEGR YFNLM+ELG KP+LDHYSCVIDL GRAG
Sbjct: 317 MRKQQQVEADGITFLGVLSSCRHGGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAG 376

Query: 61  LLTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLAN 120
           LL EAQNFIENMPISPNSI+WGSLLSACRLHGNVWIG++AAESRLLLQPDCASTHLQLAN
Sbjct: 377 LLKEAQNFIENMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLAN 436

Query: 121 LYARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDG 180
           LYARAG LD+AARLRKMMK++GLKTAPGYSWIEIQN+VYRFKAEDKSNP+M+EIFG+MDG
Sbjct: 437 LYARAGYLDNAARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMIEIFGVMDG 496

Query: 181 IVKHMRCVGCPPEEDD 197
           +V HMR VGC PE DD
Sbjct: 497 MVNHMRSVGCVPEVDD 512

BLAST of Moc01g21750 vs. NCBI nr
Match: KAG6603691.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 363.6 bits (932), Expect = 1.1e-96
Identity = 170/196 (86.73%), Postives = 184/196 (93.88%), Query Frame = 0

Query: 1   MGKQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLMLELGFKPDLDHYSCVIDLFGRAG 60
           M KQQQV+AD ITFLGVLSSCRH GLVEEGR YFNLM+ELG KP+LDHYSCVIDL GRAG
Sbjct: 324 MRKQQQVEADGITFLGVLSSCRHGGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAG 383

Query: 61  LLTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLAN 120
           LL EAQNFIENMPISPNSI+WGSLLSACRLHGNVWIG++AAESRLLLQPDCASTHLQLAN
Sbjct: 384 LLKEAQNFIENMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLAN 443

Query: 121 LYARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDG 180
           LYARAG LD+AARLRKMMK++GLKTAPGYSWIEIQN+VYRFKAEDKSNP+M+EIFG+MDG
Sbjct: 444 LYARAGYLDNAARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMIEIFGVMDG 503

Query: 181 IVKHMRCVGCPPEEDD 197
           +V HMR VGC PE DD
Sbjct: 504 MVNHMRSVGCVPEVDD 519

BLAST of Moc01g21750 vs. NCBI nr
Match: XP_022949850.1 (pentatricopeptide repeat-containing protein At2g37320 [Cucurbita moschata])

HSP 1 Score: 361.3 bits (926), Expect = 5.5e-96
Identity = 169/196 (86.22%), Postives = 183/196 (93.37%), Query Frame = 0

Query: 1   MGKQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLMLELGFKPDLDHYSCVIDLFGRAG 60
           M KQQQV+AD ITFLGVLSSCRH GLVEEGR YFNLM+ELG KP+LDHYSCVIDL GRAG
Sbjct: 324 MRKQQQVEADGITFLGVLSSCRHGGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAG 383

Query: 61  LLTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLAN 120
           LL EAQN IENMPISPNSI+WGSLLSACRLHGNVWIG++AAESRLLLQPDCASTHLQLAN
Sbjct: 384 LLKEAQNLIENMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLAN 443

Query: 121 LYARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDG 180
           LYARAG LDDAARLRKMMK++GLKTAPGYSWIEIQN+VYRFKAEDKSNP+M+EIFG+MDG
Sbjct: 444 LYARAGYLDDAARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMIEIFGVMDG 503

Query: 181 IVKHMRCVGCPPEEDD 197
           +V HMR VGC PE D+
Sbjct: 504 MVNHMRSVGCVPEVDN 519

BLAST of Moc01g21750 vs. NCBI nr
Match: XP_023544680.1 (pentatricopeptide repeat-containing protein At2g37320 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 359.8 bits (922), Expect = 1.6e-95
Identity = 168/196 (85.71%), Postives = 182/196 (92.86%), Query Frame = 0

Query: 1   MGKQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLMLELGFKPDLDHYSCVIDLFGRAG 60
           M KQQQV+AD ITFLGVLSSCRH GLVEEGR YFNLM+EL  KP+LDHYSCVIDL GRAG
Sbjct: 324 MRKQQQVEADGITFLGVLSSCRHGGLVEEGRYYFNLMVELSLKPELDHYSCVIDLLGRAG 383

Query: 61  LLTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLAN 120
           LL EAQNFIE MPISPNSI+WGSLLSACRLHGNVWIG++AAESRLLLQPDCASTHLQLAN
Sbjct: 384 LLKEAQNFIEKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLAN 443

Query: 121 LYARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDG 180
           LYARAG LDDAARLRKMMK++GLKT+PGYSWIEIQN+VYRFKAEDKSNP+M+EIFG+MDG
Sbjct: 444 LYARAGYLDDAARLRKMMKDKGLKTSPGYSWIEIQNKVYRFKAEDKSNPVMIEIFGVMDG 503

Query: 181 IVKHMRCVGCPPEEDD 197
           +V HMR VGC PE DD
Sbjct: 504 MVNHMRSVGCVPEVDD 519

BLAST of Moc01g21750 vs. ExPASy Swiss-Prot
Match: Q9ZUT4 (Pentatricopeptide repeat-containing protein At2g37320 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E50 PE=2 SV=1)

HSP 1 Score: 253.1 bits (645), Expect = 2.8e-66
Identity = 116/178 (65.17%), Postives = 144/178 (80.90%), Query Frame = 0

Query: 8   KADAITFLGVLSSCRHAGLVEEGRCYFNLMLELGFKPDLDHYSCVIDLFGRAGLLTEAQN 67
           K DAIT+LGVLSSCRHAGLV+EGR +FNLM E G KP+L+HYSC++DL GR GLL EA  
Sbjct: 320 KPDAITYLGVLSSCRHAGLVKEGRKFFNLMAEHGLKPELNHYSCLVDLLGRFGLLQEALE 379

Query: 68  FIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLANLYARAGQ 127
            IENMP+ PNS+IWGSLL +CR+HG+VW GI+AAE RL+L+PDCA+TH+QLANLYA  G 
Sbjct: 380 LIENMPMKPNSVIWGSLLFSCRVHGDVWTGIRAAEERLMLEPDCAATHVQLANLYASVGY 439

Query: 128 LDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDGIVKHM 186
             +AA +RK+MK++GLKT PG SWIEI N V+ FKAED SN  M+EI  ++  ++ HM
Sbjct: 440 WKEAATVRKLMKDKGLKTNPGCSWIEINNYVFMFKAEDGSNCRMLEIVHVLHCLIDHM 497

BLAST of Moc01g21750 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 192.2 bits (487), Expect = 5.8e-48
Identity = 93/197 (47.21%), Postives = 136/197 (69.04%), Query Frame = 0

Query: 7   VKADAITFLGVLSSCRHAGLVEEGRCYFNLMLELG-FKPDLDHYSCVIDLFGRAGLLTEA 66
           ++ D IT++GV S+C HAGLV +GR YF++M ++    P L HY+C++DLFGRAGLL EA
Sbjct: 511 LRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEA 570

Query: 67  QNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLANLYARA 126
           Q FIE MPI P+ + WGSLLSACR+H N+ +G  AAE  LLL+P+ +  +  LANLY+  
Sbjct: 571 QEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSAC 630

Query: 127 GQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDGIVKHM 186
           G+ ++AA++RK MK+  +K   G+SWIE++++V+ F  ED ++P   EI+  M  I   +
Sbjct: 631 GKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEI 690

Query: 187 RCVGCPPEEDDFDVLHE 203
           + +G  P  D   VLH+
Sbjct: 691 KKMGYVP--DTASVLHD 705

BLAST of Moc01g21750 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 183.3 bits (464), Expect = 2.7e-45
Identity = 86/193 (44.56%), Postives = 127/193 (65.80%), Query Frame = 0

Query: 8   KADAITFLGVLSSCRHAGLVEEGRCYFNLML-ELGFKPDLDHYSCVIDLFGRAGLLTEAQ 67
           K D IT +GVLS+C HAG VEEGR YF+ M  + G  P  DHY+C++DL GRAG L EA+
Sbjct: 490 KPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAK 549

Query: 68  NFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLANLYARAG 127
           + IE MP+ P+S+IWGSLL+AC++H N+ +G   AE  L ++P  +  ++ L+N+YA  G
Sbjct: 550 SMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELG 609

Query: 128 QLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDGIVKHMR 187
           + +D   +RK M++ G+   PG SWI+IQ   + F  +DKS+P   +I  L+D ++  MR
Sbjct: 610 KWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQIHSLLDILIAEMR 669

Query: 188 CVGCPPEEDDFDV 200
                PE+D  ++
Sbjct: 670 -----PEQDHTEI 677

BLAST of Moc01g21750 vs. ExPASy Swiss-Prot
Match: Q9ZUW3 (Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H60 PE=2 SV=1)

HSP 1 Score: 180.3 bits (456), Expect = 2.3e-44
Identity = 79/192 (41.15%), Postives = 130/192 (67.71%), Query Frame = 0

Query: 3   KQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLML-ELGFKPDLDHYSCVIDLFGRAGL 62
           K+++VK D +TF+GV ++C HAGLVEEG  YF++M+ +    P  +H SC++DL+ RAG 
Sbjct: 588 KKRKVKMDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCMVDLYSRAGQ 647

Query: 63  LTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLANL 122
           L +A   IENMP    S IW ++L+ACR+H    +G  AAE  + ++P+ ++ ++ L+N+
Sbjct: 648 LEKAMKVIENMPNPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDSAAYVLLSNM 707

Query: 123 YARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDGI 182
           YA +G   + A++RK+M ER +K  PGYSWIE++N+ Y F A D+S+PL  +I+  ++ +
Sbjct: 708 YAESGDWQERAKVRKLMNERNVKKEPGYSWIEVKNKTYSFLAGDRSHPLKDQIYMKLEDL 767

Query: 183 VKHMRCVGCPPE 194
              ++ +G  P+
Sbjct: 768 STRLKDLGYEPD 779

BLAST of Moc01g21750 vs. ExPASy Swiss-Prot
Match: Q9SY02 (Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H24 PE=3 SV=1)

HSP 1 Score: 176.0 bits (445), Expect = 4.3e-43
Identity = 79/188 (42.02%), Postives = 122/188 (64.89%), Query Frame = 0

Query: 3   KQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLML-ELGFKPDLDHYSCVIDLFGRAGL 62
           K++ +K D  T + VLS+C H GLV++GR YF  M  + G  P+  HY+C++DL GRAGL
Sbjct: 502 KREGLKPDDATMVAVLSACSHTGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDLLGRAGL 561

Query: 63  LTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLANL 122
           L +A N ++NMP  P++ IWG+LL A R+HGN  +   AA+    ++P+ +  ++ L+NL
Sbjct: 562 LEDAHNLMKNMPFEPDAAIWGTLLGASRVHGNTELAETAADKIFAMEPENSGMYVLLSNL 621

Query: 123 YARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDGI 182
           YA +G+  D  +LR  M+++G+K  PGYSWIEIQN+ + F   D+ +P   EIF  ++ +
Sbjct: 622 YASSGRWGDVGKLRVRMRDKGVKKVPGYSWIEIQNKTHTFSVGDEFHPEKDEIFAFLEEL 681

Query: 183 VKHMRCVG 190
              M+  G
Sbjct: 682 DLRMKKAG 689

BLAST of Moc01g21750 vs. ExPASy TrEMBL
Match: A0A6J1CPL2 (pentatricopeptide repeat-containing protein At2g37320 OS=Momordica charantia OX=3673 GN=LOC111013570 PE=4 SV=1)

HSP 1 Score: 429.1 bits (1102), Expect = 1.0e-116
Identity = 204/204 (100.00%), Postives = 204/204 (100.00%), Query Frame = 0

Query: 1   MGKQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLMLELGFKPDLDHYSCVIDLFGRAG 60
           MGKQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLMLELGFKPDLDHYSCVIDLFGRAG
Sbjct: 324 MGKQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLMLELGFKPDLDHYSCVIDLFGRAG 383

Query: 61  LLTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLAN 120
           LLTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLAN
Sbjct: 384 LLTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLAN 443

Query: 121 LYARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDG 180
           LYARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDG
Sbjct: 444 LYARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDG 503

Query: 181 IVKHMRCVGCPPEEDDFDVLHETF 205
           IVKHMRCVGCPPEEDDFDVLHETF
Sbjct: 504 IVKHMRCVGCPPEEDDFDVLHETF 527

BLAST of Moc01g21750 vs. ExPASy TrEMBL
Match: A0A6J1GDY6 (pentatricopeptide repeat-containing protein At2g37320 OS=Cucurbita moschata OX=3662 GN=LOC111453122 PE=4 SV=1)

HSP 1 Score: 361.3 bits (926), Expect = 2.6e-96
Identity = 169/196 (86.22%), Postives = 183/196 (93.37%), Query Frame = 0

Query: 1   MGKQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLMLELGFKPDLDHYSCVIDLFGRAG 60
           M KQQQV+AD ITFLGVLSSCRH GLVEEGR YFNLM+ELG KP+LDHYSCVIDL GRAG
Sbjct: 324 MRKQQQVEADGITFLGVLSSCRHGGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAG 383

Query: 61  LLTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLAN 120
           LL EAQN IENMPISPNSI+WGSLLSACRLHGNVWIG++AAESRLLLQPDCASTHLQLAN
Sbjct: 384 LLKEAQNLIENMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLAN 443

Query: 121 LYARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDG 180
           LYARAG LDDAARLRKMMK++GLKTAPGYSWIEIQN+VYRFKAEDKSNP+M+EIFG+MDG
Sbjct: 444 LYARAGYLDDAARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMIEIFGVMDG 503

Query: 181 IVKHMRCVGCPPEEDD 197
           +V HMR VGC PE D+
Sbjct: 504 MVNHMRSVGCVPEVDN 519

BLAST of Moc01g21750 vs. ExPASy TrEMBL
Match: A0A6J1IKW6 (pentatricopeptide repeat-containing protein At2g37320 OS=Cucurbita maxima OX=3661 GN=LOC111477972 PE=4 SV=1)

HSP 1 Score: 356.7 bits (914), Expect = 6.5e-95
Identity = 167/196 (85.20%), Postives = 181/196 (92.35%), Query Frame = 0

Query: 1   MGKQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLMLELGFKPDLDHYSCVIDLFGRAG 60
           M KQQQV+AD ITFLGVLSSCRH GLVEEGR YFNLM+EL  KP+LDHYSCVIDL GRAG
Sbjct: 324 MRKQQQVEADGITFLGVLSSCRHGGLVEEGRYYFNLMVELALKPELDHYSCVIDLLGRAG 383

Query: 61  LLTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLAN 120
           LL EAQNFIE MPISPNSI+WGSLLSACRLHGNVWIG++AAESRLLLQPDCASTHLQLAN
Sbjct: 384 LLKEAQNFIEKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLAN 443

Query: 121 LYARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDG 180
           LYARAG L+DAARLRKMMK++GLKTAPGYSWIEIQN+VYRFKAEDKSNP+M+EIFG+MDG
Sbjct: 444 LYARAGYLEDAARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMIEIFGVMDG 503

Query: 181 IVKHMRCVGCPPEEDD 197
           +V HMR V C PE DD
Sbjct: 504 MVNHMRSVDCVPEVDD 519

BLAST of Moc01g21750 vs. ExPASy TrEMBL
Match: A0A0A0KX36 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G293100 PE=4 SV=1)

HSP 1 Score: 353.2 bits (905), Expect = 7.2e-94
Identity = 166/196 (84.69%), Postives = 182/196 (92.86%), Query Frame = 0

Query: 1   MGKQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLMLELGFKPDLDHYSCVIDLFGRAG 60
           M KQ+QV+ADAITFLGVLSSCRHAG VEEGR YFNLM+ELG KP+LDHYSCVIDL GRAG
Sbjct: 323 MRKQKQVEADAITFLGVLSSCRHAGFVEEGRHYFNLMVELGLKPELDHYSCVIDLLGRAG 382

Query: 61  LLTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLAN 120
           LL EAQNFIE MPI+PNSI+WGSLLSACRLHGNVWIG++AAESRLLLQPDCASTHLQL N
Sbjct: 383 LLKEAQNFIEKMPITPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLTN 442

Query: 121 LYARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDG 180
           LYA+AG LDDAARLRK+MK++GLKTAPGYSWIEIQN+VYRFKAEDKSNPLMVEIFGL+DG
Sbjct: 443 LYAKAGYLDDAARLRKIMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPLMVEIFGLIDG 502

Query: 181 IVKHMRCVGCPPEEDD 197
           +V HMR VGC  E +D
Sbjct: 503 MVNHMRFVGCAHELED 518

BLAST of Moc01g21750 vs. ExPASy TrEMBL
Match: A0A5A7U1F7 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold908G001370 PE=4 SV=1)

HSP 1 Score: 345.5 bits (885), Expect = 1.5e-91
Identity = 167/199 (83.92%), Postives = 181/199 (90.95%), Query Frame = 0

Query: 1   MGKQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLMLELGFKPDLDHYSCVIDLFGRAG 60
           M KQ+QV+ADAITFLGVLSSCRHAG VEEGR YFNLM+ELG KP+LDHYSCVIDL GRAG
Sbjct: 316 MRKQKQVEADAITFLGVLSSCRHAGFVEEGRHYFNLMVELGLKPELDHYSCVIDLLGRAG 375

Query: 61  LLTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLAN 120
           LL EAQNFIE MP+SPNSIIWGSLLSACRLHGNVWIG++AAESRLLLQPDCASTHLQL  
Sbjct: 376 LLKEAQNFIEKMPMSPNSIIWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLTK 435

Query: 121 LYARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDG 180
           LYA+AG LDDAARLRK+MK++GLKTAPGYSWIEIQN+VYRFKAEDKSNPLMVEIFGLMD 
Sbjct: 436 LYAKAGYLDDAARLRKIMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPLMVEIFGLMDC 495

Query: 181 IVKHMRCVGCPPE-EDDFD 199
           +V HMR VG   E ED+ D
Sbjct: 496 MVNHMRFVGFDHELEDEVD 514

BLAST of Moc01g21750 vs. TAIR 10
Match: AT2G37320.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 253.1 bits (645), Expect = 2.0e-67
Identity = 116/178 (65.17%), Postives = 144/178 (80.90%), Query Frame = 0

Query: 8   KADAITFLGVLSSCRHAGLVEEGRCYFNLMLELGFKPDLDHYSCVIDLFGRAGLLTEAQN 67
           K DAIT+LGVLSSCRHAGLV+EGR +FNLM E G KP+L+HYSC++DL GR GLL EA  
Sbjct: 320 KPDAITYLGVLSSCRHAGLVKEGRKFFNLMAEHGLKPELNHYSCLVDLLGRFGLLQEALE 379

Query: 68  FIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLANLYARAGQ 127
            IENMP+ PNS+IWGSLL +CR+HG+VW GI+AAE RL+L+PDCA+TH+QLANLYA  G 
Sbjct: 380 LIENMPMKPNSVIWGSLLFSCRVHGDVWTGIRAAEERLMLEPDCAATHVQLANLYASVGY 439

Query: 128 LDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDGIVKHM 186
             +AA +RK+MK++GLKT PG SWIEI N V+ FKAED SN  M+EI  ++  ++ HM
Sbjct: 440 WKEAATVRKLMKDKGLKTNPGCSWIEINNYVFMFKAEDGSNCRMLEIVHVLHCLIDHM 497

BLAST of Moc01g21750 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 192.2 bits (487), Expect = 4.1e-49
Identity = 93/197 (47.21%), Postives = 136/197 (69.04%), Query Frame = 0

Query: 7   VKADAITFLGVLSSCRHAGLVEEGRCYFNLMLELG-FKPDLDHYSCVIDLFGRAGLLTEA 66
           ++ D IT++GV S+C HAGLV +GR YF++M ++    P L HY+C++DLFGRAGLL EA
Sbjct: 511 LRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEA 570

Query: 67  QNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLANLYARA 126
           Q FIE MPI P+ + WGSLLSACR+H N+ +G  AAE  LLL+P+ +  +  LANLY+  
Sbjct: 571 QEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSAC 630

Query: 127 GQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDGIVKHM 186
           G+ ++AA++RK MK+  +K   G+SWIE++++V+ F  ED ++P   EI+  M  I   +
Sbjct: 631 GKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEI 690

Query: 187 RCVGCPPEEDDFDVLHE 203
           + +G  P  D   VLH+
Sbjct: 691 KKMGYVP--DTASVLHD 705

BLAST of Moc01g21750 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 183.3 bits (464), Expect = 1.9e-46
Identity = 86/193 (44.56%), Postives = 127/193 (65.80%), Query Frame = 0

Query: 8   KADAITFLGVLSSCRHAGLVEEGRCYFNLML-ELGFKPDLDHYSCVIDLFGRAGLLTEAQ 67
           K D IT +GVLS+C HAG VEEGR YF+ M  + G  P  DHY+C++DL GRAG L EA+
Sbjct: 490 KPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAK 549

Query: 68  NFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLANLYARAG 127
           + IE MP+ P+S+IWGSLL+AC++H N+ +G   AE  L ++P  +  ++ L+N+YA  G
Sbjct: 550 SMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELG 609

Query: 128 QLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDGIVKHMR 187
           + +D   +RK M++ G+   PG SWI+IQ   + F  +DKS+P   +I  L+D ++  MR
Sbjct: 610 KWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQIHSLLDILIAEMR 669

Query: 188 CVGCPPEEDDFDV 200
                PE+D  ++
Sbjct: 670 -----PEQDHTEI 677

BLAST of Moc01g21750 vs. TAIR 10
Match: AT2G27610.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 180.3 bits (456), Expect = 1.6e-45
Identity = 79/192 (41.15%), Postives = 130/192 (67.71%), Query Frame = 0

Query: 3   KQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLML-ELGFKPDLDHYSCVIDLFGRAGL 62
           K+++VK D +TF+GV ++C HAGLVEEG  YF++M+ +    P  +H SC++DL+ RAG 
Sbjct: 588 KKRKVKMDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCMVDLYSRAGQ 647

Query: 63  LTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLANL 122
           L +A   IENMP    S IW ++L+ACR+H    +G  AAE  + ++P+ ++ ++ L+N+
Sbjct: 648 LEKAMKVIENMPNPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDSAAYVLLSNM 707

Query: 123 YARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDGI 182
           YA +G   + A++RK+M ER +K  PGYSWIE++N+ Y F A D+S+PL  +I+  ++ +
Sbjct: 708 YAESGDWQERAKVRKLMNERNVKKEPGYSWIEVKNKTYSFLAGDRSHPLKDQIYMKLEDL 767

Query: 183 VKHMRCVGCPPE 194
              ++ +G  P+
Sbjct: 768 STRLKDLGYEPD 779

BLAST of Moc01g21750 vs. TAIR 10
Match: AT4G02750.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 176.0 bits (445), Expect = 3.0e-44
Identity = 79/188 (42.02%), Postives = 122/188 (64.89%), Query Frame = 0

Query: 3   KQQQVKADAITFLGVLSSCRHAGLVEEGRCYFNLML-ELGFKPDLDHYSCVIDLFGRAGL 62
           K++ +K D  T + VLS+C H GLV++GR YF  M  + G  P+  HY+C++DL GRAGL
Sbjct: 502 KREGLKPDDATMVAVLSACSHTGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDLLGRAGL 561

Query: 63  LTEAQNFIENMPISPNSIIWGSLLSACRLHGNVWIGIQAAESRLLLQPDCASTHLQLANL 122
           L +A N ++NMP  P++ IWG+LL A R+HGN  +   AA+    ++P+ +  ++ L+NL
Sbjct: 562 LEDAHNLMKNMPFEPDAAIWGTLLGASRVHGNTELAETAADKIFAMEPENSGMYVLLSNL 621

Query: 123 YARAGQLDDAARLRKMMKERGLKTAPGYSWIEIQNRVYRFKAEDKSNPLMVEIFGLMDGI 182
           YA +G+  D  +LR  M+++G+K  PGYSWIEIQN+ + F   D+ +P   EIF  ++ +
Sbjct: 622 YASSGRWGDVGKLRVRMRDKGVKKVPGYSWIEIQNKTHTFSVGDEFHPEKDEIFAFLEEL 681

Query: 183 VKHMRCVG 190
              M+  G
Sbjct: 682 DLRMKKAG 689

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022143730.12.1e-116100.00pentatricopeptide repeat-containing protein At2g37320 [Momordica charantia][more]
KAG7033868.11.1e-9686.73Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
KAG6603691.11.1e-9686.73Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022949850.15.5e-9686.22pentatricopeptide repeat-containing protein At2g37320 [Cucurbita moschata][more]
XP_023544680.11.6e-9585.71pentatricopeptide repeat-containing protein At2g37320 [Cucurbita pepo subsp. pep... [more]
Match NameE-valueIdentityDescription
Q9ZUT42.8e-6665.17Pentatricopeptide repeat-containing protein At2g37320 OS=Arabidopsis thaliana OX... [more]
Q9SHZ85.8e-4847.21Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Q9SIT72.7e-4544.56Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q9ZUW32.3e-4441.15Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX... [more]
Q9SY024.3e-4342.02Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1CPL21.0e-116100.00pentatricopeptide repeat-containing protein At2g37320 OS=Momordica charantia OX=... [more]
A0A6J1GDY62.6e-9686.22pentatricopeptide repeat-containing protein At2g37320 OS=Cucurbita moschata OX=3... [more]
A0A6J1IKW66.5e-9585.20pentatricopeptide repeat-containing protein At2g37320 OS=Cucurbita maxima OX=366... [more]
A0A0A0KX367.2e-9484.69Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G293100 PE=4 SV=1[more]
A0A5A7U1F71.5e-9183.92Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT2G37320.12.0e-6765.17Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G22070.14.1e-4947.21pentatricopeptide (PPR) repeat-containing protein [more]
AT2G13600.11.9e-4644.56Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G27610.11.6e-4541.15Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G02750.13.0e-4442.02Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 118..144
e-value: 0.001
score: 17.1
coord: 13..45
e-value: 8.6E-6
score: 23.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 13..42
e-value: 0.068
score: 13.4
coord: 48..72
e-value: 0.069
score: 13.4
coord: 118..143
e-value: 0.0018
score: 18.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 111..145
score: 8.911594
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 10..44
score: 9.350046
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 1..117
e-value: 5.0E-23
score: 84.0
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 80..158
NoneNo IPR availablePANTHERPTHR47929FAMILY NOT NAMEDcoord: 3..198
NoneNo IPR availablePANTHERPTHR47929:SF5SUBFAMILY NOT NAMEDcoord: 3..198
NoneNo IPR availablePROSITEPS51257PROKAR_LIPOPROTEINcoord: 1..21
score: 5.0

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc01g21750.1Moc01g21750.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding