Moc09g16440 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc09g16440
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Locationchr9: 13304484 .. 13306736 (+)
RNA-Seq ExpressionMoc09g16440
SyntenyMoc09g16440
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTTACTTCACGGGCGAGGTCCCTGGTTCAAGTCCAGGATGGCCCAGCTACGCCAAGAAAAAGAATAAAAGAATAGAAGAAGCATCTGACTCCTTCATGCAGGCCCCACTTGGCTCGGGGGGATATAGCTCAGTTGGTAGAGCTCCGCTCTTGCAATTGGGTCGTTGCGATTACGGGTTGGATGTCTAATTGTCCAGGCGGTAATGAAAGTATCTTGTACCTGAACCGGTGGCTCACTTTTTCTAAGTAATGGGGAAGAGGACCGAAACATGCCACTGAAAGACTCTACTGAGACAAAGATGGGCTGTCAAGAACGTAGAGGAGGTAGGATGGGCTGTTGGTCAGATCTAGTATGGATCGTACATGGACGGTAGTTGGAGTCGGCGGCTCTCTTAGGGTTCCCTCATCTGGGATCCCTGGGGAAGAGGATCAAGTTGGCCCTTGCGAACAGCTTGATGCACGAGATCCCTTCAACCCTTTGAGCGAAATGCGGCAAAAGGAAGGAAAATCCATGGACCGACCCCATCGTCTCCACCCCGTAGTACGAGATCACCCCAAGGACGCCTTCGGTATCCAGGGGTCGCGGACCGACCATAGAACCCTGTTCAATAAGTGGAACGCATTAGTTGTCCGCTCTCGGGTTGGGCAGTAAGGGTCGGAGAAGGGCAATCACTCATTCTTAAAACCAGCATTCTTAAGACCAAAGAGGCGGGCGGAAAAGGGGAAAGCTCTCCGTTCCTGGTTCTCCTGTAGCTGGATCCTCCGGAACCACAAGAATCCTTAGTTAGAATGGGATTCCAACTCAGTACCTTTTGAGATTTTGAGAAGAGTTGCTCTTTGGAGAGCACAGTACGATGAAAGTTGTAAGCTGTGTTCGGGGGGGAGTTATTGTCTATCGTTGGCCTCTATGGTAGAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATACAAAGCTATATGATAGCACTCAATTTTTCCGATTCGGCAGTTCGATCTATGATTTCTCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATAGCCTTAAAGTTAAGGGCGAGGTTCAAACGAGGAAAGGCTTACGGTGGATACCTAGACACCCAGAGACGAGGAAGGGCGTAGTAAGCGACGAAATGCTTCGGGGAGTTGAAAATAAGCGTAGATCCGGAGATTCCCGAATAGGTCAACCTTTCAAACAGCTACGGAATCCATGGGCAGGCAAGAGACAACCTGGCGAACTGAAACATCTTAGTAGCCAGAGGAAAAGAAAGCAAAAGCGATTCCCGTAGTAGCGGCGAGCGAAATGGGAGCAGCCTAAACCGTGAAAACGGGGTTGTGGGAGAGCAATACAAGCGTCGTGCTGCTAGGCGAAGCGGTGGAGTGCTGCACCCTAGATGGCGAGAGTCCAGTAGCCGAAAGCATCACTAGCTTACGCTCTGACCCGAGTAGCATGGGGCACGTGGAATCCCGTGTGAATCAGCAAGGACCACCTTGCAAGGCTAAATACTCCTGGGTGACCGATAGTGAAGTAGTACCGTGAGGGAAGGGTGAAAAGAACCCCATCGGGGAGTGAAATAGAACATGAAACCGTAAGCTTCCAAGCAGTGGGAGGAGACCAGGACTCTGACCGCGTGCCTGTTGAAGAATGAGCCGGCGACTCATAGGCAGTGGCTTGGTTAAGGGAACCCACCGGAGCCGTAGCGAAAGCGAGTCTTCATGGGGCAATTGTCACTGCTTATGGACCCGAACCTGGGTGATCTATCCATGACCAGGATGAAGCTTGGGTGAAACTAAGTGGAGGTCCGAACCGACTGATGTTGAAGAATCAGCGGATGAGTTGTGGTTAGGGGTGAAATGCCACTCGAACCCAGAGCTAGCTGGTTCTCCCCGAAATGCGTTGAGGCGCAGCAGTTGACTGGACATCTAGGGGTAAAGCACTGTTTCGGTGCGGGCCGCGAGAGCGGTACCAAATCGAGGCAAACTCTGAATACTAGATATGATCTCAAAATAACAGGGGTCAAGGTCGGCCAGTGAGACGATGGGGGATAAGCTTCATCGTCGAGAGGGAAACAGCCCGGATCACCTGCTAAGGCCCCTAAATGACCGCTCAGTGATAAAGGAGGTAGGGGTGCAGAGACAGCCAGGAGGTTTGCCTAGAAGCAGCCACCCTTGA

mRNA sequence

ATGATTTACTTCACGGGCGAGGTCCCTGGTTCAAGTCCAGGATGGCCCAGCTACGCCAAGAAAAAGAATAAAAGAATAGAAGAAGCATCTGACTCCTTCATGCAGGCCCCACTTGGCTCGGGGGGATATAGCTCAGTTGGTAGAGCTCCGCTCTTGCAATTGGGTCGTTGCGATTACGGATCTAGTATGGATCGTACATGGACGGTAGTTGGAGTCGGCGGCTCTCTTAGGGTTCCCTCATCTGGGATCCCTGGGGAAGAGGATCAAGTTGGCCCTTGCGAACAGCTTGATGCACGAGATCCCTTCAACCCTTTGAGCGAAATGCGGCAAAAGGAAGGAAAATCCATGGACCGACCCCATCGTCTCCACCCCGTAGTACGAGATCACCCCAAGGACGCCTTCGGTATCCAGGGGTCGCGGACCGACCATAGAACCCTGTTCAATAAGTGGAACGCATTAGTTGTCCGCTCTCGGTTAGAATGGGATTCCAACTCAGTACCTTTTGAGATTTTGAGAAGAGTTGCTCTTTGGAGAGCACAGTACGATGAAAGTTGTAAGCTGTGTTCGGGGGGGAGTTATTGTCTATCGTTGGCCTCTATGGTAGAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATACAAAGCTATATGATAGCACTCAATTTTTCCGATTCGGCAGTTCGATCTATGATTTCTCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATAGCCTTAAAGTTAAGGGCGAGGTTCAAACGAGGAAAGGCTTACGGTGGATACCTAGACACCCAGAGACGAGGAAGGGCGTAGTAAGCGACGAAATGCTTCGGGGAGTTGAAAATAAGCGTAGATCCGGAGATTCCCGAATAGGCGAAGCGGTGGAGTGCTGCACCCTAGATGGCGAGAGTCCAGTAGCCGAAAGCATCACTAGCTTACGCTCTGACCCGAGTAGCATGGGGCACGGGTCAAGGTCGGCCAGTGAGACGATGGGGGATAAGCTTCATCGTCGAGAGGGAAACAGCCCGGATCACCTGCTAAGGCCCCTAAATGACCGCTCAGTGATAAAGGAGGTAGGGGTGCAGAGACAGCCAGGAGGTTTGCCTAGAAGCAGCCACCCTTGA

Coding sequence (CDS)

ATGATTTACTTCACGGGCGAGGTCCCTGGTTCAAGTCCAGGATGGCCCAGCTACGCCAAGAAAAAGAATAAAAGAATAGAAGAAGCATCTGACTCCTTCATGCAGGCCCCACTTGGCTCGGGGGGATATAGCTCAGTTGGTAGAGCTCCGCTCTTGCAATTGGGTCGTTGCGATTACGGATCTAGTATGGATCGTACATGGACGGTAGTTGGAGTCGGCGGCTCTCTTAGGGTTCCCTCATCTGGGATCCCTGGGGAAGAGGATCAAGTTGGCCCTTGCGAACAGCTTGATGCACGAGATCCCTTCAACCCTTTGAGCGAAATGCGGCAAAAGGAAGGAAAATCCATGGACCGACCCCATCGTCTCCACCCCGTAGTACGAGATCACCCCAAGGACGCCTTCGGTATCCAGGGGTCGCGGACCGACCATAGAACCCTGTTCAATAAGTGGAACGCATTAGTTGTCCGCTCTCGGTTAGAATGGGATTCCAACTCAGTACCTTTTGAGATTTTGAGAAGAGTTGCTCTTTGGAGAGCACAGTACGATGAAAGTTGTAAGCTGTGTTCGGGGGGGAGTTATTGTCTATCGTTGGCCTCTATGGTAGAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATACAAAGCTATATGATAGCACTCAATTTTTCCGATTCGGCAGTTCGATCTATGATTTCTCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATAGCCTTAAAGTTAAGGGCGAGGTTCAAACGAGGAAAGGCTTACGGTGGATACCTAGACACCCAGAGACGAGGAAGGGCGTAGTAAGCGACGAAATGCTTCGGGGAGTTGAAAATAAGCGTAGATCCGGAGATTCCCGAATAGGCGAAGCGGTGGAGTGCTGCACCCTAGATGGCGAGAGTCCAGTAGCCGAAAGCATCACTAGCTTACGCTCTGACCCGAGTAGCATGGGGCACGGGTCAAGGTCGGCCAGTGAGACGATGGGGGATAAGCTTCATCGTCGAGAGGGAAACAGCCCGGATCACCTGCTAAGGCCCCTAAATGACCGCTCAGTGATAAAGGAGGTAGGGGTGCAGAGACAGCCAGGAGGTTTGCCTAGAAGCAGCCACCCTTGA

Protein sequence

MIYFTGEVPGSSPGWPSYAKKKNKRIEEASDSFMQAPLGSGGYSSVGRAPLLQLGRCDYGSSMDRTWTVVGVGGSLRVPSSGIPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHGSRSASETMGDKLHRREGNSPDHLLRPLNDRSVIKEVGVQRQPGGLPRSSHP
Homology
BLAST of Moc09g16440 vs. NCBI nr
Match: OVA05688.1 (hypothetical protein BVC80_4285g1 [Macleaya cordata])

HSP 1 Score: 438.0 bits (1125), Expect = 9.0e-119
Identity = 245/347 (70.61%), Postives = 249/347 (71.76%), Query Frame = 0

Query: 24  KRIEEASDSFMQAPLGSGGYSSVGRAPLLQLGRCDYGSSMDRTWTVVGVGGSLRVPSSGI 83
           KRIEEASDSFM APLGSGGYSSVGRAPLLQL              VVGVGGS RVPSSGI
Sbjct: 227 KRIEEASDSFMHAPLGSGGYSSVGRAPLLQL--------------VVGVGGSPRVPSSGI 286

Query: 84  PGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTDH 143
           PGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHRLHPV              R  H
Sbjct: 287 PGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQ------GRLRH 346

Query: 144 RTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVES 203
             L    ++   +  LEWDSNS PFEILRRVALWRAQ                      S
Sbjct: 347 PGL----DSPEPQESLEWDSNSAPFEILRRVALWRAQI---------------------S 406

Query: 204 VGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFS 263
            GGLRGGGLPCGGCQRFESAYLQLVNLADTK+YDSTQFFRFGSSIYD SFMDVDKIL FS
Sbjct: 407 RGGLRGGGLPCGGCQRFESAYLQLVNLADTKVYDSTQFFRFGSSIYDLSFMDVDKILLFS 466

Query: 264 STLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG------ 323
           STLGWHSLKV GEVQTRKGLRWIPRHPETRKGV SDEMLRGVENK RSGDSRIG      
Sbjct: 467 STLGWHSLKVNGEVQTRKGLRWIPRHPETRKGVASDEMLRGVENKHRSGDSRIGQPFELL 526

Query: 324 -----------------EAVECCTLDGESPVAESITSLRSDPSSMGH 348
                            EAVEC TLDGESPVAESITSLRSDPSSMGH
Sbjct: 527 LNPWAGKRQPGELKHLSEAVECRTLDGESPVAESITSLRSDPSSMGH 528

BLAST of Moc09g16440 vs. NCBI nr
Match: CAD5336145.1 (unnamed protein product [Arabidopsis thaliana])

HSP 1 Score: 421.8 bits (1083), Expect = 6.7e-114
Identity = 230/303 (75.91%), Postives = 234/303 (77.23%), Query Frame = 0

Query: 83  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTD 142
           IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHRLHPV         G   S   
Sbjct: 132 IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPV---------GTTRS--- 191

Query: 143 HRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVE 202
                        + RL     S PFEILRRVALWRAQYDESCKLCSGGS CLSLASMVE
Sbjct: 192 ------------PQGRL----RSSPFEILRRVALWRAQYDESCKLCSGGSSCLSLASMVE 251

Query: 203 SVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPF 262
           SV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKI PF
Sbjct: 252 SVRGLIGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKIFPF 311

Query: 263 SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVEC 322
           SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG AV  
Sbjct: 312 SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGAAV-- 371

Query: 323 CTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHLLRPLNDRSVI 382
                             D +S G    RSASET+GDKLHRREGNSPDH LRPLNDRSVI
Sbjct: 372 ------------------DWTSRGKALFRSASETVGDKLHRREGNSPDHQLRPLNDRSVI 386

Query: 383 KEV 385
           KE+
Sbjct: 432 KEM 386

BLAST of Moc09g16440 vs. NCBI nr
Match: CAD5336141.1 (unnamed protein product [Arabidopsis thaliana])

HSP 1 Score: 421.8 bits (1083), Expect = 6.7e-114
Identity = 230/303 (75.91%), Postives = 234/303 (77.23%), Query Frame = 0

Query: 83  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTD 142
           IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHRLHPV         G   S   
Sbjct: 46  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPV---------GTTRS--- 105

Query: 143 HRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVE 202
                        + RL     S PFEILRRVALWRAQYDESCKLCSGGS CLSLASMVE
Sbjct: 106 ------------PQGRL----RSSPFEILRRVALWRAQYDESCKLCSGGSSCLSLASMVE 165

Query: 203 SVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPF 262
           SV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKI PF
Sbjct: 166 SVRGLIGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKIFPF 225

Query: 263 SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVEC 322
           SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG AV  
Sbjct: 226 SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGAAV-- 285

Query: 323 CTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHLLRPLNDRSVI 382
                             D +S G    RSASET+GDKLHRREGNSPDH LRPLNDRSVI
Sbjct: 286 ------------------DWTSRGKALFRSASETVGDKLHRREGNSPDHQLRPLNDRSVI 300

Query: 383 KEV 385
           KE+
Sbjct: 346 KEM 300

BLAST of Moc09g16440 vs. NCBI nr
Match: CAD5336140.1 (unnamed protein product [Arabidopsis thaliana] >CAD5336144.1 unnamed protein product [Arabidopsis thaliana])

HSP 1 Score: 421.8 bits (1083), Expect = 6.7e-114
Identity = 230/303 (75.91%), Postives = 234/303 (77.23%), Query Frame = 0

Query: 83  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTD 142
           IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHRLHPV         G   S   
Sbjct: 46  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPV---------GTTRS--- 105

Query: 143 HRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVE 202
                        + RL     S PFEILRRVALWRAQYDESCKLCSGGS CLSLASMVE
Sbjct: 106 ------------PQGRL----RSSPFEILRRVALWRAQYDESCKLCSGGSSCLSLASMVE 165

Query: 203 SVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPF 262
           SV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKI PF
Sbjct: 166 SVRGLIGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKIFPF 225

Query: 263 SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVEC 322
           SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG AV  
Sbjct: 226 SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGAAV-- 285

Query: 323 CTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHLLRPLNDRSVI 382
                             D +S G    RSASET+GDKLHRREGNSPDH LRPLNDRSVI
Sbjct: 286 ------------------DWTSRGKALFRSASETVGDKLHRREGNSPDHQLRPLNDRSVI 300

Query: 383 KEV 385
           KE+
Sbjct: 346 KEM 300

BLAST of Moc09g16440 vs. NCBI nr
Match: KAG7528872.1 (hypothetical protein ISN44_Un153g000040 [Arabidopsis suecica] >KAG7528886.1 hypothetical protein ISN44_Un153g000180 [Arabidopsis suecica] >KAG7529053.1 hypothetical protein ISN45_Un101g000060 [Arabidopsis thaliana x Arabidopsis arenosa] >KAG7529068.1 hypothetical protein ISN45_Un101g000220 [Arabidopsis thaliana x Arabidopsis arenosa])

HSP 1 Score: 404.1 bits (1037), Expect = 1.4e-108
Identity = 214/265 (80.75%), Postives = 215/265 (81.13%), Query Frame = 0

Query: 83  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTD 142
           IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHRLHPV         G   S   
Sbjct: 46  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPV---------GTTRS--- 105

Query: 143 HRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVE 202
                        + RL             RVALWRAQYDESCKLCSGGS CLSLASMVE
Sbjct: 106 ------------PQGRL-------------RVALWRAQYDESCKLCSGGSSCLSLASMVE 165

Query: 203 SVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPF 262
           SV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKI PF
Sbjct: 166 SVRGLIGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKIFPF 225

Query: 263 SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVEC 322
           SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVEC
Sbjct: 226 SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVEC 273

Query: 323 CTLDGESPVAESITSLRSDPSSMGH 348
            TLDGESPVAESITSL SDPSSMGH
Sbjct: 286 RTLDGESPVAESITSLCSDPSSMGH 273

BLAST of Moc09g16440 vs. ExPASy TrEMBL
Match: A0A200Q5G5 (Uncharacterized protein ycf68 OS=Macleaya cordata OX=56857 GN=BVC80_4285g1 PE=3 SV=1)

HSP 1 Score: 438.0 bits (1125), Expect = 4.4e-119
Identity = 245/347 (70.61%), Postives = 249/347 (71.76%), Query Frame = 0

Query: 24  KRIEEASDSFMQAPLGSGGYSSVGRAPLLQLGRCDYGSSMDRTWTVVGVGGSLRVPSSGI 83
           KRIEEASDSFM APLGSGGYSSVGRAPLLQL              VVGVGGS RVPSSGI
Sbjct: 227 KRIEEASDSFMHAPLGSGGYSSVGRAPLLQL--------------VVGVGGSPRVPSSGI 286

Query: 84  PGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTDH 143
           PGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHRLHPV              R  H
Sbjct: 287 PGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQ------GRLRH 346

Query: 144 RTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVES 203
             L    ++   +  LEWDSNS PFEILRRVALWRAQ                      S
Sbjct: 347 PGL----DSPEPQESLEWDSNSAPFEILRRVALWRAQI---------------------S 406

Query: 204 VGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFS 263
            GGLRGGGLPCGGCQRFESAYLQLVNLADTK+YDSTQFFRFGSSIYD SFMDVDKIL FS
Sbjct: 407 RGGLRGGGLPCGGCQRFESAYLQLVNLADTKVYDSTQFFRFGSSIYDLSFMDVDKILLFS 466

Query: 264 STLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG------ 323
           STLGWHSLKV GEVQTRKGLRWIPRHPETRKGV SDEMLRGVENK RSGDSRIG      
Sbjct: 467 STLGWHSLKVNGEVQTRKGLRWIPRHPETRKGVASDEMLRGVENKHRSGDSRIGQPFELL 526

Query: 324 -----------------EAVECCTLDGESPVAESITSLRSDPSSMGH 348
                            EAVEC TLDGESPVAESITSLRSDPSSMGH
Sbjct: 527 LNPWAGKRQPGELKHLSEAVECRTLDGESPVAESITSLRSDPSSMGH 528

BLAST of Moc09g16440 vs. ExPASy TrEMBL
Match: A0A7G2FKR6 (Uncharacterized protein ycf68 OS=Arabidopsis thaliana OX=3702 GN=AT9943_LOCUS23352 PE=3 SV=1)

HSP 1 Score: 421.8 bits (1083), Expect = 3.2e-114
Identity = 230/303 (75.91%), Postives = 234/303 (77.23%), Query Frame = 0

Query: 83  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTD 142
           IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHRLHPV         G   S   
Sbjct: 46  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPV---------GTTRS--- 105

Query: 143 HRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVE 202
                        + RL     S PFEILRRVALWRAQYDESCKLCSGGS CLSLASMVE
Sbjct: 106 ------------PQGRL----RSSPFEILRRVALWRAQYDESCKLCSGGSSCLSLASMVE 165

Query: 203 SVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPF 262
           SV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKI PF
Sbjct: 166 SVRGLIGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKIFPF 225

Query: 263 SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVEC 322
           SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG AV  
Sbjct: 226 SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGAAV-- 285

Query: 323 CTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHLLRPLNDRSVI 382
                             D +S G    RSASET+GDKLHRREGNSPDH LRPLNDRSVI
Sbjct: 286 ------------------DWTSRGKALFRSASETVGDKLHRREGNSPDHQLRPLNDRSVI 300

Query: 383 KEV 385
           KE+
Sbjct: 346 KEM 300

BLAST of Moc09g16440 vs. ExPASy TrEMBL
Match: A0A7G2FMH4 (Uncharacterized protein ycf68 OS=Arabidopsis thaliana OX=3702 GN=AT9943_LOCUS23357 PE=3 SV=1)

HSP 1 Score: 421.8 bits (1083), Expect = 3.2e-114
Identity = 230/303 (75.91%), Postives = 234/303 (77.23%), Query Frame = 0

Query: 83  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTD 142
           IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHRLHPV         G   S   
Sbjct: 132 IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPV---------GTTRS--- 191

Query: 143 HRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVE 202
                        + RL     S PFEILRRVALWRAQYDESCKLCSGGS CLSLASMVE
Sbjct: 192 ------------PQGRL----RSSPFEILRRVALWRAQYDESCKLCSGGSSCLSLASMVE 251

Query: 203 SVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPF 262
           SV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKI PF
Sbjct: 252 SVRGLIGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKIFPF 311

Query: 263 SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVEC 322
           SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG AV  
Sbjct: 312 SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGAAV-- 371

Query: 323 CTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHLLRPLNDRSVI 382
                             D +S G    RSASET+GDKLHRREGNSPDH LRPLNDRSVI
Sbjct: 372 ------------------DWTSRGKALFRSASETVGDKLHRREGNSPDHQLRPLNDRSVI 386

Query: 383 KEV 385
           KE+
Sbjct: 432 KEM 386

BLAST of Moc09g16440 vs. ExPASy TrEMBL
Match: A0A7G2FJL3 (Uncharacterized protein ycf68 OS=Arabidopsis thaliana OX=3702 GN=AT9943_LOCUS23353 PE=3 SV=1)

HSP 1 Score: 421.8 bits (1083), Expect = 3.2e-114
Identity = 230/303 (75.91%), Postives = 234/303 (77.23%), Query Frame = 0

Query: 83  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTD 142
           IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHRLHPV         G   S   
Sbjct: 46  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPV---------GTTRS--- 105

Query: 143 HRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVE 202
                        + RL     S PFEILRRVALWRAQYDESCKLCSGGS CLSLASMVE
Sbjct: 106 ------------PQGRL----RSSPFEILRRVALWRAQYDESCKLCSGGSSCLSLASMVE 165

Query: 203 SVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPF 262
           SV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKI PF
Sbjct: 166 SVRGLIGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKIFPF 225

Query: 263 SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVEC 322
           SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG AV  
Sbjct: 226 SSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGAAV-- 285

Query: 323 CTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHLLRPLNDRSVI 382
                             D +S G    RSASET+GDKLHRREGNSPDH LRPLNDRSVI
Sbjct: 286 ------------------DWTSRGKALFRSASETVGDKLHRREGNSPDHQLRPLNDRSVI 300

Query: 383 KEV 385
           KE+
Sbjct: 346 KEM 300

BLAST of Moc09g16440 vs. ExPASy TrEMBL
Match: A0A2N9I678 (Uncharacterized protein ycf68 OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS49399 PE=3 SV=1)

HSP 1 Score: 393.7 bits (1010), Expect = 9.4e-106
Identity = 208/283 (73.50%), Postives = 212/283 (74.91%), Query Frame = 0

Query: 169 EILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLV 228
           +++  VALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLV
Sbjct: 361 QLVGPVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLV 420

Query: 229 NLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKGLRWIPR 288
           NLADTKLYDSTQFFRFGSSIYD SFMDVDKILPFSSTLGWHSLKVKGEVQTRKGLRWIPR
Sbjct: 421 NLADTKLYDSTQFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTRKGLRWIPR 480

Query: 289 HPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH- 348
           HPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH 
Sbjct: 481 HPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHV 540

Query: 349 ------------------------------------------------------------ 385
                                                                       
Sbjct: 541 ESRVNQQGPPLAWLREPTGAVAKASLHRAIVTAYGPEPGGEMPLEPRASWFSPKCVEAQQ 600

BLAST of Moc09g16440 vs. TAIR 10
Match: AT2G07706.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:ATMG00470.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 54.7 bits (130), Expect = 2.0e-07
Identity = 34/62 (54.84%), Postives = 36/62 (58.06%), Query Frame = 0

Query: 353 SETMGDKLHRR----------EGNSPDHLLRPLNDRSVIKEVGVQRQPGGL------PRS 399
           SET G ++ R           EGNSPDH LRP N RSVIKEVGVQRQP  L      PRS
Sbjct: 36  SETRGTRVKRTLVTRELLSLGEGNSPDHQLRPPNGRSVIKEVGVQRQPRSLLAAPMPPRS 95

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
OVA05688.19.0e-11970.61hypothetical protein BVC80_4285g1 [Macleaya cordata][more]
CAD5336145.16.7e-11475.91unnamed protein product [Arabidopsis thaliana][more]
CAD5336141.16.7e-11475.91unnamed protein product [Arabidopsis thaliana][more]
CAD5336140.16.7e-11475.91unnamed protein product [Arabidopsis thaliana] >CAD5336144.1 unnamed protein pro... [more]
KAG7528872.11.4e-10880.75hypothetical protein ISN44_Un153g000040 [Arabidopsis suecica] >KAG7528886.1 hypo... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A200Q5G54.4e-11970.61Uncharacterized protein ycf68 OS=Macleaya cordata OX=56857 GN=BVC80_4285g1 PE=3 ... [more]
A0A7G2FKR63.2e-11475.91Uncharacterized protein ycf68 OS=Arabidopsis thaliana OX=3702 GN=AT9943_LOCUS233... [more]
A0A7G2FMH43.2e-11475.91Uncharacterized protein ycf68 OS=Arabidopsis thaliana OX=3702 GN=AT9943_LOCUS233... [more]
A0A7G2FJL33.2e-11475.91Uncharacterized protein ycf68 OS=Arabidopsis thaliana OX=3702 GN=AT9943_LOCUS233... [more]
A0A2N9I6789.4e-10673.50Uncharacterized protein ycf68 OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS49399 PE=3... [more]
Match NameE-valueIdentityDescription
AT2G07706.12.0e-0754.84unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 331..399
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 331..351
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 79..101
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 354..377
NoneNo IPR availablePANTHERPTHR34890:SF13YCF68 PROTEINcoord: 341..380
coord: 198..294
IPR022546Uncharacterised protein family Ycf68PANTHERPTHR34890FAMILY NOT NAMEDcoord: 341..380
coord: 198..294

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc09g16440.1Moc09g16440.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0009536 plastid