MC03g0677 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC03g0677
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionCotton fiber protein
LocationMC03: 13680358 .. 13680882 (+)
RNA-Seq ExpressionMC03g0677
SyntenyMC03g0677
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GCAAAGGGAGCAGCCTTCTTCTCTTCCCTAGCCTCCCTAATTCTTCTCTCCAGTATCTTCACCTTCAAAAACCTCTCTTTTTCATCTCATCTCTTCAACAGCACCAACTTCTGGTTCTTCATTTCCAACGCCCTCATAATTCTCATCATAGCCGCTGATTATAATCGCATTTTCTCTCCCACCCAAAACAAAAGACTCGATTTTTATGAAGATTACGTATCCTCCAATCAAAACCAAACTCAGATACTTCAAAGTTCTTCACTGGCAGTAATAGTTGATGAAGAGAGAGAAACCCCAGAAGAGAAACTGCAAATCGTGGTTGGAAGACGCGATTTTGGGTCTTGCCAGCGAAGCAAGTCGGAGAAACCCAAGAGAAAAACCATGGGGAGGAGGTCGGAAAGTGTAAAATACGAAGCAAAGGGAGAAGGAAATGAGTTCTCAAAGATGACAGATGAAGAATTGAATAGAAGGGTGGAGGAATTTATTCAAAGAATTAATAGACAAATGAGACTTCAAAGTTCAAAC

mRNA sequence

GCAAAGGGAGCAGCCTTCTTCTCTTCCCTAGCCTCCCTAATTCTTCTCTCCAGTATCTTCACCTTCAAAAACCTCTCTTTTTCATCTCATCTCTTCAACAGCACCAACTTCTGGTTCTTCATTTCCAACGCCCTCATAATTCTCATCATAGCCGCTGATTATAATCGCATTTTCTCTCCCACCCAAAACAAAAGACTCGATTTTTATGAAGATTACGTATCCTCCAATCAAAACCAAACTCAGATACTTCAAAGTTCTTCACTGGCAGTAATAGTTGATGAAGAGAGAGAAACCCCAGAAGAGAAACTGCAAATCGTGGTTGGAAGACGCGATTTTGGGTCTTGCCAGCGAAGCAAGTCGGAGAAACCCAAGAGAAAAACCATGGGGAGGAGGTCGGAAAGTGTAAAATACGAAGCAAAGGGAGAAGGAAATGAGTTCTCAAAGATGACAGATGAAGAATTGAATAGAAGGGTGGAGGAATTTATTCAAAGAATTAATAGACAAATGAGACTTCAAAGTTCAAAC

Coding sequence (CDS)

GCAAAGGGAGCAGCCTTCTTCTCTTCCCTAGCCTCCCTAATTCTTCTCTCCAGTATCTTCACCTTCAAAAACCTCTCTTTTTCATCTCATCTCTTCAACAGCACCAACTTCTGGTTCTTCATTTCCAACGCCCTCATAATTCTCATCATAGCCGCTGATTATAATCGCATTTTCTCTCCCACCCAAAACAAAAGACTCGATTTTTATGAAGATTACGTATCCTCCAATCAAAACCAAACTCAGATACTTCAAAGTTCTTCACTGGCAGTAATAGTTGATGAAGAGAGAGAAACCCCAGAAGAGAAACTGCAAATCGTGGTTGGAAGACGCGATTTTGGGTCTTGCCAGCGAAGCAAGTCGGAGAAACCCAAGAGAAAAACCATGGGGAGGAGGTCGGAAAGTGTAAAATACGAAGCAAAGGGAGAAGGAAATGAGTTCTCAAAGATGACAGATGAAGAATTGAATAGAAGGGTGGAGGAATTTATTCAAAGAATTAATAGACAAATGAGACTTCAAAGTTCAAAC

Protein sequence

AKGAAFFSSLASLILLSSIFTFKNLSFSSHLFNSTNFWFFISNALIILIIAADYNRIFSPTQNKRLDFYEDYVSSNQNQTQILQSSSLAVIVDEERETPEEKLQIVVGRRDFGSCQRSKSEKPKRKTMGRRSESVKYEAKGEGNEFSKMTDEELNRRVEEFIQRINRQMRLQSSN
Homology
BLAST of MC03g0677 vs. NCBI nr
Match: XP_038877172.1 (uncharacterized protein LOC120069472 [Benincasa hispida])

HSP 1 Score: 142 bits (358), Expect = 7.96e-39
Identity = 106/188 (56.38%), Postives = 119/188 (63.30%), Query Frame = 0

Query: 5   AFFSSLASLILLSSIFT--FKNLSFSSHLFNSTNFWFFISNALIILIIAADYNRIFSPTQ 64
           AFFSSL  L+ +  IF   FKNLS  S LFNST FWFFISN LI  IIA DY  +FS  Q
Sbjct: 40  AFFSSLLFLLSIFIIFISIFKNLSLPS-LFNSTIFWFFISNTLIF-IIAVDYG-VFSLPQ 99

Query: 65  NKRLDFYEDYVSSNQNQTQILQSSSLAVIVDEERETPEEKLQIVVGRRDFGS-------- 124
           +K    YED+  SN         SS  V+ DE++ET  EKL++VV  R   S        
Sbjct: 100 HKSFHLYEDFSPSNPKLNHFRLPSSSLVVFDEKKETTREKLELVVQGRQLDSPWKMTPAR 159

Query: 125 -CQRSKSEKPK-------RKTMGRRSESVKYEAKG-EGNEFSKMTDEELNRRVEEFIQRI 173
            C+R KSEKPK        K MG+RSESVKYEAK  E NEF KMTDEELNRRVEEFIQR 
Sbjct: 160 TCRRRKSEKPKGIVSKESNKMMGKRSESVKYEAKELEENEFEKMTDEELNRRVEEFIQRF 219

BLAST of MC03g0677 vs. NCBI nr
Match: XP_022927149.1 (uncharacterized protein LOC111434084 [Cucurbita moschata])

HSP 1 Score: 134 bits (336), Expect = 1.89e-35
Identity = 105/187 (56.15%), Postives = 117/187 (62.57%), Query Frame = 0

Query: 5   AFFSSLASLILLSSIFTFKNLSFSSHLFNSTNFWFFISNALIILIIAADYNRIFSPTQNK 64
           AFFSSL  L+LLS    FKNLSFSS LFNST FWFFISNALI  IIAADY   FS +Q K
Sbjct: 36  AFFSSL--LLLLSFCIIFKNLSFSS-LFNSTMFWFFISNALIF-IIAADY-AYFSLSQYK 95

Query: 65  RLDFYEDYVSSNQNQTQILQSSSLAVIVDEERETPEEKLQIVVGRR----------DFGS 124
           R   YE Y   N   T     +S  V+ DE+RE P+  LQ  V  R             +
Sbjct: 96  RSHLYEHYSPPNPKMTHFQHQTSSFVVFDEKREYPDGNLQNFVPERCDSLPPLKTTPVRT 155

Query: 125 CQRSKSEKPKR-------KTMGRRSESVKYEAKG-EGNEFSKMTDEELNRRVEEFIQRIN 173
            +RSKSEKPKR       K M +RSES KYE K  E NE+SKMTDEELNRRVEEFIQR N
Sbjct: 156 YRRSKSEKPKRSVSKESKKKMAKRSESGKYEEKELEENEYSKMTDEELNRRVEEFIQRFN 215

BLAST of MC03g0677 vs. NCBI nr
Match: XP_023001467.1 (uncharacterized protein LOC111495593 [Cucurbita maxima])

HSP 1 Score: 133 bits (335), Expect = 2.67e-35
Identity = 105/187 (56.15%), Postives = 117/187 (62.57%), Query Frame = 0

Query: 5   AFFSSLASLILLSSIFTFKNLSFSSHLFNSTNFWFFISNALIILIIAADYNRIFSPTQNK 64
           AFFSSL  L LLS    FKNLSFSS LFNST FWFFISN LI  IIAADY   FS +Q K
Sbjct: 36  AFFSSL--LFLLSFCIIFKNLSFSS-LFNSTIFWFFISNTLIF-IIAADY-AYFSLSQYK 95

Query: 65  RLDFYEDYVSSNQNQTQILQSSSLAVIVDEERETPEEKLQIVVGRR----------DFGS 124
           R  FYE Y   N   T     +S  V+ +E+RE P+  LQ +V  R             +
Sbjct: 96  RSHFYEHYSPPNPKITHFQHQTSSFVVFEEKREYPDGNLQNIVQDRCDSLPPLKTTPVRT 155

Query: 125 CQRSKSEKPKR-------KTMGRRSESVKYEAKG-EGNEFSKMTDEELNRRVEEFIQRIN 173
            QRSKSEKPKR       K M +RSES KYE K  E NE+SKMTDEELNRRVEEFIQR N
Sbjct: 156 YQRSKSEKPKRTVSKESKKKMAKRSESGKYEGKELEENEYSKMTDEELNRRVEEFIQRFN 215

BLAST of MC03g0677 vs. NCBI nr
Match: KAE8653186.1 (hypothetical protein Csa_020019 [Cucumis sativus])

HSP 1 Score: 132 bits (331), Expect = 1.15e-34
Identity = 110/196 (56.12%), Postives = 125/196 (63.78%), Query Frame = 0

Query: 5   AFFSSLASLILLSSIFT--FKNLSFSSHLFNSTNFWFFISNALIILIIAADYNRIFSPTQ 64
           AFFSSL  L+ +  IF   FKNLSFSS LFNST FWFFISN LI  IIA DY  +FS +Q
Sbjct: 42  AFFSSLLFLLSIFIIFISIFKNLSFSS-LFNSTVFWFFISNTLIF-IIAVDYG-LFSLSQ 101

Query: 65  NKRLDFYEDYVSSNQNQTQI---LQSSSLAVIVDEERETPEEKLQIVVGRRDFGSCQ--- 124
           +K    YEDY  S  N       LQ+SSL V+ DE+RETP+EKL+ VV  +   S     
Sbjct: 102 HKSFHLYEDYYYSPPNPKLTHFQLQTSSL-VVFDEKRETPDEKLETVVQSQRLDSPSKNT 161

Query: 125 --------RSKSEKPKR--------KTMG-RRSESVKYEAKG--EGNEFSKMTDEELNRR 173
                   R KSEKPKR        K MG RRSESVK E K   + NEF+KMTDEELNRR
Sbjct: 162 TTPVRTYLRRKSEKPKRIVSMEISKKMMGKRRSESVKNEGKELEDENEFAKMTDEELNRR 221

BLAST of MC03g0677 vs. NCBI nr
Match: XP_023520225.1 (uncharacterized protein LOC111783530 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 127 bits (318), Expect = 3.54e-33
Identity = 101/187 (54.01%), Postives = 114/187 (60.96%), Query Frame = 0

Query: 5   AFFSSLASLILLSSIFTFKNLSFSSHLFNSTNFWFFISNALIILIIAADYNRIFSPTQNK 64
           AFFSSL  L+    IF  KNLSFSS LFNST FWFFISNALI  IIAADY   FS TQ K
Sbjct: 3   AFFSSLLFLLFFCIIF--KNLSFSS-LFNSTMFWFFISNALIF-IIAADY-AYFSLTQYK 62

Query: 65  RLDFYEDYVSSNQNQTQILQSSSLAVIVDEERETPEEKLQIVVGRR----------DFGS 124
           R   YE Y   N   T   + +S  V+ DE+RE P+  L+  V  R             +
Sbjct: 63  RSHLYEHYSPPNPKMTHFQRQTSSFVVFDEKREYPDGNLRNFVQERCDSMPLLKTTPVRT 122

Query: 125 CQRSKSEKPKR-------KTMGRRSESVKYEAKG-EGNEFSKMTDEELNRRVEEFIQRIN 173
            +RSKSEKPKR       K M +R ES  YE K  E NE+SKMTDEELNRRVEEFIQR N
Sbjct: 123 YERSKSEKPKRSVSKESKKKMAKRLESGMYEEKELEENEYSKMTDEELNRRVEEFIQRFN 182

BLAST of MC03g0677 vs. ExPASy TrEMBL
Match: A0A6J1EH73 (uncharacterized protein LOC111434084 OS=Cucurbita moschata OX=3662 GN=LOC111434084 PE=4 SV=1)

HSP 1 Score: 134 bits (336), Expect = 9.16e-36
Identity = 105/187 (56.15%), Postives = 117/187 (62.57%), Query Frame = 0

Query: 5   AFFSSLASLILLSSIFTFKNLSFSSHLFNSTNFWFFISNALIILIIAADYNRIFSPTQNK 64
           AFFSSL  L+LLS    FKNLSFSS LFNST FWFFISNALI  IIAADY   FS +Q K
Sbjct: 36  AFFSSL--LLLLSFCIIFKNLSFSS-LFNSTMFWFFISNALIF-IIAADY-AYFSLSQYK 95

Query: 65  RLDFYEDYVSSNQNQTQILQSSSLAVIVDEERETPEEKLQIVVGRR----------DFGS 124
           R   YE Y   N   T     +S  V+ DE+RE P+  LQ  V  R             +
Sbjct: 96  RSHLYEHYSPPNPKMTHFQHQTSSFVVFDEKREYPDGNLQNFVPERCDSLPPLKTTPVRT 155

Query: 125 CQRSKSEKPKR-------KTMGRRSESVKYEAKG-EGNEFSKMTDEELNRRVEEFIQRIN 173
            +RSKSEKPKR       K M +RSES KYE K  E NE+SKMTDEELNRRVEEFIQR N
Sbjct: 156 YRRSKSEKPKRSVSKESKKKMAKRSESGKYEEKELEENEYSKMTDEELNRRVEEFIQRFN 215

BLAST of MC03g0677 vs. ExPASy TrEMBL
Match: A0A6J1KGM0 (uncharacterized protein LOC111495593 OS=Cucurbita maxima OX=3661 GN=LOC111495593 PE=4 SV=1)

HSP 1 Score: 133 bits (335), Expect = 1.29e-35
Identity = 105/187 (56.15%), Postives = 117/187 (62.57%), Query Frame = 0

Query: 5   AFFSSLASLILLSSIFTFKNLSFSSHLFNSTNFWFFISNALIILIIAADYNRIFSPTQNK 64
           AFFSSL  L LLS    FKNLSFSS LFNST FWFFISN LI  IIAADY   FS +Q K
Sbjct: 36  AFFSSL--LFLLSFCIIFKNLSFSS-LFNSTIFWFFISNTLIF-IIAADY-AYFSLSQYK 95

Query: 65  RLDFYEDYVSSNQNQTQILQSSSLAVIVDEERETPEEKLQIVVGRR----------DFGS 124
           R  FYE Y   N   T     +S  V+ +E+RE P+  LQ +V  R             +
Sbjct: 96  RSHFYEHYSPPNPKITHFQHQTSSFVVFEEKREYPDGNLQNIVQDRCDSLPPLKTTPVRT 155

Query: 125 CQRSKSEKPKR-------KTMGRRSESVKYEAKG-EGNEFSKMTDEELNRRVEEFIQRIN 173
            QRSKSEKPKR       K M +RSES KYE K  E NE+SKMTDEELNRRVEEFIQR N
Sbjct: 156 YQRSKSEKPKRTVSKESKKKMAKRSESGKYEGKELEENEYSKMTDEELNRRVEEFIQRFN 215

BLAST of MC03g0677 vs. ExPASy TrEMBL
Match: A0A0A0M033 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G435810 PE=4 SV=1)

HSP 1 Score: 111 bits (278), Expect = 2.11e-27
Identity = 89/164 (54.27%), Postives = 102/164 (62.20%), Query Frame = 0

Query: 35  TNFWFFISNALIILIIAADYNRIFSPTQNKRLDFYEDYVSSNQNQTQI---LQSSSLAVI 94
           T FWFFISN LI  IIA DY  +FS +Q+K    YEDY  S  N       LQ+SSL V+
Sbjct: 41  TVFWFFISNTLIF-IIAVDYG-LFSLSQHKSFHLYEDYYYSPPNPKLTHFQLQTSSL-VV 100

Query: 95  VDEERETPEEKLQIVVGRRDFGSCQ-----------RSKSEKPKR--------KTMG-RR 154
            DE+RETP+EKL+ VV  +   S             R KSEKPKR        K MG RR
Sbjct: 101 FDEKRETPDEKLETVVQSQRLDSPSKNTTTPVRTYLRRKSEKPKRIVSMEISKKMMGKRR 160

Query: 155 SESVKYEAKG--EGNEFSKMTDEELNRRVEEFIQRINRQMRLQS 173
           SESVK E K   + NEF+KMTDEELNRRVEEFIQR N+QMRLQ+
Sbjct: 161 SESVKNEGKELEDENEFAKMTDEELNRRVEEFIQRFNKQMRLQT 201

BLAST of MC03g0677 vs. ExPASy TrEMBL
Match: A0A2N9GT34 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS30393 PE=4 SV=1)

HSP 1 Score: 95.9 bits (237), Expect = 3.10e-21
Identity = 84/188 (44.68%), Postives = 111/188 (59.04%), Query Frame = 0

Query: 1   AKGAAFFSSLASLILLSSIFTFKNLSFSSHLFNSTNFWFFISNALIILIIAADYNRIFSP 60
           A G +F++ L S+ +  SI    NLS  S LFN+T FWFF+SN LI LIIA DY   +S 
Sbjct: 27  ANGMSFYAFLFSIFIYISILYIFNLS-PSTLFNTTKFWFFLSNTLI-LIIAVDYGA-YSS 86

Query: 61  TQNKRLDFYEDYVSSNQ--NQTQILQS--SSLAVIVDEERETPEEKLQIVVGRRDFGSCQ 120
           ++ K+ DFY++YV   Q  N    +     ++  +V+  +E  +EK+Q    RR      
Sbjct: 87  SKEKQ-DFYQEYVMDTQVKNVPSFVSQHPETIKEVVEVLQEPEKEKIQAKPYRR------ 146

Query: 121 RSKSEKPKRKTMG-------RRSESVKYEAKG--EGNEFSKMTDEELNRRVEEFIQRINR 175
            SKSEK KR  +        R SE+ K+E     E NEFS M+DEELNRRVEEFIQR NR
Sbjct: 147 -SKSEKTKRVLIDESKNIVLRMSETAKHERTPSLEENEFSSMSDEELNRRVEEFIQRFNR 203

BLAST of MC03g0677 vs. ExPASy TrEMBL
Match: A0A7N2LQF9 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 91.7 bits (226), Expect = 2.03e-19
Identity = 77/212 (36.32%), Postives = 108/212 (50.94%), Query Frame = 0

Query: 1   AKGAAFFSSLASLILLSSIFTFKNLSFSSHLFNSTNFWFFISNALIILIIAADYNRIFSP 60
           AKG +F++SL S+ +  S+    NLS  S LF +T FWFF+SN LI LIIA DY    S 
Sbjct: 26  AKGLSFYASLFSIFIYISVLYIFNLS-PSALFYNTKFWFFLSNTLI-LIIAVDYGAYSSS 85

Query: 61  TQNKRLDFYEDYVSSNQ-----------------------------------NQTQILQS 120
             N++ D Y++YV   Q                                    + Q+   
Sbjct: 86  --NEKQDLYQEYVKRTQVKNVPSFVPQYQKIVKQSTPKQKVDSFQEKREVIVQEVQVFPE 145

Query: 121 SSLAVIVDEERETPEEKLQIVVGRRDFGSCQRSKSEKPKRKTMGRRSESVKYEA---KGE 174
            +L V++  + E P E L+  +  + +   +RSKSE+ KR  +      +   +   K E
Sbjct: 146 RNLQVVIKSDSEKPSEDLREKIKAKTY---RRSKSEQAKRVVIDESKNIITRSSETEKNE 205

BLAST of MC03g0677 vs. TAIR 10
Match: AT1G30190.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G34610.1); Has 56 Blast hits to 56 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 44; Viruses - 0; Other Eukaryotes - 12 (source: NCBI BLink). )

HSP 1 Score: 49.3 bits (116), Expect = 3.7e-06
Identity = 74/239 (30.96%), Postives = 104/239 (43.51%), Query Frame = 0

Query: 19  IFTFKNLSFSSHLFNSTNFWFFISNALIILIIAADYNRIFSPTQNKRLDFYEDY-VSSNQ 78
           IF    +S SS +F  T   FFISN L ILIIAADY   FS  +++  DFY +Y V++  
Sbjct: 39  IFHVFEVSLSS-VFKDTKVLFFISNTL-ILIIAADYGS-FSDKESQ--DFYGEYTVAAAT 98

Query: 79  NQTQILQSSSLAVIV---------------------DEERE---------TPEEKLQIVV 138
            + +    S + V+                      +EE E         +P EK+  VV
Sbjct: 99  MRNRADNYSPIPVLTYRENTKDGEIKNPKDVEFRNPEEEDEPMVKDIICVSPPEKIVRVV 158

Query: 139 G---RRDFGSCQ-----------------------------RSKSEKPKRKTMGRRSESV 174
               +RD  + +                             RSKS+KP+RK +   +E+ 
Sbjct: 159 SEKKQRDDVAMEEYKPVTEQTLASEEACNTRNHVNPNKPYGRSKSDKPRRKRLSVDTETT 218

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038877172.17.96e-3956.38uncharacterized protein LOC120069472 [Benincasa hispida][more]
XP_022927149.11.89e-3556.15uncharacterized protein LOC111434084 [Cucurbita moschata][more]
XP_023001467.12.67e-3556.15uncharacterized protein LOC111495593 [Cucurbita maxima][more]
KAE8653186.11.15e-3456.12hypothetical protein Csa_020019 [Cucumis sativus][more]
XP_023520225.13.54e-3354.01uncharacterized protein LOC111783530 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A6J1EH739.16e-3656.15uncharacterized protein LOC111434084 OS=Cucurbita moschata OX=3662 GN=LOC1114340... [more]
A0A6J1KGM01.29e-3556.15uncharacterized protein LOC111495593 OS=Cucurbita maxima OX=3661 GN=LOC111495593... [more]
A0A0A0M0332.11e-2754.27Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G435810 PE=4 SV=1[more]
A0A2N9GT343.10e-2144.68Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS30393 PE=4 SV=1[more]
A0A7N2LQF92.03e-1936.32Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G30190.13.7e-0630.96unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008480Protein of unknown function DUF761, plantPFAMPF05553DUF761coord: 151..173
e-value: 1.2E-6
score: 27.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 114..145
NoneNo IPR availablePANTHERPTHR35997COTTON FIBER PROTEIN-RELATEDcoord: 114..174
NoneNo IPR availablePANTHERPTHR35997:SF6COTTON FIBER PROTEINcoord: 4..97
NoneNo IPR availablePANTHERPTHR35997:SF6COTTON FIBER PROTEINcoord: 114..174
NoneNo IPR availablePANTHERPTHR35997COTTON FIBER PROTEIN-RELATEDcoord: 4..97

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC03g0677.1MC03g0677.1mRNA