ClCG06G001410 (gene) Watermelon (Charleston Gray)

NameClCG06G001410
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionUnknown protein
LocationCG_Chr06 : 1472248 .. 1475980 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTATAAGGGAGTGAAATTAAAGACAGCTTATTTTAGTGGGGAAGCGATTGAAGGAGCAAGCAAAGCTGAAAATGCTGGCGAAAAGGTTCGTTTCCTTGTTCAAGCGCTCTCCAAATCCACAGTCTTCCAGGTGCGCCCTTCTTGTAATCCCCATTTTCGCTTTTTTTTTTCTTTTTTTTTTTTCTGATTTTTATTCACGCTTCCATTTATCAGAATTTTACACCAACAGTTACTAATTCACCCACCAACTTCAGTTGAACATGGAATCAATCGATTGCCCGCATTTTTTGCAAGAGATATATATCTGGGTTTTTCAATTTTTGCATGTTTTATGAAAAAGGAAACGTTGGACTTGATTTTTACAATTTTTTTTCATTGATTTTTAATGGATAGAATGGAACTTGGTGATCTCTTATTAAGCGTTGCTTGCACTTGCAGCAGTTCCATAAAGCCATCGGAGGATGGGGTGAATAGATCCTGGGGTCGGAAAGCAGTCTCTTTTGTACTTATTACTGTTACTGGTGGTGTAGCTTTGAGTGCTTTAGATGACCTTGCCATTTATCGTAGCTGTAGCAGGTATGTGTGACTTGTTACAGATTTTCAAATTTGGATGAATTATATCGTTAAAGCAAATGAAAATCTGAATTTGGCGAAAATAAACTCATGTATCTTGGGATTTCTCTTAATGCAAGTGTTTGCTGCCCTGTTTCACTTATTAGTGTTTGAAGTTCAAGGAAATTAGTGTAGTAAATTGGGGTATCTTGGTTAAATTCTTCATTAATTAGGGTTAGGGTTAGTTTATTTTTCATTCTCTATAAACATTTTATTGATTCTGGGAGAGAATTCTCCTTTTTAGTCACTTTGGCTACATCTGAAACTCTCACCATTGCTATGTTGTTTTCAACCTTTGAAATTGGCTATGATTTTACATTACTCAGGTCATTGGATCGAATAATATCTCATATTAGTATTCAATGATGTTCACATTATTGTTACTGAAAGAGCTCAGTTGTCAACCTTGGGGATACGAGAATGCAAGAATGCATTGTCTTTGGGTTTCTTTTACCTATTCAATTATTTGCTCTCTCTGTTTGGGGGCTTTGGATCTTTGTTATTTCCCTAATTACTCAGGTATATACTTTTTGGGATGAAATCGTGCAAATGGATGATAGATCTTCTAGGGCTTTACTTGGATGCATTTTTTTTTTTGTGTGTGTGTGTGTGTGTGTGTATTTGTTTTTGGCTTGGTGTATCTATATTTGACTGAGATCACCATTTTCAAGTTAGCCACCTTGTTACAATTAAGACGAGACCCTTTAGGATCTTATCATTATTTTTAGAAATATATAGGAATATGACTTAAGAACAAACTGAACTAGGACCAATTTTCCTTATTGAGAAATTGAAAGCTTCCCCAATTTTTTGGCTCTTGTCTAGATTGTCTTTGAACGTGGATTCCAAACAATTTTCTTCTTGGGGTGTACCTTCAAAAGGAAGACCAAGGTGTTTGAGATGCCCTAATAACACAAGTGAAATATTCTGCAATAGAATCCAGTTCTAATCTGATATGAGTTGTAAAAAATTGGATCAATTATTTTAACCTACAATTGATTCCAACTACAAAAATGTGTTACGGAGGTTGACTTTTTAAAGTTAACGTCTCTTTGATTTTAGCCTTAAGGAACAAGCCATTAGCAAACTGTTAGTGGTTGCTGGCCCTATTCCTAATACACACTGAAAACCTACCATGCCATTTTTAACCAAAGATTTAGTCAACAATCTACTTTGGTAGTCTACTTCTAACAAAAATGAAAAAGAAGAGACTTCATCTTGACGTAGTTGCCTAGAAGCAAGAATATGCCTGTTGGCTTATGATTCAATACTCAAAAGCATTGAGAAGTTTAGAGAAGAGAGACAGACTTTGATCCACACTCTATTTCCTACAAAATATTTTGAGCTTCATTTATAGAGTCCCAATTCACAAAATTATTGCCCTTCCTTTTAGTGGATAACCGAGCTTGTATAATGAGAGAATGCAAGAGGGCACACAAAAAGACAACCCAACAAATGGAGCCAAACTATAACCCGAGTAACTATAAAACAAAGGGCTCCGTCAATGCAAAAGAACCCATTGATAATTCAATTTCATATGTGCGTGTATATATAGATAGATAGATTTAATATTTTGTCTCACGTTATGCTTGCATCTTGCATTGTGCCTTAAATATATTTGTTAAGGACCAGTGCTTTGTAATACTGCTCACAAAGGAATTTAGTCTGATAATTCTCTTTAATAAAATAAGGAACCAAACTTTGGTTCTCATGGACCTTTTCGACACATATGTGAAAGTTCTTAGGAATGGGATGCTAGGGTTCTCATGATTTTCCTTTGCAGCTGCTCGCTGGCGTAGACTTATTTTATTTTGTTTTGTTACATTTCTCAGCAAAGCCATAGAGAAAGCCAGAAACAATCAAGCAGTTATAGATGCTATTGGAGAACCCATTGCCAAAGGTCCATGGTACAATGCATCACTTGCAGTAGCTCATAAGAGACATTCTCTATCCTGCACGTTTCCAGTATCAGGACCACAAGGCACAGGGATCCTCCAACTGAAGGCAGTTCGTAATGGAGGTTGATATGCTTTTCCTGTCATCTGTTGCACATTTTGTTTTGTTGATTCAAGTTTTAGTTTCAATAGAAAATTAGAAATGGATAGACAAGTTTTTCTTTTTTCTATCATGTTTTGTTTGATCTTTGATATCTTCTATCTGATTCATCTCATTGCATTATATAAAAAATGATATTGTGCATGGAAACTCCATTGGAAAAAAAAAAAAATATACGGAGAAAACCTCTTGGGGGTAAATCTTGTTTCTAAAGAAGCCATCCAACTTAGCTGGCGTAACTTCAACCTGAATTATTTGGAAAAGGCGTTACTTTGAAAGTTTTTAGGTAAAACTGGAAGATTCTGAAGGTATTTGGTAGGTATAGAAATTTCTTCTATAGGTATTTAAGTTTTCAAACCGTGGTTGGCAGATTTAATTTCTTATGCTTTTTCTGCTTAAGATTAGTCTAGGCTGATCCTTACATTCATGTATATCAATTCTTTAGGTTTTACTTTTGCTTCAATATATTCAAAATGGAATTTTGTTGTATTTTTTTCTTTTTTTTACATGAATTATCGTCCTGTTTCAACTTTTTATTATCTTGTTATATATTACTCATGAGAGGAGAGAAAAGTTGACTTGACTTTTCTTCTTTTTATAAATATTTTGAATGGTAATACTAAGTTGGAATATCTCAACGGCAGATTCCTGGATTCCTTTTCTCCGGCCTCGAGACTGGGACATTCTGATCATGGATGCTCTCCTCCATGTTCCTGAAAACGAAGGTAAGCAGAAAACATTGCGTATTAATCTCACTGAGAAGTTTGCCTCCGCTGCTTGTGTCTCATGCACTGATTGTCAGCCTCCAGAGAAGAGATGAAGTCAGCTTATAAGTTATTGAGAAACTAATGTCAAAAGTTAGCAGTTTCTATTAGTTTTCTGTCCCGACATAATTTTGAGCTAACCATCAAAGCGTTCGTTATCCCTACCAAAATAATATTTTCTTTGAGGTTGAGTTTAAACCATGGTGGAACCAGTCTAGAAGATTTATTATTATACAACTTTGCTGACATCAAATGTGGCAAAAGTTATATGATTCTCCTGTAAAACCCATGTTATAAAAG

mRNA sequence

ATGTATAAGGGAGTGAAATTAAAGACAGCTTATTTTAGTGGGGAAGCGATTGAAGGAGCAAGCAAAGCTGAAAATGCTGGCGAAAAGGTTCGTTTCCTTGTTCAAGCGCTCTCCAAATCCACAGTCTTCCAGTTACTAATTCACCCACCAACTTCAGTTGAACATGGAATCAATCGATTGCCCGCATTTTTTGCAAGAGATATATATCTGGGTTTTTCAATTTTTGCATGTTTTATGAAAAAGGAAACCAGTTCCATAAAGCCATCGGAGGATGGGGTGAATAGATCCTGGGGTCGGAAAGCAGTCTCTTTTGTACTTATTACTGTTACTGGTGGTGTAGCTTTGAGTGCTTTAGATGACCTTGCCATTTATCGTAGCTGTAGCAGCAAAGCCATAGAGAAAGCCAGAAACAATCAAGCAGTTATAGATGCTATTGGAGAACCCATTGCCAAAGGTCCATGGTACAATGCATCACTTGCAGTAGCTCATAAGAGACATTCTCTATCCTGCACGTTTCCAGTATCAGGACCACAAGGCACAGGGATCCTCCAACTGAAGGCAGTTCGTAATGGAGATTCCTGGATTCCTTTTCTCCGGCCTCGAGACTGGGACATTCTGATCATGGATGCTCTCCTCCATGTTCCTGAAAACGAAGGTAAGCAGAAAACATTGCGTATTAATCTCACTGAGAAGTTTGCCTCCGCTGCTTGTGTCTCATGCACTGATTGTCAGCCTCCAGAGAAGAGATGAAGTCAGCTTATAAGTTATTGAGAAACTAATGTCAAAAGTTAGCAGTTTCTATTAGTTTTCTGTCCCGACATAATTTTGAGCTAACCATCAAAGCGTTCGTTATCCCTACCAAAATAATATTTTCTTTGAGGTTGAGTTTAAACCATGGTGGAACCAGTCTAGAAGATTTATTATTATACAACTTTGCTGACATCAAATGTGGCAAAAGTTATATGATTCTCCTGTAAAACCCATGTTATAAAAG

Coding sequence (CDS)

ATGTATAAGGGAGTGAAATTAAAGACAGCTTATTTTAGTGGGGAAGCGATTGAAGGAGCAAGCAAAGCTGAAAATGCTGGCGAAAAGGTTCGTTTCCTTGTTCAAGCGCTCTCCAAATCCACAGTCTTCCAGTTACTAATTCACCCACCAACTTCAGTTGAACATGGAATCAATCGATTGCCCGCATTTTTTGCAAGAGATATATATCTGGGTTTTTCAATTTTTGCATGTTTTATGAAAAAGGAAACCAGTTCCATAAAGCCATCGGAGGATGGGGTGAATAGATCCTGGGGTCGGAAAGCAGTCTCTTTTGTACTTATTACTGTTACTGGTGGTGTAGCTTTGAGTGCTTTAGATGACCTTGCCATTTATCGTAGCTGTAGCAGCAAAGCCATAGAGAAAGCCAGAAACAATCAAGCAGTTATAGATGCTATTGGAGAACCCATTGCCAAAGGTCCATGGTACAATGCATCACTTGCAGTAGCTCATAAGAGACATTCTCTATCCTGCACGTTTCCAGTATCAGGACCACAAGGCACAGGGATCCTCCAACTGAAGGCAGTTCGTAATGGAGATTCCTGGATTCCTTTTCTCCGGCCTCGAGACTGGGACATTCTGATCATGGATGCTCTCCTCCATGTTCCTGAAAACGAAGGTAAGCAGAAAACATTGCGTATTAATCTCACTGAGAAGTTTGCCTCCGCTGCTTGTGTCTCATGCACTGATTGTCAGCCTCCAGAGAAGAGATGA

Protein sequence

MYKGVKLKTAYFSGEAIEGASKAENAGEKVRFLVQALSKSTVFQLLIHPPTSVEHGINRLPAFFARDIYLGFSIFACFMKKETSSIKPSEDGVNRSWGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNGDSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFASAACVSCTDCQPPEKR
BLAST of ClCG06G001410 vs. TrEMBL
Match: A0A0A0KF26_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G087990 PE=4 SV=1)

HSP 1 Score: 306.6 bits (784), Expect = 2.8e-80
Identity = 151/166 (90.96%), Postives = 159/166 (95.78%), Query Frame = 1

Query: 83  TSSIKPSEDGVNRSWGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVI 142
           ++SIKP+E+ VN+SWGRKAVSFVLITVTGGVALSALDDLAIY SCSSKAIEK RNNQAVI
Sbjct: 20  SNSIKPTENEVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKVRNNQAVI 79

Query: 143 DAIGEPIAKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNG-DSWIPFLRPR 202
           DAIGEPI KGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNG DSWI FLRPR
Sbjct: 80  DAIGEPIDKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPR 139

Query: 203 DWDILIMDALLHVPENEGKQKTLRINLTEKFASAACVSCTDCQPPE 248
           DWDIL+MDALL+VPENEGKQKTLRINL+EKFA AACVSCTDCQPPE
Sbjct: 140 DWDILMMDALLYVPENEGKQKTLRINLSEKFAPAACVSCTDCQPPE 185

BLAST of ClCG06G001410 vs. TrEMBL
Match: A0A061DNP0_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_003906 PE=4 SV=1)

HSP 1 Score: 249.2 bits (635), Expect = 5.2e-63
Identity = 123/180 (68.33%), Postives = 148/180 (82.22%), Query Frame = 1

Query: 73  SIFACFMKKETSSIKPSEDGVN----RSWGRKAVSFVLITVTGGVALSALDDLAIYRSCS 132
           ++ + F    +S +  + + VN    +S+ RKAVSFVLITVTGGVALSALDDLAIY  CS
Sbjct: 5   TLVSFFKNSPSSKVSSTGNSVNEEKSKSFVRKAVSFVLITVTGGVALSALDDLAIYHGCS 64

Query: 133 SKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAV 192
           SKA+EKA  NQA+IDAIGEPI KGPWYNASLAVAHKRHS+SCTFPVSGPQGTG+LQLKAV
Sbjct: 65  SKAMEKASKNQAIIDAIGEPIEKGPWYNASLAVAHKRHSVSCTFPVSGPQGTGVLQLKAV 124

Query: 193 RNG-DSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFASAACVSCTDCQPPE 248
           RNG D+W  ++ PRDW+ILIM+ALLHVP NE KQ+TLRI+L EK  S AC++CT+C+P +
Sbjct: 125 RNGDDNWYSYILPRDWEILIMEALLHVPGNEEKQQTLRISLLEKTPSPACIACTECRPQQ 184

BLAST of ClCG06G001410 vs. TrEMBL
Match: V4TYY0_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10022455mg PE=4 SV=1)

HSP 1 Score: 245.7 bits (626), Expect = 5.8e-62
Identity = 120/165 (72.73%), Postives = 139/165 (84.24%), Query Frame = 1

Query: 81  KETSSIKPSEDGVNRSWGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQA 140
           K +S+ K  ++  ++S+GRKAVSFVLITVTGGVALSALDDLAIY SCSSKA+EKA  NQA
Sbjct: 17  KASSAAKSVDEEKSKSFGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAMEKASKNQA 76

Query: 141 VIDAIGEPIAKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNGD-SWIPFLR 200
           VIDAIGEPI KGPWYNASLAV H+RHS+SCTFPVSGPQG GI QLKAVRNGD  W  FL 
Sbjct: 77  VIDAIGEPIKKGPWYNASLAVTHQRHSVSCTFPVSGPQGNGIFQLKAVRNGDPGWFAFLG 136

Query: 201 PRDWDILIMDALLHVPENEGKQKTLRINLTEKFASAACVSCTDCQ 245
           PRDW+ILIMDA  HVP NEG+Q+TL+INL + F+ + C +CTDC+
Sbjct: 137 PRDWEILIMDARFHVPGNEGQQQTLKINLLDPFSPSDCKACTDCK 181

BLAST of ClCG06G001410 vs. TrEMBL
Match: W9RB84_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_015872 PE=4 SV=1)

HSP 1 Score: 242.3 bits (617), Expect = 6.4e-61
Identity = 117/171 (68.42%), Postives = 141/171 (82.46%), Query Frame = 1

Query: 81  KETSSIKPSEDGVNRSWGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQA 140
           K +SS  P ++ +++S+GRKAVSF+LITVTGGVALSALDDLA+Y SCS KA+EK  NNQ 
Sbjct: 17  KLSSSKNPVDEEIDKSFGRKAVSFILITVTGGVALSALDDLALYHSCSRKALEKIGNNQQ 76

Query: 141 VIDAIGEPIAKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNG-DSWIPFLR 200
           + DA+GEPI KGPWYNASLAVAHKR S+SCTFPVSGPQGTG+LQLKAVRNG D+W  FLR
Sbjct: 77  IKDALGEPIVKGPWYNASLAVAHKRQSVSCTFPVSGPQGTGVLQLKAVRNGEDTWFSFLR 136

Query: 201 PRDWDILIMDALLHVPENEGKQKTLRINLTEKF--ASAACVSCTDCQPPEK 249
           PRDWDI+IMDALLHVP N+ K +T RI++++ F     AC +CTDC  P +
Sbjct: 137 PRDWDIIIMDALLHVPGNDEKHQTFRISVSDYFPPPPQACTACTDCSKPRE 187

BLAST of ClCG06G001410 vs. TrEMBL
Match: A0A059B7U8_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_H04779 PE=4 SV=1)

HSP 1 Score: 241.5 bits (615), Expect = 1.1e-60
Identity = 117/168 (69.64%), Postives = 141/168 (83.93%), Query Frame = 1

Query: 83  TSSIKPSEDGVNRSWGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVI 142
           +SS +PS +G +  + RKAVSFVLITVTGGVALSALDD AIY +CSSKA++KA  N+A+I
Sbjct: 19  SSSAQPSNEGKS-GYARKAVSFVLITVTGGVALSALDDFAIYNACSSKAVDKASKNKAII 78

Query: 143 DAIGEPIAKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNG-DSWIPFLRPR 202
           DAIGEPI KGPWYNASLAVAH+RHS+SCTFPVSGPQG+GI QLKAVRNG D+W+ FLRPR
Sbjct: 79  DAIGEPIKKGPWYNASLAVAHQRHSVSCTFPVSGPQGSGIFQLKAVRNGDDTWLSFLRPR 138

Query: 203 DWDILIMDALLHVPENEGKQKTLRINLTEKFASAACVSCTDCQPPEKR 250
           DW+ILIM+ALLHVPENE KQ+T RI+L++      C +CT C  P ++
Sbjct: 139 DWEILIMEALLHVPENEEKQRTFRISLSDDLPPPDCNACTSCSTPGQK 185

BLAST of ClCG06G001410 vs. TAIR10
Match: AT2G20390.2 (AT2G20390.2 unknown protein)

HSP 1 Score: 167.9 bits (424), Expect = 7.7e-42
Identity = 83/117 (70.94%), Postives = 94/117 (80.34%), Query Frame = 1

Query: 75  FACFMKKETSSIKPSEDGVNRSWGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEK 134
           F  F K  ++S      G   S+GRKAVSFVLITVTGGVALSALDDL+IYR CSSKA+EK
Sbjct: 6   FTSFFKGSSTSSPDKTAGTLGSFGRKAVSFVLITVTGGVALSALDDLSIYRGCSSKAMEK 65

Query: 135 ARNNQAVIDAIGEPIAKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNG 192
             N++ +I+AIGEPI KGPWYNASLAV+H+RHS+SC+FPV GPQGTGIL LKAVRNG
Sbjct: 66  VMNSKVMIEAIGEPIEKGPWYNASLAVSHQRHSVSCSFPVIGPQGTGILHLKAVRNG 122


HSP 2 Score: 62.4 bits (150), Expect = 4.6e-10
Identity = 28/39 (71.79%), Postives = 31/39 (79.49%), Query Frame = 1

Query: 192 DSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTE 231
           DS   FL+ RDWDILIMDAL+HVP NEG Q+TLRIN+T+
Sbjct: 160 DSMFGFLQQRDWDILIMDALVHVPSNEGPQQTLRINVTD 198

BLAST of ClCG06G001410 vs. NCBI nr
Match: gi|449445634|ref|XP_004140577.1| (PREDICTED: uncharacterized protein LOC101206927 [Cucumis sativus])

HSP 1 Score: 306.6 bits (784), Expect = 4.0e-80
Identity = 151/166 (90.96%), Postives = 159/166 (95.78%), Query Frame = 1

Query: 83  TSSIKPSEDGVNRSWGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVI 142
           ++SIKP+E+ VN+SWGRKAVSFVLITVTGGVALSALDDLAIY SCSSKAIEK RNNQAVI
Sbjct: 20  SNSIKPTENEVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKVRNNQAVI 79

Query: 143 DAIGEPIAKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNG-DSWIPFLRPR 202
           DAIGEPI KGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNG DSWI FLRPR
Sbjct: 80  DAIGEPIDKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPR 139

Query: 203 DWDILIMDALLHVPENEGKQKTLRINLTEKFASAACVSCTDCQPPE 248
           DWDIL+MDALL+VPENEGKQKTLRINL+EKFA AACVSCTDCQPPE
Sbjct: 140 DWDILMMDALLYVPENEGKQKTLRINLSEKFAPAACVSCTDCQPPE 185

BLAST of ClCG06G001410 vs. NCBI nr
Match: gi|659120019|ref|XP_008459968.1| (PREDICTED: uncharacterized protein LOC103498925 isoform X1 [Cucumis melo])

HSP 1 Score: 305.8 bits (782), Expect = 6.7e-80
Identity = 152/166 (91.57%), Postives = 158/166 (95.18%), Query Frame = 1

Query: 83  TSSIKPSEDGVNRSWGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVI 142
           ++SIKPSE+ VN+SWGRKAVSFVLITVTGGVALSALDDLAIY SCSSKAIEKARNNQAV 
Sbjct: 39  SNSIKPSENEVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVK 98

Query: 143 DAIGEPIAKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNG-DSWIPFLRPR 202
           DAIGEPIAKGPWYNASLAVAHKRHSLSCTFPVSGPQG GILQLKAVRNG DSWI FLRPR
Sbjct: 99  DAIGEPIAKGPWYNASLAVAHKRHSLSCTFPVSGPQGAGILQLKAVRNGEDSWISFLRPR 158

Query: 203 DWDILIMDALLHVPENEGKQKTLRINLTEKFASAACVSCTDCQPPE 248
           DWDIL+MDALL+VPENEGKQKTLRINLTEKFA AACVSCT CQPPE
Sbjct: 159 DWDILMMDALLYVPENEGKQKTLRINLTEKFAPAACVSCTGCQPPE 204

BLAST of ClCG06G001410 vs. NCBI nr
Match: gi|659120021|ref|XP_008459969.1| (PREDICTED: uncharacterized protein LOC103498925 isoform X2 [Cucumis melo])

HSP 1 Score: 305.4 bits (781), Expect = 8.8e-80
Identity = 152/165 (92.12%), Postives = 157/165 (95.15%), Query Frame = 1

Query: 84  SSIKPSEDGVNRSWGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAIEKARNNQAVID 143
           +SIKPSE+ VN+SWGRKAVSFVLITVTGGVALSALDDLAIY SCSSKAIEKARNNQAV D
Sbjct: 9   NSIKPSENEVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVKD 68

Query: 144 AIGEPIAKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNG-DSWIPFLRPRD 203
           AIGEPIAKGPWYNASLAVAHKRHSLSCTFPVSGPQG GILQLKAVRNG DSWI FLRPRD
Sbjct: 69  AIGEPIAKGPWYNASLAVAHKRHSLSCTFPVSGPQGAGILQLKAVRNGEDSWISFLRPRD 128

Query: 204 WDILIMDALLHVPENEGKQKTLRINLTEKFASAACVSCTDCQPPE 248
           WDIL+MDALL+VPENEGKQKTLRINLTEKFA AACVSCT CQPPE
Sbjct: 129 WDILMMDALLYVPENEGKQKTLRINLTEKFAPAACVSCTGCQPPE 173

BLAST of ClCG06G001410 vs. NCBI nr
Match: gi|590715325|ref|XP_007050163.1| (Uncharacterized protein TCM_003906 [Theobroma cacao])

HSP 1 Score: 249.2 bits (635), Expect = 7.5e-63
Identity = 123/180 (68.33%), Postives = 148/180 (82.22%), Query Frame = 1

Query: 73  SIFACFMKKETSSIKPSEDGVN----RSWGRKAVSFVLITVTGGVALSALDDLAIYRSCS 132
           ++ + F    +S +  + + VN    +S+ RKAVSFVLITVTGGVALSALDDLAIY  CS
Sbjct: 5   TLVSFFKNSPSSKVSSTGNSVNEEKSKSFVRKAVSFVLITVTGGVALSALDDLAIYHGCS 64

Query: 133 SKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAV 192
           SKA+EKA  NQA+IDAIGEPI KGPWYNASLAVAHKRHS+SCTFPVSGPQGTG+LQLKAV
Sbjct: 65  SKAMEKASKNQAIIDAIGEPIEKGPWYNASLAVAHKRHSVSCTFPVSGPQGTGVLQLKAV 124

Query: 193 RNG-DSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFASAACVSCTDCQPPE 248
           RNG D+W  ++ PRDW+ILIM+ALLHVP NE KQ+TLRI+L EK  S AC++CT+C+P +
Sbjct: 125 RNGDDNWYSYILPRDWEILIMEALLHVPGNEEKQQTLRISLLEKTPSPACIACTECRPQQ 184

BLAST of ClCG06G001410 vs. NCBI nr
Match: gi|1009165641|ref|XP_015901150.1| (PREDICTED: uncharacterized protein LOC107434225 [Ziziphus jujuba])

HSP 1 Score: 247.7 bits (631), Expect = 2.2e-62
Identity = 119/177 (67.23%), Postives = 144/177 (81.36%), Query Frame = 1

Query: 73  SIFACFMKKETSSIKPSEDGVNRSWGRKAVSFVLITVTGGVALSALDDLAIYRSCSSKAI 132
           S F    K + SS    ++G N+S+GRKAVSFVLITVTGG+ALSALDDLAIY  CSSKA+
Sbjct: 8   SFFKRSPKLQVSSSASPDEGNNKSFGRKAVSFVLITVTGGIALSALDDLAIYHGCSSKAM 67

Query: 133 EKARNNQAVIDAIGEPIAKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNG- 192
           EKA  NQA+ DAIGEPI KGPWYNASLAVAHKR+S+SC+FPVSGP GTG+LQLKA+RNG 
Sbjct: 68  EKASENQAIKDAIGEPIMKGPWYNASLAVAHKRNSVSCSFPVSGPHGTGVLQLKAIRNGE 127

Query: 193 DSWIPFLRPRDWDILIMDALLHVPENEGKQKTLRINLTEKFASAACVSCTDCQPPEK 249
           D+W  F RPRDWDI+IMDALLH+P NE KQ+T+R+++++ F   AC +CT C+  EK
Sbjct: 128 DTWFSFFRPRDWDIIIMDALLHIPGNEEKQQTMRVSVSDYFPPPACTACTGCEVGEK 184

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0KF26_CUCSA2.8e-8090.96Uncharacterized protein OS=Cucumis sativus GN=Csa_6G087990 PE=4 SV=1[more]
A0A061DNP0_THECC5.2e-6368.33Uncharacterized protein OS=Theobroma cacao GN=TCM_003906 PE=4 SV=1[more]
V4TYY0_9ROSI5.8e-6272.73Uncharacterized protein OS=Citrus clementina GN=CICLE_v10022455mg PE=4 SV=1[more]
W9RB84_9ROSA6.4e-6168.42Uncharacterized protein OS=Morus notabilis GN=L484_015872 PE=4 SV=1[more]
A0A059B7U8_EUCGR1.1e-6069.64Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_H04779 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G20390.27.7e-4270.94 unknown protein[more]
Match NameE-valueIdentityDescription
gi|449445634|ref|XP_004140577.1|4.0e-8090.96PREDICTED: uncharacterized protein LOC101206927 [Cucumis sativus][more]
gi|659120019|ref|XP_008459968.1|6.7e-8091.57PREDICTED: uncharacterized protein LOC103498925 isoform X1 [Cucumis melo][more]
gi|659120021|ref|XP_008459969.1|8.8e-8092.12PREDICTED: uncharacterized protein LOC103498925 isoform X2 [Cucumis melo][more]
gi|590715325|ref|XP_007050163.1|7.5e-6368.33Uncharacterized protein TCM_003906 [Theobroma cacao][more]
gi|1009165641|ref|XP_015901150.1|2.2e-6267.23PREDICTED: uncharacterized protein LOC107434225 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR014807Cyt_oxidase_assembly-1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG06G001410.1ClCG06G001410.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR014807Cytochrome oxidase assembly protein 1PFAMPF08695Coa1coord: 116..193
score: 1.
NoneNo IPR availablePANTHERPTHR35114FAMILY NOT NAMEDcoord: 84..249
score: 1.3
NoneNo IPR availablePANTHERPTHR35114:SF1SUBFAMILY NOT NAMEDcoord: 84..249
score: 1.3