CmoCh17G003920 (gene) Cucurbita moschata (Rifu)

NameCmoCh17G003920
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPigment defective 320 protein
LocationCmo_Chr17 : 2452902 .. 2455885 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAATGAGTAGAGGAGGAGGAGGATAAGGTGTGGCGGAATTGCTTCAGAATCAGAACCTGTAAGCGTTCCGATCGGTGGAGATTTGGATTTGAATTTGAATTTTGAGTTTTAGGGTTTAGGGTTGTGTTCTTTGTGAAGGTGTTCAGGATGATGAAGAAGCTAAGATGGACTATGGAGGGCCAAAGCTTTTGGGATTTGGATGTTTCAACGCCTAGAACACTCGATGGCTCGGCCTCCCCTGTTCCTACTGATTTGCAGCTACTTCCCTTGGGATTGTCCAGAGGTGTTCGGCTTTCCAGGGCCAAGCAGATCGACTTCATGCAGCAGTTCATGGCTGCTCCTTTTGTTCCTTCTTTCACCCCTTCCCATGGCTTCTCTCTCCAGCGCGTCTTCTCCATCCCCTTTTCGGACTCTGGGTAACTCCATCCTCTCTATTTTGTTCTTGGATTTTTCTCATTTCCTATGTTCATGCACTCATTTGTATTGATTCTCCATCAAATCTGGCTTAAAAGCACTGCAGGTCCGCTACTCTTTTAGGTCAGTTCAATGTGCAGAAATTCGTGTCCTCTCTTAAGAAATCTGGTTTTGGAGAGATGGGTCAGTCGATTTCCTCATTGCTGCAAGGCATTGGAAGGCACCTCCGCGACCGATCTTTGTATGCGTTTGGTATTTCTTCTGATATCTTGTTAACTCCTGATGATGCCCTGTTGATCAGCTTTGATGGATATGGTGACAGTGACATACTTAGAACAAAAGCAGTACTCCACCACAAGGCATGTGTATCCCTGGCCTCTAATTGAATTTGTTAATTCTGTTAGCTGCTTGTTCATTGTTTGATGTAGATTCTTATTCATCCCGTTGCAGTTCTTACATCATGATCTAACAATGGAGGCGCTTTCTCCAGGGCTTTTTGTAGACAAATCTGGTAAATATTGGGATGTGCCTTCTTCGTTAGTCATTGATCTAGGGTCTGCGGATACCGACTCGGGTCTGAGTTATCACTTGTCTATGCACCACAATGCTGGGTCTCCCTCACAATCTGGAAGTGAACAAACCTGTATGGCTCCTTTCTGTTTACTTCCTGGTTTATCAGCCAAGGCTGCGTTTGCCTTAAAGAAGAACTTGGAAATTTGGAGAAGCAACGCCAAGAAGTTGAAGAGGGTGCAACCATATGACATTTTCCTAGCAAATCCTCACGTTTCATTGTCAGGGATCATTGGTATGAATCCAAACTGGTTTCCTTTAATGAAACTTCTGATTATGGATTTATCCCCCAAGTTTTTTGCCATGTTTTAATTCTGTGTCAAATTTTATTGAAGTCTTAATTCAACTCCAAGGCTAGGAAATCGAATAAACAAATAAATGAAATCTCACCAGCCTCGTGGTTTAAGGTTTAATCTCTTTGATTTAGGTGTTGATGTTCATATACAATACTTCGAATGTGATGAGACCATATATTTAGTTGATTTCAATCACGTTAATGAGTAGGGACTTCTTGAAGTTCATCAATATATTGAGGATAAATTGGAACTTTTATAATATTAGGAGCAAAATGTTGTGAAATTTAAGGGACCATATTGTAGCTGCGACCTCATTTTAACTTGCAATTGCCAACTTGGCCATTGAAATTCAAGGATTTGACAAACTCAGAAACCACCATAGTTGGACCCTTAATTCATGTTCAAAATTCTTCATTTTTGCACCAAGGAAACGAAGGAATTTAGTTTAGGATACTTCTGTTGGTATGCAGTTTAGCAGATTGTATGAAACTGCAAGCAATCCCTTGGAAAATTATTATTGTGCAATTTCTACTAATTTTTGCATACCATTTCTATTGATCTTAAAAGGTGCTGTTGCTACTAGCTACTTTGGAGACATTTCGGCTGGATCGGCAGCAGAAGGCAGTCTTCAGGAGTTTAGAGGACTTTACATGCAGACTTCTCGAATAAGATCTACTGTTTTTGCAGATGTATTTGCTTCCATTTCTTTTTCAGCTCAGTATGGGATGTTTCAAAGGAACTTTCTGGATCTTACCCGTTTTTCTGGACGTTTCGATTTCCATTCTGGCTCCAAATTCCTTTCTGGAGCCATGCTCTTGATAGAGGATCTTTCCAATTCCCAGCACCCAAGAACCGAATCCGTGAAAGCGACCTTGCCGAATGCGAGATTTTCCTTTCAGCAGCAGGTACGTATGTTCGTTCTTCATTCAGCTGCTGAGTCTGGTCCAAATCAACAATAAAAAACTTGAAAAGAAAATAAAACAAATCTTGCTAGCACATGTACTTCCTCGTATGATCATCTCTTCTGCTTTTTGGATCTGTGAAGTTCTGGTTTCATGAGTTTCGGAAGTTATTTATGAAATTGTCTCCTGAGTCGAACCCGTAATAAATAATAAAGGATCTTCGCACTTGATTGAACTTTTCCATGATTTTGGAGAAGTTTGAAAGGAAAATGATGATAAATGTTTCGATCGAATTCAGTGGCAAAAACTCGTGGAAAAACCTTCTTTACAACTTTGTATAAGGCCACTGATTCGGGGGATGTTTTCGTGTTTTCGTGAAACATTGGTTGCAGATTGCTGGACCTGTTAGCTTTAGAGCAGATTCAGGAGTTGCTATAGATTTGAGTAAAGCAGGATGGGGGAGTTTACAAGTGGAGGAGCCTACATTTGCGTTGGAATATGCGTTGTACGCCCTTGGTTCAGCTAAAGCCATCGCTTGGTATTCACCAAAGCAAAGAGAATTTATGGTAGAGCTCCGTTTCTATGAGAACTGAAAGAGCCCATGTTTCGATTTCGTTTTCGTTTTCGTTTTCGTTTTCGTGTCCTGTTAGACCAATGTTGCCACTTCTATTCTTTATTTTATTTACTCTGAGTCGTGATAGTTCGAACTTGAGAGTATATGTCTTAATCGGATAACCGAGTATCCTCGGTTGACTGTTGGAAATGGCAATATGAACACATTACATCAAGCAAAACTGGCTC

mRNA sequence

GAAATGAGTAGAGGAGGAGGAGGATAAGGTGTGGCGGAATTGCTTCAGAATCAGAACCTGTGTTCAGGATGATGAAGAAGCTAAGATGGACTATGGAGGGCCAAAGCTTTTGGGATTTGGATGTTTCAACGCCTAGAACACTCGATGGCTCGGCCTCCCCTGTTCCTACTGATTTGCAGCTACTTCCCTTGGGATTGTCCAGAGGTGTTCGGCTTTCCAGGGCCAAGCAGATCGACTTCATGCAGCAGTTCATGGCTGCTCCTTTTGTTCCTTCTTTCACCCCTTCCCATGGCTTCTCTCTCCAGCGCGTCTTCTCCATCCCCTTTTCGGACTCTGGGTCCGCTACTCTTTTAGGTCAGTTCAATGTGCAGAAATTCGTGTCCTCTCTTAAGAAATCTGGTTTTGGAGAGATGGGTCAGTCGATTTCCTCATTGCTGCAAGGCATTGGAAGGCACCTCCGCGACCGATCTTTGTATGCGTTTGGTATTTCTTCTGATATCTTGTTAACTCCTGATGATGCCCTGTTGATCAGCTTTGATGGATATGGTGACAGTGACATACTTAGAACAAAAGCAGTACTCCACCACAAGTTCTTACATCATGATCTAACAATGGAGGCGCTTTCTCCAGGGCTTTTTGTAGACAAATCTGGTAAATATTGGGATGTGCCTTCTTCGTTAGTCATTGATCTAGGGTCTGCGGATACCGACTCGGGTCTGAGTTATCACTTGTCTATGCACCACAATGCTGGGTCTCCCTCACAATCTGGAAGTGAACAAACCTGTATGGCTCCTTTCTGTTTACTTCCTGGTTTATCAGCCAAGGCTGCGTTTGCCTTAAAGAAGAACTTGGAAATTTGGAGAAGCAACGCCAAGAAGTTGAAGAGGGTGCAACCATATGACATTTTCCTAGCAAATCCTCACGTTTCATTGTCAGGGATCATTGGTGCTGTTGCTACTAGCTACTTTGGAGACATTTCGGCTGGATCGGCAGCAGAAGGCAGTCTTCAGGAGTTTAGAGGACTTTACATGCAGACTTCTCGAATAAGATCTACTGTTTTTGCAGATGTATTTGCTTCCATTTCTTTTTCAGCTCAGTATGGGATGTTTCAAAGGAACTTTCTGGATCTTACCCGTTTTTCTGGACGTTTCGATTTCCATTCTGGCTCCAAATTCCTTTCTGGAGCCATGCTCTTGATAGAGGATCTTTCCAATTCCCAGCACCCAAGAACCGAATCCGTGAAAGCGACCTTGCCGAATGCGAGATTTTCCTTTCAGCAGCAGATTGCTGGACCTGTTAGCTTTAGAGCAGATTCAGGAGTTGCTATAGATTTGAGTAAAGCAGGATGGGGGAGTTTACAAGTGGAGGAGCCTACATTTGCGTTGGAATATGCGTTGTACGCCCTTGGTTCAGCTAAAGCCATCGCTTGGTATTCACCAAAGCAAAGAGAATTTATGGTAGAGCTCCGTTTCTATGAGAACTGAAAGAGCCCATGTTTCGATTTCGTTTTCGTTTTCGTTTTCGTTTTCGTGTCCTGTTAGACCAATGTTGCCACTTCTATTCTTTATTTTATTTACTCTGAGTCGTGATAGTTCGAACTTGAGAGTATATGTCTTAATCGGATAACCGAGTATCCTCGGTTGACTGTTGGAAATGGCAATATGAACACATTACATCAAGCAAAACTGGCTC

Coding sequence (CDS)

ATGATGAAGAAGCTAAGATGGACTATGGAGGGCCAAAGCTTTTGGGATTTGGATGTTTCAACGCCTAGAACACTCGATGGCTCGGCCTCCCCTGTTCCTACTGATTTGCAGCTACTTCCCTTGGGATTGTCCAGAGGTGTTCGGCTTTCCAGGGCCAAGCAGATCGACTTCATGCAGCAGTTCATGGCTGCTCCTTTTGTTCCTTCTTTCACCCCTTCCCATGGCTTCTCTCTCCAGCGCGTCTTCTCCATCCCCTTTTCGGACTCTGGGTCCGCTACTCTTTTAGGTCAGTTCAATGTGCAGAAATTCGTGTCCTCTCTTAAGAAATCTGGTTTTGGAGAGATGGGTCAGTCGATTTCCTCATTGCTGCAAGGCATTGGAAGGCACCTCCGCGACCGATCTTTGTATGCGTTTGGTATTTCTTCTGATATCTTGTTAACTCCTGATGATGCCCTGTTGATCAGCTTTGATGGATATGGTGACAGTGACATACTTAGAACAAAAGCAGTACTCCACCACAAGTTCTTACATCATGATCTAACAATGGAGGCGCTTTCTCCAGGGCTTTTTGTAGACAAATCTGGTAAATATTGGGATGTGCCTTCTTCGTTAGTCATTGATCTAGGGTCTGCGGATACCGACTCGGGTCTGAGTTATCACTTGTCTATGCACCACAATGCTGGGTCTCCCTCACAATCTGGAAGTGAACAAACCTGTATGGCTCCTTTCTGTTTACTTCCTGGTTTATCAGCCAAGGCTGCGTTTGCCTTAAAGAAGAACTTGGAAATTTGGAGAAGCAACGCCAAGAAGTTGAAGAGGGTGCAACCATATGACATTTTCCTAGCAAATCCTCACGTTTCATTGTCAGGGATCATTGGTGCTGTTGCTACTAGCTACTTTGGAGACATTTCGGCTGGATCGGCAGCAGAAGGCAGTCTTCAGGAGTTTAGAGGACTTTACATGCAGACTTCTCGAATAAGATCTACTGTTTTTGCAGATGTATTTGCTTCCATTTCTTTTTCAGCTCAGTATGGGATGTTTCAAAGGAACTTTCTGGATCTTACCCGTTTTTCTGGACGTTTCGATTTCCATTCTGGCTCCAAATTCCTTTCTGGAGCCATGCTCTTGATAGAGGATCTTTCCAATTCCCAGCACCCAAGAACCGAATCCGTGAAAGCGACCTTGCCGAATGCGAGATTTTCCTTTCAGCAGCAGATTGCTGGACCTGTTAGCTTTAGAGCAGATTCAGGAGTTGCTATAGATTTGAGTAAAGCAGGATGGGGGAGTTTACAAGTGGAGGAGCCTACATTTGCGTTGGAATATGCGTTGTACGCCCTTGGTTCAGCTAAAGCCATCGCTTGGTATTCACCAAAGCAAAGAGAATTTATGGTAGAGCTCCGTTTCTATGAGAACTGA
BLAST of CmoCh17G003920 vs. Swiss-Prot
Match: TGD4_ARATH (Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic OS=Arabidopsis thaliana GN=TGD4 PE=1 SV=1)

HSP 1 Score: 442.2 bits (1136), Expect = 7.1e-123
Identity = 239/482 (49.59%), Postives = 320/482 (66.39%), Query Frame = 1

Query: 2   MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQF 61
           M ++RW  EG   WDLD+STP TL+G+A  VP D   LPLGLSRG RLSR KQ++F  +F
Sbjct: 1   MNRMRWVGEGD-IWDLDMSTPVTLEGTARAVPDDP--LPLGLSRGTRLSRPKQVEFFHRF 60

Query: 62  MAAPFVPSFTPSH---------GFSLQRVFSIPFSDSGSATLLGQFNVQKFVSSLKKS-G 121
           MA+P +PSF+P           GFSLQRV ++PFS++   +LLGQF+VQ+FV+ + K+  
Sbjct: 61  MASPLIPSFSPIRPNTGDGGGGGFSLQRVLTLPFSNNWLVSLLGQFDVQRFVTEIDKTKA 120

Query: 122 FGEMGQS-ISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGY-GDSDIL-RTK 181
           FG    S ++S L  IG+HL+D+SLYA G  S+ LL+PDD LL+S+D Y GD D   R K
Sbjct: 121 FGRGSSSTVASRLNTIGKHLKDKSLYALGFCSEFLLSPDDTLLLSYDAYKGDLDKNPRAK 180

Query: 182 AVLHHKFLHHDLTMEALSPGLFVDKSGKYWDVPSSLVIDLGSADTDSGLSYHLSMHHNAG 241
           A+ +H+F  H+LT EA+ PGLFVDK G+YWDVP S+ IDL S   +SG SYHL +HHN+G
Sbjct: 181 AIFNHEFPLHNLTAEAVWPGLFVDKHGEYWDVPLSMAIDLASLPAESGPSYHLCLHHNSG 240

Query: 242 SPSQSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLANPHVSL 301
           SP +  S+   + P  LLPGLS K+A + + N+++WR    KL+  +PYD+FL++PHV++
Sbjct: 241 SPKKLHSDTMEVPPPSLLPGLSLKSAVSYRTNMDLWRGTTPKLETCKPYDVFLSSPHVAV 300

Query: 302 SGIIGAVATSYFGDISAGSAAEGSLQEFRGLYMQTSRIRSTVFADVFASISFSAQYGMFQ 361
           SGIIG+V T+ FG+ S  S  E   +   G  +    + S   AD     S +AQYG FQ
Sbjct: 301 SGIIGSVMTAAFGENSIRSKFENDSEGVGGFSLHFPSVNSGFMADALGRASLTAQYGNFQ 360

Query: 362 RNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFSFQQQIAG 421
           + F DLTRF  R DF  G +FL+GA  + +DL NS+ P  E+ +   P    S QQQI G
Sbjct: 361 KFFFDLTRFHARLDFPHGLRFLTGATSVAQDLLNSRQPSLEAFQKICPEVLVSLQQQIVG 420

Query: 422 PVSFRADSGVAIDLSKAGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRF 471
           P SF+ +SG+ IDL + G   + V++  FA+EYAL  L SAKA+  YSPKQ EFMVELRF
Sbjct: 421 PFSFKVESGIEIDL-RNGANPVTVDKTVFAIEYALQVLLSAKAVVSYSPKQNEFMVELRF 478

BLAST of CmoCh17G003920 vs. TrEMBL
Match: A0A0A0K824_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G001740 PE=4 SV=1)

HSP 1 Score: 769.6 bits (1986), Expect = 2.2e-219
Identity = 386/469 (82.30%), Postives = 423/469 (90.19%), Query Frame = 1

Query: 2   MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQF 61
           MKKLRW M+GQ FWDLDVST RTLDGSASPVP+ L LLPLGLSRGVRLSRAKQIDFMQ+F
Sbjct: 1   MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRF 60

Query: 62  MAAPFVPSFTPSHGFSLQRVFSIPFSDSGSATLLGQFNVQKFVSSLKKSGFGEMGQSISS 121
           MAAPFVPS++PSHGFSLQRVFS+PFSDSGS TLLGQFN+QKF+SSL K+G GEM QS SS
Sbjct: 61  MAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQKFMSSLMKTGSGEMCQSYSS 120

Query: 122 LLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLT 181
           LLQ IGRHL  RSLYA GIS+DILL PDD+L+ISFDGYGDSDI+RTKAV H KFLHHDLT
Sbjct: 121 LLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLT 180

Query: 182 MEALSPGLFVDKSGKYWDVPSSLVIDLGSADTDSGLSYHLSMHHNAGSPSQSGSEQTCMA 241
           +EALSPGLF++K G+YWDVPSSLV+DLGS  +DSGLSYHLSMH NAG PSQ GSE T  A
Sbjct: 181 VEALSPGLFMEKCGRYWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSA 240

Query: 242 PFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLANPHVSLSGIIGAVATSYFG 301
           PFCLLPGLSAKAAFA KKN EIWRSNAKKLK VQPYDIFL+ PHVSLS IIGAVATSYFG
Sbjct: 241 PFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG 300

Query: 302 DISAGSAAEGSLQEFRGLYMQTSRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRF 361
           D  A SAA+ SL++F+G YM++SRIRSTVFAD+F SISFSAQYGMFQ+ +LDLTRFS   
Sbjct: 301 DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACM 360

Query: 362 DFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFSFQQQIAGPVSFRADSGVAID 421
           DFHSGSKFLSG+MLLI+DLSNS+HP+TESVKATLPNARFS QQQIAGPVSFRAD+GVAID
Sbjct: 361 DFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFSIQQQIAGPVSFRADTGVAID 420

Query: 422 LSKAGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE 471
           L+KAGW  L+VEEPTFALEYAL+ LGSAKAIAWYSPK REFMVELRFYE
Sbjct: 421 LNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYE 469

BLAST of CmoCh17G003920 vs. TrEMBL
Match: W9SJS3_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_014683 PE=4 SV=1)

HSP 1 Score: 543.9 bits (1400), Expect = 1.9e-151
Identity = 274/469 (58.42%), Postives = 339/469 (72.28%), Query Frame = 1

Query: 2   MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQF 61
           MKKLRW M+G+ FW++D STPRT+DG A PVP D   LPLGLSRG +LSR KQIDF Q+F
Sbjct: 1   MKKLRWVMDGEGFWEVDASTPRTVDGLARPVPGDT--LPLGLSRGPKLSRPKQIDFFQRF 60

Query: 62  MAAPFVPSFTPSHGFSLQRVFSIPFSDSGSATLLGQFNVQKFVSSLKKSGFGEMGQSISS 121
           MAAPFVPS+   HGF+LQRV +IPF+ +    ++GQFNVQKFVSS+K  G G+   S SS
Sbjct: 61  MAAPFVPSYAGDHGFALQRVLTIPFAHNWFTAVVGQFNVQKFVSSMK--GSGDTQHSSSS 120

Query: 122 LLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLT 181
            LQ +  HL+D+SLYA G  S++LLTPDD LL+S D Y + D+ R KAV HHKF +HDL 
Sbjct: 121 WLQKVRSHLKDKSLYAVGFCSELLLTPDDTLLVSLDRYAEKDLSRKKAVFHHKFPYHDLM 180

Query: 182 MEALSPGLFVDKSGKYWDVPSSLVIDLGSADTDSGLSYHLSMHHNAGSPSQSGSEQTCMA 241
           +EA+ PGLF+DK+  YWDVP  + +DL S  + SG SYHLS  HN+GSP +  S+     
Sbjct: 181 VEAVWPGLFIDKANNYWDVPFLVAVDLASVASPSGSSYHLSALHNSGSPERFHSDGNDRV 240

Query: 242 PFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLANPHVSLSGIIGAVATSYFG 301
           P CL PGLS + A A KK++E WRSNA KLK VQP+D+FL+NP VS SGIIGA AT++FG
Sbjct: 241 PTCLRPGLSLRGALAFKKDIEFWRSNAPKLKMVQPFDMFLSNPQVSASGIIGAAATAFFG 300

Query: 302 DISAGSAAEGSLQEFRGLYMQTSRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRF 361
           D +     E + Q FRG       ++S    D+F S+SF+AQ+G FQR FLDLTRF  R 
Sbjct: 301 DNTERPRTEEAFQGFRGFNFDYGAVKSAFVGDMFGSVSFTAQHGNFQRLFLDLTRFQARL 360

Query: 362 DFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFSFQQQIAGPVSFRADSGVAID 421
           DF SGSKFLSGA  L +D  NSQ P  ++++A  PN   SFQQQIAGP SFR +SG+ ID
Sbjct: 361 DFPSGSKFLSGAARLAQDFVNSQQPSFDALQAVCPNIHLSFQQQIAGPFSFRVESGITID 420

Query: 422 LSKAGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE 471
           L    W ++ +EEP FA+E+AL  LGSAKAIAWYSPKQ+EFMVELRF+E
Sbjct: 421 LRNRDW-NICIEEPIFAIEHALQVLGSAKAIAWYSPKQQEFMVELRFFE 464

BLAST of CmoCh17G003920 vs. TrEMBL
Match: A5BHS8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_020812 PE=4 SV=1)

HSP 1 Score: 537.7 bits (1384), Expect = 1.4e-149
Identity = 281/470 (59.79%), Postives = 339/470 (72.13%), Query Frame = 1

Query: 2    MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQF 61
            MKKLRW M+G  FW+LD+ST  TLDG A  VP D   LPLGLSRG RLSR  QIDF Q+F
Sbjct: 586  MKKLRWAMDG-GFWELDISTATTLDGVARAVPDDP--LPLGLSRGTRLSRPMQIDFFQRF 645

Query: 62   MAAPFVPSFTPS-HGFSLQRVFSIPFSDSGSATLLGQFNVQKFVSSLKKSGFGEMGQSIS 121
            M+ PFVPS + S HGFSLQR F+ PF+++  A+LLGQFN QKFVSS+K+     +  S S
Sbjct: 646  MSMPFVPSSSISTHGFSLQRGFTFPFTENWFASLLGQFNFQKFVSSVKEGRL--LQPSES 705

Query: 122  SLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDL 181
            S LQGIGR   D+SLYA G+SS++L+TPDD LL+S + YGD  + R KAV  HKF +H+L
Sbjct: 706  SWLQGIGRRFSDKSLYALGLSSELLITPDDTLLVSLEAYGDKKVPRKKAVFLHKFPNHNL 765

Query: 182  TMEALSPGLFVDKSGKYWDVPSSLVIDLGSADTDSGLSYHLSMHHNAGSPSQSGSEQTCM 241
             +EA+ PGLFVDK G YWDVP S+ IDL S  +DSG SYHLS+HHN G+P Q    QT  
Sbjct: 766  MVEAVWPGLFVDKFGTYWDVPLSMAIDLASVASDSGASYHLSVHHNTGTPKQFDGNQTHE 825

Query: 242  APFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLANPHVSLSGIIGAVATSYF 301
             P  LLPGL AK AFALKKN+++WRS A+KLK VQP+DIFL+NPH+S SGIIGA  T+  
Sbjct: 826  VPATLLPGLCAKGAFALKKNIDLWRSKAQKLKMVQPFDIFLSNPHISFSGIIGAAGTACL 885

Query: 302  GDISAGSAAEGSLQEFRGLYMQTSRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGR 361
            GD S     E     F+GL +   R++S + AD+FAS++F+AQ+G FQR FLDLTRF  R
Sbjct: 886  GDNSVRVQVEDESHGFKGLKLHLPRVKSALLADIFASVAFTAQHGNFQRLFLDLTRFYAR 945

Query: 362  FDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFSFQQQIAGPVSFRADSGVAI 421
             DF SGSKFL+G   L +DL NSQ P  E+ +A  P A  S QQQI GP SFR DSGVA+
Sbjct: 946  LDFPSGSKFLAGTTRLTQDLYNSQQPSLEAFQAICPTATLSLQQQIVGPFSFRIDSGVAV 1005

Query: 422  DLSKAGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE 471
            +L    W  + V+EP FA+EYAL  LGSAKAIAWYSPK  EFMVELRF+E
Sbjct: 1006 NLKNREW-HIDVDEPVFAIEYALQVLGSAKAIAWYSPKHEEFMVELRFFE 1049

BLAST of CmoCh17G003920 vs. TrEMBL
Match: D7T697_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0020g00870 PE=4 SV=1)

HSP 1 Score: 535.0 bits (1377), Expect = 9.0e-149
Identity = 280/470 (59.57%), Postives = 338/470 (71.91%), Query Frame = 1

Query: 2   MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQF 61
           MKKLRW M+G  FW+LD+ST  TLDG A  VP D   LPLGLSRG RLSR  QIDF Q+F
Sbjct: 1   MKKLRWAMDG-GFWELDMSTATTLDGVARAVPDDP--LPLGLSRGTRLSRPMQIDFFQRF 60

Query: 62  MAAPFVPSFTPS-HGFSLQRVFSIPFSDSGSATLLGQFNVQKFVSSLKKSGFGEMGQSIS 121
           M+ PFVPS + S HGFSLQR F+ PF+++  A+LLGQFN QKFVSS+K+     +  S S
Sbjct: 61  MSMPFVPSSSISTHGFSLQRGFTFPFTENWFASLLGQFNFQKFVSSVKEGRL--LQPSES 120

Query: 122 SLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDL 181
           S LQGIGR   D+SLYA G+SS++L+TPDD LL+S + YGD  + R KAV  HKF +H+L
Sbjct: 121 SWLQGIGRRFSDKSLYALGLSSELLITPDDTLLVSLEAYGDKKVPRKKAVFLHKFPNHNL 180

Query: 182 TMEALSPGLFVDKSGKYWDVPSSLVIDLGSADTDSGLSYHLSMHHNAGSPSQSGSEQTCM 241
            +EA+ PGLFVDK G YWDVP S+ IDL S  +DSG SYHLS+HHN G+P Q    QT  
Sbjct: 181 MVEAVWPGLFVDKFGTYWDVPLSMAIDLASVASDSGASYHLSVHHNTGTPKQFDGNQTHE 240

Query: 242 APFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLANPHVSLSGIIGAVATSYF 301
            P  LLPGL AK AFALKKN+++WRS A+KLK VQP+DIFL+NPH+S SGIIGA  T+  
Sbjct: 241 VPATLLPGLCAKGAFALKKNIDLWRSKAQKLKMVQPFDIFLSNPHISFSGIIGAAGTACL 300

Query: 302 GDISAGSAAEGSLQEFRGLYMQTSRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGR 361
           GD S     E     F+G  +   R++S + AD+FAS++F+AQ+G FQR FLDLTRF  R
Sbjct: 301 GDNSVRVQVEDESHGFKGFKLHLPRVKSALVADIFASVAFTAQHGNFQRLFLDLTRFYAR 360

Query: 362 FDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFSFQQQIAGPVSFRADSGVAI 421
            DF SGSKFL+G   L +DL NSQ P  E+ +A  P A  S QQQI GP SFR DSGVA+
Sbjct: 361 LDFPSGSKFLAGTTRLTQDLYNSQQPSLEAFQAICPTATLSLQQQIVGPFSFRIDSGVAV 420

Query: 422 DLSKAGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE 471
           +L    W  + V+EP FA+EYAL  LGSAKAIAWYSPK  EFMVELRF+E
Sbjct: 421 NLKNREW-HIDVDEPVFAIEYALQVLGSAKAIAWYSPKHEEFMVELRFFE 464

BLAST of CmoCh17G003920 vs. TrEMBL
Match: A0A067K6W8_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_19095 PE=4 SV=1)

HSP 1 Score: 534.6 bits (1376), Expect = 1.2e-148
Identity = 282/474 (59.49%), Postives = 343/474 (72.36%), Query Frame = 1

Query: 2   MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQF 61
           MKKLRW M+G  FW+LDVSTP TL+G A  VP D   LPLGL RG +LSR KQI F Q+F
Sbjct: 1   MKKLRWAMDG-GFWELDVSTPLTLEGEARAVPGDP--LPLGLCRGTKLSRTKQIHFFQRF 60

Query: 62  MAAPFVPSFTPS---HGFSLQRVFSIP-FSDSGSATLLGQFNVQKFVSSLKKSGFGEMGQ 121
           MA+PFVPS++PS   HGFS Q V  IP FS +   TLLGQFN+QKF+SSL +S    +  
Sbjct: 61  MASPFVPSYSPSTRGHGFSFQSVLPIPTFSQNWFGTLLGQFNLQKFLSSLNESS--NLQS 120

Query: 122 SISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLH 181
           S+SS LQ I R LRD+SLYA G  S++ +TPDD LL SFD YGD+   R KAVLHHKF +
Sbjct: 121 SLSSRLQTICRRLRDKSLYAVGFCSELYVTPDDTLLFSFDTYGDNRSTRKKAVLHHKFPN 180

Query: 182 HDLTMEALSPGLFVDKSGKYWDVPSSLVIDLGSA-DTDSGLSYHLSMHHNAGSPSQSGSE 241
           H+LT+EA+SPGLF D SG YWDVP S+ IDL S   +DSG S+ L +HHN+GSP    S+
Sbjct: 181 HNLTLEAVSPGLFADNSGNYWDVPFSMAIDLASVTSSDSGPSFRLCVHHNSGSPKPFQSD 240

Query: 242 QTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLANPHVSLSGIIGAVA 301
           QT   P  LLPG S K+AF+ KK+++IWRSNA+KLK VQPYD+FL+NPH+S SGIIGA  
Sbjct: 241 QTSAVPAALLPGSSVKSAFSFKKSIDIWRSNAQKLKMVQPYDLFLSNPHISASGIIGATV 300

Query: 302 TSYFGDISAGSAAEGSLQEFRGLYMQTSRIRSTVFADVFASISFSAQYGMFQRNFLDLTR 361
           T+Y GD S  S       +F+GL ++   ++S +  D+F+S++F+AQYG FQR FLDLTR
Sbjct: 301 TTYIGDNSVRSQEVDDSLDFKGLCIRAPEVKSALLGDIFSSVAFTAQYGNFQRPFLDLTR 360

Query: 362 FSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFSFQQQIAGPVSFRADS 421
           F    DF SGSKFLSGA  + +D  NSQ P   +VKA  PNA  SFQQQIAGP S R DS
Sbjct: 361 FHVCLDFPSGSKFLSGAAKVAQDFFNSQQPSMGTVKAICPNALVSFQQQIAGPFSLRVDS 420

Query: 422 GVAIDLSKAGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE 471
           GV ID  K  W  ++V +P FA+EYAL  LGSAKAIAW+SPKQ+EFMVELRF+E
Sbjct: 421 GVVIDWKKTDW-HMRVHDPVFAVEYALQVLGSAKAIAWFSPKQKEFMVELRFFE 468

BLAST of CmoCh17G003920 vs. TAIR10
Match: AT3G06960.1 (AT3G06960.1 pigment defective 320)

HSP 1 Score: 442.2 bits (1136), Expect = 4.0e-124
Identity = 239/482 (49.59%), Postives = 320/482 (66.39%), Query Frame = 1

Query: 2   MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQF 61
           M ++RW  EG   WDLD+STP TL+G+A  VP D   LPLGLSRG RLSR KQ++F  +F
Sbjct: 1   MNRMRWVGEGD-IWDLDMSTPVTLEGTARAVPDDP--LPLGLSRGTRLSRPKQVEFFHRF 60

Query: 62  MAAPFVPSFTPSH---------GFSLQRVFSIPFSDSGSATLLGQFNVQKFVSSLKKS-G 121
           MA+P +PSF+P           GFSLQRV ++PFS++   +LLGQF+VQ+FV+ + K+  
Sbjct: 61  MASPLIPSFSPIRPNTGDGGGGGFSLQRVLTLPFSNNWLVSLLGQFDVQRFVTEIDKTKA 120

Query: 122 FGEMGQS-ISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGY-GDSDIL-RTK 181
           FG    S ++S L  IG+HL+D+SLYA G  S+ LL+PDD LL+S+D Y GD D   R K
Sbjct: 121 FGRGSSSTVASRLNTIGKHLKDKSLYALGFCSEFLLSPDDTLLLSYDAYKGDLDKNPRAK 180

Query: 182 AVLHHKFLHHDLTMEALSPGLFVDKSGKYWDVPSSLVIDLGSADTDSGLSYHLSMHHNAG 241
           A+ +H+F  H+LT EA+ PGLFVDK G+YWDVP S+ IDL S   +SG SYHL +HHN+G
Sbjct: 181 AIFNHEFPLHNLTAEAVWPGLFVDKHGEYWDVPLSMAIDLASLPAESGPSYHLCLHHNSG 240

Query: 242 SPSQSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLANPHVSL 301
           SP +  S+   + P  LLPGLS K+A + + N+++WR    KL+  +PYD+FL++PHV++
Sbjct: 241 SPKKLHSDTMEVPPPSLLPGLSLKSAVSYRTNMDLWRGTTPKLETCKPYDVFLSSPHVAV 300

Query: 302 SGIIGAVATSYFGDISAGSAAEGSLQEFRGLYMQTSRIRSTVFADVFASISFSAQYGMFQ 361
           SGIIG+V T+ FG+ S  S  E   +   G  +    + S   AD     S +AQYG FQ
Sbjct: 301 SGIIGSVMTAAFGENSIRSKFENDSEGVGGFSLHFPSVNSGFMADALGRASLTAQYGNFQ 360

Query: 362 RNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFSFQQQIAG 421
           + F DLTRF  R DF  G +FL+GA  + +DL NS+ P  E+ +   P    S QQQI G
Sbjct: 361 KFFFDLTRFHARLDFPHGLRFLTGATSVAQDLLNSRQPSLEAFQKICPEVLVSLQQQIVG 420

Query: 422 PVSFRADSGVAIDLSKAGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRF 471
           P SF+ +SG+ IDL + G   + V++  FA+EYAL  L SAKA+  YSPKQ EFMVELRF
Sbjct: 421 PFSFKVESGIEIDL-RNGANPVTVDKTVFAIEYALQVLLSAKAVVSYSPKQNEFMVELRF 478

BLAST of CmoCh17G003920 vs. TAIR10
Match: AT2G44640.1 (AT2G44640.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 244.2 bits (622), Expect = 1.6e-64
Identity = 155/472 (32.84%), Postives = 248/472 (52.54%), Query Frame = 1

Query: 14  FWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSFTPS 73
           FWD +VS+P+TL+G+A  VP +    PL  +R  R  R +Q+  +++      +PS  P+
Sbjct: 12  FWDQNVSSPQTLEGTARSVPGEP--FPLDGARASRSHRIQQLSLLREGFPLGIIPSLAPA 71

Query: 74  H-----GFSLQRVFSIPFSDSGSATLLGQFNVQKFVSSLKKSGFGEMGQSISSLLQGIGR 133
                  FSL  +   P S++    L+GQF  +K  + +K +      +    +++   +
Sbjct: 72  SDKRLGSFSLNSLLLSPSSNNWWLGLVGQFKPKKLFADIK-ADISNAEEWDLQVVKDTAK 131

Query: 134 HLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPG 193
           H+ D+SLY+ G+ + I L    +LL+S +  GD + LR K +L H    HDLT+EA  P 
Sbjct: 132 HIVDKSLYSIGLWTQIALGTSSSLLLSTERLGDKNGLRNKLMLVHPLEKHDLTVEAAWPD 191

Query: 194 LFVDKSGKYWDVPSSLVIDLGSADTDSGLSYHLSMHHNAGSP---SQSGSEQTCMAPFCL 253
           LF+D  G++WDVP SL +D+ S   +SG+ Y   +H + G+P   + +G E    AP  L
Sbjct: 192 LFLDNKGRFWDVPESLNVDVSSLVPESGVRYRFGLHKSRGNPQPVNAAGVESGSDAPTSL 251

Query: 254 LPGLSAKAAFALKKNLEIWRSNAKKLKRVQ-------PYDIFLANPHVSLSGIIGAVATS 313
           +PGL AKAA + K N ++WR   K+    +       PYD+ L  PH ++SGI+G+   +
Sbjct: 252 MPGLCAKAAVSYKVNRDLWRPQEKEGNTEEEDKPVFLPYDLRLKEPHAAISGIVGSSLAA 311

Query: 314 YFGDISAGSAAEGSLQEFRGLYMQTSRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFS 373
           +                 RG+ +   + RS + ADVF S  ++ Q G F + + DLTR  
Sbjct: 312 WITG--------------RGMLVNGKK-RSPISADVFGSACYTFQKGRFSKLYGDLTRVD 371

Query: 374 GRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFSFQQQIAGPVSFRADSGV 433
            R D       L  A  L + L ++    ++    + P     FQQQ+AGP+ F+ DS  
Sbjct: 372 ARVD-------LPSAFALAKKLFHASSNNSDDTLWS-PRLNLIFQQQVAGPIVFKVDSQF 431

Query: 434 AIDLSKAGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE 471
            +       G+ ++E+  ++L Y+L  L S K +AWYSPK++E M+ELR +E
Sbjct: 432 QV-------GAARMEDVIYSLNYSLRLLESGKIVAWYSPKRKEGMIELRVFE 450

BLAST of CmoCh17G003920 vs. NCBI nr
Match: gi|778709018|ref|XP_004138517.2| (PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucumis sativus])

HSP 1 Score: 769.6 bits (1986), Expect = 3.1e-219
Identity = 386/469 (82.30%), Postives = 423/469 (90.19%), Query Frame = 1

Query: 2   MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQF 61
           MKKLRW M+GQ FWDLDVST RTLDGSASPVP+ L LLPLGLSRGVRLSRAKQIDFMQ+F
Sbjct: 1   MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRF 60

Query: 62  MAAPFVPSFTPSHGFSLQRVFSIPFSDSGSATLLGQFNVQKFVSSLKKSGFGEMGQSISS 121
           MAAPFVPS++PSHGFSLQRVFS+PFSDSGS TLLGQFN+QKF+SSL K+G GEM QS SS
Sbjct: 61  MAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQKFMSSLMKTGSGEMCQSYSS 120

Query: 122 LLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLT 181
           LLQ IGRHL  RSLYA GIS+DILL PDD+L+ISFDGYGDSDI+RTKAV H KFLHHDLT
Sbjct: 121 LLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLT 180

Query: 182 MEALSPGLFVDKSGKYWDVPSSLVIDLGSADTDSGLSYHLSMHHNAGSPSQSGSEQTCMA 241
           +EALSPGLF++K G+YWDVPSSLV+DLGS  +DSGLSYHLSMH NAG PSQ GSE T  A
Sbjct: 181 VEALSPGLFMEKCGRYWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSA 240

Query: 242 PFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLANPHVSLSGIIGAVATSYFG 301
           PFCLLPGLSAKAAFA KKN EIWRSNAKKLK VQPYDIFL+ PHVSLS IIGAVATSYFG
Sbjct: 241 PFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG 300

Query: 302 DISAGSAAEGSLQEFRGLYMQTSRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRF 361
           D  A SAA+ SL++F+G YM++SRIRSTVFAD+F SISFSAQYGMFQ+ +LDLTRFS   
Sbjct: 301 DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACM 360

Query: 362 DFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFSFQQQIAGPVSFRADSGVAID 421
           DFHSGSKFLSG+MLLI+DLSNS+HP+TESVKATLPNARFS QQQIAGPVSFRAD+GVAID
Sbjct: 361 DFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFSIQQQIAGPVSFRADTGVAID 420

Query: 422 LSKAGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE 471
           L+KAGW  L+VEEPTFALEYAL+ LGSAKAIAWYSPK REFMVELRFYE
Sbjct: 421 LNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYE 469

BLAST of CmoCh17G003920 vs. NCBI nr
Match: gi|659116830|ref|XP_008458281.1| (PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucumis melo])

HSP 1 Score: 764.2 bits (1972), Expect = 1.3e-217
Identity = 385/469 (82.09%), Postives = 416/469 (88.70%), Query Frame = 1

Query: 2   MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQF 61
           MKKLRW M+G  FWDLDVST RTLDGSASPVP+   LLPLGLSRGVRLSRAKQIDFMQ F
Sbjct: 1   MKKLRWAMDG--FWDLDVSTSRTLDGSASPVPSPFHLLPLGLSRGVRLSRAKQIDFMQSF 60

Query: 62  MAAPFVPSFTPSHGFSLQRVFSIPFSDSGSATLLGQFNVQKFVSSLKKSGFGEMGQSISS 121
           M APFVPS++PSHGFSLQRVFSIPFSDSGS TLLGQFN+QKF+SSL K+G GEMGQS SS
Sbjct: 61  MVAPFVPSYSPSHGFSLQRVFSIPFSDSGSITLLGQFNLQKFMSSLMKTGSGEMGQSFSS 120

Query: 122 LLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLT 181
            +Q IGRHL  RSLYA GIS+DILL PDD+L+ISFDGYGDSDI+RTKAV H KFLHHDLT
Sbjct: 121 FIQCIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLT 180

Query: 182 MEALSPGLFVDKSGKYWDVPSSLVIDLGSADTDSGLSYHLSMHHNAGSPSQSGSEQTCMA 241
           MEALSPGLF+DKSG+YWDVPSSLV+DLGSA +DSGLSYHLSMH N G PS  GSE T  A
Sbjct: 181 MEALSPGLFMDKSGRYWDVPSSLVVDLGSAASDSGLSYHLSMHQNTGFPSPLGSEPTHSA 240

Query: 242 PFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLANPHVSLSGIIGAVATSYFG 301
           PFCL PGLSAKAAFA KKN EIWRSNAKKLK VQPYDIFL+ PHVSLS IIGAVATSYFG
Sbjct: 241 PFCLFPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG 300

Query: 302 DISAGSAAEGSLQEFRGLYMQTSRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRF 361
           D    SAA+GSL EF+G YMQTSRIRST+FAD+F SISFSAQYGMFQ+ +LDLTRFS   
Sbjct: 301 DDLVRSAAQGSLAEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQKKYLDLTRFSACM 360

Query: 362 DFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFSFQQQIAGPVSFRADSGVAID 421
           DFHSGSKFLSG+MLLI+DLSNS+HP+TE+VKATLPNARFS QQQIAGPVSFRADSGVAID
Sbjct: 361 DFHSGSKFLSGSMLLIDDLSNSRHPKTEAVKATLPNARFSIQQQIAGPVSFRADSGVAID 420

Query: 422 LSKAGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE 471
           L+KAGW  L+V+EPTFALEYAL  LGSAKAIAWYSPK REFMVELRFYE
Sbjct: 421 LNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYE 467

BLAST of CmoCh17G003920 vs. NCBI nr
Match: gi|645233790|ref|XP_008223512.1| (PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Prunus mume])

HSP 1 Score: 557.0 bits (1434), Expect = 3.2e-155
Identity = 285/470 (60.64%), Postives = 346/470 (73.62%), Query Frame = 1

Query: 2   MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQF 61
           MKKLRW M+G SFWDLD+STPRT++G   PVP D   LPLGL+RG RLSR KQIDFMQ+F
Sbjct: 3   MKKLRWAMDG-SFWDLDMSTPRTIEGLGRPVPGDP--LPLGLTRGARLSRPKQIDFMQRF 62

Query: 62  MAAPFVPSFTPSHGFSLQRVFSIPFSDSGSATLLGQFNVQKFVSSLKKSGFGEMGQSISS 121
           M APFVPS++ ++GF+LQRV +IP SD+   TLLGQFN+Q+FVSS+KKSG      S SS
Sbjct: 63  MTAPFVPSYSAANGFNLQRVLTIPISDNWFGTLLGQFNLQRFVSSVKKSGKTTPPDSASS 122

Query: 122 LLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLT 181
            +Q IG HLRD+SLYA    SD LLTPDD L+ S + YGD    R KA+  HKF HH+LT
Sbjct: 123 WMQSIGTHLRDKSLYALSFCSDFLLTPDDTLVFSVEAYGDDKKARKKALFQHKFPHHNLT 182

Query: 182 MEALSPGLFVDKSGKYWDVPSSLVIDLGSADTDSGLSYHLSMHHNAGSPSQSGSEQTCMA 241
           +EA+ PGLFVDK G YWDVP S+ +DL S  +DSG SYH+  HHN+G P +  S Q+   
Sbjct: 183 VEAVWPGLFVDKPGNYWDVPFSMSLDLASVASDSGASYHICAHHNSGEPERLDSGQSDGV 242

Query: 242 PFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLANPHVSLSGIIGAVATSYFG 301
           P  LLPGLS  +AF+ KKN+E+WRSNA+KL+ VQPYDIFL+NPHVS SGIIGA  T+ FG
Sbjct: 243 PASLLPGLSVTSAFSFKKNIELWRSNAQKLRMVQPYDIFLSNPHVSASGIIGAAMTASFG 302

Query: 302 DISAGS-AAEGSLQEFRGLYMQTSRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGR 361
           D S  S  A+   + FRG  ++   ++S   AD+FAS SF+AQ+G FQR FLDLTRF  R
Sbjct: 303 DSSVRSQIADDDPEGFRGFSIRAPEVKSAFLADIFASASFTAQHGNFQRLFLDLTRFHAR 362

Query: 362 FDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFSFQQQIAGPVSFRADSGVAI 421
            DF SGSKFLSGA  L +D  NSQ P  E+++   PNA  S QQQIAGP SFR DSGVA+
Sbjct: 363 LDFPSGSKFLSGATHLAQDFFNSQQPNLEAIRDICPNATLSLQQQIAGPFSFRVDSGVAV 422

Query: 422 DLSKAGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE 471
           +L    W +++V+EP FALEYAL  LGSAKA+AWYSPK +E M+ELRFYE
Sbjct: 423 ELKNQDW-NIRVDEPVFALEYALQVLGSAKAVAWYSPKHQECMIELRFYE 468

BLAST of CmoCh17G003920 vs. NCBI nr
Match: gi|703161607|ref|XP_010112836.1| (hypothetical protein L484_014683 [Morus notabilis])

HSP 1 Score: 543.9 bits (1400), Expect = 2.8e-151
Identity = 274/469 (58.42%), Postives = 339/469 (72.28%), Query Frame = 1

Query: 2   MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQF 61
           MKKLRW M+G+ FW++D STPRT+DG A PVP D   LPLGLSRG +LSR KQIDF Q+F
Sbjct: 1   MKKLRWVMDGEGFWEVDASTPRTVDGLARPVPGDT--LPLGLSRGPKLSRPKQIDFFQRF 60

Query: 62  MAAPFVPSFTPSHGFSLQRVFSIPFSDSGSATLLGQFNVQKFVSSLKKSGFGEMGQSISS 121
           MAAPFVPS+   HGF+LQRV +IPF+ +    ++GQFNVQKFVSS+K  G G+   S SS
Sbjct: 61  MAAPFVPSYAGDHGFALQRVLTIPFAHNWFTAVVGQFNVQKFVSSMK--GSGDTQHSSSS 120

Query: 122 LLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLT 181
            LQ +  HL+D+SLYA G  S++LLTPDD LL+S D Y + D+ R KAV HHKF +HDL 
Sbjct: 121 WLQKVRSHLKDKSLYAVGFCSELLLTPDDTLLVSLDRYAEKDLSRKKAVFHHKFPYHDLM 180

Query: 182 MEALSPGLFVDKSGKYWDVPSSLVIDLGSADTDSGLSYHLSMHHNAGSPSQSGSEQTCMA 241
           +EA+ PGLF+DK+  YWDVP  + +DL S  + SG SYHLS  HN+GSP +  S+     
Sbjct: 181 VEAVWPGLFIDKANNYWDVPFLVAVDLASVASPSGSSYHLSALHNSGSPERFHSDGNDRV 240

Query: 242 PFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLANPHVSLSGIIGAVATSYFG 301
           P CL PGLS + A A KK++E WRSNA KLK VQP+D+FL+NP VS SGIIGA AT++FG
Sbjct: 241 PTCLRPGLSLRGALAFKKDIEFWRSNAPKLKMVQPFDMFLSNPQVSASGIIGAAATAFFG 300

Query: 302 DISAGSAAEGSLQEFRGLYMQTSRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRF 361
           D +     E + Q FRG       ++S    D+F S+SF+AQ+G FQR FLDLTRF  R 
Sbjct: 301 DNTERPRTEEAFQGFRGFNFDYGAVKSAFVGDMFGSVSFTAQHGNFQRLFLDLTRFQARL 360

Query: 362 DFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFSFQQQIAGPVSFRADSGVAID 421
           DF SGSKFLSGA  L +D  NSQ P  ++++A  PN   SFQQQIAGP SFR +SG+ ID
Sbjct: 361 DFPSGSKFLSGAARLAQDFVNSQQPSFDALQAVCPNIHLSFQQQIAGPFSFRVESGITID 420

Query: 422 LSKAGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE 471
           L    W ++ +EEP FA+E+AL  LGSAKAIAWYSPKQ+EFMVELRF+E
Sbjct: 421 LRNRDW-NICIEEPIFAIEHALQVLGSAKAIAWYSPKQQEFMVELRFFE 464

BLAST of CmoCh17G003920 vs. NCBI nr
Match: gi|743910942|ref|XP_010999321.1| (PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic-like [Populus euphratica])

HSP 1 Score: 538.1 bits (1385), Expect = 1.5e-149
Identity = 277/472 (58.69%), Postives = 342/472 (72.46%), Query Frame = 1

Query: 2   MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQF 61
           M KLRW M+G  FWDLD STPRTL+G    VP +   LPLG+SRG RLSR KQIDF Q+F
Sbjct: 1   MNKLRWAMDG-GFWDLDRSTPRTLEGEGRAVPGEP--LPLGVSRGTRLSRPKQIDFFQRF 60

Query: 62  MAAPFVPSFTPS-HGFSLQRVFSIPFSDSGSATLLGQFNVQKFVSSLKKSGFGEMGQSIS 121
           M APF+PS++ S HGFSLQRV ++PF+ +  ATLL QFN+QKF SS KK+G  +     S
Sbjct: 61  MFAPFIPSYSASSHGFSLQRVLALPFTQNWFATLLAQFNLQKFASSFKKNGALQ-----S 120

Query: 122 SLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDIL--RTKAVLHHKFLHH 181
           S L+ I +HL D+SLYA G  S++LL+P D LL+S D YGD D    R KA+ HHKF +H
Sbjct: 121 SRLENIKKHLEDKSLYALGFCSELLLSPCDTLLLSLDFYGDDDNKKPRKKAIFHHKFPNH 180

Query: 182 DLTMEALSPGLFVDKSGKYWDVPSSLVIDLGSADTDSGLSYHLSMHHNAGSPSQSGSEQT 241
           DL +EA+ PGL++DK+G YWDVP S+ IDL S  +DSG SYH  MHH+AG P Q G ++T
Sbjct: 181 DLNVEAVWPGLYIDKAGNYWDVPFSMAIDLASLSSDSGASYHFCMHHSAGQPMQLGGDET 240

Query: 242 CMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLANPHVSLSGIIGAVATS 301
              P  LLPG+S K+AF+LKKN+EIWRSNA+KLK VQP+DIFL+NPH+S SG+IGA   +
Sbjct: 241 VEVPATLLPGISLKSAFSLKKNVEIWRSNAQKLKMVQPFDIFLSNPHISASGVIGAAVMA 300

Query: 302 YFGDISAGSAAEGSLQEFRGLYMQTSRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFS 361
            FGD S         Q+F GL ++   ++ST+  D F+S+SF+AQ+G FQR  LDLTRF 
Sbjct: 301 CFGDNSVRPQVVHESQQFEGLCLRAPAVKSTLLVDAFSSVSFTAQHGNFQRLLLDLTRFH 360

Query: 362 GRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFSFQQQIAGPVSFRADSGV 421
            R DF SGSKFLSGA  L +D  NSQ P  E+V+A  P A  SFQQQIAGP SFR DSGV
Sbjct: 361 ARLDFPSGSKFLSGAARLAQDFCNSQQPTMETVQAICPKATVSFQQQIAGPFSFRVDSGV 420

Query: 422 AIDLSKAGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE 471
            ID     W  + V++P FA+EYAL+ LGSAKA+AWYSPKQ+EFMVELRF+E
Sbjct: 421 EIDWKNKDW-HMCVDDPVFAIEYALHVLGSAKAVAWYSPKQQEFMVELRFFE 463

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TGD4_ARATH7.1e-12349.59Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic OS=Arabidopsis thaliana GN=... [more]
Match NameE-valueIdentityDescription
A0A0A0K824_CUCSA2.2e-21982.30Uncharacterized protein OS=Cucumis sativus GN=Csa_6G001740 PE=4 SV=1[more]
W9SJS3_9ROSA1.9e-15158.42Uncharacterized protein OS=Morus notabilis GN=L484_014683 PE=4 SV=1[more]
A5BHS8_VITVI1.4e-14959.79Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_020812 PE=4 SV=1[more]
D7T697_VITVI9.0e-14959.57Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0020g00870 PE=4 SV=... [more]
A0A067K6W8_JATCU1.2e-14859.49Uncharacterized protein OS=Jatropha curcas GN=JCGZ_19095 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G06960.14.0e-12449.59 pigment defective 320[more]
AT2G44640.11.6e-6432.84 FUNCTIONS IN: molecular_function unknown[more]
Match NameE-valueIdentityDescription
gi|778709018|ref|XP_004138517.2|3.1e-21982.30PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucumis sativus... [more]
gi|659116830|ref|XP_008458281.1|1.3e-21782.09PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucumis melo][more]
gi|645233790|ref|XP_008223512.1|3.2e-15560.64PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Prunus mume][more]
gi|703161607|ref|XP_010112836.1|2.8e-15158.42hypothetical protein L484_014683 [Morus notabilis][more]
gi|743910942|ref|XP_010999321.1|1.5e-14958.69PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic-like [Populus eu... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016482 cytosolic transport
biological_process GO:0006869 lipid transport
biological_process GO:1902582 single-organism intracellular transport
biological_process GO:0006470 protein dephosphorylation
biological_process GO:0008150 biological_process
cellular_component GO:0009941 chloroplast envelope
cellular_component GO:0016020 membrane
cellular_component GO:0009507 chloroplast
cellular_component GO:0009570 chloroplast stroma
cellular_component GO:0005575 cellular_component
molecular_function GO:0005488 binding
molecular_function GO:0008253 5'-nucleotidase activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh17G003920.1CmoCh17G003920.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34954FAMILY NOT NAMEDcoord: 3..471
score: 2.8E
NoneNo IPR availablePANTHERPTHR34954:SF1PROTEIN TRIGALACTOSYLDIACYLGLYCEROL 4, CHLOROPLASTICcoord: 3..471
score: 2.8E