Cla97C01G004820 (gene) Watermelon (97103) v2

NameCla97C01G004820
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionFantom protein
LocationCla97Chr01 : 4605550 .. 4608321 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTTGTTCGGTACGGGCCGGCAAGGCTGGTCCCAATTGGCTTGACCGCCTACGTTCCAACAAGGGTTTTCCAATCGTTGATAATCTTGAACTTGATCACTTCCTTACTGACCAAACACTCGATAATCCCTCCTCGTCTTCGCTTCTAGATTCTAAGCCCCATTCCACTCAGGCTGACCCCCACTCGGACTCTGATCCCAATTCTCAATGCCGGGACAACTCCTCCTCCTCCAACTCTCCTGTTGAAAATGAAAACCCATCTTCTTATGGAATCATTACTAACATCCTCTCTGACCTCTTCAACATGACGGGTTCCTCTCGTAATTCCAAATGTTCCGGCAAAAAGTACCCTAGGAAACAGTCCAACCCCAAGATTTGCTCTCTTCCTTCTGGTACTAGTGCGGACTATGCGGATGCGAAGAATATGTGTTGTCTGCAGAAAGAAGATAACATCCTCTCATCAAACTCTGATAATAGCTCAAAAGGTTGCTTCGATGTTGGGTCGGATGTAGCACAAAATGTGTGCCTTAAGGTTGTAGAGGAAGAGATGGGAGATGAGAAGTGCGAGAAGGAACTTAAAGGATACTCGAAAAGCGAGGTCACGGTCATAGATACTAGCGATGACGTCTGGAAGTCCGACAAACTGATTTTCAGAAGGAAGAATGTATGGAAGGTCAAGGACAAAAAGGGTAAGTTGAGGAGCTATGGAAGGAAGAAGAGGAAGCAGTCTTTTGAAATGAATGACCTTCCCGATAAGATTGCTTCCACAAGTAAGAAAACCAAAGTCTGGGGTTCAGAGGAGCGCTTTCATTTCAATGAACAGAAAAACCGTGGAAAGGAATCTCTCAAACCATTGAATAAAGTAAGATTAATTATTTTGATACTGTCTTGATTGTACAAGTGCAATTTCTAGCTAGTCTTTTGTTGCTGTTTTTTTTTTAATTGTGAAATGATGGTTGAGTATTGGTCTTTATATAGATGTAATTCGCACCCGTGTAGATCCTTCTGTTACTATTTTCTGCGTATATTTCAGGTGCTTTCTTCCCCTTTATCACTTTTGATGTGCATTCTTAAGAAATGTTGATCTCTTTAAATGGGTCTCTAAAATAATTGCTACTTGACGTTTCCATGTCAACTACCTTGTTTATACCAAACAAAGATGTAAATGTCGAGATGCAGATGTAGATGAAGTCATTGAAGTTTCAATTCATACATATGCTGATAGTCATAGACTATCTTGTAGATATATGGAAAATTAATCAAAGTCATGGAACTCCTAATGATTATTTGAGTGATTTGGGTTAATAAATAAGCAATTCCTGTTTGTTGAAGAAGTTGTAAATTCTTTTATTAGTATTTATGTTTATATTGGTTATGTTTTTGTTGACATTTTTATAGTAGATTTTGTTTCCTCCTATCCATGTTTGAATACTTTAATTTGTGGAAATATTGAAATGTGGATTGAGATTTTATTTCTGTACCTTAGTATGAAGAATTTTGGTATGTATTATTTCACTTCACAGTTGTGAATGAAACAATAAATTTTGTTCTAGTCCCTACTTGTCATATTATCATCTCAATTCAAACTGCTTAGTGTTTGTTTGTGGTCAAGTAGTCTCATGACCGTTATCTATGTGTACGATTCTCATGATGCTATTTAATTAATTACATAATTGTATGACATAATGTTGGAGCTGAATCTCTTCTTAGTCATGATTACTATCATTGATGTCATCTTTTAGCTGAGATTCTCGAAGTGGAATAATTGTTGGATTGTGGCAGCCACTATAATTTTATGATGTTCTTAAGGCATTATTTGAAAGAAATATTGTTGCGAGGCAGGAGGAGGCGTATTATTGAGTCAGTTATTTTAAATGCTATCCTAGTTGATTTAAAACACAATGGAGGTGAAATCTAATTTTATATGCACTGTTTTTGCCACTATTGATAAACAAAACTTTTAGAATATAGTGTAAAATACACTTTTGGTCATTCAGGTTTGGAGTAAATGTCTATTTGATCCTTGAAGTTCCAAAATAGGGACTAGAATAGGCACTTCTAGTCCCTGAGATTTGAATAAAAGTTTAAATGGCTCATGAGGTCAGTTGACTGCTAGTTGACTAGCAGAATAGTGAAATGGTCATTACTTGCTGACAAGATTATGATTAGATGGCAAACAATTAATAATTTTATATATTGTTTAATGTGATTGGGACTATAATGTGGCAAAATGGATATCTAAAATTTAAAGCATCTTAAGTCTTAATTAGTTGAGGTATGCTCTTATTGGCATTTTTGAAAAGAAAAAAAAAACTTTGTGCTATGTATTTGTTTTTGTTTAATTCTTTGCCCCTTAAAAGAAAAAAGAATTCAATTGAGAAACAAAAGGGTTCCCTGTAAACAATCTGAGCTCCACACATGCATTTCTTTCTCCTGTTTTTAACTTTCCTCCAGGAATATTATTTAGCATTTTTTCCCTTACCGTTCTAGATTTTGAAACTTCTTGCAGGGACATAATCCTCAGCATTGCTATGGTCCTGAAATTAGAGTAAATGCACCGGATAGTAGTAATGAAAAGAAGGAAAATGTTTACACGCTCTCCCAGAAAAATGTCAGCCATGACCCCAAAGAAAATGGTAAGTTTTCTTGGGATTTGCTGTTTTTCAGTTCTACTCAGTCTTGGAATTCTATAGCCGACCTTGAGACAATGCAATGAACTTATATGGATGTTTTGTAATTTTCCCGTACAGATCACCCAAGGGCTTGGTAA

mRNA sequence

ATGCTTTGTTCGGTACGGGCCGGCAAGGCTGGTCCCAATTGGCTTGACCGCCTACGTTCCAACAAGGGTTTTCCAATCGTTGATAATCTTGAACTTGATCACTTCCTTACTGACCAAACACTCGATAATCCCTCCTCGTCTTCGCTTCTAGATTCTAAGCCCCATTCCACTCAGGCTGACCCCCACTCGGACTCTGATCCCAATTCTCAATGCCGGGACAACTCCTCCTCCTCCAACTCTCCTGTTGAAAATGAAAACCCATCTTCTTATGGAATCATTACTAACATCCTCTCTGACCTCTTCAACATGACGGGTTCCTCTCGTAATTCCAAATGTTCCGGCAAAAAGTACCCTAGGAAACAGTCCAACCCCAAGATTTGCTCTCTTCCTTCTGGTACTAGTGCGGACTATGCGGATGCGAAGAATATGTGTTGTCTGCAGAAAGAAGATAACATCCTCTCATCAAACTCTGATAATAGCTCAAAAGGTTGCTTCGATGTTGGGTCGGATGTAGCACAAAATGTGTGCCTTAAGGTTGTAGAGGAAGAGATGGGAGATGAGAAGTGCGAGAAGGAACTTAAAGGATACTCGAAAAGCGAGGTCACGGTCATAGATACTAGCGATGACGTCTGGAAGTCCGACAAACTGATTTTCAGAAGGAAGAATGTATGGAAGGTCAAGGACAAAAAGGGTAAGTTGAGGAGCTATGGAAGGAAGAAGAGGAAGCAGTCTTTTGAAATGAATGACCTTCCCGATAAGATTGCTTCCACAAGTAAGAAAACCAAAGTCTGGGGTTCAGAGGAGCGCTTTCATTTCAATGAACAGAAAAACCGTGGAAAGGAATCTCTCAAACCATTGAATAAAGGACATAATCCTCAGCATTGCTATGGTCCTGAAATTAGAGTAAATGCACCGGATAGTAGTAATGAAAAGAAGGAAAATGTTTACACGCTCTCCCAGAAAAATGTCAGCCATGACCCCAAAGAAAATGATCACCCAAGGGCTTGGTAA

Coding sequence (CDS)

ATGCTTTGTTCGGTACGGGCCGGCAAGGCTGGTCCCAATTGGCTTGACCGCCTACGTTCCAACAAGGGTTTTCCAATCGTTGATAATCTTGAACTTGATCACTTCCTTACTGACCAAACACTCGATAATCCCTCCTCGTCTTCGCTTCTAGATTCTAAGCCCCATTCCACTCAGGCTGACCCCCACTCGGACTCTGATCCCAATTCTCAATGCCGGGACAACTCCTCCTCCTCCAACTCTCCTGTTGAAAATGAAAACCCATCTTCTTATGGAATCATTACTAACATCCTCTCTGACCTCTTCAACATGACGGGTTCCTCTCGTAATTCCAAATGTTCCGGCAAAAAGTACCCTAGGAAACAGTCCAACCCCAAGATTTGCTCTCTTCCTTCTGGTACTAGTGCGGACTATGCGGATGCGAAGAATATGTGTTGTCTGCAGAAAGAAGATAACATCCTCTCATCAAACTCTGATAATAGCTCAAAAGGTTGCTTCGATGTTGGGTCGGATGTAGCACAAAATGTGTGCCTTAAGGTTGTAGAGGAAGAGATGGGAGATGAGAAGTGCGAGAAGGAACTTAAAGGATACTCGAAAAGCGAGGTCACGGTCATAGATACTAGCGATGACGTCTGGAAGTCCGACAAACTGATTTTCAGAAGGAAGAATGTATGGAAGGTCAAGGACAAAAAGGGTAAGTTGAGGAGCTATGGAAGGAAGAAGAGGAAGCAGTCTTTTGAAATGAATGACCTTCCCGATAAGATTGCTTCCACAAGTAAGAAAACCAAAGTCTGGGGTTCAGAGGAGCGCTTTCATTTCAATGAACAGAAAAACCGTGGAAAGGAATCTCTCAAACCATTGAATAAAGGACATAATCCTCAGCATTGCTATGGTCCTGAAATTAGAGTAAATGCACCGGATAGTAGTAATGAAAAGAAGGAAAATGTTTACACGCTCTCCCAGAAAAATGTCAGCCATGACCCCAAAGAAAATGATCACCCAAGGGCTTGGTAA

Protein sequence

MLCSVRAGKAGPNWLDRLRSNKGFPIVDNLELDHFLTDQTLDNPSSSSLLDSKPHSTQADPHSDSDPNSQCRDNSSSSNSPVENENPSSYGIITNILSDLFNMTGSSRNSKCSGKKYPRKQSNPKICSLPSGTSADYADAKNMCCLQKEDNILSSNSDNSSKGCFDVGSDVAQNVCLKVVEEEMGDEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRKNVWKVKDKKGKLRSYGRKKRKQSFEMNDLPDKIASTSKKTKVWGSEERFHFNEQKNRGKESLKPLNKGHNPQHCYGPEIRVNAPDSSNEKKENVYTLSQKNVSHDPKENDHPRAW
BLAST of Cla97C01G004820 vs. NCBI nr
Match: XP_016898859.1 (PREDICTED: uncharacterized protein LOC103482569 [Cucumis melo] >XP_016898860.1 PREDICTED: uncharacterized protein LOC103482569 [Cucumis melo] >XP_016898861.1 PREDICTED: uncharacterized protein LOC103482569 [Cucumis melo])

HSP 1 Score: 519.2 bits (1336), Expect = 1.0e-143
Identity = 268/328 (81.71%), Postives = 285/328 (86.89%), Query Frame = 0

Query: 1   MLCSVRAGKAGPNWLDRLRSNKGFPIVDNLELDHFLTDQTLDNPSSSSLLDSKPHSTQAD 60
           MLCSVRAGKAGPNWLDRLRSNKGFPI DNLELDHFLTDQ LDNP   SL DS PHST+AD
Sbjct: 1   MLCSVRAGKAGPNWLDRLRSNKGFPITDNLELDHFLTDQNLDNP--CSLSDSNPHSTRAD 60

Query: 61  PHSDSXXXXXCXXXXXSSNSPVENENPSSYGIITNILSDLFNMTGSSRNSKCSGKKYPRK 120
           PHSD+           SSNSP+EN NPSS+ IIT+ILSDLFNM G+SRNSKC  KKYPRK
Sbjct: 61  PHSDANLNSQ-HQDNSSSNSPIENGNPSSFEIITDILSDLFNMGGASRNSKCCSKKYPRK 120

Query: 121 QSNPKICSLPSGTSADYADAKNMCCLQKEDNILSSNSDNSSKGCFDVGSDVAQNVCLKVV 180
           QSNPKICS+PS  + DYADAKN+CCLQKEDNILSSNSDNSSKGC D GSD+AQNVCLKVV
Sbjct: 121 QSNPKICSIPSIANVDYADAKNLCCLQKEDNILSSNSDNSSKGCTDSGSDMAQNVCLKVV 180

Query: 181 EEEMGDEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRKNVWKVKDKKGKLRSYGRKK 240
           EEE+ DEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRK+VWKVKDKK KLRSYGRKK
Sbjct: 181 EEEVWDEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRKSVWKVKDKKCKLRSYGRKK 240

Query: 241 RKQSFEMNDLPDKIASTSKKTKVWGSEERFHFNEQKNRGKESLKPLNKGHNPQHCYGPEI 300
           RKQS EMNDLPD+I S SKKTKVWGSEERFH N+Q+  GKESLKPLNK HN QHCYGPEI
Sbjct: 241 RKQSSEMNDLPDRIVSASKKTKVWGSEERFHLNKQQIHGKESLKPLNKVHNLQHCYGPEI 300

Query: 301 RVNAPDSSNEKKENVYTLSQKNVSHDPK 329
           R+ APDSSNEKKEN  TLSQKN  +DPK
Sbjct: 301 RLTAPDSSNEKKENGCTLSQKNGGYDPK 325

BLAST of Cla97C01G004820 vs. NCBI nr
Match: XP_011654849.1 (PREDICTED: uncharacterized protein LOC105435450 [Cucumis sativus] >XP_011654850.1 PREDICTED: uncharacterized protein LOC105435450 [Cucumis sativus] >KGN50314.1 hypothetical protein Csa_5G167060 [Cucumis sativus])

HSP 1 Score: 512.3 bits (1318), Expect = 1.2e-141
Identity = 266/328 (81.10%), Postives = 282/328 (85.98%), Query Frame = 0

Query: 1   MLCSVRAGKAGPNWLDRLRSNKGFPIVDNLELDHFLTDQTLDNPSSSSLLDSKPHSTQAD 60
           MLCSVRAGKAGPNWLDRLRSNKGFPI DNLELDHFLTDQ LDNP   SL DS PHST+AD
Sbjct: 1   MLCSVRAGKAGPNWLDRLRSNKGFPITDNLELDHFLTDQNLDNP--CSLSDSNPHSTRAD 60

Query: 61  PHSDSXXXXXCXXXXXSSNSPVENENPSSYGIITNILSDLFNMTGSSRNSKCSGKKYPRK 120
           P SD+           SSNSP+EN NPSS+GIIT+ILSDLFNM G+SRNSKC  KKYPRK
Sbjct: 61  PRSDANLNSH-HQDNSSSNSPIENGNPSSFGIITDILSDLFNMGGASRNSKCFSKKYPRK 120

Query: 121 QSNPKICSLPSGTSADYADAKNMCCLQKEDNILSSNSDNSSKGCFDVGSDVAQNVCLKVV 180
           QSNPKI S+PS T+ DYADAKN+CCLQKEDNILSSNSDNSSKGC D GSD+AQNVCLKVV
Sbjct: 121 QSNPKIYSIPSVTNGDYADAKNLCCLQKEDNILSSNSDNSSKGCIDSGSDMAQNVCLKVV 180

Query: 181 EEEMGDEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRKNVWKVKDKKGKLRSYGRKK 240
           EEE+ DEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRK+VWKVKDKK KLRSYGRKK
Sbjct: 181 EEEVWDEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRKSVWKVKDKKCKLRSYGRKK 240

Query: 241 RKQSFEMNDLPDKIASTSKKTKVWGSEERFHFNEQKNRGKESLKPLNKGHNPQHCYGPEI 300
           RKQS E NDLPD+I S SKKTKVWGSEERFH N Q+  GKESLKPLNK HN QHCYGPE 
Sbjct: 241 RKQSSETNDLPDRIVSASKKTKVWGSEERFHLNRQQIHGKESLKPLNKVHNFQHCYGPES 300

Query: 301 RVNAPDSSNEKKENVYTLSQKNVSHDPK 329
           R+ APDSSNEKKEN  TLSQKN  +DPK
Sbjct: 301 RLTAPDSSNEKKENGSTLSQKNGGYDPK 325

BLAST of Cla97C01G004820 vs. NCBI nr
Match: XP_023551713.1 (uncharacterized protein LOC111809606 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 443.7 bits (1140), Expect = 5.4e-121
Identity = 246/326 (75.46%), Postives = 265/326 (81.29%), Query Frame = 0

Query: 1   MLCSVRAGKAGPNWLDRLRSNKGFPIVDNLELDHFLTDQTLDNPSSSSLLDSKPHSTQAD 60
           MLCSVRAGKA PNWLDRLRSNKGFPI DNL+LDHFLT+Q LDNPS SS            
Sbjct: 1   MLCSVRAGKAAPNWLDRLRSNKGFPIADNLDLDHFLTNQNLDNPSPSSXXXXXXXXXXXX 60

Query: 61  PHSDSXXXXXCXXXXXSSNSPVENENPSSYGIITNILSDLFNMTGSSRNSKCSGKKYPRK 120
                XXXXX XXXXX    P+EN NPSSYGIIT ILSDLFNMTG+SR+SKCSGKK PRK
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXPIENGNPSSYGIITGILSDLFNMTGASRSSKCSGKKLPRK 120

Query: 121 QSNPKICSLPSGTSADYADAKNMCCLQKEDNILSSNSDNSSKGCFDVGSDVAQNVCLKVV 180
           QSNPKICS+PS T+ADYAD KN+CC QKEDNILSSNSDNSSKG  + GSD AQN    V 
Sbjct: 121 QSNPKICSVPSLTNADYADDKNLCCAQKEDNILSSNSDNSSKGGANAGSDKAQNARSMVE 180

Query: 181 EEEMGDEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRKNVWKVKDKKGKLRSYGRKK 240
           EEE+ DEK EKEL+GYSKSEVTVIDTS DVWKSDKLIFRRKNVWKVKDKKGKLRSYGRKK
Sbjct: 181 EEEVEDEKGEKELQGYSKSEVTVIDTSSDVWKSDKLIFRRKNVWKVKDKKGKLRSYGRKK 240

Query: 241 RKQSFEMNDLPDKIASTSKKTKVWGSEERFHFNEQKNRGKESLKPLNKGHNPQHCYGPEI 300
           RKQ  EMN L D  AS SKKTK WGSEERFHFN  + RGKE+LK LNKG+N QHC GPEI
Sbjct: 241 RKQCSEMNGLSDMTASASKKTKFWGSEERFHFNAPQIRGKEALKQLNKGYNSQHCSGPEI 300

Query: 301 RVNAPDSSNEKKENVYTLSQKNVSHD 327
            V+APDSSN+KKENV +LSQ+N S+D
Sbjct: 301 SVSAPDSSNDKKENVCSLSQENGSYD 326

BLAST of Cla97C01G004820 vs. NCBI nr
Match: XP_022984364.1 (uncharacterized protein LOC111482689 [Cucurbita maxima] >XP_022984365.1 uncharacterized protein LOC111482689 [Cucurbita maxima])

HSP 1 Score: 438.7 bits (1127), Expect = 1.7e-119
Identity = 243/326 (74.54%), Postives = 263/326 (80.67%), Query Frame = 0

Query: 1   MLCSVRAGKAGPNWLDRLRSNKGFPIVDNLELDHFLTDQTLDNPSSSSLLDSKPHSTQAD 60
           MLCSVRAGKA PNWLDRLRSNKGFPI DNL+LDHFLT+Q LDNPS SS            
Sbjct: 1   MLCSVRAGKAAPNWLDRLRSNKGFPIADNLDLDHFLTNQNLDNPSPSSXXXXXXXXXXXX 60

Query: 61  PHSDSXXXXXCXXXXXSSNSPVENENPSSYGIITNILSDLFNMTGSSRNSKCSGKKYPRK 120
                XXXXX XXXXX    P+EN NPSSYGIIT ILSDLF+MTG+SR+SKCSGKK PRK
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXPIENGNPSSYGIITGILSDLFHMTGASRSSKCSGKKLPRK 120

Query: 121 QSNPKICSLPSGTSADYADAKNMCCLQKEDNILSSNSDNSSKGCFDVGSDVAQNVCLKVV 180
           QSNPKICS+PS T+ADYAD KN+CC QKEDNILSSNSDNSSKG  + GSD AQN    V 
Sbjct: 121 QSNPKICSVPSLTNADYADDKNLCCAQKEDNILSSNSDNSSKGGANAGSDKAQNARSMVE 180

Query: 181 EEEMGDEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRKNVWKVKDKKGKLRSYGRKK 240
           EEE+ DEK EKEL+GYSKSEVTVIDTS DVWKSDKLIFRRKNVWKVKDKKGKLRSYGRKK
Sbjct: 181 EEEVEDEKGEKELQGYSKSEVTVIDTSSDVWKSDKLIFRRKNVWKVKDKKGKLRSYGRKK 240

Query: 241 RKQSFEMNDLPDKIASTSKKTKVWGSEERFHFNEQKNRGKESLKPLNKGHNPQHCYGPEI 300
           RKQ  EMN L D  AS SKK K WGSEERFHFN  + RGKE+LK LNKG+N  HC GPEI
Sbjct: 241 RKQCSEMNGLSDMTASASKKAKFWGSEERFHFNAPQIRGKEALKQLNKGYNSHHCSGPEI 300

Query: 301 RVNAPDSSNEKKENVYTLSQKNVSHD 327
            V+APDSSN+KK NVY+LSQ+N S+D
Sbjct: 301 SVSAPDSSNDKKGNVYSLSQENGSYD 326

BLAST of Cla97C01G004820 vs. NCBI nr
Match: XP_022922604.1 (uncharacterized protein LOC111430562 isoform X1 [Cucurbita moschata] >XP_022922605.1 uncharacterized protein LOC111430562 isoform X1 [Cucurbita moschata])

HSP 1 Score: 438.0 bits (1125), Expect = 3.0e-119
Identity = 243/326 (74.54%), Postives = 264/326 (80.98%), Query Frame = 0

Query: 1   MLCSVRAGKAGPNWLDRLRSNKGFPIVDNLELDHFLTDQTLDNPSSSSLLDSKPHSTQAD 60
           MLCSVRAGKA PNWLDRLRSNKGFPI DNL+LDHFLT+Q LDNPS SS            
Sbjct: 1   MLCSVRAGKAAPNWLDRLRSNKGFPIADNLDLDHFLTNQNLDNPSPSSXXXXXXXXXXXX 60

Query: 61  PHSDSXXXXXCXXXXXSSNSPVENENPSSYGIITNILSDLFNMTGSSRNSKCSGKKYPRK 120
                XXXXX XXXXX    P+EN NPSSYGIIT ILSDLFN+TG+SR+SKCSGKK PRK
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXPIENGNPSSYGIITGILSDLFNITGASRSSKCSGKKLPRK 120

Query: 121 QSNPKICSLPSGTSADYADAKNMCCLQKEDNILSSNSDNSSKGCFDVGSDVAQNVCLKVV 180
           QSNPKICS+PS T+ADYAD KN+CC QKEDNILSSNSDNSSKG  + GSD AQN    V 
Sbjct: 121 QSNPKICSVPSLTNADYADDKNLCCAQKEDNILSSNSDNSSKGGANAGSDKAQNARSMVE 180

Query: 181 EEEMGDEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRKNVWKVKDKKGKLRSYGRKK 240
           EEE+ DEK EKEL+GYSKSEVTVIDTS DVWKSDKLIFRRKNVWKVKDKKGKLRSYGRKK
Sbjct: 181 EEEVEDEKGEKELQGYSKSEVTVIDTSSDVWKSDKLIFRRKNVWKVKDKKGKLRSYGRKK 240

Query: 241 RKQSFEMNDLPDKIASTSKKTKVWGSEERFHFNEQKNRGKESLKPLNKGHNPQHCYGPEI 300
           RKQ  EMN L D  AS SK TK WGSEERFHFN  + RGKE+LK LNKG+N QHC  PEI
Sbjct: 241 RKQCSEMNGLSDMTASASKITKFWGSEERFHFNAPQIRGKEALKQLNKGYNSQHCSDPEI 300

Query: 301 RVNAPDSSNEKKENVYTLSQKNVSHD 327
            V+APDSSN+KKE+VY+LSQ+N S+D
Sbjct: 301 SVSAPDSSNDKKESVYSLSQENGSYD 326

BLAST of Cla97C01G004820 vs. TrEMBL
Match: tr|A0A1S4DS96|A0A1S4DS96_CUCME (uncharacterized protein LOC103482569 OS=Cucumis melo OX=3656 GN=LOC103482569 PE=4 SV=1)

HSP 1 Score: 519.2 bits (1336), Expect = 6.7e-144
Identity = 268/328 (81.71%), Postives = 285/328 (86.89%), Query Frame = 0

Query: 1   MLCSVRAGKAGPNWLDRLRSNKGFPIVDNLELDHFLTDQTLDNPSSSSLLDSKPHSTQAD 60
           MLCSVRAGKAGPNWLDRLRSNKGFPI DNLELDHFLTDQ LDNP   SL DS PHST+AD
Sbjct: 1   MLCSVRAGKAGPNWLDRLRSNKGFPITDNLELDHFLTDQNLDNP--CSLSDSNPHSTRAD 60

Query: 61  PHSDSXXXXXCXXXXXSSNSPVENENPSSYGIITNILSDLFNMTGSSRNSKCSGKKYPRK 120
           PHSD+           SSNSP+EN NPSS+ IIT+ILSDLFNM G+SRNSKC  KKYPRK
Sbjct: 61  PHSDANLNSQ-HQDNSSSNSPIENGNPSSFEIITDILSDLFNMGGASRNSKCCSKKYPRK 120

Query: 121 QSNPKICSLPSGTSADYADAKNMCCLQKEDNILSSNSDNSSKGCFDVGSDVAQNVCLKVV 180
           QSNPKICS+PS  + DYADAKN+CCLQKEDNILSSNSDNSSKGC D GSD+AQNVCLKVV
Sbjct: 121 QSNPKICSIPSIANVDYADAKNLCCLQKEDNILSSNSDNSSKGCTDSGSDMAQNVCLKVV 180

Query: 181 EEEMGDEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRKNVWKVKDKKGKLRSYGRKK 240
           EEE+ DEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRK+VWKVKDKK KLRSYGRKK
Sbjct: 181 EEEVWDEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRKSVWKVKDKKCKLRSYGRKK 240

Query: 241 RKQSFEMNDLPDKIASTSKKTKVWGSEERFHFNEQKNRGKESLKPLNKGHNPQHCYGPEI 300
           RKQS EMNDLPD+I S SKKTKVWGSEERFH N+Q+  GKESLKPLNK HN QHCYGPEI
Sbjct: 241 RKQSSEMNDLPDRIVSASKKTKVWGSEERFHLNKQQIHGKESLKPLNKVHNLQHCYGPEI 300

Query: 301 RVNAPDSSNEKKENVYTLSQKNVSHDPK 329
           R+ APDSSNEKKEN  TLSQKN  +DPK
Sbjct: 301 RLTAPDSSNEKKENGCTLSQKNGGYDPK 325

BLAST of Cla97C01G004820 vs. TrEMBL
Match: tr|A0A0A0KN29|A0A0A0KN29_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G167060 PE=4 SV=1)

HSP 1 Score: 512.3 bits (1318), Expect = 8.2e-142
Identity = 266/328 (81.10%), Postives = 282/328 (85.98%), Query Frame = 0

Query: 1   MLCSVRAGKAGPNWLDRLRSNKGFPIVDNLELDHFLTDQTLDNPSSSSLLDSKPHSTQAD 60
           MLCSVRAGKAGPNWLDRLRSNKGFPI DNLELDHFLTDQ LDNP   SL DS PHST+AD
Sbjct: 1   MLCSVRAGKAGPNWLDRLRSNKGFPITDNLELDHFLTDQNLDNP--CSLSDSNPHSTRAD 60

Query: 61  PHSDSXXXXXCXXXXXSSNSPVENENPSSYGIITNILSDLFNMTGSSRNSKCSGKKYPRK 120
           P SD+           SSNSP+EN NPSS+GIIT+ILSDLFNM G+SRNSKC  KKYPRK
Sbjct: 61  PRSDANLNSH-HQDNSSSNSPIENGNPSSFGIITDILSDLFNMGGASRNSKCFSKKYPRK 120

Query: 121 QSNPKICSLPSGTSADYADAKNMCCLQKEDNILSSNSDNSSKGCFDVGSDVAQNVCLKVV 180
           QSNPKI S+PS T+ DYADAKN+CCLQKEDNILSSNSDNSSKGC D GSD+AQNVCLKVV
Sbjct: 121 QSNPKIYSIPSVTNGDYADAKNLCCLQKEDNILSSNSDNSSKGCIDSGSDMAQNVCLKVV 180

Query: 181 EEEMGDEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRKNVWKVKDKKGKLRSYGRKK 240
           EEE+ DEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRK+VWKVKDKK KLRSYGRKK
Sbjct: 181 EEEVWDEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRKSVWKVKDKKCKLRSYGRKK 240

Query: 241 RKQSFEMNDLPDKIASTSKKTKVWGSEERFHFNEQKNRGKESLKPLNKGHNPQHCYGPEI 300
           RKQS E NDLPD+I S SKKTKVWGSEERFH N Q+  GKESLKPLNK HN QHCYGPE 
Sbjct: 241 RKQSSETNDLPDRIVSASKKTKVWGSEERFHLNRQQIHGKESLKPLNKVHNFQHCYGPES 300

Query: 301 RVNAPDSSNEKKENVYTLSQKNVSHDPK 329
           R+ APDSSNEKKEN  TLSQKN  +DPK
Sbjct: 301 RLTAPDSSNEKKENGSTLSQKNGGYDPK 325

BLAST of Cla97C01G004820 vs. TrEMBL
Match: tr|B9SEK5|B9SEK5_RICCO (Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_0969780 PE=4 SV=1)

HSP 1 Score: 166.8 bits (421), Expect = 8.5e-38
Identity = 119/289 (41.18%), Postives = 163/289 (56.40%), Query Frame = 0

Query: 1   MLCSVRAG-KAGPNWLDRLRSNKGFPIVDNLELDHFLTDQTLDNP--SSSSLLDSKPHST 60
           MLCSV AG K+G NWLDRLRS KGFP  +NL+LD+FL++ +L NP  S S+L  +K  ++
Sbjct: 1   MLCSVSAGTKSGSNWLDRLRSTKGFPATENLDLDNFLSNSSLLNPSISESTLSHNKRVTS 60

Query: 61  QADPHSDSXXXXXCXXXXXSSNSPVENENPSSYGIITNILSDLFNMTGS-SRNSKCSGKK 120
                 D+                 EN     +G++TN+L DLFNM  S  +NS+ SG K
Sbjct: 61  DQTQFPDTSS---------------ENGEKEWFGLVTNVLCDLFNMGDSQDKNSRLSGTK 120

Query: 121 YPRKQSNPKICSLPSGTSADYADAKNMCCLQKEDNILSSNSDNSSKGCFDVGSDVAQNVC 180
             RKQ+NPK   + S    +          + ++N   SN    +  CF    D      
Sbjct: 121 SSRKQTNPKFFDIESVRKEECVQVATPASFRSDNN---SNVVGMNADCFSNDDD------ 180

Query: 181 LKVVEEEMGDEKC--EKELKGYSKSEVTVIDTSDDVWKSDKLIFRRKNVWKVKDKKGKLR 240
              V+EE   EKC  +KELKGYSKSEVTVIDTS ++WK DKL+FRRKN+WKV+DKKGK  
Sbjct: 181 -NNVDEE--KEKCSSDKELKGYSKSEVTVIDTSFEMWKFDKLVFRRKNIWKVRDKKGKSW 240

Query: 241 SYGRKKRKQSFEMNDLPDKIASTSKKTKVWGSEERFHFNEQKNRGKESL 284
           S+  KKRK +   + + +      KK K+  S+ +F  +++ N G  +L
Sbjct: 241 SFSSKKRKGNQLESAIGNGNVGCKKKAKM-SSDSQFASSKESNGGDFAL 261

BLAST of Cla97C01G004820 vs. TrEMBL
Match: tr|B9RQ51|B9RQ51_RICCO (Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_0955310 PE=4 SV=1)

HSP 1 Score: 166.4 bits (420), Expect = 1.1e-37
Identity = 116/282 (41.13%), Postives = 159/282 (56.38%), Query Frame = 0

Query: 4   SVRAG-KAGPNWLDRLRSNKGFPIVDNLELDHFLTDQTLDNPSSSSLLDSKPHSTQADPH 63
           SV AG K+G NWLDRLRS KGFP  +NL+LD+FL+D +L N  S+  L+ +  S Q +  
Sbjct: 4   SVFAGNKSGSNWLDRLRSTKGFPATENLDLDNFLSDPSLPNSESTQSLNRRVTSDQTE-- 63

Query: 64  SDSXXXXXCXXXXXSSNSPVENENPSSYGIITNILSDLFNMTGS-SRNSKCSGKKYPRKQ 123
                           ++  EN     +G++TN+L DLFNM  S  +NS+ SGKK  RKQ
Sbjct: 64  --------------IPDTLRENGEREWFGVVTNVLCDLFNMGDSQDKNSRISGKKSSRKQ 123

Query: 124 SNPKICSLPSGTSADYADAKNMCCLQKEDNILSSNSDNSSKGCFDVGSDVAQNVCLKVVE 183
           +NPK     S    +Y  A        ++N   SN    +  CF V  D   N  L   +
Sbjct: 124 TNPKFFDADSVRKEEYVQAATTASFHSDNN---SNVVGMNADCF-VDDDDEYNGKL---D 183

Query: 184 EEMGDEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRKNVWKVKDKKGKLRSYGRKKR 243
           E+      +KELKGYSKSEVTVIDTS +VWK DKL+FRRK++WKV+DKKGK  ++  KKR
Sbjct: 184 EKKEKSSSDKELKGYSKSEVTVIDTSFEVWKFDKLVFRRKSIWKVRDKKGKSWNFASKKR 243

Query: 244 KQSFEMNDLPDKIASTSKKTKVWGSEERFHFNEQKNRGKESL 284
           K +   +   +   S+ KK K+  S+  F  +++ N G  +L
Sbjct: 244 KGNHLESATNNGNVSSKKKAKM--SDSEFASSKESNGGDFAL 260

BLAST of Cla97C01G004820 vs. TrEMBL
Match: tr|A0A2P5DF97|A0A2P5DF97_PARAD (Fantom protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_068110 PE=4 SV=1)

HSP 1 Score: 163.3 bits (412), Expect = 9.4e-37
Identity = 128/336 (38.10%), Postives = 175/336 (52.08%), Query Frame = 0

Query: 1   MLCSVRAGKAGPNWLDRLRSNKGFPI-VDNLELDHFLTDQTLDNPSSSSLLDSKPHSTQA 60
           MLCSV AGK+G +WL+RLRSNKGFP   D+L+LDHFL+     NP+SSS     PH    
Sbjct: 1   MLCSVPAGKSGSSWLNRLRSNKGFPTGDDDLDLDHFLS----QNPNSSS--SDSPHL--- 60

Query: 61  DPHSDSXXXXXCXXXXXSSNSPVENENPSSYGIITNILSDLFNMTGSSRNSKCSGKKYPR 120
             +S+              ++    +     G+++N+LS+LF M GS  +S+ SGKK+PR
Sbjct: 61  --NSEESRTIPTRPEPQRVSNRSGGQEREWVGVMSNVLSELFFMGGSGESSRLSGKKFPR 120

Query: 121 KQSNPKICSLPSGTSADYADAKNMCCLQKEDNIL--SSNSDNSS------KGCFDVGSDV 180
           KQ+NP+IC + S  S      +     +  D  +  S NSD +S      +G    G   
Sbjct: 121 KQTNPRICVVSSDNSNSSVVGERK---KSSDGAVTASFNSDGNSPMMRTKEGNVGFGGFE 180

Query: 181 AQNVCLKVVEEEMGDEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRKNVWKVKDKKG 240
                               ELKGYS+SEVTVIDTS   WKS+KL+FRRKNVWKV++KKG
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXELKGYSRSEVTVIDTSFGCWKSEKLVFRRKNVWKVREKKG 240

Query: 241 KLRSYGRKKRKQSFEMNDLPDKIASTSKKTKVWGSEERFHFNEQKNRGKESLK-PLNKGH 300
           KLRS+GRKKRK     +D         KK KV  S E          G +S+K   ++G 
Sbjct: 241 KLRSFGRKKRKGG---SDSTWGAVGLEKKAKVLASSEA--------NGDQSVKISSDEGQ 300

Query: 301 NPQHCYGPEIRVNAPDSSNEKKENVYTLSQKNVSHD 327
           N ++    E+     ++ N  +E + T S   +S D
Sbjct: 301 NLKNDSMEEVCKATAENLNTTREEISTESPDKLSQD 311

BLAST of Cla97C01G004820 vs. TAIR10
Match: AT5G24500.1 (unknown protein)

HSP 1 Score: 114.4 bits (285), Expect = 1.4e-25
Identity = 95/252 (37.70%), Postives = 136/252 (53.97%), Query Frame = 0

Query: 1   MLCSVRAGK-AGPNWLDRLRSNKGFPIVDN------LELDHFL-TDQTLDNPSSSSLLDS 60
           ML S+   K A   WL+RLR N+G    D+      L LD FL  +   +  ++SS  DS
Sbjct: 1   MLSSIIDDKPASSTWLNRLRLNRGLTTDDDDASGNPLTLDDFLRRNHHTEIAATSSASDS 60

Query: 61  KPHS-TQADPHSDSXXXXXCXXXXXSSNSPVENENPSS-YGIITNILSDLFNMTGSSRNS 120
            P +   +DP                + SP E   P   YG+++++L +LFN +GSS++S
Sbjct: 61  PPSAPIPSDPE--------------LAESPSEEPVPGEWYGVMSDVLFELFNFSGSSKSS 120

Query: 121 KCSG-KKYPRKQSNPKICSLPSG-------TSADYADAKNMCCLQKEDNILSSNSDNSSK 180
              G KK PRKQSNP+ CSL +         +    DA  +  +++     S +S N   
Sbjct: 121 TIPGKKKLPRKQSNPRHCSLETPEDVVVPLVNQKSDDANCLPSVREFATSSSRSSYNKKP 180

Query: 181 GCFDVGSDVAQNVCLKVVEEEMGDEKCEKELKGYSKSEVTVIDTSDDVWKSDKLIFRRKN 235
              ++       V    V+EE  +EK EK+L G+S+SEVTVIDTS  +WKS+KL+FRR+N
Sbjct: 181 PAPEIRERRRSVVEGDGVDEE--EEKGEKDLVGFSRSEVTVIDTSFKIWKSEKLVFRRRN 236

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_016898859.11.0e-14381.71PREDICTED: uncharacterized protein LOC103482569 [Cucumis melo] >XP_016898860.1 P... [more]
XP_011654849.11.2e-14181.10PREDICTED: uncharacterized protein LOC105435450 [Cucumis sativus] >XP_011654850.... [more]
XP_023551713.15.4e-12175.46uncharacterized protein LOC111809606 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022984364.11.7e-11974.54uncharacterized protein LOC111482689 [Cucurbita maxima] >XP_022984365.1 uncharac... [more]
XP_022922604.13.0e-11974.54uncharacterized protein LOC111430562 isoform X1 [Cucurbita moschata] >XP_0229226... [more]
Match NameE-valueIdentityDescription
tr|A0A1S4DS96|A0A1S4DS96_CUCME6.7e-14481.71uncharacterized protein LOC103482569 OS=Cucumis melo OX=3656 GN=LOC103482569 PE=... [more]
tr|A0A0A0KN29|A0A0A0KN29_CUCSA8.2e-14281.10Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G167060 PE=4 SV=1[more]
tr|B9SEK5|B9SEK5_RICCO8.5e-3841.18Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_0969780 PE=4 SV=1[more]
tr|B9RQ51|B9RQ51_RICCO1.1e-3741.13Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_0955310 PE=4 SV=1[more]
tr|A0A2P5DF97|A0A2P5DF97_PARAD9.4e-3738.10Fantom protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_068110 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT5G24500.11.4e-2537.70unknown protein[more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G004820.1Cla97C01G004820.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 309..323
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 272..336
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 272..287
NoneNo IPR availablePANTHERPTHR37258FAMILY NOT NAMEDcoord: 1..282

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C01G004820Silver-seed gourdcarwmbB0741
Cla97C01G004820Silver-seed gourdcarwmbB1093
Cla97C01G004820Cucurbita maxima (Rimu)cmawmbB285
Cla97C01G004820Cucurbita maxima (Rimu)cmawmbB601
Cla97C01G004820Cucurbita moschata (Rifu)cmowmbB267
Cla97C01G004820Cucurbita moschata (Rifu)cmowmbB576
Cla97C01G004820Wax gourdwgowmbB059