ClCG03G010620 (gene) Watermelon (Charleston Gray)

NameClCG03G010620
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionRetrotransposon gag protein
LocationCG_Chr03 : 19239644 .. 19241628 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCTCAATGTCTTAAGCTCGCCGAGATGGTAGCGTGAGTTCTACTGTGAAGTGGTCAGGCGGAGGGAAGACTCTTCTCTTTCAAGTTATTTGTCTCCGATTGTCGTCTGGAGATAACTTTGCTGTCACTTAGGAATTCTTCCAAGAGATCCTACAGCATTTTCGTCTTTAGGTCTGCCCTTTTATAGATGTTGCGTTGCAATCGCTGGCAAATCTGCATGATTGGAGGCTGAATGATGCCTTTCGACTAAGGTTGTTTCCTTGCAGGATAGAGCAAAAGATTGGTTAGAGTCAATTGCTCCAGAGAGCATAGCCACGTGGGATGCACTGGTGCAAATTTTTCTAAACAAGTTCTTCCTACCCTTTAAAACAAATAAGTTGAGAACATTGATAGACACTTTTAGACAAAATGACGGTGAGCAACTATTTGAGGCTTAAGAGCACTATTAGCTGCTACAGAAGTGCCTTCAACATGGCTACCCTAATTGGTTGCAAATTTAGCTATTTTACAATGGGTTATCTAGCACCACCAAAGAAATACTTGATGTGGTAGCAGGAGGTTTAATGTTTTTGAAAATTAAAGAGGACGATCGAACTTTGTTGGACAACAAGGCAACCACAAGTTATAACTAGCCAACAAAACGGGCCAATCCTAGACCAAGACTCTATGAAGTTGATGAGGTAAATTCTTTAAAAGCACATATGGCTTCTCTTATTAATGTTCTAAATCAATTGTCTATAAGAAATGCACCATCAATAGCGCACATGGTTGCCTGCAGCACTTACTTAACATTAGATAATCTAGATCTAGAGCAGGCAAACTATGTGCATAGGACAAATCAGTTTCATGGGCAATACCAAATTCCACAGTTTTTGTTACTCATGCTGAGAGGAAGCCGTCATTGAGAAACTATTAGGAGATTTTATCGAAGAATTAAGGAGTATGATAAATCAATTGGAAAGTACGATGACAAGCCAAGGAAAGGCGATTCAGAGTATTGAAGTCCAAATCAGCCAGATGGCCACAACCATGAATGCAATGCAAAAAGAAAAATTCACCAATTGTCCTAAGAGGAACCCAAAAAAGGATTGTAAGGTGATAACTCGCAGGAGTGGAAAGAATGTCGTTGGACTTATTGTTTTTTATGAGGATGAGTAGGTTGAAGGTGAGCCTTCTAGACCTAAGAAGGAAAAGGAAATTGAAGTTGAGAGGCACATTAAGGAGAAAGTGGAAAATGTTCTTGTATCCTCAGTATCAATGAGTAATTCTAACCTTATTCCTATAATTCTGAACAACATCCCCTATCGACAATATTTTAGGAAAAAGAAGTTTGATCAACAATTTTCAAAATTTTTAGAAATTTTCAAGAAATTAAATATTAACATACCTTTTGCATATGCGCTGGAAAAGATGCCGAACTACATCAAGTTCATGAAGGATATGTTGTCAAAAAAGAAGAAATTCAAAAAGTATGAGACTGTGAGCTTAACTGAAGAGTGTAGCGTTATCTTTCAGAAAAATTGTCTCAAAAGTTGAAGGACCCAAAGACTTCACCATCCTAGGCACCATACTGAACATGACGGTAAAATGTGCTTTTTATGATTTAGGACGTTTATTAATTTAATGTCATTGTCTGTGTACAGAAAATTGCTCTTAGGAGGAGGTGCGACCAATAAATATCTCTCTTCAATTGGTGGATCGTTCTCTCACTTACCCATGAGGGATAGTGGAGGATATTCTAGTGAAGGTAGATAAATTTATTTTTCCTGCAGACTTCATAGTGTTGGATATGGAGGATGACTCAGAGGTGTCAATCATCTTGGGGCACCCGTTTTTTACCATAGGTAGGGCTCTTATTAATGTCCAACAACGTAAGCTCACCCTATGTGTGAATGAGGAAGAGGTAATATTTAATATCTATCGCTCTATGAACTATCCTGATGGGGTAAATACATGTTGTAGGGTAGATACTATGGATGA

mRNA sequence

ATGGCTTCTCAATGTCTTAAGCTCGCCGAGATGGCGGAGGGAAGACTCTTCTCTTTCAAGTTATTTGTCTCCGATTGTCGTCTGGAGATAACTTTGCTGTCACTTAGGAATTCTTCCAAGAGATCCTACAGCATTTTCGATAGAGCAAAAGATTGGTTAGAGTCAATTGCTCCAGAGAGCATAGCCACGTGGGATGCACTGGAAGCCGTCATTGAGAAACTATTAGGAGATTTTATCGAAGAATTAAGGAGTATGATAAATCAATTGGAAAGTACGATGACAAGCCAAGGAAAGGCGATTCAGAGTATTGAAGTCCAAATCAGCCAGATGGCCACAACCATGAATGCAATGCAAAAAGAAAAATTCACCAATTGTCCTAAGAGGAACCCAAAAAAGGATTGTAAGGTGATAACTCGCAGGAGTGAGCCTTCTAGACCTAAGAAGGAAAAGGAAATTGAAGTTGAGAGGCACATTAAGGAGAAAGTGGAAAATGTTCTTGTATCCTCAGTATCAATGAGTAATTCTAACCTTATTCCTATAATTCTGAACAACATCCCCTATCGACAATATTTTAGGAAAAAGAAGTTTGATCAACAATTTTCAAAATTTTTAGAAATTTTCAAGAAATTAAATATTAACATACCTTTTGCATATGCGCTGGAAAAGATGCCGAACTACATCAAGTTCATGAAGGATATGTTGTCAAAAAAGAAGAAATTCAAAAAAAAAATTGTCTCAAAAGTTGAAGGACCCAAAGACTTCACCATCCTAGGCACCATACTGAACATGACGGATATTCTAGTGAAGGTAGATAAATTTATTTTTCCTGCAGACTTCATAGTGTTGGATATGGAGGATGACTCAGAGGTGTCAATCATCTTGGGGCACCCGTTTTTTACCATAGGTAGGGCTCTTATTAATGTCCAACAACGTAAGCTCACCCTATGTGTGAATGAGGAAGAGGGTAGATACTATGGATGA

Coding sequence (CDS)

ATGGCTTCTCAATGTCTTAAGCTCGCCGAGATGGCGGAGGGAAGACTCTTCTCTTTCAAGTTATTTGTCTCCGATTGTCGTCTGGAGATAACTTTGCTGTCACTTAGGAATTCTTCCAAGAGATCCTACAGCATTTTCGATAGAGCAAAAGATTGGTTAGAGTCAATTGCTCCAGAGAGCATAGCCACGTGGGATGCACTGGAAGCCGTCATTGAGAAACTATTAGGAGATTTTATCGAAGAATTAAGGAGTATGATAAATCAATTGGAAAGTACGATGACAAGCCAAGGAAAGGCGATTCAGAGTATTGAAGTCCAAATCAGCCAGATGGCCACAACCATGAATGCAATGCAAAAAGAAAAATTCACCAATTGTCCTAAGAGGAACCCAAAAAAGGATTGTAAGGTGATAACTCGCAGGAGTGAGCCTTCTAGACCTAAGAAGGAAAAGGAAATTGAAGTTGAGAGGCACATTAAGGAGAAAGTGGAAAATGTTCTTGTATCCTCAGTATCAATGAGTAATTCTAACCTTATTCCTATAATTCTGAACAACATCCCCTATCGACAATATTTTAGGAAAAAGAAGTTTGATCAACAATTTTCAAAATTTTTAGAAATTTTCAAGAAATTAAATATTAACATACCTTTTGCATATGCGCTGGAAAAGATGCCGAACTACATCAAGTTCATGAAGGATATGTTGTCAAAAAAGAAGAAATTCAAAAAAAAAATTGTCTCAAAAGTTGAAGGACCCAAAGACTTCACCATCCTAGGCACCATACTGAACATGACGGATATTCTAGTGAAGGTAGATAAATTTATTTTTCCTGCAGACTTCATAGTGTTGGATATGGAGGATGACTCAGAGGTGTCAATCATCTTGGGGCACCCGTTTTTTACCATAGGTAGGGCTCTTATTAATGTCCAACAACGTAAGCTCACCCTATGTGTGAATGAGGAAGAGGGTAGATACTATGGATGA

Protein sequence

MASQCLKLAEMAEGRLFSFKLFVSDCRLEITLLSLRNSSKRSYSIFDRAKDWLESIAPESIATWDALEAVIEKLLGDFIEELRSMINQLESTMTSQGKAIQSIEVQISQMATTMNAMQKEKFTNCPKRNPKKDCKVITRRSEPSRPKKEKEIEVERHIKEKVENVLVSSVSMSNSNLIPIILNNIPYRQYFRKKKFDQQFSKFLEIFKKLNINIPFAYALEKMPNYIKFMKDMLSKKKKFKKKIVSKVEGPKDFTILGTILNMTDILVKVDKFIFPADFIVLDMEDDSEVSIILGHPFFTIGRALINVQQRKLTLCVNEEEGRYYG
BLAST of ClCG03G010620 vs. TrEMBL
Match: U5CWW5_AMBTC (Uncharacterized protein OS=Amborella trichopoda GN=AMTR_s03373p00000010 PE=4 SV=1)

HSP 1 Score: 124.8 bits (312), Expect = 1.9e-25
Identity = 73/184 (39.67%), Postives = 105/184 (57.07%), Query Frame = 1

Query: 186 PYRQYFRKKKFDQQFSKFLEIFKKLNINIPFAYALEKMPNYIKFMKDMLSKKKKF----- 245
           P+ Q F+K++ D QF +FL++ K+L+INIP   ALE+MP Y+KF+KD+L+KK++      
Sbjct: 35  PFPQRFKKQQDDGQFRRFLDVLKQLHINIPLVEALEQMPTYVKFLKDILTKKRRLGEFET 94

Query: 246 -----------KKKIVSKVEGPKDFTI---------LG-------TIL------------ 305
                      K KI  K++ P  FTI         LG       T+             
Sbjct: 95  VALTEGCSAMLKSKIPPKLKDPGSFTIPISIGGRDKLGIGEARPTTVTLQLADRSMAHPE 154

Query: 306 -NMTDILVKVDKFIFPADFIVLDMEDDSEVSIILGHPFFTIGRALINVQQRKLTLCVNEE 325
             + D+LV+VDKFIFPADFI+LD E+D EV IILG PF   GR LI+V++ +LT+   +E
Sbjct: 155 GKIEDVLVQVDKFIFPADFIILDYEEDREVPIILGRPFLATGRTLIDVEKGELTMRAQDE 214

BLAST of ClCG03G010620 vs. TrEMBL
Match: G0Y6U4_ARAHY (Retrotransposon gag protein OS=Arachis hypogaea GN=303L13_14 PE=4 SV=1)

HSP 1 Score: 107.5 bits (267), Expect = 3.2e-20
Identity = 96/326 (29.45%), Postives = 140/326 (42.94%), Query Frame = 1

Query: 79  IEELRSMINQLESTMTSQGKA-IQSIEVQISQMATTMNAMQKEKFTNCPKRNPKKDCKVI 138
           I  +   +   E+ +  + KA I+++EV + Q++  +       F      NP +DCK I
Sbjct: 502 ITVITEQVASTEAQVIQETKASIRNLEVLVGQLSKQILERSVSTFQEDTVVNPGEDCKAI 561

Query: 139 TRRS---------------EPSRPKKEKEIEVERHIKEKVENVLVSSVSMSNS-NLIPII 198
             RS               E   P ++KE EVE    ++ +N    S+ +  +    P  
Sbjct: 562 QLRSGKVADSETKVNEDVVEKEAPDEKKE-EVEHAPPKRADNPFPDSLDIYPTLPKAPEY 621

Query: 199 LNNIPYRQYFRKKKFDQQFSKFLEIFKKLNINIPFAYALEKMPNYIKFMKDMLSKKKKFK 258
              +PY Q  +K+   +QFSKFLEIF+KL INIPFA  LE+MP Y+KFMK++LSKKK+ K
Sbjct: 622 KPKMPYPQRLQKETKKKQFSKFLEIFRKLQINIPFAEVLEQMPIYVKFMKELLSKKKRLK 681

Query: 259 ----------------KKIVSKVEGPKDFTI---------------LGTILNMTDI---- 318
                             +  K+  P  F I               LG  +N+  +    
Sbjct: 682 GDETVVLTKECSAVIQNNLPRKMPDPGSFQIPCTIGSTTFEKSLCDLGASINLMPLSVMK 741

Query: 319 -------------LVKVDKFIFPADFIV------------------LDMEDDSEVSIILG 322
                        L   DK + PA  +V                  LD  +D   SIILG
Sbjct: 742 KLHIQEAQPTKIALQMADKSMKPAYGLVENILVKVGKFFLPADFVILDTGEDENASIILG 801

BLAST of ClCG03G010620 vs. TrEMBL
Match: G0Y6U4_ARAHY (Retrotransposon gag protein OS=Arachis hypogaea GN=303L13_14 PE=4 SV=1)

HSP 1 Score: 29.6 bits (65), Expect = 8.5e+03
Identity = 15/36 (41.67%), Postives = 22/36 (61.11%), Query Frame = 1

Query: 43  YSIFDRAKDWLESIAPESIATWDALEAVIEKLLGDF 79
           +++ D+AK WL+S   ES+ TW   E V+ K L  F
Sbjct: 204 FALRDKAKLWLDSQPKESLNTW---EKVVTKFLTKF 236


HSP 2 Score: 100.9 bits (250), Expect = 3.0e-18
Identity = 70/219 (31.96%), Postives = 112/219 (51.14%), Query Frame = 1

Query: 77  DFIEELRSMINQLESTMTSQGK----AIQSIEVQISQMATTMNAMQKEKFTNCPKRNPKK 136
           D + ++   + Q     T+  K    +I+++EVQI Q+A  +   Q + F+   + NPK+
Sbjct: 315 DRLSKMEDALTQFMQVSTTNQKNTEASIRNLEVQIGQLAKQLADQQSKNFSANTQVNPKE 374

Query: 137 DCKVITRRSEPSRPKKEKEI--EVERHIKEKVEN---------VLVSSVSMSNSNLIPII 196
            C  +   SE     KE E+  E E++ K K+E+         V   S       + P  
Sbjct: 375 HCYEVELESE-----KEVELNKEAEKNEKNKIESEKNESGDGDVKEESKKKGKEVVRPPP 434

Query: 197 LNNIPYRQYFRKKKFDQQFSKFLEIFKKLNINIPFAYALEKMPNYIKFMKDMLSKKKKF- 256
           + N+PY     KK  ++QF++FL+I K+L INIPFA ALE+MP+Y +FMK++L+KK+KF 
Sbjct: 435 VKNLPYPHAPSKKDKERQFARFLDIIKRLQINIPFAEALEQMPSYARFMKELLTKKRKFS 494

Query: 257 ---------------KKKIVSKVEGPKDFTILGTILNMT 265
                          +K +  K   P  FT+  TI N++
Sbjct: 495 EDGTVELEAGCSAIIQKSLPQKSRDPGSFTLPVTIGNVS 528

BLAST of ClCG03G010620 vs. TrEMBL
Match: A0A151T9N8_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_018335 PE=4 SV=1)

HSP 1 Score: 68.6 bits (166), Expect = 1.7e-08
Identity = 31/57 (54.39%), Postives = 42/57 (73.68%), Query Frame = 1

Query: 265 DILVKVDKFIFPADFIVLDMEDDSEVSIILGHPFFTIGRALINVQQRKLTLCVNEEE 322
           D+LVKVDKF FP DF+V+DME+DSEV +ILG PF    + +I+V   KL + V ++E
Sbjct: 579 DLLVKVDKFWFPVDFVVMDMEEDSEVPLILGRPFMKTAKVIIDVDDGKLKVRVQDDE 635


HSP 2 Score: 100.5 bits (249), Expect = 3.9e-18
Identity = 65/190 (34.21%), Postives = 105/190 (55.26%), Query Frame = 1

Query: 73  KLLGDFIEELRSMINQLESTMTSQGKAIQSIEVQISQMATTMNAMQKEKFTNCPKRNPKK 132
           KL G   + ++  I+  +ST  S    I+++E+Q+ Q+A  +       F+     NPK+
Sbjct: 406 KLEGTLNQFMKVSISNHKSTEAS----IKNLEIQVGQLAKQLAENSGRNFSANTHTNPKE 465

Query: 133 DCKVITRRS-------EPSRPKKEKEIEV-ERHIKEKVENVLVSSVSMSNSNLIPI---- 192
           +C  IT R        E    ++EKE E+ E   KE  E V V       +  + +    
Sbjct: 466 NCSAITTRGGKRVGVLEDEEEQQEKEAEMKEGDGKELAEEVAVKKSKSQLARELKMKSKV 525

Query: 193 -ILNNIPYRQYFRKKKFDQQFSKFLEIFKKLNINIPFAYALEKMPNYIKFMKDMLSKKKK 250
               ++PY Q   KK  ++QF++F+E+FKKL+INIPF+ +LE+MP Y KFMKD+L+K KK
Sbjct: 526 SSTKDMPYPQAPSKKDKEKQFARFMELFKKLHINIPFSESLEQMPTYAKFMKDLLTKNKK 585

BLAST of ClCG03G010620 vs. TrEMBL
Match: A0A151R7N2_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_040121 PE=4 SV=1)

HSP 1 Score: 63.9 bits (154), Expect = 4.1e-07
Identity = 33/63 (52.38%), Postives = 43/63 (68.25%), Query Frame = 1

Query: 258 GTILNMTDILVKVDKFIFPADFIVLDMEDDSEVSIILGHPFFTIGRALINVQQRKLTLCV 317
           G I NM   LVKVDKF F ADF++LDME+DS++ IILG PF    RA+I++   +  L V
Sbjct: 669 GVIENM---LVKVDKFTFLADFVILDMEEDSDIPIILGRPFMKTARAIIDIGDGEFKLRV 728

Query: 318 NEE 321
            +E
Sbjct: 729 QDE 728


HSP 2 Score: 100.1 bits (248), Expect = 5.1e-18
Identity = 70/224 (31.25%), Postives = 113/224 (50.45%), Query Frame = 1

Query: 79  IEELRSMINQLESTMTSQGKA-IQSIEVQISQMATTMNAMQKEKFTNCPKRNPKKDCK-V 138
           +EE  +   Q+ +T     +A I+++EVQ+ Q+A  +   Q + F+   + NPK+ C+ +
Sbjct: 60  MEEALTQFMQVSTTNQKNTEASIRNLEVQVGQLAKQLADQQNKNFSANTQVNPKEQCQSI 119

Query: 139 ITRRSEPSRPKK-------EKEIEVERHIKEKVENVLVSSVSMSNSN------------- 198
            TRR      KK       EKE+E+ +  ++K  N   S  +                  
Sbjct: 120 TTRRGTVIEEKKSVVELESEKEVELNKEEEKKENNKKESEKNERRDGDVREERKKKGKEV 179

Query: 199 LIPIILNNIPYRQYFRKKKFDQQFSKFLEIFKKLNINIPFAYALEKMPNYIKFMKDMLSK 258
           + P  + N+PY     KK  ++QF++FL+I K+L INIPFA ALE+MP+Y +FMKD+L+K
Sbjct: 180 IRPPPVKNLPYPHAPSKKDKERQFARFLDIIKRLQINIPFAEALEQMPSYARFMKDLLTK 239

Query: 259 KKK----------------FKKKIVSKVEGPKDFTILGTILNMT 265
           K+K                 +K +  K   P  FT+  TI N++
Sbjct: 240 KRKLSEDGTVELEAGCSAIIQKSLPQKSRDPGSFTLPVTIGNVS 283

BLAST of ClCG03G010620 vs. NCBI nr
Match: gi|1021498264|ref|XP_016192294.1| (PREDICTED: uncharacterized protein LOC107633171 [Arachis ipaensis])

HSP 1 Score: 150.2 bits (378), Expect = 6.2e-33
Identity = 91/262 (34.73%), Postives = 147/262 (56.11%), Query Frame = 1

Query: 74  LLGDFIEELRSMIN---QLESTMTSQGKAIQSIEVQISQMATTMNAMQKEKFTNCPKRNP 133
           +LG+  +E + M +   ++ S M +Q  AI+ +EVQI  ++  + +       N  K N 
Sbjct: 1   MLGELCKESKDMQDFKEEVRSNMQNQDAAIKKLEVQIGYLSKQVPSHNPY---NATKTNS 60

Query: 134 KKDCKVITRRSEPS---RPKKEKEIEVERHIKEKVENVLVSSVSMSNSNLIPIILNNIPY 193
           +++CK IT RS        +K +  EV+  + +K E    +S S     ++   +    Y
Sbjct: 61  REECKAITLRSRKELKETSRKTQGREVDESLSDKEEAQTPASNSPEEKEVLRPYVPKASY 120

Query: 194 RQYFRKKKFDQQFSKFLEIFKKLNINIPFAYALEKMPNYIKFMKDMLSKKKKFKK----- 253
            Q  +K + D QFS+FLE+F +L INIPFA  LE+MP Y KF+K++++KK+ ++      
Sbjct: 121 PQRLKKNEKDNQFSRFLEVFNRLQINIPFAKVLEQMPLYAKFLKELMTKKRSWRNDKTVV 180

Query: 254 -----------KIVSKVEGPKDFTILGTILNMTDILVKVDKFIFPADFIVLDMEDDSEVS 313
                      K+  K++ P+ F IL     + D+LVKV  FIFPADF+VLDM+++++ S
Sbjct: 181 LTKECSAIIQHKLPQKLKDPRSFQIL-----LEDLLVKVGDFIFPADFVVLDMKEETKAS 240

BLAST of ClCG03G010620 vs. NCBI nr
Match: gi|848881274|ref|XP_012841003.1| (PREDICTED: uncharacterized protein LOC105961316 [Erythranthe guttata])

HSP 1 Score: 144.4 bits (363), Expect = 3.4e-31
Identity = 98/272 (36.03%), Postives = 142/272 (52.21%), Query Frame = 1

Query: 100 IQSIEVQISQMATTMNAMQKEKFTNCPKRNPKKDCKVITRRS-----EPSRPKKE----- 159
           ++++E QI Q+A +M+ M K  F +  + NPK+ C+ IT RS     +P  P  E     
Sbjct: 8   MKNMEKQIGQIAQSMSTMAKGGFPSNTEVNPKESCQAITTRSGLQMTDPPYPTDESPRPP 67

Query: 160 -KEIEVERHIKEKVENVLVSSV--SMSNSNLIPIILNNIPYRQYFRKKKFDQQFSKFLEI 219
            +   VE  I   + N   +S   ++S  +  P+++  IP+ +  +KKKF  Q  KF+E 
Sbjct: 68  VQPTPVEPEITISMSNTKEASKPNNISFPDNPPLMITPIPFPERQKKKKFKDQLKKFIEK 127

Query: 220 FKKLNINIPFAYALEKMPNYIKFMKDMLSKKKKFKKKI----------------VSKVEG 279
            K++ INIPFA ALE MPNY KFMK++LSKK + ++ I                  K++ 
Sbjct: 128 IKQIRINIPFAEALEVMPNYTKFMKEVLSKKIRIEEDIPVTLTATCSAILQSNLPPKMKD 187

Query: 280 PKDFTILGTILNMT----------------------DILVKVDKFIFPADFIVLDMEDDS 321
           P  +TI   I N T                      D+LVKVDKFI P DF+VL+M +D 
Sbjct: 188 PGSYTIPCIIGNSTFDKALCDLGADRSLKYPDGIVEDVLVKVDKFILPVDFVVLEMPEDD 247

BLAST of ClCG03G010620 vs. NCBI nr
Match: gi|672195269|ref|XP_008776509.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103696607 [Phoenix dactylifera])

HSP 1 Score: 142.9 bits (359), Expect = 9.9e-31
Identity = 99/294 (33.67%), Postives = 154/294 (52.38%), Query Frame = 1

Query: 58  PESIATWDALEAVIEKLLG---DFIEELRSMINQLESTMTSQGKAIQSIEVQISQMATTM 117
           PES  +W   E  IEKL     +  E L + ++QL S+        +++EVQ+ Q+A  +
Sbjct: 339 PESKQSW---EIAIEKLANASSERFERLEAKVDQLASSN-------RNVEVQLGQLANFI 398

Query: 118 NAMQKEKFTNCPKRNPKKDCKVITRRSEPSRPKKEKEI---------EVERHIKEKVENV 177
           N+  +    +  + NPK+ CK +T RS     +   E          EV + + E+VE++
Sbjct: 399 NSRGQGNLPSKTEVNPKEHCKAVTLRSGKQLGQVSGETIVGDEVDYEEVSKKVSEEVEDL 458

Query: 178 LVSSVSMSNSNLIPI--ILNNIPYRQYFRKKKFDQQFSKFLEIFKKLNINIPFAYALEKM 237
                + + S L P+   +  IP+ Q  +K K DQQF KFL++F++L+INIPFA AL ++
Sbjct: 459 -----AKTTSPLPPVEPYVPPIPFPQRLKKNKIDQQFEKFLKVFRQLHINIPFADALAQI 518

Query: 238 PNYIKFMKDMLSKKKK----------------FKKKIVSKVEGPKDFTILGTILNMTD-- 297
           P Y KF+K+++SKK+K                 + K+  K  G K+       L + D  
Sbjct: 519 PAYTKFLKEIMSKKRKLEDFETIALTEECSAIIQNKLPPKKLGLKELKPTTISLQLADRS 578

Query: 298 ----------ILVKVDKFIFPADFIVLDMEDDSEVSIILGHPFFTIGRALINVQ 310
                     +L+KV KFI P DFIVL+ME+D+E+ IILG PF     A+I+V+
Sbjct: 579 VKYSLGVLENVLIKVKKFIIPVDFIVLEMEEDTEIPIILGRPFLATAGAIIDVK 617

BLAST of ClCG03G010620 vs. NCBI nr
Match: gi|672195269|ref|XP_008776509.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103696607 [Phoenix dactylifera])

HSP 1 Score: 34.3 bits (77), Expect = 5.0e+02
Identity = 14/25 (56.00%), Postives = 18/25 (72.00%), Query Frame = 1

Query: 43  YSIFDRAKDWLESIAPESIATWDAL 68
           +S+ D+AK WL S AP S  TW+AL
Sbjct: 92  FSLKDKAKAWLNSKAPNSFTTWNAL 116


HSP 2 Score: 142.5 bits (358), Expect = 1.3e-30
Identity = 91/267 (34.08%), Postives = 148/267 (55.43%), Query Frame = 1

Query: 67  LEAVIEKLLGDFIEELRSMINQLESTMTSQGKAIQSIEVQISQMATTMNAMQKEKFTNCP 126
           +EA++  L  +F ++ +    ++ S + +Q  A Q +E QI  ++         K  N  
Sbjct: 4   MEAMLSNLCKEF-KDTKKFHEEVTSNLQNQDAATQKLEAQIGYLSKQAPG---HKLGNAT 63

Query: 127 KRNPKKDCKVITRR-----SEPSRPKKEKEIEVERHIKEKVENVLVSSVSMSNSNLIPII 186
           +   ++ CK IT R      E SR  +E++ E   + KE+ +    +        ++   
Sbjct: 64  RTTSREKCKAITLRRGKELKETSRETQEEKAEGSPNAKEEAQAPTPNP--SKEKEVLRSY 123

Query: 187 LNNIPYRQYFRKKKFDQQFSKFLEIFKKLNINIPFAYALEKMPNYIKFMKDMLSKKKKFK 246
           +   PY Q+  K   + QFSKFLEIFKKL INI FA ALE+MP Y KF+K++++KK+ ++
Sbjct: 124 VPKAPYPQHLMKNAKNNQFSKFLEIFKKLQINISFAEALEQMPLYTKFLKELMTKKRSWR 183

Query: 247 --KKIVSKVEGPK-DFTILGTILN-----MTDILVKVDKFIFPADFIVLDMEDDSEVSII 306
             + ++ + +  +    + G         + D+LVKV  FIFPADF+VLDME++ + SII
Sbjct: 184 NDETVIEEAKPTRMALQLAGRSFKYPHGIVEDLLVKVGDFIFPADFVVLDMEEEVKASII 243

Query: 307 LGHPFFTIGRALINVQQRKLTLCVNEE 321
           LG PF     A+I+VQ+ +LTL ++EE
Sbjct: 244 LGRPFLATAGAIIDVQKGELTLRLHEE 264

BLAST of ClCG03G010620 vs. NCBI nr
Match: gi|720089288|ref|XP_010244714.1| (PREDICTED: uncharacterized protein LOC104588472 [Nelumbo nucifera])

HSP 1 Score: 140.6 bits (353), Expect = 4.9e-30
Identity = 86/250 (34.40%), Postives = 137/250 (54.80%), Query Frame = 1

Query: 80  EELRSMINQLESTMTSQGKAIQSIEVQISQMATTMNAMQKEKFTNCPKRNPKKDCKVITR 139
           E +   IN  E+   S   +I++++VQ+ Q+ +T++  ++ +  N  ++NP++    I  
Sbjct: 18  EMISKFINTAEAKFQSHETSIKNLKVQVGQIISTLSERKEGRLPNNTEKNPREHVNAIAL 77

Query: 140 RSEPS---RPKKEKEIEVERHIKEKVENVLVSSVSMSNSNLIPIILNNIPYRQYFRKKKF 199
           RS  +     + +KE E+    KE          S    NL  I L  IPY +   + K 
Sbjct: 78  RSGKTVGEAQEDDKEAELADLQKENER-------STPKLNLDEIKLP-IPYTRRVFRDKL 137

Query: 200 DQQFSKFLEIFKKLNINIPFAYALEKMPNYIKFMKDMLSKKKKFKKKIVSKVEGPKDFTI 259
           D+QF KFLE+FKK++IN+P    L +MP Y KF+K+++S K+K++   +          +
Sbjct: 138 DKQFEKFLEVFKKIHINLPLLDVLSQMPKYAKFLKEVMSNKRKWEDCEMELTSTTITLQL 197

Query: 260 LGTILN-----MTDILVKVDKFIFPADFIVLDMEDDSEVSIILGHPFFTIGRALINVQQR 319
               +      + D+LVKV  FI P DFI+LDME+D  + +ILG PF   G ALI+VQ+ 
Sbjct: 198 ANRSIKYPRGIVEDVLVKVGNFIIPTDFIMLDMEEDRSMPLILGRPFLATGNALIDVQKG 257

Query: 320 KLTLCVNEEE 322
           +LTL +N EE
Sbjct: 258 QLTLRINGEE 259

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
U5CWW5_AMBTC1.9e-2539.67Uncharacterized protein OS=Amborella trichopoda GN=AMTR_s03373p00000010 PE=4 SV=... [more]
G0Y6U4_ARAHY3.2e-2029.45Retrotransposon gag protein OS=Arachis hypogaea GN=303L13_14 PE=4 SV=1[more]
G0Y6U4_ARAHY8.5e+0341.67Retrotransposon gag protein OS=Arachis hypogaea GN=303L13_14 PE=4 SV=1[more]
A0A151T9N8_CAJCA1.7e-0854.39Uncharacterized protein OS=Cajanus cajan GN=KK1_018335 PE=4 SV=1[more]
A0A151R7N2_CAJCA4.1e-0752.38Uncharacterized protein OS=Cajanus cajan GN=KK1_040121 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|1021498264|ref|XP_016192294.1|6.2e-3334.73PREDICTED: uncharacterized protein LOC107633171 [Arachis ipaensis][more]
gi|848881274|ref|XP_012841003.1|3.4e-3136.03PREDICTED: uncharacterized protein LOC105961316 [Erythranthe guttata][more]
gi|672195269|ref|XP_008776509.1|9.9e-3133.67PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103696607 [Phoenix da... [more]
gi|672195269|ref|XP_008776509.1|5.0e+0256.00PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103696607 [Phoenix da... [more]
gi|720089288|ref|XP_010244714.1|4.9e-3034.40PREDICTED: uncharacterized protein LOC104588472 [Nelumbo nucifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG03G010620.1ClCG03G010620.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 100..120
scor

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None