Csa3G124790 (gene) Cucumber (Chinese Long) v2

NameCsa3G124790
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionUnknown protein
LocationChr3 : 7468448 .. 7470213 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCAACATTGCTTTCTTCACTCAGCCATGGCTGAAACTCACTGACACTCTTCACAATAACCCTTAATTAATTCTTCATAAACTTTCAACTCCATCTTCGAAATTGCCTCTCACTTTCCCTCTTCCATCCATTTATAAACATCCCCATCACTCCCACTCAATTCACTCACTTCAATTCATTTCTCCTCTGTTCTTCTTCAAGCTGTTTCAATGGCGGCGCCCACTCTCTCATTTCCCCCTTTTCCTCTTAATCGAGAAGCACCCACCAGAATGCTCAAGGATTTTCTTCACGAGACCAATCCAAATGGACTCGCATCTCCTAAACCAAAACCCACGTCGTTTAAAGCTCTCGCTTTCCACGCCGTCGTCGCCGCCGTTAAGAGGATCTCACTTCCGTCTGTGAAATCGCCCAGGATCTTCCCCAGAAGCCTCTCGCGGAGGCTCTTGAAGAAAACAGAGAGAGATGAGAGAGAAACCGGAGGCGATTTTGTTGTTAAGATTAAGGACATTATTCGATGGAAATCGTTTAGAGACTTGATCGACGAGACGACGGCGGCGGCTCCACCGCTTGATTTCGCCGAATCGCCGGATCGTTACACTTACACGGCAGCCGCGACGACTACGACTACGACGACGACGACGACGACGACGAGTAGTAGTAAGAGTTCGAGCTGGTGTGAGAGCGATTTCACGGCGGAGGATTTGGCGTCGCCGTCGTGGAGAGATTGGTCCGACGACGGTACAATGGGGAAAATGTACTTCCCATGTGTCGGTGAAGATTCGAACGAAACGACAGCCGCATACGCACAAAACGACGAAGAGGTGAGCAATAATTCAAAGTACTTCACACCTATAATGGAATTTGTCATCTTCTATATTTGGTCAATCTTTTCCTACCCTACGATCCGTTTTATTTTCGTTGAGATTTAGAACGAGCGATTGTGAACCGTTAGATTAAATCAATTTCTCTTATGACAATACTAGATTCGCCCGTGTAATAAGAAACGACACCGTATGTTTTAAAAAAAATTAGGGTTTAATTAAAATAATAATAATAATAATAATAAAGTAGATTTTAAACTAAACTTAGTAGTATATTTGACTTGTTTTCTAGATTTTTATCTTTTGATATCATCATCATTATTTTTTACATGTTATTGTTCATATTTTTCTATCTAAATCCTTAAATTTTGTATTATTTATTATCCGTATTAAGTGAAATAAAAAAAATAATTGAAACTATTTACTAAATATAACAAAATATATTGAAATCTTTGCGGCTCATTTAAGTTTTTTACTTGTTTTACAGGTAAATGCATTATTGATACGAGAGGATAATGAAGAACAAGAGGTACTAGATGAAAGTACACGACGACTTTTAGAGCAAGTTAAAGGAGCAATTTCATTATCAAAAAGTTGTAGATTAGTAGAGCGTTGTGGGTTGGATTGGTTGATTCGAGAATTGTTTCGACGTGAACTTGCGGATGTTCAGGACGTTGACGAACGAGTGAGAAACGATGATCGAAGAATTAGGGTGAAGAATGGCAAAGATGATGAATATGTGTGTGATTGGTTTTTGTCTCACAAAGGGAAGGAAAGTTATGTGAGAGAAATGGAGAGGGAAGGGAAATGGGAGATTTTTGGTGTTGATGAGAAAATTGAATTAGGGTTAGAAATTGAAGGAGAGATTTTGGGATGTTTGGTTGATGAAATTCTACTTGACATTTTTTCATTATGAATAGAAAATATTTCTTTCGTTTTTTGTTA

mRNA sequence

ATGGCGGCGCCCACTCTCTCATTTCCCCCTTTTCCTCTTAATCGAGAAGCACCCACCAGAATGCTCAAGGATTTTCTTCACGAGACCAATCCAAATGGACTCGCATCTCCTAAACCAAAACCCACGTCGTTTAAAGCTCTCGCTTTCCACGCCGTCGTCGCCGCCGTTAAGAGGATCTCACTTCCGTCTGTGAAATCGCCCAGGATCTTCCCCAGAAGCCTCTCGCGGAGGCTCTTGAAGAAAACAGAGAGAGATGAGAGAGAAACCGGAGGCGATTTTGTTGTTAAGATTAAGGACATTATTCGATGGAAATCGTTTAGAGACTTGATCGACGAGACGACGGCGGCGGCTCCACCGCTTGATTTCGCCGAATCGCCGGATCGTTACACTTACACGGCAGCCGCGACGACTACGACTACGACGACGACGACGACGACGACGAGTAGTAGTAAGAGTTCGAGCTGGTGTGAGAGCGATTTCACGGCGGAGGATTTGGCGTCGCCGTCGTGGAGAGATTGGTCCGACGACGGTACAATGGGGAAAATGTACTTCCCATGTGTCGGTGAAGATTCGAACGAAACGACAGCCGCATACGCACAAAACGACGAAGAGGTAAATGCATTATTGATACGAGAGGATAATGAAGAACAAGAGGTACTAGATGAAAGTACACGACGACTTTTAGAGCAAGTTAAAGGAGCAATTTCATTATCAAAAAGTTGTAGATTAGTAGAGCGTTGTGGGTTGGATTGGTTGATTCGAGAATTGTTTCGACGTGAACTTGCGGATGTTCAGGACGTTGACGAACGAGTGAGAAACGATGATCGAAGAATTAGGGTGAAGAATGGCAAAGATGATGAATATGTGTGTGATTGGTTTTTGTCTCACAAAGGGAAGGAAAGTTATGTGAGAGAAATGGAGAGGGAAGGGAAATGGGAGATTTTTGGTGTTGATGAGAAAATTGAATTAGGGTTAGAAATTGAAGGAGAGATTTTGGGATGTTTGGTTGATGAAATTCTACTTGACATTTTTTCATTATGA

Coding sequence (CDS)

ATGGCGGCGCCCACTCTCTCATTTCCCCCTTTTCCTCTTAATCGAGAAGCACCCACCAGAATGCTCAAGGATTTTCTTCACGAGACCAATCCAAATGGACTCGCATCTCCTAAACCAAAACCCACGTCGTTTAAAGCTCTCGCTTTCCACGCCGTCGTCGCCGCCGTTAAGAGGATCTCACTTCCGTCTGTGAAATCGCCCAGGATCTTCCCCAGAAGCCTCTCGCGGAGGCTCTTGAAGAAAACAGAGAGAGATGAGAGAGAAACCGGAGGCGATTTTGTTGTTAAGATTAAGGACATTATTCGATGGAAATCGTTTAGAGACTTGATCGACGAGACGACGGCGGCGGCTCCACCGCTTGATTTCGCCGAATCGCCGGATCGTTACACTTACACGGCAGCCGCGACGACTACGACTACGACGACGACGACGACGACGACGAGTAGTAGTAAGAGTTCGAGCTGGTGTGAGAGCGATTTCACGGCGGAGGATTTGGCGTCGCCGTCGTGGAGAGATTGGTCCGACGACGGTACAATGGGGAAAATGTACTTCCCATGTGTCGGTGAAGATTCGAACGAAACGACAGCCGCATACGCACAAAACGACGAAGAGGTAAATGCATTATTGATACGAGAGGATAATGAAGAACAAGAGGTACTAGATGAAAGTACACGACGACTTTTAGAGCAAGTTAAAGGAGCAATTTCATTATCAAAAAGTTGTAGATTAGTAGAGCGTTGTGGGTTGGATTGGTTGATTCGAGAATTGTTTCGACGTGAACTTGCGGATGTTCAGGACGTTGACGAACGAGTGAGAAACGATGATCGAAGAATTAGGGTGAAGAATGGCAAAGATGATGAATATGTGTGTGATTGGTTTTTGTCTCACAAAGGGAAGGAAAGTTATGTGAGAGAAATGGAGAGGGAAGGGAAATGGGAGATTTTTGGTGTTGATGAGAAAATTGAATTAGGGTTAGAAATTGAAGGAGAGATTTTGGGATGTTTGGTTGATGAAATTCTACTTGACATTTTTTCATTATGA

Protein sequence

MAAPTLSFPPFPLNREAPTRMLKDFLHETNPNGLASPKPKPTSFKALAFHAVVAAVKRISLPSVKSPRIFPRSLSRRLLKKTERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAAPPLDFAESPDRYTYTAAATTTTTTTTTTTTSSSKSSSWCESDFTAEDLASPSWRDWSDDGTMGKMYFPCVGEDSNETTAAYAQNDEEVNALLIREDNEEQEVLDESTRRLLEQVKGAISLSKSCRLVERCGLDWLIRELFRRELADVQDVDERVRNDDRRIRVKNGKDDEYVCDWFLSHKGKESYVREMEREGKWEIFGVDEKIELGLEIEGEILGCLVDEILLDIFSL*
BLAST of Csa3G124790 vs. TrEMBL
Match: A0A0A0L6C4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G124790 PE=4 SV=1)

HSP 1 Score: 698.0 bits (1800), Expect = 5.9e-198
Identity = 346/346 (100.00%), Postives = 346/346 (100.00%), Query Frame = 1

Query: 1   MAAPTLSFPPFPLNREAPTRMLKDFLHETNPNGLASPKPKPTSFKALAFHAVVAAVKRIS 60
           MAAPTLSFPPFPLNREAPTRMLKDFLHETNPNGLASPKPKPTSFKALAFHAVVAAVKRIS
Sbjct: 1   MAAPTLSFPPFPLNREAPTRMLKDFLHETNPNGLASPKPKPTSFKALAFHAVVAAVKRIS 60

Query: 61  LPSVKSPRIFPRSLSRRLLKKTERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAAPPL 120
           LPSVKSPRIFPRSLSRRLLKKTERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAAPPL
Sbjct: 61  LPSVKSPRIFPRSLSRRLLKKTERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAAPPL 120

Query: 121 DFAESPDRYTYTAAATTTTTTTTTTTTSSSKSSSWCESDFTAEDLASPSWRDWSDDGTMG 180
           DFAESPDRYTYTAAATTTTTTTTTTTTSSSKSSSWCESDFTAEDLASPSWRDWSDDGTMG
Sbjct: 121 DFAESPDRYTYTAAATTTTTTTTTTTTSSSKSSSWCESDFTAEDLASPSWRDWSDDGTMG 180

Query: 181 KMYFPCVGEDSNETTAAYAQNDEEVNALLIREDNEEQEVLDESTRRLLEQVKGAISLSKS 240
           KMYFPCVGEDSNETTAAYAQNDEEVNALLIREDNEEQEVLDESTRRLLEQVKGAISLSKS
Sbjct: 181 KMYFPCVGEDSNETTAAYAQNDEEVNALLIREDNEEQEVLDESTRRLLEQVKGAISLSKS 240

Query: 241 CRLVERCGLDWLIRELFRRELADVQDVDERVRNDDRRIRVKNGKDDEYVCDWFLSHKGKE 300
           CRLVERCGLDWLIRELFRRELADVQDVDERVRNDDRRIRVKNGKDDEYVCDWFLSHKGKE
Sbjct: 241 CRLVERCGLDWLIRELFRRELADVQDVDERVRNDDRRIRVKNGKDDEYVCDWFLSHKGKE 300

Query: 301 SYVREMEREGKWEIFGVDEKIELGLEIEGEILGCLVDEILLDIFSL 347
           SYVREMEREGKWEIFGVDEKIELGLEIEGEILGCLVDEILLDIFSL
Sbjct: 301 SYVREMEREGKWEIFGVDEKIELGLEIEGEILGCLVDEILLDIFSL 346

BLAST of Csa3G124790 vs. TrEMBL
Match: A0A0D2QYB6_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_004G156200 PE=4 SV=1)

HSP 1 Score: 159.5 bits (402), Expect = 7.6e-36
Identity = 124/381 (32.55%), Postives = 178/381 (46.72%), Query Frame = 1

Query: 16  EAPTRMLKDFLHET-------------------------NPNG-LASPKPKPTSFKALAF 75
           E   RMLKDF+H+                          NPN  L   + K  S    AF
Sbjct: 4   ERRPRMLKDFIHDDPNSCSSNGFKSFPRKSTQNSIIFRENPNQKLQRSRSKAASATISAF 63

Query: 76  HAVVAAVKRISLPSVKSPRIFPRSLSRRLLKKTERDERETGGDFVVKIKDIIRWKSFRDL 135
            A++  +K I   S     + PR+LSR+  K+     +E      V +KDIIRWKSFRDL
Sbjct: 64  QAMINVIKSIHFASSSPSILLPRTLSRKPSKRKISQNKEAEIKMTVTVKDIIRWKSFRDL 123

Query: 136 IDETTAAAPPLDFAESPDRYTYTAAATTTTTTTTT--TTTSSSKSSSWCESDFTAEDLAS 195
           ++E    + PLDFA S     +    TTTTT + T  + T+SS  SSWC+SDFT+E L S
Sbjct: 124 LEE---KSQPLDFAPSSASPHHHCTTTTTTTGSNTPCSCTTSSNGSSWCDSDFTSEYLPS 183

Query: 196 PSWRDWSDDGTMGKMYFPCVGEDSNETTAAYAQN----------DEEVNALLIR------ 255
             + +   D  +GK + PCVG+D+ ETT   A N          +EE     +       
Sbjct: 184 DEYGENEVDNMVGKKFSPCVGKDTMETTTRTAANTDMGPKHASVEEEPQHSPLSVLDFEY 243

Query: 256 ----EDNEEQEVLDESTRRLLEQVKGAISLSKSCRLVERCGLDWLIRELFRRELADVQDV 315
               ED EE   ++E    LL  VK    L++         +D L+ +LFR E+    D 
Sbjct: 244 GGDDEDGEEANEIEEKAWELLNGVKETSPLTRYKN--NNICIDKLLLDLFREEMETKWDQ 303

Query: 316 DERVRNDDRR-IRVKNGKDDEYVCDWFLSHKG----KESYVREMEREGKWEIFGVDEKIE 344
              +   +R  +RV       ++C+     +G    +E  V +MEREGKW     +E+ E
Sbjct: 304 TRNIEELEREMVRVAKA----WICEEQNEKRGVGDKREECVGDMEREGKWRDRFHEEQEE 363

BLAST of Csa3G124790 vs. TrEMBL
Match: M5XHR6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa026717mg PE=4 SV=1)

HSP 1 Score: 140.2 bits (352), Expect = 4.8e-30
Identity = 122/386 (31.61%), Postives = 181/386 (46.89%), Query Frame = 1

Query: 12  PLNREAPTRMLKDFLHE-TNPNGLASP---------KPKPTSFKALAFHAVVAAVKRISL 71
           P N  +P + L +F  + T+ N   +P         + K  S    AF +++ AVK I  
Sbjct: 45  PFNPRSPQKNLIEFDPKPTSSNSKNNPTITSKLLRSRSKAASTTISAFQSLMNAVKNIPF 104

Query: 72  PSVKSPRIFPRSLSRRLLKK----TERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAA 131
            +VKSP   PRSLSRRLL+     + R + +      V++KDIIRW SFRD   E     
Sbjct: 105 TTVKSPSFLPRSLSRRLLQSKFQSSSRKQSQNQVQITVRVKDIIRWTSFRD---EMLPPP 164

Query: 132 PPLDFAESPDRYTYTAAATTTTTTTTTTTTSSSKSSSWCESDFTAEDLAS--PSWRDWSD 191
           PPLD+A SP   T +   T +TTTT TT +SSS  SSWC+SDFTAE L S   S  D   
Sbjct: 165 PPLDYASSPLHCTTSTTITGSTTTTCTTCSSSSNGSSWCDSDFTAEFLPSLVGSNSDPHA 224

Query: 192 DGTMGKMYFPCVGEDSNE----TTAAYAQN---DEEVNALLIREDNEEQEVL-------- 251
           D  +GK Y PCVG+D  E    TT   + N     +V  LL  ED +   V         
Sbjct: 225 DEEVGKKYLPCVGKDFMEEEASTTGTGSCNIALGPQVEILLGDEDEQHSPVSVLDCQFGE 284

Query: 252 --DESTRRLLEQVKGAISLSKSCRLVERCGLDWLIRELFRRELADVQDVDERVRNDDRRI 311
             D+S     +Q    +        +E   L+++       +  +   +++++  D  R 
Sbjct: 285 DEDDSFTSTFDQSLANVGDEDEEMAMEL--LNYVKATSSSPDSCEEGHLEDKLLLDFFRE 344

Query: 312 RV---KNGKDDEYVCDWFLSHKG----------------KESYVREMEREGKWEIFGVDE 346
            +   +N  DD +  +     K                 KE+ VR+M + G+W  F  ++
Sbjct: 345 EMSVQRNQTDDGFQWEMVSKAKAWVSGEHNELEWGLEHKKEACVRDMHKGGRWNKFEYEQ 404

BLAST of Csa3G124790 vs. TrEMBL
Match: A0A061E1G2_THECC (Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_005434 PE=4 SV=1)

HSP 1 Score: 140.2 bits (352), Expect = 4.8e-30
Identity = 82/168 (48.81%), Postives = 102/168 (60.71%), Query Frame = 1

Query: 34  LASPKPKPTSFKALAFHAVVAAVKRISLPSVKSPRIFPRSLSRRLLKKTERDERETGGDF 93
           L   + K  S     F A++ AV+ I   SVKSP I PRSLSR+L KK  + E ET    
Sbjct: 70  LQRSRSKAASTTISTFQAMIKAVRNIHFTSVKSPSILPRSLSRKLSKKNSQKETETRT-- 129

Query: 94  VVKIKDIIRWKSFRDLIDETTAAAPPLDFAESPDRYTYTAAATTTTTTTTTTTTSSSKSS 153
            V++KDIIRWKS RDL++E     PP DFA SP   T T + TTTTTT + +T  SS SS
Sbjct: 130 TVRVKDIIRWKSSRDLVEEKF---PPADFASSPHHCT-TRSTTTTTTTGSKSTPCSSNSS 189

Query: 154 SWCESDFTAEDLASPSWRDWSDDGTMGKMYFPCVGEDSNETTAAYAQN 202
           SWC+SDFT+E L S  + +   D  +GK + PCVG+D  ETT   A N
Sbjct: 190 SWCDSDFTSEYLPSEEYHESEVD--VGKKFLPCVGKDPMETTTGLAAN 229

BLAST of Csa3G124790 vs. TrEMBL
Match: A0A061E1G2_THECC (Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_005434 PE=4 SV=1)

HSP 1 Score: 70.1 bits (170), Expect = 6.1e-09
Identity = 53/143 (37.06%), Postives = 78/143 (54.55%), Query Frame = 1

Query: 212 EDNEEQEVLDESTR-------RLLEQVKGAISLSKSCRLVERCGLDWLIRELFRRELADV 271
           ED E+ +V +E T        +LL  VK   SL KS R +    +D L+ +LFR ELA  
Sbjct: 311 EDGEDDDVEEEKTNEVEEKAWQLLNHVKET-SLLKSYRYIS---IDKLLLDLFREELATK 370

Query: 272 QDVDERVRNDDRRIRVK----NGKDDEYVCDWFLSHKGKESYVREMEREGKWEIFGVDEK 331
            +   +   +   IR      NG+ +E    W +  K +E+YVR+M+REGKW  F  +E+
Sbjct: 371 WNETRKEEVEHDMIRQAEAWINGEQNE-TAKWRVWEK-REAYVRDMDREGKWRKF-EEEQ 430

Query: 332 IELGLEIEGEILGCLVDEILLDI 344
            EL LE+E  ++  LVDE+L D+
Sbjct: 431 EELALEVESRVMNILVDELLFDL 446


HSP 2 Score: 140.2 bits (352), Expect = 4.8e-30
Identity = 82/168 (48.81%), Postives = 102/168 (60.71%), Query Frame = 1

Query: 34  LASPKPKPTSFKALAFHAVVAAVKRISLPSVKSPRIFPRSLSRRLLKKTERDERETGGDF 93
           L   + K  S     F A++ AV+ I   SVKSP I PRSLSR+L KK  + E ET    
Sbjct: 70  LQRSRSKAASTTISTFQAMIKAVRNIHFTSVKSPSILPRSLSRKLSKKNSQKETETRT-- 129

Query: 94  VVKIKDIIRWKSFRDLIDETTAAAPPLDFAESPDRYTYTAAATTTTTTTTTTTTSSSKSS 153
            V++KDIIRWKS RDL++E     PP DFA SP   T T + TTTTTT + +T  SS SS
Sbjct: 130 TVRVKDIIRWKSSRDLVEEKF---PPADFASSPHHCT-TRSTTTTTTTGSKSTPCSSNSS 189

Query: 154 SWCESDFTAEDLASPSWRDWSDDGTMGKMYFPCVGEDSNETTAAYAQN 202
           SWC+SDFT+E L S  + +   D  +GK + PCVG+D  ETT   A N
Sbjct: 190 SWCDSDFTSEYLPSEEYHESEVD--VGKKFLPCVGKDPMETTTGLAAN 229

BLAST of Csa3G124790 vs. TAIR10
Match: AT4G00770.1 (AT4G00770.1 unknown protein)

HSP 1 Score: 80.9 bits (198), Expect = 1.7e-15
Identity = 94/335 (28.06%), Postives = 142/335 (42.39%), Query Frame = 1

Query: 16  EAPTRMLKDFLHETN----PNGLAS--------PKPK-PTSFKALAFHAVVAAVKRISLP 75
           E  +RMLKD L E +     NG  S        P P  P   ++ A  AV+ A+K +   
Sbjct: 2   ELRSRMLKDCLLEDSNSCSSNGFKSIPRRHPLNPFPMIPKRKQSNALQAVINAIKNLHSN 61

Query: 76  SVKSPR--IFPRSLSRRLLKKTERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAAPPL 135
           ++KS    I PRSLSRRL  K + + + +    V+++KDI+RW S +DL ++ +   P  
Sbjct: 62  TIKSAPSGILPRSLSRRLATKNKAENQAS--ITVIRVKDIVRWHSSKDLHEDISHFEP-- 121

Query: 136 DFAESPDRYTYTAAATTTTTTTTTTTTSSSKSSSWCESDFTAEDLASPSWR---DWSDDG 195
                   + YT     TTTTT ++TTS +  SSW + DFT+E L S SW    +   + 
Sbjct: 122 --------HQYTTK--NTTTTTGSSTTSGTSCSSWSDLDFTSEFLPS-SWGSNVEECGEK 181

Query: 196 TMGKMYFPCVGEDS-------------NETTAAYAQNDEEVNALLIREDNEEQEVLDEST 255
              K    CVGEDS              E      +++  V+   I+ + E  E  D S 
Sbjct: 182 QSVKNNLHCVGEDSCTAVILADTEVGPEENLQCEKEHNSPVSVFEIQHE-EYDETSDSSF 241

Query: 256 RRLLEQVK-------GAISLSKSCRLVERCGLDWLIRELFRRELADVQDVDERVRNDDRR 313
            + L+ V+         I   +S   +    LD          +   Q+ D +  +D+  
Sbjct: 242 SQCLDNVERTKQKLMQTIQRFESLANISPFNLDEWGSMDEASCMEGGQETDTKYDDDENC 301

BLAST of Csa3G124790 vs. NCBI nr
Match: gi|449432203|ref|XP_004133889.1| (PREDICTED: uncharacterized protein LOC101208043 [Cucumis sativus])

HSP 1 Score: 698.0 bits (1800), Expect = 8.5e-198
Identity = 346/346 (100.00%), Postives = 346/346 (100.00%), Query Frame = 1

Query: 1   MAAPTLSFPPFPLNREAPTRMLKDFLHETNPNGLASPKPKPTSFKALAFHAVVAAVKRIS 60
           MAAPTLSFPPFPLNREAPTRMLKDFLHETNPNGLASPKPKPTSFKALAFHAVVAAVKRIS
Sbjct: 1   MAAPTLSFPPFPLNREAPTRMLKDFLHETNPNGLASPKPKPTSFKALAFHAVVAAVKRIS 60

Query: 61  LPSVKSPRIFPRSLSRRLLKKTERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAAPPL 120
           LPSVKSPRIFPRSLSRRLLKKTERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAAPPL
Sbjct: 61  LPSVKSPRIFPRSLSRRLLKKTERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAAPPL 120

Query: 121 DFAESPDRYTYTAAATTTTTTTTTTTTSSSKSSSWCESDFTAEDLASPSWRDWSDDGTMG 180
           DFAESPDRYTYTAAATTTTTTTTTTTTSSSKSSSWCESDFTAEDLASPSWRDWSDDGTMG
Sbjct: 121 DFAESPDRYTYTAAATTTTTTTTTTTTSSSKSSSWCESDFTAEDLASPSWRDWSDDGTMG 180

Query: 181 KMYFPCVGEDSNETTAAYAQNDEEVNALLIREDNEEQEVLDESTRRLLEQVKGAISLSKS 240
           KMYFPCVGEDSNETTAAYAQNDEEVNALLIREDNEEQEVLDESTRRLLEQVKGAISLSKS
Sbjct: 181 KMYFPCVGEDSNETTAAYAQNDEEVNALLIREDNEEQEVLDESTRRLLEQVKGAISLSKS 240

Query: 241 CRLVERCGLDWLIRELFRRELADVQDVDERVRNDDRRIRVKNGKDDEYVCDWFLSHKGKE 300
           CRLVERCGLDWLIRELFRRELADVQDVDERVRNDDRRIRVKNGKDDEYVCDWFLSHKGKE
Sbjct: 241 CRLVERCGLDWLIRELFRRELADVQDVDERVRNDDRRIRVKNGKDDEYVCDWFLSHKGKE 300

Query: 301 SYVREMEREGKWEIFGVDEKIELGLEIEGEILGCLVDEILLDIFSL 347
           SYVREMEREGKWEIFGVDEKIELGLEIEGEILGCLVDEILLDIFSL
Sbjct: 301 SYVREMEREGKWEIFGVDEKIELGLEIEGEILGCLVDEILLDIFSL 346

BLAST of Csa3G124790 vs. NCBI nr
Match: gi|659075370|ref|XP_008438109.1| (PREDICTED: uncharacterized protein LOC103483313 isoform X2 [Cucumis melo])

HSP 1 Score: 604.4 bits (1557), Expect = 1.3e-169
Identity = 304/344 (88.37%), Postives = 317/344 (92.15%), Query Frame = 1

Query: 1   MAAPTLSFPPFPLNREAPTRMLKDFLHETNPNGLASPKPKPTSFKALAFHAVVAAVKRIS 60
           MAAP+LSFPPFPLNRE PTRMLKDFLHETNPNG+AS KPKPTSFKALAFHAVVAAVKRIS
Sbjct: 1   MAAPSLSFPPFPLNREGPTRMLKDFLHETNPNGIASSKPKPTSFKALAFHAVVAAVKRIS 60

Query: 61  LPSVKSPRIFPRSLSRRLLKKTERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAAPPL 120
            PSVKSPRIFPRSLSRRLL+KTERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAAPPL
Sbjct: 61  FPSVKSPRIFPRSLSRRLLRKTERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAAPPL 120

Query: 121 DFAESPDRYTYTAAATTTTTTTTTTTTSSSKSSSWCESDFTAEDLASPSWRDWSDDGTMG 180
           DFAESPDRYTYTAAA  TTTTTTTTTTSSSKSSSWCESDFTAEDL SPSWRDWSDDGT+G
Sbjct: 121 DFAESPDRYTYTAAA--TTTTTTTTTTSSSKSSSWCESDFTAEDLPSPSWRDWSDDGTIG 180

Query: 181 KMYFPCVGEDSNETTAAYAQNDEEVNALLIREDNEEQEVLDESTRRLLEQVKGAISLSKS 240
           KMYF CVGEDS ETTAA+A+ND+EVNAL  RED EEQEVLDESTRRLLEQVKG ISLS+S
Sbjct: 181 KMYFRCVGEDSTETTAAHAKNDKEVNALSRREDKEEQEVLDESTRRLLEQVKGVISLSES 240

Query: 241 CRLVERCGLDWLIRELFRRELADVQDVDERVRNDDRRIRVKNGKDDEYVCDWFLSHKGKE 300
           CRL E CGLD L+RELFRR+LA  QD       DD RIR+KNGKD EYV DWFLSHKGKE
Sbjct: 241 CRLAEHCGLDGLLRELFRRDLASFQD-------DDDRIRMKNGKDGEYVYDWFLSHKGKE 300

Query: 301 SYVREMEREGKWEIFGVDEKIELGLEIEGEILGCLVDEILLDIF 345
           SYVREMEREGKWEIFGVDEKI+LGLEIEGE+LGCLVDEILLDIF
Sbjct: 301 SYVREMEREGKWEIFGVDEKIDLGLEIEGEVLGCLVDEILLDIF 335

BLAST of Csa3G124790 vs. NCBI nr
Match: gi|659075368|ref|XP_008438108.1| (PREDICTED: uncharacterized protein LOC103483313 isoform X1 [Cucumis melo])

HSP 1 Score: 599.4 bits (1544), Expect = 4.1e-168
Identity = 304/346 (87.86%), Postives = 317/346 (91.62%), Query Frame = 1

Query: 1   MAAPTLSFPPFPLNREAPTRMLKDFLHETNPNGLASPKPKPTSFKALAFHAVVAAVKRIS 60
           MAAP+LSFPPFPLNRE PTRMLKDFLHETNPNG+AS KPKPTSFKALAFHAVVAAVKRIS
Sbjct: 1   MAAPSLSFPPFPLNREGPTRMLKDFLHETNPNGIASSKPKPTSFKALAFHAVVAAVKRIS 60

Query: 61  LPSVKSPRIFPRSLSRRLLKKTERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAAPPL 120
            PSVKSPRIFPRSLSRRLL+KTERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAAPPL
Sbjct: 61  FPSVKSPRIFPRSLSRRLLRKTERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAAPPL 120

Query: 121 DFAESPDRYTYTAAATTTTTTTTTTTTSSSKSSSWCESDFTAEDLASPSWRDWSDDGTMG 180
           DFAESPDRYTYTAAA  TTTTTTTTTTSSSKSSSWCESDFTAEDL SPSWRDWSDDGT+G
Sbjct: 121 DFAESPDRYTYTAAA--TTTTTTTTTTSSSKSSSWCESDFTAEDLPSPSWRDWSDDGTIG 180

Query: 181 KMYFPCVGEDSNETTAAYAQNDEEV--NALLIREDNEEQEVLDESTRRLLEQVKGAISLS 240
           KMYF CVGEDS ETTAA+A+ND+EV  NAL  RED EEQEVLDESTRRLLEQVKG ISLS
Sbjct: 181 KMYFRCVGEDSTETTAAHAKNDKEVGINALSRREDKEEQEVLDESTRRLLEQVKGVISLS 240

Query: 241 KSCRLVERCGLDWLIRELFRRELADVQDVDERVRNDDRRIRVKNGKDDEYVCDWFLSHKG 300
           +SCRL E CGLD L+RELFRR+LA  QD       DD RIR+KNGKD EYV DWFLSHKG
Sbjct: 241 ESCRLAEHCGLDGLLRELFRRDLASFQD-------DDDRIRMKNGKDGEYVYDWFLSHKG 300

Query: 301 KESYVREMEREGKWEIFGVDEKIELGLEIEGEILGCLVDEILLDIF 345
           KESYVREMEREGKWEIFGVDEKI+LGLEIEGE+LGCLVDEILLDIF
Sbjct: 301 KESYVREMEREGKWEIFGVDEKIDLGLEIEGEVLGCLVDEILLDIF 337

BLAST of Csa3G124790 vs. NCBI nr
Match: gi|823150658|ref|XP_012475152.1| (PREDICTED: uncharacterized protein LOC105791579 [Gossypium raimondii])

HSP 1 Score: 159.5 bits (402), Expect = 1.1e-35
Identity = 124/381 (32.55%), Postives = 178/381 (46.72%), Query Frame = 1

Query: 16  EAPTRMLKDFLHET-------------------------NPNG-LASPKPKPTSFKALAF 75
           E   RMLKDF+H+                          NPN  L   + K  S    AF
Sbjct: 4   ERRPRMLKDFIHDDPNSCSSNGFKSFPRKSTQNSIIFRENPNQKLQRSRSKAASATISAF 63

Query: 76  HAVVAAVKRISLPSVKSPRIFPRSLSRRLLKKTERDERETGGDFVVKIKDIIRWKSFRDL 135
            A++  +K I   S     + PR+LSR+  K+     +E      V +KDIIRWKSFRDL
Sbjct: 64  QAMINVIKSIHFASSSPSILLPRTLSRKPSKRKISQNKEAEIKMTVTVKDIIRWKSFRDL 123

Query: 136 IDETTAAAPPLDFAESPDRYTYTAAATTTTTTTTT--TTTSSSKSSSWCESDFTAEDLAS 195
           ++E    + PLDFA S     +    TTTTT + T  + T+SS  SSWC+SDFT+E L S
Sbjct: 124 LEE---KSQPLDFAPSSASPHHHCTTTTTTTGSNTPCSCTTSSNGSSWCDSDFTSEYLPS 183

Query: 196 PSWRDWSDDGTMGKMYFPCVGEDSNETTAAYAQN----------DEEVNALLIR------ 255
             + +   D  +GK + PCVG+D+ ETT   A N          +EE     +       
Sbjct: 184 DEYGENEVDNMVGKKFSPCVGKDTMETTTRTAANTDMGPKHASVEEEPQHSPLSVLDFEY 243

Query: 256 ----EDNEEQEVLDESTRRLLEQVKGAISLSKSCRLVERCGLDWLIRELFRRELADVQDV 315
               ED EE   ++E    LL  VK    L++         +D L+ +LFR E+    D 
Sbjct: 244 GGDDEDGEEANEIEEKAWELLNGVKETSPLTRYKN--NNICIDKLLLDLFREEMETKWDQ 303

Query: 316 DERVRNDDRR-IRVKNGKDDEYVCDWFLSHKG----KESYVREMEREGKWEIFGVDEKIE 344
              +   +R  +RV       ++C+     +G    +E  V +MEREGKW     +E+ E
Sbjct: 304 TRNIEELEREMVRVAKA----WICEEQNEKRGVGDKREECVGDMEREGKWRDRFHEEQEE 363

BLAST of Csa3G124790 vs. NCBI nr
Match: gi|764625929|ref|XP_004306853.2| (PREDICTED: uncharacterized protein LOC101297873 [Fragaria vesca subsp. vesca])

HSP 1 Score: 147.5 bits (371), Expect = 4.3e-32
Identity = 137/435 (31.49%), Postives = 189/435 (43.45%), Query Frame = 1

Query: 10  PFPLNREAPTRMLKDFLHE-----------------------TNPNGLAS-------PKP 69
           PFP+ R  PT MLKDFL+E                       +NPN  A+        + 
Sbjct: 18  PFPIERR-PT-MLKDFLNENSNSCSSSGFKSFPRKPELDCKASNPNPTATITSKLQRSRS 77

Query: 70  KPTSFKALAFHAVVAAVKRISLPSVKSPRIFPRSLSRRLLKKTE---RDERETGGDFVVK 129
           K  S    AF +++ AVK I   +VK+P + PRSLSRRL K++    +   +T     VK
Sbjct: 78  KAASTTISAFQSIMNAVKNIQFTAVKTPSLLPRSLSRRLSKRSSWSRKQSLQTQVQISVK 137

Query: 130 IKDIIRWKSFRDLIDETTAAAPPLDFAESPDRYTYTAAATTTTTTTTTTTTSSSKSSSWC 189
           +KDI+RW SFRD   E    + P DFA SP   T TA   T TTTTTTT ++SS  SSWC
Sbjct: 138 VKDILRWTSFRD---ERLPQSLPWDFASSPHHCT-TATTVTDTTTTTTTCSNSSNGSSWC 197

Query: 190 ESDFTAEDLASPSWRDWSDDGTMGKMYFPCVGEDSNETTAAYA---QNDEEVNALLIRED 249
           +SDFTAE L SP       +  MGK Y PCVG  S E TA  A   + D +V  L   ED
Sbjct: 198 DSDFTAEFLQSPC----DGENEMGKKYSPCVGRVSMEATAGPARCSELDPKVEVLSCDED 257

Query: 250 NEEQEVL----------DESTRRLLEQVKGAISLSKSC---RLVERCGL-----DWLIRE 309
            +   V           +E+     +Q    +  +K     RL +  GL      WL  E
Sbjct: 258 EQHSPVSVLNFQFGEDEEETFSTTFDQSLANVERTKVMLMQRLKQFEGLANLDNSWLSPE 317

Query: 310 --LFRRELADVQDVDER-------VRNDDRRIRVKNGKDDEYVCDWFLSHKGKESYVRE- 344
             L+  E A   +++ER       V+        ++  +D+ + D+F    G +   +  
Sbjct: 318 EGLYYEEEAGEHEIEERAMEMLNHVKETSLTPLYEDSMEDKLLLDFFREEMGAQRNDQTE 377

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0L6C4_CUCSA5.9e-198100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_3G124790 PE=4 SV=1[more]
A0A0D2QYB6_GOSRA7.6e-3632.55Uncharacterized protein OS=Gossypium raimondii GN=B456_004G156200 PE=4 SV=1[more]
M5XHR6_PRUPE4.8e-3031.61Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa026717mg PE=4 SV=1[more]
A0A061E1G2_THECC4.8e-3048.81Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_005434 PE=4 SV=1[more]
A0A061E1G2_THECC6.1e-0937.06Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_005434 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G00770.11.7e-1528.06 unknown protein[more]
Match NameE-valueIdentityDescription
gi|449432203|ref|XP_004133889.1|8.5e-198100.00PREDICTED: uncharacterized protein LOC101208043 [Cucumis sativus][more]
gi|659075370|ref|XP_008438109.1|1.3e-16988.37PREDICTED: uncharacterized protein LOC103483313 isoform X2 [Cucumis melo][more]
gi|659075368|ref|XP_008438108.1|4.1e-16887.86PREDICTED: uncharacterized protein LOC103483313 isoform X1 [Cucumis melo][more]
gi|823150658|ref|XP_012475152.1|1.1e-3532.55PREDICTED: uncharacterized protein LOC105791579 [Gossypium raimondii][more]
gi|764625929|ref|XP_004306853.2|4.3e-3231.49PREDICTED: uncharacterized protein LOC101297873 [Fragaria vesca subsp. vesca][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU098348cucumber EST collection version 3.0transcribed_cluster
CU121795cucumber EST collection version 3.0transcribed_cluster
CU132291cucumber EST collection version 3.0transcribed_cluster
CU132894cucumber EST collection version 3.0transcribed_cluster
CU140050cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa3G124790.1Csa3G124790.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU132894CU132894transcribed_cluster
CU098348CU098348transcribed_cluster
CU121795CU121795transcribed_cluster
CU140050CU140050transcribed_cluster
CU132291CU132291transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33623FAMILY NOT NAMEDcoord: 48..346
score: 3.5
NoneNo IPR availablePANTHERPTHR33623:SF4SUBFAMILY NOT NAMEDcoord: 48..346
score: 3.5

The following gene(s) are paralogous to this gene:

None