Cp4.1LG01g04080 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g04080
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionChaperone protein DnaJ
LocationCp4.1LG01 : 1401191 .. 1404121 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GATTTCTGCTCAGCACCGTAGATTTTAATTGTTTTGCTGACTTTTCCAACGTCAAAATCCCCAATCAATAGTCTTCCTTCACACGGATTATGCATCAAAATATCTCACCTCCTCCTCTTATCTTGTCATTAAATATCTTCCTACCAAGTAGCACCTATTTCTCTCTTCTCTTCTCTCTTATCCTTCTGAAAACTAGGAAGATGACTGCCGCATGGCTTCCTCTGTACACGCCCGTTGTTGCAACAAAAATACAGAATCCAACTCGAAGAAAGTTGGGATCGTACAACTTTTCAACTTCCAAGATGCTTTATGGCAATACTTTGGCATGCAGGGCAGGTTCTTCAATAACAGACTTTGATCTTTATGATCTTCTTGGCATCGACAACACCTCCGATTCATCGCGGATTAAGGCAGCGTACCGTGCGCTCCAAAAGCAGTGCCACCCCGACATCGCCGGTCCTGCTGGCCATGACATGGCTATCATTCTCAATGAAGTGTATTCAGTTCTTTCGGATCCTAATTCCAGGTTGGCTTACGATAAGGTCAGAACTGAAAACTCTATTCTAATTTGTGCAGCTCTCTATAGAATTTGGAAACAAGATTGTTTGTGACAAAAATGATGTTCGCAGGAGCAAGCAAAAATGGCAGAACTTCGAGGTTACACAGGGAAGTCCGTTTATTCGGTATGGTTGGGATCGGAAAGCGAACGAAGGGCGGTTTTTGTAGATGAAGTGAAGTGCATTGGCTGCTTGAAATGTGCTTTATTTGCTGGGAAAACTTTTGCTGTCGAATCTGTGTATGGGAGAGCTAGAGTTGTGGCGCAGTGGGCTGATCCTGAATATAAAGTGATGGAGGCCATTGAAGCTTGCCCAGTTGACTGCATTTCGTGAGTGTTTAATCAACCAGTTATAACTTCTTGTCTCATTCTTCTGATTTGCTAATAAGTTCTTTTCTTGTTCTTGGCAAATTAAACCCAGGATGGTGGAAAGGTCAGATTTGGCAGCCCTAGAGTTCTTAATGTCAAAGCAACCACGTGGAAACGTAAGAGTTGGTATGGGCAACGCAGCTGGTGAAAGAGTGTCAAACATTTTCACAGACGTGAAGAAGTTCCAAACCAGATTCAAGGAAGCTATGGAGAAGACTTCAAAACAGCATGTCAAGGTTAGACACAAAATATTCTTCGACATCACCATTAACTTAAAATCGTTCAAAACTTCATCCAGTTGTATGATTGTTTATCAGGGGACAACCTTCGAGAACAAAGCACAGTTGTCTGCAATTCAAGCCATCAGATCAATCTCAAATTGCTTATTTTGGCAGACAGCCACCGGATCCAAACACAGTCAAAACCTGACGCTCGTCGTCAACACATCCACACCACAAATCAACAAACTTCAAGCTGCTATTACTGCAAGGAAACAACTCAAGGCAAAAACCGAAGACAAAAATGGAACTACAACAAAATACATAAAGGACGATTACTGGGTTCCGACCACCCTTGCACTTCCAGTCTCAACTCAAACTCCAATAAGCACAATCTCAAATCCAAGGGTAGAGACCAAACGTAGCAAAGAATCAAAAGACCTTGATTTTGAAGTGGGTGGTGGTGATCATACTAGCCCCATGAGATTGGCATTACCAGTCTTGATATCGATAGTTGCAACCACTGTAATCCAGCAAATGGTGACAGATGATGGTGCCAGCGGCTTGAAGGAACACATCGGAGGTTCTGTGGCACTGGAGTTAGTTAATAGTCATTGGATGCAGCTTATACAAAGAGGAGTCACTTGGTATGTCATTGGAATAGCAGTGATAGGAATGGTCGAAAAGATTGCAAGAAAGATTAGGCATTGAAGAATTACAGAGAGCAAAATTTTCCAGAAAATAGAAGCTGCATATATGGCTATTAGTTATACCAGAAGAAAAATGGATATGTGATCACAAATGACAGAATTATGAAACGAAGTGTATTAGAAGTTCAGTGTTCATACCCTTTTAGTTCATAAACCAGGCAGTTCAGTGTTCATTTTCATATCAACTATTGTAATCAAACATCATCAATAGATCGGAAAACAACTTATTCAACCAAAACATACCTGATTAAAGACTTGTTACCATAAACAGTTATATTCCCTCATCAGAGGTCCAACAGTGATCAAATGTTTTGAGAAGTCATGGCACACAATGACATCCAAAATAAACCATATCCTTTTCACGAGCGAATAATGTCATAGAGTACACATTTTCGAGACGCGATCTCTAAAATTTAAGATCAATAAGAACTTGCAGAAGCAGCAGAACGTCTTAGTAGCGATATTTGGTTGAATAGTCCTTAGGTGCAATGACTAAACCACGTCCTTTCCTTGCTCCACGATAGACTGTTTCAATGATATTGATGAACTCTTGCTTATCCTTGAGTGCCCAGTTTATCTTATTGTTGTTACCAGTACCAAGATCAATCATTATGTGCTTGTTCCTAAAGAAGAACATGACAGTGGAAGGATCATACAGCTCGTACATTGTGTTGAAATCAGGAACCTCAGTGATATCCACAAGGTATATTACTGCGAAGTTCTTGATCGTCTCAGCAACTGAGGCCAACACCTCATCCATCTAGATCAAGAACATACCAATCAGATATAAACATCTCAAAATACTATAAAGATGATCATGTAGAGCAAACCAAACAAATTATGTGAAAAAATCTTCATGACTTTAAACATTCACGATGGAAGGACTGTGGCTCCGCACAACAAAGACAAGGGGAAAAAGACTCTAAACAAATCATACACCAAAAACGAGTCTAGATGGCTCATGAACTCAAACTTTGATCAATCCGCTAGCATTGAAAACCTCGTGCATACAGCAACAACAGTCAAAGACTGGATTATTAAAGATCAATAAGCAAGTCGTTTAATAACAATTGCAATG

mRNA sequence

GATTTCTGCTCAGCACCGTAGATTTTAATTGTTTTGCTGACTTTTCCAACGTCAAAATCCCCAATCAATAGTCTTCCTTCACACGGATTATGCATCAAAATATCTCACCTCCTCCTCTTATCTTGTCATTAAATATCTTCCTACCAAGTAGCACCTATTTCTCTCTTCTCTTCTCTCTTATCCTTCTGAAAACTAGGAAGATGACTGCCGCATGGCTTCCTCTGTACACGCCCGTTGTTGCAACAAAAATACAGAATCCAACTCGAAGAAAGTTGGGATCGTACAACTTTTCAACTTCCAAGATGCTTTATGGCAATACTTTGGCATGCAGGGCAGGTTCTTCAATAACAGACTTTGATCTTTATGATCTTCTTGGCATCGACAACACCTCCGATTCATCGCGGATTAAGGCAGCGTACCGTGCGCTCCAAAAGCAGTGCCACCCCGACATCGCCGGTCCTGCTGGCCATGACATGGCTATCATTCTCAATGAAGTGTATTCAGTTCTTTCGGATCCTAATTCCAGGTTGGCTTACGATAAGGAGCAAGCAAAAATGGCAGAACTTCGAGGTTACACAGGGAAGTCCGTTTATTCGGTATGGTTGGGATCGGAAAGCGAACGAAGGGCGGTTTTTGTAGATGAAGTGAAGTGCATTGGCTGCTTGAAATGTGCTTTATTTGCTGGGAAAACTTTTGCTGTCGAATCTGTGTATGGGAGAGCTAGAGTTGTGGCGCAGTGGGCTGATCCTGAATATAAAGTGATGGAGGCCATTGAAGCTTGCCCAGTTGACTGCATTTCGATGGTGGAAAGGTCAGATTTGGCAGCCCTAGAGTTCTTAATGTCAAAGCAACCACGTGGAAACGTAAGAGTTGGTATGGGCAACGCAGCTGGTGAAAGAGTGTCAAACATTTTCACAGACGTGAAGAAGTTCCAAACCAGATTCAAGGAAGCTATGGAGAAGACTTCAAAACAGCATGTCAAGGGGACAACCTTCGAGAACAAAGCACAGTTGTCTGCAATTCAAGCCATCAGATCAATCTCAAATTGCTTATTTTGGCAGACAGCCACCGGATCCAAACACAGTCAAAACCTGACGCTCGTCGTCAACACATCCACACCACAAATCAACAAACTTCAAGCTGCTATTACTGCAAGGAAACAACTCAAGGCAAAAACCGAAGACAAAAATGGAACTACAACAAAATACATAAAGGACGATTACTGGGTTCCGACCACCCTTGCACTTCCAGTCTCAACTCAAACTCCAATAAGCACAATCTCAAATCCAAGGGTAGAGACCAAACGTAGCAAAGAATCAAAAGACCTTGATTTTGAAGTGGGTGGTGGTGATCATACTAGCCCCATGAGATTGGCATTACCAGTCTTGATATCGATAGTTGCAACCACTGTAATCCAGCAAATGGTGACAGATGATGGTGCCAGCGGCTTGAAGGAACACATCGGAGGTTCTGTGGCACTGGAGTTAGTTAATAGTCATTGGATGCAGCTTATACAAAGAGGAGTCACTTGGTATGTCATTGGAATAGCAGTGATAGGAATGGTCGAAAAGATTGCAAGAAAGATTAGGCATTGAAGAATTACAGAGAGCAAAATTTTCCAGAAAATAGAAGCTGCATATATGGCTATTAGTTATACCAGAAGAAAAATGGATATGTGATCACAAATGACAGAATTATGAAACGAAGTGTATTAGAAGTTCAGTGTTCATACCCTTTTAGTTCATAAACCAGGCAGTTCAGTGTTCATTTTCATATCAACTATTGTAATCAAACATCATCAATAGATCGGAAAACAACTTATTCAACCAAAACATACCTGATTAAAGACTTGTTACCATAAACAGTTATATTCCCTCATCAGAGGTCCAACAGTGATCAAATGTTTTGAGAAGTCATGGCACACAATGACATCCAAAATAAACCATATCCTTTTCACGAGCGAATAATGTCATAGAGTACACATTTTCGAGACGCGATCTCTAAAATTTAAGATCAATAAGAACTTGCAGAAGCAGCAGAACGTCTTAGTAGCGATATTTGGTTGAATAGTCCTTAGGTGCAATGACTAAACCACGTCCTTTCCTTGCTCCACGATAGACTGTTTCAATGATATTGATGAACTCTTGCTTATCCTTGAGTGCCCAGTTTATCTTATTGTTGTTACCAGTACCAAGATCAATCATTATGTGCTTGTTCCTAAAGAAGAACATGACAGTGGAAGGATCATACAGCTCGTACATTGTGTTGAAATCAGGAACCTCAGTGATATCCACAAGGTATATTACTGCGAAGTTCTTGATCGTCTCAGCAACTGAGGCCAACACCTCATCCATCTAGATCAAGAACATACCAATCAGATATAAACATCTCAAAATACTATAAAGATGATCATGTAGAGCAAACCAAACAAATTATGTGAAAAAATCTTCATGACTTTAAACATTCACGATGGAAGGACTGTGGCTCCGCACAACAAAGACAAGGGGAAAAAGACTCTAAACAAATCATACACCAAAAACGAGTCTAGATGGCTCATGAACTCAAACTTTGATCAATCCGCTAGCATTGAAAACCTCGTGCATACAGCAACAACAGTCAAAGACTGGATTATTAAAGATCAATAAGCAAGTCGTTTAATAACAATTGCAATG

Coding sequence (CDS)

ATGCATCAAAATATCTCACCTCCTCCTCTTATCTTGTCATTAAATATCTTCCTACCAAGTAGCACCTATTTCTCTCTTCTCTTCTCTCTTATCCTTCTGAAAACTAGGAAGATGACTGCCGCATGGCTTCCTCTGTACACGCCCGTTGTTGCAACAAAAATACAGAATCCAACTCGAAGAAAGTTGGGATCGTACAACTTTTCAACTTCCAAGATGCTTTATGGCAATACTTTGGCATGCAGGGCAGGTTCTTCAATAACAGACTTTGATCTTTATGATCTTCTTGGCATCGACAACACCTCCGATTCATCGCGGATTAAGGCAGCGTACCGTGCGCTCCAAAAGCAGTGCCACCCCGACATCGCCGGTCCTGCTGGCCATGACATGGCTATCATTCTCAATGAAGTGTATTCAGTTCTTTCGGATCCTAATTCCAGGTTGGCTTACGATAAGGAGCAAGCAAAAATGGCAGAACTTCGAGGTTACACAGGGAAGTCCGTTTATTCGGTATGGTTGGGATCGGAAAGCGAACGAAGGGCGGTTTTTGTAGATGAAGTGAAGTGCATTGGCTGCTTGAAATGTGCTTTATTTGCTGGGAAAACTTTTGCTGTCGAATCTGTGTATGGGAGAGCTAGAGTTGTGGCGCAGTGGGCTGATCCTGAATATAAAGTGATGGAGGCCATTGAAGCTTGCCCAGTTGACTGCATTTCGATGGTGGAAAGGTCAGATTTGGCAGCCCTAGAGTTCTTAATGTCAAAGCAACCACGTGGAAACGTAAGAGTTGGTATGGGCAACGCAGCTGGTGAAAGAGTGTCAAACATTTTCACAGACGTGAAGAAGTTCCAAACCAGATTCAAGGAAGCTATGGAGAAGACTTCAAAACAGCATGTCAAGGGGACAACCTTCGAGAACAAAGCACAGTTGTCTGCAATTCAAGCCATCAGATCAATCTCAAATTGCTTATTTTGGCAGACAGCCACCGGATCCAAACACAGTCAAAACCTGACGCTCGTCGTCAACACATCCACACCACAAATCAACAAACTTCAAGCTGCTATTACTGCAAGGAAACAACTCAAGGCAAAAACCGAAGACAAAAATGGAACTACAACAAAATACATAAAGGACGATTACTGGGTTCCGACCACCCTTGCACTTCCAGTCTCAACTCAAACTCCAATAAGCACAATCTCAAATCCAAGGGTAGAGACCAAACGTAGCAAAGAATCAAAAGACCTTGATTTTGAAGTGGGTGGTGGTGATCATACTAGCCCCATGAGATTGGCATTACCAGTCTTGATATCGATAGTTGCAACCACTGTAATCCAGCAAATGGTGACAGATGATGGTGCCAGCGGCTTGAAGGAACACATCGGAGGTTCTGTGGCACTGGAGTTAGTTAATAGTCATTGGATGCAGCTTATACAAAGAGGAGTCACTTGGTATGTCATTGGAATAGCAGTGATAGGAATGGTCGAAAAGATTGCAAGAAAGATTAGGCATTGA

Protein sequence

MHQNISPPPLILSLNIFLPSSTYFSLLFSLILLKTRKMTAAWLPLYTPVVATKIQNPTRRKLGSYNFSTSKMLYGNTLACRAGSSITDFDLYDLLGIDNTSDSSRIKAAYRALQKQCHPDIAGPAGHDMAIILNEVYSVLSDPNSRLAYDKEQAKMAELRGYTGKSVYSVWLGSESERRAVFVDEVKCIGCLKCALFAGKTFAVESVYGRARVVAQWADPEYKVMEAIEACPVDCISMVERSDLAALEFLMSKQPRGNVRVGMGNAAGERVSNIFTDVKKFQTRFKEAMEKTSKQHVKGTTFENKAQLSAIQAIRSISNCLFWQTATGSKHSQNLTLVVNTSTPQINKLQAAITARKQLKAKTEDKNGTTTKYIKDDYWVPTTLALPVSTQTPISTISNPRVETKRSKESKDLDFEVGGGDHTSPMRLALPVLISIVATTVIQQMVTDDGASGLKEHIGGSVALELVNSHWMQLIQRGVTWYVIGIAVIGMVEKIARKIRH
BLAST of Cp4.1LG01g04080 vs. Swiss-Prot
Match: DNAJ_BIFLO (Chaperone protein DnaJ OS=Bifidobacterium longum (strain NCC 2705) GN=dnaJ PE=3 SV=1)

HSP 1 Score: 60.1 bits (144), Expect = 8.1e-08
Identity = 27/61 (44.26%), Postives = 36/61 (59.02%), Query Frame = 1

Query: 90  DLYDLLGIDNTSDSSRIKAAYRALQKQCHPDIAGPAGHDMAIILNEVYSVLSDPNSRLAY 149
           D Y+ LG++  +    IK AYR L ++ HPDIAGP   D    +N  Y VLS+P+ R  Y
Sbjct: 3   DYYETLGVERGASDDEIKKAYRKLSRKYHPDIAGPEFEDKFKEVNNAYDVLSNPDKRRMY 62

Query: 150 D 151
           D
Sbjct: 63  D 63

BLAST of Cp4.1LG01g04080 vs. Swiss-Prot
Match: DNAJ_DICT6 (Chaperone protein DnaJ OS=Dictyoglomus thermophilum (strain ATCC 35947 / DSM 3960 / H-6-12) GN=dnaJ PE=3 SV=1)

HSP 1 Score: 59.3 bits (142), Expect = 1.4e-07
Identity = 28/66 (42.42%), Postives = 38/66 (57.58%), Query Frame = 1

Query: 87  TDFDLYDLLGIDNTSDSSRIKAAYRALQKQCHPDI-AGPAGHDMAIILNEVYSVLSDPNS 146
           T  D Y++LG+   +    IK AYR L +Q HPD+   P+ H+    +NE Y VLSDP  
Sbjct: 3   TKKDYYEILGVPRNATQDEIKQAYRRLVRQYHPDLNKDPSAHEKFKEINEAYEVLSDPQK 62

Query: 147 RLAYDK 152
           R  YD+
Sbjct: 63  RAQYDQ 68

BLAST of Cp4.1LG01g04080 vs. Swiss-Prot
Match: DNAJ_MARMM (Chaperone protein DnaJ OS=Maricaulis maris (strain MCS10) GN=dnaJ PE=3 SV=1)

HSP 1 Score: 58.5 bits (140), Expect = 2.4e-07
Identity = 29/64 (45.31%), Postives = 40/64 (62.50%), Query Frame = 1

Query: 90  DLYDLLGIDNTSDSSRIKAAYRALQKQCHPD-IAGPAGHDMAI-ILNEVYSVLSDPNSRL 149
           D Y++LG+D T+D   +K+AYR    + HPD   G A  +    ++ E YSVLSDPN R 
Sbjct: 5   DFYEVLGVDKTADEKTLKSAYRKQAMKYHPDRNPGDAEAEAQFKVVGEAYSVLSDPNKRA 64

Query: 150 AYDK 152
           AYD+
Sbjct: 65  AYDR 68

BLAST of Cp4.1LG01g04080 vs. Swiss-Prot
Match: DNAJ_PROMM (Chaperone protein DnaJ OS=Prochlorococcus marinus (strain MIT 9313) GN=dnaJ PE=3 SV=1)

HSP 1 Score: 57.4 bits (137), Expect = 5.3e-07
Identity = 28/63 (44.44%), Postives = 35/63 (55.56%), Query Frame = 1

Query: 90  DLYDLLGIDNTSDSSRIKAAYRALQKQCHPDI-AGPAGHDMAIILNEVYSVLSDPNSRLA 149
           D YDLLG+   +D   +K AYR L +Q HPDI   P   D    +   Y VLSDP +R  
Sbjct: 3   DYYDLLGVSKDADGDTLKRAYRRLARQYHPDINKDPGAEDRFKEIGRAYEVLSDPQTRGR 62

Query: 150 YDK 152
           YD+
Sbjct: 63  YDQ 65

BLAST of Cp4.1LG01g04080 vs. Swiss-Prot
Match: DNAJ_CLOTH (Chaperone protein DnaJ OS=Clostridium thermocellum (strain ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) GN=dnaJ PE=3 SV=1)

HSP 1 Score: 57.4 bits (137), Expect = 5.3e-07
Identity = 30/77 (38.96%), Postives = 40/77 (51.95%), Query Frame = 1

Query: 90  DLYDLLGIDNTSDSSRIKAAYRALQKQCHPDI--AGPAGHDMAIILNEVYSVLSDPNSRL 149
           D Y++LG+D  +  + IK AYR L KQ HPD+     A       +NE Y VLSDP  R 
Sbjct: 6   DYYEILGVDRGASDAEIKKAYRKLAKQYHPDMNPGDKAAEAKFKEINEAYEVLSDPQKRA 65

Query: 150 AYDKEQAKMAELRGYTG 165
            YD+      +  G+ G
Sbjct: 66  RYDQFGHSAFDPNGFGG 82

BLAST of Cp4.1LG01g04080 vs. TrEMBL
Match: A0A0A0KPY4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G175710 PE=4 SV=1)

HSP 1 Score: 706.8 bits (1823), Expect = 1.8e-200
Identity = 363/468 (77.56%), Postives = 401/468 (85.68%), Query Frame = 1

Query: 38  MTAAWLPLYTPVVATKIQNPTRRKLGSYNFSTSKMLYGNTLACRAGSSITDFDLYDLLGI 97
           M+A+WLPLYTP      Q P +RKLGSYN STS+ L+GNTL+C+A SSITDFDLYDLLGI
Sbjct: 1   MSASWLPLYTPTA----QYPIQRKLGSYNPSTSRKLHGNTLSCKAASSITDFDLYDLLGI 60

Query: 98  DNTSDSSRIKAAYRALQKQCHPDIAGPAGHDMAIILNEVYSVLSDPNSRLAYDKEQAKMA 157
           DNTS  SRIKAAYRALQK CHPDIAGPAGHDMAIILNE YSVLSDP+SRLAYDKEQAKMA
Sbjct: 61  DNTSHPSRIKAAYRALQKHCHPDIAGPAGHDMAIILNEAYSVLSDPSSRLAYDKEQAKMA 120

Query: 158 ELRGYTGKSVYSVWLGSESERRAVFVDEVKCIGCLKCALFAGKTFAVESVYGRARVVAQW 217
           ELRGYTGK VYSVWLGSESE+RAVFVDEVKCIGCLKCALFAGKTFAVESVYGRARVVAQW
Sbjct: 121 ELRGYTGKPVYSVWLGSESEQRAVFVDEVKCIGCLKCALFAGKTFAVESVYGRARVVAQW 180

Query: 218 ADPEYKVMEAIEACPVDCISMVERSDLAALEFLMSKQPRGNVRVGMGNAAGERVSNIFTD 277
           ADPEYKVMEAIEACPVDCISMVER+DLAALEFLMSKQPRGNVRVGMGN AGERVSNIFTD
Sbjct: 181 ADPEYKVMEAIEACPVDCISMVERTDLAALEFLMSKQPRGNVRVGMGNTAGERVSNIFTD 240

Query: 278 VKKFQTRFKEAMEKTSKQHVKGTTFENKAQLSAIQAIRSISNCLFWQTAT----GSKHSQ 337
           VKKFQ +F EAMEK  K+  KG TFE++ QL+AIQAIRSISN LFWQTAT    GSK SQ
Sbjct: 241 VKKFQIKFNEAMEKAMKEQSKGATFESEGQLAAIQAIRSISNWLFWQTATPVGPGSKQSQ 300

Query: 338 NLTLVVNTSTPQINKLQAAITARKQLKAKTEDKNGTTTKYI-KDDYWVPTTLALPVSTQT 397
           +L    +  TP+INKLQAA TARKQ++ K ED+N TTTKY+ +DDYWVPTT ALP STQ+
Sbjct: 301 SLARSASKFTPEINKLQAAATARKQIREKAEDRNRTTTKYLYRDDYWVPTTFALPASTQS 360

Query: 398 PISTISNPRVETKRSKESKDLDFEVGGGDHTSPMRLALPVLISIVATTVIQQMVTDDGAS 457
           P + IS P VETK +K+S+ L  +V  G H SPMRL LPV ISI+AT +IQQMV +DGAS
Sbjct: 361 PNNPISKPSVETKPTKQSRGLGSDVSRGGHVSPMRLVLPVSISIIATAIIQQMVRNDGAS 420

Query: 458 GLKEHIGGSVALELVNSHWMQLIQRGVTWYVIGIAVIGMVEKIARKIR 501
            LKEH  GS+ALELVNSHWMQ+I  GVTWY+IG+AV+GM+E IARK R
Sbjct: 421 ELKEHAAGSMALELVNSHWMQVILTGVTWYIIGMAVMGMLEMIARKFR 464

BLAST of Cp4.1LG01g04080 vs. TrEMBL
Match: B9HGK8_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0007s07670g PE=4 SV=2)

HSP 1 Score: 497.3 bits (1279), Expect = 2.2e-137
Identity = 270/479 (56.37%), Postives = 339/479 (70.77%), Query Frame = 1

Query: 38  MTAAWLP-LYTPVVATKIQNPTRRKLGSYNFSTSKMLYGNTLACRAGSS---------IT 97
           M A  LP LYTP  +   +  T +   S+  ++S+     +L CRA SS         IT
Sbjct: 1   MPAGCLPSLYTPFASMTTRILTPKTFTSFPPTSSRKTCNYSLTCRATSSSSSSSSYSSIT 60

Query: 98  DFDLYDLLGIDNTSDSSRIKAAYRALQKQCHPDIAGPAGHDMAIILNEVYSVLSDPNSRL 157
           DFDLYDLLGID++SD S+IK AYR LQK+CHPDIAGPAGHDMAIILNE YS+LSDPNSRL
Sbjct: 61  DFDLYDLLGIDSSSDHSQIKTAYRTLQKRCHPDIAGPAGHDMAIILNEAYSLLSDPNSRL 120

Query: 158 AYDKEQAKMAELRGYTGKSVYSVWLGSESERRAVFVDEVKCIGCLKCALFAGKTFAVESV 217
           AYDKEQAKMAELRGY+GK +YSVW GSESE+RAVFVDEVKC+GCLKCAL A KTFA+ES+
Sbjct: 121 AYDKEQAKMAELRGYSGKPIYSVWFGSESEQRAVFVDEVKCVGCLKCALIAEKTFAIESL 180

Query: 218 YGRARVVAQWADPEYKVMEAIEACPVDCISMVERSDLAALEFLMSKQPRGNVRVGMGNAA 277
           YGRARVVAQWADPE+K+  AI+ACPVDCIS VERSDLAALEFLMSKQPRG+VRVG GN A
Sbjct: 181 YGRARVVAQWADPEHKIQAAIDACPVDCISTVERSDLAALEFLMSKQPRGSVRVGGGNTA 240

Query: 278 GERVSNIFTDVKKFQTRFKEAMEKTSKQHVKGTTFENKAQLSAIQAIRSISNCLFWQTAT 337
           G RVSNIF DVKKFQ RF +AM K + Q+   +  + +A++SA QAIRSISN L+WQ+  
Sbjct: 241 GGRVSNIFIDVKKFQNRFVDAMNKANPQNSMESDLQREARISAFQAIRSISNWLYWQSPK 300

Query: 338 GSKHS----QNLTLVVNTS-TPQINKLQAAITARKQLKAKTEDKNGT-TTKYIKDDYWVP 397
           G   S    Q L  +V  S  P INK++ A  ARK+ +  T     T ++    D+YW P
Sbjct: 301 GRADSPESCQKLARIVRKSPQPNINKIREAAAARKKARENTRPFRQTPSSSLYYDEYWTP 360

Query: 398 TTLALPVSTQTPISTISNPRVETKRSKESKDLDFEVGGGD--HTSPMRLALPVLISIVAT 457
           +T  LP S     S+ S+   ET  +KE K L+ +  G +   T+P+R  +P++ +I+A 
Sbjct: 361 STQFLPASVN---SSSSSATPETSHAKEPKKLEKDNRGEEKRQTNPIRWEIPMVPAIIAA 420

Query: 458 TVIQQMVTDDGASGLKEHIGGSVALELVNSHWMQLIQRGVTWYVIGIAVIGMVEKIARK 499
            +I   V +     L EH+GGS ALE+VNS W+Q+   G+TWY+IG+++IG+VE I ++
Sbjct: 421 VIIHLQVGEGTVGRLNEHVGGSFALEIVNSSWLQVTLAGITWYLIGLSIIGVVEAIRKR 476

BLAST of Cp4.1LG01g04080 vs. TrEMBL
Match: A0A061E9Z1_THECC (DNAJ heat shock N-terminal domain-containing protein, putative OS=Theobroma cacao GN=TCM_007718 PE=4 SV=1)

HSP 1 Score: 496.1 bits (1276), Expect = 4.9e-137
Identity = 275/477 (57.65%), Postives = 347/477 (72.75%), Query Frame = 1

Query: 38  MTAAWLPLYTPV--VATKIQNPTRRKLGSYNFSTSKML--YGNTLACRAG----SSITDF 97
           MTA  LPLYTP   + TK   P   K  +++ STSK L  Y ++  C+A     SSI DF
Sbjct: 1   MTATCLPLYTPATSIITKSSTP---KPYTFSTSTSKKLPIYHHSFTCKASASPSSSIMDF 60

Query: 98  DLYDLLGIDNTSDSSRIKAAYRALQKQCHPDIAGPAGHDMAIILNEVYSVLSDPNSRLAY 157
           DLYDLLGID++S+ S+IK AYRALQK+CHPDIAGPAGHDMAIILNE YSVLSDP SRLAY
Sbjct: 61  DLYDLLGIDSSSNHSQIKTAYRALQKRCHPDIAGPAGHDMAIILNEAYSVLSDPGSRLAY 120

Query: 158 DKEQAKMAELRGYTGKSVYSVWLGSESERRAVFVDEVKCIGCLKCALFAGKTFAVESVYG 217
           DKEQAKMAELRGYTGK +YSVW GSESE+RAVFVDEVKC+GCLKCALFA KTFA+ES+YG
Sbjct: 121 DKEQAKMAELRGYTGKPLYSVWRGSESEQRAVFVDEVKCVGCLKCALFAEKTFAIESLYG 180

Query: 218 RARVVAQWADPEYKVMEAIEACPVDCISMVERSDLAALEFLMSKQPRGNVRVGMGNAAGE 277
           RARVVAQWAD E+K++EAIEACPVDCIS+VERSDLAALEFLMSKQPRGNVRVG+GN  G 
Sbjct: 181 RARVVAQWADSEHKILEAIEACPVDCISIVERSDLAALEFLMSKQPRGNVRVGVGNTVGA 240

Query: 278 RVSNIFTDVKKFQTRFKEAMEKTSKQHVKGTTFENKAQLSAIQAIRSISNCLFWQT---A 337
           RVSNIF DVKKFQTRF +AM+K + +  K      +A++SAI AI+SISN  +WQ+    
Sbjct: 241 RVSNIFVDVKKFQTRFVDAMDKAATKESKEADLRREARMSAIHAIKSISNWWYWQSPNAG 300

Query: 338 TGSKHSQ-NLTLV-VNTSTPQINKLQAAITARKQLKAKTED-KNGTTTKYI-KDDYWVPT 397
           T  + SQ +LT V   +S P INKL+ A  ARKQ +  +      T + Y+  D+YW+P+
Sbjct: 301 TPVEESQLSLTHVPPKSSAPNINKLRDAAAARKQARESSRTIGTRTPSSYLHHDEYWMPS 360

Query: 398 TLALPVSTQTPIST-ISNPRVETKRSKESKDLDFEVGGGDHTSPMRLALPVLISIVATTV 457
             +LP S     S+ +S+  ++T   KE+ D  FE       + +  A+P++ +I+A  +
Sbjct: 361 RQSLPASIHNSSSSKVSSKPLQTNERKETDDKIFEKDWRKR-NQVDWAIPMVAAIIAAVI 420

Query: 458 IQQMVTDDGASGLKEHIGGSVALELVNSHWMQLIQRGVTWYVIGIAVIGMVEKIARK 499
           ++Q V D     + EHIGGS+AL +VNS W+Q+I  G+TWY+IG A++ ++E I  +
Sbjct: 421 VRQQVGDRVVGEITEHIGGSLALTMVNSSWLQVILAGITWYLIGSAMVEVIETIRNR 473

BLAST of Cp4.1LG01g04080 vs. TrEMBL
Match: D7TEX4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0042g00960 PE=4 SV=1)

HSP 1 Score: 496.1 bits (1276), Expect = 4.9e-137
Identity = 271/481 (56.34%), Postives = 338/481 (70.27%), Query Frame = 1

Query: 38  MTAAWLPLYTPV--VATKIQNPTRRKLGSYNFSTSKMLYGN-TLACRAGSS--------I 97
           M +  LPLYTP   + T+I  P    L  ++ +T + L+GN ++ C+A SS        +
Sbjct: 1   MPSTCLPLYTPTSSIITRISTPRSAGLTFHHATTLRKLHGNNSITCKASSSSSSSSLSSL 60

Query: 98  TDFDLYDLLGIDNTSDSSRIKAAYRALQKQCHPDIAGPAGHDMAIILNEVYSVLSDPNSR 157
            DFDLYDLLGI+++SD  +IK AYR LQK+CHPDIAGPAGHDMAIILNEVYSVLSDPN R
Sbjct: 61  VDFDLYDLLGIESSSDQWQIKMAYRKLQKRCHPDIAGPAGHDMAIILNEVYSVLSDPNLR 120

Query: 158 LAYDKEQAKMAELRGYTGKSVYSVWLGSESERRAVFVDEVKCIGCLKCALFAGKTFAVES 217
           LAYDKEQAK+A LRGYTGK +YSVW GSESE RAVFVDEVKC+GCLKCALFA KTFA+ES
Sbjct: 121 LAYDKEQAKIARLRGYTGKPLYSVWYGSESEERAVFVDEVKCVGCLKCALFAEKTFAIES 180

Query: 218 VYGRARVVAQWADPEYKVMEAIEACPVDCISMVERSDLAALEFLMSKQPRGNVRVGMGNA 277
           VYGRARVVAQWADPEYK+ +AI+ACPVDCISMVERS+LAALEFLMSKQPRG+VR+  GNA
Sbjct: 181 VYGRARVVAQWADPEYKIQQAIDACPVDCISMVERSNLAALEFLMSKQPRGSVRMSAGNA 240

Query: 278 AGERVSNIFTDVKKFQTRFKEAMEKTSKQHVKGTTFENKAQLSAIQAIRSISNCLFWQTA 337
            G  VSNIF DVKKFQTRF +AM+K S    K    + +A++SAIQ IRSI+N L+WQ  
Sbjct: 241 VGACVSNIFVDVKKFQTRFHDAMDKASTHGSKEKDDQREARISAIQTIRSITNWLYWQAP 300

Query: 338 TGSKHS-QNLTLVV-NTSTPQINKLQAAITARKQLKAKTEDKNGTTTKYIKD-DYWVPTT 397
           TG   S Q+LT V    S P  NKL+ A  ARKQ +  TE ++ T   YI D +YWVP+T
Sbjct: 301 TGGSDSGQSLTRVAGRLSGPNFNKLRDAAAARKQARESTEPRSRTMPSYIYDAEYWVPST 360

Query: 398 LALPVSTQT------PISTISNPRVETKRSKESKDLDFEVGGGDHTSPMRLALPVLISIV 457
           LALP + Q         S+ S+P  +  + K  K  D  V   +  S     +P+  + +
Sbjct: 361 LALPATNQNNDLASKAASSESSPPSKQWKGKSKK--DHGVSKNNRRSSTIWQIPLATATI 420

Query: 458 ATTVIQQMVTDDGASGLKEHIGGSVALELVNSHWMQLIQRGVTWYVIGIAVIGMVEKIAR 499
           A  V++  + +     LKEHIGGS+AL +VNS W+Q++  GVTWY+IG  ++ ++E I  
Sbjct: 421 AAVVVRFQLGEGAVGELKEHIGGSLALYIVNSSWLQVVLAGVTWYLIGTYMVELLEVIRN 479

BLAST of Cp4.1LG01g04080 vs. TrEMBL
Match: V4VEG5_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031498mg PE=4 SV=1)

HSP 1 Score: 485.7 bits (1249), Expect = 6.7e-134
Identity = 262/460 (56.96%), Postives = 326/460 (70.87%), Query Frame = 1

Query: 41  AWLPLYTPVVATKIQNPTRRKLGSYNFSTSKMLYGNTLACRAGSSITDFDLYDLLGIDNT 100
           AWLPL+TP ++T  +N +  K      ++ K+   N++ C   S   DFDLYDLLGID++
Sbjct: 7   AWLPLFTPSISTITKNNSIPK------TSRKLSNSNSVTCCKASLNMDFDLYDLLGIDSS 66

Query: 101 SDSSRIKAAYRALQKQCHPDIAGPAGHDMAIILNEVYSVLSDPNSRLAYDKEQAKMAELR 160
           SD S+IK AYR LQK+CHPDIAG AGHDMAIILNE YSVLSDPNSRLAYDKEQAK A LR
Sbjct: 67  SDQSQIKTAYRMLQKRCHPDIAGSAGHDMAIILNEAYSVLSDPNSRLAYDKEQAKTAGLR 126

Query: 161 GYTGKSVYSVWLGSESERRAVFVDEVKCIGCLKCALFAGKTFAVESVYGRARVVAQWADP 220
           GYTGK +YSVW GSESE+RAVFVDEVKC+GCLKCALFAGKTFA+ES YGRARVVAQWADP
Sbjct: 127 GYTGKPIYSVWFGSESEQRAVFVDEVKCVGCLKCALFAGKTFAIESAYGRARVVAQWADP 186

Query: 221 EYKVMEAIEACPVDCISMVERSDLAALEFLMSKQPRGNVRVGMGNAAGERVSNIFTDVKK 280
           E+K++EAIE CPVDCIS+VERSDLAALE+LM+KQPRG VRVG GN AG RVSNIF DVKK
Sbjct: 187 EHKILEAIETCPVDCISIVERSDLAALEYLMAKQPRGTVRVGAGNTAGARVSNIFVDVKK 246

Query: 281 FQTRFKEAMEKTSKQHVKGTTFENKAQLSAIQAIRSISNCLFWQTATGSKHSQNLT-LVV 340
           FQT+++ AM+K +    K T    +A+LSAIQAIRSISN L WQ      + QNLT    
Sbjct: 247 FQTQYEGAMKKAAG---KETDTNWEARLSAIQAIRSISNWLHWQLPNAESY-QNLTRSKQ 306

Query: 341 NTSTPQINKLQAAITARKQLKAKTEDKNGTTTKYIKDDYWVPTTLALPVSTQTPIS-TIS 400
               P I KL  A  ARKQ    T  K+        D+YW P+T ALP +TQ+  S   +
Sbjct: 307 KLKEPNIKKLLDAAAARKQASQST--KSIPPNCMYHDEYWSPSTHALPDTTQSNRSFKAA 366

Query: 401 NPRVETKRSKESKDLDFEVGGGDHTSPMRLALPVLISIVATTVIQQMVTDDGASGLKEHI 460
           +     K  K+  D ++ V   +  SP+ L +P++ + +A  +++  V    + GLKEHI
Sbjct: 367 SKSPYNKEWKKPNDRNYSVREENRRSPIALGIPIVTAAIAAAMVRMQVDQGVSDGLKEHI 426

Query: 461 GGSVALELVNSHWMQLIQRGVTWYVIGIAVIGMVEKIARK 499
           GGS+AL ++NS W+Q++  G+TWY IG AV+ ++E I  +
Sbjct: 427 GGSLALIIINSSWLQVMLAGITWYFIGAAVVELIEVIGNR 454

BLAST of Cp4.1LG01g04080 vs. TAIR10
Match: AT5G23240.1 (AT5G23240.1 DNAJ heat shock N-terminal domain-containing protein)

HSP 1 Score: 442.6 bits (1137), Expect = 3.3e-124
Identity = 239/432 (55.32%), Postives = 301/432 (69.68%), Query Frame = 1

Query: 80  CRA---GSSITDFDLYDLLGIDNTSDSSRIKAAYRALQKQCHPDIAGPAGHDMAIILNEV 139
           CRA    SSITDFDLYDLLGID +SD S+IK+AYRALQK+CHPDIAG  GHDMAIILNE 
Sbjct: 37  CRATSSSSSITDFDLYDLLGIDRSSDKSQIKSAYRALQKRCHPDIAGDPGHDMAIILNEA 96

Query: 140 YSVLSDPNSRLAYDKEQAKMAELRGYTGKSVYSVWLGSESERRAVFVDEVKCIGCLKCAL 199
           Y +LSDP SR AYDKEQAK+ ELRGYTGK +YSVW G E+E+RA FVDEVKC+GCLKCAL
Sbjct: 97  YQLLSDPISRQAYDKEQAKLEELRGYTGKPIYSVWCGPETEQRAAFVDEVKCVGCLKCAL 156

Query: 200 FAGKTFAVESVYGRARVVAQWADPEYKVMEAIEACPVDCISMVERSDLAALEFLMSKQPR 259
            A KTFA+E+ YGRARVVAQWADPE K+ EAIEACPVDCISMVERSDLA LEFLMSKQPR
Sbjct: 157 CAEKTFAIETAYGRARVVAQWADPESKIKEAIEACPVDCISMVERSDLAPLEFLMSKQPR 216

Query: 260 GNVRVGMGNAAGERVSNIFTDVKKFQTRFKEAMEKTSKQHVKGTTFENKAQLSAIQAIRS 319
           GNVR+G+GN  GERVSN+F DVKKFQ R+ +AM +T+K+     T + + Q+SA++AIRS
Sbjct: 217 GNVRIGVGNTVGERVSNVFVDVKKFQERYAKAMSRTTKE-----TSQREVQISAVEAIRS 276

Query: 320 ISNCLFWQTATGSK---HSQNLTLVV----NTSTPQINKLQAAITARKQLKAKTEDKNGT 379
           ISN L+W+++  +K      N++L          P I KLQ  + A KQ       K   
Sbjct: 277 ISNWLYWRSSPYTKPLSPESNMSLTFTKRKKAVDPDIRKLQDVVAAMKQADQSGRTKEKG 336

Query: 380 TTKYIKDDYWVPTTLALPVS-TQTPISTISNPRVETKRSKESKDLDFEVGGGDHTSPMRL 439
           +   + +DYW P+  ALP S         SNP+V T+++  S++        ++    R+
Sbjct: 337 SAYLLGEDYWSPSNAALPSSGNNNGSKASSNPQV-TRKTFPSEEK--PTSRRENRRQFRI 396

Query: 440 -ALPVLISIVATTVIQQMVTDDGASGLKEHIGGSVALELVNSHWMQLIQRGVTWYVIGIA 499
              P+  +IVA  ++Q   +   AS L +HIGGS+AL +VNS W Q++  GVTWY IG  
Sbjct: 397 KKFPIGTAIVAVFLVQYQASYRAASELNDHIGGSLALSIVNSPWQQILLAGVTWYFIGAM 456

BLAST of Cp4.1LG01g04080 vs. TAIR10
Match: AT2G42750.1 (AT2G42750.1 DNAJ heat shock N-terminal domain-containing protein)

HSP 1 Score: 120.9 bits (302), Expect = 2.2e-27
Identity = 70/183 (38.25%), Postives = 99/183 (54.10%), Query Frame = 1

Query: 90  DLYDLLGIDNTSDSSRIKAAYRALQKQCHPDIAG--PAGHDMAIILNEVYSVLSDPNSRL 149
           D Y +LG+   +    IK AY    K CHPD++G  P   +  + +N++Y +LSDP  R+
Sbjct: 76  DYYAVLGLLPDATQEEIKKAYYNCMKSCHPDLSGNDPETTNFCMFINDIYEILSDPVQRM 135

Query: 150 AYDKEQAKMAELRGYTGKSVYSVWLGSESERRAVFVDEVKCIGCLKCALFAGKTFAVESV 209
            YD       E+ GYT  ++ + +L   + R  VFVDE  CIGC  CA  A   F +E  
Sbjct: 136 VYD-------EIHGYTVTAI-NPFLDDSTPRDHVFVDEFACIGCKNCANVAPDIFQIEED 195

Query: 210 YGRARVVAQWADPEYKVMEAIEACPVDCISMVERSDLAALEFLMSKQPRGNVRV---GMG 268
           +GRAR   Q  +P+  V +A+E CPVDCI     + L+ LE  M +  R NV +   GMG
Sbjct: 196 FGRARACNQRGNPDL-VQQAVETCPVDCIHQTSAAQLSLLEDEMRRVERVNVALMLSGMG 249

BLAST of Cp4.1LG01g04080 vs. TAIR10
Match: AT3G05345.1 (AT3G05345.1 Chaperone DnaJ-domain superfamily protein)

HSP 1 Score: 65.1 bits (157), Expect = 1.4e-10
Identity = 52/190 (27.37%), Postives = 85/190 (44.74%), Query Frame = 1

Query: 67  FSTSKMLYGNTLACRAGSSITDFDLYDLLGIDNTSDSSRIKAAYRALQKQCHPDIAGPAG 126
           +STS+    +         ++    Y +LG++ +  SS +KAA+RA  KQ HPD+     
Sbjct: 21  YSTSQRFIPSCRGKNREDPLSSSSPYSILGVEPSCSSSELKAAFRAKVKQYHPDVNKDGS 80

Query: 127 HDMAII--LNEVYSVLSDPNSRLAYDKEQAKMAELRGYTGKSVYSVWLGSESERRAVFVD 186
           +   +I  + + Y +L++      Y + +    E            +   E E   VFV+
Sbjct: 81  NSDIMIRRIIQAYEMLTN------YSRSEIIEGE--------CLDPFDHPECEALDVFVN 140

Query: 187 EVKCIG---CLKCALFAGKTFAVESVYGRARVVAQWADPEYKVMEAIEACPVDCISMVER 246
           EV C+G      C   A   F+ +S  G AR ++Q    +Y+V  A+  CP +CI  V  
Sbjct: 141 EVLCVGKRCSYPCFETASHVFSCDS-SGTARAMSQGHGEDYRVQSAVNQCPRNCIHYVTP 195

Query: 247 SDLAALEFLM 252
           S    LE L+
Sbjct: 201 SQRIILEELL 195

BLAST of Cp4.1LG01g04080 vs. TAIR10
Match: AT4G13830.2 (AT4G13830.2 DNAJ-like 20)

HSP 1 Score: 54.7 bits (130), Expect = 1.9e-07
Identity = 33/107 (30.84%), Postives = 51/107 (47.66%), Query Frame = 1

Query: 50  VATKIQNPTRRKLGSYNFSTSKMLYGNTLACRAGSSITDFDLYDLLGIDNTSDSSRIKAA 109
           + T I  PTR +  S     S++ + + +         D   YDLLG+  +     IK A
Sbjct: 32  IPTTISYPTRTRFSSTRIQ-SRLTHDDPV-----KQSEDLSFYDLLGVTESVTLPEIKQA 91

Query: 110 YRALQKQCHPDIAGP----AGHDMAIILNEVYSVLSDPNSRLAYDKE 153
           Y+ L ++ HPD++ P       D  I + E Y  LSDP  R+ YD++
Sbjct: 92  YKQLARKYHPDVSPPDRVEEYTDRFIRVQEAYETLSDPRRRVLYDRD 132

BLAST of Cp4.1LG01g04080 vs. TAIR10
Match: AT1G21080.3 (AT1G21080.3 DNAJ heat shock N-terminal domain-containing protein)

HSP 1 Score: 53.1 bits (126), Expect = 5.6e-07
Identity = 31/94 (32.98%), Postives = 46/94 (48.94%), Query Frame = 1

Query: 86  ITDFDLYDLLGIDNTSDSSRIKAAYRALQKQCHPDIAGP----AGHDMAIILNEVYSVLS 145
           + + + YD+LG+  T+  + IK AY    +Q HPD   P    A H+   +L E Y VLS
Sbjct: 2   VKETEFYDVLGVSPTATEAEIKKAYYIKARQVHPD-KNPNDPQAAHNFQ-VLGEAYQVLS 61

Query: 146 DPNSRLAYDKEQAKMAELRGYTGKSVYSVWLGSE 176
           DP  R AYD               +++++  GSE
Sbjct: 62  DPGQRQAYDTSGKSGISTEIIDPAAIFAMLFGSE 93

BLAST of Cp4.1LG01g04080 vs. NCBI nr
Match: gi|778700708|ref|XP_011654903.1| (PREDICTED: uncharacterized protein LOC101205271 [Cucumis sativus])

HSP 1 Score: 706.8 bits (1823), Expect = 2.6e-200
Identity = 363/468 (77.56%), Postives = 401/468 (85.68%), Query Frame = 1

Query: 38  MTAAWLPLYTPVVATKIQNPTRRKLGSYNFSTSKMLYGNTLACRAGSSITDFDLYDLLGI 97
           M+A+WLPLYTP      Q P +RKLGSYN STS+ L+GNTL+C+A SSITDFDLYDLLGI
Sbjct: 1   MSASWLPLYTPTA----QYPIQRKLGSYNPSTSRKLHGNTLSCKAASSITDFDLYDLLGI 60

Query: 98  DNTSDSSRIKAAYRALQKQCHPDIAGPAGHDMAIILNEVYSVLSDPNSRLAYDKEQAKMA 157
           DNTS  SRIKAAYRALQK CHPDIAGPAGHDMAIILNE YSVLSDP+SRLAYDKEQAKMA
Sbjct: 61  DNTSHPSRIKAAYRALQKHCHPDIAGPAGHDMAIILNEAYSVLSDPSSRLAYDKEQAKMA 120

Query: 158 ELRGYTGKSVYSVWLGSESERRAVFVDEVKCIGCLKCALFAGKTFAVESVYGRARVVAQW 217
           ELRGYTGK VYSVWLGSESE+RAVFVDEVKCIGCLKCALFAGKTFAVESVYGRARVVAQW
Sbjct: 121 ELRGYTGKPVYSVWLGSESEQRAVFVDEVKCIGCLKCALFAGKTFAVESVYGRARVVAQW 180

Query: 218 ADPEYKVMEAIEACPVDCISMVERSDLAALEFLMSKQPRGNVRVGMGNAAGERVSNIFTD 277
           ADPEYKVMEAIEACPVDCISMVER+DLAALEFLMSKQPRGNVRVGMGN AGERVSNIFTD
Sbjct: 181 ADPEYKVMEAIEACPVDCISMVERTDLAALEFLMSKQPRGNVRVGMGNTAGERVSNIFTD 240

Query: 278 VKKFQTRFKEAMEKTSKQHVKGTTFENKAQLSAIQAIRSISNCLFWQTAT----GSKHSQ 337
           VKKFQ +F EAMEK  K+  KG TFE++ QL+AIQAIRSISN LFWQTAT    GSK SQ
Sbjct: 241 VKKFQIKFNEAMEKAMKEQSKGATFESEGQLAAIQAIRSISNWLFWQTATPVGPGSKQSQ 300

Query: 338 NLTLVVNTSTPQINKLQAAITARKQLKAKTEDKNGTTTKYI-KDDYWVPTTLALPVSTQT 397
           +L    +  TP+INKLQAA TARKQ++ K ED+N TTTKY+ +DDYWVPTT ALP STQ+
Sbjct: 301 SLARSASKFTPEINKLQAAATARKQIREKAEDRNRTTTKYLYRDDYWVPTTFALPASTQS 360

Query: 398 PISTISNPRVETKRSKESKDLDFEVGGGDHTSPMRLALPVLISIVATTVIQQMVTDDGAS 457
           P + IS P VETK +K+S+ L  +V  G H SPMRL LPV ISI+AT +IQQMV +DGAS
Sbjct: 361 PNNPISKPSVETKPTKQSRGLGSDVSRGGHVSPMRLVLPVSISIIATAIIQQMVRNDGAS 420

Query: 458 GLKEHIGGSVALELVNSHWMQLIQRGVTWYVIGIAVIGMVEKIARKIR 501
            LKEH  GS+ALELVNSHWMQ+I  GVTWY+IG+AV+GM+E IARK R
Sbjct: 421 ELKEHAAGSMALELVNSHWMQVILTGVTWYIIGMAVMGMLEMIARKFR 464

BLAST of Cp4.1LG01g04080 vs. NCBI nr
Match: gi|659090183|ref|XP_008445880.1| (PREDICTED: uncharacterized protein LOC103488766 [Cucumis melo])

HSP 1 Score: 704.1 bits (1816), Expect = 1.7e-199
Identity = 364/468 (77.78%), Postives = 402/468 (85.90%), Query Frame = 1

Query: 38  MTAAWLPLYTPVVATKIQNPTRRKLGSYNFSTSKMLYGNTLACRAGSSITDFDLYDLLGI 97
           M+A+WLPLYTP   TK Q P +RKLGSYN STS+ L+ NTL+C+A SSITDFDLYDLLGI
Sbjct: 1   MSASWLPLYTP---TK-QYPIQRKLGSYNPSTSRKLHANTLSCKAASSITDFDLYDLLGI 60

Query: 98  DNTSDSSRIKAAYRALQKQCHPDIAGPAGHDMAIILNEVYSVLSDPNSRLAYDKEQAKMA 157
           DNTS+ SRIKAAYRALQK CHPDIAGPAGHDMAIILNE YSVLSDP+SRLAYDKEQAKMA
Sbjct: 61  DNTSNPSRIKAAYRALQKHCHPDIAGPAGHDMAIILNEAYSVLSDPSSRLAYDKEQAKMA 120

Query: 158 ELRGYTGKSVYSVWLGSESERRAVFVDEVKCIGCLKCALFAGKTFAVESVYGRARVVAQW 217
           ELRGYTGK VYSVWLGSESE+RAVFVDEVKCIGCLKCALFAGKTFAVESVYGRARVVAQW
Sbjct: 121 ELRGYTGKPVYSVWLGSESEQRAVFVDEVKCIGCLKCALFAGKTFAVESVYGRARVVAQW 180

Query: 218 ADPEYKVMEAIEACPVDCISMVERSDLAALEFLMSKQPRGNVRVGMGNAAGERVSNIFTD 277
           ADPEYKVMEAIEACPVDCISMVER+DLAALEFLMSKQPRGNVRVGMGN AGERVSNIFTD
Sbjct: 181 ADPEYKVMEAIEACPVDCISMVERTDLAALEFLMSKQPRGNVRVGMGNTAGERVSNIFTD 240

Query: 278 VKKFQTRFKEAMEKTSKQHVKGTTFENKAQLSAIQAIRSISNCLFWQTAT----GSKHSQ 337
           VKKFQ RF EAMEK +K+  KG TF+++ QL+AIQAIRSISN LFWQTAT    GSK SQ
Sbjct: 241 VKKFQIRFNEAMEKATKEQSKGATFDSEGQLAAIQAIRSISNWLFWQTATPVGPGSKQSQ 300

Query: 338 NLTLVVNTSTPQINKLQAAITARKQLKAKTEDKNGTTTKYI-KDDYWVPTTLALPVSTQT 397
           +LT   +  TP+INKLQAA+TARKQ++ K E +N  TTKY+ +DDYWVPTT ALP STQ 
Sbjct: 301 SLTRSASKFTPEINKLQAAVTARKQIREKAEGRNRITTKYLSRDDYWVPTTFALPASTQC 360

Query: 398 PISTISNPRVETKRSKESKDLDFEVGGGDHTSPMRLALPVLISIVATTVIQQMVTDDGAS 457
           P + IS P VETK +K+S+DL  +V  G H SPMRL LPV ISI+AT +IQQMV +DGA 
Sbjct: 361 PNNPISKPSVETKPTKQSRDLGSDVSSGVHVSPMRLILPVSISIIATAIIQQMVRNDGAD 420

Query: 458 GLKEHIGGSVALELVNSHWMQLIQRGVTWYVIGIAVIGMVEKIARKIR 501
            LKEH  GS+ALELVNSHWMQ+I  GVTWY+IG+AV GMVE IARK R
Sbjct: 421 ELKEHAAGSMALELVNSHWMQVILTGVTWYIIGMAVTGMVEMIARKFR 464

BLAST of Cp4.1LG01g04080 vs. NCBI nr
Match: gi|566180117|ref|XP_002310648.2| (hypothetical protein POPTR_0007s07670g [Populus trichocarpa])

HSP 1 Score: 497.3 bits (1279), Expect = 3.2e-137
Identity = 270/479 (56.37%), Postives = 339/479 (70.77%), Query Frame = 1

Query: 38  MTAAWLP-LYTPVVATKIQNPTRRKLGSYNFSTSKMLYGNTLACRAGSS---------IT 97
           M A  LP LYTP  +   +  T +   S+  ++S+     +L CRA SS         IT
Sbjct: 1   MPAGCLPSLYTPFASMTTRILTPKTFTSFPPTSSRKTCNYSLTCRATSSSSSSSSYSSIT 60

Query: 98  DFDLYDLLGIDNTSDSSRIKAAYRALQKQCHPDIAGPAGHDMAIILNEVYSVLSDPNSRL 157
           DFDLYDLLGID++SD S+IK AYR LQK+CHPDIAGPAGHDMAIILNE YS+LSDPNSRL
Sbjct: 61  DFDLYDLLGIDSSSDHSQIKTAYRTLQKRCHPDIAGPAGHDMAIILNEAYSLLSDPNSRL 120

Query: 158 AYDKEQAKMAELRGYTGKSVYSVWLGSESERRAVFVDEVKCIGCLKCALFAGKTFAVESV 217
           AYDKEQAKMAELRGY+GK +YSVW GSESE+RAVFVDEVKC+GCLKCAL A KTFA+ES+
Sbjct: 121 AYDKEQAKMAELRGYSGKPIYSVWFGSESEQRAVFVDEVKCVGCLKCALIAEKTFAIESL 180

Query: 218 YGRARVVAQWADPEYKVMEAIEACPVDCISMVERSDLAALEFLMSKQPRGNVRVGMGNAA 277
           YGRARVVAQWADPE+K+  AI+ACPVDCIS VERSDLAALEFLMSKQPRG+VRVG GN A
Sbjct: 181 YGRARVVAQWADPEHKIQAAIDACPVDCISTVERSDLAALEFLMSKQPRGSVRVGGGNTA 240

Query: 278 GERVSNIFTDVKKFQTRFKEAMEKTSKQHVKGTTFENKAQLSAIQAIRSISNCLFWQTAT 337
           G RVSNIF DVKKFQ RF +AM K + Q+   +  + +A++SA QAIRSISN L+WQ+  
Sbjct: 241 GGRVSNIFIDVKKFQNRFVDAMNKANPQNSMESDLQREARISAFQAIRSISNWLYWQSPK 300

Query: 338 GSKHS----QNLTLVVNTS-TPQINKLQAAITARKQLKAKTEDKNGT-TTKYIKDDYWVP 397
           G   S    Q L  +V  S  P INK++ A  ARK+ +  T     T ++    D+YW P
Sbjct: 301 GRADSPESCQKLARIVRKSPQPNINKIREAAAARKKARENTRPFRQTPSSSLYYDEYWTP 360

Query: 398 TTLALPVSTQTPISTISNPRVETKRSKESKDLDFEVGGGD--HTSPMRLALPVLISIVAT 457
           +T  LP S     S+ S+   ET  +KE K L+ +  G +   T+P+R  +P++ +I+A 
Sbjct: 361 STQFLPASVN---SSSSSATPETSHAKEPKKLEKDNRGEEKRQTNPIRWEIPMVPAIIAA 420

Query: 458 TVIQQMVTDDGASGLKEHIGGSVALELVNSHWMQLIQRGVTWYVIGIAVIGMVEKIARK 499
            +I   V +     L EH+GGS ALE+VNS W+Q+   G+TWY+IG+++IG+VE I ++
Sbjct: 421 VIIHLQVGEGTVGRLNEHVGGSFALEIVNSSWLQVTLAGITWYLIGLSIIGVVEAIRKR 476

BLAST of Cp4.1LG01g04080 vs. NCBI nr
Match: gi|296085315|emb|CBI29047.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 496.1 bits (1276), Expect = 7.1e-137
Identity = 271/481 (56.34%), Postives = 338/481 (70.27%), Query Frame = 1

Query: 38  MTAAWLPLYTPV--VATKIQNPTRRKLGSYNFSTSKMLYGN-TLACRAGSS--------I 97
           M +  LPLYTP   + T+I  P    L  ++ +T + L+GN ++ C+A SS        +
Sbjct: 1   MPSTCLPLYTPTSSIITRISTPRSAGLTFHHATTLRKLHGNNSITCKASSSSSSSSLSSL 60

Query: 98  TDFDLYDLLGIDNTSDSSRIKAAYRALQKQCHPDIAGPAGHDMAIILNEVYSVLSDPNSR 157
            DFDLYDLLGI+++SD  +IK AYR LQK+CHPDIAGPAGHDMAIILNEVYSVLSDPN R
Sbjct: 61  VDFDLYDLLGIESSSDQWQIKMAYRKLQKRCHPDIAGPAGHDMAIILNEVYSVLSDPNLR 120

Query: 158 LAYDKEQAKMAELRGYTGKSVYSVWLGSESERRAVFVDEVKCIGCLKCALFAGKTFAVES 217
           LAYDKEQAK+A LRGYTGK +YSVW GSESE RAVFVDEVKC+GCLKCALFA KTFA+ES
Sbjct: 121 LAYDKEQAKIARLRGYTGKPLYSVWYGSESEERAVFVDEVKCVGCLKCALFAEKTFAIES 180

Query: 218 VYGRARVVAQWADPEYKVMEAIEACPVDCISMVERSDLAALEFLMSKQPRGNVRVGMGNA 277
           VYGRARVVAQWADPEYK+ +AI+ACPVDCISMVERS+LAALEFLMSKQPRG+VR+  GNA
Sbjct: 181 VYGRARVVAQWADPEYKIQQAIDACPVDCISMVERSNLAALEFLMSKQPRGSVRMSAGNA 240

Query: 278 AGERVSNIFTDVKKFQTRFKEAMEKTSKQHVKGTTFENKAQLSAIQAIRSISNCLFWQTA 337
            G  VSNIF DVKKFQTRF +AM+K S    K    + +A++SAIQ IRSI+N L+WQ  
Sbjct: 241 VGACVSNIFVDVKKFQTRFHDAMDKASTHGSKEKDDQREARISAIQTIRSITNWLYWQAP 300

Query: 338 TGSKHS-QNLTLVV-NTSTPQINKLQAAITARKQLKAKTEDKNGTTTKYIKD-DYWVPTT 397
           TG   S Q+LT V    S P  NKL+ A  ARKQ +  TE ++ T   YI D +YWVP+T
Sbjct: 301 TGGSDSGQSLTRVAGRLSGPNFNKLRDAAAARKQARESTEPRSRTMPSYIYDAEYWVPST 360

Query: 398 LALPVSTQT------PISTISNPRVETKRSKESKDLDFEVGGGDHTSPMRLALPVLISIV 457
           LALP + Q         S+ S+P  +  + K  K  D  V   +  S     +P+  + +
Sbjct: 361 LALPATNQNNDLASKAASSESSPPSKQWKGKSKK--DHGVSKNNRRSSTIWQIPLATATI 420

Query: 458 ATTVIQQMVTDDGASGLKEHIGGSVALELVNSHWMQLIQRGVTWYVIGIAVIGMVEKIAR 499
           A  V++  + +     LKEHIGGS+AL +VNS W+Q++  GVTWY+IG  ++ ++E I  
Sbjct: 421 AAVVVRFQLGEGAVGELKEHIGGSLALYIVNSSWLQVVLAGVTWYLIGTYMVELLEVIRN 479

BLAST of Cp4.1LG01g04080 vs. NCBI nr
Match: gi|590689599|ref|XP_007043272.1| (DNAJ heat shock N-terminal domain-containing protein, putative [Theobroma cacao])

HSP 1 Score: 496.1 bits (1276), Expect = 7.1e-137
Identity = 275/477 (57.65%), Postives = 347/477 (72.75%), Query Frame = 1

Query: 38  MTAAWLPLYTPV--VATKIQNPTRRKLGSYNFSTSKML--YGNTLACRAG----SSITDF 97
           MTA  LPLYTP   + TK   P   K  +++ STSK L  Y ++  C+A     SSI DF
Sbjct: 1   MTATCLPLYTPATSIITKSSTP---KPYTFSTSTSKKLPIYHHSFTCKASASPSSSIMDF 60

Query: 98  DLYDLLGIDNTSDSSRIKAAYRALQKQCHPDIAGPAGHDMAIILNEVYSVLSDPNSRLAY 157
           DLYDLLGID++S+ S+IK AYRALQK+CHPDIAGPAGHDMAIILNE YSVLSDP SRLAY
Sbjct: 61  DLYDLLGIDSSSNHSQIKTAYRALQKRCHPDIAGPAGHDMAIILNEAYSVLSDPGSRLAY 120

Query: 158 DKEQAKMAELRGYTGKSVYSVWLGSESERRAVFVDEVKCIGCLKCALFAGKTFAVESVYG 217
           DKEQAKMAELRGYTGK +YSVW GSESE+RAVFVDEVKC+GCLKCALFA KTFA+ES+YG
Sbjct: 121 DKEQAKMAELRGYTGKPLYSVWRGSESEQRAVFVDEVKCVGCLKCALFAEKTFAIESLYG 180

Query: 218 RARVVAQWADPEYKVMEAIEACPVDCISMVERSDLAALEFLMSKQPRGNVRVGMGNAAGE 277
           RARVVAQWAD E+K++EAIEACPVDCIS+VERSDLAALEFLMSKQPRGNVRVG+GN  G 
Sbjct: 181 RARVVAQWADSEHKILEAIEACPVDCISIVERSDLAALEFLMSKQPRGNVRVGVGNTVGA 240

Query: 278 RVSNIFTDVKKFQTRFKEAMEKTSKQHVKGTTFENKAQLSAIQAIRSISNCLFWQT---A 337
           RVSNIF DVKKFQTRF +AM+K + +  K      +A++SAI AI+SISN  +WQ+    
Sbjct: 241 RVSNIFVDVKKFQTRFVDAMDKAATKESKEADLRREARMSAIHAIKSISNWWYWQSPNAG 300

Query: 338 TGSKHSQ-NLTLV-VNTSTPQINKLQAAITARKQLKAKTED-KNGTTTKYI-KDDYWVPT 397
           T  + SQ +LT V   +S P INKL+ A  ARKQ +  +      T + Y+  D+YW+P+
Sbjct: 301 TPVEESQLSLTHVPPKSSAPNINKLRDAAAARKQARESSRTIGTRTPSSYLHHDEYWMPS 360

Query: 398 TLALPVSTQTPIST-ISNPRVETKRSKESKDLDFEVGGGDHTSPMRLALPVLISIVATTV 457
             +LP S     S+ +S+  ++T   KE+ D  FE       + +  A+P++ +I+A  +
Sbjct: 361 RQSLPASIHNSSSSKVSSKPLQTNERKETDDKIFEKDWRKR-NQVDWAIPMVAAIIAAVI 420

Query: 458 IQQMVTDDGASGLKEHIGGSVALELVNSHWMQLIQRGVTWYVIGIAVIGMVEKIARK 499
           ++Q V D     + EHIGGS+AL +VNS W+Q+I  G+TWY+IG A++ ++E I  +
Sbjct: 421 VRQQVGDRVVGEITEHIGGSLALTMVNSSWLQVILAGITWYLIGSAMVEVIETIRNR 473

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DNAJ_BIFLO8.1e-0844.26Chaperone protein DnaJ OS=Bifidobacterium longum (strain NCC 2705) GN=dnaJ PE=3 ... [more]
DNAJ_DICT61.4e-0742.42Chaperone protein DnaJ OS=Dictyoglomus thermophilum (strain ATCC 35947 / DSM 396... [more]
DNAJ_MARMM2.4e-0745.31Chaperone protein DnaJ OS=Maricaulis maris (strain MCS10) GN=dnaJ PE=3 SV=1[more]
DNAJ_PROMM5.3e-0744.44Chaperone protein DnaJ OS=Prochlorococcus marinus (strain MIT 9313) GN=dnaJ PE=3... [more]
DNAJ_CLOTH5.3e-0738.96Chaperone protein DnaJ OS=Clostridium thermocellum (strain ATCC 27405 / DSM 1237... [more]
Match NameE-valueIdentityDescription
A0A0A0KPY4_CUCSA1.8e-20077.56Uncharacterized protein OS=Cucumis sativus GN=Csa_5G175710 PE=4 SV=1[more]
B9HGK8_POPTR2.2e-13756.37Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0007s07670g PE=4 SV=2[more]
A0A061E9Z1_THECC4.9e-13757.65DNAJ heat shock N-terminal domain-containing protein, putative OS=Theobroma caca... [more]
D7TEX4_VITVI4.9e-13756.34Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0042g00960 PE=4 SV=... [more]
V4VEG5_9ROSI6.7e-13456.96Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031498mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G23240.13.3e-12455.32 DNAJ heat shock N-terminal domain-containing protein[more]
AT2G42750.12.2e-2738.25 DNAJ heat shock N-terminal domain-containing protein[more]
AT3G05345.11.4e-1027.37 Chaperone DnaJ-domain superfamily protein[more]
AT4G13830.21.9e-0730.84 DNAJ-like 20[more]
AT1G21080.35.6e-0732.98 DNAJ heat shock N-terminal domain-containing protein[more]
Match NameE-valueIdentityDescription
gi|778700708|ref|XP_011654903.1|2.6e-20077.56PREDICTED: uncharacterized protein LOC101205271 [Cucumis sativus][more]
gi|659090183|ref|XP_008445880.1|1.7e-19977.78PREDICTED: uncharacterized protein LOC103488766 [Cucumis melo][more]
gi|566180117|ref|XP_002310648.2|3.2e-13756.37hypothetical protein POPTR_0007s07670g [Populus trichocarpa][more]
gi|296085315|emb|CBI29047.3|7.1e-13756.34unnamed protein product [Vitis vinifera][more]
gi|590689599|ref|XP_007043272.1|7.1e-13757.65DNAJ heat shock N-terminal domain-containing protein, putative [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001623DnaJ_domain
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0009507 chloroplast
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g04080.1Cp4.1LG01g04080.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001623DnaJ domainGENE3DG3DSA:1.10.287.110coord: 87..151
score: 9.6
IPR001623DnaJ domainPFAMPF00226DnaJcoord: 90..150
score: 1.0
IPR001623DnaJ domainSMARTSM00271dnaj_3coord: 89..145
score: 4.2
IPR001623DnaJ domainPROFILEPS50076DNAJ_2coord: 90..153
score: 14
IPR001623DnaJ domainunknownSSF46565Chaperone J-domaincoord: 87..157
score: 4.84
NoneNo IPR availableGENE3DG3DSA:3.30.70.20coord: 181..240
score: 3.8
NoneNo IPR availablePANTHERPTHR24078DNAJ HOMOLOG SUBFAMILY C MEMBERcoord: 453..461
score: 1.2E-13coord: 88..153
score: 1.2
NoneNo IPR availablePFAMPF13370Fer4_13coord: 183..237
score: 8.2
NoneNo IPR availableunknownSSF548624Fe-4S ferredoxinscoord: 180..240
score: 1.6

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG01g04080Cp4.1LG01g10130Cucurbita pepo (Zucchini)cpecpeB374
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG01g04080Cucurbita maxima (Rimu)cmacpeB724
Cp4.1LG01g04080Cucurbita moschata (Rifu)cmocpeB676
Cp4.1LG01g04080Wild cucumber (PI 183967)cpecpiB445
Cp4.1LG01g04080Cucumber (Chinese Long) v2cpecuB444
Cp4.1LG01g04080Watermelon (97103) v1cpewmB379
Cp4.1LG01g04080Cucumber (Chinese Long) v3cpecucB0508
Cp4.1LG01g04080Cucumber (Chinese Long) v3cpecucB0550