Cp4.1LG13g11760 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG13g11760
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionLate embryogenesis abundant protein (LEA) family protein
LocationCp4.1LG13 : 9351777 .. 9353810 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGTTCTTTGGCCCATTTTCCATTCTGAGGAGAGAATCGATAAGCCACAGAGCTTGAAAGGAGGGAGAAGATGGCTTTCTTTCCCCTTCCATTTTATGGAAAGGTAAGAGAAATAAAATTAGTGGGACACGTGACAGGCAGCAGAAACCAAAGCAACCAAGTTTTGTTCTTACTTCAATTGCACACGTTGAGATACATTGTCGAAACATCTGTCCTTCTATATGTACTACGTGTCACTTCTGGCTGTGCACTATCTTCCTGGTTGCTTAATAAATAAATGTCTGCTACTTCTTTTCGACACCTTCAACTAAATCATCATGCCCAACCTATTCGCTTTGTGTCTCGTCATTACTTCTTTAACTGCGGCGGGACTCTGGTCTCCTTCCCCGGCGTCCCGGCAAGATCACGAGCAGGATGTTATTGTTAAAGAAGGTCACCGAGTGGTTGTGGTCGAGTATGGCGACCAAGGTCAACACAATACTAAGGTTTCCATCTCTTCCGAACCCACCAAAGATGCCTCCCCTAGCAACCCACTACATGATAGTTTAAATATTGGGATCCCCAACGAAGACTCCGAAAGGCACCGCACCAGAGATCTTATTTGCGATGCCTTGGGCAAATGTAAGCATAAGATAGCCAGTGCTGTGGGGAAGGCTAAAGTAATGGTTTCGGAGACGGCGCAGGAGGCCCACGACGTTGGAGAGGCTGTTGCTGGTGCTTTCGATGAAGCCAAAGAGACAGTTTCAGACAAATCTCACCACGTGGGAACGTCGTTCTCAGAGAAAGGGCATCGATTGAGGGAGTCGGTTGAGAAAGCCAGAGAGGATGCCGACGAGTTCCTGGAGAAAACGAAAGAGACGGTTTTGGAGAAAGCACGTGACTTGAAAGAGGGTGCAAAGGATGTATTGAAGGAAGGCAAAGCACGTGACTTGAAAGAGGGTGCAAAGGATGTATTGAAGGAAGGCAAAGCACGTGACTTGAAAGAGGGTGCAATGGAGAAAGGGAGAGAAGCAAGACAAACTGCGGAGAAGATTAAAACTGGTGGAAACAAGGTCAAGGAGAATCTGATGGGTATTCCTGATGGAGGATTAAAGCTAGTAAATGATTCCTTTAGGTACTTGAGGTCGTTAGAGTCGTGGAAAGCGGCGATGGATGTGTTGAGTCTGTTGGGATTTGGTATGGCTTTGGGAATGGGCGTGTGGACTACCTTCATCTCTAGCTATGTGCTGGCGAGTGCGTTGCCAAGGCAGCAGTTGGCAGTGGTACAGAGCAAGATATATCCCCTGTATTTTAGGGCCATGGCTTCCAGCATTGGGATGGCCCTATTCGGGCATCTATTCAGCCGCACAAAATGGATGTTCCCAATTCCGAAAAATGCTGAAGTGGTCCAAGGATATGTACTTGTGGCTGCACTTTTGATGATTTTTGCCAATTCTCTCTACATGGAGCCCCAAGCCACCAAGGTATCTTCTGACCGCTGCTCCACCTTGAGTTTTTTTAGTTTTGTACTAGTTCCCTGTTATTGTGTATTTATTCCTGTGCAGGTAATGTTTGAGAGATTAAAAGTGGAGAAGGAAGAAGGAAAAGGAATTGAAGACATAGCCGCTGAACCTCGGGACGCCAATGATAATCCCCCAGCAGTCACAACCAGCACAGCCACACAAGTCGTAGAACGAGAGGCCGTGAAGTCCAGAATCGTGGGGTTGAATAAGAGGCTGAAGAAGCTGAATTCGTATTCATCCTTGTTAAACCTGCTCACTCTGATGGCTCTCACCTGGCATCTTGTGTACCTGAGCCAGCGTCTGTGCATCCCCTGCTAATATAACAATATTTTTCTTTTCCTGGTTTTGTTTTTGATGTGTGCATATTTTACCTGAAGTAGACCGTGTTTAGGCGGTTGGTACCAAAGTTGGTTAGTTTCAACTTATTGTCGGTGCTAAGTGGTTATATAATGTATTTTAGATTGTTTATTTCCTGTCTAATGAACTATTTTAGATTGTTTATTTCCTATTTAATGATTAGGTGCTAAGTTCCC

mRNA sequence

AGTTCTTTGGCCCATTTTCCATTCTGAGGAGAGAATCGATAAGCCACAGAGCTTGAAAGGAGGGAGAAGATGGCTTTCTTTCCCCTTCCATTTTATGGAAAGGTAAGAGAAATAAAATTAGTGGGACACGTGACAGGCAGCAGAAACCAAAGCAACCAAGTTTTGTTCTTACTTCAATTGCACACGTTGAGATACATTGTCGAAACATCTGTCCTTCTATATGTACTACGTGTCACTTCTGGCTGTGCACTATCTTCCTGGTTGCTTAATAAATAAATGTCTGCTACTTCTTTTCGACACCTTCAACTAAATCATCATGCCCAACCTATTCGCTTTGTGTCTCGTCATTACTTCTTTAACTGCGGCGGGACTCTGGTCTCCTTCCCCGGCGTCCCGGCAAGATCACGAGCAGGATGTTATTGTTAAAGAAGGTCACCGAGTGGTTGTGGTCGAGTATGGCGACCAAGGTCAACACAATACTAAGGTTTCCATCTCTTCCGAACCCACCAAAGATGCCTCCCCTAGCAACCCACTACATGATAGTTTAAATATTGGGATCCCCAACGAAGACTCCGAAAGGCACCGCACCAGAGATCTTATTTGCGATGCCTTGGGCAAATGTAAGCATAAGATAGCCAGTGCTGTGGGGAAGGCTAAAGTAATGGTTTCGGAGACGGCGCAGGAGGCCCACGACGTTGGAGAGGCTGTTGCTGGTGCTTTCGATGAAGCCAAAGAGACAGTTTCAGACAAATCTCACCACGTGGGAACGTCGTTCTCAGAGAAAGGGCATCGATTGAGGGAGTCGGTTGAGAAAGCCAGAGAGGATGCCGACGAGTTCCTGGAGAAAACGAAAGAGACGGTTTTGGAGAAAGCACGTGACTTGAAAGAGGGTGCAAAGGATGTATTGAAGGAAGGCAAAGCACGTGACTTGAAAGAGGGTGCAAAGGATGTATTGAAGGAAGGCAAAGCACGTGACTTGAAAGAGGGTGCAATGGAGAAAGGGAGAGAAGCAAGACAAACTGCGGAGAAGATTAAAACTGGTGGAAACAAGGTCAAGGAGAATCTGATGGGTATTCCTGATGGAGGATTAAAGCTAGTAAATGATTCCTTTAGGTACTTGAGGTCGTTAGAGTCGTGGAAAGCGGCGATGGATGTGTTGAGTCTGTTGGGATTTGGTATGGCTTTGGGAATGGGCGTGTGGACTACCTTCATCTCTAGCTATGTGCTGGCGAGTGCGTTGCCAAGGCAGCAGTTGGCAGTGGTACAGAGCAAGATATATCCCCTGTATTTTAGGGCCATGGCTTCCAGCATTGGGATGGCCCTATTCGGGCATCTATTCAGCCGCACAAAATGGATGTTCCCAATTCCGAAAAATGCTGAAGTGGTCCAAGGATATGTACTTGTGGCTGCACTTTTGATGATTTTTGCCAATTCTCTCTACATGGAGCCCCAAGCCACCAAGGTAATGTTTGAGAGATTAAAAGTGGAGAAGGAAGAAGGAAAAGGAATTGAAGACATAGCCGCTGAACCTCGGGACGCCAATGATAATCCCCCAGCAGTCACAACCAGCACAGCCACACAAGTCGTAGAACGAGAGGCCGTGAAGTCCAGAATCGTGGGGTTGAATAAGAGGCTGAAGAAGCTGAATTCGTATTCATCCTTGTTAAACCTGCTCACTCTGATGGCTCTCACCTGGCATCTTGTGTACCTGAGCCAGCGTCTGTGCATCCCCTGCTAATATAACAATATTTTTCTTTTCCTGGTTTTGTTTTTGATGTGTGCATATTTTACCTGAAGTAGACCGTGTTTAGGCGGTTGGTACCAAAGTTGGTTAGTTTCAACTTATTGTCGGTGCTAAGTGGTTATATAATGTATTTTAGATTGTTTATTTCCTGTCTAATGAACTATTTTAGATTGTTTATTTCCTATTTAATGATTAGGTGCTAAGTTCCC

Coding sequence (CDS)

ATGCCCAACCTATTCGCTTTGTGTCTCGTCATTACTTCTTTAACTGCGGCGGGACTCTGGTCTCCTTCCCCGGCGTCCCGGCAAGATCACGAGCAGGATGTTATTGTTAAAGAAGGTCACCGAGTGGTTGTGGTCGAGTATGGCGACCAAGGTCAACACAATACTAAGGTTTCCATCTCTTCCGAACCCACCAAAGATGCCTCCCCTAGCAACCCACTACATGATAGTTTAAATATTGGGATCCCCAACGAAGACTCCGAAAGGCACCGCACCAGAGATCTTATTTGCGATGCCTTGGGCAAATGTAAGCATAAGATAGCCAGTGCTGTGGGGAAGGCTAAAGTAATGGTTTCGGAGACGGCGCAGGAGGCCCACGACGTTGGAGAGGCTGTTGCTGGTGCTTTCGATGAAGCCAAAGAGACAGTTTCAGACAAATCTCACCACGTGGGAACGTCGTTCTCAGAGAAAGGGCATCGATTGAGGGAGTCGGTTGAGAAAGCCAGAGAGGATGCCGACGAGTTCCTGGAGAAAACGAAAGAGACGGTTTTGGAGAAAGCACGTGACTTGAAAGAGGGTGCAAAGGATGTATTGAAGGAAGGCAAAGCACGTGACTTGAAAGAGGGTGCAAAGGATGTATTGAAGGAAGGCAAAGCACGTGACTTGAAAGAGGGTGCAATGGAGAAAGGGAGAGAAGCAAGACAAACTGCGGAGAAGATTAAAACTGGTGGAAACAAGGTCAAGGAGAATCTGATGGGTATTCCTGATGGAGGATTAAAGCTAGTAAATGATTCCTTTAGGTACTTGAGGTCGTTAGAGTCGTGGAAAGCGGCGATGGATGTGTTGAGTCTGTTGGGATTTGGTATGGCTTTGGGAATGGGCGTGTGGACTACCTTCATCTCTAGCTATGTGCTGGCGAGTGCGTTGCCAAGGCAGCAGTTGGCAGTGGTACAGAGCAAGATATATCCCCTGTATTTTAGGGCCATGGCTTCCAGCATTGGGATGGCCCTATTCGGGCATCTATTCAGCCGCACAAAATGGATGTTCCCAATTCCGAAAAATGCTGAAGTGGTCCAAGGATATGTACTTGTGGCTGCACTTTTGATGATTTTTGCCAATTCTCTCTACATGGAGCCCCAAGCCACCAAGGTAATGTTTGAGAGATTAAAAGTGGAGAAGGAAGAAGGAAAAGGAATTGAAGACATAGCCGCTGAACCTCGGGACGCCAATGATAATCCCCCAGCAGTCACAACCAGCACAGCCACACAAGTCGTAGAACGAGAGGCCGTGAAGTCCAGAATCGTGGGGTTGAATAAGAGGCTGAAGAAGCTGAATTCGTATTCATCCTTGTTAAACCTGCTCACTCTGATGGCTCTCACCTGGCATCTTGTGTACCTGAGCCAGCGTCTGTGCATCCCCTGCTAA

Protein sequence

MPNLFALCLVITSLTAAGLWSPSPASRQDHEQDVIVKEGHRVVVVEYGDQGQHNTKVSISSEPTKDASPSNPLHDSLNIGIPNEDSERHRTRDLICDALGKCKHKIASAVGKAKVMVSETAQEAHDVGEAVAGAFDEAKETVSDKSHHVGTSFSEKGHRLRESVEKAREDADEFLEKTKETVLEKARDLKEGAKDVLKEGKARDLKEGAKDVLKEGKARDLKEGAMEKGREARQTAEKIKTGGNKVKENLMGIPDGGLKLVNDSFRYLRSLESWKAAMDVLSLLGFGMALGMGVWTTFISSYVLASALPRQQLAVVQSKIYPLYFRAMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVLVAALLMIFANSLYMEPQATKVMFERLKVEKEEGKGIEDIAAEPRDANDNPPAVTTSTATQVVEREAVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRLCIPC
BLAST of Cp4.1LG13g11760 vs. TrEMBL
Match: A0A0A0KSX1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G650480 PE=4 SV=1)

HSP 1 Score: 362.1 bits (928), Expect = 1.1e-96
Identity = 223/368 (60.60%), Postives = 267/368 (72.55%), Query Frame = 1

Query: 111 GKAKVMVSETAQEAHDVGEAVAGAFDE-AKETVSDKSHH-VGTSFSEKGHRLRESVEKAR 170
           G   V+V    Q  H+   +++   D+ AK +   ++   +   + +  H++  +VEKA+
Sbjct: 28  GHRMVVVEYDDQGQHNTKVSISSEPDQDAKNSERHRTKDLICDVYGKCKHKVASAVEKAK 87

Query: 171 EDADEFLEKTKETVLEKARDLKEGAKDVLKEGKARDLKEGAKDVLKEGKARDLKEGAMEK 230
               E  ++  + V E   D  +GAKD         LKEGAK+ L+  K+R+ K   + K
Sbjct: 88  VMVTETAQEAHD-VGESVTDAFDGAKD--------KLKEGAKETLEMAKSREEK---VVK 147

Query: 231 GRE--ARQTAEKIKTGGNKVKENLMGIPDGGLKLVNDSFRYLRSLESWKAAMDVLSLLGF 290
           G E  A++T EKIKTG NK+KENLMG+ D G K+++  FR+L         MD L LLGF
Sbjct: 148 GAERVAKETGEKIKTGENKLKENLMGLVDRGFKVIDYLFRHLGF------GMDALGLLGF 207

Query: 291 GMALGMGVWTTFISSYVLASALPRQQLAVVQSKIYPLYFRAMASSIGMALFGHLFSRTKW 350
            MALGMGVW TFISSYVLAS LPRQQL VVQSKIYP+YF+AMAS IGMAL GHLFSRT+W
Sbjct: 208 TMALGMGVWVTFISSYVLASVLPRQQLGVVQSKIYPVYFKAMASCIGMALLGHLFSRTEW 267

Query: 351 MFPIPKNAEVVQGYVLVAALLMIFANSLYMEPQATKVMFERLKVEKEEGKGIEDIAAEPR 410
            FPIPKN+EVVQGYVLVAALLMIFANSLYMEP+ATKVMFERLK+EKEEG+GIEDIA E  
Sbjct: 268 TFPIPKNSEVVQGYVLVAALLMIFANSLYMEPRATKVMFERLKIEKEEGRGIEDIAREET 327

Query: 411 -DANDNPPAVTTSTATQVVEREAVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYL 470
            +  DN PA+T+ST TQVV+RE VKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYL
Sbjct: 328 GNVIDNSPAITSSTPTQVVDREVVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYL 377

Query: 471 SQRLCIPC 474
           SQRLC PC
Sbjct: 388 SQRLCNPC 377

BLAST of Cp4.1LG13g11760 vs. TrEMBL
Match: A0A0D2SDT7_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_013G138700 PE=4 SV=1)

HSP 1 Score: 305.4 bits (781), Expect = 1.2e-79
Identity = 210/492 (42.68%), Postives = 273/492 (55.49%), Query Frame = 1

Query: 32  QDVIVKEGHRVVVVEYGDQGQHNTKVSISS---------------EPTKDASPSNPLHDS 91
            DVI+KEGHRV+VVEY   G+HNTKVSISS               E  KDA+ + P +  
Sbjct: 24  DDVILKEGHRVIVVEYDQDGKHNTKVSISSPSLHQQTDQGEYFGKETMKDAASALP-NVG 83

Query: 92  LNIGIPNEDSERHRTRDLICDALGKCKHKIASAVGKAKVMVSETAQEAHDVGEAVAGAFD 151
             I      S RH   +LICDA GKC  ++A+A+GKAK  VS+TA EA+ + +A +G   
Sbjct: 84  HGISQGKAGSGRHSPGELICDAFGKCTQRVATALGKAKDKVSDTAHEANKLKQAASGTAH 143

Query: 152 EAKETVSDK----SHHVGTSFSEKGHRLRESVEKAREDADEFLEKTKETVLEKARDLKEG 211
           EAKE   DK    +  V    SE  H  R+ V   +    + L K K  V++K +D+KE 
Sbjct: 144 EAKEKAKDKAWETAQEVREKVSESAHETRDKVADKKGAIGDALGKAKGAVVQKGQDVKER 203

Query: 212 AKDVLKEGKARDLKEGAKDVLK----------EGKARDLKEGAMEKGREARQTAEKIKTG 271
           AK+ +   KA++    AKD  K            +  +++E AME   EA + A K+KT 
Sbjct: 204 AKESI--DKAKEAATTAKDTAKTMGADIVTNTSEQVENVQEKAME---EAGRAANKVKTS 263

Query: 272 GNKVKENLMGIPDGGLKLVNDSFRYLRSLESWKAAMDVLSLLGFGMALGMGVWTTFISSY 331
            NK                 D  +Y+ S+E+    M +++LLG   A GM VW TFISSY
Sbjct: 264 ANKYL---------------DGLKYMTSMEALNTVMGIVNLLGLATAYGMSVWVTFISSY 323

Query: 332 VLASALPRQQLAVVQSKIYPLYFRAMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVL 391
           +LA  LPRQQ  VVQSKIYP+YFRAMA SIGMAL GHL    K     P   EV Q   L
Sbjct: 324 ILAGQLPRQQFGVVQSKIYPVYFRAMAYSIGMALLGHLLWHRKRSISSP--PEVFQAINL 383

Query: 392 VAALLMIFANSLYMEPQATKVMFERLKVEKEEGKGIEDIAAEPRDANDNP---------- 451
           +++L M+  N LY+EP+ATKVMFER+K+EKE+G+G  D  AE   A ++P          
Sbjct: 384 LSSLFMVLVNGLYLEPKATKVMFERMKMEKEDGRGRHDFVAEGSRATESPSVADPVAKNS 443

Query: 452 -----------PAVTTSTATQVVEREAVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWH 474
                      PA   + A    E+E +K  +  LN+RLKKLN+ SS+LN+LTLMALTWH
Sbjct: 444 RKGPSTAPAPAPAPAPAVAPTSSEQEVIKRTMGRLNERLKKLNTNSSMLNILTLMALTWH 492

BLAST of Cp4.1LG13g11760 vs. TrEMBL
Match: W9RNN9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_008289 PE=4 SV=1)

HSP 1 Score: 296.2 bits (757), Expect = 7.1e-77
Identity = 210/497 (42.25%), Postives = 298/497 (59.96%), Query Frame = 1

Query: 1   MPNLFALCLVITSLTAAGLWSPSPASRQDHEQDVIVKEGHRVVVVEYGDQGQHNTKVSIS 60
           M N+ +LCLV+TSL  AG+ SP+P  +Q++  D+IVKEGHRVVVVEY  +GQ  TKVSIS
Sbjct: 1   MMNVVSLCLVVTSLVTAGVLSPTP-KKQNNGDDLIVKEGHRVVVVEYDQEGQPITKVSIS 60

Query: 61  SEP-TKDASPSNPLHDSLNIGIPN-----------------EDSERHRTRDLICDALGKC 120
            E  T+    S+ L ++ ++ +PN                  D E    ++LICDA GKC
Sbjct: 61  PEDKTRQRFNSDKLKEAASV-LPNLGQGLSTPKADGGGEGEGDGEWRSPKELICDAYGKC 120

Query: 121 KHKIASAVGKAKVMVSE----TAQEAHDVGEAVAGAFDEAKETVSDKSHHVGTSFSEKGH 180
           KHKI  A+G+ K  VSE     A++  ++ E      ++AKE VS+K+        E GH
Sbjct: 121 KHKIVDAIGRTKEAVSEKAHDVAEKTKEMKEKAEDVVEKAKEAVSEKARGFAEKTRETGH 180

Query: 181 RLRESVEKAREDADEFLEKTKETVLEKARDLKEGA---KDVLKE--GKARD-LKEGAKDV 240
             +++ E+    A  F+EKTKE   E     K+ A   ++  +E  GKA++ ++  A++V
Sbjct: 181 EAQDATERK---AHVFIEKTKEAAHEAVEKKKKAAYRMEEAAEESYGKAKEAVRNKAQEV 240

Query: 241 LKEGKARDLKEGAMEKGREARQTAEKIKTGGNKVKENLMGIPDGGLKLVNDSFRYLRSLE 300
             EG+AR+  E   E  ++A+      KT    V  N+        K    +FR L + +
Sbjct: 241 --EGQARERAEKTWEAAKDAKDVG---KTFVKDVASNVTKFAATFRKQAGATFRELVTGK 300

Query: 301 SWKAAMDVLSLLGFGMALGMGVWTTFISSYVLASALPRQQLAVVQSKIYPLYFRAMASSI 360
           +    + V+ L+ F  A G  VW TFI SYVLA ALPRQQ  VVQSKIYP+YFR MA  I
Sbjct: 301 ALNPVVGVVYLVTFSTAYGTAVWETFILSYVLAGALPRQQFGVVQSKIYPVYFRTMAWGI 360

Query: 361 GMALFGHLFSRTKWMFPIPKNAEVVQGYVLVAALLMIFANSLYMEPQATKVMFERLKVEK 420
           G A+ G L +     F     AE  Q + L+A+L+++F N LY+EP+ATKVMFER++VEK
Sbjct: 361 GTAVLGLLVTGRGKAF--SSMAEKFQIFNLLASLVLVFVNMLYLEPRATKVMFERMRVEK 420

Query: 421 EEGKGIEDIAAEPRDANDNPPAVTTSTATQVVEREAVKSRIVGLNKRLKKLNSYSSLLNL 470
           EEG+G E++ AE   A +  PAV++  A +  E+EAV++RI+ LN RLKKLN++SS LN+
Sbjct: 421 EEGRGREELPAEQPSAAE--PAVSSMPA-ETAEQEAVRNRILSLNGRLKKLNTWSSFLNI 480

BLAST of Cp4.1LG13g11760 vs. TrEMBL
Match: A0A067KS45_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_10947 PE=4 SV=1)

HSP 1 Score: 288.5 bits (737), Expect = 1.5e-74
Identity = 218/526 (41.44%), Postives = 290/526 (55.13%), Query Frame = 1

Query: 3   NLFALCLVITSLTAAGLWSPSPASRQDH--EQDVIVKEGHRVVVVEYGDQ--GQHNTKVS 62
           NL AL LV+TSL  A +WSPSP  +  +  E+DVIVKEGHRV+VVE  D   GQHNTK+ 
Sbjct: 2   NLLALSLVLTSLLTAQVWSPSPTGKHQNQKEEDVIVKEGHRVIVVETFDDEGGQHNTKIR 61

Query: 63  IS---------------SEPTKDASPSNP-LHDSLNIGIPNEDSERHRTR------DLIC 122
           IS                E  K A+   P L   L+      DS  ++ +      +LIC
Sbjct: 62  ISPPQDSISSVGVIENAKEKVKQAAHVLPNLGQGLSSSGYGPDSGSYQDKITTGPGELIC 121

Query: 123 DALGKCKHKIASAVGKAKVMVSETAQEAHDVGEAVAGAFDEAKETVSDKSHHVGTSFSEK 182
           DA GKC+HKIA  + KAK  V +          A  GA  + KE +     H     +++
Sbjct: 122 DAFGKCRHKIAKVIDKAKEKVED----------AYGGATHQTKEII-----HEAKEIAQE 181

Query: 183 GH-RLRESVEKAREDADEFLEKTKETVLEKARDLKEGAKDVLKEG------KARDLKEGA 242
           G  R +    +A++  ++  EKTKETV  KA + KE  +D  ++       KA+++KE A
Sbjct: 182 GIIRKKRIAHEAKDKVEDACEKTKETVSHKAHEAKEVVEDAYEKAKDSGTQKAQEVKESA 241

Query: 243 KDVLKEGKARDLKEGAMEKGR-----EARQTAEKIKTGGNKVKENLMGIPDGGLKLVNDS 302
           K+ L   KA+D+ + A + G+       +  +EK     N V  +L           N +
Sbjct: 242 KESL--DKAKDIAKDAKDLGKTIGVDSVKNVSEKTGQATNIVYNSL-----------NRA 301

Query: 303 FRYLRSLESWKAAMDVLSLLGFGMALGMGVWTTFISSYVLASALPRQQLAVVQSKIYPLY 362
           FR++ S +     M V++LLGF +A GM  W TFISSYVLA+AL RQQ  +VQSKIYP+Y
Sbjct: 302 FRFMGSQKGIHFLMGVINLLGFSIAYGMCFWVTFISSYVLANALNRQQFGLVQSKIYPVY 361

Query: 363 FRAMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVLVAALLMIFANSLYMEPQATKVM 422
           FRAM   I +ALFGHL  R K+ F     AEV Q   L+ ++L++  N+LY+EP ATKVM
Sbjct: 362 FRAMGFCIAVALFGHLIGRMKFSF--TSKAEVFQAINLLLSVLLVLINALYLEPLATKVM 421

Query: 423 FERLKVEKEEGKGIEDIAAEPRDANDNP------------PAVTT--STATQVVE---RE 474
           F++LK+EKEEG+G E    E   A   P             A TT  S  T+V E    E
Sbjct: 422 FDKLKIEKEEGRGRETSTTESNRAEQEPVADASPATALGSTATTTGSSAPTKVPENRKEE 481

BLAST of Cp4.1LG13g11760 vs. TrEMBL
Match: A0A103P142_CYNCS (Uncharacterized protein (Fragment) OS=Cynara cardunculus var. scolymus GN=Ccrd_026525 PE=4 SV=1)

HSP 1 Score: 272.3 bits (695), Expect = 1.1e-69
Identity = 194/508 (38.19%), Postives = 286/508 (56.30%), Query Frame = 1

Query: 1   MPNLFALCLVITSLTAAGLWSPSPASRQDHEQDVIVKEGHRVVVVEYGDQGQHNTKVSIS 60
           M N+ AL LV+TSL    +WSP P    D+ ++VIVK GHRV+VVEY  +G  NTKV IS
Sbjct: 60  MMNILALGLVLTSLVTGAVWSPPPPP--DNREEVIVKGGHRVIVVEYEKEGDGNTKVLIS 119

Query: 61  S-EPTKDASPSNPLHDSLNIGIPNEDSERH--RTRDLICDALGKCKHKIASAVGKAKVMV 120
             +P+     S     S +   P +D+E    R R+L+CDA GKCKHKIA+A G+ K  V
Sbjct: 120 PHDPSASGVDSTDTCHSADGNPPVDDAEAKFSRPRELVCDAFGKCKHKIANAFGRTKDKV 179

Query: 121 SETAQEAHD-VGEAVAGAFDEAKETVSDKSHHVGTSFSE---KGHRLRESVEKAREDADE 180
           S+TAQ+  +   EA +GA  +AK+TV      +  + +E   KG +  + + +A+    E
Sbjct: 180 SDTAQDIEEHAKEAASGAVGKAKDTVYGYEERMKGAANEAFGKGKQAAKDINEAKSKLAE 239

Query: 181 FLEKTKETVLEKAR-----DLKEGAKDVLKEG------KARDLKEGAKDVLKEGKARDLK 240
            + +  E + +KA+     +  + AKD + +G       A D+ E  +D   + K+ D+ 
Sbjct: 240 KVSEKIEGIEDKAKGAAIGESLDNAKDSIAQGIGKAKKVAGDVMETVRDSATKAKSFDMV 299

Query: 241 EGAMEKGRE-ARQTAEKIKTGGNKVKE-------NLMGIPDGGL--------KLVNDSFR 300
           +     G +  R  + +++ G   V E       N+  +    L        +++ D F 
Sbjct: 300 DSPKRIGEDIQRNVSGRVEEGAEHVMEQAKEAAANVQKVGQKSLGEIISKLKEVMYDVFW 359

Query: 301 YLRSLESWKAAMDVLSLLGFGMALGMGVWTTFISSYVLASALPRQQLAVVQSKIYPLYFR 360
           Y+ S E   A + ++ +LGF  A GM +W TF+SSY+L   LPRQQ  +VQS+IYP+YF+
Sbjct: 360 YMVSPEKVDAVVGLIHMLGFSTAYGMCMWVTFVSSYILGRYLPRQQFGMVQSRIYPVYFK 419

Query: 361 AMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVLVAALLMIFANSLYMEPQATKVMFE 420
           AMA  +G AL GHL SR K      K  E+ QG  L++ALLM+  N +++EP++TKVMFE
Sbjct: 420 AMAYCVGAALLGHLVSRRKESLSSIK--EIFQGLSLLSALLMVLINMIWLEPRSTKVMFE 479

Query: 421 RLKVEKEEGKGI-----EDIAAEPRDANDNPPAVTTSTATQVVEREAVKSRIVGLNKRLK 470
           R+K+EKEEG+GI     E IA    D    PPA             A +  ++ +N++LK
Sbjct: 480 RMKIEKEEGRGIAGAVREGIADNGSDTVVRPPANVA----------AERQDVLRMNEKLK 539

BLAST of Cp4.1LG13g11760 vs. TAIR10
Match: AT1G72100.1 (AT1G72100.1 late embryogenesis abundant domain-containing protein / LEA domain-containing protein)

HSP 1 Score: 238.8 bits (608), Expect = 6.8e-63
Identity = 192/524 (36.64%), Postives = 271/524 (51.72%), Query Frame = 1

Query: 1   MPNLFALCLVITSLTAAGLWSPSPASRQDH-----EQDVIVKEGHRVV------------ 60
           M NL ALCLV+++L AA +WSPSPA    +     E +VIVK+GH VV            
Sbjct: 1   MTNLLALCLVLSTLLAAEVWSPSPAMTTHNTAVASEGEVIVKDGHHVVVVEYDRDGKTNT 60

Query: 61  -------VVEYGDQGQHNTKVSIS-----SEPTKDASPSNPLHDSLNIGIP---NEDSER 120
                    + G++ ++  ++  S      E  K+ +   P H    I  P   +E  + 
Sbjct: 61  RVSISPPSADQGEEKENEVEMGTSMFRNVKEKAKETASYLP-HVGQGISQPVMTDEARDH 120

Query: 121 HRTR-DLICDALGKCKHKIASAVGKAKVM----VSETAQE--------AHDVGEAVAGAF 180
           H T  ++ICDA GKC+ KIAS VG+AK      V ETA +        AHDV E V  A 
Sbjct: 121 HATAGEVICDAFGKCRQKIASVVGRAKDRTVDSVGETASDVREAAAHKAHDVKETVTHAA 180

Query: 181 DEAKETVSDKSHHVGTSFSEKGHRLRESVEKAREDADEFLEKTKETVLEKARDLKEGAKD 240
            + ++TV+D++ +     +EK H  +E V     DA       KE+V +KA D KE    
Sbjct: 181 RDVEDTVADQAQYAKGRVTEKAHDPKEGVAHKAHDA-------KESVADKAHDAKESVAQ 240

Query: 241 VLKEGKARDLKEGAKDVLKEGKARDLKEGAMEKGREARQTA-EKIKTGGNKVKENLMGIP 300
                KA D KE  ++     KA D+KE   +K  E+++ A ++++    ++KE      
Sbjct: 241 -----KAHDAKEKVRE-----KAHDVKETVAQKAHESKERAKDRVREKAQELKETATHKS 300

Query: 301 DGGLKLVNDSFRYLRS-----LESWKAAMDVLSLLGFGMALGMGVWTTFISSYVLASALP 360
               + V +  R   S     L   K A  ++ L G   A G  VW TF+SSYVLAS L 
Sbjct: 301 KNAWERVKNGAREFGSATAATLSPTKVA-SIVGLTGIAAAFGTSVWVTFVSSYVLASVLG 360

Query: 361 RQQLAVVQSKIYPLYFRAMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVLVAALLMI 420
           RQQ  VVQSK+YP+YF+A +  I + LFGH+ SR + +  +    E+ QG  L+++  MI
Sbjct: 361 RQQFGVVQSKLYPVYFKATSVGILVGLFGHVLSRRRKL--LTDATEMWQGVNLLSSFFMI 420

Query: 421 FANSLYMEPQATKVMFERLKVEKEEGKGIEDIAAEPRDANDNPPAVTTSTATQVVEREAV 474
            AN  ++EP+ATK MFER+K EKEEG+G E                   T+ Q + R   
Sbjct: 421 EANKSFVEPRATKAMFERMKAEKEEGRGGE------------------RTSEQELRR--- 480

BLAST of Cp4.1LG13g11760 vs. TAIR10
Match: AT1G22600.1 (AT1G22600.1 Late embryogenesis abundant protein (LEA) family protein)

HSP 1 Score: 140.6 bits (353), Expect = 2.5e-33
Identity = 112/364 (30.77%), Postives = 186/364 (51.10%), Query Frame = 1

Query: 119 ETAQEAHDVGEAVAGAFDEAKETVS------DKSHHVGTSFSEKGHRLRESVEKAREDAD 178
           E  Q++ D GE      +E +ET S      ++ HH     +  G  + +++ K +    
Sbjct: 58  EVDQKSRDEGEVFG---NEKRETASSLPEEEEREHH-----ATPGELICDAIGKCKHKLG 117

Query: 179 EFLEKTKETVLEKARDLKEGAKDVLKEGKARDLKEGAKDVLKEGKARDLKEGAMEKGREA 238
             L + K+     A DL +   ++    +A +++E      +E + + + E A +K    
Sbjct: 118 TVLGRVKDRT---ASDLSDETPEMTVAREALEVEEKVSWKAREARGK-VNERATKKAHRV 177

Query: 239 RQTAEKIKT---GGNKVKENLMGIPDGGLKLVNDSFRYLRSLESWKAAMDVLSLLGFGMA 298
           ++  EK++    G   V    +G+   G                      V+ ++G   A
Sbjct: 178 QKVLEKVQIAVRGIGTVVATALGLTKIG---------------------SVVGIVGIAAA 237

Query: 299 LGMGVWTTFISSYVLASALPRQQLAVVQSKIYPLYFRAMASSIGMALFGHLFSRTKWMFP 358
            GM VW TF+S YVLAS L  QQ  VVQSK+YP+YF+A++  I + L GH+  R + +F 
Sbjct: 238 YGMCVWVTFVSGYVLASVLGEQQFGVVQSKMYPVYFKAVSVGILVGLLGHVIGRRRKVF- 297

Query: 359 IPKNAEVVQGYVLVAALLMIFANSLYMEPQATKVMFERLKVEKEEGKGIEDIAAEPRDAN 418
                ++ Q   L++++LM+ AN+ ++  +ATK MFE +K EKE+G+G  D + + + + 
Sbjct: 298 -TDAVDMWQSVNLLSSILMVEANASFVYTRATKAMFELIKAEKEDGRGF-DTSDQSQSSE 357

Query: 419 DNPPAVTTSTATQVVEREAVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRL 474
                      T+  + + VK R+  L++R++KLN+YSS LNLLTLM+LTWH VYL  RL
Sbjct: 358 SAGRTRGKKKVTEKTDEDVVKQRLTKLSERMRKLNAYSSRLNLLTLMSLTWHFVYLGYRL 385

BLAST of Cp4.1LG13g11760 vs. NCBI nr
Match: gi|659119380|ref|XP_008459625.1| (PREDICTED: uncharacterized protein LOC103498695 [Cucumis melo])

HSP 1 Score: 525.4 bits (1352), Expect = 1.0e-145
Identity = 312/478 (65.27%), Postives = 351/478 (73.43%), Query Frame = 1

Query: 1   MPNLFALCLVITSLTAAGLWSPSPASRQDHEQDVIVKEGHRVVVVEYGDQGQHNTKVSIS 60
           M NLFA+ L+IT+LTAAGLWSP P       Q+VIVKEGHRVVVVEY DQGQHNTKVSIS
Sbjct: 17  MTNLFAMFLIITTLTAAGLWSPPPPP-----QNVIVKEGHRVVVVEYDDQGQHNTKVSIS 76

Query: 61  SEPTKDASPSNPLHDSLNIGIPNEDSERHRTRDLICDALGKCKHKIASAVGKAKVMVSET 120
           SEP  DA                ++SERHRT+DLICD  GKCKHK+ASAV KAKVMV+ET
Sbjct: 77  SEPDLDA----------------KNSERHRTKDLICDVYGKCKHKVASAVEKAKVMVTET 136

Query: 121 AQEAHDVGEAVAGAFDEAKETVSDKSHHVGTSFSEKGHRLRESVEKAREDADEFLEKTKE 180
           AQEAHDVGE+VAGAFDEAK                                D+  E  KE
Sbjct: 137 AQEAHDVGESVAGAFDEAK--------------------------------DKLKEGAKE 196

Query: 181 TVLEKARDLKEGAKDVLKE-GKARD-LKEGAKDVLKEGKARDLKEGAMEKGRE--ARQTA 240
           T  E    LKEGAK   +  G+A+D LKEGAK+ L+  K+R+ K   + KG E  A++T 
Sbjct: 197 TFGEAKDKLKEGAKGAKETFGEAKDKLKEGAKETLEMAKSREEK---VVKGAERVAKETG 256

Query: 241 EKIKTGGNKVKENLMGIPDGGLKLVNDSFRYLRSLESWKAAMDVLSLLGFGMALGMGVWT 300
           EKI+TG NK+KENLMG+ D G K++N  FR+L         MD L LLGF MALGMGVW 
Sbjct: 257 EKIQTGENKLKENLMGLVDRGFKVMNYLFRHLG------VGMDALGLLGFAMALGMGVWV 316

Query: 301 TFISSYVLASALPRQQLAVVQSKIYPLYFRAMASSIGMALFGHLFSRTKWMFPIPKNAEV 360
           TFISSYVLAS LPRQQL VVQSKIYP+YF+AMAS IGMAL GHLFSRT+W FPIPKN+EV
Sbjct: 317 TFISSYVLASVLPRQQLGVVQSKIYPVYFKAMASCIGMALLGHLFSRTEWKFPIPKNSEV 376

Query: 361 VQGYVLVAALLMIFANSLYMEPQATKVMFERLKVEKEEGKGIEDIAAEPR-DANDNPPAV 420
           VQGYVLVAALLMIFANSLYMEP+ATKVMFERLK+EKEEG+GIEDI  E   +  DN PA+
Sbjct: 377 VQGYVLVAALLMIFANSLYMEPRATKVMFERLKIEKEEGRGIEDIGREETVNVIDNSPAI 432

Query: 421 TTSTATQVVEREAVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRLCIPC 474
           T+ST TQ+V+RE VKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRLC PC
Sbjct: 437 TSSTPTQIVDREVVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRLCNPC 432

BLAST of Cp4.1LG13g11760 vs. NCBI nr
Match: gi|778707978|ref|XP_004141640.2| (PREDICTED: uncharacterized protein LOC101208468 [Cucumis sativus])

HSP 1 Score: 362.1 bits (928), Expect = 1.5e-96
Identity = 223/368 (60.60%), Postives = 267/368 (72.55%), Query Frame = 1

Query: 111 GKAKVMVSETAQEAHDVGEAVAGAFDE-AKETVSDKSHH-VGTSFSEKGHRLRESVEKAR 170
           G   V+V    Q  H+   +++   D+ AK +   ++   +   + +  H++  +VEKA+
Sbjct: 50  GHRMVVVEYDDQGQHNTKVSISSEPDQDAKNSERHRTKDLICDVYGKCKHKVASAVEKAK 109

Query: 171 EDADEFLEKTKETVLEKARDLKEGAKDVLKEGKARDLKEGAKDVLKEGKARDLKEGAMEK 230
               E  ++  + V E   D  +GAKD         LKEGAK+ L+  K+R+ K   + K
Sbjct: 110 VMVTETAQEAHD-VGESVTDAFDGAKD--------KLKEGAKETLEMAKSREEK---VVK 169

Query: 231 GRE--ARQTAEKIKTGGNKVKENLMGIPDGGLKLVNDSFRYLRSLESWKAAMDVLSLLGF 290
           G E  A++T EKIKTG NK+KENLMG+ D G K+++  FR+L         MD L LLGF
Sbjct: 170 GAERVAKETGEKIKTGENKLKENLMGLVDRGFKVIDYLFRHLGF------GMDALGLLGF 229

Query: 291 GMALGMGVWTTFISSYVLASALPRQQLAVVQSKIYPLYFRAMASSIGMALFGHLFSRTKW 350
            MALGMGVW TFISSYVLAS LPRQQL VVQSKIYP+YF+AMAS IGMAL GHLFSRT+W
Sbjct: 230 TMALGMGVWVTFISSYVLASVLPRQQLGVVQSKIYPVYFKAMASCIGMALLGHLFSRTEW 289

Query: 351 MFPIPKNAEVVQGYVLVAALLMIFANSLYMEPQATKVMFERLKVEKEEGKGIEDIAAEPR 410
            FPIPKN+EVVQGYVLVAALLMIFANSLYMEP+ATKVMFERLK+EKEEG+GIEDIA E  
Sbjct: 290 TFPIPKNSEVVQGYVLVAALLMIFANSLYMEPRATKVMFERLKIEKEEGRGIEDIAREET 349

Query: 411 -DANDNPPAVTTSTATQVVEREAVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYL 470
            +  DN PA+T+ST TQVV+RE VKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYL
Sbjct: 350 GNVIDNSPAITSSTPTQVVDREVVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYL 399

Query: 471 SQRLCIPC 474
           SQRLC PC
Sbjct: 410 SQRLCNPC 399

BLAST of Cp4.1LG13g11760 vs. NCBI nr
Match: gi|700197506|gb|KGN52683.1| (hypothetical protein Csa_5G650480 [Cucumis sativus])

HSP 1 Score: 362.1 bits (928), Expect = 1.5e-96
Identity = 223/368 (60.60%), Postives = 267/368 (72.55%), Query Frame = 1

Query: 111 GKAKVMVSETAQEAHDVGEAVAGAFDE-AKETVSDKSHH-VGTSFSEKGHRLRESVEKAR 170
           G   V+V    Q  H+   +++   D+ AK +   ++   +   + +  H++  +VEKA+
Sbjct: 28  GHRMVVVEYDDQGQHNTKVSISSEPDQDAKNSERHRTKDLICDVYGKCKHKVASAVEKAK 87

Query: 171 EDADEFLEKTKETVLEKARDLKEGAKDVLKEGKARDLKEGAKDVLKEGKARDLKEGAMEK 230
               E  ++  + V E   D  +GAKD         LKEGAK+ L+  K+R+ K   + K
Sbjct: 88  VMVTETAQEAHD-VGESVTDAFDGAKD--------KLKEGAKETLEMAKSREEK---VVK 147

Query: 231 GRE--ARQTAEKIKTGGNKVKENLMGIPDGGLKLVNDSFRYLRSLESWKAAMDVLSLLGF 290
           G E  A++T EKIKTG NK+KENLMG+ D G K+++  FR+L         MD L LLGF
Sbjct: 148 GAERVAKETGEKIKTGENKLKENLMGLVDRGFKVIDYLFRHLGF------GMDALGLLGF 207

Query: 291 GMALGMGVWTTFISSYVLASALPRQQLAVVQSKIYPLYFRAMASSIGMALFGHLFSRTKW 350
            MALGMGVW TFISSYVLAS LPRQQL VVQSKIYP+YF+AMAS IGMAL GHLFSRT+W
Sbjct: 208 TMALGMGVWVTFISSYVLASVLPRQQLGVVQSKIYPVYFKAMASCIGMALLGHLFSRTEW 267

Query: 351 MFPIPKNAEVVQGYVLVAALLMIFANSLYMEPQATKVMFERLKVEKEEGKGIEDIAAEPR 410
            FPIPKN+EVVQGYVLVAALLMIFANSLYMEP+ATKVMFERLK+EKEEG+GIEDIA E  
Sbjct: 268 TFPIPKNSEVVQGYVLVAALLMIFANSLYMEPRATKVMFERLKIEKEEGRGIEDIAREET 327

Query: 411 -DANDNPPAVTTSTATQVVEREAVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYL 470
            +  DN PA+T+ST TQVV+RE VKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYL
Sbjct: 328 GNVIDNSPAITSSTPTQVVDREVVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYL 377

Query: 471 SQRLCIPC 474
           SQRLC PC
Sbjct: 388 SQRLCNPC 377

BLAST of Cp4.1LG13g11760 vs. NCBI nr
Match: gi|823264374|ref|XP_012464930.1| (PREDICTED: uncharacterized protein LOC105783816 isoform X2 [Gossypium raimondii])

HSP 1 Score: 307.0 bits (785), Expect = 5.8e-80
Identity = 208/481 (43.24%), Postives = 273/481 (56.76%), Query Frame = 1

Query: 33  DVIVKEGHRVVVVEYGDQGQHNTKVSISS-----EPTKDASPSNPLHDSLNIGIPNEDSE 92
           DVI+KEGHRV+VVEY   G+HNTKVSISS     +  +DA+ + P +    I      S 
Sbjct: 25  DVILKEGHRVIVVEYDQDGKHNTKVSISSPSLHQQTDQDAASALP-NVGHGISQGKAGSG 84

Query: 93  RHRTRDLICDALGKCKHKIASAVGKAKVMVSETAQEAHDVGEAVAGAFDEAKETVSDK-- 152
           RH   +LICDA GKC  ++A+A+GKAK  VS+TA EA+ + +A +G   EAKE   DK  
Sbjct: 85  RHSPGELICDAFGKCTQRVATALGKAKDKVSDTAHEANKLKQAASGTAHEAKEKAKDKAW 144

Query: 153 --SHHVGTSFSEKGHRLRESVEKAREDADEFLEKTKETVLEKARDLKEGAKDVLKEGKAR 212
             +  V    SE  H  R+ V   +    + L K K  V++K +D+KE AK+ +   KA+
Sbjct: 145 ETAQEVREKVSESAHETRDKVADKKGAIGDALGKAKGAVVQKGQDVKERAKESI--DKAK 204

Query: 213 DLKEGAKDVLK----------EGKARDLKEGAMEKGREARQTAEKIKTGGNKVKENLMGI 272
           +    AKD  K            +  +++E AME   EA + A K+KT  NK        
Sbjct: 205 EAATTAKDTAKTMGADIVTNTSEQVENVQEKAME---EAGRAANKVKTSANKYL------ 264

Query: 273 PDGGLKLVNDSFRYLRSLESWKAAMDVLSLLGFGMALGMGVWTTFISSYVLASALPRQQL 332
                    D  +Y+ S+E+    M +++LLG   A GM VW TFISSY+LA  LPRQQ 
Sbjct: 265 ---------DGLKYMTSMEALNTVMGIVNLLGLATAYGMSVWVTFISSYILAGQLPRQQF 324

Query: 333 AVVQSKIYPLYFRAMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVLVAALLMIFANS 392
            VVQSKIYP+YFRAMA SIGMAL GHL    K     P   EV Q   L+++L M+  N 
Sbjct: 325 GVVQSKIYPVYFRAMAYSIGMALLGHLLWHRKRSISSP--PEVFQAINLLSSLFMVLVNG 384

Query: 393 LYMEPQATKVMFERLKVEKEEGKGIEDIAAEPRDANDNP--------------------- 452
           LY+EP+ATKVMFER+K+EKE+G+G  D  AE   A ++P                     
Sbjct: 385 LYLEPKATKVMFERMKMEKEDGRGRHDFVAEGSRATESPSVADPVAKNSRKGPSTAPAPA 444

Query: 453 PAVTTSTATQVVEREAVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWHLVYLSQRLCIP 474
           PA   + A    E+E +K  +  LN+RLKKLN+ SS+LN+LTLMALTWHLVYL QRL   
Sbjct: 445 PAPAPAVAPTSSEQEVIKRTMGRLNERLKKLNTNSSMLNILTLMALTWHLVYLGQRLTFN 482

BLAST of Cp4.1LG13g11760 vs. NCBI nr
Match: gi|823264372|ref|XP_012464929.1| (PREDICTED: uncharacterized protein LOC105783816 isoform X1 [Gossypium raimondii])

HSP 1 Score: 305.4 bits (781), Expect = 1.7e-79
Identity = 210/492 (42.68%), Postives = 273/492 (55.49%), Query Frame = 1

Query: 32  QDVIVKEGHRVVVVEYGDQGQHNTKVSISS---------------EPTKDASPSNPLHDS 91
            DVI+KEGHRV+VVEY   G+HNTKVSISS               E  KDA+ + P +  
Sbjct: 24  DDVILKEGHRVIVVEYDQDGKHNTKVSISSPSLHQQTDQGEYFGKETMKDAASALP-NVG 83

Query: 92  LNIGIPNEDSERHRTRDLICDALGKCKHKIASAVGKAKVMVSETAQEAHDVGEAVAGAFD 151
             I      S RH   +LICDA GKC  ++A+A+GKAK  VS+TA EA+ + +A +G   
Sbjct: 84  HGISQGKAGSGRHSPGELICDAFGKCTQRVATALGKAKDKVSDTAHEANKLKQAASGTAH 143

Query: 152 EAKETVSDK----SHHVGTSFSEKGHRLRESVEKAREDADEFLEKTKETVLEKARDLKEG 211
           EAKE   DK    +  V    SE  H  R+ V   +    + L K K  V++K +D+KE 
Sbjct: 144 EAKEKAKDKAWETAQEVREKVSESAHETRDKVADKKGAIGDALGKAKGAVVQKGQDVKER 203

Query: 212 AKDVLKEGKARDLKEGAKDVLK----------EGKARDLKEGAMEKGREARQTAEKIKTG 271
           AK+ +   KA++    AKD  K            +  +++E AME   EA + A K+KT 
Sbjct: 204 AKESI--DKAKEAATTAKDTAKTMGADIVTNTSEQVENVQEKAME---EAGRAANKVKTS 263

Query: 272 GNKVKENLMGIPDGGLKLVNDSFRYLRSLESWKAAMDVLSLLGFGMALGMGVWTTFISSY 331
            NK                 D  +Y+ S+E+    M +++LLG   A GM VW TFISSY
Sbjct: 264 ANKYL---------------DGLKYMTSMEALNTVMGIVNLLGLATAYGMSVWVTFISSY 323

Query: 332 VLASALPRQQLAVVQSKIYPLYFRAMASSIGMALFGHLFSRTKWMFPIPKNAEVVQGYVL 391
           +LA  LPRQQ  VVQSKIYP+YFRAMA SIGMAL GHL    K     P   EV Q   L
Sbjct: 324 ILAGQLPRQQFGVVQSKIYPVYFRAMAYSIGMALLGHLLWHRKRSISSP--PEVFQAINL 383

Query: 392 VAALLMIFANSLYMEPQATKVMFERLKVEKEEGKGIEDIAAEPRDANDNP---------- 451
           +++L M+  N LY+EP+ATKVMFER+K+EKE+G+G  D  AE   A ++P          
Sbjct: 384 LSSLFMVLVNGLYLEPKATKVMFERMKMEKEDGRGRHDFVAEGSRATESPSVADPVAKNS 443

Query: 452 -----------PAVTTSTATQVVEREAVKSRIVGLNKRLKKLNSYSSLLNLLTLMALTWH 474
                      PA   + A    E+E +K  +  LN+RLKKLN+ SS+LN+LTLMALTWH
Sbjct: 444 RKGPSTAPAPAPAPAPAVAPTSSEQEVIKRTMGRLNERLKKLNTNSSMLNILTLMALTWH 492

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KSX1_CUCSA1.1e-9660.60Uncharacterized protein OS=Cucumis sativus GN=Csa_5G650480 PE=4 SV=1[more]
A0A0D2SDT7_GOSRA1.2e-7942.68Uncharacterized protein OS=Gossypium raimondii GN=B456_013G138700 PE=4 SV=1[more]
W9RNN9_9ROSA7.1e-7742.25Uncharacterized protein OS=Morus notabilis GN=L484_008289 PE=4 SV=1[more]
A0A067KS45_JATCU1.5e-7441.44Uncharacterized protein OS=Jatropha curcas GN=JCGZ_10947 PE=4 SV=1[more]
A0A103P142_CYNCS1.1e-6938.19Uncharacterized protein (Fragment) OS=Cynara cardunculus var. scolymus GN=Ccrd_0... [more]
Match NameE-valueIdentityDescription
AT1G72100.16.8e-6336.64 late embryogenesis abundant domain-containing protein / LEA domain-c... [more]
AT1G22600.12.5e-3330.77 Late embryogenesis abundant protein (LEA) family protein[more]
Match NameE-valueIdentityDescription
gi|659119380|ref|XP_008459625.1|1.0e-14565.27PREDICTED: uncharacterized protein LOC103498695 [Cucumis melo][more]
gi|778707978|ref|XP_004141640.2|1.5e-9660.60PREDICTED: uncharacterized protein LOC101208468 [Cucumis sativus][more]
gi|700197506|gb|KGN52683.1|1.5e-9660.60hypothetical protein Csa_5G650480 [Cucumis sativus][more]
gi|823264374|ref|XP_012464930.1|5.8e-8043.24PREDICTED: uncharacterized protein LOC105783816 isoform X2 [Gossypium raimondii][more]
gi|823264372|ref|XP_012464929.1|1.7e-7942.68PREDICTED: uncharacterized protein LOC105783816 isoform X1 [Gossypium raimondii][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR025423DUF4149
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG13g11760.1Cp4.1LG13g11760.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025423Domain of unknown function DUF4149PFAMPF13664DUF4149coord: 285..387
score: 7.9
NoneNo IPR availableunknownCoilCoilcoord: 183..203
scor
NoneNo IPR availablePANTHERPTHR23241LATE EMBRYOGENESIS ABUNDANT PLANTS LEA-RELATEDcoord: 258..473
score: 5.5
NoneNo IPR availablePANTHERPTHR23241:SF50LATE EMBRYOGENESIS ABUNDANT DOMAIN-CONTAINING PROTEIN-RELATEDcoord: 258..473
score: 5.5

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG13g11760CmaCh15G000020Cucurbita maxima (Rimu)cmacpeB305
Cp4.1LG13g11760CmoCh15G000010Cucurbita moschata (Rifu)cmocpeB268
Cp4.1LG13g11760Cla011951Watermelon (97103) v1cpewmB197
Cp4.1LG13g11760ClCG11G008550Watermelon (Charleston Gray)cpewcgB162
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:

None