Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTTCCTCTCTTTTATGGTTTTTCTCTTGCTCTAATGGCTTCTTCTTCCAAGTGCTCCGAGGGTACCAGTTGCTCGGGTTTGAGTTCTTCTTCTTCTTCTACTTGTTCGTCTTCTTCTTCCTCCTCCATGTCGTCTTATACGGCTGTGGCGGCGGATCAGATGGTCAAGGTTGAGATTGAGGCGGCGGAGGCTCTTGCGGATTTGGCGGTTTTGGCGGTGAGGGAGAGTGGAGGTGAAGCGAAATGGGGGAGTAAAGGGAAGGGGAAACGGGCCAGGAAGAAGGTTAAGAGCGAGTTGCCGACTTGGGGTCTTGTCGACTCTTTACCTAGTCGCGCGGATCTGGACCTTCGGATTCAGGTTGTTTTTTGCTTTTTCTTTTGTTGAGGTTTTGATTGGATTTCTTTTGTATTTGATTGCATTTTTTTTTGGGATTTTGGTGTTTTTATTTTCCACTTGTTAATGGTGTAGGAAATTGAGTGATGTTCTTACATGAGCCTTGGCATTGTGTAGTATGATAGTGTGCTGTAATCGAAGAGGCGCAATCACTTGACTGATACTATTTTTCTGAATTGGGAAGACTTCTGAAGCAAAACCTAACCTTGGAATGTAGCTATTTGATTTGCATTTACTCTGTATCCTTGAAATGTGTTTACCTAGTGGGTCCATAAGTTCACATTTCTGTACCTCTTTGGTTTTTAGATTCTAAGGAAGTAGGATTCAAATTGGTGAACCTTATCCAAGAAGTTTGGAAGTTTTGTTTTGGAGCAGAATTAAGAATATGGAGAAGACTGGAGCTTATGTTGGGAGCAGGATTAAGAATATGGAGATTTTTTTTCTTTGAATCTAAAACAAGAAGTGGGCCACTGTTAGATTAAGTGACACCATTTGTTAACTCATGAAAGCACATTAAAAGTTGAACAGGTTTGGAAAAGAAGTTTCCTACACAGTTGTGGGAACAGTTATTTTTCTAGTTTCTCCTACTCTTCAAATTATTGGTGATTGGATTGGACCAGAACTGCTGAGCTTTTGGCTTGTCTTTCCTGCAAGTCCATTTTTTTCCCAATTAGGTCCCGATTTCTGTTATCATGACTTATGTCATATTTATTGACGAGTAAAATTGTATTAATATGATGTCCATATATGGTGTTTGAGCAAAAACTTAAAGCATTGATACCAAGTTACATTACTTCTGCAGGCCTTGACAAAATACTTTTGTTTCTAGCAGGATAGAGGAGTGGTAAGTCATCAGCCATCAGAAAAGGAATGTACAAATCAATCCCACCCTGAGTGGGAAACGACCAGAAAGATGTTAAAGGTGGACAAGGAGGAGGCCGAATCACATAAAGTGAGTCCTACATGCACTACAAGCTACCCATTATTTGGCTGCAGAAGGTCAAGGCGTAATCTAACTGAGGTTATGCTTACTAACTGCTTCACACACGTTTTTACTTATTCTTTTCAATCATTTTCATAAGCGACTTTCTTCTTCAGGCTGAAAAGGAAGAAAGGAGAATACGAAGAATTTTAGCAAATAGAGAGTCAGCCCGGCAGACTATTCGGCGTAGGCAGGTTCAGAATTTCTCTTAAAGAAGTACTTTTTGGTTTTATTGTTCAAATTATGTATCACATATTCACGTGTCTAGTACATTGTTGTGAGATTTTCTTGTGCAAACGTGTTATCCGAGATGAAGGCCTTTTATTGAAAATACATCTCTTTTTCATTCTCTTGTTTGTGGATTTCTATATCTTACTCTAAACCATCTGTGGGATGTCTTTGATGATTAGTTGTACAATTGCAGTAGACAGGAGTTTGAGGATACAAACCAAAAATATAATGAAACTCTAAAAATGGTAATTAAACTAAATACCCCTCAAAACTAACAAATTTTCAACTTGATTGTATACTGTTGGAAGTTACTTGTCTGATTGACCCTACAGCACACTCATATTGTCAATGCAATGTAACTGGTGTAGACCTCTTCACAACCCCAACTTTAATGCTCAGGAACAAGTTCACTGGAAATATGCTTACTTGGGCTAGGGAAAGAAAATACCAGGTGTTCTTAATTTTAGACACCATCAAGAAATTAACCATGTGGGCTATTCTTAGGTTTTGATTTAAACTGCTAAAAAGAAAGTAGATGTTTCAAGCAGAGCTAGCTATACAAATCATTCTGAATCATTGCCTTACTGCCAGTCATCAGAATCTCCACAAGTGAATTGGTGGTTGGATGGCTTGGCATCTTTTGGGGTATTTTACTTGGTGATGGGCTAAGTTTTAAGATAATCAGTACTAAAATGAAACTATTAACATCAAACGTAATTAGAGAGTTCAAGTTACATTGTTACTCTATTGAAATCACTCCATGTAATAATATTTTATTTTGGTTGTTTATTGTGCAGGCTCTTTGCGAGGAGCTAACCAAAAAGGCTGCCGATTTAGCTTGGGAAAATGAAAATTTAAAGAGGGTAGGTATTCAAGAGCACCCCCCCCCCCCCCCCCCCAAAAAAAAAAAAGAAAAACAGCTGACCCATTTAAATTATTGAACTTTTCTATTTGTTGTAACCTTTATTTGCTTTTATCTTGTGATCTTTCAACAATCAAGGAAAAGGAGTTGGCCCTGAAGGAGTACCAATCTCTGGAGACTACCAACAAGGAATTAAAGGAACAGGTAGGATGGGGTTGCATTTAAGAAATGTGTATGGTCATGGTCTTACCAAATCTCCCGACGATTCACTTGCTCGCTCGTTTGTGTCTAGATGGCTCAAGCAGTAAAGCCCAAGGTGGAGGAAATCCCCGGAAACAATAGATCATCTCATGTTCAGATGCCTCCTTTACCTACCAACTACCCTCTTTTCTTGTTTAGTCGCCCTCCATATGCATCGTATTTTTGGCCGTCTGTGGTTCAACCTTCAGGTCCCTATCATGAACTACACAATGTCGTTGTCGTCCCTTCAAGTATTCATTTGCCTGCTGATAATAATGTTCCTGTGTCTGATTCTTCCCATGTACAAGAAAGCTTTCCGAATGTCAATGGCCTGAGAACACCCTTTTGTATACTACCTTGTTCTTGGTTGTTGCCTCATCATGATCATAGGAATCAACACAGTCCTCAAATCTCGTGTCCCACAGGAAATAATCAAGAGGATATTTATTCGAATTCCCAAAACAGTGCTTATACTTCAAAGGTTGTTGTGAATGCAGAAAGCAGACATTCTTCTTTGCCTTCAACTGAAGAGAAAAACGAAGCTCCTGACTTGAATGAAGCTCCCAATTTAAACGAAGCTTTGATTCCAAAGGATCAAACTCAGAACACAGTTGGAGTAGTTGTGGAGGGATTCGATGCTGATGCAAGAGCTCAAGTTAGAAAAGTGCTTTCTCCTGTAAGACTTGAATGTGTTGAACCCACATCTGCCATTAAACAAGATAACCGAAACGAAGATGATCACGGTCTGTCATCAAAAACTTGTGATGACTTCTGTGATTTTGAAAAAAAGCTTGAACCAGAGATTGTACCCTGTAAGAAAACCATAGATGCAATGGCTGCAGCTGAGGCAAGGAGGAGAAGAAAAGAACTAACAAAGTTAAAGAATCTTCATGCCCGTCAATTCCGTATGCATTCTTGATCTATATGTATGGCTGAAGAGTTTGGCAACTGTTCGTCGTCAACAATCTCATCTGTGTTAAGTCCTGCATTTCACTGGCGATTGTTGCCAGAGGCAAGCACAGAGCATTGAGACATAACCGAAGTTCTGGCTTTCTTTTTGAGGCATTCCCTTTGCTTTTGATGCTGAGCTCAGAGCATGTCAAGATCTTGCTGCTAGTTGTGGCAGTGATGCGGAAGAAAATTCACGCATCGGGGCGTTTTTTTCGTGTTTGTTCCGATGAATTTACTAACAAATGAGAGAGGTACTGATGACTTAGATGCTAATCACTTCCAAGAGCTTTGAGTTGGATTTGGAAGGGCATCAAAGAGGAGGGGAATGTAAGAATTTTGTATTTAAATCTTTTATGGCTACTTCAAACACCCTTCATTGGGTTGGCTACAGTTCTTCAATAAGCTGAAGCATAAGAACACATCTTCTGATGAGCATGGAGATGTGCTATCTACTTTATTAAGAACCAGATAACAAATAAAAGATGTAGAAAGCCAAGTGGAAATATTACATGCTGTTTTCCTAAAAAATTTGAACAGAAATACTTCATTTAAATTATAGTTGTGGTCCTAGAGTTTTTCCAATTCTTTGAAATTGGTTTCTTATGTCCCTTTTAATAACAAGGGGAGGAAGGTAGATAACAATATAAAAATTTATAAGTAAAAGTAGTGTTTACTTTCGGGTGTAAATGTTAAAAAGA
mRNA sequence
TTTTTCCTCTCTTTTATGGTTTTTCTCTTGCTCTAATGGCTTCTTCTTCCAAGTGCTCCGAGGGTACCAGTTGCTCGGGTTTGAGTTCTTCTTCTTCTTCTACTTGTTCGTCTTCTTCTTCCTCCTCCATGTCGTCTTATACGGCTGTGGCGGCGGATCAGATGGTCAAGGTTGAGATTGAGGCGGCGGAGGCTCTTGCGGATTTGGCGGTTTTGGCGGTGAGGGAGAGTGGAGGTGAAGCGAAATGGGGGAGTAAAGGGAAGGGGAAACGGGCCAGGAAGAAGGTTAAGAGCGAGTTGCCGACTTGGGGTCTTGTCGACTCTTTACCTAGTCGCGCGGATCTGGACCTTCGGATTCAGGATAGAGGAGTGGTAAGTCATCAGCCATCAGAAAAGGAATGTACAAATCAATCCCACCCTGAGTGGGAAACGACCAGAAAGATGTTAAAGGTGGACAAGGAGGAGGCCGAATCACATAAAGTGAGTCCTACATGCACTACAAGCTACCCATTATTTGGCTGCAGAAGGTCAAGGCGTAATCTAACTGAGGCTGAAAAGGAAGAAAGGAGAATACGAAGAATTTTAGCAAATAGAGAGTCAGCCCGGCAGACTATTCGGCGTAGGCAGGCTCTTTGCGAGGAGCTAACCAAAAAGGCTGCCGATTTAGCTTGGGAAAATGAAAATTTAAAGAGGGAAAAGGAGTTGGCCCTGAAGGAGTACCAATCTCTGGAGACTACCAACAAGGAATTAAAGGAACAGATGGCTCAAGCAGTAAAGCCCAAGGTGGAGGAAATCCCCGGAAACAATAGATCATCTCATGTTCAGATGCCTCCTTTACCTACCAACTACCCTCTTTTCTTGTTTAGTCGCCCTCCATATGCATCGTATTTTTGGCCGTCTGTGGTTCAACCTTCAGGTCCCTATCATGAACTACACAATGTCGTTGTCGTCCCTTCAAGTATTCATTTGCCTGCTGATAATAATGTTCCTGTGTCTGATTCTTCCCATGTACAAGAAAGCTTTCCGAATGTCAATGGCCTGAGAACACCCTTTTGTATACTACCTTGTTCTTGGTTGTTGCCTCATCATGATCATAGGAATCAACACAGTCCTCAAATCTCGTGTCCCACAGGAAATAATCAAGAGGATATTTATTCGAATTCCCAAAACAGTGCTTATACTTCAAAGGTTGTTGTGAATGCAGAAAGCAGACATTCTTCTTTGCCTTCAACTGAAGAGAAAAACGAAGCTCCTGACTTGAATGAAGCTCCCAATTTAAACGAAGCTTTGATTCCAAAGGATCAAACTCAGAACACAGTTGGAGTAGTTGTGGAGGGATTCGATGCTGATGCAAGAGCTCAAGTTAGAAAAGTGCTTTCTCCTGTAAGACTTGAATGTGTTGAACCCACATCTGCCATTAAACAAGATAACCGAAACGAAGATGATCACGGTCTGTCATCAAAAACTTGTGATGACTTCTGTGATTTTGAAAAAAAGCTTGAACCAGAGATTGTACCCTGTAAGAAAACCATAGATGCAATGGCTGCAGCTGAGGCAAGGAGGAGAAGAAAAGAACTAACAAAGTTAAAGAATCTTCATGCCCGTCAATTCCGTATGCATTCTTGATCTATATGTATGGCTGAAGAGTTTGGCAACTGTTCGTCGTCAACAATCTCATCTGTGTTAAGTCCTGCATTTCACTGGCGATTGTTGCCAGAGGCAAGCACAGAGCATTGAGACATAACCGAAGTTCTGGCTTTCTTTTTGAGGCATTCCCTTTGCTTTTGATGCTGAGCTCAGAGCATGTCAAGATCTTGCTGCTAGTTGTGGCAGTGATGCGGAAGAAAATTCACGCATCGGGGCGTTTTTTTCGTGTTTGTTCCGATGAATTTACTAACAAATGAGAGAGGTACTGATGACTTAGATGCTAATCACTTCCAAGAGCTTTGAGTTGGATTTGGAAGGGCATCAAAGAGGAGGGGAATGTAAGAATTTTGTATTTAAATCTTTTATGGCTACTTCAAACACCCTTCATTGGGTTGGCTACAGTTCTTCAATAAGCTGAAGCATAAGAACACATCTTCTGATGAGCATGGAGATGTGCTATCTACTTTATTAAGAACCAGATAACAAATAAAAGATGTAGAAAGCCAAGTGGAAATATTACATGCTGTTTTCCTAAAAAATTTGAACAGAAATACTTCATTTAAATTATAGTTGTGGTCCTAGAGTTTTTCCAATTCTTTGAAATTGGTTTCTTATGTCCCTTTTAATAACAAGGGGAGGAAGGTAGATAACAATATAAAAATTTATAAGTAAAAGTAGTGTTTACTTTCGGGTGTAAATGTTAAAAAGA
Coding sequence (CDS)
ATGGCTTCTTCTTCCAAGTGCTCCGAGGGTACCAGTTGCTCGGGTTTGAGTTCTTCTTCTTCTTCTACTTGTTCGTCTTCTTCTTCCTCCTCCATGTCGTCTTATACGGCTGTGGCGGCGGATCAGATGGTCAAGGTTGAGATTGAGGCGGCGGAGGCTCTTGCGGATTTGGCGGTTTTGGCGGTGAGGGAGAGTGGAGGTGAAGCGAAATGGGGGAGTAAAGGGAAGGGGAAACGGGCCAGGAAGAAGGTTAAGAGCGAGTTGCCGACTTGGGGTCTTGTCGACTCTTTACCTAGTCGCGCGGATCTGGACCTTCGGATTCAGGATAGAGGAGTGGTAAGTCATCAGCCATCAGAAAAGGAATGTACAAATCAATCCCACCCTGAGTGGGAAACGACCAGAAAGATGTTAAAGGTGGACAAGGAGGAGGCCGAATCACATAAAGTGAGTCCTACATGCACTACAAGCTACCCATTATTTGGCTGCAGAAGGTCAAGGCGTAATCTAACTGAGGCTGAAAAGGAAGAAAGGAGAATACGAAGAATTTTAGCAAATAGAGAGTCAGCCCGGCAGACTATTCGGCGTAGGCAGGCTCTTTGCGAGGAGCTAACCAAAAAGGCTGCCGATTTAGCTTGGGAAAATGAAAATTTAAAGAGGGAAAAGGAGTTGGCCCTGAAGGAGTACCAATCTCTGGAGACTACCAACAAGGAATTAAAGGAACAGATGGCTCAAGCAGTAAAGCCCAAGGTGGAGGAAATCCCCGGAAACAATAGATCATCTCATGTTCAGATGCCTCCTTTACCTACCAACTACCCTCTTTTCTTGTTTAGTCGCCCTCCATATGCATCGTATTTTTGGCCGTCTGTGGTTCAACCTTCAGGTCCCTATCATGAACTACACAATGTCGTTGTCGTCCCTTCAAGTATTCATTTGCCTGCTGATAATAATGTTCCTGTGTCTGATTCTTCCCATGTACAAGAAAGCTTTCCGAATGTCAATGGCCTGAGAACACCCTTTTGTATACTACCTTGTTCTTGGTTGTTGCCTCATCATGATCATAGGAATCAACACAGTCCTCAAATCTCGTGTCCCACAGGAAATAATCAAGAGGATATTTATTCGAATTCCCAAAACAGTGCTTATACTTCAAAGGTTGTTGTGAATGCAGAAAGCAGACATTCTTCTTTGCCTTCAACTGAAGAGAAAAACGAAGCTCCTGACTTGAATGAAGCTCCCAATTTAAACGAAGCTTTGATTCCAAAGGATCAAACTCAGAACACAGTTGGAGTAGTTGTGGAGGGATTCGATGCTGATGCAAGAGCTCAAGTTAGAAAAGTGCTTTCTCCTGTAAGACTTGAATGTGTTGAACCCACATCTGCCATTAAACAAGATAACCGAAACGAAGATGATCACGGTCTGTCATCAAAAACTTGTGATGACTTCTGTGATTTTGAAAAAAAGCTTGAACCAGAGATTGTACCCTGTAAGAAAACCATAGATGCAATGGCTGCAGCTGAGGCAAGGAGGAGAAGAAAAGAACTAACAAAGTTAAAGAATCTTCATGCCCGTCAATTCCGTATGCATTCTTGA
Protein sequence
MASSSKCSEGTSCSGLSSSSSSTCSSSSSSSMSSYTAVAADQMVKVEIEAAEALADLAVLAVRESGGEAKWGSKGKGKRARKKVKSELPTWGLVDSLPSRADLDLRIQDRGVVSHQPSEKECTNQSHPEWETTRKMLKVDKEEAESHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYQSLETTNKELKEQMAQAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRPPYASYFWPSVVQPSGPYHELHNVVVVPSSIHLPADNNVPVSDSSHVQESFPNVNGLRTPFCILPCSWLLPHHDHRNQHSPQISCPTGNNQEDIYSNSQNSAYTSKVVVNAESRHSSLPSTEEKNEAPDLNEAPNLNEALIPKDQTQNTVGVVVEGFDADARAQVRKVLSPVRLECVEPTSAIKQDNRNEDDHGLSSKTCDDFCDFEKKLEPEIVPCKKTIDAMAAAEARRRRKELTKLKNLHARQFRMHS
Homology
BLAST of Lcy06g020270 vs. ExPASy Swiss-Prot
Match:
P23922 (Transcription factor HBP-1a OS=Triticum aestivum OX=4565 PE=2 SV=1)
HSP 1 Score: 47.0 bits (110), Expect = 7.8e-04
Identity = 36/100 (36.00%), Postives = 57/100 (57.00%), Query Frame = 0
Query: 173 EKEERRIRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYQSLE 232
E+E ++ +R L+NRESAR++ R+QA CEEL ++A L EN +L+ E + KEY+ L
Sbjct: 250 ERELKKQKRKLSNRESARRSRLRKQAECEELGQRAEALKSENSSLRIELDRIKKEYEELL 309
Query: 233 TTNKELKEQMAQA-------VKPKVEEIPGNNRSSHVQMP 266
+ N LK ++ ++ P + E N SH + P
Sbjct: 310 SKNTSLKAKLGESGGGGGSDAVPDMNERGDTNGGSHQKEP 349
BLAST of Lcy06g020270 vs. ExPASy TrEMBL
Match:
A0A6J1F7T1 (uncharacterized protein LOC111441650 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111441650 PE=4 SV=1)
HSP 1 Score: 740.7 bits (1911), Expect = 4.2e-210
Identity = 415/533 (77.86%), Postives = 449/533 (84.24%), Query Frame = 0
Query: 1 MASSSKCSEGTSCSGLSSSSSSTCSSSSSSSMSSYTAVAADQMVKVEIEAAEALADLAVL 60
MASSSKCSE TSCSGLSSSS+ + SSSS + ADQMVKVEIEAAEALADLAVL
Sbjct: 1 MASSSKCSEATSCSGLSSSSTRSFSSSS---------MEADQMVKVEIEAAEALADLAVL 60
Query: 61 AVRESG---GEAKWGSK-GKGKRARKKVKSELPTWGLVDSLPSRADLDLRIQDRGVVSHQ 120
AVR+SG E KW K KGKRARK+VK+E PT VDSLPSRADLDLRIQDRGV+SH
Sbjct: 61 AVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDLRIQDRGVISHH 120
Query: 121 PSEKECTNQSHPEWETTRKMLKVDKEEAESHKVSPTCTTSYPLFGCRRSRRNLTEAEKEE 180
PSEKEC + SHPEWETT++M+K +K EAES K+ S+PLFGCRRSRRNLTEAEKEE
Sbjct: 121 PSEKECADHSHPEWETTKEMIKAEK-EAESPKL------SHPLFGCRRSRRNLTEAEKEE 180
Query: 181 RRIRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYQSLETTNK 240
RRIRR+LANRESARQTIRRRQALCE+LTKKA+DLAWENENLKREKELALKEYQSLE TNK
Sbjct: 181 RRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNK 240
Query: 241 ELKEQMAQAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRPPYASYFWPSVVQPSGPY 300
ELKEQ+A A +PK+EEIPGNNRSSHVQ PPLPTNYPLFLFSRPPYASYFWPSVVQPS PY
Sbjct: 241 ELKEQIAHA-EPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPY 300
Query: 301 HELHNVVVVPSSIHLPADNNVPVSDSSHVQESFPNVNGLRTPFCILPCSWLLPHHDHRNQ 360
H+LHNV VVP S+ P++N V VSDSSHVQE+F NV GLRTPFCI+PCSWLLPHHDHRNQ
Sbjct: 301 HDLHNVAVVPPSVRSPSNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQ 360
Query: 361 HSPQISCPTGNNQEDIYSNSQNSAYTSKVVVNAESRHSSLPSTEEKNEAPDLNEAPNLNE 420
S Q SCP GN QE IYSNSQNSAYTSKVVV AESRHSSLPS EEKNEA DLNEAP+L
Sbjct: 361 QSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPSL-- 420
Query: 421 ALIPKDQTQNTVGVVVEGFDADARAQVRKVLSPVRLECVEPTSAIKQDNRNEDDHGLSSK 480
K+ TQNTVGVVV+ F+AD R QVRKVLSPVRLEC+EPTS +KQD +EDD GLSS+
Sbjct: 421 ----KEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSR 480
Query: 481 TCDDFCDF-EKKLEPEIVPCKKTIDAMAAAEARRRRKELTKLKNLHARQFRMH 529
TCDD C EKK EPEIV CKKTIDAMAA EARRRRKELTKLKNLH R RMH
Sbjct: 481 TCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKLKNLHTRPCRMH 510
BLAST of Lcy06g020270 vs. ExPASy TrEMBL
Match:
A0A6J1J476 (uncharacterized protein LOC111481617 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111481617 PE=4 SV=1)
HSP 1 Score: 736.1 bits (1899), Expect = 1.0e-208
Identity = 414/533 (77.67%), Postives = 447/533 (83.86%), Query Frame = 0
Query: 1 MASSSKCSEGTSCSGLSSSSSSTCSSSSSSSMSSYTAVAADQMVKVEIEAAEALADLAVL 60
MASSSKCSE TSCSGLSSSS+ SSSSSSM ADQMVKVEIEAAEAL DLAVL
Sbjct: 1 MASSSKCSEATSCSGLSSSST---RSSSSSSME------ADQMVKVEIEAAEALEDLAVL 60
Query: 61 AVRESG---GEAKWGSKG-KGKRARKKVKSELPTWGLVDSLPSRADLDLRIQDRGVVSHQ 120
AVR+SG E KW KG KGKRARK+VK+E PT VDSLPSRADLDLRIQDRGV+SHQ
Sbjct: 61 AVRDSGVEPSETKWRIKGKKGKRARKEVKTESPTSAFVDSLPSRADLDLRIQDRGVISHQ 120
Query: 121 PSEKECTNQSHPEWETTRKMLKVDKEEAESHKVSPTCTTSYPLFGCRRSRRNLTEAEKEE 180
PSEKEC + SHPEWETT++M+K +K E ES K+ S+PLFGCRR RRNLTEAEKEE
Sbjct: 121 PSEKECADHSHPEWETTKEMIKAEK-EVESPKL------SHPLFGCRRPRRNLTEAEKEE 180
Query: 181 RRIRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYQSLETTNK 240
RRIRR+LANRESARQTIRRRQ LCE+LTKKA+DLAWENENLKREKELALKEYQSLE TNK
Sbjct: 181 RRIRRVLANRESARQTIRRRQTLCEDLTKKASDLAWENENLKREKELALKEYQSLEITNK 240
Query: 241 ELKEQMAQAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRPPYASYFWPSVVQPSGPY 300
ELKEQ+AQA +PK+EEIPGNNRSSHVQ PPLPTNYPLF FSRPPYASYFWPSVVQPS PY
Sbjct: 241 ELKEQIAQA-EPKMEEIPGNNRSSHVQTPPLPTNYPLFFFSRPPYASYFWPSVVQPSSPY 300
Query: 301 HELHNVVVVPSSIHLPADNNVPVSDSSHVQESFPNVNGLRTPFCILPCSWLLPHHDHRNQ 360
H+LHNV VVP S+ P++N V VSDSSHVQE+F NV GLRTPFCI+PCSWLLPHHDHRNQ
Sbjct: 301 HDLHNVAVVPPSVRSPSNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQ 360
Query: 361 HSPQISCPTGNNQEDIYSNSQNSAYTSKVVVNAESRHSSLPSTEEKNEAPDLNEAPNLNE 420
S Q SCP GN QE IYSNSQNSAYTSKVVV AESR SSLPS EEKNEA DLNEAP+L
Sbjct: 361 QSSQNSCPAGNIQECIYSNSQNSAYTSKVVVRAESRRSSLPSAEEKNEAHDLNEAPSL-- 420
Query: 421 ALIPKDQTQNTVGVVVEGFDADARAQVRKVLSPVRLECVEPTSAIKQDNRNEDDHGLSSK 480
KD TQNTVGVVV+ F+AD R +VRKVLSPVRLEC+EPTS +KQD +EDD GLSS+
Sbjct: 421 ----KDHTQNTVGVVVDRFEADTRDKVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSR 480
Query: 481 TCDDFCDF-EKKLEPEIVPCKKTIDAMAAAEARRRRKELTKLKNLHARQFRMH 529
TCDD C EKK EPE+V CKKTIDAMAA EARRRRKELTKLKNLH R RMH
Sbjct: 481 TCDDLCHLAEKKHEPELVSCKKTIDAMAATEARRRRKELTKLKNLHTRPCRMH 510
BLAST of Lcy06g020270 vs. ExPASy TrEMBL
Match:
A0A6J1F2W5 (uncharacterized protein LOC111441650 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111441650 PE=4 SV=1)
HSP 1 Score: 736.1 bits (1899), Expect = 1.0e-208
Identity = 415/534 (77.72%), Postives = 449/534 (84.08%), Query Frame = 0
Query: 1 MASSSKCSEGTSCSGLSSSSSSTCSSSSSSSMSSYTAVAADQMVKVEIEAAEALADLAVL 60
MASSSKCSE TSCSGLSSSS+ + SSSS + ADQMVKVEIEAAEALADLAVL
Sbjct: 1 MASSSKCSEATSCSGLSSSSTRSFSSSS---------MEADQMVKVEIEAAEALADLAVL 60
Query: 61 AVRESG---GEAKWGSK-GKGKRARKKVKSELPTWGLVDSLPSRADLDLRI-QDRGVVSH 120
AVR+SG E KW K KGKRARK+VK+E PT VDSLPSRADLDLRI QDRGV+SH
Sbjct: 61 AVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDLRIQQDRGVISH 120
Query: 121 QPSEKECTNQSHPEWETTRKMLKVDKEEAESHKVSPTCTTSYPLFGCRRSRRNLTEAEKE 180
PSEKEC + SHPEWETT++M+K +K EAES K+ S+PLFGCRRSRRNLTEAEKE
Sbjct: 121 HPSEKECADHSHPEWETTKEMIKAEK-EAESPKL------SHPLFGCRRSRRNLTEAEKE 180
Query: 181 ERRIRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYQSLETTN 240
ERRIRR+LANRESARQTIRRRQALCE+LTKKA+DLAWENENLKREKELALKEYQSLE TN
Sbjct: 181 ERRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITN 240
Query: 241 KELKEQMAQAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRPPYASYFWPSVVQPSGP 300
KELKEQ+A A +PK+EEIPGNNRSSHVQ PPLPTNYPLFLFSRPPYASYFWPSVVQPS P
Sbjct: 241 KELKEQIAHA-EPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSP 300
Query: 301 YHELHNVVVVPSSIHLPADNNVPVSDSSHVQESFPNVNGLRTPFCILPCSWLLPHHDHRN 360
YH+LHNV VVP S+ P++N V VSDSSHVQE+F NV GLRTPFCI+PCSWLLPHHDHRN
Sbjct: 301 YHDLHNVAVVPPSVRSPSNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRN 360
Query: 361 QHSPQISCPTGNNQEDIYSNSQNSAYTSKVVVNAESRHSSLPSTEEKNEAPDLNEAPNLN 420
Q S Q SCP GN QE IYSNSQNSAYTSKVVV AESRHSSLPS EEKNEA DLNEAP+L
Sbjct: 361 QQSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPSL- 420
Query: 421 EALIPKDQTQNTVGVVVEGFDADARAQVRKVLSPVRLECVEPTSAIKQDNRNEDDHGLSS 480
K+ TQNTVGVVV+ F+AD R QVRKVLSPVRLEC+EPTS +KQD +EDD GLSS
Sbjct: 421 -----KEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSS 480
Query: 481 KTCDDFCDF-EKKLEPEIVPCKKTIDAMAAAEARRRRKELTKLKNLHARQFRMH 529
+TCDD C EKK EPEIV CKKTIDAMAA EARRRRKELTKLKNLH R RMH
Sbjct: 481 RTCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKLKNLHTRPCRMH 511
BLAST of Lcy06g020270 vs. ExPASy TrEMBL
Match:
A0A6J1J5U4 (uncharacterized protein LOC111481617 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111481617 PE=4 SV=1)
HSP 1 Score: 731.5 bits (1887), Expect = 2.5e-207
Identity = 414/534 (77.53%), Postives = 447/534 (83.71%), Query Frame = 0
Query: 1 MASSSKCSEGTSCSGLSSSSSSTCSSSSSSSMSSYTAVAADQMVKVEIEAAEALADLAVL 60
MASSSKCSE TSCSGLSSSS+ SSSSSSM ADQMVKVEIEAAEAL DLAVL
Sbjct: 1 MASSSKCSEATSCSGLSSSST---RSSSSSSME------ADQMVKVEIEAAEALEDLAVL 60
Query: 61 AVRESG---GEAKWGSKG-KGKRARKKVKSELPTWGLVDSLPSRADLDLRI-QDRGVVSH 120
AVR+SG E KW KG KGKRARK+VK+E PT VDSLPSRADLDLRI QDRGV+SH
Sbjct: 61 AVRDSGVEPSETKWRIKGKKGKRARKEVKTESPTSAFVDSLPSRADLDLRIQQDRGVISH 120
Query: 121 QPSEKECTNQSHPEWETTRKMLKVDKEEAESHKVSPTCTTSYPLFGCRRSRRNLTEAEKE 180
QPSEKEC + SHPEWETT++M+K +K E ES K+ S+PLFGCRR RRNLTEAEKE
Sbjct: 121 QPSEKECADHSHPEWETTKEMIKAEK-EVESPKL------SHPLFGCRRPRRNLTEAEKE 180
Query: 181 ERRIRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYQSLETTN 240
ERRIRR+LANRESARQTIRRRQ LCE+LTKKA+DLAWENENLKREKELALKEYQSLE TN
Sbjct: 181 ERRIRRVLANRESARQTIRRRQTLCEDLTKKASDLAWENENLKREKELALKEYQSLEITN 240
Query: 241 KELKEQMAQAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRPPYASYFWPSVVQPSGP 300
KELKEQ+AQA +PK+EEIPGNNRSSHVQ PPLPTNYPLF FSRPPYASYFWPSVVQPS P
Sbjct: 241 KELKEQIAQA-EPKMEEIPGNNRSSHVQTPPLPTNYPLFFFSRPPYASYFWPSVVQPSSP 300
Query: 301 YHELHNVVVVPSSIHLPADNNVPVSDSSHVQESFPNVNGLRTPFCILPCSWLLPHHDHRN 360
YH+LHNV VVP S+ P++N V VSDSSHVQE+F NV GLRTPFCI+PCSWLLPHHDHRN
Sbjct: 301 YHDLHNVAVVPPSVRSPSNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRN 360
Query: 361 QHSPQISCPTGNNQEDIYSNSQNSAYTSKVVVNAESRHSSLPSTEEKNEAPDLNEAPNLN 420
Q S Q SCP GN QE IYSNSQNSAYTSKVVV AESR SSLPS EEKNEA DLNEAP+L
Sbjct: 361 QQSSQNSCPAGNIQECIYSNSQNSAYTSKVVVRAESRRSSLPSAEEKNEAHDLNEAPSL- 420
Query: 421 EALIPKDQTQNTVGVVVEGFDADARAQVRKVLSPVRLECVEPTSAIKQDNRNEDDHGLSS 480
KD TQNTVGVVV+ F+AD R +VRKVLSPVRLEC+EPTS +KQD +EDD GLSS
Sbjct: 421 -----KDHTQNTVGVVVDRFEADTRDKVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSS 480
Query: 481 KTCDDFCDF-EKKLEPEIVPCKKTIDAMAAAEARRRRKELTKLKNLHARQFRMH 529
+TCDD C EKK EPE+V CKKTIDAMAA EARRRRKELTKLKNLH R RMH
Sbjct: 481 RTCDDLCHLAEKKHEPELVSCKKTIDAMAATEARRRRKELTKLKNLHTRPCRMH 511
BLAST of Lcy06g020270 vs. ExPASy TrEMBL
Match:
A0A0A0LBD4 (BZIP domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G775260 PE=4 SV=1)
HSP 1 Score: 727.2 bits (1876), Expect = 4.8e-206
Identity = 417/534 (78.09%), Postives = 453/534 (84.83%), Query Frame = 0
Query: 2 ASSSKCSEGTSCSGLSSSSSSTCSSSSSSSMSSYTAVAADQMVKVEIEAAEALADLAVLA 61
ASSSKCS+GT+ SGLSSSSSS+ SSSSSSSMSS A AADQMVKVEIEAAEALA LAVLA
Sbjct: 39 ASSSKCSDGTTSSGLSSSSSSSSSSSSSSSMSSSMAKAADQMVKVEIEAAEALAGLAVLA 98
Query: 62 VRESGG---EAKWGSKGKGKRARKKVKSELPTWGLVDSLPSRADLDLRI-QDRGVVSHQP 121
VRE+G + KWG KGKGKRARK+VK+E PT G DSLP+RADLDLRI QDRGVV HQP
Sbjct: 99 VRETGTQPFQTKWGIKGKGKRARKEVKTESPTSGFADSLPARADLDLRIEQDRGVVKHQP 158
Query: 122 SEKECTNQSHPEWETTRKMLKVDKEEAESHKVSPTCTTSYPLFGCRRSRRNLTEAEKEER 181
SEKECT QS PE ETT ++ K+DK EAES KVSP CTTSY FGCRRSRR LTEAEKEER
Sbjct: 159 SEKECTIQSQPEPETTGEVTKMDK-EAESSKVSPACTTSYQFFGCRRSRRTLTEAEKEER 218
Query: 182 RIRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYQSLETTNKE 241
RIRRILANRESARQTIRRRQALCEELT+KAADLAWENENLKREKE+ALKEYQSLETTNKE
Sbjct: 219 RIRRILANRESARQTIRRRQALCEELTRKAADLAWENENLKREKEVALKEYQSLETTNKE 278
Query: 242 LKEQMAQAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRPPYASYFWPSVVQPSGPYH 301
LKEQ+A+AVKPKVEEIPGN+RSSHVQMPPLPTN PLFLFSR P YFWPSVVQ + YH
Sbjct: 279 LKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTNCPLFLFSRLP---YFWPSVVQSTSSYH 338
Query: 302 ELHNVVVVPSSIHLPADNNVPVSDSSHVQESFPNVNGLRTPFCIL-PCSWLLPHHDHRNQ 361
EL NVVVVPSSI+ PA+NN VS SS QE+F N G R P CIL P SWLLPHHD RNQ
Sbjct: 339 ELPNVVVVPSSINPPANNNASVSGSSQTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQ 398
Query: 362 HSPQISCPTGNNQEDIYSNSQNSAYTSKVVVNAESRHSSLPSTEEKNEAPDLNEAPNLNE 421
SPQI P GN+QE +YS SQNSA TSK V AESRHSSLPS EE+NEAPDLNEAP+L+E
Sbjct: 399 QSPQIWFPAGNDQEGVYSKSQNSAITSK-DVRAESRHSSLPSAEEENEAPDLNEAPSLDE 458
Query: 422 ALIPKDQTQNTVGVVVEGFDADARAQVRKVLSPVRLECVEPTSAIKQDNRNEDDHGLSSK 481
+ PKD TQNTVGV VEGFD +ARA VRKVLSPVRLEC+EP+SA DN NEDDHG+SS+
Sbjct: 459 SSNPKDDTQNTVGVAVEGFDTNARAPVRKVLSPVRLECIEPSSAATLDNWNEDDHGVSSR 518
Query: 482 TCDDFCDF-EKKLEPEIVPCKKTIDAMAAAEARRRRKELTKLKNLHARQFRMHS 530
TCDD C F E++ EPE+VPCKKT+DAMAA EARRRRKELTKLKNL+ARQ RM S
Sbjct: 519 TCDDLCYFAERRHEPEVVPCKKTVDAMAATEARRRRKELTKLKNLYARQCRMQS 567
BLAST of Lcy06g020270 vs. NCBI nr
Match:
XP_038904851.1 (uncharacterized protein LOC120091090 isoform X2 [Benincasa hispida])
HSP 1 Score: 746.5 bits (1926), Expect = 1.6e-211
Identity = 425/535 (79.44%), Postives = 456/535 (85.23%), Query Frame = 0
Query: 1 MASSSKCSEGTSCSGLSSSSSSTCSSSSSSSMSSYTAVAADQMVKVEIEAAEALADLAVL 60
MASSSKCS GTSCS LSSSSSS+ SS AADQMVKVEIEAAEALA LAVL
Sbjct: 1 MASSSKCSNGTSCSSLSSSSSSSMLSSK----------AADQMVKVEIEAAEALAGLAVL 60
Query: 61 AVRESGG---EAKWGSKGKGKRARKKVKSELPTWGLVDSLPSRADLDLRI-QDRGVVSHQ 120
AVRE+G E KWG KGKGKRARK+VK+ELPT G DSLPS ADLDLRI QDRGVV HQ
Sbjct: 61 AVRETGVQPLETKWGIKGKGKRARKEVKTELPTSGFADSLPSCADLDLRIEQDRGVVRHQ 120
Query: 121 PSEKECTNQSHPEWETTRKMLKVDKEEAESHKVSPTCTTSYPLFGCRRSRRNLTEAEKEE 180
PSEKECTNQSHPEWETT +++K DK EAES KVSP CTTSY LFGCRRSRRNLTEAEKEE
Sbjct: 121 PSEKECTNQSHPEWETTGELIKADK-EAESCKVSPACTTSYQLFGCRRSRRNLTEAEKEE 180
Query: 181 RRIRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYQSLETTNK 240
RR+RRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYQ+LETTN
Sbjct: 181 RRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYQTLETTNM 240
Query: 241 ELKEQMAQAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRPPYASYFWPSVVQPSGPY 300
ELKEQ+A+AVKPKV EIPGNNRSSHVQMPPLPTNYPLFL SR P YFWPSVVQP+ PY
Sbjct: 241 ELKEQLAEAVKPKV-EIPGNNRSSHVQMPPLPTNYPLFLLSRLP---YFWPSVVQPTTPY 300
Query: 301 HELHNVVVVPSSIHLPADNNVPVSDSSHVQESFPNVNGLRTPFCIL-PCSWLLPHHDHRN 360
H+L NVVVVPSSI+LPA+NNV VS SSHVQE+F +V G RTP CIL PCSWLLPHHD RN
Sbjct: 301 HDLPNVVVVPSSINLPANNNVSVSGSSHVQENFMDVTGPRTPLCILPPCSWLLPHHDFRN 360
Query: 361 QHSPQISCPTGNNQEDIYSNSQNSAYTSKVVVNAESRHSSLPSTEEKNEAPDLNEAPNLN 420
Q +PQI P GNNQEDIYS SQ+SA TSK VV+AESR SLPS EE+NEAPDLNEAPNLN
Sbjct: 361 QQNPQIWFPAGNNQEDIYSKSQDSANTSK-VVHAESRRPSLPSAEEENEAPDLNEAPNLN 420
Query: 421 EALIPKDQTQNTVGVVVEGFDADARAQVRKVLSPVRLECVEPTSAIKQDNRNEDDHGLSS 480
+A PKD TQNTVGV V+GFD + RAQVRKVLSPVRLEC+EP+ A+KQDN +EDDH L S
Sbjct: 421 KASTPKDHTQNTVGVDVDGFDTNTRAQVRKVLSPVRLECIEPSPAVKQDNGSEDDHCLPS 480
Query: 481 KTCDDFCDF-EKKLEPEIVPCKKTIDAMAAAEARRRRKELTKLKNLHARQFRMHS 530
KTCDD CDF E++ EPEIV CKKTIDAMAA EARRRRKELTKLKNL+ RQ RM S
Sbjct: 481 KTCDDLCDFAERRHEPEIVLCKKTIDAMAATEARRRRKELTKLKNLYTRQCRMQS 519
BLAST of Lcy06g020270 vs. NCBI nr
Match:
XP_038904850.1 (uncharacterized protein LOC120091090 isoform X1 [Benincasa hispida])
HSP 1 Score: 746.1 bits (1925), Expect = 2.0e-211
Identity = 425/536 (79.29%), Postives = 456/536 (85.07%), Query Frame = 0
Query: 1 MASSSKCSEGTSCSGLSSSSSSTCSSSSSSSMSSYTAVAADQMVKVEIEAAEALADLAVL 60
MASSSKCS GTSCS LSSSSSS+ SS AADQMVKVEIEAAEALA LAVL
Sbjct: 1 MASSSKCSNGTSCSSLSSSSSSSMLSSK----------AADQMVKVEIEAAEALAGLAVL 60
Query: 61 AVRESGG---EAKWGSKGKGKRARKKVKSELPTWGLVDSLPSRADLDLRI--QDRGVVSH 120
AVRE+G E KWG KGKGKRARK+VK+ELPT G DSLPS ADLDLRI QDRGVV H
Sbjct: 61 AVRETGVQPLETKWGIKGKGKRARKEVKTELPTSGFADSLPSCADLDLRIEQQDRGVVRH 120
Query: 121 QPSEKECTNQSHPEWETTRKMLKVDKEEAESHKVSPTCTTSYPLFGCRRSRRNLTEAEKE 180
QPSEKECTNQSHPEWETT +++K DK EAES KVSP CTTSY LFGCRRSRRNLTEAEKE
Sbjct: 121 QPSEKECTNQSHPEWETTGELIKADK-EAESCKVSPACTTSYQLFGCRRSRRNLTEAEKE 180
Query: 181 ERRIRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYQSLETTN 240
ERR+RRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYQ+LETTN
Sbjct: 181 ERRVRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYQTLETTN 240
Query: 241 KELKEQMAQAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRPPYASYFWPSVVQPSGP 300
ELKEQ+A+AVKPKV EIPGNNRSSHVQMPPLPTNYPLFL SR P YFWPSVVQP+ P
Sbjct: 241 MELKEQLAEAVKPKV-EIPGNNRSSHVQMPPLPTNYPLFLLSRLP---YFWPSVVQPTTP 300
Query: 301 YHELHNVVVVPSSIHLPADNNVPVSDSSHVQESFPNVNGLRTPFCIL-PCSWLLPHHDHR 360
YH+L NVVVVPSSI+LPA+NNV VS SSHVQE+F +V G RTP CIL PCSWLLPHHD R
Sbjct: 301 YHDLPNVVVVPSSINLPANNNVSVSGSSHVQENFMDVTGPRTPLCILPPCSWLLPHHDFR 360
Query: 361 NQHSPQISCPTGNNQEDIYSNSQNSAYTSKVVVNAESRHSSLPSTEEKNEAPDLNEAPNL 420
NQ +PQI P GNNQEDIYS SQ+SA TSK VV+AESR SLPS EE+NEAPDLNEAPNL
Sbjct: 361 NQQNPQIWFPAGNNQEDIYSKSQDSANTSK-VVHAESRRPSLPSAEEENEAPDLNEAPNL 420
Query: 421 NEALIPKDQTQNTVGVVVEGFDADARAQVRKVLSPVRLECVEPTSAIKQDNRNEDDHGLS 480
N+A PKD TQNTVGV V+GFD + RAQVRKVLSPVRLEC+EP+ A+KQDN +EDDH L
Sbjct: 421 NKASTPKDHTQNTVGVDVDGFDTNTRAQVRKVLSPVRLECIEPSPAVKQDNGSEDDHCLP 480
Query: 481 SKTCDDFCDF-EKKLEPEIVPCKKTIDAMAAAEARRRRKELTKLKNLHARQFRMHS 530
SKTCDD CDF E++ EPEIV CKKTIDAMAA EARRRRKELTKLKNL+ RQ RM S
Sbjct: 481 SKTCDDLCDFAERRHEPEIVLCKKTIDAMAATEARRRRKELTKLKNLYTRQCRMQS 520
BLAST of Lcy06g020270 vs. NCBI nr
Match:
KAG7017441.1 (hypothetical protein SDJN02_19306 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 745.0 bits (1922), Expect = 4.6e-211
Identity = 419/533 (78.61%), Postives = 452/533 (84.80%), Query Frame = 0
Query: 1 MASSSKCSEGTSCSGLSSSSSSTCSSSSSSSMSSYTAVAADQMVKVEIEAAEALADLAVL 60
MASSSKCSE TSCSGLSSSS+ SSSSSSM ADQMVKVEIEAAEALADLAVL
Sbjct: 1 MASSSKCSEATSCSGLSSSST---RSSSSSSME------ADQMVKVEIEAAEALADLAVL 60
Query: 61 AVRESG---GEAKWGSK-GKGKRARKKVKSELPTWGLVDSLPSRADLDLRIQDRGVVSHQ 120
AVR+SG E KW K KGKRARK+VK+E PT VDSLPSRADLDLRIQDRGV+SHQ
Sbjct: 61 AVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDLRIQDRGVISHQ 120
Query: 121 PSEKECTNQSHPEWETTRKMLKVDKEEAESHKVSPTCTTSYPLFGCRRSRRNLTEAEKEE 180
PSEKEC + SHPEWETT++M+K +K EAES K+ S+PLFGCRRSRRNLTEAEKEE
Sbjct: 121 PSEKECADHSHPEWETTKEMIKAEK-EAESPKL------SHPLFGCRRSRRNLTEAEKEE 180
Query: 181 RRIRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYQSLETTNK 240
RRIRR+LANRESARQTIRRRQALCE+LTKKA+DLAWENENLKREKELALKEYQSLE TNK
Sbjct: 181 RRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNK 240
Query: 241 ELKEQMAQAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRPPYASYFWPSVVQPSGPY 300
ELKEQ+AQA +PK+EEIPGNNRSSHVQ PPLPTNYPLFLFSRPPYASYFWPSVVQPS PY
Sbjct: 241 ELKEQIAQA-EPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPY 300
Query: 301 HELHNVVVVPSSIHLPADNNVPVSDSSHVQESFPNVNGLRTPFCILPCSWLLPHHDHRNQ 360
H+LHNV VVP S+ P++N V VSDSSHVQE+F NV GLRTPFCI+PCSWLLPHHDHRNQ
Sbjct: 301 HDLHNVAVVPPSVRSPSNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQ 360
Query: 361 HSPQISCPTGNNQEDIYSNSQNSAYTSKVVVNAESRHSSLPSTEEKNEAPDLNEAPNLNE 420
S Q SCP GN QE IYSNSQNSAYTSKVVV AESRHSSLPS EEKNEA DLNEAP+L
Sbjct: 361 QSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPSL-- 420
Query: 421 ALIPKDQTQNTVGVVVEGFDADARAQVRKVLSPVRLECVEPTSAIKQDNRNEDDHGLSSK 480
K+ TQNTVGVVV+ F+AD R QVRKVLSPVRLEC+EPTS +KQD +EDD GLSS+
Sbjct: 421 ----KEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSR 480
Query: 481 TCDDFCDF-EKKLEPEIVPCKKTIDAMAAAEARRRRKELTKLKNLHARQFRMH 529
TCDD C EKK EPE+V CKKTIDAMAA EARRRRKELTKLKNLH R RMH
Sbjct: 481 TCDDLCHLAEKKHEPEVVSCKKTIDAMAATEARRRRKELTKLKNLHTRPCRMH 510
BLAST of Lcy06g020270 vs. NCBI nr
Match:
XP_022934488.1 (uncharacterized protein LOC111441650 isoform X2 [Cucurbita moschata])
HSP 1 Score: 740.7 bits (1911), Expect = 8.6e-210
Identity = 415/533 (77.86%), Postives = 449/533 (84.24%), Query Frame = 0
Query: 1 MASSSKCSEGTSCSGLSSSSSSTCSSSSSSSMSSYTAVAADQMVKVEIEAAEALADLAVL 60
MASSSKCSE TSCSGLSSSS+ + SSSS + ADQMVKVEIEAAEALADLAVL
Sbjct: 1 MASSSKCSEATSCSGLSSSSTRSFSSSS---------MEADQMVKVEIEAAEALADLAVL 60
Query: 61 AVRESG---GEAKWGSK-GKGKRARKKVKSELPTWGLVDSLPSRADLDLRIQDRGVVSHQ 120
AVR+SG E KW K KGKRARK+VK+E PT VDSLPSRADLDLRIQDRGV+SH
Sbjct: 61 AVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDLRIQDRGVISHH 120
Query: 121 PSEKECTNQSHPEWETTRKMLKVDKEEAESHKVSPTCTTSYPLFGCRRSRRNLTEAEKEE 180
PSEKEC + SHPEWETT++M+K +K EAES K+ S+PLFGCRRSRRNLTEAEKEE
Sbjct: 121 PSEKECADHSHPEWETTKEMIKAEK-EAESPKL------SHPLFGCRRSRRNLTEAEKEE 180
Query: 181 RRIRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYQSLETTNK 240
RRIRR+LANRESARQTIRRRQALCE+LTKKA+DLAWENENLKREKELALKEYQSLE TNK
Sbjct: 181 RRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNK 240
Query: 241 ELKEQMAQAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRPPYASYFWPSVVQPSGPY 300
ELKEQ+A A +PK+EEIPGNNRSSHVQ PPLPTNYPLFLFSRPPYASYFWPSVVQPS PY
Sbjct: 241 ELKEQIAHA-EPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPY 300
Query: 301 HELHNVVVVPSSIHLPADNNVPVSDSSHVQESFPNVNGLRTPFCILPCSWLLPHHDHRNQ 360
H+LHNV VVP S+ P++N V VSDSSHVQE+F NV GLRTPFCI+PCSWLLPHHDHRNQ
Sbjct: 301 HDLHNVAVVPPSVRSPSNNTVYVSDSSHVQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQ 360
Query: 361 HSPQISCPTGNNQEDIYSNSQNSAYTSKVVVNAESRHSSLPSTEEKNEAPDLNEAPNLNE 420
S Q SCP GN QE IYSNSQNSAYTSKVVV AESRHSSLPS EEKNEA DLNEAP+L
Sbjct: 361 QSSQNSCPAGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEAHDLNEAPSL-- 420
Query: 421 ALIPKDQTQNTVGVVVEGFDADARAQVRKVLSPVRLECVEPTSAIKQDNRNEDDHGLSSK 480
K+ TQNTVGVVV+ F+AD R QVRKVLSPVRLEC+EPTS +KQD +EDD GLSS+
Sbjct: 421 ----KEHTQNTVGVVVDRFEADTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSR 480
Query: 481 TCDDFCDF-EKKLEPEIVPCKKTIDAMAAAEARRRRKELTKLKNLHARQFRMH 529
TCDD C EKK EPEIV CKKTIDAMAA EARRRRKELTKLKNLH R RMH
Sbjct: 481 TCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKLKNLHTRPCRMH 510
BLAST of Lcy06g020270 vs. NCBI nr
Match:
XP_023528186.1 (uncharacterized protein LOC111791175 isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 736.5 bits (1900), Expect = 1.6e-208
Identity = 416/533 (78.05%), Postives = 448/533 (84.05%), Query Frame = 0
Query: 1 MASSSKCSEGTSCSGLSSSSSSTCSSSSSSSMSSYTAVAADQMVKVEIEAAEALADLAVL 60
MASSSKCSE TSCSGLSSSS+ SSSSSSM ADQMVKVEIEAAEALADLAV
Sbjct: 1 MASSSKCSEATSCSGLSSSST---RSSSSSSME------ADQMVKVEIEAAEALADLAVW 60
Query: 61 AVRESG---GEAKWGSK-GKGKRARKKVKSELPTWGLVDSLPSRADLDLRIQDRGVVSHQ 120
AVR+SG E KW K KGKRARK+VK+E PT VDSLPSRADLDLRIQDR V+SHQ
Sbjct: 61 AVRDSGVQPSETKWRIKEKKGKRARKEVKTESPTSAFVDSLPSRADLDLRIQDREVISHQ 120
Query: 121 PSEKECTNQSHPEWETTRKMLKVDKEEAESHKVSPTCTTSYPLFGCRRSRRNLTEAEKEE 180
PSEKEC + SHPEWETT++M+K +K EAES K+ S+PLFGCRRSRRNLTEAEKEE
Sbjct: 121 PSEKECADHSHPEWETTKEMIKAEK-EAESPKL------SHPLFGCRRSRRNLTEAEKEE 180
Query: 181 RRIRRILANRESARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYQSLETTNK 240
RRIRR+LANRESARQTIRRRQALCE+LTKKA+DLAWENENLKREKELALKEYQSLE TNK
Sbjct: 181 RRIRRVLANRESARQTIRRRQALCEDLTKKASDLAWENENLKREKELALKEYQSLEITNK 240
Query: 241 ELKEQMAQAVKPKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRPPYASYFWPSVVQPSGPY 300
ELKEQ+AQA +PK+EEIPGNNRSSHVQ PPLPTNYPLFLFSRPPYASYFWPSVVQPS PY
Sbjct: 241 ELKEQIAQA-EPKMEEIPGNNRSSHVQTPPLPTNYPLFLFSRPPYASYFWPSVVQPSSPY 300
Query: 301 HELHNVVVVPSSIHLPADNNVPVSDSSHVQESFPNVNGLRTPFCILPCSWLLPHHDHRNQ 360
H+LH V VVP S+ P++N V VSDSSH+QE+F NV GLRTPFCI+PCSWLLPHHDHRNQ
Sbjct: 301 HDLHTVAVVPPSVRSPSNNTVYVSDSSHLQENFTNVTGLRTPFCIVPCSWLLPHHDHRNQ 360
Query: 361 HSPQISCPTGNNQEDIYSNSQNSAYTSKVVVNAESRHSSLPSTEEKNEAPDLNEAPNLNE 420
S Q SCP GN QE IYSNSQNSAYTSKVVV AESRHSSLPS EEKNEA DLNEAP+L
Sbjct: 361 QSSQNSCPVGNIQEYIYSNSQNSAYTSKVVVRAESRHSSLPSAEEKNEANDLNEAPSL-- 420
Query: 421 ALIPKDQTQNTVGVVVEGFDADARAQVRKVLSPVRLECVEPTSAIKQDNRNEDDHGLSSK 480
KD TQNTVGVVV+ F+ D R QVRKVLSPVRLEC+EPTS +KQD +EDD GLSS+
Sbjct: 421 ----KDHTQNTVGVVVDRFEVDTRDQVRKVLSPVRLECIEPTSTVKQDKPSEDDRGLSSR 480
Query: 481 TCDDFCDF-EKKLEPEIVPCKKTIDAMAAAEARRRRKELTKLKNLHARQFRMH 529
TCDD C EKK EPEIV CKKTIDAMAA EARRRRKELTKLKNLH R RMH
Sbjct: 481 TCDDLCHLAEKKHEPEIVSCKKTIDAMAATEARRRRKELTKLKNLHTRPCRMH 510
BLAST of Lcy06g020270 vs. TAIR 10
Match:
AT1G19490.1 (Basic-leucine zipper (bZIP) transcription factor family protein )
HSP 1 Score: 214.2 bits (544), Expect = 2.6e-55
Identity = 204/526 (38.78%), Postives = 277/526 (52.66%), Query Frame = 0
Query: 16 LSSSSSSTCSSSSSSSMSSYTAVAADQMVKVEIEAAEALADLAVLAV-RESGGE--AKWG 75
L SSS CSSSSSSS TA A E+EAAEALADLA LA+ RE E A WG
Sbjct: 3 LEPISSSCCSSSSSSSGEENTAAAN----MTEMEAAEALADLAQLAIMREQVFESAASWG 62
Query: 76 SKGKGKRARKKVKSELPTWGLVDSLPSRADLDL----RIQDRGVVSHQPSEKECTNQSHP 135
S KGKR RK+VK+E P DSL D D + + +V + E+E +
Sbjct: 63 S--KGKRVRKRVKTESPP---SDSLLKPPDSDTLPTPDLAEERLVKEEEEEEEVEPITK- 122
Query: 136 EWETTRKMLKVD-KEEAESHKVSPTCTTSYPLFGCRRSRRNLTEAEKEERRIRRILANRE 195
E T+ +K + E ++ T GC RSR+NL+EAE+EERRIRRILANRE
Sbjct: 123 --ELTKAPVKSEINGETPKPILASTLIRCSRSNGCGRSRQNLSEAEREERRIRRILANRE 182
Query: 196 SARQTIRRRQALCEELTKKAADLAWENENLKREKELALKEYQSLETTNKELKEQMAQAVK 255
SARQTIRRRQA+CEEL+KKAADL +ENENL+REK+ ALKE+QSLET NK LKEQ+ ++VK
Sbjct: 183 SARQTIRRRQAMCEELSKKAADLTYENENLRREKDWALKEFQSLETINKHLKEQVLKSVK 242
Query: 256 PKVEEIPGNNRSSHVQMPPLPTNYPLFLFSRPPYASYFWPSVVQPSGPYHELHNVVVVPS 315
P +E + + S V+M T P + +++ PY + WP V Q S P + S
Sbjct: 243 PDTKEPEESPKPSQVEMSTSST--PFYFYNQNPYQLFCWPHVTQSSNP---------MIS 302
Query: 316 SIHLPADNNVPVSD-SSHVQESFPNVNGLRTPFCILPCSWLLPHHDHRNQHSPQISCPTG 375
+ P ++ E+ + NG +T F ++PC W LP DH N P G
Sbjct: 303 PLEFPTSGGASAKTITTQEHENAADDNGQKTHFYVVPCPWFLPPPDHSN------GVPFG 362
Query: 376 --NNQEDIYSNSQNSAYTSKVVVN-AESRHSSLPS--TEEKNEAPDLNEAPNLNEALIPK 435
+ Q +SN + +S ++ E+ S LP+ EE + +P+ +LNE
Sbjct: 363 LQDTQRGTFSNGHHIDDSSARPMDVTETPRSHLPTRIKEEDSGSPETRPLYDLNE----- 422
Query: 436 DQTQNTVGVVVEGFDADARAQVRKVLSPVRLECVEPTSAIKQDNRNEDDHGLSSKTCDDF 495
+ V+ EG D + ++K ++ +E +G++
Sbjct: 423 ----SATEVLSEGGDG--------------FPVTQQAYSLKHEDVSETTNGVTLMPPGHH 468
Query: 496 CDFEKKLEPEIVPCKKTIDAMAAAEARRRRKELTKLKNLHARQFRM 528
+P KK ++AAAEAR+RRKELT+LKNLH RQ RM
Sbjct: 483 VLIS-------LPEKKH-GSLAAAEARKRRKELTRLKNLHGRQCRM 468
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
P23922 | 7.8e-04 | 36.00 | Transcription factor HBP-1a OS=Triticum aestivum OX=4565 PE=2 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1F7T1 | 4.2e-210 | 77.86 | uncharacterized protein LOC111441650 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1J476 | 1.0e-208 | 77.67 | uncharacterized protein LOC111481617 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1F2W5 | 1.0e-208 | 77.72 | uncharacterized protein LOC111441650 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1J5U4 | 2.5e-207 | 77.53 | uncharacterized protein LOC111481617 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A0A0LBD4 | 4.8e-206 | 78.09 | BZIP domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G775260 PE=4 S... | [more] |
Match Name | E-value | Identity | Description | |
XP_038904851.1 | 1.6e-211 | 79.44 | uncharacterized protein LOC120091090 isoform X2 [Benincasa hispida] | [more] |
XP_038904850.1 | 2.0e-211 | 79.29 | uncharacterized protein LOC120091090 isoform X1 [Benincasa hispida] | [more] |
KAG7017441.1 | 4.6e-211 | 78.61 | hypothetical protein SDJN02_19306 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022934488.1 | 8.6e-210 | 77.86 | uncharacterized protein LOC111441650 isoform X2 [Cucurbita moschata] | [more] |
XP_023528186.1 | 1.6e-208 | 78.05 | uncharacterized protein LOC111791175 isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
AT1G19490.1 | 2.6e-55 | 38.78 | Basic-leucine zipper (bZIP) transcription factor family protein | [more] |