Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTCGTGGAAAAGGAATTTCTCTTTCAGTCCCACCTCCCTCGCTCTCTCTCCTCCCCTCAGAGTGAAAGTAAAAGTCTCCGTTAGGGTTCATGGAAGCTTCCTTCTGAGGTCAGCATGGGTTTCTTTATCTTCATCTGTTCTTCCAATTTCGTCAGAGTCTCTCATTTGGGTTTCGCTTAAATTTCTTGTTTTTGATGCTGAGTTCATAGTAATTCGTCTTGTTCTTCTCTTCCATCTACGTAGAAACTGAGAAGTTTGAGGTTGAAGTTGTTTTTATCAATAAGATAATGAAGAATCCTGATCAGGATCAGCAAGATCCGAGGTACGACTCGTCTGGGTAGCTTTCATTTTAATAGCTCTCGCGCTTTTAGCTGTGATTTTCAATGTCTGAAGTGATTTTAGTTCGTGGGTTTTCTTTTAGTGTTTCTCTACGTCTTGCTCTGAGTTTCCCATTATCATTTGAGCTTGTGTAACCATGCGACTGGCGGCTTGGTCTTGAACAAAGTGCATGAGATTTATGAGATTTTATATAACGTTTTATATATGATTCTGGAGTGTGAATTTCTTGTTTGCTATCCACCAGAGGTAAGTAATCGAATTTCATATTGTAGTTAAACTTGCTTAATATCAATAATGGAAGGCGTAGACTGCTGTGCACAGGACAATTTTCATGCCTTTGTTTTTGTTTATATTTGGCCTCAACTTTTCGGATTATAATGTATTCACTCATTTTTTATATGTTTTTTTTGAGTTCCTATTGCATTTGAATCGAAGAATTTTTTCTTTCTTTCTTTATGTTTGTTTTCATAAATGTAACATCTCATTTCTTTTAAAGATTACGATCAAGTCATTTTAGCTCATTAAGGAGGTAAATTACGAGAAATTTCATGGATATATGTATTTTTTAAACATTAATTTAAAGAAGAGGCATCTGATCTAAATCTAAAATTGAAACCTTAAGTTACACACAAGAGAAATGGAACTTCAGGTTTTTATTTTAGATTTAGGTTAGATGCCTCTTCTTTAAGTTAATAAAAAAAAAAAATACACAATTTTATGGCTTATTGTGAATATGAACTGTGCTCGGACTCTGGCAGTATGTTTGGTTATAGTTAAATGCTGAAGTGCCTGTGATTGAAGTAGTTTACATTCTTGACTTGCACAGTTTTTTTTTCCTCCCCTCCTTGTTGCCTCTCTCTTCCCCCATTTTGGGTTTGTGTGTGTTTTGATGATCATTGGTGCATCAGAAATGTACCTGGTGGGGAGGACACAACTGCAATGACTATTGAGTTTCTTCGGGCTCGACTTCTATCGGAAAGATCTGTTTCAAGAAGTGCAAGACAAAGAGCTGATGAACTAGCGAAAAGGGTACAGCATTTTGTTTATTTGTTTTGTTGGTTTTCTATGGCTTCTGTCTCATGTCTTTGACCTGAAAAACTTATTTGTTAACTATAGGTTGCAGAATTGGAGGAGCAGCTTAAGATCGTGTCTCTTCAAAGAAAGATGGCTGAAAAGGCAACAGCAGATGTACTTGCCATTCTAGAAGATAATGGCGCTAGTGATATTTCTGAGACACTTGATTCAAACTCTGACCACGAAACACCACGTGAATCAAAAGTTGGGAATGACCCTGCAAGAGAAGATGTGAACTCCTCCAAATCAATACATACGAGAAATGAACATGAAGAATTTTCAGGTTCTGATATTGATACTTCTCCAGTGCTAGGTGGAAGCCTATCTTGGAAAGGACGCAATGATTCTCCACATAAATGTGAGAAGTACAAAAAATTTTCTACAAGAAGTCGAAGCAGTTTTACATCTATTGGTTCTTCTTCACCAAAACATCGTCTTGGAAGATCATGCCGCCAGATAAAACGTAGAGATACAAGGTATGCTTAAGCTGTTTATTTTTGTCATCTGTTTGTGAAAAAATAGATACAGTTGCGCGTTTATAGATGCATGACCCATTACTTCCAGTTGACTTGGCTCATCTATAAGACTCTAGGTTATTGTTCTTGGAGATAAATTGGCATCTTGTAGTGACTATTTTCCTAATTACTGAAGTGAGATTTTGGAAAGATCTGAGTTTAATTGTCATAGCTTTTCCAACCTCCATACTGACTTGCAATCTTGGAAGGTGTCAGTACATGCAATTGTTTCTTAGCTCTTGAAGAAAATTTCCTTTTCTATTTCCCATGTTATGCATACAATTGCATCATGGTTCCTTATCCTTTTTTGTTATTTAACTTGATTATTTTATTTTATATATTTTTTAACTTGATTCTTTTATTTAGTGTGCTTTTCATTTTGTCCTGCAATCAGCATGGTAGGTTTCTTTAAGTTTCCTTTTTTCTTTCTTTGGGGGGGGGGGGGGGGTAGGAACTCAAAAAAAGTACTCATCAGGAGTACCCCAACAAAATATATATAACACAGATTTTTGAAACCGTAGATGAAGATCTCTGGATAGGTGTTCATCCTCCACAAACCACGCAGGAGAAAAACTTTATCTCTTTTGGGATTGTATCTTCTTTGTGAGCATAGGACCTCACAAAACAAAACAAAAAATACAAAAATACCCCTCCAACAATTGAGAATAAAAAATAATCGCAAACAGAAAATGGCTACAGTCTTTCGTCCTATTTTCTATAGCTAGATACTCCAAATAATCAAATTGAGAGTCACATGCCAAAGAATTCGTCCACTTTTATAGAAGGGAGAATTAATAAGCACTCCTTCCATCATGTGTCAACAACCTTTACAAAGGAGCCAAACTGGCACTCTATAAAGCACCGACACTCTATATTTGGCAGAGTGTCGGTGTCGGACACCAACACGCTCGCACACGCTCCCGACGCCCTGATTTGATTGTTTGATTATTTTATTATTTTTTAAAATTTTCTGACACGCGAGGACACGTCCATGACACATGGGGACACGTTCATGACACGTTTAGGACACGTTTGTGAGGGAAAAAGAATATTAGATTGAAAAAGCCCAATGAGCCCACTTAGAAGATAAACCTAAATCCTAAGAAAGTAGGAAAAGGCTTCACGGCTTCAGCTACCCACGATTTCTTCTACGTCTCACTGGATTGCTTCCTCCTTCACCGTTTCTTCTTCACCATCTTCTTCCTCCAAGGCCATCGCCTTTGTCTGGGAGAGAGAAGCTTAGGCCATCGCACTTGTCTGTGTGAGAGACATTGCTTACCCATTGGCGAACTTGTCTATGAGAGAGAGATACCCACCCTACTCACTTGTTTGTGTGAGAGAGACAGAGAGAGTACCAGCCCCTGCTTTCGCACCTCTTTGTCTAAGAAAAGTGGAATTTGGATTTTGTTTGCTGCCCCAAAAATACTTATAAATACTTCAAATATATATATATATTAAAATAAAAAAAATTAACGTGTCTCCAGTGTGTCGTGTCCTACTTTTTTAGAAATTGACGTGTCGTCGTGTCCATGTTGTGTCGTTCCCGTGTTGCGTGTCTGTTTCCTTACTTCTTAGCTAGCATTAAACATTTCAATAAATTTGTCACACATTTTCAAAGCAAAGTGGCACCTCCATGACACATGATCCAAGTTTTCCCTCTTGCACAAAACTCAACATTAGGAACCCAAAACCATAGATGAAGATCTCTTGATAAAGTCCATGGTGTTGATCCTCCCGAAAGCCACCAAGAAGGAGAAAAACCTTGTCTTCTTTTGGATTTTAGCTGCCCAAAGATGAGCAAAATAAAGGGGGGATTTTTCCATGTTTTCTTAATTCAAGTTAAATAAAGATTTTGAATTTCATATCATCCATATGGATGATATACAATTTTAGATAGTGATTTCTATAAGTTGTTGGGAAAATTACTAAATCTACCAAAACATCATGGGTTGGCCTAGTGCTCGTAAGGGCCAATGAAAAATGTAAGGGGCTTAGAGGGAATGGGTTCAAACCCTAGTGACCACCTACCTAGAATTTAATATCCTACGAATTACCTTGACAACCAAATGTAGTAGGGTCAAGTGGTTGTCTCGTGAGATTAGTTGAGGTGTGCGTATCATGGATATCAAAACAAAAAAAAAAATTACAAAATCTACTACCAAACCATTAGGGTCAAACTTTTAGCTGTTCAATCATACCCTTAAACAATAAGTACTATTGAAATGCCACTCCTGGCGTTAGATCTGTTTTTCTATCGTTTTCACTGAAATGTTTGTTTTTTCTGGTAAAAACTATGAAGTCAGGAAAAAAGAAGAGAAAATTGTTTCTGGTAAATCTTTCATATTTTATTATTAAATAAAAAATTATGAATTTAGGAAGGGAGAGAGAGCCCCATCTCTCCTTTCTTTTATTTCTTGATGTTCTCAAACCACCCATCTAATCAATTCTTAATAATCTCTCTCAAGTAAGTCCATTTAGTTAATCTGAATATGTTCTTATGTTGCCCCTTCTACAAATCTTTTGCATCTCCTAGCACTTTCCTAATAATTTGCTACGAGCAATAGAGTAGGTTCTTTGTGGTCACCCTCTCCAGAAAAAAAGATATCTACTTATCAATGTGATTAAAGCTTCCTGTTGGATACTTGGATGGAAAGGAATAGAACTTTCGAGACACCTTTAGGGAGGAAGATGTTTTTTTTGGATAATGTGAGAATTTTGACCTCTCTTTGGTTTCTATTTCGGAAACAGTTTGTAACTGTTCCTCTTTTGTTATCTTTGCTGATTGGAGTGATTATTTATATGATATACTCCTTGCTGTTGGATAAGATTGCTTGTCCCCTTCTTCTGTATTCTTTCTCGGTAATGGAAGTATTGTCTCTTATTTTTAAAAAAATATACAACTTTAGTTGGAAGTTAATATTGACTCGAATAATATGTAGAATAAACTATGTTTACTTAAATCAATTCTTCTCATTTTAGTGCTTGATGAAATTTATCAATTAACTAGGAAATGCAGTACTTGTTAGTAATTCTAACAGGCTTACCTTGACCTGTTTGGCTTATCTCATAGGGGCGATAGATGATACTTGGCTTCTTCTGTTATGTCATAGATCTTTGAAGTGTAGGTCAATCATTTGATATGTGCCAAAGAAAGGTTGTAGAGCTTTGAACACTTTCTGAACCATGTACTCGAAACATTAATTATTATTTTTGCATACCAGACAACTGGATGGAGAGCAAGAGCTCAAATCTGAGGCACTCGTGGGTAGTTTTCAAGAGATTGCACCATCTGCATGTTCAGAAGACTCTCGAAATTGCTGTGTAAAAGGGCCTAAGATATTAAGAGATGGTTATGAACCTCATGAAAAGACATGCTCAGGTCCTTCACAAGATCATAATAGTGTAGAAAATAAAGATCAAGATCATGATTTGGATGAGTGCGAAAAAGGAAATGATATGGAGAAGGCGTTGGAATGTCAAGCACAACTCATTGATCAATATGAAGCAATGGAAAAGGCTCAAAGAGAATGGGAAGAGAAGTTCAGAGAAAATAACAACAGTACTCCCGTATGTTCCATCAGAAACTGATTACTTGTAACAGTTGACTTTTCCTTTCTTTCATTAAATCCACGATTTAATTGAGATTCCTATGAAAGTTCTACTTTGATGCATTCCCTTTTTCTTATCATCTACCTCATCTGATGAGCATTCTTTGTCATACTGGTGATTATGTTCTTCAAGTGTCTCCTGTTTTAGCTAGATAACTTCTATAAACTTAAGTAATGCGTGCCCTTTTCTTTTCTCAAGTAAGAAGCTCAAGTTCATTAGCAAACACAAATCTTCATAATATCATGAAAATAATTTTTAATCATCAGGACAGGAGTATGAATAAATAAATCTGTGGGATTACCTAAGGTTTGTTTTCTTTGTTTTTTGCAATAAAAAGACGGAAGTTATTGGCTTGCGTGACAATTATGTTTTACAGTCAATCCTGTCATGCATATGTCATTGCACTAGACCTTGATTGTATGCACCCAATTACCAGCTTTATGAGGATTATGTCCATTCAAAGTATCTTTCTCCTTGTATTAGACTGTAAATATTAGAACATTGTGGGATGGTGGGTGGGGGCAATTTGTCCTTTGGTTTTGAGCAAATTAAATTATTGTATGCACAGTGGCAGTTTCTTTAACAACTTATTCTTCTCTCTTGGTTCATTGCTTATGTAGGATTCTTGTGACCCTGGAAACCATTCAGATATCACTGAGGAAAGAGATGAGATAAGGGCACAAGCTCCAAATCTGTCTGGTAATGCTTTCCTTGCAAATGAGGCAAAATCACAGGTTGCAGTCAATTGTGTCACTAGAGATTCGTCCCAAGCTCAAACCAATGGGCTTGACCCATCTTCATGTGCTGATGTGGAAGACTTGCAGGATCAGAATACAAATAGCATTTCTACTTCACGATCACTTGAAGAATTTACCTTTCCTATGGCTAATGTGAAGCAATGCCAAGAAAGCCAAGAAAGTAGCGAACAAGAACCTTCTTGTACCTCCCAACTCAATCATGGGCTCCCTCACAGGCCATTGTCATCTCATGGTGGTATCGATTTCTATGACCAAGAAACTCCGTGCAGTAAGAATGATCTATATGCATTGGTGCCACATGAACCGCCTGCATTAAATGGTGTACTCGAGGCACTTAAACAAGCAAAGCTATCGCTAGCAAAGAAAATCAAGAAATTACCCTCCGTAGAGGGTGAATCAATTGGAACTCTTTCTGTTCCAAAAGTTGGGGGCAGGTTAGATATCCCTATTGGATGTGCTGGGCTCTTCAGACTTCCAACCGACTTTGCTGCCGAAGCTTCTACTCAAGCGAACTTCCTAGTTTCAAGTTCTCAGTTAAGATCGTCAACTCATTATCCTGGTGAGGGTGTTGCATTATCTGCAAATCAACAATTTTTCCCTAGTCATGAAATGGAGGACAGATCAAGTTTTTTAAGAGATGGTTGTTTACGCAACAGCAATTACCACACTGGTTCAGTTTTTACCCGAGATGGATTTCTGACTGACCATTTTCCTGAGAATGGATGGAAAAATCCAGGCCAGAAGCATCATTTTGATCGATACTTCGATGCAATTCAACCCTCTCCCCATGTACACAACTATCCATCACCTTCGGTATCCTCAAGCATACATCCCAATGAAAGTTTTTTAAAAACTTTTCCCAGTCGGACTGTAGGAATCCCTCCGGCCAACCAATATTCATTTTATGATGATCAAATTAGGCCAAATATGTATAGATAGTTGGATGTAGTTATATAAGTATAGACATGTGATACATGCTTCAGGTTCCAGGAGAGCTGTTTTTCTTCTGCAACAATAGACTCGTATACCATGGGAAGGCCATTTCTACATTCTCTTGGTTTCTGCAAACTTGCTGGAAGGAAATTGGCGTTGAAAGAGATTTGACCTTGTATCATACTGTAAGTCTGTAATATTATTATTGTTTGCCTGTAACTTAAGGTTGTTCTGCCTAGACATCAGGCTATTTTCAACTTTCCATACATAAAATAAACTAATGTATATTTTGGATCCTACACTCATTTGGACTGACTTTTCAGGATTACTTCACCTCAAAAGTTCAAAGCTTGTGTTTGCTTAAATGTCATCAGAAGTTCAAGAATTCCTAA
mRNA sequence
ATGGTCGTGGAAAAGGAATTTCTCTTTCAGTCCCACCTCCCTCGCTCTCTCTCCTCCCCTCAGAGTGAAAGTAAAAAAACTGAGAAGTTTGAGGTTGAAGTTGTTTTTATCAATAAGATAATGAAGAATCCTGATCAGGATCAGCAAGATCCGAGAAATGTACCTGGTGGGGAGGACACAACTGCAATGACTATTGAGTTTCTTCGGGCTCGACTTCTATCGGAAAGATCTGTTTCAAGAAGTGCAAGACAAAGAGCTGATGAACTAGCGAAAAGGGTTGCAGAATTGGAGGAGCAGCTTAAGATCGTGTCTCTTCAAAGAAAGATGGCTGAAAAGGCAACAGCAGATGTACTTGCCATTCTAGAAGATAATGGCGCTAGTGATATTTCTGAGACACTTGATTCAAACTCTGACCACGAAACACCACGTGAATCAAAAGTTGGGAATGACCCTGCAAGAGAAGATGTGAACTCCTCCAAATCAATACATACGAGAAATGAACATGAAGAATTTTCAGGTTCTGATATTGATACTTCTCCAGTGCTAGGTGGAAGCCTATCTTGGAAAGGACGCAATGATTCTCCACATAAATGTGAGAAGTACAAAAAATTTTCTACAAGAAGTCGAAGCAGTTTTACATCTATTGGTTCTTCTTCACCAAAACATCGTCTTGGAAGATCATGCCGCCAGATAAAACGTAGAGATACAAGACAACTGGATGGAGAGCAAGAGCTCAAATCTGAGGCACTCGTGGGTAGTTTTCAAGAGATTGCACCATCTGCATGTTCAGAAGACTCTCGAAATTGCTGTGTAAAAGGGCCTAAGATATTAAGAGATGGTTATGAACCTCATGAAAAGACATGCTCAGGTCCTTCACAAGATCATAATAGTGTAGAAAATAAAGATCAAGATCATGATTTGGATGAGTGCGAAAAAGGAAATGATATGGAGAAGGCGTTGGAATGTCAAGCACAACTCATTGATCAATATGAAGCAATGGAAAAGGCTCAAAGAGAATGGGAAGAGAAGTTCAGAGAAAATAACAACAGTACTCCCGATTCTTGTGACCCTGGAAACCATTCAGATATCACTGAGGAAAGAGATGAGATAAGGGCACAAGCTCCAAATCTGTCTGGTAATGCTTTCCTTGCAAATGAGGCAAAATCACAGGTTGCAGTCAATTGTGTCACTAGAGATTCGTCCCAAGCTCAAACCAATGGGCTTGACCCATCTTCATGTGCTGATGTGGAAGACTTGCAGGATCAGAATACAAATAGCATTTCTACTTCACGATCACTTGAAGAATTTACCTTTCCTATGGCTAATGTGAAGCAATGCCAAGAAAGCCAAGAAAGTAGCGAACAAGAACCTTCTTGTACCTCCCAACTCAATCATGGGCTCCCTCACAGGCCATTGTCATCTCATGGTGGTATCGATTTCTATGACCAAGAAACTCCGTGCAGTAAGAATGATCTATATGCATTGGTGCCACATGAACCGCCTGCATTAAATGGTGTACTCGAGGCACTTAAACAAGCAAAGCTATCGCTAGCAAAGAAAATCAAGAAATTACCCTCCGTAGAGGGTGAATCAATTGGAACTCTTTCTGTTCCAAAAGTTGGGGGCAGGTTAGATATCCCTATTGGATGTGCTGGGCTCTTCAGACTTCCAACCGACTTTGCTGCCGAAGCTTCTACTCAAGCGAACTTCCTAGTTTCAAGTTCTCAGTTAAGATCGTCAACTCATTATCCTGGTGAGGGTGTTGCATTATCTGCAAATCAACAATTTTTCCCTAGTCATGAAATGGAGGACAGATCAAGTTTTTTAAGAGATGGTTGTTTACGCAACAGCAATTACCACACTGGTTCAGTTTTTACCCGAGATGGATTTCTGACTGACCATTTTCCTGAGAATGGATGGAAAAATCCAGGCCAGAAGCATCATTTTGATCGATACTTCGATGCAATTCAACCCTCTCCCCATGTACACAACTATCCATCACCTTCGGTTCCAGGAGAGCTGTTTTTCTTCTGCAACAATAGACTCGTATACCATGGGAAGGCCATTTCTACATTCTCTTGGTTTCTGCAAACTTGCTGGAAGGAAATTGGCGTTGAAAGAGATTTGACCTTGTATCATACTGATTACTTCACCTCAAAAGTTCAAAGCTTGTGTTTGCTTAAATGTCATCAGAAGTTCAAGAATTCCTAA
Coding sequence (CDS)
ATGGTCGTGGAAAAGGAATTTCTCTTTCAGTCCCACCTCCCTCGCTCTCTCTCCTCCCCTCAGAGTGAAAGTAAAAAAACTGAGAAGTTTGAGGTTGAAGTTGTTTTTATCAATAAGATAATGAAGAATCCTGATCAGGATCAGCAAGATCCGAGAAATGTACCTGGTGGGGAGGACACAACTGCAATGACTATTGAGTTTCTTCGGGCTCGACTTCTATCGGAAAGATCTGTTTCAAGAAGTGCAAGACAAAGAGCTGATGAACTAGCGAAAAGGGTTGCAGAATTGGAGGAGCAGCTTAAGATCGTGTCTCTTCAAAGAAAGATGGCTGAAAAGGCAACAGCAGATGTACTTGCCATTCTAGAAGATAATGGCGCTAGTGATATTTCTGAGACACTTGATTCAAACTCTGACCACGAAACACCACGTGAATCAAAAGTTGGGAATGACCCTGCAAGAGAAGATGTGAACTCCTCCAAATCAATACATACGAGAAATGAACATGAAGAATTTTCAGGTTCTGATATTGATACTTCTCCAGTGCTAGGTGGAAGCCTATCTTGGAAAGGACGCAATGATTCTCCACATAAATGTGAGAAGTACAAAAAATTTTCTACAAGAAGTCGAAGCAGTTTTACATCTATTGGTTCTTCTTCACCAAAACATCGTCTTGGAAGATCATGCCGCCAGATAAAACGTAGAGATACAAGACAACTGGATGGAGAGCAAGAGCTCAAATCTGAGGCACTCGTGGGTAGTTTTCAAGAGATTGCACCATCTGCATGTTCAGAAGACTCTCGAAATTGCTGTGTAAAAGGGCCTAAGATATTAAGAGATGGTTATGAACCTCATGAAAAGACATGCTCAGGTCCTTCACAAGATCATAATAGTGTAGAAAATAAAGATCAAGATCATGATTTGGATGAGTGCGAAAAAGGAAATGATATGGAGAAGGCGTTGGAATGTCAAGCACAACTCATTGATCAATATGAAGCAATGGAAAAGGCTCAAAGAGAATGGGAAGAGAAGTTCAGAGAAAATAACAACAGTACTCCCGATTCTTGTGACCCTGGAAACCATTCAGATATCACTGAGGAAAGAGATGAGATAAGGGCACAAGCTCCAAATCTGTCTGGTAATGCTTTCCTTGCAAATGAGGCAAAATCACAGGTTGCAGTCAATTGTGTCACTAGAGATTCGTCCCAAGCTCAAACCAATGGGCTTGACCCATCTTCATGTGCTGATGTGGAAGACTTGCAGGATCAGAATACAAATAGCATTTCTACTTCACGATCACTTGAAGAATTTACCTTTCCTATGGCTAATGTGAAGCAATGCCAAGAAAGCCAAGAAAGTAGCGAACAAGAACCTTCTTGTACCTCCCAACTCAATCATGGGCTCCCTCACAGGCCATTGTCATCTCATGGTGGTATCGATTTCTATGACCAAGAAACTCCGTGCAGTAAGAATGATCTATATGCATTGGTGCCACATGAACCGCCTGCATTAAATGGTGTACTCGAGGCACTTAAACAAGCAAAGCTATCGCTAGCAAAGAAAATCAAGAAATTACCCTCCGTAGAGGGTGAATCAATTGGAACTCTTTCTGTTCCAAAAGTTGGGGGCAGGTTAGATATCCCTATTGGATGTGCTGGGCTCTTCAGACTTCCAACCGACTTTGCTGCCGAAGCTTCTACTCAAGCGAACTTCCTAGTTTCAAGTTCTCAGTTAAGATCGTCAACTCATTATCCTGGTGAGGGTGTTGCATTATCTGCAAATCAACAATTTTTCCCTAGTCATGAAATGGAGGACAGATCAAGTTTTTTAAGAGATGGTTGTTTACGCAACAGCAATTACCACACTGGTTCAGTTTTTACCCGAGATGGATTTCTGACTGACCATTTTCCTGAGAATGGATGGAAAAATCCAGGCCAGAAGCATCATTTTGATCGATACTTCGATGCAATTCAACCCTCTCCCCATGTACACAACTATCCATCACCTTCGGTTCCAGGAGAGCTGTTTTTCTTCTGCAACAATAGACTCGTATACCATGGGAAGGCCATTTCTACATTCTCTTGGTTTCTGCAAACTTGCTGGAAGGAAATTGGCGTTGAAAGAGATTTGACCTTGTATCATACTGATTACTTCACCTCAAAAGTTCAAAGCTTGTGTTTGCTTAAATGTCATCAGAAGTTCAAGAATTCCTAA
Protein sequence
MVVEKEFLFQSHLPRSLSSPQSESKKTEKFEVEVVFINKIMKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHETPRESKVGNDPAREDVNSSKSIHTRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHKCEKYKKFSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDTRQLDGEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHEKTCSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREWEEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQTNGLDPSSCADVEDLQDQNTNSISTSRSLEEFTFPMANVKQCQESQESSEQEPSCTSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKLPSVEGESIGTLSVPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLRSSTHYPGEGVALSANQQFFPSHEMEDRSSFLRDGCLRNSNYHTGSVFTRDGFLTDHFPENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGELFFFCNNRLVYHGKAISTFSWFLQTCWKEIGVERDLTLYHTDYFTSKVQSLCLLKCHQKFKNS
Homology
BLAST of Sgr022650 vs. NCBI nr
Match:
XP_022134072.1 (uncharacterized protein LOC111006434 [Momordica charantia] >XP_022134074.1 uncharacterized protein LOC111006434 [Momordica charantia])
HSP 1 Score: 991.9 bits (2563), Expect = 3.0e-285
Identity = 529/641 (82.53%), Postives = 565/641 (88.14%), Query Frame = 0
Query: 41 MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQL 100
M+NPDQDQQDPR+VPG EDTTAMTIEFLRARLLSERSVSRSA+QRADELAKRVAELEEQL
Sbjct: 1 MENPDQDQQDPRSVPGVEDTTAMTIEFLRARLLSERSVSRSAKQRADELAKRVAELEEQL 60
Query: 101 KIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHETPRESKVGNDPAREDVNSSK 160
K+VSLQRKMAEKATADVL+ILEDNGASDISETLDSNSDHET ESKV +DPAR DVNS+
Sbjct: 61 KVVSLQRKMAEKATADVLSILEDNGASDISETLDSNSDHETLCESKVEDDPARADVNSN- 120
Query: 161 SIHTRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHKCEKYKKFSTRSRSSFTSIGSSSP 220
S RN HEE+SGSDIDTSPVLGGSLSWKGRNDSPH EKYKK S RSRSSF+SIGSSSP
Sbjct: 121 STRRRNVHEEYSGSDIDTSPVLGGSLSWKGRNDSPHTHEKYKKISIRSRSSFSSIGSSSP 180
Query: 221 KHRLGRSCRQIKRRDTRQLDGEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDG 280
KHRLGRSCRQIKRRD R L+ EQELKSEALV S QEIAPS CSEDSRNCC+ GPKILRDG
Sbjct: 181 KHRLGRSCRQIKRRDPRPLNREQELKSEALVDSAQEIAPSTCSEDSRNCCINGPKILRDG 240
Query: 281 YEPHEKTCSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW 340
++ HE+T SG S D++ V NKD+DHDLDE EK NDMEKALECQAQLIDQYEAMEKAQREW
Sbjct: 241 HDLHEETRSGSSPDNSCVGNKDKDHDLDEYEKVNDMEKALECQAQLIDQYEAMEKAQREW 300
Query: 341 EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDS 400
EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFL NEAK+QVAV+C+ RDS
Sbjct: 301 EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLENEAKAQVAVDCIPRDS 360
Query: 401 SQAQTNGLDPSSCADVEDLQDQNTNSISTSRSLEEFTFPMANVKQCQESQESSEQEPSCT 460
QAQTNGL PS CADVE+LQDQN+NSISTSRSLEEFTFPMANVKQCQESQE+ EQEPSCT
Sbjct: 361 YQAQTNGLGPSLCADVEELQDQNSNSISTSRSLEEFTFPMANVKQCQESQENREQEPSCT 420
Query: 461 SQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKK 520
SQLN+GLP RPLSSHGGI+F+++E PCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKK
Sbjct: 421 SQLNYGLPERPLSSHGGINFHEKEPPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKK 480
Query: 521 IKKLPSVEGE----SIGTLSVPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQ 580
I KLPSVEGE SIGTLSVP VG RL++PIGCAGLFRLPTDFAAEAS+QA+FL SSSQ
Sbjct: 481 INKLPSVEGESIGKSIGTLSVPNVGDRLEVPIGCAGLFRLPTDFAAEASSQASFLSSSSQ 540
Query: 581 LRSSTHYPGEGVALSANQQFFPSHEMEDRSSFLRDGCLRNSNYHTGSVFTRDGFLTDHFP 640
RS+THYPGEG ALSAN Q FPSHE EDRSSFLRD LRN Y GF TDHFP
Sbjct: 541 SRSATHYPGEGGALSANPQIFPSHEREDRSSFLRDNRLRNGRY---------GFPTDHFP 600
Query: 641 ENGWKNP--GQKHHFDRYFDAIQPSPH-VHNYPSPSVPGEL 675
ENGW NP GQ++ FDR FDAIQPSPH VH YP P V +
Sbjct: 601 ENGWNNPGQGQRYRFDRSFDAIQPSPHVVHQYPPPPVSSSI 631
BLAST of Sgr022650 vs. NCBI nr
Match:
XP_038885028.1 (uncharacterized protein LOC120075573 [Benincasa hispida])
HSP 1 Score: 983.4 bits (2541), Expect = 1.1e-282
Identity = 532/638 (83.39%), Postives = 558/638 (87.46%), Query Frame = 0
Query: 41 MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQL 100
M+NPDQDQQDPR+VPG EDTTAMTIEFLRARLLSERSVS+SARQRADELAKRVAELEEQL
Sbjct: 1 MENPDQDQQDPRSVPGAEDTTAMTIEFLRARLLSERSVSKSARQRADELAKRVAELEEQL 60
Query: 101 KIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHETPRESKVGNDPAREDVNSSK 160
KIVSLQRKMAEKATADVLAILEDNGASDISET DSNSD ET ESKV + PARE VNSS
Sbjct: 61 KIVSLQRKMAEKATADVLAILEDNGASDISETFDSNSDRET--ESKVEDGPAREGVNSS- 120
Query: 161 SIHTRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHKCEKYKKFSTRSRSSFTSIGSSSP 220
SI RN HEE+SG DIDTSPVLGGSLSWKGRNDSPH EKYKKFS RSRSSFTSI SSSP
Sbjct: 121 SIQRRNGHEEYSGFDIDTSPVLGGSLSWKGRNDSPHTREKYKKFSIRSRSSFTSISSSSP 180
Query: 221 KHRLGRSCRQIKRRDTRQLDGEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDG 280
KH+LGRSCRQIKR+DTR LDGEQELKSEA V S QEI PS CSED+RN V G ILRDG
Sbjct: 181 KHQLGRSCRQIKRKDTRPLDGEQELKSEARVDSSQEI-PSTCSEDTRNYSVNGHNILRDG 240
Query: 281 YEPHEKTCSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW 340
YE EKT SG S HNSV NKDQDHDLD EK N+MEKAL+CQAQLIDQYEAMEKAQREW
Sbjct: 241 YELPEKTHSGSSGVHNSVGNKDQDHDLDGYEKVNEMEKALKCQAQLIDQYEAMEKAQREW 300
Query: 341 EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDS 400
EEKFRENNNSTPDSCDPGNHSDITEERDE+RAQAPNLS NAFLANEAKSQVAV+CV RD
Sbjct: 301 EEKFRENNNSTPDSCDPGNHSDITEERDEMRAQAPNLSNNAFLANEAKSQVAVDCVIRDL 360
Query: 401 SQAQTNGLDPSSCADVEDLQDQNTNSISTSRSLEEFTFPMANVKQCQESQESSEQEPSCT 460
SQAQTNGL PS CADVEDLQDQNTNS+STS+SLEEFTFPMA VKQ QESQE+S QEPSCT
Sbjct: 361 SQAQTNGLGPSMCADVEDLQDQNTNSVSTSKSLEEFTFPMAIVKQYQESQENSAQEPSCT 420
Query: 461 SQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKK 520
S L+HGLP RPLSSH GI+FYDQETP S NDLYALVPHEPPAL+GVLEAL QAKLSL KK
Sbjct: 421 SHLHHGLPERPLSSHSGINFYDQETPFSNNDLYALVPHEPPALDGVLEALNQAKLSLTKK 480
Query: 521 IKKLPSVEGE----SIGTLSVPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQ 580
I KLPSVEGE SIG LSVPKVG RL+IPIGCAGLFRLPTDFAAEAS+Q NFL SSSQ
Sbjct: 481 IIKLPSVEGESIDKSIGALSVPKVGDRLEIPIGCAGLFRLPTDFAAEASSQPNFLASSSQ 540
Query: 581 LRSSTHYPGEGVALSANQQFFPSHEMEDRSSFLRDGCLRNSNYHTGSVFTRDGFLTDHFP 640
LRSSTHYPGEGVALSAN Q F SHEMED SSFLR+ LRNS Y TGS FTRDGFLT +
Sbjct: 541 LRSSTHYPGEGVALSANHQIFHSHEMEDGSSFLRESRLRNSGYRTGSGFTRDGFLTHNVH 600
Query: 641 ENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGEL 675
EN WKNPGQKHHFD+YFDA+QPSP+VHNYPS V +
Sbjct: 601 ENRWKNPGQKHHFDQYFDAVQPSPYVHNYPSRPVSSSI 634
BLAST of Sgr022650 vs. NCBI nr
Match:
TYK11749.1 (uncharacterized protein E5676_scaffold304G00680 [Cucumis melo var. makuwa])
HSP 1 Score: 968.8 bits (2503), Expect = 2.7e-278
Identity = 527/676 (77.96%), Postives = 567/676 (83.88%), Query Frame = 0
Query: 46 QDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSL 105
Q++ + R+VPG EDTTAMTIEFLRARLLSERSVS+SARQRADELAKRVAELEEQLKIVSL
Sbjct: 7 QEKWNLRSVPGVEDTTAMTIEFLRARLLSERSVSKSARQRADELAKRVAELEEQLKIVSL 66
Query: 106 QRKMAEKATADVLAILEDNGASDISETLDSNSDHETPRESKVGNDPAREDVNSSKSIHTR 165
QRKMAEKATADVLAILEDNGASDISETLDSNSDHET E KV + P REDVNS ++ R
Sbjct: 67 QRKMAEKATADVLAILEDNGASDISETLDSNSDHET--EPKVEDGPPREDVNSG-TVRRR 126
Query: 166 NEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHKCEKYKKFSTRSRSSFTSIGSSSPKHRLG 225
NEHEE+SGS+I+TSPVLGGSLSWKGRNDSPH EKYKK S RSRSSFTSIGSSSPKH+LG
Sbjct: 127 NEHEEYSGSNINTSPVLGGSLSWKGRNDSPHTREKYKKHSIRSRSSFTSIGSSSPKHQLG 186
Query: 226 RSCRQIKRRDTRQLDGEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHE 285
RSCRQIKRRDTR LDGEQELKSEA + S +EI S EDSRN V G ILRD YE E
Sbjct: 187 RSCRQIKRRDTRPLDGEQELKSEARMDSSEEIL-STSLEDSRNYSVNGHNILRDNYEVRE 246
Query: 286 KTCSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREWEEKFR 345
KTCS S HNS+ N DQD+D+D EK +DMEKAL+CQAQLIDQYEAMEKAQREWEEKFR
Sbjct: 247 KTCSSSSGIHNSIGNSDQDNDVDGYEKVDDMEKALKCQAQLIDQYEAMEKAQREWEEKFR 306
Query: 346 ENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQT 405
ENNNSTPDSCDPGNHSDITEERDE+RAQAPNLS N ANEAK +AV C RD SQ QT
Sbjct: 307 ENNNSTPDSCDPGNHSDITEERDEMRAQAPNLSNNP--ANEAKPHIAVVCGDRDLSQVQT 366
Query: 406 NGLDPSSC-ADVEDLQDQNTNSISTSRSLEEFTFPMANVKQCQESQESSEQEPSCTSQLN 465
NGL PS C ADVEDLQDQNTNSISTS+SLEEFTFPMANVKQCQ+SQE+S QEPSCTS LN
Sbjct: 367 NGLGPSMCAADVEDLQDQNTNSISTSKSLEEFTFPMANVKQCQDSQENSAQEPSCTSHLN 426
Query: 466 HGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKL 525
HGLP RPLSSH GI+ YDQETPCS NDLYALVPHEPPAL+GVLEALKQAKLSL KKI KL
Sbjct: 427 HGLPERPLSSHSGINSYDQETPCSNNDLYALVPHEPPALDGVLEALKQAKLSLTKKIIKL 486
Query: 526 PSVEGE------SIGTLSVPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLR 585
PSV+GE SIG LSV K+G RL+IP+GCAGLFRLPTDFAAEAS+QANFL SSSQLR
Sbjct: 487 PSVDGESESIDKSIGPLSVLKMGDRLEIPVGCAGLFRLPTDFAAEASSQANFLASSSQLR 546
Query: 586 SSTHYPGEGVALSANQQFFPSHEMEDRSSFLRDGCLRNSNYHTGSVFTRDGFLTDHFPEN 645
S THYPGEGVALSAN Q FP HEMEDRSSFLRD LR S YHTGS FTRD FLTDH PEN
Sbjct: 547 SPTHYPGEGVALSANHQIFPGHEMEDRSSFLRDSRLRCSGYHTGSGFTRDAFLTDHIPEN 606
Query: 646 GWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGELFFFCNNRLVYHGKAISTFSWFLQT 705
WKNP QKHH D+YFDA+QPS +V NYPS VPGELFFFCN RL+YHG+ F +
Sbjct: 607 RWKNPSQKHHIDQYFDAVQPSSYVPNYPSRPVPGELFFFCNYRLMYHGRPFLHSIGFYKL 666
Query: 706 CWKEIGVERDLTLYHT 715
+++ DLTLYHT
Sbjct: 667 AERKLAF-GDLTLYHT 675
BLAST of Sgr022650 vs. NCBI nr
Match:
XP_004140985.1 (uncharacterized protein LOC101207733 [Cucumis sativus] >KGN46025.1 hypothetical protein Csa_005228 [Cucumis sativus])
HSP 1 Score: 966.1 bits (2496), Expect = 1.8e-277
Identity = 522/641 (81.44%), Postives = 555/641 (86.58%), Query Frame = 0
Query: 41 MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQL 100
M+NPDQDQQDPR+VPG EDTTAMTIEFLRARLLSERSVS+SARQRADELAKRVAELEEQL
Sbjct: 1 MENPDQDQQDPRSVPGVEDTTAMTIEFLRARLLSERSVSKSARQRADELAKRVAELEEQL 60
Query: 101 KIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHETPRESKVGNDPAREDVNSSK 160
KIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHET E KV + AREDV SS
Sbjct: 61 KIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHET--EPKVEDGLAREDV-SSG 120
Query: 161 SIHTRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHKCEKYKKFSTRSRSSFTSIGSSSP 220
++ RNEHEE+SGS+IDTSPVLGGSLSWKGRNDSPH EKYKK S RSRSSFTSIGSSSP
Sbjct: 121 TVRRRNEHEEYSGSNIDTSPVLGGSLSWKGRNDSPHTREKYKKHSIRSRSSFTSIGSSSP 180
Query: 221 KHRLGRSCRQIKRRDTRQLDGEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDG 280
KH+LGRSCRQIKRRDTR LDGEQELKS+ALV S +EI PS EDS+N V G ILRDG
Sbjct: 181 KHQLGRSCRQIKRRDTRPLDGEQELKSDALVDSSEEI-PSTSLEDSQNYSVNGHSILRDG 240
Query: 281 YEPHEKTCSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW 340
YE EKT S S HNSV N DQD+D+D EK +DMEKAL+CQAQLIDQYEAMEKAQREW
Sbjct: 241 YEVREKTRSSSSGVHNSVGNSDQDNDIDGYEKVDDMEKALKCQAQLIDQYEAMEKAQREW 300
Query: 341 EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDS 400
EEKFRENNNSTPDSCDPGNHSDITEERDE+RAQAPNLS N ANEAK QVA +C TRD
Sbjct: 301 EEKFRENNNSTPDSCDPGNHSDITEERDEMRAQAPNLSNNP--ANEAKPQVAFDCDTRDL 360
Query: 401 SQAQTNGLDPSSCA-DVEDLQDQNTNSISTSRSLEEFTFPMANVKQCQESQESSEQEPSC 460
SQAQTNGL PS CA DVEDLQDQNTNSISTS+SLEEFTFPMANVKQCQESQE+S QEPSC
Sbjct: 361 SQAQTNGLGPSMCAVDVEDLQDQNTNSISTSKSLEEFTFPMANVKQCQESQENSAQEPSC 420
Query: 461 TSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAK 520
TS LNHGLP RPLSSHGGI+ YDQETPCS NDLYALVPHEPPAL+GVLEALKQAKLSL K
Sbjct: 421 TSHLNHGLPERPLSSHGGINSYDQETPCSNNDLYALVPHEPPALDGVLEALKQAKLSLTK 480
Query: 521 KIKKLPSVEGE------SIGTLSVPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVS 580
KI KLPSV+GE SIG LS+PK+G RL+IP+GCAGLFRLPTDFAAEAS+QANFL S
Sbjct: 481 KIIKLPSVDGESESIDKSIGPLSIPKMGDRLEIPVGCAGLFRLPTDFAAEASSQANFLAS 540
Query: 581 SSQLRSSTHYPGEGVALSANQQFFPSHEMEDRSSFLRDGCLRNSNYHTGSVFTRDGFLTD 640
SSQLRS THYPGEG ALSAN Q FP HEMEDRSSFLRD LR+S Y GS FTRDGFLTD
Sbjct: 541 SSQLRSPTHYPGEGAALSANHQIFPGHEMEDRSSFLRDSRLRSSGYRAGSGFTRDGFLTD 600
Query: 641 HFPENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGEL 675
H PEN WKNPGQKHHFD+YFDA+QPS +VHNYP V +
Sbjct: 601 HIPENRWKNPGQKHHFDQYFDAVQPSSYVHNYPPRPVSSNI 635
BLAST of Sgr022650 vs. NCBI nr
Match:
XP_008456583.1 (PREDICTED: uncharacterized protein LOC103496496 [Cucumis melo])
HSP 1 Score: 948.3 bits (2450), Expect = 3.8e-272
Identity = 514/641 (80.19%), Postives = 547/641 (85.34%), Query Frame = 0
Query: 41 MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQL 100
M+NPDQDQQDPR+VPG EDTTAMTIEFLRARLLSERSVS+SARQRADELAKRVAELEEQL
Sbjct: 1 MENPDQDQQDPRSVPGVEDTTAMTIEFLRARLLSERSVSKSARQRADELAKRVAELEEQL 60
Query: 101 KIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHETPRESKVGNDPAREDVNSSK 160
KIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHET E KV + P REDVNS
Sbjct: 61 KIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHET--EPKVEDGPPREDVNSG- 120
Query: 161 SIHTRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHKCEKYKKFSTRSRSSFTSIGSSSP 220
++ RNEHEE+SGS+I+TSPVLGGSLSWKGRNDSPH EKYKK S RSRSSFTSIGSSSP
Sbjct: 121 TVRRRNEHEEYSGSNINTSPVLGGSLSWKGRNDSPHTREKYKKHSIRSRSSFTSIGSSSP 180
Query: 221 KHRLGRSCRQIKRRDTRQLDGEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDG 280
KH+LGRSCRQIKRRDTR LDGEQELKSEA + S +EI S EDSRN V G ILRD
Sbjct: 181 KHQLGRSCRQIKRRDTRPLDGEQELKSEARMDSSEEIL-STSLEDSRNYSVNGHNILRDN 240
Query: 281 YEPHEKTCSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW 340
YE EKTCS S HNS+ N DQD+D+D EK +DMEKAL+CQAQLIDQYEAMEKAQREW
Sbjct: 241 YEVREKTCSSSSGIHNSIGNSDQDNDVDGYEKVDDMEKALKCQAQLIDQYEAMEKAQREW 300
Query: 341 EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDS 400
EEKFRENNNSTPDSCDPGNHSDITEERDE+RAQAPNLS N ANEAK +AV C RD
Sbjct: 301 EEKFRENNNSTPDSCDPGNHSDITEERDEMRAQAPNLSNNP--ANEAKPHIAVVCGDRDL 360
Query: 401 SQAQTNGLDPSSC-ADVEDLQDQNTNSISTSRSLEEFTFPMANVKQCQESQESSEQEPSC 460
SQ QTNGL PS C ADVEDLQDQNTNSISTS+SLEEFTFPMANVKQCQ+SQE+S QEPSC
Sbjct: 361 SQVQTNGLGPSMCAADVEDLQDQNTNSISTSKSLEEFTFPMANVKQCQDSQENSAQEPSC 420
Query: 461 TSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAK 520
TS LNHGLP RPLSSH GI+ YDQETPCS NDLYALVPHEPPAL+GVLEALKQAKLSL K
Sbjct: 421 TSHLNHGLPERPLSSHSGINSYDQETPCSNNDLYALVPHEPPALDGVLEALKQAKLSLTK 480
Query: 521 KIKKLPSVEGE------SIGTLSVPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVS 580
KI KLPSV+GE SIG LSV K+G RL+IP+GCAGLFRLPTDFAAEAS+QANFL S
Sbjct: 481 KIIKLPSVDGESESIDKSIGPLSVLKMGDRLEIPVGCAGLFRLPTDFAAEASSQANFLAS 540
Query: 581 SSQLRSSTHYPGEGVALSANQQFFPSHEMEDRSSFLRDGCLRNSNYHTGSVFTRDGFLTD 640
SSQLRS THYPGEGVALSAN Q FP HEMEDRSSFLRD LR S YHTGS FTRD FLTD
Sbjct: 541 SSQLRSPTHYPGEGVALSANHQIFPGHEMEDRSSFLRDSRLRCSGYHTGSGFTRDAFLTD 600
Query: 641 HFPENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGEL 675
H PEN WKNP QKHH D+YFDA+QPS +V NYPS V +
Sbjct: 601 HIPENRWKNPSQKHHIDQYFDAVQPSSYVPNYPSRPVSSNI 635
BLAST of Sgr022650 vs. ExPASy TrEMBL
Match:
A0A6J1BXR7 (uncharacterized protein LOC111006434 OS=Momordica charantia OX=3673 GN=LOC111006434 PE=4 SV=1)
HSP 1 Score: 991.9 bits (2563), Expect = 1.4e-285
Identity = 529/641 (82.53%), Postives = 565/641 (88.14%), Query Frame = 0
Query: 41 MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQL 100
M+NPDQDQQDPR+VPG EDTTAMTIEFLRARLLSERSVSRSA+QRADELAKRVAELEEQL
Sbjct: 1 MENPDQDQQDPRSVPGVEDTTAMTIEFLRARLLSERSVSRSAKQRADELAKRVAELEEQL 60
Query: 101 KIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHETPRESKVGNDPAREDVNSSK 160
K+VSLQRKMAEKATADVL+ILEDNGASDISETLDSNSDHET ESKV +DPAR DVNS+
Sbjct: 61 KVVSLQRKMAEKATADVLSILEDNGASDISETLDSNSDHETLCESKVEDDPARADVNSN- 120
Query: 161 SIHTRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHKCEKYKKFSTRSRSSFTSIGSSSP 220
S RN HEE+SGSDIDTSPVLGGSLSWKGRNDSPH EKYKK S RSRSSF+SIGSSSP
Sbjct: 121 STRRRNVHEEYSGSDIDTSPVLGGSLSWKGRNDSPHTHEKYKKISIRSRSSFSSIGSSSP 180
Query: 221 KHRLGRSCRQIKRRDTRQLDGEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDG 280
KHRLGRSCRQIKRRD R L+ EQELKSEALV S QEIAPS CSEDSRNCC+ GPKILRDG
Sbjct: 181 KHRLGRSCRQIKRRDPRPLNREQELKSEALVDSAQEIAPSTCSEDSRNCCINGPKILRDG 240
Query: 281 YEPHEKTCSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW 340
++ HE+T SG S D++ V NKD+DHDLDE EK NDMEKALECQAQLIDQYEAMEKAQREW
Sbjct: 241 HDLHEETRSGSSPDNSCVGNKDKDHDLDEYEKVNDMEKALECQAQLIDQYEAMEKAQREW 300
Query: 341 EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDS 400
EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFL NEAK+QVAV+C+ RDS
Sbjct: 301 EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLENEAKAQVAVDCIPRDS 360
Query: 401 SQAQTNGLDPSSCADVEDLQDQNTNSISTSRSLEEFTFPMANVKQCQESQESSEQEPSCT 460
QAQTNGL PS CADVE+LQDQN+NSISTSRSLEEFTFPMANVKQCQESQE+ EQEPSCT
Sbjct: 361 YQAQTNGLGPSLCADVEELQDQNSNSISTSRSLEEFTFPMANVKQCQESQENREQEPSCT 420
Query: 461 SQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKK 520
SQLN+GLP RPLSSHGGI+F+++E PCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKK
Sbjct: 421 SQLNYGLPERPLSSHGGINFHEKEPPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKK 480
Query: 521 IKKLPSVEGE----SIGTLSVPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQ 580
I KLPSVEGE SIGTLSVP VG RL++PIGCAGLFRLPTDFAAEAS+QA+FL SSSQ
Sbjct: 481 INKLPSVEGESIGKSIGTLSVPNVGDRLEVPIGCAGLFRLPTDFAAEASSQASFLSSSSQ 540
Query: 581 LRSSTHYPGEGVALSANQQFFPSHEMEDRSSFLRDGCLRNSNYHTGSVFTRDGFLTDHFP 640
RS+THYPGEG ALSAN Q FPSHE EDRSSFLRD LRN Y GF TDHFP
Sbjct: 541 SRSATHYPGEGGALSANPQIFPSHEREDRSSFLRDNRLRNGRY---------GFPTDHFP 600
Query: 641 ENGWKNP--GQKHHFDRYFDAIQPSPH-VHNYPSPSVPGEL 675
ENGW NP GQ++ FDR FDAIQPSPH VH YP P V +
Sbjct: 601 ENGWNNPGQGQRYRFDRSFDAIQPSPHVVHQYPPPPVSSSI 631
BLAST of Sgr022650 vs. ExPASy TrEMBL
Match:
A0A5D3CNC8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold304G00680 PE=4 SV=1)
HSP 1 Score: 968.8 bits (2503), Expect = 1.3e-278
Identity = 527/676 (77.96%), Postives = 567/676 (83.88%), Query Frame = 0
Query: 46 QDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSL 105
Q++ + R+VPG EDTTAMTIEFLRARLLSERSVS+SARQRADELAKRVAELEEQLKIVSL
Sbjct: 7 QEKWNLRSVPGVEDTTAMTIEFLRARLLSERSVSKSARQRADELAKRVAELEEQLKIVSL 66
Query: 106 QRKMAEKATADVLAILEDNGASDISETLDSNSDHETPRESKVGNDPAREDVNSSKSIHTR 165
QRKMAEKATADVLAILEDNGASDISETLDSNSDHET E KV + P REDVNS ++ R
Sbjct: 67 QRKMAEKATADVLAILEDNGASDISETLDSNSDHET--EPKVEDGPPREDVNSG-TVRRR 126
Query: 166 NEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHKCEKYKKFSTRSRSSFTSIGSSSPKHRLG 225
NEHEE+SGS+I+TSPVLGGSLSWKGRNDSPH EKYKK S RSRSSFTSIGSSSPKH+LG
Sbjct: 127 NEHEEYSGSNINTSPVLGGSLSWKGRNDSPHTREKYKKHSIRSRSSFTSIGSSSPKHQLG 186
Query: 226 RSCRQIKRRDTRQLDGEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDGYEPHE 285
RSCRQIKRRDTR LDGEQELKSEA + S +EI S EDSRN V G ILRD YE E
Sbjct: 187 RSCRQIKRRDTRPLDGEQELKSEARMDSSEEIL-STSLEDSRNYSVNGHNILRDNYEVRE 246
Query: 286 KTCSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREWEEKFR 345
KTCS S HNS+ N DQD+D+D EK +DMEKAL+CQAQLIDQYEAMEKAQREWEEKFR
Sbjct: 247 KTCSSSSGIHNSIGNSDQDNDVDGYEKVDDMEKALKCQAQLIDQYEAMEKAQREWEEKFR 306
Query: 346 ENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDSSQAQT 405
ENNNSTPDSCDPGNHSDITEERDE+RAQAPNLS N ANEAK +AV C RD SQ QT
Sbjct: 307 ENNNSTPDSCDPGNHSDITEERDEMRAQAPNLSNNP--ANEAKPHIAVVCGDRDLSQVQT 366
Query: 406 NGLDPSSC-ADVEDLQDQNTNSISTSRSLEEFTFPMANVKQCQESQESSEQEPSCTSQLN 465
NGL PS C ADVEDLQDQNTNSISTS+SLEEFTFPMANVKQCQ+SQE+S QEPSCTS LN
Sbjct: 367 NGLGPSMCAADVEDLQDQNTNSISTSKSLEEFTFPMANVKQCQDSQENSAQEPSCTSHLN 426
Query: 466 HGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAKKIKKL 525
HGLP RPLSSH GI+ YDQETPCS NDLYALVPHEPPAL+GVLEALKQAKLSL KKI KL
Sbjct: 427 HGLPERPLSSHSGINSYDQETPCSNNDLYALVPHEPPALDGVLEALKQAKLSLTKKIIKL 486
Query: 526 PSVEGE------SIGTLSVPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVSSSQLR 585
PSV+GE SIG LSV K+G RL+IP+GCAGLFRLPTDFAAEAS+QANFL SSSQLR
Sbjct: 487 PSVDGESESIDKSIGPLSVLKMGDRLEIPVGCAGLFRLPTDFAAEASSQANFLASSSQLR 546
Query: 586 SSTHYPGEGVALSANQQFFPSHEMEDRSSFLRDGCLRNSNYHTGSVFTRDGFLTDHFPEN 645
S THYPGEGVALSAN Q FP HEMEDRSSFLRD LR S YHTGS FTRD FLTDH PEN
Sbjct: 547 SPTHYPGEGVALSANHQIFPGHEMEDRSSFLRDSRLRCSGYHTGSGFTRDAFLTDHIPEN 606
Query: 646 GWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGELFFFCNNRLVYHGKAISTFSWFLQT 705
WKNP QKHH D+YFDA+QPS +V NYPS VPGELFFFCN RL+YHG+ F +
Sbjct: 607 RWKNPSQKHHIDQYFDAVQPSSYVPNYPSRPVPGELFFFCNYRLMYHGRPFLHSIGFYKL 666
Query: 706 CWKEIGVERDLTLYHT 715
+++ DLTLYHT
Sbjct: 667 AERKLAF-GDLTLYHT 675
BLAST of Sgr022650 vs. ExPASy TrEMBL
Match:
A0A0A0K8I3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G045040 PE=4 SV=1)
HSP 1 Score: 966.1 bits (2496), Expect = 8.5e-278
Identity = 522/641 (81.44%), Postives = 555/641 (86.58%), Query Frame = 0
Query: 41 MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQL 100
M+NPDQDQQDPR+VPG EDTTAMTIEFLRARLLSERSVS+SARQRADELAKRVAELEEQL
Sbjct: 1 MENPDQDQQDPRSVPGVEDTTAMTIEFLRARLLSERSVSKSARQRADELAKRVAELEEQL 60
Query: 101 KIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHETPRESKVGNDPAREDVNSSK 160
KIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHET E KV + AREDV SS
Sbjct: 61 KIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHET--EPKVEDGLAREDV-SSG 120
Query: 161 SIHTRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHKCEKYKKFSTRSRSSFTSIGSSSP 220
++ RNEHEE+SGS+IDTSPVLGGSLSWKGRNDSPH EKYKK S RSRSSFTSIGSSSP
Sbjct: 121 TVRRRNEHEEYSGSNIDTSPVLGGSLSWKGRNDSPHTREKYKKHSIRSRSSFTSIGSSSP 180
Query: 221 KHRLGRSCRQIKRRDTRQLDGEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDG 280
KH+LGRSCRQIKRRDTR LDGEQELKS+ALV S +EI PS EDS+N V G ILRDG
Sbjct: 181 KHQLGRSCRQIKRRDTRPLDGEQELKSDALVDSSEEI-PSTSLEDSQNYSVNGHSILRDG 240
Query: 281 YEPHEKTCSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW 340
YE EKT S S HNSV N DQD+D+D EK +DMEKAL+CQAQLIDQYEAMEKAQREW
Sbjct: 241 YEVREKTRSSSSGVHNSVGNSDQDNDIDGYEKVDDMEKALKCQAQLIDQYEAMEKAQREW 300
Query: 341 EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDS 400
EEKFRENNNSTPDSCDPGNHSDITEERDE+RAQAPNLS N ANEAK QVA +C TRD
Sbjct: 301 EEKFRENNNSTPDSCDPGNHSDITEERDEMRAQAPNLSNNP--ANEAKPQVAFDCDTRDL 360
Query: 401 SQAQTNGLDPSSCA-DVEDLQDQNTNSISTSRSLEEFTFPMANVKQCQESQESSEQEPSC 460
SQAQTNGL PS CA DVEDLQDQNTNSISTS+SLEEFTFPMANVKQCQESQE+S QEPSC
Sbjct: 361 SQAQTNGLGPSMCAVDVEDLQDQNTNSISTSKSLEEFTFPMANVKQCQESQENSAQEPSC 420
Query: 461 TSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAK 520
TS LNHGLP RPLSSHGGI+ YDQETPCS NDLYALVPHEPPAL+GVLEALKQAKLSL K
Sbjct: 421 TSHLNHGLPERPLSSHGGINSYDQETPCSNNDLYALVPHEPPALDGVLEALKQAKLSLTK 480
Query: 521 KIKKLPSVEGE------SIGTLSVPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVS 580
KI KLPSV+GE SIG LS+PK+G RL+IP+GCAGLFRLPTDFAAEAS+QANFL S
Sbjct: 481 KIIKLPSVDGESESIDKSIGPLSIPKMGDRLEIPVGCAGLFRLPTDFAAEASSQANFLAS 540
Query: 581 SSQLRSSTHYPGEGVALSANQQFFPSHEMEDRSSFLRDGCLRNSNYHTGSVFTRDGFLTD 640
SSQLRS THYPGEG ALSAN Q FP HEMEDRSSFLRD LR+S Y GS FTRDGFLTD
Sbjct: 541 SSQLRSPTHYPGEGAALSANHQIFPGHEMEDRSSFLRDSRLRSSGYRAGSGFTRDGFLTD 600
Query: 641 HFPENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGEL 675
H PEN WKNPGQKHHFD+YFDA+QPS +VHNYP V +
Sbjct: 601 HIPENRWKNPGQKHHFDQYFDAVQPSSYVHNYPPRPVSSNI 635
BLAST of Sgr022650 vs. ExPASy TrEMBL
Match:
A0A1S3C3K3 (uncharacterized protein LOC103496496 OS=Cucumis melo OX=3656 GN=LOC103496496 PE=4 SV=1)
HSP 1 Score: 948.3 bits (2450), Expect = 1.8e-272
Identity = 514/641 (80.19%), Postives = 547/641 (85.34%), Query Frame = 0
Query: 41 MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQL 100
M+NPDQDQQDPR+VPG EDTTAMTIEFLRARLLSERSVS+SARQRADELAKRVAELEEQL
Sbjct: 1 MENPDQDQQDPRSVPGVEDTTAMTIEFLRARLLSERSVSKSARQRADELAKRVAELEEQL 60
Query: 101 KIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHETPRESKVGNDPAREDVNSSK 160
KIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHET E KV + P REDVNS
Sbjct: 61 KIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHET--EPKVEDGPPREDVNSG- 120
Query: 161 SIHTRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHKCEKYKKFSTRSRSSFTSIGSSSP 220
++ RNEHEE+SGS+I+TSPVLGGSLSWKGRNDSPH EKYKK S RSRSSFTSIGSSSP
Sbjct: 121 TVRRRNEHEEYSGSNINTSPVLGGSLSWKGRNDSPHTREKYKKHSIRSRSSFTSIGSSSP 180
Query: 221 KHRLGRSCRQIKRRDTRQLDGEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDG 280
KH+LGRSCRQIKRRDTR LDGEQELKSEA + S +EI S EDSRN V G ILRD
Sbjct: 181 KHQLGRSCRQIKRRDTRPLDGEQELKSEARMDSSEEIL-STSLEDSRNYSVNGHNILRDN 240
Query: 281 YEPHEKTCSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW 340
YE EKTCS S HNS+ N DQD+D+D EK +DMEKAL+CQAQLIDQYEAMEKAQREW
Sbjct: 241 YEVREKTCSSSSGIHNSIGNSDQDNDVDGYEKVDDMEKALKCQAQLIDQYEAMEKAQREW 300
Query: 341 EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDS 400
EEKFRENNNSTPDSCDPGNHSDITEERDE+RAQAPNLS N ANEAK +AV C RD
Sbjct: 301 EEKFRENNNSTPDSCDPGNHSDITEERDEMRAQAPNLSNNP--ANEAKPHIAVVCGDRDL 360
Query: 401 SQAQTNGLDPSSC-ADVEDLQDQNTNSISTSRSLEEFTFPMANVKQCQESQESSEQEPSC 460
SQ QTNGL PS C ADVEDLQDQNTNSISTS+SLEEFTFPMANVKQCQ+SQE+S QEPSC
Sbjct: 361 SQVQTNGLGPSMCAADVEDLQDQNTNSISTSKSLEEFTFPMANVKQCQDSQENSAQEPSC 420
Query: 461 TSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAK 520
TS LNHGLP RPLSSH GI+ YDQETPCS NDLYALVPHEPPAL+GVLEALKQAKLSL K
Sbjct: 421 TSHLNHGLPERPLSSHSGINSYDQETPCSNNDLYALVPHEPPALDGVLEALKQAKLSLTK 480
Query: 521 KIKKLPSVEGE------SIGTLSVPKVGGRLDIPIGCAGLFRLPTDFAAEASTQANFLVS 580
KI KLPSV+GE SIG LSV K+G RL+IP+GCAGLFRLPTDFAAEAS+QANFL S
Sbjct: 481 KIIKLPSVDGESESIDKSIGPLSVLKMGDRLEIPVGCAGLFRLPTDFAAEASSQANFLAS 540
Query: 581 SSQLRSSTHYPGEGVALSANQQFFPSHEMEDRSSFLRDGCLRNSNYHTGSVFTRDGFLTD 640
SSQLRS THYPGEGVALSAN Q FP HEMEDRSSFLRD LR S YHTGS FTRD FLTD
Sbjct: 541 SSQLRSPTHYPGEGVALSANHQIFPGHEMEDRSSFLRDSRLRCSGYHTGSGFTRDAFLTD 600
Query: 641 HFPENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSPSVPGEL 675
H PEN WKNP QKHH D+YFDA+QPS +V NYPS V +
Sbjct: 601 HIPENRWKNPSQKHHIDQYFDAVQPSSYVPNYPSRPVSSNI 635
BLAST of Sgr022650 vs. ExPASy TrEMBL
Match:
A0A6J1FEU7 (uncharacterized protein LOC111445070 OS=Cucurbita moschata OX=3662 GN=LOC111445070 PE=4 SV=1)
HSP 1 Score: 928.3 bits (2398), Expect = 2.0e-266
Identity = 503/634 (79.34%), Postives = 545/634 (85.96%), Query Frame = 0
Query: 41 MKNPDQDQQDPRNVPGGEDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQL 100
M+NPDQDQQDPR+VPG EDTTAMTIEFLRARLLSERSVS+ ARQRADELAKRVAELEEQL
Sbjct: 1 MENPDQDQQDPRSVPGVEDTTAMTIEFLRARLLSERSVSKCARQRADELAKRVAELEEQL 60
Query: 101 KIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHETPRESKVGNDPAREDVNSSK 160
++VS QR+MAEKATADVLAILEDNGA+DISETLDSNSDHET +KV + P R D NSS
Sbjct: 61 RVVSFQRRMAEKATADVLAILEDNGATDISETLDSNSDHET---AKVEDGPVRGDANSS- 120
Query: 161 SIHTRNEHEEFSGSDIDTSPVLGGSLSWKGRNDSPHKCEKYKKFSTRSRSSFTSIGSSSP 220
SI RNEHEE+SGS DTSP+LG SLSWKGRND PH EKYKKFS RS+S+FTSIGSSSP
Sbjct: 121 SIGRRNEHEEYSGS--DTSPMLGASLSWKGRNDRPHIREKYKKFSIRSQSNFTSIGSSSP 180
Query: 221 KHRLGRSCRQIKRRDTRQLDGEQELKSEALVGSFQEIAPSACSEDSRNCCVKGPKILRDG 280
KH+LGRSCRQIKRRDTR LDGEQELKS+ V S QEI PS CSEDSRN V G KI RDG
Sbjct: 181 KHQLGRSCRQIKRRDTRPLDGEQELKSKTCVDSCQEI-PSTCSEDSRNYSVNGDKISRDG 240
Query: 281 YEPHEKTCSGPSQDHNSVENKDQDHDLDECEKGNDMEKALECQAQLIDQYEAMEKAQREW 340
YE HEKT SG S+ HNSV NKDQDHDLD EK +DMEK+L+CQAQLIDQYEAMEKAQREW
Sbjct: 241 YELHEKTRSGSSEVHNSVGNKDQDHDLDGYEKVSDMEKSLKCQAQLIDQYEAMEKAQREW 300
Query: 341 EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVTRDS 400
EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNL ++ LANE KSQV+ +CVTRD
Sbjct: 301 EEKFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLPNSSLLANEPKSQVSADCVTRDL 360
Query: 401 SQAQTN-GLDPSSCADVEDLQDQNTNSISTSRSLEEFTFPMANVKQCQESQESSEQEPSC 460
SQAQT+ GL PS C+DV DLQDQN NS+STSRSLEEFTFPMANVKQCQES E+ EQEPSC
Sbjct: 361 SQAQTSGGLGPSLCSDVNDLQDQNMNSVSTSRSLEEFTFPMANVKQCQESHENREQEPSC 420
Query: 461 TSQLNHGLPHRPLSSHGGIDFYDQETPCSKNDLYALVPHEPPALNGVLEALKQAKLSLAK 520
TS LNHGLP R LSSH GI+ YDQETPCS+ DLYALVPHEPPAL+GVLEALKQAKLSL K
Sbjct: 421 TSNLNHGLPERLLSSHTGINIYDQETPCSRTDLYALVPHEPPALDGVLEALKQAKLSLTK 480
Query: 521 KIKKLPSVEG----ESIGTLSVPKVGGRLDIPIGCAGLFRLPTDFAAEA-STQANFLVSS 580
KI KLP VEG +SIG LSVPKV L+IPIGCAGLFRLPTDFAAEA STQ NFL SS
Sbjct: 481 KINKLPFVEGGSVDKSIGALSVPKVEDSLEIPIGCAGLFRLPTDFAAEASSTQPNFLASS 540
Query: 581 SQLRSSTHYPGEGVALSANQQFFPSHEMEDRSSFLRDGCLRNSNYHTGSVFTRDGFLTDH 640
S+LRS+ Y GE VALSA Q FP+HEMEDRSSFL LR+S+YHTGS TRDG+LTDH
Sbjct: 541 SELRSTARYTGESVALSATHQIFPAHEMEDRSSFLT--ALRSSHYHTGSGSTRDGYLTDH 600
Query: 641 FPENGWKNPGQKHHFDRYFDAIQPSPHVHNYPSP 669
FPE+ WKNPGQ HHFD+YFDAIQPSP+VH+YPSP
Sbjct: 601 FPESRWKNPGQNHHFDQYFDAIQPSPYVHSYPSP 625
BLAST of Sgr022650 vs. TAIR 10
Match:
AT3G52240.1 (unknown protein; Has 220 Blast hits to 193 proteins in 66 species: Archae - 0; Bacteria - 15; Metazoa - 53; Fungi - 33; Plants - 66; Viruses - 0; Other Eukaryotes - 53 (source: NCBI BLink). )
HSP 1 Score: 242.7 bits (618), Expect = 9.5e-64
Identity = 217/594 (36.53%), Postives = 295/594 (49.66%), Query Frame = 0
Query: 58 EDTTAMTIEFLRARLLSERSVSRSARQRADELAKRVAELEEQLKIVSLQRKMAEKATADV 117
+D T++TIEFLRARLL+ER+VS+SAR + D LA +VAELEEQLKIVSLQRK AE+ATADV
Sbjct: 18 QDPTSVTIEFLRARLLAERAVSKSARAKLDGLADKVAELEEQLKIVSLQRKKAEQATADV 77
Query: 118 LAILEDNGASDISETLDSNSDHETPRESKVGNDPAREDVNSSKSIHTRNEHEEFSGSDID 177
LAILE+NG +D+S+ DSNSDHE +S
Sbjct: 78 LAILEENGYNDVSDDYDSNSDHECYSQS-------------------------------- 137
Query: 178 TSPVLGGSLSWKGRNDSPHKCEKYKK-FSTRSRSSFTSIGSSSPKHRLGRSCRQIKRRDT 237
+ VLG SLSWKGR P +K K+ + R F SSP+HR GRSCRQI+R +
Sbjct: 138 -NSVLGKSLSWKGRRREPGSSDKIKENRNRRHHRGFECAYFSSPRHRQGRSCRQIRRGEA 197
Query: 238 RQLDGEQELKSEALVGSFQ------EIAPSACSEDSRN----CCVKGPKILRDGYEPHEK 297
R + ++ K + FQ E+ P + SR VKG L
Sbjct: 198 RNV--SEDYKRDGNPVEFQENGVRTEMLPQKNDQVSRTVVDVAVVKGDDSL--------- 257
Query: 298 TCSGPSQDHNSVENKDQDHDLDECEKGN----DMEKALECQAQLIDQYEAMEKAQREWEE 357
N + N + EKGN ++E+ALE +AQ+I +E ME+ QREWE+
Sbjct: 258 ---------NKLSNS------NGLEKGNSTDINLERALENRAQVIGSFEEMEETQREWEK 317
Query: 358 KFRENNNSTPDSCDPGNHSDITEERDEIRAQAPNLSGNAFLANEAKSQVAVNCVT-RDSS 417
FREN +S D CD GNHSD+T+E + +AQ+P L G+ + + ++ N V R+S
Sbjct: 318 NFRENKSSALDLCDVGNHSDVTDESNGEKAQSP-LQGSTVVPSLRDTRSIANEVDFRESF 377
Query: 418 QAQTNGLDPSSCADVEDLQDQNTNSISTSRSLEEFTFPMANV-KQCQESQESSEQEPSCT 477
+ ++G +S D+ NS SRS+E+ + + K ES S +P +
Sbjct: 378 ETLSHGSPDNSVTS----PDKCCNSCG-SRSVEQDAYSSRDKGKHISESPTSEYSQPQSS 437
Query: 478 SQLN-------HGLPHRPLSSHGGIDFYDQETPCSKND--LYALVPHEPPALNGVLEALK 537
+N P +S GG F T + D L ++ +P VL ALK
Sbjct: 438 KGINEHSSSTIRSPPVTQPNSRGGF-FGSTTTTIHEVDDPLVSVKDVKPDTCETVLTALK 497
Query: 538 QAKLSLAKKIKKL-------------PSVEGESIGTLSVP--------------KVGGRL 597
QAKLSL +K+ L PS G + T ++P VG +
Sbjct: 498 QAKLSLQEKVNSLHIRNPDCHSESSYPSTPGSYMNTYALPMEPAFSTKPSLPASSVGSMV 545
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022134072.1 | 3.0e-285 | 82.53 | uncharacterized protein LOC111006434 [Momordica charantia] >XP_022134074.1 uncha... | [more] |
XP_038885028.1 | 1.1e-282 | 83.39 | uncharacterized protein LOC120075573 [Benincasa hispida] | [more] |
TYK11749.1 | 2.7e-278 | 77.96 | uncharacterized protein E5676_scaffold304G00680 [Cucumis melo var. makuwa] | [more] |
XP_004140985.1 | 1.8e-277 | 81.44 | uncharacterized protein LOC101207733 [Cucumis sativus] >KGN46025.1 hypothetical ... | [more] |
XP_008456583.1 | 3.8e-272 | 80.19 | PREDICTED: uncharacterized protein LOC103496496 [Cucumis melo] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1BXR7 | 1.4e-285 | 82.53 | uncharacterized protein LOC111006434 OS=Momordica charantia OX=3673 GN=LOC111006... | [more] |
A0A5D3CNC8 | 1.3e-278 | 77.96 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A0A0K8I3 | 8.5e-278 | 81.44 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G045040 PE=4 SV=1 | [more] |
A0A1S3C3K3 | 1.8e-272 | 80.19 | uncharacterized protein LOC103496496 OS=Cucumis melo OX=3656 GN=LOC103496496 PE=... | [more] |
A0A6J1FEU7 | 2.0e-266 | 79.34 | uncharacterized protein LOC111445070 OS=Cucurbita moschata OX=3662 GN=LOC1114450... | [more] |
Match Name | E-value | Identity | Description | |
AT3G52240.1 | 9.5e-64 | 36.53 | unknown protein; Has 220 Blast hits to 193 proteins in 66 species: Archae - 0; B... | [more] |