Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAACAAAATCTGCTGATATACCTTGAGAAATTTCGGCCATCGCTTGATAATAACAATCAAATTACATTTGGTGGTAAAGTTACAATACTATATTTCTTCACACTTAAACTCATCACAATATATATATCTATGATTTGTGTCTCTTCTATCTTCTATCTTTTGCAGCTGTTGACAAATCTCAATTTAAGATGGCCACAAGAGATTATTATTGTGTGTTAAACGGTGGACAAGGACATTCCTACAAAGATATGTCCAAAAGACTGTGTAAGAATGTCACAACCACAAATTTAACTTCAATAAAATAATTAATTATTAACCTTATATTAAAAGTACTAAATATCTATATATATTGTAATTATGGTTTAAAAATTTGCAGTGCAGTTGAGTGCTACATTGATGGGGCACGAAAATGAATATGTTGGCATATTACAAGGAAAGAGCTCAAAGAAGAAAGAGGATGTATTACAAAACATAGGTAAATTTCGTTACATATGAATTATTTTTACAGAGAGTATTGACGTAACTTTTTGTTGGTTAATATAATCAATTCTAGATTTGCAGTCCTTTCCAGAAGAGAATTGGAGGATGCAGAAAAGGTATATTTAATCTCACGATGGAGACATGTTCCCGTGCGTCCTCTATTTTCCAATTATTCTTTGGATTATACCAGATTATTATCTCATATGAGTAAGTGGAAGAGTAAATTTAGGGATGTTTTTAAATAGAGTTTGAAAATATTTATAAAATATAGTAAAATTTTACTTGTTATTTTTATAAATATTTTAACAATAATTTTCTGTAAATTTATTTATTATTATTATTAGAGGTAGAGTAAAATTTCATTTTGTTTAAAATGTTTTTGCCCAATCTTAATATTTGGATTTTTTTATTCTTGAGTTTTGAAAATTAAAGAGACTTCTTCTTAATTTCTCACAATGTTTTGCACAGTTATGATTTTTTAGTCAAATTCCAACAACAAAAACAAGTTTTTAAATACTTGTTTTAGCTTGCAAAATCTAGCTTAGTTTTTTAAAACATTGATAAATACTAGATAACAAATTAAAAAATTTAGAAGTGGAAGGAGGTTTTATAGGCTTAATTTAAAAAAAAAAAAAAAAAAAAAAAATCAAATAGTTAACAAACCAAACCTTAGCTAATTGAATATTTGTTTAATAGCTATCTATCTTAGCAAATAAACTTCTTTTGCGTATATATATAAATATATAAAAGGATACGTAGTGTGACACTCCCATTTAGATTACTTTATTCAACTTACAATAATAGGCAGTTTAGATGGGGAATGAGAATCTATATAGAATAAGAAAATAAATTTTGTAACAAAATATTTATTCAAACTACACTTGAGAAGCCTGTGGAAAAGTAATGGTAAAAGAATGAGCTAATAAAACCCAATCAGTGGTAAATTGAATAGTAAAATAATTCTGAGTGTAAATAAAAGTTCAAACTTTGTATAATTTAAACATCAGATATATCTGTTAAATAATTTTGAAAAGCAAACACATAATAATCATTAATTAGCTGATAGAAATATTACCAAATCTATACTCAAATCTGTAGTGATATGGAAATTGGAAATTTTAGGACAAAACCATTGGGAAACGCAGAATCAGTAGCTAATAAAAAAGAAAAAAAAAAAAAGTTTCCGTTGAAAATATATTGATGGCAAGTTTAAAGGCGTAAAGAAGCTGATATTTTTATTCTTGTAAAACAAAACCATTGGGAACCTTTACTAATAATAATTGGTTTACCAGCAACTGCGAAAACAGAAATGATGAAATTGGAATGTGAATATATTGAGTTATTGCAAGATGGGGCGACTTTCTCAGAGGATATACTCTCACAATACCCTGAAGATATATTGCAAGGCGCAGGTAAACTTTAATTACACCAAATAAATTTATCTTATTTGTTTACTATTGAGGCAAGAAATGGAGTTGTGAACTCTAAAGAGTCAATATAAGATATAAGATTGATGTTTTTTAATACTAATTGAGACTCACAAACTCATTCCACCAAATAAAGGAGTAGAGTTCACAATTCCACTTTTCTCAACTCCTACTCTCTGGTTCCTCCGCCCCAAACACCTTTTTAATATGTGAGGTAGTTGCAAATATAACAATCAAACTCAAAATATTAACAGGTATAACACAATGCAAAAGAATTTACATATGTAGCAAAATTTAGATATCCAACTCTTGAAGTCTAGAGTCTATGACCAATAAATTTTGCTATATTTGAAGTTGTTTAAAAATGTTACTATATATTTAATTATTAGCTATAAAAATACTACCCATCTCAACTACCCTAATATATTTCAATATTTGTAGTAAGTATGGAAAGGGAGTTATTGGTCTTGGACCATCCAGAGCTTATTGAGATGATCACAGGAGAGAACCACTCCTCAATGCCTGATTCTCTCCGCAGCCTAGCAAGTGTAACAGACACGATGAGTAAGTTAAAGTAGTCAAAATTTAACATATATATATATAAATAACAGACATCGGATGTGATTAATATTAATATAGTATTACTGTTGGGACAGAACATGCGAGATCATATTCATTGTGGGAACAACGAGAAACGTACCCAAATTTTGTTGAAGACCTCAACATAGACTACAAAATAAATCGACTAGAAAATCTTGGTAATTGATTTATGATTTAATTATTTTAATTTTTATTGGATACGATTCGAATTCATTTTATTTTAAGGTTTGATCCATTGGGTATATGATTTGGCAGTAAATTCGAGGAAAGGATATTTAGCGCTAGAAGACGAATACTTACGGCTCTTACGAGAGAGACATACCAAGAACCATGTCCACCAAATTCATTTAGAATACATTTGGGAACATATATTGAAGTTAACATGTAAGTTAATTAAAGGGTTGTTTTCAAATATAACAAAATGAACTAACTTATATATAAATATAGTAAAATTATGTTACTATCTGTCACAATATACACTAATAAACTATTATCATCCAATCCATCACTATCTATTAGCGATAGTTTTATCAATAAATATTTATGTAAATAAAAATTATAAAAGTTATTTAATTAAATTACTTTTACTTTTCAACTAAATTATATATATATAATTTACATACATACATTGATTTATTCAACCTTTCAATTCATGGAAATATGTGATTTGCAGTATCTTGGAAGAAGAATTTGAGATTGATGGAAGATGCTCTTGCTCCCGACCGTAGGGAGAAACACCCACTCGAATTTGGTAAGTTTATAGCAAATATACAAGAATACACTATTTTGTTTTGTATTGAAACTTATATTATTAATTTTTAGATGAGTTTGTTAATGTAATTCAAAGAGAATACAAGATAAGGAAGTTATACCTAAATTATTTTAGGATTATACTCTAATTAATTTAAGATTTTATTTAACCTAGAAATAATTCTTTAAATTACGTTAACATTAACAGAATACGCAGAGGGGATAAAGAAAATGATTAAGGAAGCAGAACAGGAATATTATGAGCTTATACAGAAGCAACATTTAACCATCCTCAATGGCATTTCAACAATGGATGCGACTCAAGTATATGCTAAATACATTCATAAAGAACTACTGGCTCGAATAGGTAAATTAAGAATACATTTAAAAGATTAAATTAAGTGCTAGTATATAAATTATGATTAATTCAATTAATTTATTTTGCAGCATATATGAGAAGAAAAATCAAGCCGTTGGAAAGGGAGCATAATCACATCCTAAGAAAAGTTATGTACGACCGAATAGACAAACAAGCCATTGTAAGTCGAGGTAATAATAGCTACATATTTAAGTCCAAAATAATATAATCTATAGCACAATGATAAAATAATTGCATAAATAGCACAAAAATTGGTAAAAAACTAAAAAATCCAAGCCCAACCACCATTTTGCACGCTTCTTCTCGATTTTCTCGAATTGAAAGCTATCTTTGATAGCAACTATCAATGCTATCAATGCTATCAATGATAGCCACTGATAGCTGCTACCAGTTGTTATCACTGATAGCCGCTATCAGCGATAACTTTCAATTTGAGAAATGTGTTATCAGTTATAGCCTTCAATTTGAAAAATGTGTTATAATTTGTTATCAGCGATAGCTGCTATCAGTGATAGCTTTCAATTTGAGAAATATTAAGTTTGATGTCAATGATACATGATTATCACTGATATCTACCTAAGTTTTATCACCAATATCATTGATAATTAATATGCTGATATATAATTATTACTGATATACGGATTTCACTAATATCAACCTAAGGTATATCACTGTTATCATTGATCATTGATATTAGTATTACCAATACACGAATATCACTGATATCTACCTAAGTCATACCACTGATGTCTACCTAAGATATATCACTTAAATCAATAATATTTTGACATTATGGCTATCAATGATATCACGACTATTATTGATATAAGGATATCACTAATATTTACCCAAGTTCTATTATTGATACATGGATATCATTGATATATACCTAAGTTATATCACTAATTTTCATTGATAATTGATATAACTGATACATGGATATCACTAATATCTGCCTAAATCTTATCACTAATATCATTGATGTCATTGATATCTACCTAAATCAAATCACTAATATAAAGATATAACTGGTATTTACCTAAGTTATATCACTAATATCATCGATTTTAAACACTAGAATTGCCAAATTTATATTTTTACATTCAAATAGGACTTTAGTATATCAATGGCTATCATCGATAGACTATTATTATGTCATAATAATTGAAGTCTATTGCTGATAGCTACTAATAGATTTTAGAGGTTTTAGACTTTTAGTTTGAACTTAAGTATTTACTAATAAATTATTATACATTAGTTGATATAAGTCTATAAGGAATAGACTTTTATCACTGATTTAAGTAGTCTGTCAATGATAGAACCATATCACTGATAGAGTATGACTTAGTCTATCATAGGCTATCAATGATAATAGTGTATTACTAATAGATTTCATGCTTTTAAACTCCTACAAAAAAAAAAAAAAAAAAAAATTACGGGCCCGTTTGATGATGTTCTCATTTCTCATTTTTTGAGAAACAAACTTTGTTTGATAAGCATTATTTTGTTTCTTATTTCTAATTTTAAAAAGCAGTTCTAAAAAATGTGCAAAATCAATAATTTAAAAAATTGGTTTCTTTTAAATTCGTTTTCATTTGGTATTTATATTTATTGTTTCGTTTCAAATATAAAATTTGAAAACTGACTTACTGGGGCTAAGGCACTTATAAAAATGATGATGATGATTACGAACTCTCAAATCTTGAGTCTGAGACTCCAACTCGTCTATTATTATTTTTTAAGTAAAAAGATAGATACTTTCATTAGGGAGAGAGGGAGATAAATTTGTGTTTTTTAAGTAAAAAGTTAATTATTTGATGAAGGAGAGAGAAGGAGATAAATTTGTGAGTGGGAGGGGGAGAGAGAGAAGGGATAACAATCGGGGTGGGAGAGGAAGAAGTGGGAGAGAGGAGTGAGAAGGGGAGAAATTTGGGGAGACATAGAGAAAATCGGGGAGAGAGGAGCGAAAGGGAGAGAGAAAATAGGAGTGAAGGGGAGAGAGAAGCTCAAATTTGGAATTTTAAAAAAATTTAAAATTTGATTAGTCAACCTTTCCTATATTTACAAATATTTTAAAGATGTGCTATATTTTTAAATTATTTTTCTAATTGTGCTATATGCTATAATTTTTCAATAATTTATGAAAAAATATTTTCAATTGAATTATATTTTTTAAAATAAGATTTTGTTTCAATTTTAATTTACCAAAATTATTTTAAATATATTTGTAAATATTTTTGTTCATTTTTTATATTTGAAAAGAATATTTTTATTTTGCTATATTTGTAAATATTTTAATTTATTTTTTTATTTGAAAACAACCTTTTTAATTTTTAATGAGTTGATTATCGATGCAAAGGAATCACATTGGAGTTATGGAAGGCAGTACGCCAATATTTTGATCGCTTGATATACAACATTCAAACAAGGAATTTCCTTCAACCACCTGTAAAGATGGAAGGTATTTAATTGACATAGTGTAATAAGGGATTTGGTAGTTAATTAATTGATTATTAATTTGTTGAACCCATTATTCTACCAGCAGATTCTGAATCATTGAATAACAGTGTTGAAGAAGAAGAAGAATTCATGACAACGTTATATTATTCAGTCAAATCGCTTAGCATCTATTGTGTGGAGTGTATGAGTTGTATCAACAATTTAATTCAACAAGCTTTGGGATTAGAAGATGACGATGATTATAAATTAAGTCCCAAAATTTCATATGAAGGTGAAAAGTGTAATGAAGTTGTTGATAAGTTTCTATATTTGCGTAGCATATATTGTCCTAATTGCATCAGGGTTGTGAGGAGGGTCACACTTCAACATGTTCCTTGGCAAACTTCTACAACTTCCATTCCTAATCAAGGTGAGTTTACTGTTAGAGAGAATAGTTGGGATGATATATATCAAGGATACCTCTAGTGTTAAAAAGTTTTTATAGCGTACTTTTCTTTAAACTCTAAGGTTAAAACATAAATATTACTTTATTTCTTGACTTTAGTATTGAATGTAATCATTTGTAGCTTCTAATTAATCCTTTGAGTTCTTCAATATGAATTTTTGGAATATTTGAATGTTATGCAGTTTTGTAGAGAGACTATTTTTCCTAGGTCATCAAACTAGGATTCTTAGAGACTTCTATTGCCACCTATAAATATAGGTGTTTTCTATCCTTTGGAAGCAAGTAACATTTAGATTAGACTAATATCAATAGCCTTCTTTTCTAAGGAATTACCATCGCAGTCCACTGTATTCCACATCTATATATTGTCTAATACAACCAAATTTTCCTTACATAAATTACTGTCTTATCACAAACTCTCCTGCATAATTACAAAGCTAGCCTTGAATGTCTCCTCCACCGTCAAATAACATTTCTTCCATCAAGCAATGACTATCTCCACAAATCAACATGAGTTGTTTTTTTAGTATGTTTTGTTCTTACTCACATGCTTCTTAAGAAAATTTTTAGAAAGTCACCCAACATAGAATTTATTCCAAATCAAACACACACATATCTTTGAATGATACTTTCATAGCCTAAGATCCTCTTTTCATTGAGAATATGGTCTCGATTCATTATTCATGTAATACCCACTGGGGTATTACATGTACACCAATTTCTGCCTTGGTTCATTTCTAAAACTATATCCTACTACTACAAGAGGTTCTGCTCTAATCCAAATCCTTTTTTAGGGGAGTATTAGAGTTTAGTTAATTATATATATTATTTTACCTTTGTTAGTTGGGTTAATTACAGTTTGTCGTAAATAGTTGTTTCTCAACTGATTGACACTTGTATCATCACTACCTATTTATACGCGAGCTTTGTGATTCAATAAAGTAAGAAACTTGTATCATCACTGTGTTTCTGAGTACATTATTATTAAATAAGGTGGTGAAGCAATATGCAATGAAGAAGGTTCAGTTATCACAATGCCACCCAATGAAACAAAAACTATTATTCGACAAAGCAGTAGAGTTGGAAGAAGGTGGAAGATTGGAAGTTGTATAGTAGTTGGTGGTTTAATGGAATCAATTACAAGTTTAGTTATTGTAACTGCAGCGACAACTGCTAATATGTCTAACGGTAATTATGGCCTCATTTGGTAAATATTTTATTTTTTGTTTTTAATTTTTGAAAATTATCTACTTTTCATCTTTATTTTCAAAAACTAAGTCAGATTTTAAAAACTAGAAAAAATTGTTTTAAGAATTTTTTTTTTCTCTTTTAAATTTGGCTAACAATTCAACTCTTCTACTTAAGATATACAAATCATAGTAAGAAAATGAGAGAAAATAAACTTAAATTTTAAAAACCAAAAACTAAAAACGAAAGGGTTACCAAATAAAGTTTTTTTTTCAATTCTGTTTTCCCTTTTTTCTTTTATCTTAAATAATTAATTAGGTGATCTCTAGTTGATCTTGGTTTGGGATTATTCTTGCAGGAAGTATATATGTTAGCTTTGACATTCCAAATTTGATTACTAGATTCTTCATTGTTAGGCACGATGTAAGTAATTTCTTTCTCTATTGTATGGAATCAATTATACGTTCTGGTCATTTATATTTTAATTTCTCCACACCTAGGCTTTTAATTTTTCTTAGGACAATTGA
mRNA sequence
ATGGAACAAAATCTGCTGATATACCTTGAGAAATTTCGGCCATCGCTTGATAATAACAATCAAATTACATTTGGTGCTGTTGACAAATCTCAATTTAAGATGGCCACAAGAGATTATTATTGTGTGTTAAACGGTGGACAAGGACATTCCTACAAAGATATGTCCAAAAGACTGTTGCAGTTGAGTGCTACATTGATGGGGCACGAAAATGAATATGTTGGCATATTACAAGGAAAGAGCTCAAAGAAGAAAGAGGATGTATTACAAAACATAGTCCTTTCCAGAAGAGAATTGGAGGATGCAGAAAAGGTATATTTAATCTCACGATGGAGACATGTTCCCGTGCGTCCTCTATTTTCCAATTATTCTTTGGATTATACCAGATTATTATCTCATATGACAACTGCGAAAACAGAAATGATGAAATTGGAATGTGAATATATTGAGTTATTGCAAGATGGGGCGACTTTCTCAGAGGATATACTCTCACAATACCCTGAAGATATATTGCAAGGCGCAGTAAGTATGGAAAGGGAGTTATTGGTCTTGGACCATCCAGAGCTTATTGAGATGATCACAGGAGAGAACCACTCCTCAATGCCTGATTCTCTCCGCAGCCTAGCAAGTGTAACAGACACGATGAAACATGCGAGATCATATTCATTGTGGGAACAACGAGAAACGTACCCAAATTTTGTTGAAGACCTCAACATAGACTACAAAATAAATCGACTAGAAAATCTTGTATCTTGGAAGAAGAATTTGAGATTGATGGAAGATGCTCTTGCTCCCGACCGTAGGGAGAAACACCCACTCGAATTTGAATACGCAGAGGGGATAAAGAAAATGATTAAGGAAGCAGAACAGGAATATTATGAGCTTATACAGAAGCAACATTTAACCATCCTCAATGGCATTTCAACAATGGATGCGACTCAAGTATATGCTAAATACATTCATAAAGAACTACTGGCTCGAATAGCATATATGAGAAGAAAAATCAAGCCGTTGGAAAGGGAGCATAATCACATCCTAAGAAAAGTTATGTACGACCGAATAGACAAACAAGCCATTGTAAGTCGAGGAATCACATTGGAGTTATGGAAGGCAGTACGCCAATATTTTGATCGCTTGATATACAACATTCAAACAAGGAATTTCCTTCAACCACCTGTAAAGATGGAAGATTCTGAATCATTGAATAACAGTGTTGAAGAAGAAGAAGAATTCATGACAACGTTATATTATTCAGTCAAATCGCTTAGCATCTATTGTGTGGAGTGTATGAGTTGTATCAACAATTTAATTCAACAAGCTTTGGGATTAGAAGATGACGATGATTATAAATTAAGTCCCAAAATTTCATATGAAGGTGAAAAGTGTAATGAAGTTGTTGATAAGTTTCTATATTTGCGTAGCATATATTGTCCTAATTGCATCAGGGTTGTGAGGAGGGTCACACTTCAACATGTTCCTTGGCAAACTTCTACAACTTCCATTCCTAATCAAGGTGGTGAAGCAATATGCAATGAAGAAGGTTCAGTTATCACAATGCCACCCAATGAAACAAAAACTATTATTCGACAAAGCAGTAGAGTTGGAAGAAGGTGGAAGATTGGAAGTTGTATAGTAGTTGGTGGTTTAATGGAATCAATTACAAGTTTAGTTATTGTAACTGCAGCGACAACTGCTAATATGTCTAACGGAAGTATATATGTTAGCTTTGACATTCCAAATTTGATTACTAGATTCTTCATTGTTAGGCACGATGACAATTGA
Coding sequence (CDS)
ATGGAACAAAATCTGCTGATATACCTTGAGAAATTTCGGCCATCGCTTGATAATAACAATCAAATTACATTTGGTGCTGTTGACAAATCTCAATTTAAGATGGCCACAAGAGATTATTATTGTGTGTTAAACGGTGGACAAGGACATTCCTACAAAGATATGTCCAAAAGACTGTTGCAGTTGAGTGCTACATTGATGGGGCACGAAAATGAATATGTTGGCATATTACAAGGAAAGAGCTCAAAGAAGAAAGAGGATGTATTACAAAACATAGTCCTTTCCAGAAGAGAATTGGAGGATGCAGAAAAGGTATATTTAATCTCACGATGGAGACATGTTCCCGTGCGTCCTCTATTTTCCAATTATTCTTTGGATTATACCAGATTATTATCTCATATGACAACTGCGAAAACAGAAATGATGAAATTGGAATGTGAATATATTGAGTTATTGCAAGATGGGGCGACTTTCTCAGAGGATATACTCTCACAATACCCTGAAGATATATTGCAAGGCGCAGTAAGTATGGAAAGGGAGTTATTGGTCTTGGACCATCCAGAGCTTATTGAGATGATCACAGGAGAGAACCACTCCTCAATGCCTGATTCTCTCCGCAGCCTAGCAAGTGTAACAGACACGATGAAACATGCGAGATCATATTCATTGTGGGAACAACGAGAAACGTACCCAAATTTTGTTGAAGACCTCAACATAGACTACAAAATAAATCGACTAGAAAATCTTGTATCTTGGAAGAAGAATTTGAGATTGATGGAAGATGCTCTTGCTCCCGACCGTAGGGAGAAACACCCACTCGAATTTGAATACGCAGAGGGGATAAAGAAAATGATTAAGGAAGCAGAACAGGAATATTATGAGCTTATACAGAAGCAACATTTAACCATCCTCAATGGCATTTCAACAATGGATGCGACTCAAGTATATGCTAAATACATTCATAAAGAACTACTGGCTCGAATAGCATATATGAGAAGAAAAATCAAGCCGTTGGAAAGGGAGCATAATCACATCCTAAGAAAAGTTATGTACGACCGAATAGACAAACAAGCCATTGTAAGTCGAGGAATCACATTGGAGTTATGGAAGGCAGTACGCCAATATTTTGATCGCTTGATATACAACATTCAAACAAGGAATTTCCTTCAACCACCTGTAAAGATGGAAGATTCTGAATCATTGAATAACAGTGTTGAAGAAGAAGAAGAATTCATGACAACGTTATATTATTCAGTCAAATCGCTTAGCATCTATTGTGTGGAGTGTATGAGTTGTATCAACAATTTAATTCAACAAGCTTTGGGATTAGAAGATGACGATGATTATAAATTAAGTCCCAAAATTTCATATGAAGGTGAAAAGTGTAATGAAGTTGTTGATAAGTTTCTATATTTGCGTAGCATATATTGTCCTAATTGCATCAGGGTTGTGAGGAGGGTCACACTTCAACATGTTCCTTGGCAAACTTCTACAACTTCCATTCCTAATCAAGGTGGTGAAGCAATATGCAATGAAGAAGGTTCAGTTATCACAATGCCACCCAATGAAACAAAAACTATTATTCGACAAAGCAGTAGAGTTGGAAGAAGGTGGAAGATTGGAAGTTGTATAGTAGTTGGTGGTTTAATGGAATCAATTACAAGTTTAGTTATTGTAACTGCAGCGACAACTGCTAATATGTCTAACGGAAGTATATATGTTAGCTTTGACATTCCAAATTTGATTACTAGATTCTTCATTGTTAGGCACGATGACAATTGA
Protein sequence
MEQNLLIYLEKFRPSLDNNNQITFGAVDKSQFKMATRDYYCVLNGGQGHSYKDMSKRLLQLSATLMGHENEYVGILQGKSSKKKEDVLQNIVLSRRELEDAEKVYLISRWRHVPVRPLFSNYSLDYTRLLSHMTTAKTEMMKLECEYIELLQDGATFSEDILSQYPEDILQGAVSMERELLVLDHPELIEMITGENHSSMPDSLRSLASVTDTMKHARSYSLWEQRETYPNFVEDLNIDYKINRLENLVSWKKNLRLMEDALAPDRREKHPLEFEYAEGIKKMIKEAEQEYYELIQKQHLTILNGISTMDATQVYAKYIHKELLARIAYMRRKIKPLEREHNHILRKVMYDRIDKQAIVSRGITLELWKAVRQYFDRLIYNIQTRNFLQPPVKMEDSESLNNSVEEEEEFMTTLYYSVKSLSIYCVECMSCINNLIQQALGLEDDDDYKLSPKISYEGEKCNEVVDKFLYLRSIYCPNCIRVVRRVTLQHVPWQTSTTSIPNQGGEAICNEEGSVITMPPNETKTIIRQSSRVGRRWKIGSCIVVGGLMESITSLVIVTAATTANMSNGSIYVSFDIPNLITRFFIVRHDDN
Homology
BLAST of CmUC09G168380 vs. NCBI nr
Match:
XP_038897356.1 (uncharacterized protein LOC120085457 isoform X3 [Benincasa hispida])
HSP 1 Score: 583.2 bits (1502), Expect = 2.6e-162
Identity = 360/670 (53.73%), Postives = 439/670 (65.52%), Query Frame = 0
Query: 1 MEQNLLIYLEKFRPSLDNNNQITFGAVDKSQFKMATRDYYCVLNGGQGHSYKDMSKRLLQ 60
MEQNLL +L++FR SLDN N+ITFG D+ QFK AT DY+ +L GGQG S+ D+ +LL+
Sbjct: 97 MEQNLLFHLKRFRSSLDNTNKITFGVNDRHQFKKATSDYFSLLIGGQGSSHIDLLTKLLR 156
Query: 61 LSATLMGHENEYVGILQGKSSKKKEDVLQNIVLSRRELEDAEKVYLISRWRHVP-----V 120
L TL+ ++NEY+ ILQGK SK KEDVLQNIV +RRELE AEKVYLISR R++P V
Sbjct: 157 LRDTLVEYQNEYINILQGKWSKNKEDVLQNIVQTRRELEYAEKVYLISRRRYIPKLSTGV 216
Query: 121 RPLFSNYSLDYTRLLSHMTTAKTEMMKLECEYIELLQDGATFSEDILSQYPEDILQGAVS 180
+ S YS DY LLS M K EMMKLE EY+ LL+ GATFS D+LS+Y EDILQ AVS
Sbjct: 217 DLISSKYSSDYDGLLSQMICRKEEMMKLEGEYVALLRAGATFSNDVLSRYSEDILQRAVS 276
Query: 181 MERELLVLDHPELIEMITGENHSSM----PDSLRSLASVTDTMKHARSYSLWEQRETYPN 240
+R+ L++D E+I GE S+ P R L V M R YSLWE+ + YPN
Sbjct: 277 SKRDSLIIDELSSNEII-GERTWSLFDVQPIYFRRLCDVISMMLSMRFYSLWERPDMYPN 336
Query: 241 FVEDLNIDYKINRLENLVS----------------------------------------- 300
VED IDYK NRLENLV+
Sbjct: 337 LVEDFKIDYKTNRLENLVNLRKEYLMVEDEYLWVLQERHTKNHANTSTSFSVHQIHSEYV 396
Query: 301 ----------WKKNLRLMEDALAPDRREKHPLEFEY-------AEGIKKMIKEAEQEYYE 360
WK +LRL+ED A EK+ ++ Y E +KK +KEAE+E E
Sbjct: 397 WEYLFDTILCWKNDLRLLEDTCARKLGEKYSNKWSYECDDLEEIEKLKKELKEAEEECCE 456
Query: 361 LIQKQHLTILNGISTMDATQVYAKYIHKELLARIAYMRRKIKPLEREHNHILRK--VMYD 420
LIQK+HLTILN I +MD TQ+YAKYIH+ELL I +R KIK LER+++ IL K +Y+
Sbjct: 457 LIQKKHLTILNDIPSMDVTQIYAKYIHQELLDEIVSLRVKIKMLERDYHRILSKPSYVYN 516
Query: 421 RIDKQAIVSRGITLE-LWKAVRQYFDRLIYNIQTRNFLQPP-VKMEDSESLNNSVEEEEE 480
+ D+Q S G E LW +QYFDRL YNI +NFLQPP +KME ESL+N+V EEEE
Sbjct: 517 QQDRQEFESVGTGRERLWDIEQQYFDRLKYNIAAKNFLQPPDLKMEAFESLDNNV-EEEE 576
Query: 481 FMTTLYYSVKSLSIYCVECMSCINNLIQQALGLEDDDDYKLSPKISYEGEK----CNEVV 540
F+TTLYYSVKS +IYCVECMS INNLI++ALGLEDD++ LS IS+EGE CN+VV
Sbjct: 577 FLTTLYYSVKSFNIYCVECMSYINNLIKRALGLEDDEE--LSAIISFEGENVGEMCNDVV 636
Query: 541 DKFLYLRSIYCPNCIRVVRRVTLQHVPWQT----STTSIPNQGGEAICNEEGSVITMPPN 592
DKFLY RSI CP CI+ VR T+Q T ++TSIPNQG EAICN+E S ITMP +
Sbjct: 637 DKFLYSRSICCPICIKAVRMATIQQKNTSTASGQTSTSIPNQGDEAICNQEDSTITMPLD 696
BLAST of CmUC09G168380 vs. NCBI nr
Match:
XP_038897352.1 (uncharacterized protein LOC120085457 isoform X1 [Benincasa hispida] >XP_038897353.1 uncharacterized protein LOC120085457 isoform X1 [Benincasa hispida] >XP_038897354.1 uncharacterized protein LOC120085457 isoform X1 [Benincasa hispida])
HSP 1 Score: 580.9 bits (1496), Expect = 1.3e-161
Identity = 359/669 (53.66%), Postives = 438/669 (65.47%), Query Frame = 0
Query: 1 MEQNLLIYLEKFRPSLDNNNQITFGAVDKSQFKMATRDYYCVLNGGQGHSYKDMSKRLLQ 60
MEQNLL +L++FR SLDN N+ITFG D+ QFK AT DY+ +L GGQG S+ D+ +LL+
Sbjct: 97 MEQNLLFHLKRFRSSLDNTNKITFGVNDRHQFKKATSDYFSLLIGGQGSSHIDLLTKLLR 156
Query: 61 LSATLMGHENEYVGILQGKSSKKKEDVLQNIVLSRRELEDAEKVYLISRWRHVP-----V 120
L TL+ ++NEY+ ILQGK SK KEDVLQNIV +RRELE AEKVYLISR R++P V
Sbjct: 157 LRDTLVEYQNEYINILQGKWSKNKEDVLQNIVQTRRELEYAEKVYLISRRRYIPKLSTGV 216
Query: 121 RPLFSNYSLDYTRLLSHMTTAKTEMMKLECEYIELLQDGATFSEDILSQYPEDILQGAVS 180
+ S YS DY LLS M K EMMKLE EY+ LL+ GATFS D+LS+Y EDILQ AVS
Sbjct: 217 DLISSKYSSDYDGLLSQMICRKEEMMKLEGEYVALLRAGATFSNDVLSRYSEDILQRAVS 276
Query: 181 MERELLVLDHPELIEMITGENHSSM----PDSLRSLASVTDTMKHARSYSLWEQRETYPN 240
+R+ L++D E+I GE S+ P R L V M R YSLWE+ + YPN
Sbjct: 277 SKRDSLIIDELSSNEII-GERTWSLFDVQPIYFRRLCDVISMMLSMRFYSLWERPDMYPN 336
Query: 241 FVEDLNIDYKINRLENLVS----------------------------------------- 300
VED IDYK NRLENLV+
Sbjct: 337 LVEDFKIDYKTNRLENLVNLRKEYLMVEDEYLWVLQERHTKNHANTSTSFSVHQIHSEYV 396
Query: 301 ----------WKKNLRLMEDALAPDRREKHPLEFEY-------AEGIKKMIKEAEQEYYE 360
WK +LRL+ED A EK+ ++ Y E +KK +KEAE+E E
Sbjct: 397 WEYLFDTILCWKNDLRLLEDTCARKLGEKYSNKWSYECDDLEEIEKLKKELKEAEEECCE 456
Query: 361 LIQKQHLTILNGISTMDATQVYAKYIHKELLARIAYMRRKIKPLEREHNHILRK--VMYD 420
LIQK+HLTILN I +MD TQ+YAKYIH+ELL I +R KIK LER+++ IL K +Y+
Sbjct: 457 LIQKKHLTILNDIPSMDVTQIYAKYIHQELLDEIVSLRVKIKMLERDYHRILSKPSYVYN 516
Query: 421 RIDKQAIVSRGITLE-LWKAVRQYFDRLIYNIQTRNFLQPP-VKMEDSESLNNSVEEEEE 480
+ D+Q S G E LW +QYFDRL YNI +NFLQPP +KME ESL+N+V EEEE
Sbjct: 517 QQDRQEFESVGTGRERLWDIEQQYFDRLKYNIAAKNFLQPPDLKMEAFESLDNNV-EEEE 576
Query: 481 FMTTLYYSVKSLSIYCVECMSCINNLIQQALGLEDDDDYKLSPKISYEGEK----CNEVV 540
F+TTLYYSVKS +IYCVECMS INNLI++ALGLEDD++ LS IS+EGE CN+VV
Sbjct: 577 FLTTLYYSVKSFNIYCVECMSYINNLIKRALGLEDDEE--LSAIISFEGENVGEMCNDVV 636
Query: 541 DKFLYLRSIYCPNCIRVVRRVTLQHVPWQT----STTSIPNQGGEAICNEEGSVITMPPN 591
DKFLY RSI CP CI+ VR T+Q T ++TSIPNQG EAICN+E S ITMP +
Sbjct: 637 DKFLYSRSICCPICIKAVRMATIQQKNTSTASGQTSTSIPNQGDEAICNQEDSTITMPLD 696
BLAST of CmUC09G168380 vs. NCBI nr
Match:
XP_038897355.1 (uncharacterized protein LOC120085457 isoform X2 [Benincasa hispida])
HSP 1 Score: 580.9 bits (1496), Expect = 1.3e-161
Identity = 359/669 (53.66%), Postives = 438/669 (65.47%), Query Frame = 0
Query: 1 MEQNLLIYLEKFRPSLDNNNQITFGAVDKSQFKMATRDYYCVLNGGQGHSYKDMSKRLLQ 60
MEQNLL +L++FR SLDN N+ITFG D+ QFK AT DY+ +L GGQG S+ D+ +LL+
Sbjct: 97 MEQNLLFHLKRFRSSLDNTNKITFGVNDRHQFKKATSDYFSLLIGGQGSSHIDLLTKLLR 156
Query: 61 LSATLMGHENEYVGILQGKSSKKKEDVLQNIVLSRRELEDAEKVYLISRWRHVP-----V 120
L TL+ ++NEY+ ILQGK SK KEDVLQNIV +RRELE AEKVYLISR R++P V
Sbjct: 157 LRDTLVEYQNEYINILQGKWSKNKEDVLQNIVQTRRELEYAEKVYLISRRRYIPKLSTGV 216
Query: 121 RPLFSNYSLDYTRLLSHMTTAKTEMMKLECEYIELLQDGATFSEDILSQYPEDILQGAVS 180
+ S YS DY LLS M K EMMKLE EY+ LL+ GATFS D+LS+Y EDILQ AVS
Sbjct: 217 DLISSKYSSDYDGLLSQMICRKEEMMKLEGEYVALLRAGATFSNDVLSRYSEDILQRAVS 276
Query: 181 MERELLVLDHPELIEMITGENHSSM----PDSLRSLASVTDTMKHARSYSLWEQRETYPN 240
+R+ L++D E+I GE S+ P R L V M R YSLWE+ + YPN
Sbjct: 277 SKRDSLIIDELSSNEII-GERTWSLFDVQPIYFRRLCDVISMMLSMRFYSLWERPDMYPN 336
Query: 241 FVEDLNIDYKINRLENLVS----------------------------------------- 300
VED IDYK NRLENLV+
Sbjct: 337 LVEDFKIDYKTNRLENLVNLRKEYLMVEDEYLWVLQERHTKNHANTSTSFSVHQIHSEYV 396
Query: 301 ----------WKKNLRLMEDALAPDRREKHPLEFEY-------AEGIKKMIKEAEQEYYE 360
WK +LRL+ED A EK+ ++ Y E +KK +KEAE+E E
Sbjct: 397 WEYLFDTILCWKNDLRLLEDTCARKLGEKYSNKWSYECDDLEEIEKLKKELKEAEEECCE 456
Query: 361 LIQKQHLTILNGISTMDATQVYAKYIHKELLARIAYMRRKIKPLEREHNHILRK--VMYD 420
LIQK+HLTILN I +MD TQ+YAKYIH+ELL I +R KIK LER+++ IL K +Y+
Sbjct: 457 LIQKKHLTILNDIPSMDVTQIYAKYIHQELLDEIVSLRVKIKMLERDYHRILSKPSYVYN 516
Query: 421 RIDKQAIVSRGITLE-LWKAVRQYFDRLIYNIQTRNFLQPP-VKMEDSESLNNSVEEEEE 480
+ D+Q S G E LW +QYFDRL YNI +NFLQPP +KME ESL+N+V EEEE
Sbjct: 517 QQDRQEFESVGTGRERLWDIEQQYFDRLKYNIAAKNFLQPPDLKMEAFESLDNNV-EEEE 576
Query: 481 FMTTLYYSVKSLSIYCVECMSCINNLIQQALGLEDDDDYKLSPKISYEGEK----CNEVV 540
F+TTLYYSVKS +IYCVECMS INNLI++ALGLEDD++ LS IS+EGE CN+VV
Sbjct: 577 FLTTLYYSVKSFNIYCVECMSYINNLIKRALGLEDDEE--LSAIISFEGENVGEMCNDVV 636
Query: 541 DKFLYLRSIYCPNCIRVVRRVTLQHVPWQT----STTSIPNQGGEAICNEEGSVITMPPN 591
DKFLY RSI CP CI+ VR T+Q T ++TSIPNQG EAICN+E S ITMP +
Sbjct: 637 DKFLYSRSICCPICIKAVRMATIQQKNTSTASGQTSTSIPNQGDEAICNQEDSTITMPLD 696
BLAST of CmUC09G168380 vs. NCBI nr
Match:
XP_038897359.1 (uncharacterized protein LOC120085457 isoform X5 [Benincasa hispida])
HSP 1 Score: 580.9 bits (1496), Expect = 1.3e-161
Identity = 359/669 (53.66%), Postives = 438/669 (65.47%), Query Frame = 0
Query: 1 MEQNLLIYLEKFRPSLDNNNQITFGAVDKSQFKMATRDYYCVLNGGQGHSYKDMSKRLLQ 60
MEQNLL +L++FR SLDN N+ITFG D+ QFK AT DY+ +L GGQG S+ D+ +LL+
Sbjct: 97 MEQNLLFHLKRFRSSLDNTNKITFGVNDRHQFKKATSDYFSLLIGGQGSSHIDLLTKLLR 156
Query: 61 LSATLMGHENEYVGILQGKSSKKKEDVLQNIVLSRRELEDAEKVYLISRWRHVP-----V 120
L TL+ ++NEY+ ILQGK SK KEDVLQNIV +RRELE AEKVYLISR R++P V
Sbjct: 157 LRDTLVEYQNEYINILQGKWSKNKEDVLQNIVQTRRELEYAEKVYLISRRRYIPKLSTGV 216
Query: 121 RPLFSNYSLDYTRLLSHMTTAKTEMMKLECEYIELLQDGATFSEDILSQYPEDILQGAVS 180
+ S YS DY LLS M K EMMKLE EY+ LL+ GATFS D+LS+Y EDILQ AVS
Sbjct: 217 DLISSKYSSDYDGLLSQMICRKEEMMKLEGEYVALLRAGATFSNDVLSRYSEDILQRAVS 276
Query: 181 MERELLVLDHPELIEMITGENHSSM----PDSLRSLASVTDTMKHARSYSLWEQRETYPN 240
+R+ L++D E+I GE S+ P R L V M R YSLWE+ + YPN
Sbjct: 277 SKRDSLIIDELSSNEII-GERTWSLFDVQPIYFRRLCDVISMMLSMRFYSLWERPDMYPN 336
Query: 241 FVEDLNIDYKINRLENLVS----------------------------------------- 300
VED IDYK NRLENLV+
Sbjct: 337 LVEDFKIDYKTNRLENLVNLRKEYLMVEDEYLWVLQERHTKNHANTSTSFSVHQIHSEYV 396
Query: 301 ----------WKKNLRLMEDALAPDRREKHPLEFEY-------AEGIKKMIKEAEQEYYE 360
WK +LRL+ED A EK+ ++ Y E +KK +KEAE+E E
Sbjct: 397 WEYLFDTILCWKNDLRLLEDTCARKLGEKYSNKWSYECDDLEEIEKLKKELKEAEEECCE 456
Query: 361 LIQKQHLTILNGISTMDATQVYAKYIHKELLARIAYMRRKIKPLEREHNHILRK--VMYD 420
LIQK+HLTILN I +MD TQ+YAKYIH+ELL I +R KIK LER+++ IL K +Y+
Sbjct: 457 LIQKKHLTILNDIPSMDVTQIYAKYIHQELLDEIVSLRVKIKMLERDYHRILSKPSYVYN 516
Query: 421 RIDKQAIVSRGITLE-LWKAVRQYFDRLIYNIQTRNFLQPP-VKMEDSESLNNSVEEEEE 480
+ D+Q S G E LW +QYFDRL YNI +NFLQPP +KME ESL+N+V EEEE
Sbjct: 517 QQDRQEFESVGTGRERLWDIEQQYFDRLKYNIAAKNFLQPPDLKMEAFESLDNNV-EEEE 576
Query: 481 FMTTLYYSVKSLSIYCVECMSCINNLIQQALGLEDDDDYKLSPKISYEGEK----CNEVV 540
F+TTLYYSVKS +IYCVECMS INNLI++ALGLEDD++ LS IS+EGE CN+VV
Sbjct: 577 FLTTLYYSVKSFNIYCVECMSYINNLIKRALGLEDDEE--LSAIISFEGENVGEMCNDVV 636
Query: 541 DKFLYLRSIYCPNCIRVVRRVTLQHVPWQT----STTSIPNQGGEAICNEEGSVITMPPN 591
DKFLY RSI CP CI+ VR T+Q T ++TSIPNQG EAICN+E S ITMP +
Sbjct: 637 DKFLYSRSICCPICIKAVRMATIQQKNTSTASGQTSTSIPNQGDEAICNQEDSTITMPLD 696
BLAST of CmUC09G168380 vs. NCBI nr
Match:
XP_038897357.1 (uncharacterized protein LOC120085457 isoform X4 [Benincasa hispida])
HSP 1 Score: 580.9 bits (1496), Expect = 1.3e-161
Identity = 359/669 (53.66%), Postives = 438/669 (65.47%), Query Frame = 0
Query: 1 MEQNLLIYLEKFRPSLDNNNQITFGAVDKSQFKMATRDYYCVLNGGQGHSYKDMSKRLLQ 60
MEQNLL +L++FR SLDN N+ITFG D+ QFK AT DY+ +L GGQG S+ D+ +LL+
Sbjct: 97 MEQNLLFHLKRFRSSLDNTNKITFGVNDRHQFKKATSDYFSLLIGGQGSSHIDLLTKLLR 156
Query: 61 LSATLMGHENEYVGILQGKSSKKKEDVLQNIVLSRRELEDAEKVYLISRWRHVP-----V 120
L TL+ ++NEY+ ILQGK SK KEDVLQNIV +RRELE AEKVYLISR R++P V
Sbjct: 157 LRDTLVEYQNEYINILQGKWSKNKEDVLQNIVQTRRELEYAEKVYLISRRRYIPKLSTGV 216
Query: 121 RPLFSNYSLDYTRLLSHMTTAKTEMMKLECEYIELLQDGATFSEDILSQYPEDILQGAVS 180
+ S YS DY LLS M K EMMKLE EY+ LL+ GATFS D+LS+Y EDILQ AVS
Sbjct: 217 DLISSKYSSDYDGLLSQMICRKEEMMKLEGEYVALLRAGATFSNDVLSRYSEDILQRAVS 276
Query: 181 MERELLVLDHPELIEMITGENHSSM----PDSLRSLASVTDTMKHARSYSLWEQRETYPN 240
+R+ L++D E+I GE S+ P R L V M R YSLWE+ + YPN
Sbjct: 277 SKRDSLIIDELSSNEII-GERTWSLFDVQPIYFRRLCDVISMMLSMRFYSLWERPDMYPN 336
Query: 241 FVEDLNIDYKINRLENLVS----------------------------------------- 300
VED IDYK NRLENLV+
Sbjct: 337 LVEDFKIDYKTNRLENLVNLRKEYLMVEDEYLWVLQERHTKNHANTSTSFSVHQIHSEYV 396
Query: 301 ----------WKKNLRLMEDALAPDRREKHPLEFEY-------AEGIKKMIKEAEQEYYE 360
WK +LRL+ED A EK+ ++ Y E +KK +KEAE+E E
Sbjct: 397 WEYLFDTILCWKNDLRLLEDTCARKLGEKYSNKWSYECDDLEEIEKLKKELKEAEEECCE 456
Query: 361 LIQKQHLTILNGISTMDATQVYAKYIHKELLARIAYMRRKIKPLEREHNHILRK--VMYD 420
LIQK+HLTILN I +MD TQ+YAKYIH+ELL I +R KIK LER+++ IL K +Y+
Sbjct: 457 LIQKKHLTILNDIPSMDVTQIYAKYIHQELLDEIVSLRVKIKMLERDYHRILSKPSYVYN 516
Query: 421 RIDKQAIVSRGITLE-LWKAVRQYFDRLIYNIQTRNFLQPP-VKMEDSESLNNSVEEEEE 480
+ D+Q S G E LW +QYFDRL YNI +NFLQPP +KME ESL+N+V EEEE
Sbjct: 517 QQDRQEFESVGTGRERLWDIEQQYFDRLKYNIAAKNFLQPPDLKMEAFESLDNNV-EEEE 576
Query: 481 FMTTLYYSVKSLSIYCVECMSCINNLIQQALGLEDDDDYKLSPKISYEGEK----CNEVV 540
F+TTLYYSVKS +IYCVECMS INNLI++ALGLEDD++ LS IS+EGE CN+VV
Sbjct: 577 FLTTLYYSVKSFNIYCVECMSYINNLIKRALGLEDDEE--LSAIISFEGENVGEMCNDVV 636
Query: 541 DKFLYLRSIYCPNCIRVVRRVTLQHVPWQT----STTSIPNQGGEAICNEEGSVITMPPN 591
DKFLY RSI CP CI+ VR T+Q T ++TSIPNQG EAICN+E S ITMP +
Sbjct: 637 DKFLYSRSICCPICIKAVRMATIQQKNTSTASGQTSTSIPNQGDEAICNQEDSTITMPLD 696
BLAST of CmUC09G168380 vs. ExPASy TrEMBL
Match:
A0A6J1GQF8 (uncharacterized protein LOC111456533 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111456533 PE=3 SV=1)
HSP 1 Score: 316.6 bits (810), Expect = 2.2e-82
Identity = 244/664 (36.75%), Postives = 350/664 (52.71%), Query Frame = 0
Query: 1 MEQNLLIYLEKFRPSLDNNNQITFGAVDKSQFKMATRDYYCVLNGGQGHSYKDMSKRLLQ 60
ME++LL L + S +ITF V + + + + DY+ +L GG S DM ++LL+
Sbjct: 137 MERSLLFRLNPEKFSSYGRYEITF-MVGRLRLRDLSTDYFWLLQGGVERSCIDMMEKLLR 196
Query: 61 LSATLMGHENEYVGILQGKSSKKKEDVLQNIVLSRRELEDAEKVYLISRWRHV------P 120
L L+ E+EY IL+GKS K KEDVL+ +V +RRELE AEKVYLISR R++
Sbjct: 197 LRKQLIEQEDEYGDILKGKSLKNKEDVLRELVNTRRELEHAEKVYLISRRRNLRSQSSSE 256
Query: 121 VRPLFSNYSLDYTRLLSHMTTAKTEMMKLECEYIELLQDGATFSEDILSQYPEDILQGAV 180
V + S S + L+ M K EMMKLE ++ LLQ+ + + L Y +D+L AV
Sbjct: 257 VDDVISKQSDE--EFLTDMIRRKKEMMKLEAIFVGLLQNSTKYLVEKLLGYSKDVLGLAV 316
Query: 181 SMERELLVLDHPELIEMITGENHSSMPDSLRSLASVTDTMKHARSYSLWEQRETYPNFVE 240
S+ RELL LD E+ N S+ D + L S R Y W QR+ YPN
Sbjct: 317 SLTRELLALDELSDSELREQMNFGSLDDIIWMLMS-------WRQYYSWGQRQIYPNLWL 376
Query: 241 DLNID--YKINRLENLVSWKKNLRLMEDA---LAPDRREK-----------HPLEFEYA- 300
+++ D Y+I LE +V+ ++ +ED + +R K H + EY
Sbjct: 377 EMHTDKEYQIYTLERMVNLREQYLRVEDEYLWVLEERHTKKHANTYTSFDVHQIHLEYIW 436
Query: 301 -----------------------------------EGIKKMIKEAEQEYYELIQKQHLTI 360
I + +KEAE+ Y + I+++H+T
Sbjct: 437 EKVLQTSLNWRNDWILRLDAYTRELQKSFEHDVLDNEIMEELKEAERAYCDFIERRHITN 496
Query: 361 LNGISTMDATQVYAKYIHKELLARIAYMRRKIKPLEREHNHILRKVMYDRIDKQAI---- 420
L+ I MD +Q+YAKY H +LL I +R K+K LE + + +L KV Y+ ++ +
Sbjct: 497 LDNIPLMDVSQIYAKYNHPKLLVDINSLRLKMKKLESDCDRMLLKVTYEYKQQERVEYEM 556
Query: 421 VSRGITLELWKAVRQYFDRLIYNIQTRNFLQPPVKMEDSESLNNSVEEEEEFMTTLYYSV 480
+++ L LWK +YFDR +N++++ FLQ +S S E F L S+
Sbjct: 557 MAKARKL-LWKIDYRYFDRFKHNMESKKFLQLLPTTSKVQSFQAS--ESFNFFKMLNDSI 616
Query: 481 KSLSIYCVECMSCINNLIQQALGLEDDDDYKLSPKISYEGE-----KCNEVVDKFLYLRS 540
KS +IYC +C +CINNLIQ L LED ++ S K SYE E KC EV+ KFL L +
Sbjct: 617 KSYNIYCSKCQNCINNLIQSDLKLEDVEE--SSAKFSYEDEKDIPKKCYEVIYKFLNLHT 676
Query: 541 IYCPNCIRVVRRVTLQHVPWQTST------TSIPNQGGEAICNEEGSVITM-PPNETKTI 591
+CP C VR VT+QH P Q++T SI E I ++EG VIT+ P+E T
Sbjct: 677 THCPACTNYVREVTIQHAPKQSTTLVQVPKPSIHGTDEEIIDHQEGLVITIFHPDEPSTR 736
BLAST of CmUC09G168380 vs. ExPASy TrEMBL
Match:
A0A6J1GQA4 (uncharacterized protein LOC111456533 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111456533 PE=3 SV=1)
HSP 1 Score: 314.3 bits (804), Expect = 1.1e-81
Identity = 243/665 (36.54%), Postives = 350/665 (52.63%), Query Frame = 0
Query: 1 MEQNLLIYLEKFRPSLDNNNQITFGAVDKSQFKMATRDYYCVLNGGQGHSYKDMSKRLLQ 60
ME++LL L + S +ITF V + + + + DY+ +L GG S DM ++LL+
Sbjct: 137 MERSLLFRLNPEKFSSYGRYEITF-MVGRLRLRDLSTDYFWLLQGGVERSCIDMMEKLLR 196
Query: 61 LSATLMGHENEYVGILQGKSSKKKEDVLQNIVLSRRELEDAEKVYLISRWRHV------P 120
L L+ E+EY IL+GKS K KEDVL+ +V +RRELE AEKVYLISR R++
Sbjct: 197 LRKQLIEQEDEYGDILKGKSLKNKEDVLRELVNTRRELEHAEKVYLISRRRNLRSQSSSE 256
Query: 121 VRPLFSNYSLDYTRLLSHMTTAKTEMMKLECEYIELLQDGATFSEDILSQYPEDILQGAV 180
V + S S + L+ M K EMMKLE ++ LLQ+ + + L Y +D+L AV
Sbjct: 257 VDDVISKQSDE--EFLTDMIRRKKEMMKLEAIFVGLLQNSTKYLVEKLLGYSKDVLGLAV 316
Query: 181 SMERELLVLDHPELIEMITGENHSSMPDSLRSLASVTDTMKHARSYSLWEQRETYPNFVE 240
S+ RELL LD E+ N S+ D + L S R Y W QR+ YPN
Sbjct: 317 SLTRELLALDELSDSELREQMNFGSLDDIIWMLMS-------WRQYYSWGQRQIYPNLWL 376
Query: 241 DLNID--YKINRLENLVSWKKNLRLMEDA---LAPDRREK-----------HPLEFEYA- 300
+++ D Y+I LE +V+ ++ +ED + +R K H + EY
Sbjct: 377 EMHTDKEYQIYTLERMVNLREQYLRVEDEYLWVLEERHTKKHANTYTSFDVHQIHLEYIW 436
Query: 301 -----------------------------------EGIKKMIKEAEQEYYELIQKQHLTI 360
I + +KEAE+ Y + I+++H+T
Sbjct: 437 EKVLQTSLNWRNDWILRLDAYTRELQKSFEHDVLDNEIMEELKEAERAYCDFIERRHITN 496
Query: 361 LNGISTMDATQVYAKYIHKELLARIAYMRRKIKPLEREHNHILRKVMYDRIDKQAI---- 420
L+ I MD +Q+YAKY H +LL I +R K+K LE + + +L KV Y+ ++ +
Sbjct: 497 LDNIPLMDVSQIYAKYNHPKLLVDINSLRLKMKKLESDCDRMLLKVTYEYKQQERVEYEM 556
Query: 421 VSRGITLELWKAVRQYFDRLIYNIQTRNFLQPPVKMEDSESLNNSVEEEEEFMTTLYYSV 480
+++ L LWK +YFDR +N++++ FLQ +S S E F L S+
Sbjct: 557 MAKARKL-LWKIDYRYFDRFKHNMESKKFLQLLPTTSKVQSFQAS--ESFNFFKMLNDSI 616
Query: 481 KSLSIYCVECMSCINNLIQQALGLEDDDDYKLSPKISYEGE-----KCNEVVDKFLYLRS 540
KS +IYC +C +CINNLIQ L LED ++ S K SYE E KC EV+ KFL L +
Sbjct: 617 KSYNIYCSKCQNCINNLIQSDLKLEDVEE--SSAKFSYEDEKDIPKKCYEVIYKFLNLHT 676
Query: 541 IYCPNCIRVVRRVTLQHVPWQTST-TSIPNQG------GEAICNEEGSVITM-PPNETKT 591
+CP C VR VT+QH P Q++T +P E I ++EG VIT+ P+E T
Sbjct: 677 THCPACTNYVREVTIQHAPKQSTTLVQVPKPSIHGTADEEIIDHQEGLVITIFHPDEPST 736
BLAST of CmUC09G168380 vs. ExPASy TrEMBL
Match:
A0A6J1GQF7 (uncharacterized protein LOC111456533 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111456533 PE=3 SV=1)
HSP 1 Score: 294.7 bits (753), Expect = 8.8e-76
Identity = 226/612 (36.93%), Postives = 323/612 (52.78%), Query Frame = 0
Query: 54 MSKRLLQLSATLMGHENEYVGILQGKSSKKKEDVLQNIVLSRRELEDAEKVYLISRWRHV 113
M ++LL+L L+ E+EY IL+GKS K KEDVL+ +V +RRELE AEKVYLISR R++
Sbjct: 1 MMEKLLRLRKQLIEQEDEYGDILKGKSLKNKEDVLRELVNTRRELEHAEKVYLISRRRNL 60
Query: 114 ------PVRPLFSNYSLDYTRLLSHMTTAKTEMMKLECEYIELLQDGATFSEDILSQYPE 173
V + S S + L+ M K EMMKLE ++ LLQ+ + + L Y +
Sbjct: 61 RSQSSSEVDDVISKQSDE--EFLTDMIRRKKEMMKLEAIFVGLLQNSTKYLVEKLLGYSK 120
Query: 174 DILQGAVSMERELLVLDHPELIEMITGENHSSMPDSLRSLASVTDTMKHARSYSLWEQRE 233
D+L AVS+ RELL LD E+ N S+ D + L S R Y W QR+
Sbjct: 121 DVLGLAVSLTRELLALDELSDSELREQMNFGSLDDIIWMLMS-------WRQYYSWGQRQ 180
Query: 234 TYPNFVEDLNID--YKINRLENLVSWKKNLRLMEDA---LAPDRREK-----------HP 293
YPN +++ D Y+I LE +V+ ++ +ED + +R K H
Sbjct: 181 IYPNLWLEMHTDKEYQIYTLERMVNLREQYLRVEDEYLWVLEERHTKKHANTYTSFDVHQ 240
Query: 294 LEFEYA------------------------------------EGIKKMIKEAEQEYYELI 353
+ EY I + +KEAE+ Y + I
Sbjct: 241 IHLEYIWEKVLQTSLNWRNDWILRLDAYTRELQKSFEHDVLDNEIMEELKEAERAYCDFI 300
Query: 354 QKQHLTILNGISTMDATQVYAKYIHKELLARIAYMRRKIKPLEREHNHILRKVMYDRIDK 413
+++H+T L+ I MD +Q+YAKY H +LL I +R K+K LE + + +L KV Y+ +
Sbjct: 301 ERRHITNLDNIPLMDVSQIYAKYNHPKLLVDINSLRLKMKKLESDCDRMLLKVTYEYKQQ 360
Query: 414 QAI----VSRGITLELWKAVRQYFDRLIYNIQTRNFLQPPVKMEDSESLNNSVEEEEEFM 473
+ + +++ L LWK +YFDR +N++++ FLQ +S S E F
Sbjct: 361 ERVEYEMMAKARKL-LWKIDYRYFDRFKHNMESKKFLQLLPTTSKVQSFQAS--ESFNFF 420
Query: 474 TTLYYSVKSLSIYCVECMSCINNLIQQALGLEDDDDYKLSPKISYEGE-----KCNEVVD 533
L S+KS +IYC +C +CINNLIQ L LED ++ S K SYE E KC EV+
Sbjct: 421 KMLNDSIKSYNIYCSKCQNCINNLIQSDLKLEDVEE--SSAKFSYEDEKDIPKKCYEVIY 480
Query: 534 KFLYLRSIYCPNCIRVVRRVTLQHVPWQTST-TSIPNQG------GEAICNEEGSVITM- 591
KFL L + +CP C VR VT+QH P Q++T +P E I ++EG VIT+
Sbjct: 481 KFLNLHTTHCPACTNYVREVTIQHAPKQSTTLVQVPKPSIHGTADEEIIDHQEGLVITIF 540
BLAST of CmUC09G168380 vs. ExPASy TrEMBL
Match:
A0A6J1CWM1 (uncharacterized protein LOC111015471 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111015471 PE=4 SV=1)
HSP 1 Score: 166.0 bits (419), Expect = 4.7e-37
Identity = 126/371 (33.96%), Postives = 186/371 (50.13%), Query Frame = 0
Query: 36 TRDYYCVLNGGQGHSYKDMSKRLLQLSATLMGHENEYVGILQGKSSKKKEDVLQNIVLSR 95
T DY+C+L GG+G SY+D+ ++L L E+EY+GILQGK S + DVLQ IV SR
Sbjct: 25 TLDYFCLLKGGEGLSYQDLVHKILCFRNQLKKWEDEYIGILQGKFSNNENDVLQEIVRSR 84
Query: 96 RELEDAEKVYLISRWRHVPVRPLFSNYSLDYT--RLLSHMTTAKTEMMKLECEYIELLQD 155
RELE+AEKVYLI+R R + + +T LL M + K E+ KLE EY+ LL+
Sbjct: 85 RELENAEKVYLITRMRSFHYKSSIEVDDISFTDENLLKKMLSLKRELTKLEGEYVALLEA 144
Query: 156 GATFSEDILSQYPEDILQGAVSMERELLVLDHPELIEM---ITGENHSSMPDSLRSLASV 215
AT+S LS Y + +LQ AVS RELLVL+ + + I+ + SM S L V
Sbjct: 145 EATYSYHALSTYSKHVLQSAVSRRRELLVLEEECFMPLKGRISVFDVGSM--SFGRLQEV 204
Query: 216 TDTMKHARSYSLWEQRETYPNFVED-LNIDYKINRLENLVSWKKNLRLMED---ALAPDR 275
++ + Y ++ YP+F++D NI YKI LE +V +++ +ED L DR
Sbjct: 205 IRMIQRMKFYPPLKEERMYPSFMQDNNNIHYKIEFLETMVRLREDFLTLEDKYLCLLQDR 264
Query: 276 REKH-----------------------------------------PLEFEYA-------- 335
K+ LE Y+
Sbjct: 265 YIKNCSYTSLLKFNLIYSEYLWEYLLEIILNWRNYLTLLEDSYGCKLEMNYSCMGIDSST 324
Query: 336 --EGIKKM-IKEAEQEYYELIQKQHLTILNGISTMDATQVYAKYIHKELLARIAYMRRKI 346
G++ K+ E+EY++LI++ LT N IS++D Q+Y KYI E+L I + + +I
Sbjct: 325 LKHGLQNTEFKQMEEEYHKLIREWRLTSHNNISSIDVAQIYEKYIKPEVLDNIVFSKNEI 384
BLAST of CmUC09G168380 vs. ExPASy TrEMBL
Match:
A0A6J1CYL4 (uncharacterized protein LOC111015471 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111015471 PE=4 SV=1)
HSP 1 Score: 159.1 bits (401), Expect = 5.8e-35
Identity = 128/383 (33.42%), Postives = 188/383 (49.09%), Query Frame = 0
Query: 36 TRDYYCVLNGGQGHSYKDMSKRLLQLSATLMGHENEYVGILQGKSSKKKEDVLQNIVLSR 95
T DY+C+L GG+G SY+D+ ++L L E+EY+GILQGK S + DVLQ IV SR
Sbjct: 25 TLDYFCLLKGGEGLSYQDLVHKILCFRNQLKKWEDEYIGILQGKFSNNENDVLQEIVRSR 84
Query: 96 RELEDAEKVYLISRWRHVPVRPLFSNYSLDYT--RLLSHMTTAKTEMMKLECEYIELLQD 155
RELE+AEKVYLI+R R + + +T LL M + K E+ KLE EY+ LL+
Sbjct: 85 RELENAEKVYLITRMRSFHYKSSIEVDDISFTDENLLKKMLSLKRELTKLEGEYVALLEA 144
Query: 156 GATFSEDILSQYPEDILQGAVSMERELLVLDHPELIEM---ITGENHSSMPDSLRSLASV 215
AT+S LS Y + +LQ AVS RELLVL+ + + I+ + SM S L V
Sbjct: 145 EATYSYHALSTYSKHVLQSAVSRRRELLVLEEECFMPLKGRISVFDVGSM--SFGRLQEV 204
Query: 216 TDTMKHARSYSLWEQRETYPNFVED-LNIDYKINRLENLVSWKKNLRLMED---ALAPDR 275
++ + Y ++ YP+F++D NI YKI LE +V +++ +ED L DR
Sbjct: 205 IRMIQRMKFYPPLKEERMYPSFMQDNNNIHYKIEFLETMVRLREDFLTLEDKYLCLLQDR 264
Query: 276 REKH-----------------------------------------PLEFEYA-------- 335
K+ LE Y+
Sbjct: 265 YIKNCSYTSLLKFNLIYSEYLWEYLLEIILNWRNYLTLLEDSYGCKLEMNYSCMGIDSST 324
Query: 336 --EGIKKM-IKEAEQEYYELIQKQHLTILNGISTMDATQVYAKYIHKELLARIA------ 351
G++ K+ E+EY++LI++ LT N IS++D Q+Y KYI E+L I
Sbjct: 325 LKHGLQNTEFKQMEEEYHKLIREWRLTSHNNISSIDVAQIYEKYIKPEVLDNIGKFLFFF 384
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038897356.1 | 2.6e-162 | 53.73 | uncharacterized protein LOC120085457 isoform X3 [Benincasa hispida] | [more] |
XP_038897352.1 | 1.3e-161 | 53.66 | uncharacterized protein LOC120085457 isoform X1 [Benincasa hispida] >XP_03889735... | [more] |
XP_038897355.1 | 1.3e-161 | 53.66 | uncharacterized protein LOC120085457 isoform X2 [Benincasa hispida] | [more] |
XP_038897359.1 | 1.3e-161 | 53.66 | uncharacterized protein LOC120085457 isoform X5 [Benincasa hispida] | [more] |
XP_038897357.1 | 1.3e-161 | 53.66 | uncharacterized protein LOC120085457 isoform X4 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1GQF8 | 2.2e-82 | 36.75 | uncharacterized protein LOC111456533 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1GQA4 | 1.1e-81 | 36.54 | uncharacterized protein LOC111456533 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1GQF7 | 8.8e-76 | 36.93 | uncharacterized protein LOC111456533 isoform X3 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1CWM1 | 4.7e-37 | 33.96 | uncharacterized protein LOC111015471 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1CYL4 | 5.8e-35 | 33.42 | uncharacterized protein LOC111015471 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |