Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTCTTCCCCATGATTTCTGGGAAGCTTTATCCATCCTACTCCACATCATGCATTTTCCCACCAACCCATCTTCATCTTCATCTTCCTTCCCCATTTCCTCTTCTCAAAATCTCTTCCCAACTCAAATTGATCAGCTCTGAATCCGTCTCCCTCTCTTTTCCAACCTCATTTGCTTCTAAACCCAGTGCCAAGTCCATTAGATTTTCGAATTTAGTGGCCAAAGTTTATAGCTATGAGGACCAAAACCCCACTGCTTTGTCGGATTTGGATGACTTGTCCGAGAATGGAGTTGTCTATAAGAAGACATTGGCGATGGTCGAGTGCTCCATGTTCGCTGCACTTAATGGCTTGGTCTACTTCTTGAGTAATTCCCTTGCTCTTGAGGTTTGTAGTTCTCAGTTTCAACGAATTTGATTTCTTTCCTCCGATGCTGCTGTTTCTGTCATTTTTCATTCTTTTTTGGATTACTTTTTTTCTGGTTGTTTTCTTTAAATGAGAGGGATGCCCTAAGTCCATGTGGTGTTTGGTTTACTTTGGAAGAGATTCTTCTCCATTGATGTCAATTTGAGATGCCTTTTTCCCGTTGATTTTAGTTTCACTTAAAATTTATCATAATTGACTAAATGTATGTGAATAGTTAATACTAATGATGAAATTCTTTTTGTTTGTCAATATGGCGATTTTCTTAAACAGGACATCTTGGGGCTTGTATAGGGTCCTCCTAAACTATCCTCAATAACAAGGAAATAAACTGAAGACAATTCACTTATGTATTCTTCCATGGAGTATGCAATACATATACAACTCATAGTCTAACTAATCCTGGTAAACCTATTCTCTAAAGCTAACTGACTAATAACCAAATAACAATAATACAACAGCACTAATACTCCCTCTCTATATGGGAATGAATGTTGAAAAGCTCCTGTCTTCTGTAATAGAGAGAAAGAGCACTAAACTGGGAAGAGTATTGGTGATTATATCAGCAAGCTCTAAATGACAACAAATTGTTAAGAAGTTTGAGAATACCATCAGATAACTTTGACACAGACAAAGAAAGCCAAAATTCTTTTAGCAATTACGCAAGTTTGGTGACAGCTTCATGAGAAATCATTCTGTGATGAAGAGATGGAAAAATTTGTATAATCCTTGGAAAGAGATTACTAAAGCCTCTGATTTTGTGTGGGGAAATTGTTCCTTCAACGTGGGTTTGGGTGATAAGGCCCTTTCTTGGGAAGATAGATGGAGGGGTGATCAACCTCTCAAGTGTGTTTTTCCTAATTTGTACAGAATCTCCTTGAGCAAAGCGTTCATTATTAAGGACCTTCGGAAAGATAGCACCTGGGATCTTCGTTTCCCTAGAAATTTGCTGAATAGAGAGCTTCAAGAGTGGGATACCTTGGCTTCTTTGATAGGCAGCTTTAATCCTTCAGGAAGGGAGGATGTTCTGACTTGGAGGTTGGATAAATCAGGGATGTTTTCTGTCAAGTCAGCCTTGGAAGAGATTCAAACTAAAAGAAGAATTCTGGAGGAAGATCTCAGCAGTCAAATCTGGGAAGGCAACATTCCTCAGAAAGTTAAGTTTTTCTTGTGGTCTTCGGCCCTTAATAGTATTAACACCATTGATAGAATCCAGAGGAGGTTTCCTTGTCTTAATCTTTCCCGGCTGGTGTGTGATGTGTGGCAATAATGAGGAATCTCTCCCTCATTTGTTAATTCACTGTGAGTTTGCTGCCTCTGTTTGGGATCATTTCAGTAGGACTTTTGGGTGGCAAGGTTGTAAACCGAAGTGCATGGAAAAGTGGCTGGCTGATATCCCAGATAGTTGGTTCATGAGAGACAAGGCAAGAGTGCTTTGGAAGAATATGGTTCGAGCTATCTCTTGGCTTCTTTGAAAGGAAAGAAATTTGAGGATCTTCACAGATAAGAGCTCTTCTTTTGCTATCTTTTGTGACCTTATACAGTTTACGGTCTCTTCTTGGAGACACAATCAATTTAGATTGGAAGGTCTTTTTGTAATCCTTCCTTCGGGAGAGGGGATGTCTTATCCCTTGGTCCCTTAGGCTGTTTTGTTGGAGCTCTTTGTTTTTTAATATATTTCTTCGTCTCTTATCCCAAAAAAAAAATAATAAAAATTACGCAATTCACACTGATTCACCAATAACAGCAGCCAAAGCACAATACTATCTTTTCACTTTCCACAAACAAGAGCATAACCTTGGAAAGCCCAAAAACCAGTGCCAGAATGACGTGTATCAAGACAAACATCCTAACCAATATCCACAGTGTTATGTTGAGATGAGGAAGAAGCTTTTAAAGGAATATTCTTGGCCAGGATTTCCCTTAAGATACAAAAGAAAATGATGAACAACAATCGTGTAGAGTCTTGGGTCTAGCAACAAACTAACTTTGACGATGGATGACGAAAGTAATATCAGGATGGGAAACTTGAGAATTTAACAACTGCCTGATAAACTTTTGGAAGTGGCCTGCTGGAAACCCTCCGGCTTTATATGATGATGATCCAAGACGTGTGATGTATATGCAATTTTGAAGTGACCTTTCTCTTAAGAACAACAACTAAAGGTTGTGAAGCAATCCCCCGCCTATGACAATCTCAAAAAGGAAGTCAATCAATCCTTACGCAACTAAGGAGATATGAAGGAAATTTAAATCTACAAAGAAAGAAAGAGATGGCTATCTCAAAAGAAGGTTACAAAAAAGCGCTCAACTCTAGATTGATGGTAGCAATAGAGTAATTACAAAAAAGCCTGTTTTGAGCACTCCAAGGCCATAAACAGTACAAAATCTTAAAACTAAGAAAGGGAAGTCTTCTTCTCTGAGAAAACTCTTGAATTTCTCTCCCTCCAAAGAAGCCATAAGATAGCTCTAGTGAGGTCCGTATGTAAAACTTCGGTTGTCTTTAAAGCACCACCCAATTTTCCACACTTTGGTTTACACTCCTGCAGACTGAACACATTTCCAAAGTGGTTCCAACCCTTCTCAACAAACTCAAAGTGAAGCAATAAGTTAGGGGAGATTCCTCACTTCTGAAGCACATAACAAACACTTTTGGGGAGATGTGGAGATTTGGGAACCTCCTTTGAACCTTATCCAGGGTATTGATGCTGCTAAGGGCTAAAGTCCAAGTAAAAAAATTCACCCTTTTTGAATTAGACCGTCCCAATCTGCTTGCACAAAATTTCTTCCAGTACCTCCTTTTCTTATGAATAACTAACAAGCAGAATTCACAGAAAATCTACCCGAATCTTCCAAAATTTCTTCCAGTACCTCCTTTTCTTATGAATAACTAACAAGCAGACTTCACAGAAAATCTACCCGAATCTTCCATCTTCCAAATCAGCACATCATCTTCTTGAGTAGGAAGGAATTGGTTCAGCAAGAAGGAAAGAATAGTCCATTCTTCCAATTCTCTGTCAAAAACACCCCTTCAACATCAAATTCCACGCAGACCCATTTCACAAATCTTGAACTTTCTTATCTTGAGTCATAGATATTTTGAACATATTAGGAAACACCCGCTTAAGGGGGTACTCGGTCAGCCAGCTATCACTCCAAAAGAGGACCTTGTTCCCTCTCCCCACTTTGTACTTGCTATTTTCCCAAAAAAAAACAGCCGCCTTAGCAATGTCCTTCCATGGACTAAACTTACCTTCCCACTTTTTGTGAGCTTGGCGATCCCTATGGAAAGACTCTCCATACTTACATCTAACTGCTGTTCTCCGTAAAGCTGAATTTTCCTTGTCGAAAAGCGCCAAAGCTGCTTGGTTAGGAGGGCCAAGTTTTTAGTTCTTGTGGTCACACGATCCCAAATGATTAAATGACTTCCTTTAACCTCGTCAGCCCCTTTCCATAGAAAATTTCTGAAGCTCCACCTTCTTGCACACCCCCACTTGAGCTTTGAATATAGATAGGTAATGAGCTGGAACACTTGAAAGGGTGGCCTTAATTAAGGTAGCTCTACCTCCTTGAATAGGCTGAAGGAAAATGTTCACTTCCTTCATAGCATTTGGGCAAGGACCCATGTGATAATATCCTGATTATCATCAACATAAATCTGAAGAGCCACAAAAGAACCATCCCCCTTAGTAAAGAGAGAACAATCAACCTTAGATTGAATAAAACCATGAGATAACAATACTGTAGAAAACTTTGTACACTTCTTGTTTGATACGGTAAATTGAAATTATGACCCAATGAGAAGGAAAAAAAAAAAAAAAAAAAAAAAAAGGAGAGACTGTGAATATACTTGTTTAATTTTCAAATGAATAATTACCCCTTGCTACAAACATGCTCACAAACTTTATAACCATTAGGAAGAGCCATGTGAACTTCCGAATTAATCAAAGTCACCATTTAAACAAGTATTGTCAACATCTATTTTTGCATGTATTCTTCCTTCCATGAAGTATGTATGAGACAAATACAACTCGTAGTCTAACCAATTGTTAAAACTAATGACAAAATAACAATAACACGACAGCATGTGTTGCAAAGAATAGTTTTTAAAAATAGGATTTTTTTTCTTTTTCAAAAGATAGATAGTGTTGTTATTCATCTTTCTACAATGAATTCTATAATTTCAAAAAAAGAATAAGAAAAGGAAAAAAAATTCTAAAAAATGGCTTGTTTTATAGCTTTCTGAACATGAATGCTAATTTTATGAAAGTTTTCATACTCATTACATACCTGTTCCTTCCTTAAAAAAAACATACTTACCTGTTCCTTGTAATTTTTCACTCGTTTAAAGAAATTTAAAGAAATGTACATTTTTTATCAACTTAGATGTCTGCAAATTAAGCTACCGCTTAGATTTCTTCATTTTTTTTAAAAAAAATAGTTTAAGTTTCTTGAATTTTAACTGATTGCTGTTTGTTATTTATTTGATATCGGACAGAATTACTTCGGCTGTTTCTTCTGTCTACCAATAGTTATCTCTTCAATGAGATGGGGCGTAGCAGCTGGGAGAAAAACCATGGTCTGTAGTTTTCCAGGACTCGATTGTTGAACTATTGATTTTATGTTGTCTTGTTATTCGTACATTCTCTCATTGTAGTAATGTAAATATCTAAGCTGTCACATCTGATCTTCTTTCAAGACAGGTCGCAACATTCTTGCTGCTCCTGGTTTTGTCTGGTCCAGTGAAAGCTTTAACCTATATGGTGAGATGCTTTAAGATTTCATCTTCTCAATCTTAATTGCATCTGGATTTTTCTTGTTTCTGAAATATAAAGATTTTTATTTTTAGCCTATCAACCCGGGAAATTCTCTCTTATATTAGAGCCAAAGGAGAATCAGCTCAATGGCCTTCTTAAACTAACTTCAAAAAGCTAAAGGCAGCCTTGAGGGGTTTTCCCCCCTGATAATTAAATAAGCGTTAATAAAAAGGATAAAATACATAATCAGGTATTATCAAACATAATTATGTAAAAGTACAAAACAGAAAAATCACTTATATCAATTAAAACTTTTAAAATTAACGATCGTAAAGGCTTTTGAGTTTGTTTCTTCAATATTTTCTAGTCCTTGGTGAAAAAAAAATATTAACAGGCTGAAGTACTACAACTTATGGTGCAGTACCATTCCATGAATTTCCATACTTTTAAGAGTAAACGAAAAATTAATGTTATTCTTCCAACCATTACATCATAAGGCATAGACTCCAAACTGAAATGTGTTAATGCATTCTCATAGAGGCACATGCATCTTATGATCCAAAGAAAGCAAAAGCGTCTAGGCTAAAAATTTGAGCCTTTCTCCAAGGAATGAGCCTCACAGGGCTTTTATAACCTTGTTCTCAGAACATAGTTATATGTAATGAGAACTTCTTTCCATGCCCTCTGGTACTTGCTTTTATTTAAGAAATAGTAGCATATTTAGGTTCTGCTAAACCTATGGATATAAGCACAACAATGGTTTTGCTCCATTAGCTGCCATCTAGGAAGCATAGACACTCTATTTTATGCCGAGTGTCCATGTCAGACACGTCTCGGACACGTCTAGAACATGCCTTGGACACGTTTGGGACACACTTGAAAATTATAATTCATTCTGAAAAAGATTAAAAGTCAAGCCTATTAATCCCAACATAAAAAGTTTAAAAAAAAAACCATAAACTTTTAAATAGAAAATCCTAAAAGCTAAAAACCATCTCAAGTGATCGTTGGTTTGCCAACTGCACTTATCACTAACCACTTCATACTGTAAGGTTTGATTTTTTTTTTCTCTCTTTCTAAACTTCATAAAAGACAAAAGTTATAGAAGATCTCTAAATTCTAATTTTTCCAACCATCATATATATTTATCATTTATAAACCATTAGAAGATATCTTTAATGTCTAACCATAGCCATAGCCACTAAAATCCTTTGAGGTTTCCTAATGTTGAAGATGAATCAATATATACATATATGCCTTTAAAAAAAATATTCCCAATGTGTCTGTGTCCTACTTTTTTAGAAATTGATGTATCGCCGTATCCGTATTGTGTTGTGTTCGTATTCCGTGTCTGTGTTCGTGCCTCTTAGATTGCCATCATAACTTCAGTCTTGAGCATTTTGTTCATTATTTTTTTTGAGTTATTAATATTATGGAGTAATGCGAATTATGATTTGAATGGTAAAATTGTTGATCTCTGACTATTCTTTGCAAATATAAGTTTAATATCAAATGTATTCCTTCTCAACCGTACAGACCTGTTTTACATTGCTGTATTCTGAATGTCCATGAAGATGCTTAATGTCAACAGATCATTAGAGAAAATCTGAGACATCATAGTTGTTGTGGCCTCAGCTTGTAGTGCCATTTCTAGATCTTTATGTAATTGCTGTCTATACCATTTGTGCCAATTTGAGAGTATTTTTTTAACTGTTTTAGTTTGATGTTTCTCCTTTTGGTGTATCTCTAATATATCTTTTGATGTTTTTTAATCCTTGTCCTATTGAAAGGGGAACAGTATTCAATTGTTTACCACCTCCTGCAGAACGCCTGACATTGTTTTATGCTTCCTTGTGCAGCTTAGGCATGGTTTAGTGGGGCTGACAATGGGCTCCTTGTGGAGGTAGGTGATTCACTATGCATTCAACTATCCATAAATATATAACTAACCTTGATTTAGGCATATGGCATTAAGGATTCGTAAAGTTTGTTAAATCTCTATATGCTTGTTGGTCAAGTTTCTCAGGGTATTGTAATGAAATTTGGTGATATGCATTAACCAAAGGCATTAATGGAGTTACTATTCTAATCTCTATATAGCTTTACTTGAATTTTGTGCTTTGAACTTATCCTAATTAGGTAAGAACAGAATAATAATAAAAACAATAAGAAAAGTAAAGAAGAATTCTGTGCTTTGGACCACTTCTACTTGCTTTTGAAAAAATTTACATTGGATCTCATTCATCGACCTACGGAATATTTGGGGAGGGCATATTCCTATTGGAGGATGGTCGTTATAATGGCTGCTTTCTGGTTTTGTATCTTCTCAGGCTTGGAGCAAATTGGAGTACTTCAATCTTCCTGTGCACAATCGTATTATTTCTCTTCTCCAACACCCACGTGAACAAGTAATTTAATCCCAGTTCCTTTTCTTTTAATAAATTCTACTAAGAACATAACTGTTTGTTTAGGTTCGGGCGCTCGGGGCAGTGGGGTATGTCTTAATATCTTCATTCTTGATAAGAGAGAACATACTAGCTCTGGTAACAAGTTATCCTGAATCTCTGGTGCTATGCTTCAAGGAACTCACCTCTGAAATCTCAAAACACCTTCCATTTTTTTGCAGATCACTATAAATATTCATGCTTCCCTCACCCTTATCTTCACTGCCTGGGGTGTAAACTTGATTCCTTCAATGAATGCAATATATGTTATCTTTGGGACACTGGTATGTTTTATATTCCCCTCCATTAGAAATTCTTTTAATGAAATTGGTGAAAACAGACTGTACTTGTAGGACATGTATTATAGAGATTTAATTAATATTTAATTTAATTGGTTAAGCACAAGCATCATTTATTTGAACGTTCCTCCCCCTCTTTGTGTTCTGAAAGGTATTGCTGAACTCTGGATGCTTCATGTTTTTGCTTCACCTTTTGTACTCCGTATTCCTTACAAGACTTGGTCTGAAGACTTCATTGACATTGCCAAGGTGGCTGGAGAAGGCGATGTAAACGTACGATGGGTATTAACTCGTTCGTGATGATAAATACACCATTTACTCTTAACTTTTATGGAGGAAAGAGGAAGATATTGGTTAGGCTTAATGCAAAGTTTCATTTGCACCCGATATTTTCGAGCACATTGGCTAGCGAAACAGATTGTATTTCATGTGTTATCATGTTCTAAGTTCAATATTCTAAGCTAGTCTGAATGTGAAAAGTGCTTGGCCATTTGTTCCATTGTATAAGAAAAGGTATGTATGTATGGAGGATGCCTTTGCACAACCTGATAAAATGGAAGGAACTATAGTGGAGGGAGGACTTCAGATGTCCACTAGTGGTCATCTTGAGCAAGCTTAA
mRNA sequence
ATGATTTCTGGGAAGCTTTATCCATCCTACTCCACATCATGCATTTTCCCACCAACCCATCTTCATCTTCATCTTCCTTCCCCATTTCCTCTTCTCAAAATCTCTTCCCAACTCAAATTGATCAGCTCTGAATCCGTCTCCCTCTCTTTTCCAACCTCATTTGCTTCTAAACCCAGTGCCAAGTCCATTAGATTTTCGAATTTAGTGGCCAAAGTTTATAGCTATGAGGACCAAAACCCCACTGCTTTGTCGGATTTGGATGACTTGTCCGAGAATGGAGTTGTCTATAAGAAGACATTGGCGATGGTCGAGTGCTCCATGTTCGCTGCACTTAATGGCTTGGTCTACTTCTTGAGTAATTCCCTTGCTCTTGAGAGATGGAAAAATTTGTATAATCCTTGGAAAGAGATTACTAAAGCCTCTGATTTTGTGTGGGGAAATTGTTCCTTCAACGTGGGTTTGGGTGATAAGGCCCTTTCTTGGGAAGATAGATGGAGGGGTGATCAACCTCTCAAGTGTGTTTTTCCTAATTTGTACAGAATCTCCTTGAGCAAAGCGTTCATTATTAAGGACCTTCGGAAAGATAGCACCTGGGATCTTCGTTTCCCTAGAAATTTGCTGAATAGAGAGCTTCAAGAGTGGGATACCTTGGCTTCTTTGATAGGCAGCTTTAATCCTTCAGGAAGGGAGGATGTTCTGACTTGGAGGTTGGATAAATCAGGGATGTTTTCTGTCAAGTCAGCCTTGGAAGAGATTCAAACTAAAAGAAGAATTCTGGAGGAAGATCTCAGCAGTCAAATCTGGGAAGGCAACATTCCTCAGAAAGTTAAGTTTTTCTTGTGGTCTTCGGCCCTTAATAGTATTAACACCATTGATAGAATCCAGAGGAGGTTTCCTTGTCTTAATCTTTCCCGGCTGCTGAATTTTCCTTGTCGAAAAGCGCCAAAGCTGCTTGAATTACTTCGGCTGTTTCTTCTGTCTACCAATAGTTATCTCTTCAATGAGATGGGGCGTAGCAGCTGGGAGAAAAACCATGGTCGCAACATTCTTGCTGCTCCTGGTTTTGTCTGGTCCAGTGAAAGCTTTAACCTATATGAAATTGATGTATCGCCGTATCCGTATTGTGTTGTGTTCCTTAGGCATGGTTTAGTGGGGCTGACAATGGGCTCCTTGTGGAGGCTTGGAGCAAATTGGAGTACTTCAATCTTCCTGTGCACAATCGTTCGGGCGCTCGGGGCAGTGGGGTATGTCTTAATATCTTCATTCTTGATAAGAGAGAACATACTAGCTCTGATCACTATAAATATTCATGCTTCCCTCACCCTTATCTTCACTGCCTGGGGTGTAAACTTGATTCCTTCAATGAATGCAATATATGTTATCTTTGGGACACTGGTATGTATTGCTGAACTCTGGATGCTTCATGTTTTTGCTTCACCTTTTGTACTCCGTATTCCTTACAAGACTTGGTCTGAAGACTTCATTGACATTGCCAAGGTGGCTGGAGAAGGCGATGTAAACGTACGATGGAAAAGGTATGTATGTATGGAGGATGCCTTTGCACAACCTGATAAAATGGAAGGAACTATAGTGGAGGGAGGACTTCAGATGTCCACTAGTGGTCATCTTGAGCAAGCTTAA
Coding sequence (CDS)
ATGATTTCTGGGAAGCTTTATCCATCCTACTCCACATCATGCATTTTCCCACCAACCCATCTTCATCTTCATCTTCCTTCCCCATTTCCTCTTCTCAAAATCTCTTCCCAACTCAAATTGATCAGCTCTGAATCCGTCTCCCTCTCTTTTCCAACCTCATTTGCTTCTAAACCCAGTGCCAAGTCCATTAGATTTTCGAATTTAGTGGCCAAAGTTTATAGCTATGAGGACCAAAACCCCACTGCTTTGTCGGATTTGGATGACTTGTCCGAGAATGGAGTTGTCTATAAGAAGACATTGGCGATGGTCGAGTGCTCCATGTTCGCTGCACTTAATGGCTTGGTCTACTTCTTGAGTAATTCCCTTGCTCTTGAGAGATGGAAAAATTTGTATAATCCTTGGAAAGAGATTACTAAAGCCTCTGATTTTGTGTGGGGAAATTGTTCCTTCAACGTGGGTTTGGGTGATAAGGCCCTTTCTTGGGAAGATAGATGGAGGGGTGATCAACCTCTCAAGTGTGTTTTTCCTAATTTGTACAGAATCTCCTTGAGCAAAGCGTTCATTATTAAGGACCTTCGGAAAGATAGCACCTGGGATCTTCGTTTCCCTAGAAATTTGCTGAATAGAGAGCTTCAAGAGTGGGATACCTTGGCTTCTTTGATAGGCAGCTTTAATCCTTCAGGAAGGGAGGATGTTCTGACTTGGAGGTTGGATAAATCAGGGATGTTTTCTGTCAAGTCAGCCTTGGAAGAGATTCAAACTAAAAGAAGAATTCTGGAGGAAGATCTCAGCAGTCAAATCTGGGAAGGCAACATTCCTCAGAAAGTTAAGTTTTTCTTGTGGTCTTCGGCCCTTAATAGTATTAACACCATTGATAGAATCCAGAGGAGGTTTCCTTGTCTTAATCTTTCCCGGCTGCTGAATTTTCCTTGTCGAAAAGCGCCAAAGCTGCTTGAATTACTTCGGCTGTTTCTTCTGTCTACCAATAGTTATCTCTTCAATGAGATGGGGCGTAGCAGCTGGGAGAAAAACCATGGTCGCAACATTCTTGCTGCTCCTGGTTTTGTCTGGTCCAGTGAAAGCTTTAACCTATATGAAATTGATGTATCGCCGTATCCGTATTGTGTTGTGTTCCTTAGGCATGGTTTAGTGGGGCTGACAATGGGCTCCTTGTGGAGGCTTGGAGCAAATTGGAGTACTTCAATCTTCCTGTGCACAATCGTTCGGGCGCTCGGGGCAGTGGGGTATGTCTTAATATCTTCATTCTTGATAAGAGAGAACATACTAGCTCTGATCACTATAAATATTCATGCTTCCCTCACCCTTATCTTCACTGCCTGGGGTGTAAACTTGATTCCTTCAATGAATGCAATATATGTTATCTTTGGGACACTGGTATGTATTGCTGAACTCTGGATGCTTCATGTTTTTGCTTCACCTTTTGTACTCCGTATTCCTTACAAGACTTGGTCTGAAGACTTCATTGACATTGCCAAGGTGGCTGGAGAAGGCGATGTAAACGTACGATGGAAAAGGTATGTATGTATGGAGGATGCCTTTGCACAACCTGATAAAATGGAAGGAACTATAGTGGAGGGAGGACTTCAGATGTCCACTAGTGGTCATCTTGAGCAAGCTTAA
Protein sequence
MISGKLYPSYSTSCIFPPTHLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSFASKPSAKSIRFSNLVAKVYSYEDQNPTALSDLDDLSENGVVYKKTLAMVECSMFAALNGLVYFLSNSLALERWKNLYNPWKEITKASDFVWGNCSFNVGLGDKALSWEDRWRGDQPLKCVFPNLYRISLSKAFIIKDLRKDSTWDLRFPRNLLNRELQEWDTLASLIGSFNPSGREDVLTWRLDKSGMFSVKSALEEIQTKRRILEEDLSSQIWEGNIPQKVKFFLWSSALNSINTIDRIQRRFPCLNLSRLLNFPCRKAPKLLELLRLFLLSTNSYLFNEMGRSSWEKNHGRNILAAPGFVWSSESFNLYEIDVSPYPYCVVFLRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYVIFGTLVCIAELWMLHVFASPFVLRIPYKTWSEDFIDIAKVAGEGDVNVRWKRYVCMEDAFAQPDKMEGTIVEGGLQMSTSGHLEQA
Homology
BLAST of Spg020865 vs. NCBI nr
Match:
XP_022929150.1 (uncharacterized protein LOC111435817 [Cucurbita moschata])
HSP 1 Score: 273.9 bits (699), Expect = 3.1e-69
Identity = 211/498 (42.37%), Postives = 230/498 (46.18%), Query Frame = 0
Query: 1 MISGKLYPSYSTSCIFPP------THLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSF 60
MISG LYPS STSCIFPP T H+HL P PLLKISS+L+LIS ESVSLSFPT
Sbjct: 1 MISGNLYPSVSTSCIFPPKPTSTSTTAHVHL--PLPLLKISSKLRLISFESVSLSFPTFI 60
Query: 61 ASKPSAKSIRFSNLVAKVYSYEDQNPTALSDLDDLSENGVVYKKTLAMVECSMFAALNGL 120
ASK S KS RFSN VAKVYS+E QNPT+LSDL+DLSENGVVYKKTLAMVECSMFAALNGL
Sbjct: 61 ASKSSVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGL 120
Query: 121 VYFLSNSLALERWKNLYNPWKEITKASDFVWGNCSFNVGLGDKALSWEDRWRGDQPLKCV 180
VYFLSNSLALE + C
Sbjct: 121 VYFLSNSLALENY-------------------------------------------FGCF 180
Query: 181 FPNLYRISLSKAFIIKDLRKDSTWDLRFPRNLLNRELQEWDTLASLIGSFNPSGREDVLT 240
F
Sbjct: 181 F----------------------------------------------------------- 240
Query: 241 WRLDKSGMFSVKSALEEIQTKRRILEEDLSSQIWEGNIPQKVKFFLWSSALNSINTIDRI 300
Sbjct: 241 ------------------------------------------------------------ 290
Query: 301 QRRFPCLNLSRLLNFPCRKAPKLLELLRLFLLSTNSYLFNEMGRSSWEKNHGRNILAAPG 360
CL P ++ +R W GR + A
Sbjct: 301 -----CL-------------PIVISSMR------------------WGIAAGRKTMVA-- 290
Query: 361 FVWSSESFNLYEIDVSPYPYCVVFLRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGAV 420
+F L + P LRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGAV
Sbjct: 361 ------TFLLLLVLSGPVKALTFLLRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGAV 290
Query: 421 GYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYVIFGTLV---CIAEL 480
GYVLISSFLIRENILALITINIHASLTLIFTA GVNLIPSMNAIY IFGTLV C +
Sbjct: 421 GYVLISSFLIRENILALITINIHASLTLIFTASGVNLIPSMNAIYAIFGTLVMLNCGCFM 290
Query: 481 WMLHVFASPFVLRIPYKT 490
++LH+ S F+ R+ KT
Sbjct: 481 FLLHLLYSIFLTRLGLKT 290
BLAST of Spg020865 vs. NCBI nr
Match:
XP_016900134.1 (PREDICTED: uncharacterized protein LOC103488678 isoform X3 [Cucumis melo])
HSP 1 Score: 273.9 bits (699), Expect = 3.1e-69
Identity = 207/523 (39.58%), Postives = 233/523 (44.55%), Query Frame = 0
Query: 1 MISGKLYPSYSTSCIFPPTHLHLHLPSPFP-------LLKISSQLKLISSESVSLSFPTS 60
MISGKLY S S+SCIFPPT P+P P LKISS L+LIS +SVSLS P+S
Sbjct: 1 MISGKLYSSSSSSCIFPPT----PTPTPTPTGNLHLSFLKISSTLRLISFQSVSLSVPSS 60
Query: 61 FASKPSAKSIRFSNLVAKVYSYEDQNPTALSDLDDLSENGVVYKKTLAMVECSMFAALNG 120
FASK SAKS RFSN + +VYSYE QN LSDLDDLSENGVVYKKTLAMVECSMFAALNG
Sbjct: 61 FASKSSAKSTRFSNSLVEVYSYEGQNSITLSDLDDLSENGVVYKKTLAMVECSMFAALNG 120
Query: 121 LVYFLSNSLALERWKNLYNPWKEITKASDFVWGNCSFNVGLGDKALSWEDRWRGDQPLKC 180
LVYFLSNSLALE + C F + + ++ W G + + C
Sbjct: 121 LVYFLSNSLALEN------------------YFGCFFCLPIVISSMRWGIS-AGRKTMVC 180
Query: 181 VFPNLYRISLSKAFIIKDLRKDSTWDLRFPRNLLNRELQEWDTLASLIGSFNPSGREDVL 240
FP L +++
Sbjct: 181 SFPGLKQVA--------------------------------------------------- 240
Query: 241 TWRLDKSGMFSVKSALEEIQTKRRILEEDLSSQIWEGNIPQKVKFFLWSSALNSINTIDR 300
Sbjct: 241 ------------------------------------------------------------ 299
Query: 301 IQRRFPCLNLSRLLNFPCRKAPKLLELLRLFLLSTNSYLFNEMGRSSWEKNHGRNILAAP 360
L L +LS
Sbjct: 301 -------------------------TFLLLLVLS-------------------------- 299
Query: 361 GFVWSSESFNLYEIDVSPYPYCVVFLRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGA 420
P LRHGLVG TMGSLWRLGANWSTSIFLCTIVRA GA
Sbjct: 361 ----------------GPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCTIVRAFGA 299
Query: 421 VGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYVIFGTLVCIAELWM 480
VGYVL+SSFLIRENIL+LITINIHASLTLIFTAWGVNLIPSMNAIY IFGTL
Sbjct: 421 VGYVLVSSFLIRENILSLITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTL-------- 299
Query: 481 LHVFASPFVLRIPYKTWSEDFIDIAKVAGEGDVNV--RWKRYV 515
TW EDFIDI VAG+G+VN+ W+ ++
Sbjct: 481 ---------------TWFEDFIDITTVAGKGNVNILSGWRSFL 299
BLAST of Spg020865 vs. NCBI nr
Match:
XP_016900141.1 (PREDICTED: uncharacterized protein LOC103488678 isoform X7 [Cucumis melo])
HSP 1 Score: 273.1 bits (697), Expect = 5.2e-69
Identity = 206/515 (40.00%), Postives = 229/515 (44.47%), Query Frame = 0
Query: 1 MISGKLYPSYSTSCIFPPTHLHLHLPSPFP-------LLKISSQLKLISSESVSLSFPTS 60
MISGKLY S S+SCIFPPT P+P P LKISS L+LIS +SVSLS P+S
Sbjct: 1 MISGKLYSSSSSSCIFPPT----PTPTPTPTGNLHLSFLKISSTLRLISFQSVSLSVPSS 60
Query: 61 FASKPSAKSIRFSNLVAKVYSYEDQNPTALSDLDDLSENGVVYKKTLAMVECSMFAALNG 120
FASK SAKS RFSN + +VYSYE QN LSDLDDLSENGVVYKKTLAMVECSMFAALNG
Sbjct: 61 FASKSSAKSTRFSNSLVEVYSYEGQNSITLSDLDDLSENGVVYKKTLAMVECSMFAALNG 120
Query: 121 LVYFLSNSLALERWKNLYNPWKEITKASDFVWGNCSFNVGLGDKALSWEDRWRGDQPLKC 180
LVYFLSNSLALE + C F + + ++ W G + + C
Sbjct: 121 LVYFLSNSLALEN------------------YFGCFFCLPIVISSMRWGIS-AGRKTMVC 180
Query: 181 VFPNLYRISLSKAFIIKDLRKDSTWDLRFPRNLLNRELQEWDTLASLIGSFNPSGREDVL 240
FP L +++
Sbjct: 181 SFPGLKQVA--------------------------------------------------- 240
Query: 241 TWRLDKSGMFSVKSALEEIQTKRRILEEDLSSQIWEGNIPQKVKFFLWSSALNSINTIDR 300
Sbjct: 241 ------------------------------------------------------------ 291
Query: 301 IQRRFPCLNLSRLLNFPCRKAPKLLELLRLFLLSTNSYLFNEMGRSSWEKNHGRNILAAP 360
L L +LS
Sbjct: 301 -------------------------TFLLLLVLS-------------------------- 291
Query: 361 GFVWSSESFNLYEIDVSPYPYCVVFLRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGA 420
P LRHGLVG TMGSLWRLGANWSTSIFLCTIVRA GA
Sbjct: 361 ----------------GPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCTIVRAFGA 291
Query: 421 VGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYVIFGTLVCIAELWM 480
VGYVL+SSFLIRENIL+LITINIHASLTLIFTAWGVNLIPSMNAIY IFGTL
Sbjct: 421 VGYVLVSSFLIRENILSLITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTL-------- 291
Query: 481 LHVFASPFVLRIPYKTWSEDFIDIAKVAGEGDVNV 509
TW EDFIDI VAG+G+VN+
Sbjct: 481 ---------------TWFEDFIDITTVAGKGNVNI 291
BLAST of Spg020865 vs. NCBI nr
Match:
XP_016900135.1 (PREDICTED: uncharacterized protein LOC103488678 isoform X4 [Cucumis melo])
HSP 1 Score: 273.1 bits (697), Expect = 5.2e-69
Identity = 206/515 (40.00%), Postives = 229/515 (44.47%), Query Frame = 0
Query: 1 MISGKLYPSYSTSCIFPPTHLHLHLPSPFP-------LLKISSQLKLISSESVSLSFPTS 60
MISGKLY S S+SCIFPPT P+P P LKISS L+LIS +SVSLS P+S
Sbjct: 1 MISGKLYSSSSSSCIFPPT----PTPTPTPTGNLHLSFLKISSTLRLISFQSVSLSVPSS 60
Query: 61 FASKPSAKSIRFSNLVAKVYSYEDQNPTALSDLDDLSENGVVYKKTLAMVECSMFAALNG 120
FASK SAKS RFSN + +VYSYE QN LSDLDDLSENGVVYKKTLAMVECSMFAALNG
Sbjct: 61 FASKSSAKSTRFSNSLVEVYSYEGQNSITLSDLDDLSENGVVYKKTLAMVECSMFAALNG 120
Query: 121 LVYFLSNSLALERWKNLYNPWKEITKASDFVWGNCSFNVGLGDKALSWEDRWRGDQPLKC 180
LVYFLSNSLALE + C F + + ++ W G + + C
Sbjct: 121 LVYFLSNSLALEN------------------YFGCFFCLPIVISSMRWGIS-AGRKTMVC 180
Query: 181 VFPNLYRISLSKAFIIKDLRKDSTWDLRFPRNLLNRELQEWDTLASLIGSFNPSGREDVL 240
FP L +++
Sbjct: 181 SFPGLKQVA--------------------------------------------------- 240
Query: 241 TWRLDKSGMFSVKSALEEIQTKRRILEEDLSSQIWEGNIPQKVKFFLWSSALNSINTIDR 300
Sbjct: 241 ------------------------------------------------------------ 291
Query: 301 IQRRFPCLNLSRLLNFPCRKAPKLLELLRLFLLSTNSYLFNEMGRSSWEKNHGRNILAAP 360
L L +LS
Sbjct: 301 -------------------------TFLLLLVLS-------------------------- 291
Query: 361 GFVWSSESFNLYEIDVSPYPYCVVFLRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGA 420
P LRHGLVG TMGSLWRLGANWSTSIFLCTIVRA GA
Sbjct: 361 ----------------GPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCTIVRAFGA 291
Query: 421 VGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYVIFGTLVCIAELWM 480
VGYVL+SSFLIRENIL+LITINIHASLTLIFTAWGVNLIPSMNAIY IFGTL
Sbjct: 421 VGYVLVSSFLIRENILSLITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTL-------- 291
Query: 481 LHVFASPFVLRIPYKTWSEDFIDIAKVAGEGDVNV 509
TW EDFIDI VAG+G+VN+
Sbjct: 481 ---------------TWFEDFIDITTVAGKGNVNI 291
BLAST of Spg020865 vs. NCBI nr
Match:
XP_016900133.1 (PREDICTED: uncharacterized protein LOC103488678 isoform X1 [Cucumis melo])
HSP 1 Score: 273.1 bits (697), Expect = 5.2e-69
Identity = 206/515 (40.00%), Postives = 229/515 (44.47%), Query Frame = 0
Query: 1 MISGKLYPSYSTSCIFPPTHLHLHLPSPFP-------LLKISSQLKLISSESVSLSFPTS 60
MISGKLY S S+SCIFPPT P+P P LKISS L+LIS +SVSLS P+S
Sbjct: 1 MISGKLYSSSSSSCIFPPT----PTPTPTPTGNLHLSFLKISSTLRLISFQSVSLSVPSS 60
Query: 61 FASKPSAKSIRFSNLVAKVYSYEDQNPTALSDLDDLSENGVVYKKTLAMVECSMFAALNG 120
FASK SAKS RFSN + +VYSYE QN LSDLDDLSENGVVYKKTLAMVECSMFAALNG
Sbjct: 61 FASKSSAKSTRFSNSLVEVYSYEGQNSITLSDLDDLSENGVVYKKTLAMVECSMFAALNG 120
Query: 121 LVYFLSNSLALERWKNLYNPWKEITKASDFVWGNCSFNVGLGDKALSWEDRWRGDQPLKC 180
LVYFLSNSLALE + C F + + ++ W G + + C
Sbjct: 121 LVYFLSNSLALEN------------------YFGCFFCLPIVISSMRWGIS-AGRKTMVC 180
Query: 181 VFPNLYRISLSKAFIIKDLRKDSTWDLRFPRNLLNRELQEWDTLASLIGSFNPSGREDVL 240
FP L +++
Sbjct: 181 SFPGLKQVA--------------------------------------------------- 240
Query: 241 TWRLDKSGMFSVKSALEEIQTKRRILEEDLSSQIWEGNIPQKVKFFLWSSALNSINTIDR 300
Sbjct: 241 ------------------------------------------------------------ 291
Query: 301 IQRRFPCLNLSRLLNFPCRKAPKLLELLRLFLLSTNSYLFNEMGRSSWEKNHGRNILAAP 360
L L +LS
Sbjct: 301 -------------------------TFLLLLVLS-------------------------- 291
Query: 361 GFVWSSESFNLYEIDVSPYPYCVVFLRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGA 420
P LRHGLVG TMGSLWRLGANWSTSIFLCTIVRA GA
Sbjct: 361 ----------------GPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCTIVRAFGA 291
Query: 421 VGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYVIFGTLVCIAELWM 480
VGYVL+SSFLIRENIL+LITINIHASLTLIFTAWGVNLIPSMNAIY IFGTL
Sbjct: 421 VGYVLVSSFLIRENILSLITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTL-------- 291
Query: 481 LHVFASPFVLRIPYKTWSEDFIDIAKVAGEGDVNV 509
TW EDFIDI VAG+G+VN+
Sbjct: 481 ---------------TWFEDFIDITTVAGKGNVNI 291
BLAST of Spg020865 vs. ExPASy Swiss-Prot
Match:
P0C2F6 (Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1g65750 PE=3 SV=1)
HSP 1 Score: 53.9 bits (128), Expect = 6.6e-06
Identity = 42/151 (27.81%), Postives = 67/151 (44.37%), Query Frame = 0
Query: 153 GLGDKALSWEDRWRGDQPLKCVFPNLYRISLSKAFIIKDL-RKDSTWDLR--FPRNLLNR 212
G G + W DRW +PL N R + + KDL WD P N
Sbjct: 178 GDGQQIRFWTDRWVSGKPL-LELDNGERPTDCDTVVAKDLWIPGRGWDFAKIDPYTTNNT 237
Query: 213 ELQEWDTLASLIGSFNPSGREDVLTWRLDKSGMFSVKSALEEIQTKRRILEEDLSS---Q 272
L+ + L+ +G D L+W+ + G FSV+SA E + T + +++S
Sbjct: 238 RLELRAVVLDLV-----TGARDRLSWKFSQDGQFSVRSAYEML-TVDEVPRPNMASFFNC 297
Query: 273 IWEGNIPQKVKFFLWSSALNSINTIDRIQRR 298
+W+ +P++VK FLW ++ T + RR
Sbjct: 298 LWKVRVPERVKTFLWLVGNQAVMTEEERHRR 321
BLAST of Spg020865 vs. ExPASy TrEMBL
Match:
A0A6J1ETG3 (uncharacterized protein LOC111435817 OS=Cucurbita moschata OX=3662 GN=LOC111435817 PE=4 SV=1)
HSP 1 Score: 273.9 bits (699), Expect = 1.5e-69
Identity = 211/498 (42.37%), Postives = 230/498 (46.18%), Query Frame = 0
Query: 1 MISGKLYPSYSTSCIFPP------THLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSF 60
MISG LYPS STSCIFPP T H+HL P PLLKISS+L+LIS ESVSLSFPT
Sbjct: 1 MISGNLYPSVSTSCIFPPKPTSTSTTAHVHL--PLPLLKISSKLRLISFESVSLSFPTFI 60
Query: 61 ASKPSAKSIRFSNLVAKVYSYEDQNPTALSDLDDLSENGVVYKKTLAMVECSMFAALNGL 120
ASK S KS RFSN VAKVYS+E QNPT+LSDL+DLSENGVVYKKTLAMVECSMFAALNGL
Sbjct: 61 ASKSSVKSTRFSNSVAKVYSFEGQNPTSLSDLEDLSENGVVYKKTLAMVECSMFAALNGL 120
Query: 121 VYFLSNSLALERWKNLYNPWKEITKASDFVWGNCSFNVGLGDKALSWEDRWRGDQPLKCV 180
VYFLSNSLALE + C
Sbjct: 121 VYFLSNSLALENY-------------------------------------------FGCF 180
Query: 181 FPNLYRISLSKAFIIKDLRKDSTWDLRFPRNLLNRELQEWDTLASLIGSFNPSGREDVLT 240
F
Sbjct: 181 F----------------------------------------------------------- 240
Query: 241 WRLDKSGMFSVKSALEEIQTKRRILEEDLSSQIWEGNIPQKVKFFLWSSALNSINTIDRI 300
Sbjct: 241 ------------------------------------------------------------ 290
Query: 301 QRRFPCLNLSRLLNFPCRKAPKLLELLRLFLLSTNSYLFNEMGRSSWEKNHGRNILAAPG 360
CL P ++ +R W GR + A
Sbjct: 301 -----CL-------------PIVISSMR------------------WGIAAGRKTMVA-- 290
Query: 361 FVWSSESFNLYEIDVSPYPYCVVFLRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGAV 420
+F L + P LRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGAV
Sbjct: 361 ------TFLLLLVLSGPVKALTFLLRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGAV 290
Query: 421 GYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYVIFGTLV---CIAEL 480
GYVLISSFLIRENILALITINIHASLTLIFTA GVNLIPSMNAIY IFGTLV C +
Sbjct: 421 GYVLISSFLIRENILALITINIHASLTLIFTASGVNLIPSMNAIYAIFGTLVMLNCGCFM 290
Query: 481 WMLHVFASPFVLRIPYKT 490
++LH+ S F+ R+ KT
Sbjct: 481 FLLHLLYSIFLTRLGLKT 290
BLAST of Spg020865 vs. ExPASy TrEMBL
Match:
A0A1S4DVX3 (uncharacterized protein LOC103488678 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103488678 PE=4 SV=1)
HSP 1 Score: 273.9 bits (699), Expect = 1.5e-69
Identity = 207/523 (39.58%), Postives = 233/523 (44.55%), Query Frame = 0
Query: 1 MISGKLYPSYSTSCIFPPTHLHLHLPSPFP-------LLKISSQLKLISSESVSLSFPTS 60
MISGKLY S S+SCIFPPT P+P P LKISS L+LIS +SVSLS P+S
Sbjct: 1 MISGKLYSSSSSSCIFPPT----PTPTPTPTGNLHLSFLKISSTLRLISFQSVSLSVPSS 60
Query: 61 FASKPSAKSIRFSNLVAKVYSYEDQNPTALSDLDDLSENGVVYKKTLAMVECSMFAALNG 120
FASK SAKS RFSN + +VYSYE QN LSDLDDLSENGVVYKKTLAMVECSMFAALNG
Sbjct: 61 FASKSSAKSTRFSNSLVEVYSYEGQNSITLSDLDDLSENGVVYKKTLAMVECSMFAALNG 120
Query: 121 LVYFLSNSLALERWKNLYNPWKEITKASDFVWGNCSFNVGLGDKALSWEDRWRGDQPLKC 180
LVYFLSNSLALE + C F + + ++ W G + + C
Sbjct: 121 LVYFLSNSLALEN------------------YFGCFFCLPIVISSMRWGIS-AGRKTMVC 180
Query: 181 VFPNLYRISLSKAFIIKDLRKDSTWDLRFPRNLLNRELQEWDTLASLIGSFNPSGREDVL 240
FP L +++
Sbjct: 181 SFPGLKQVA--------------------------------------------------- 240
Query: 241 TWRLDKSGMFSVKSALEEIQTKRRILEEDLSSQIWEGNIPQKVKFFLWSSALNSINTIDR 300
Sbjct: 241 ------------------------------------------------------------ 299
Query: 301 IQRRFPCLNLSRLLNFPCRKAPKLLELLRLFLLSTNSYLFNEMGRSSWEKNHGRNILAAP 360
L L +LS
Sbjct: 301 -------------------------TFLLLLVLS-------------------------- 299
Query: 361 GFVWSSESFNLYEIDVSPYPYCVVFLRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGA 420
P LRHGLVG TMGSLWRLGANWSTSIFLCTIVRA GA
Sbjct: 361 ----------------GPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCTIVRAFGA 299
Query: 421 VGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYVIFGTLVCIAELWM 480
VGYVL+SSFLIRENIL+LITINIHASLTLIFTAWGVNLIPSMNAIY IFGTL
Sbjct: 421 VGYVLVSSFLIRENILSLITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTL-------- 299
Query: 481 LHVFASPFVLRIPYKTWSEDFIDIAKVAGEGDVNV--RWKRYV 515
TW EDFIDI VAG+G+VN+ W+ ++
Sbjct: 481 ---------------TWFEDFIDITTVAGKGNVNILSGWRSFL 299
BLAST of Spg020865 vs. ExPASy TrEMBL
Match:
A0A1S4DVX6 (uncharacterized protein LOC103488678 isoform X7 OS=Cucumis melo OX=3656 GN=LOC103488678 PE=4 SV=1)
HSP 1 Score: 273.1 bits (697), Expect = 2.5e-69
Identity = 206/515 (40.00%), Postives = 229/515 (44.47%), Query Frame = 0
Query: 1 MISGKLYPSYSTSCIFPPTHLHLHLPSPFP-------LLKISSQLKLISSESVSLSFPTS 60
MISGKLY S S+SCIFPPT P+P P LKISS L+LIS +SVSLS P+S
Sbjct: 1 MISGKLYSSSSSSCIFPPT----PTPTPTPTGNLHLSFLKISSTLRLISFQSVSLSVPSS 60
Query: 61 FASKPSAKSIRFSNLVAKVYSYEDQNPTALSDLDDLSENGVVYKKTLAMVECSMFAALNG 120
FASK SAKS RFSN + +VYSYE QN LSDLDDLSENGVVYKKTLAMVECSMFAALNG
Sbjct: 61 FASKSSAKSTRFSNSLVEVYSYEGQNSITLSDLDDLSENGVVYKKTLAMVECSMFAALNG 120
Query: 121 LVYFLSNSLALERWKNLYNPWKEITKASDFVWGNCSFNVGLGDKALSWEDRWRGDQPLKC 180
LVYFLSNSLALE + C F + + ++ W G + + C
Sbjct: 121 LVYFLSNSLALEN------------------YFGCFFCLPIVISSMRWGIS-AGRKTMVC 180
Query: 181 VFPNLYRISLSKAFIIKDLRKDSTWDLRFPRNLLNRELQEWDTLASLIGSFNPSGREDVL 240
FP L +++
Sbjct: 181 SFPGLKQVA--------------------------------------------------- 240
Query: 241 TWRLDKSGMFSVKSALEEIQTKRRILEEDLSSQIWEGNIPQKVKFFLWSSALNSINTIDR 300
Sbjct: 241 ------------------------------------------------------------ 291
Query: 301 IQRRFPCLNLSRLLNFPCRKAPKLLELLRLFLLSTNSYLFNEMGRSSWEKNHGRNILAAP 360
L L +LS
Sbjct: 301 -------------------------TFLLLLVLS-------------------------- 291
Query: 361 GFVWSSESFNLYEIDVSPYPYCVVFLRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGA 420
P LRHGLVG TMGSLWRLGANWSTSIFLCTIVRA GA
Sbjct: 361 ----------------GPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCTIVRAFGA 291
Query: 421 VGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYVIFGTLVCIAELWM 480
VGYVL+SSFLIRENIL+LITINIHASLTLIFTAWGVNLIPSMNAIY IFGTL
Sbjct: 421 VGYVLVSSFLIRENILSLITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTL-------- 291
Query: 481 LHVFASPFVLRIPYKTWSEDFIDIAKVAGEGDVNV 509
TW EDFIDI VAG+G+VN+
Sbjct: 481 ---------------TWFEDFIDITTVAGKGNVNI 291
BLAST of Spg020865 vs. ExPASy TrEMBL
Match:
A0A1S4DWP1 (uncharacterized protein LOC103488678 isoform X4 OS=Cucumis melo OX=3656 GN=LOC103488678 PE=4 SV=1)
HSP 1 Score: 273.1 bits (697), Expect = 2.5e-69
Identity = 206/515 (40.00%), Postives = 229/515 (44.47%), Query Frame = 0
Query: 1 MISGKLYPSYSTSCIFPPTHLHLHLPSPFP-------LLKISSQLKLISSESVSLSFPTS 60
MISGKLY S S+SCIFPPT P+P P LKISS L+LIS +SVSLS P+S
Sbjct: 1 MISGKLYSSSSSSCIFPPT----PTPTPTPTGNLHLSFLKISSTLRLISFQSVSLSVPSS 60
Query: 61 FASKPSAKSIRFSNLVAKVYSYEDQNPTALSDLDDLSENGVVYKKTLAMVECSMFAALNG 120
FASK SAKS RFSN + +VYSYE QN LSDLDDLSENGVVYKKTLAMVECSMFAALNG
Sbjct: 61 FASKSSAKSTRFSNSLVEVYSYEGQNSITLSDLDDLSENGVVYKKTLAMVECSMFAALNG 120
Query: 121 LVYFLSNSLALERWKNLYNPWKEITKASDFVWGNCSFNVGLGDKALSWEDRWRGDQPLKC 180
LVYFLSNSLALE + C F + + ++ W G + + C
Sbjct: 121 LVYFLSNSLALEN------------------YFGCFFCLPIVISSMRWGIS-AGRKTMVC 180
Query: 181 VFPNLYRISLSKAFIIKDLRKDSTWDLRFPRNLLNRELQEWDTLASLIGSFNPSGREDVL 240
FP L +++
Sbjct: 181 SFPGLKQVA--------------------------------------------------- 240
Query: 241 TWRLDKSGMFSVKSALEEIQTKRRILEEDLSSQIWEGNIPQKVKFFLWSSALNSINTIDR 300
Sbjct: 241 ------------------------------------------------------------ 291
Query: 301 IQRRFPCLNLSRLLNFPCRKAPKLLELLRLFLLSTNSYLFNEMGRSSWEKNHGRNILAAP 360
L L +LS
Sbjct: 301 -------------------------TFLLLLVLS-------------------------- 291
Query: 361 GFVWSSESFNLYEIDVSPYPYCVVFLRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGA 420
P LRHGLVG TMGSLWRLGANWSTSIFLCTIVRA GA
Sbjct: 361 ----------------GPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCTIVRAFGA 291
Query: 421 VGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYVIFGTLVCIAELWM 480
VGYVL+SSFLIRENIL+LITINIHASLTLIFTAWGVNLIPSMNAIY IFGTL
Sbjct: 421 VGYVLVSSFLIRENILSLITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTL-------- 291
Query: 481 LHVFASPFVLRIPYKTWSEDFIDIAKVAGEGDVNV 509
TW EDFIDI VAG+G+VN+
Sbjct: 481 ---------------TWFEDFIDITTVAGKGNVNI 291
BLAST of Spg020865 vs. ExPASy TrEMBL
Match:
A0A1S4DVY9 (uncharacterized protein LOC103488678 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103488678 PE=4 SV=1)
HSP 1 Score: 273.1 bits (697), Expect = 2.5e-69
Identity = 206/515 (40.00%), Postives = 229/515 (44.47%), Query Frame = 0
Query: 1 MISGKLYPSYSTSCIFPPTHLHLHLPSPFP-------LLKISSQLKLISSESVSLSFPTS 60
MISGKLY S S+SCIFPPT P+P P LKISS L+LIS +SVSLS P+S
Sbjct: 1 MISGKLYSSSSSSCIFPPT----PTPTPTPTGNLHLSFLKISSTLRLISFQSVSLSVPSS 60
Query: 61 FASKPSAKSIRFSNLVAKVYSYEDQNPTALSDLDDLSENGVVYKKTLAMVECSMFAALNG 120
FASK SAKS RFSN + +VYSYE QN LSDLDDLSENGVVYKKTLAMVECSMFAALNG
Sbjct: 61 FASKSSAKSTRFSNSLVEVYSYEGQNSITLSDLDDLSENGVVYKKTLAMVECSMFAALNG 120
Query: 121 LVYFLSNSLALERWKNLYNPWKEITKASDFVWGNCSFNVGLGDKALSWEDRWRGDQPLKC 180
LVYFLSNSLALE + C F + + ++ W G + + C
Sbjct: 121 LVYFLSNSLALEN------------------YFGCFFCLPIVISSMRWGIS-AGRKTMVC 180
Query: 181 VFPNLYRISLSKAFIIKDLRKDSTWDLRFPRNLLNRELQEWDTLASLIGSFNPSGREDVL 240
FP L +++
Sbjct: 181 SFPGLKQVA--------------------------------------------------- 240
Query: 241 TWRLDKSGMFSVKSALEEIQTKRRILEEDLSSQIWEGNIPQKVKFFLWSSALNSINTIDR 300
Sbjct: 241 ------------------------------------------------------------ 291
Query: 301 IQRRFPCLNLSRLLNFPCRKAPKLLELLRLFLLSTNSYLFNEMGRSSWEKNHGRNILAAP 360
L L +LS
Sbjct: 301 -------------------------TFLLLLVLS-------------------------- 291
Query: 361 GFVWSSESFNLYEIDVSPYPYCVVFLRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGA 420
P LRHGLVG TMGSLWRLGANWSTSIFLCTIVRA GA
Sbjct: 361 ----------------GPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCTIVRAFGA 291
Query: 421 VGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYVIFGTLVCIAELWM 480
VGYVL+SSFLIRENIL+LITINIHASLTLIFTAWGVNLIPSMNAIY IFGTL
Sbjct: 421 VGYVLVSSFLIRENILSLITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTL-------- 291
Query: 481 LHVFASPFVLRIPYKTWSEDFIDIAKVAGEGDVNV 509
TW EDFIDI VAG+G+VN+
Sbjct: 481 ---------------TWFEDFIDITTVAGKGNVNI 291
BLAST of Spg020865 vs. TAIR 10
Match:
AT1G26180.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2232, membrane (InterPro:IPR018710); Has 285 Blast hits to 285 proteins in 90 species: Archae - 0; Bacteria - 140; Metazoa - 0; Fungi - 0; Plants - 40; Viruses - 0; Other Eukaryotes - 105 (source: NCBI BLink). )
HSP 1 Score: 135.2 bits (339), Expect = 1.6e-31
Identity = 71/122 (58.20%), Postives = 93/122 (76.23%), Query Frame = 0
Query: 371 PYPYCVVFLRHGLVGLTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIRENILA 430
P FL HGLVGL +GSLW +GA+W SIFLCT+VRALG +GYVL SSFLIRENILA
Sbjct: 156 PVKALTYFLTHGLVGLALGSLWSMGASWRLSIFLCTMVRALGLIGYVLTSSFLIRENILA 215
Query: 431 LITINIHASLTLIFTAWGVNLIPSMNAIYVIFGTLVCIAE---LWMLHVFASPFVLRIPY 490
+ITINIHASL+ +FTA G+N++PSM+ IY+IFGT++ + + +LH+ S F+ R+
Sbjct: 216 VITINIHASLSYVFTAMGLNIMPSMSLIYMIFGTVLLLNSGFFVLLLHLLYSIFLTRLGM 275
HSP 2 Score: 55.1 bits (131), Expect = 2.1e-07
Identity = 25/34 (73.53%), Postives = 31/34 (91.18%), Query Frame = 0
Query: 94 VVYKKTLAMVECSMFAALNGLVYFLSNSLALERW 128
VVY+KTL +VEC+MFAA+ GLVYFLSNSLA+E +
Sbjct: 85 VVYQKTLRLVECAMFAAVTGLVYFLSNSLAIENY 118
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022929150.1 | 3.1e-69 | 42.37 | uncharacterized protein LOC111435817 [Cucurbita moschata] | [more] |
XP_016900134.1 | 3.1e-69 | 39.58 | PREDICTED: uncharacterized protein LOC103488678 isoform X3 [Cucumis melo] | [more] |
XP_016900141.1 | 5.2e-69 | 40.00 | PREDICTED: uncharacterized protein LOC103488678 isoform X7 [Cucumis melo] | [more] |
XP_016900135.1 | 5.2e-69 | 40.00 | PREDICTED: uncharacterized protein LOC103488678 isoform X4 [Cucumis melo] | [more] |
XP_016900133.1 | 5.2e-69 | 40.00 | PREDICTED: uncharacterized protein LOC103488678 isoform X1 [Cucumis melo] | [more] |
Match Name | E-value | Identity | Description | |
P0C2F6 | 6.6e-06 | 27.81 | Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1ETG3 | 1.5e-69 | 42.37 | uncharacterized protein LOC111435817 OS=Cucurbita moschata OX=3662 GN=LOC1114358... | [more] |
A0A1S4DVX3 | 1.5e-69 | 39.58 | uncharacterized protein LOC103488678 isoform X3 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S4DVX6 | 2.5e-69 | 40.00 | uncharacterized protein LOC103488678 isoform X7 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S4DWP1 | 2.5e-69 | 40.00 | uncharacterized protein LOC103488678 isoform X4 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S4DVY9 | 2.5e-69 | 40.00 | uncharacterized protein LOC103488678 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
Match Name | E-value | Identity | Description | |
AT1G26180.1 | 1.6e-31 | 58.20 | unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2232... | [more] |