Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCAAAATTCTGAATTTCTCAAGCAAGAAGAAGAAGTGGCAATGGAAGAACTTCAATCTCCACCAGAACCATCATCCACTGACATTACCTCCAACAAAGGGAGGAAGAAACCACAGGCTAAGGAGAAGAAGGAACCGGAGAAAAGAGCTAAGAAGACCTCCAACAAAGGGAAGAAGAAACCACCGGCTAAGGAGAAGAAGGAACCGGAGAAAAGAGCTAAGAAGAAGACACCAGTGGCTACTACTACTGCTGCTGCTGCTACTACTACTTCTACTTCAGTCAACGAACACCAACGCACTGATCGTTTAAATGATGTTTTGCCCAAGGTTAAGGTTTCAGAGTTTGATCCTTGTGTTGAAAATCATTTTAGAGCCATGGATGCAATTGTTGAGCTCTGTTGTGAAGCAGAGGAGGGCGATGGTGGAATTGACGAAAGTGACATTCAGCGCTTTTCATCGTCCACAATTTTCTTGAGGTACTCAATGGTTGCATGTTACTAGCTTAACGTATGTTTGATTGGTGAAGATTTTGTGGTTTGTGCTTGGCATGCAATGTATTGAATTTAATCCCATTTAATCAGGGAATGGAGGTTCTACAATTATGAGGCGAAAACTATCAAGTTCGCTAATGATTCGACAGGTCCTGAGGGTAAGGATGCTGATATCACGATAAACTTACCACAATTTTCTTCTGCAGCTGTTCTAAAGGTGCTGGATTTAGGCCTGTTGTTTTCTTTATAATTTTGTTTCCTGCTGCTTGGCTTGTTTTTTCCTTTTAGAATATGAACTGTTACTCTCTTGCAGCATTATGAATTAACATAAGTCTTAATTGATATACAGAAAGGAGCACCGCCTGGAGCCTCTACATCTCTGGATTTTCGGTATGGTGTTCTGGGTTGTTGTTTTTTGTATTTTTTAGGAAATTATGTTTTTTGTTGGTACCATCCTTGGAAATATAAGGAGTGACAAAATCTGTATTTGACTTTCAAGGTCGTACTGATGTAATGCTTACATAATGACCTGTGTCTAAGGTACTGTATGCCTCATCTAATGCACAAGTACTATTAAAAAGACTGCATACCATGGAAATTTGGTATTAAGCTCTTACTAGTTGGATTCCCTCCCTCGAGACATTTGTGAGTGGAACTTTCTGCCCATCATGTGATTAAAATATGTAATATTAAATCGCCAATTGACTCAAGCTATTGGGTATGGGTAAATTTAATCATGTATTATTTGACACTCCCCACACATGTAGACTTGAAATATGAAAGAAGACCCCATAAGTGGAAATTAATATTAAATAGAAAGTAAATAACCATCATGGGTTGGTCTAGTGGTAGAAAAAGGAGATATGGTCTTGACAACTAACTAAGCGGTCATTTGTTCAATCCATGATGGCTACCTACTGAAGAATTAATTTCTAGAAGTTTTCTTGGCATTCGAATGTTGTTGGGTCAGACAGGTTGCTTATTGAGAATAGTCGGGATGAGCGTAAGAGCATACACACCACAATGTAACTTATGGTACTACTTTGAGTTTTGATATCTTCTGCTGTTAGTTACAGTAGTTTTCCATTGGTGGGATAGTGTTCTCTGTTGTGTATGAAATGAAGCTAGTTTTTTTGGTGTCTAATGTAATTGTGCCTTCCGCAGTTGATGTTGATATGTACATTCAATCAATCATTTTTGTTGATTTTTGACAGAAACTTTGCTATGCATGTCGGTGGGCCTGTTTGGGCCATAGATTGGTGTCCTCAAGTTCATGGAAGGACCAACTCCCTTATCAAATGTGAGGTATCTCTCCTCTTCTATTTCCCTTCTTTAAGCTAAACTGTATGAATACCTTATAATCCAAGTGTAAAATATTGTTTTATTCATCTTCACTTACGACTTTCATTTTCATTCACTTAGATAGGCTCTGATTTCTATTTGTTCATATTTTCTCTCTATTTATAAGTTAATATTCGACTCAACTTGTGTGACAGTTTATTGCTGTTTCTGCTCATCCACCTGGTTCTTCTTATCACAAGATGGGTATCCCGCTCACAGGAAGAGGTATGGTACAGATATGGTGTTTAGTGCATGGCACCGAAAACTATGAACCGATCGATGTAGGAGAGCCTCCTTCAGATTTGTCTTCTCAACCAAAGAAGCCTAGAGGAAGGCCACCAGGGCGCAAAAAAAAGGAGGCATCAGGCTTGCCATCTCCACCGAAGAGGCCTAGAGGAAGACCTAAAAAGGAACAAAAAGAATCCACTGACAAGAAGGGTGACAATTGCCAGCTTGTTCAGGAATTTTCTATGGAAAACCCAGTTGGTTCATCCAGCTTGCTTGAGATTGATGGCGTCCCCAAAAATACTGAAAATTTTGTATTACTGGAAAACAATGTTGAAAGAGAGAGGAGTACCTTACAAGAAGTTTCTACATGTAATTCTGAAGATGAAGTTCCTGCTAAGAAAAGGAGAGTGAGAAGAAAAGTTAAGTCTAGGAATCTTGTCGATGACGTGGGAGTGTCATCACTTACAGAGTATCAAGAAGATGGATCCATTGCTAACAACCATGAGGCGGATGAGAATGTTAAAAGTGAATATTCTGGGGAAGACAATCTGTTATGCAAGGACATTTCAGAGAATGTTGTCTTAGACGCTAGCTCAATTGAATTTTCTATTCCCGAGAGTGTTGCTTTGCCAAGAGTCGTACTGTGCTTAGCTCACAATGGAAAGGTAGCATGGGATTTGAAATGGAAGCCAATCAATGCATGTACTGACAATTGCAAGCACCGAATGGGCTACCTTGCTGTCTTGCTAGGCAATGGATCTCTAGAAGTGTAATTAAACTTCTTTGACTGTCTTGTGACATGTTTGCTCAGTTACCAACTTACCGTATCTTTACTTTTCTTTTCGTCCATTTTCCCATAACTTTTAATTTATTATTTTTCCTGCGTATATTTAGTTATTCTCTTGAGTTGCTTAAGATTTATAATAAGTTAGATTTGGAAGTATGCTGGATAATCTACTCTGATGTTTCCTTTTGAGATCAGTTAAAACTTGTGTTGAATATGACCCATATTTTTCTTGCTAAACCGAGTTTCGGAATTGCTATCATAGTGGATCTGATTTTTTGACATTACGTATTTGCTAGTAGAACCGCTTTTTATTTGAGCATGCCATAGCATAAAAACTACTTTTGAAGTATGTAGTTTTTGGAGAAATTATACAAATAAGATTACACTATATACAATACAAAGTAAGTTCTTAAAATACTGGTGTTATCCTAGTTAAATTGCTGTTCTTGATCCTTTGGTTGTTCTAAGGTTCTGACTTTTAATGTTTACTACGTGCCTCCAATAATTTGCAATATTTGTTCGTTATTGGGAACTGCACTAAATAATTGATCATATGAACTAATTGCAGCTGGGAGGTTCCTTTTCCCCATGCAGTGAAGACCATCTATTCTAAATTCAATGGGGAGGGTACAGATCCTCGCTTTGTGAAGTTGAAGCCTATTTTCAGATGTTCGAGGTTGAGAACTGCAAATACACAGAGGTATCCGGTGTTTATTTTTAATATAATGACATGACAACTCTTCAAGTTTAACACGGTGCAGTTTTATGAAAAGATAGTGCAATGTGTTCAGTATTGCGATAAATTGAAAAGTAGAATAGGCTGCCTGTTCAAGTTTATCACGGTGCTGTTTTGAATGAATTTCTTGGCTTTGCCTGTTGTTAGGAGTGGTATTATGAACTTATGAAGTACGTTTAAGACTATTCACTTTGTTCTTGTTGAATCCAAGAATTAATCTTTGGTTCCACGATCAACCACATCCTGTAAAGGAAAAATTATATCATTGATTGGATTATATTTAAGTGATTACGAAAACTTTTTCATTGATTTTTCTTTTAAATATATTCTGGGTTGTTACAGTTGTCATTATTTTGGTAATATTGCATCACGTGTTAACTATTTCTCACGAGTTTTATTTGTCCCTAACTTTCTTTGTTACTGCTAATTCATTGTCCATTGCTGGTGCTGCAGCATCCCTCTGACAGTGGAATGGTCTCTCGCACCTCCTTATGATTATCTACTCGCTGGATGTCATGATGGAACGGTTATTATTTAATTTCCCTTTTCCTTTTAATGTTGCATTCAGGGAGACTTAGTTCTTCATTTGGTTTCAGATATTTCTAGAACTGCTTACTCATTCTTTTGCTCAAATGTTGTTCGGAGCGTGAGAAAATTGAAGTTCCTGAAGATTTCTCTTTCCAATTATATACTTTTTAGGTTTTTTCTTGTTTTTGATCATCACCCATACCGATAAAACATTTTTTCTGACAATTCCTGATACAGTGTTGGGTAGTTCTGTCTCTTGTTGTTTTGAATTTTAAATTTATGACGACTAAAACTCAGTTCATCCATCCAGGTCGCATTGTGGAAGTTCTCTGCCAATAGTTCTTGTGAAGGTTGTTTTGCTTCTCTTAACTTGTGTTTCTAATTCTCTCTTGTTTGTACTTTAAAATTTGGCTTGTGCAATTACAAGTGATATGGCTCTCATTTTCTTCTTATTTTCTTCAGATACGAGGCCTTTACTTCGTTTTAGTGCAGATACAGTCCCAATAAGAGCAGTTGCATGGGCACCAAGTGAAAGGTTTGTAGTGGAGTTCAAGTCAACCCCTCGCCCCTTCCAAAGAAAAAAAAAAAAAACAGAGAGGGAGAGAGTAGAAATAAAAAGGGATGTCATATTTCTTAAATTTAGTTATTGACATTTCTGCTTATTCACAGTAATTAGGAAATGTTACCCTAAGAAAATGGTCACAATGCCAATCCTCCCACCCTAGTGCTGTTTACATAATCGGGAAAACATTTTTACCTTTGATGACAAAGAATAATAACTGGACTATTAACCATGAAATGGGTTGTCTGGTGAAAATAGTTGAGGGTGCATCCTGACAATCATAGTATAGGATAGGATTAGATATAAAAAAATTATACTGTTAGATGACCAACCTATCCTATATTATTAAGGTCGAAATTATAAATATTCATGTATTTTCATATTATGTGCAGCAACCTCGAAAGTGCAAATGTGATACTTACTGCTGGTCATGGAGGTTTAAAATTTTGGGACCTAAGGTTGGTGGAGTTATTTATTTTCTTTTATTTTTCATTATTATTATTTTTGGATTATATATAAAGTCAAGTTTTCATTAAGACAAGATGAAAGAAATATATAAGGCCATAGAAAAAAGTTTCCACCTGACAAGTTAAAAGTGAACTAAATTATACAAGAAGGGCATCTTATTCAGAAAACACCTAGTGGAGAAAAAACATTGTGTTGATTGACGAGGTGGAATTACAAAGACGAAAAGCTCCTGCTTTCCAAAATCATTTCCAATTGGCAATAAGAGAAGACATAATTGTGATTTGTGAAAGTCCTGATAACTATTATAACATCATTAGTCAAAACAATAGTGAATAGAAAAATCTATCAAAGCATCTTTTTCATCCTTGCAAATGCGTTTTTCTCCTTTTCTTCCATGTGGAACACAGGAAGGTTCTCTAATGATACGATTTCTTAATCCATTTATTCTGAGATCATTGCATATGTTTTGAATTATTTTGGTTTTCAGTTCACATTATATTGTTTCATTTCAACTCAGTGGCAATCTGATATCAATCATTACAATATTATATCAGAGATCCTTTCCGCCCCTTGTGGGACCTTCATCCAGCACCGAGGATTATATATAGTCTGGATTGGCTTCCTAATCCTAGGTACATACTTTATCTCGATAGAATGCAGATTAGCCACATTAATGTATGGTATTACTAACTGTCTTTTTTTACAATGAACCTATTCCGATTAGATGCGTTTTTTTATCCTTTGATGATGGAACATTGAGACTTCTCAGTTGCTAAAGGCTGCAAATGATGTTCCAGCAACTGGCCAACCCTTTACAGCGATAAAACAAAAAGGTTTACACACTTACATTTGTTCATCATATGCCATCTGGAGTATTCAAGTGTCAAGGCAGACAGGTATATCTGTAATTCATGGTTTGTTAGCTTTAGTTTGCTAGAGATCTAATTTTTAACATCAACCACTGTAATTTGGGTTTTCCATTTGCCATTGAATTGATTTACGTTTATTACTGTCATCAGTGGTCTGAGGTCTCTACAGCTCTAGTGTAGCGTTAGTCTTGATAACTGAAAATGTGTTTATATTGCTATCGCTCTCCTGATTATTTATGTTTCTTCATGTCATTAAACAGATGAAAAACGAATTCTCATTTGTGTGGTTATATGCCTGTCTTGATTTAATTTCAGCCGTGCGGAAATAAACAACAATTGTCTCACATTATCATTATGTCTGATTTGGTTTTACTTTTAATTTTTAAATTTCAACACAATCAAGTGAATATTTGAATTTTATCATGTACATTTAAGATATAGCAACTGATTATTTCAGGCATGGTTGCATACTGCGGTGCTGATGGAGCTGTTGTCCGTTTCCAAGTAAGTTCAACTTGCACAGTAGATTTTGCTATGGAGGTGAAAGTTTGCCTCTTCGGCACTTGTCATTTATTTGGTCTCAGTTATTTTATTTTATTTTTTTAATCTTAGCTTATAGTTCTGTTAGTTTATTCTACTTTGCTGCTAAGTATTTATTTATGTGTGTATATATTAATATAATAAACTACTTCTCCTTTGAAGGAGAGAATTACAAAAGGGGTCTAGACAAAGGCATCCTATCCCAGATCTTGGGATGGTTTCAGCAATCCCTCTTTTTATTATTAAAATAATAAATGCATTGTTTGATCAGTTGAAAATTATCAATTCTTTGATAAAGAGGTTTTTGCCATAAAAATTAGTTTTGATAGTTGAAGTATATAAATAGCTGAAGAGTTCATATATGCTTTCTTTTTCTCTATGGACCACAGATGGAGTTGAAGATAATTGATACCACTTCAAATCATACCTTGAATTATCATTTCATTATTTGGTTTTTGTTCTTGTTTAATTATCAGCTTACTACGAAAGCAGCGGACAAAGAGAATTCACGCCATCGTACCCCACATTATGTATGTGAATACTTAACCGAGGAGGAATCAATTATTACATTCCGCTCGCCACCACCAAATGTTCCAATCCCTTTGAAAAAGCTGTCCAACAAATCTGAACACCCACTGTCCATGCGAGCTATTTTATCTGATTCAATGCAGTCTAATGAAGGAAATCATAAAACAGCCACAGCTTCAACATTGGAAAATGAAGCATCCATTTGCTCGGATGTCGATGTCGGTGTTGAATCTGGATCTGAGGATACACCGTTGTCCACCAAGAAGAAGAACCGAACTCAACCGAAGTGCAAGAAGAAGGGAGTTGAGAACCTAGAATTGGAATGTAACGTTGAGCCTAAAGATGATGCGCATATAGATGCTGACGTAGAAGCACAAACAGATGCTGTCTTAGAAGCACGGATGGATGCTGACGTAGTGCCCAGTTCGGGGGATCACTTTGAAAATCTTCCTCCCAAATCAGTTGCAATGCATAGAGTGAGATGGAACATGAACATGGGGAGTGAAAAATGGTTGTGCTACGGTGGAGCATCTGGAATTCTACGCTGTCAGGAGATGGTGCTGTCGGCCCTCGATATGAAGTTGATGAAGAAAAAATGAAATTTATTTGAAAAAAATGGCTTCCTAAAAATGTTTATTGGGCAGCTGATTTTCACGAAGCGACTTGTGATCTTGCTCAGTTCTTTGTACAGTTGTAAGATCAGAGAAGTCATTTTGAATTGGAATAATTGTTATTGCCATCGACCCGCAGTGCTTTCATCCTCAATAGGCTTAAATTAGAGGAGCAGCTAAACGAATAAAAGAGCATGACCTTGGGATAGCAGAAGACCAATAACTAGGAAGGTTATTTAATAAGAGAAGTTCAAGAGCCTTAGCAGATAAACTTGATTTGGTTTTGTGTTCAGCAGAAATGGAATGGGCAACTGACATCGATTTTTATTCGCATGAATAGTTAGAGGAACATCAGCAGTCATGGTAAAACTTTAGCCACTTGTGTATTCTAAAATAACTGACACAAAGCTCGCCTCCTTAGCGATTGGGTGTCGGGTTTAGTATGTCGTGAGGCTGTTCGTCTAGAAGATGAGTTTCTAATAAATAAGGCTATTATCGGAGCTTGATTATTTTCTTTTTTATATGTAATCTGTTTTTCTTAACTCTTCTATATTGTAAATGAGCTTTCAAATATTGTTTCCTTCCTAACTTTGTATCAG
mRNA sequence
CCAAAATTCTGAATTTCTCAAGCAAGAAGAAGAAGTGGCAATGGAAGAACTTCAATCTCCACCAGAACCATCATCCACTGACATTACCTCCAACAAAGGGAGGAAGAAACCACAGGCTAAGGAGAAGAAGGAACCGGAGAAAAGAGCTAAGAAGACCTCCAACAAAGGGAAGAAGAAACCACCGGCTAAGGAGAAGAAGGAACCGGAGAAAAGAGCTAAGAAGAAGACACCAGTGGCTACTACTACTGCTGCTGCTGCTACTACTACTTCTACTTCAGTCAACGAACACCAACGCACTGATCGTTTAAATGATGTTTTGCCCAAGGTTAAGGTTTCAGAGTTTGATCCTTGTGTTGAAAATCATTTTAGAGCCATGGATGCAATTGTTGAGCTCTGTTGTGAAGCAGAGGAGGGCGATGGTGGAATTGACGAAAGTGACATTCAGCGCTTTTCATCGTCCACAATTTTCTTGAGGGAATGGAGGTTCTACAATTATGAGGCGAAAACTATCAAGTTCGCTAATGATTCGACAGGTCCTGAGGGTAAGGATGCTGATATCACGATAAACTTACCACAATTTTCTTCTGCAGCTGTTCTAAAGAAAGGAGCACCGCCTGGAGCCTCTACATCTCTGGATTTTCGAAACTTTGCTATGCATGTCGGTGGGCCTGTTTGGGCCATAGATTGGTGTCCTCAAGTTCATGGAAGGACCAACTCCCTTATCAAATGTGAGTTTATTGCTGTTTCTGCTCATCCACCTGGTTCTTCTTATCACAAGATGGGTATCCCGCTCACAGGAAGAGGTATGGTACAGATATGGTGTTTAGTGCATGGCACCGAAAACTATGAACCGATCGATGTAGGAGAGCCTCCTTCAGATTTGTCTTCTCAACCAAAGAAGCCTAGAGGAAGGCCACCAGGGCGCAAAAAAAAGGAGGCATCAGGCTTGCCATCTCCACCGAAGAGGCCTAGAGGAAGACCTAAAAAGGAACAAAAAGAATCCACTGACAAGAAGGGTGACAATTGCCAGCTTGTTCAGGAATTTTCTATGGAAAACCCAGTTGGTTCATCCAGCTTGCTTGAGATTGATGGCGTCCCCAAAAATACTGAAAATTTTGTATTACTGGAAAACAATGTTGAAAGAGAGAGGAGTACCTTACAAGAAGTTTCTACATGTAATTCTGAAGATGAAGTTCCTGCTAAGAAAAGGAGAGTGAGAAGAAAAGTTAAGTCTAGGAATCTTGTCGATGACGTGGGAGTGTCATCACTTACAGAGTATCAAGAAGATGGATCCATTGCTAACAACCATGAGGCGGATGAGAATGTTAAAAGTGAATATTCTGGGGAAGACAATCTGTTATGCAAGGACATTTCAGAGAATGTTGTCTTAGACGCTAGCTCAATTGAATTTTCTATTCCCGAGAGTGTTGCTTTGCCAAGAGTCGTACTGTGCTTAGCTCACAATGGAAAGGTAGCATGGGATTTGAAATGGAAGCCAATCAATGCATGTACTGACAATTGCAAGCACCGAATGGGCTACCTTGCTGTCTTGCTAGGCAATGGATCTCTAGAAGTCTGGGAGGTTCCTTTTCCCCATGCAGTGAAGACCATCTATTCTAAATTCAATGGGGAGGGTACAGATCCTCGCTTTGTGAAGTTGAAGCCTATTTTCAGATGTTCGAGGTTGAGAACTGCAAATACACAGAGCATCCCTCTGACAGTGGAATGGTCTCTCGCACCTCCTTATGATTATCTACTCGCTGGATGTCATGATGGAACGGTCGCATTGTGGAAGTTCTCTGCCAATAGTTCTTGTGAAGATACGAGGCCTTTACTTCGTTTTAGTGCAGATACAGTCCCAATAAGAGCAGTTGCATGGGCACCAAGTGAAAGCAACCTCGAAAGTGCAAATGTGATACTTACTGCTGGTCATGGAGGTTTAAAATTTTGGGACCTAAGAGATCCTTTCCGCCCCTTGTGGGACCTTCATCCAGCACCGAGGATTATATATAGTCTGGATTGGCTTCCTAATCCTAGATGCGTTTTTTTATCCTTTGATGATGGAACATTGAGACTTCTCAGTTGCTAAAGGCTGCAAATGATGTTCCAGCAACTGGCCAACCCTTTACAGCGATAAAACAAAAAGGTTTACACACTTACATTTGTTCATCATATGCCATCTGGAGTATTCAAGTGTCAAGGCAGACAGGCATGGTTGCATACTGCGGTGCTGATGGAGCTGTTGTCCGTTTCCAACTTACTACGAAAGCAGCGGACAAAGAGAATTCACGCCATCGTACCCCACATTATGTATGTGAATACTTAACCGAGGAGGAATCAATTATTACATTCCGCTCGCCACCACCAAATGTTCCAATCCCTTTGAAAAAGCTGTCCAACAAATCTGAACACCCACTGTCCATGCGAGCTATTTTATCTGATTCAATGCAGTCTAATGAAGGAAATCATAAAACAGCCACAGCTTCAACATTGGAAAATGAAGCATCCATTTGCTCGGATGTCGATGTCGGTGTTGAATCTGGATCTGAGGATACACCGTTGTCCACCAAGAAGAAGAACCGAACTCAACCGAAGTGCAAGAAGAAGGGAGTTGAGAACCTAGAATTGGAATGTAACGTTGAGCCTAAAGATGATGCGCATATAGATGCTGACGTAGAAGCACAAACAGATGCTGTCTTAGAAGCACGGATGGATGCTGACGTAGTGCCCAGTTCGGGGGATCACTTTGAAAATCTTCCTCCCAAATCAGTTGCAATGCATAGAGTGAGATGGAACATGAACATGGGGAGTGAAAAATGGTTGTGCTACGGTGGAGCATCTGGAATTCTACGCTGTCAGGAGATGGTGCTGTCGGCCCTCGATATGAAGTTGATGAAGAAAAAATGAAATTTATTTGAAAAAAATGGCTTCCTAAAAATGTTTATTGGGCAGCTGATTTTCACGAAGCGACTTGTGATCTTGCTCAGTTCTTTGTACAGTTGTAAGATCAGAGAAGTCATTTTGAATTGGAATAATTGTTATTGCCATCGACCCGCAGTGCTTTCATCCTCAATAGGCTTAAATTAGAGGAGCAGCTAAACGAATAAAAGAGCATGACCTTGGGATAGCAGAAGACCAATAACTAGGAAGGTTATTTAATAAGAGAAGTTCAAGAGCCTTAGCAGATAAACTTGATTTGGTTTTGTGTTCAGCAGAAATGGAATGGGCAACTGACATCGATTTTTATTCGCATGAATAGTTAGAGGAACATCAGCAGTCATGGTAAAACTTTAGCCACTTGTGTATTCTAAAATAACTGACACAAAGCTCGCCTCCTTAGCGATTGGGTGTCGGGTTTAGTATGTCGTGAGGCTGTTCGTCTAGAAGATGAGTTTCTAATAAATAAGGCTATTATCGGAGCTTGATTATTTTCTTTTTTATATGTAATCTGTTTTTCTTAACTCTTCTATATTGTAAATGAGCTTTCAAATATTGTTTCCTTCCTAACTTTGTATCAG
Coding sequence (CDS)
ATGGAAGAACTTCAATCTCCACCAGAACCATCATCCACTGACATTACCTCCAACAAAGGGAGGAAGAAACCACAGGCTAAGGAGAAGAAGGAACCGGAGAAAAGAGCTAAGAAGACCTCCAACAAAGGGAAGAAGAAACCACCGGCTAAGGAGAAGAAGGAACCGGAGAAAAGAGCTAAGAAGAAGACACCAGTGGCTACTACTACTGCTGCTGCTGCTACTACTACTTCTACTTCAGTCAACGAACACCAACGCACTGATCGTTTAAATGATGTTTTGCCCAAGGTTAAGGTTTCAGAGTTTGATCCTTGTGTTGAAAATCATTTTAGAGCCATGGATGCAATTGTTGAGCTCTGTTGTGAAGCAGAGGAGGGCGATGGTGGAATTGACGAAAGTGACATTCAGCGCTTTTCATCGTCCACAATTTTCTTGAGGGAATGGAGGTTCTACAATTATGAGGCGAAAACTATCAAGTTCGCTAATGATTCGACAGGTCCTGAGGGTAAGGATGCTGATATCACGATAAACTTACCACAATTTTCTTCTGCAGCTGTTCTAAAGAAAGGAGCACCGCCTGGAGCCTCTACATCTCTGGATTTTCGAAACTTTGCTATGCATGTCGGTGGGCCTGTTTGGGCCATAGATTGGTGTCCTCAAGTTCATGGAAGGACCAACTCCCTTATCAAATGTGAGTTTATTGCTGTTTCTGCTCATCCACCTGGTTCTTCTTATCACAAGATGGGTATCCCGCTCACAGGAAGAGGTATGGTACAGATATGGTGTTTAGTGCATGGCACCGAAAACTATGAACCGATCGATGTAGGAGAGCCTCCTTCAGATTTGTCTTCTCAACCAAAGAAGCCTAGAGGAAGGCCACCAGGGCGCAAAAAAAAGGAGGCATCAGGCTTGCCATCTCCACCGAAGAGGCCTAGAGGAAGACCTAAAAAGGAACAAAAAGAATCCACTGACAAGAAGGGTGACAATTGCCAGCTTGTTCAGGAATTTTCTATGGAAAACCCAGTTGGTTCATCCAGCTTGCTTGAGATTGATGGCGTCCCCAAAAATACTGAAAATTTTGTATTACTGGAAAACAATGTTGAAAGAGAGAGGAGTACCTTACAAGAAGTTTCTACATGTAATTCTGAAGATGAAGTTCCTGCTAAGAAAAGGAGAGTGAGAAGAAAAGTTAAGTCTAGGAATCTTGTCGATGACGTGGGAGTGTCATCACTTACAGAGTATCAAGAAGATGGATCCATTGCTAACAACCATGAGGCGGATGAGAATGTTAAAAGTGAATATTCTGGGGAAGACAATCTGTTATGCAAGGACATTTCAGAGAATGTTGTCTTAGACGCTAGCTCAATTGAATTTTCTATTCCCGAGAGTGTTGCTTTGCCAAGAGTCGTACTGTGCTTAGCTCACAATGGAAAGGTAGCATGGGATTTGAAATGGAAGCCAATCAATGCATGTACTGACAATTGCAAGCACCGAATGGGCTACCTTGCTGTCTTGCTAGGCAATGGATCTCTAGAAGTCTGGGAGGTTCCTTTTCCCCATGCAGTGAAGACCATCTATTCTAAATTCAATGGGGAGGGTACAGATCCTCGCTTTGTGAAGTTGAAGCCTATTTTCAGATGTTCGAGGTTGAGAACTGCAAATACACAGAGCATCCCTCTGACAGTGGAATGGTCTCTCGCACCTCCTTATGATTATCTACTCGCTGGATGTCATGATGGAACGGTCGCATTGTGGAAGTTCTCTGCCAATAGTTCTTGTGAAGATACGAGGCCTTTACTTCGTTTTAGTGCAGATACAGTCCCAATAAGAGCAGTTGCATGGGCACCAAGTGAAAGCAACCTCGAAAGTGCAAATGTGATACTTACTGCTGGTCATGGAGGTTTAAAATTTTGGGACCTAAGAGATCCTTTCCGCCCCTTGTGGGACCTTCATCCAGCACCGAGGATTATATATAGTCTGGATTGGCTTCCTAATCCTAGATGCGTTTTTTTATCCTTTGATGATGGAACATTGAGACTTCTCAGTTGCTAA
Protein sequence
MEELQSPPEPSSTDITSNKGRKKPQAKEKKEPEKRAKKTSNKGKKKPPAKEKKEPEKRAKKKTPVATTTAAAATTTSTSVNEHQRTDRLNDVLPKVKVSEFDPCVENHFRAMDAIVELCCEAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTIKFANDSTGPEGKDADITINLPQFSSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHGRTNSLIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDVGEPPSDLSSQPKKPRGRPPGRKKKEASGLPSPPKRPRGRPKKEQKESTDKKGDNCQLVQEFSMENPVGSSSLLEIDGVPKNTENFVLLENNVERERSTLQEVSTCNSEDEVPAKKRRVRRKVKSRNLVDDVGVSSLTEYQEDGSIANNHEADENVKSEYSGEDNLLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKVAWDLKWKPINACTDNCKHRMGYLAVLLGNGSLEVWEVPFPHAVKTIYSKFNGEGTDPRFVKLKPIFRCSRLRTANTQSIPLTVEWSLAPPYDYLLAGCHDGTVALWKFSANSSCEDTRPLLRFSADTVPIRAVAWAPSESNLESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLSC
Homology
BLAST of IVF0020322 vs. ExPASy Swiss-Prot
Match:
Q8BL74 (General transcription factor 3C polypeptide 2 OS=Mus musculus OX=10090 GN=Gtf3c2 PE=2 SV=2)
HSP 1 Score: 63.9 bits (154), Expect = 7.9e-09
Identity = 44/190 (23.16%), Postives = 79/190 (41.58%), Query Frame = 0
Query: 480 WDLKWKPINACTD-------NCKHRMGYLAVLLGNGSLEVWEVPFPHAVKTIYSKFNGEG 539
WDLK+ P A R+G LA+ +G + ++ +P P A + ++ +
Sbjct: 467 WDLKFCPSGAWEHPETLRKAPLLPRLGLLALACSDGKVLLFSLPHPEA---LLAQQPPDA 526
Query: 540 TDPRFVKLKPIFRCSRLRTANTQSIP-------LTVEWSLAPPYDYLLAGCHDGTVALWK 599
P K++ + + L+ + Q+ L++ W P+ +L AG ++G V W
Sbjct: 527 MKPAIYKVQCL---ATLQVGSVQASDPSECGQCLSLAWMPTRPHHHLAAGYYNGMVVFWN 586
Query: 600 FSANSSCEDTR---------PLLRFSADTVPIRAVAWAPSESNLESANVILTAGHGGLKF 647
NS + R P F A +R + W + S+ ++ +KF
Sbjct: 587 LPTNSPLQRIRLSDGSLKLYPFQCFLAHDQAVRTIQWCKANSHF----LVSAGSDRKIKF 646
BLAST of IVF0020322 vs. ExPASy TrEMBL
Match:
A0A5D3DPQ1 (DNA binding protein, putative isoform 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G003700 PE=4 SV=1)
HSP 1 Score: 1370.1 bits (3545), Expect = 0.0e+00
Identity = 681/681 (100.00%), Postives = 681/681 (100.00%), Query Frame = 0
Query: 1 MEELQSPPEPSSTDITSNKGRKKPQAKEKKEPEKRAKKTSNKGKKKPPAKEKKEPEKRAK 60
MEELQSPPEPSSTDITSNKGRKKPQAKEKKEPEKRAKKTSNKGKKKPPAKEKKEPEKRAK
Sbjct: 1 MEELQSPPEPSSTDITSNKGRKKPQAKEKKEPEKRAKKTSNKGKKKPPAKEKKEPEKRAK 60
Query: 61 KKTPVATTTAAAATTTSTSVNEHQRTDRLNDVLPKVKVSEFDPCVENHFRAMDAIVELCC 120
KKTPVATTTAAAATTTSTSVNEHQRTDRLNDVLPKVKVSEFDPCVENHFRAMDAIVELCC
Sbjct: 61 KKTPVATTTAAAATTTSTSVNEHQRTDRLNDVLPKVKVSEFDPCVENHFRAMDAIVELCC 120
Query: 121 EAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTIKFANDSTGPEGKDADITINLPQF 180
EAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTIKFANDSTGPEGKDADITINLPQF
Sbjct: 121 EAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTIKFANDSTGPEGKDADITINLPQF 180
Query: 181 SSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHGRTNSLIKCEFIAVSAHPP 240
SSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHGRTNSLIKCEFIAVSAHPP
Sbjct: 181 SSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHGRTNSLIKCEFIAVSAHPP 240
Query: 241 GSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDVGEPPSDLSSQPKKPRGRPPGRKKKEA 300
GSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDVGEPPSDLSSQPKKPRGRPPGRKKKEA
Sbjct: 241 GSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDVGEPPSDLSSQPKKPRGRPPGRKKKEA 300
Query: 301 SGLPSPPKRPRGRPKKEQKESTDKKGDNCQLVQEFSMENPVGSSSLLEIDGVPKNTENFV 360
SGLPSPPKRPRGRPKKEQKESTDKKGDNCQLVQEFSMENPVGSSSLLEIDGVPKNTENFV
Sbjct: 301 SGLPSPPKRPRGRPKKEQKESTDKKGDNCQLVQEFSMENPVGSSSLLEIDGVPKNTENFV 360
Query: 361 LLENNVERERSTLQEVSTCNSEDEVPAKKRRVRRKVKSRNLVDDVGVSSLTEYQEDGSIA 420
LLENNVERERSTLQEVSTCNSEDEVPAKKRRVRRKVKSRNLVDDVGVSSLTEYQEDGSIA
Sbjct: 361 LLENNVERERSTLQEVSTCNSEDEVPAKKRRVRRKVKSRNLVDDVGVSSLTEYQEDGSIA 420
Query: 421 NNHEADENVKSEYSGEDNLLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKVAW 480
NNHEADENVKSEYSGEDNLLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKVAW
Sbjct: 421 NNHEADENVKSEYSGEDNLLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKVAW 480
Query: 481 DLKWKPINACTDNCKHRMGYLAVLLGNGSLEVWEVPFPHAVKTIYSKFNGEGTDPRFVKL 540
DLKWKPINACTDNCKHRMGYLAVLLGNGSLEVWEVPFPHAVKTIYSKFNGEGTDPRFVKL
Sbjct: 481 DLKWKPINACTDNCKHRMGYLAVLLGNGSLEVWEVPFPHAVKTIYSKFNGEGTDPRFVKL 540
Query: 541 KPIFRCSRLRTANTQSIPLTVEWSLAPPYDYLLAGCHDGTVALWKFSANSSCEDTRPLLR 600
KPIFRCSRLRTANTQSIPLTVEWSLAPPYDYLLAGCHDGTVALWKFSANSSCEDTRPLLR
Sbjct: 541 KPIFRCSRLRTANTQSIPLTVEWSLAPPYDYLLAGCHDGTVALWKFSANSSCEDTRPLLR 600
Query: 601 FSADTVPIRAVAWAPSESNLESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLD 660
FSADTVPIRAVAWAPSESNLESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLD
Sbjct: 601 FSADTVPIRAVAWAPSESNLESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLD 660
Query: 661 WLPNPRCVFLSFDDGTLRLLS 682
WLPNPRCVFLSFDDGTLRLLS
Sbjct: 661 WLPNPRCVFLSFDDGTLRLLS 681
BLAST of IVF0020322 vs. ExPASy TrEMBL
Match:
A0A1S3B6M4 (uncharacterized protein LOC103486595 OS=Cucumis melo OX=3656 GN=LOC103486595 PE=4 SV=1)
HSP 1 Score: 1363.2 bits (3527), Expect = 0.0e+00
Identity = 678/681 (99.56%), Postives = 679/681 (99.71%), Query Frame = 0
Query: 1 MEELQSPPEPSSTDITSNKGRKKPQAKEKKEPEKRAKKTSNKGKKKPPAKEKKEPEKRAK 60
MEELQSPPEPSSTDITSNKGRKKPQAKEKKEPEKRAKKTSNKGKKKPPAKEKKEPEKRAK
Sbjct: 1 MEELQSPPEPSSTDITSNKGRKKPQAKEKKEPEKRAKKTSNKGKKKPPAKEKKEPEKRAK 60
Query: 61 KKTPVATTTAAAATTTSTSVNEHQRTDRLNDVLPKVKVSEFDPCVENHFRAMDAIVELCC 120
KKTPVATTTAAAATTTSTSVNEHQRTDRLNDVLPKVKVSEFDPCVENHFRAMDAIVELCC
Sbjct: 61 KKTPVATTTAAAATTTSTSVNEHQRTDRLNDVLPKVKVSEFDPCVENHFRAMDAIVELCC 120
Query: 121 EAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTIKFANDSTGPEGKDADITINLPQF 180
EAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTIKFANDSTGPEGKDADITINLPQF
Sbjct: 121 EAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTIKFANDSTGPEGKDADITINLPQF 180
Query: 181 SSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHGRTNSLIKCEFIAVSAHPP 240
SSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHGRTNSLIKCEFIAVSAHPP
Sbjct: 181 SSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHGRTNSLIKCEFIAVSAHPP 240
Query: 241 GSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDVGEPPSDLSSQPKKPRGRPPGRKKKEA 300
GSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDVGEPPSDLSSQPKKPRGRPPGRKKKEA
Sbjct: 241 GSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDVGEPPSDLSSQPKKPRGRPPGRKKKEA 300
Query: 301 SGLPSPPKRPRGRPKKEQKESTDKKGDNCQLVQEFSMENPVGSSSLLEIDGVPKNTENFV 360
SGLPSPPKRPRGRPKKEQKESTDKKGDNCQLVQEFSMENPVGSSSLLEIDGVPKNTENFV
Sbjct: 301 SGLPSPPKRPRGRPKKEQKESTDKKGDNCQLVQEFSMENPVGSSSLLEIDGVPKNTENFV 360
Query: 361 LLENNVERERSTLQEVSTCNSEDEVPAKKRRVRRKVKSRNLVDDVGVSSLTEYQEDGSIA 420
LLENNVERERSTLQEVSTCNSEDEVPAKKRRVRRKVKSRNLVDDVGVSSLTEYQEDGSIA
Sbjct: 361 LLENNVERERSTLQEVSTCNSEDEVPAKKRRVRRKVKSRNLVDDVGVSSLTEYQEDGSIA 420
Query: 421 NNHEADENVKSEYSGEDNLLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKVAW 480
NNHEADENVKSEYSGEDNLLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKVAW
Sbjct: 421 NNHEADENVKSEYSGEDNLLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKVAW 480
Query: 481 DLKWKPINACTDNCKHRMGYLAVLLGNGSLEVWEVPFPHAVKTIYSKFNGEGTDPRFVKL 540
DLKWKPINACTDNCKHRMGYLAVLLGNGSLEVWEVPFPHAVKTIYSKFNGEGTDPRFVKL
Sbjct: 481 DLKWKPINACTDNCKHRMGYLAVLLGNGSLEVWEVPFPHAVKTIYSKFNGEGTDPRFVKL 540
Query: 541 KPIFRCSRLRTANTQSIPLTVEWSLAPPYDYLLAGCHDGTVALWKFSANSSCEDTRPLLR 600
KPIFRCSRLRTANTQSIPLTVEWSLAPPYDYLLAGCHDGTVALWKFSANSSCEDTRPLLR
Sbjct: 541 KPIFRCSRLRTANTQSIPLTVEWSLAPPYDYLLAGCHDGTVALWKFSANSSCEDTRPLLR 600
Query: 601 FSADTVPIRAVAWAPSESNLESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLD 660
FSADTVPIRAVAWAPSESNLESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLD
Sbjct: 601 FSADTVPIRAVAWAPSESNLESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLD 660
Query: 661 WLPNPRCVFLSFDDGTLRLLS 682
WLPNPR + LSFDDGTLRLLS
Sbjct: 661 WLPNPRYILLSFDDGTLRLLS 681
BLAST of IVF0020322 vs. ExPASy TrEMBL
Match:
A0A0A0LGM2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G775290 PE=4 SV=1)
HSP 1 Score: 1272.7 bits (3292), Expect = 0.0e+00
Identity = 638/683 (93.41%), Postives = 654/683 (95.75%), Query Frame = 0
Query: 1 MEELQSPPEPSSTDITSNKG-RKKPQAKEKKEPEKRAKKTSNKGKKKPPAKEKKEPEKRA 60
MEELQSPPEPSSTDITSNKG +KKP AKEKKEPEKRAKKTSNKGKKKPPAKEKKE EKRA
Sbjct: 1 MEELQSPPEPSSTDITSNKGKKKKPPAKEKKEPEKRAKKTSNKGKKKPPAKEKKELEKRA 60
Query: 61 KKKTPVATTTAAAATTTSTSVNEHQRTDRLNDVLPKVKVSEFDPCVENHFRAMDAIVELC 120
KKKTPV T A TTST VN+HQ T RL+DV+P+VKVSEFDPCVENHFRAMDAIVELC
Sbjct: 61 KKKTPVTATVVTA--TTSTEVNKHQSTARLDDVVPEVKVSEFDPCVENHFRAMDAIVELC 120
Query: 121 CEAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTIKFANDSTGPEGKDADITINLPQ 180
CEAE+GDGGIDESDIQRFSSSTIFLREWRFYNYE KTIKFANDS GPEGKDADITI+LPQ
Sbjct: 121 CEAEDGDGGIDESDIQRFSSSTIFLREWRFYNYEPKTIKFANDSRGPEGKDADITIDLPQ 180
Query: 181 FSSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHGRTNSLIKCEFIAVSAHP 240
FSSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVH RTNSLIKCEFIAVSAHP
Sbjct: 181 FSSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHERTNSLIKCEFIAVSAHP 240
Query: 241 PGSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDVGEPPSDLSSQPKKPRGRPPGRKKKE 300
PGSSYHKMGIPLTGRGMVQIWCLVHGTE+YEPIDVGEPPSDLSSQPK+PRGRPPGRK+K
Sbjct: 241 PGSSYHKMGIPLTGRGMVQIWCLVHGTESYEPIDVGEPPSDLSSQPKRPRGRPPGRKEKG 300
Query: 301 ASGLPSPPKRPRGRPKKEQKESTD-KKGDNCQLVQEFSMENPVGSSSLLEIDGVPKNTEN 360
AS LPS PKRPRGRPKKEQKES D KKGDNCQLVQEFSMENPVGSS+LLEIDGVPKNTEN
Sbjct: 301 ASVLPSQPKRPRGRPKKEQKESNDKKKGDNCQLVQEFSMENPVGSSNLLEIDGVPKNTEN 360
Query: 361 FVLLENNVERERSTLQEVSTCNSEDEVPAKKRRVRRKVKSRNLVDDVGVSSLTEYQEDGS 420
FVLLENNVERE STLQEVSTC+SEDEVPAKKRRVRRKVK RNLVDDVGV SL EYQEDGS
Sbjct: 361 FVLLENNVERESSTLQEVSTCHSEDEVPAKKRRVRRKVKPRNLVDDVGVLSLAEYQEDGS 420
Query: 421 IANNHEADENVKSEYSGEDNLLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKV 480
IANNHEA+ENVKSEYSGEDNLLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKV
Sbjct: 421 IANNHEANENVKSEYSGEDNLLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKV 480
Query: 481 AWDLKWKPINACTDNCKHRMGYLAVLLGNGSLEVWEVPFPHAVKTIYSKFNGEGTDPRFV 540
AWDLKWKP+NACTDNCKHRMGYLAVLLGNGSLEVWEVPFPHAVK IYSKFNGEGTDPRF+
Sbjct: 481 AWDLKWKPMNACTDNCKHRMGYLAVLLGNGSLEVWEVPFPHAVKAIYSKFNGEGTDPRFM 540
Query: 541 KLKPIFRCSRLRTANTQSIPLTVEWSLAPPYDYLLAGCHDGTVALWKFSANSSCEDTRPL 600
KLKPIFRCSRLRT NTQSIPLTVEWS PPYDYLLAGCHDGTVALWKFSANSSCEDTRPL
Sbjct: 541 KLKPIFRCSRLRTTNTQSIPLTVEWSRTPPYDYLLAGCHDGTVALWKFSANSSCEDTRPL 600
Query: 601 LRFSADTVPIRAVAWAPSESNLESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYS 660
LRFSADTVPIRAVAWAPSES+LESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYS
Sbjct: 601 LRFSADTVPIRAVAWAPSESDLESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYS 660
Query: 661 LDWLPNPRCVFLSFDDGTLRLLS 682
LDWLPNPRCVFLSFDDGTLRLLS
Sbjct: 661 LDWLPNPRCVFLSFDDGTLRLLS 681
BLAST of IVF0020322 vs. ExPASy TrEMBL
Match:
A0A6J1F7U5 (uncharacterized protein LOC111441649 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111441649 PE=4 SV=1)
HSP 1 Score: 943.7 bits (2438), Expect = 4.2e-271
Identity = 485/644 (75.31%), Postives = 528/644 (81.99%), Query Frame = 0
Query: 42 KGKKKPPAKEKKEPEKRAKKKTPVATTTAAAATTTSTSVNEHQRTDRLNDVLPKVKVSEF 101
KGKKK + E EP+KRAKKK +TSVNE Q T RL+D +VKVSEF
Sbjct: 18 KGKKKSVSLE--EPQKRAKKK------------GGATSVNEVQPTGRLDD--SRVKVSEF 77
Query: 102 DPCVENHFRAMDAIVELCCEAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTIKFAN 161
D CVENHFRA+DAI EL EAE G+GG+DESD QRFSSST FLREW+FYNYE KT+KF +
Sbjct: 78 DHCVENHFRAIDAIAELYGEAENGEGGVDESDFQRFSSSTTFLREWKFYNYEPKTVKFTS 137
Query: 162 DSTGPEGKDADITINLPQFSSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVH 221
DS PEGKDADIT+ LPQFSSAAVLK GAPPGA+ SLDFRNF MHVGGPVWAIDWCP VH
Sbjct: 138 DSRVPEGKDADITMELPQFSSAAVLKNGAPPGATASLDFRNFIMHVGGPVWAIDWCPLVH 197
Query: 222 GRTNSLIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDVGE---PP 281
RT+SLIKCEFIAVSAHPPGSSYH MGIPL+GRGMVQIWCLVHGTE++E
Sbjct: 198 ERTDSLIKCEFIAVSAHPPGSSYHTMGIPLSGRGMVQIWCLVHGTESHESETTSATECKD 257
Query: 282 SDLSSQPKKPRGRPPGRKKKEASGLPSPPKRPRGRPKKEQKE-STDKKGDNCQLVQEFSM 341
SDL SQPK+PRGRPPGRKK AS LPS PKRPRGRPKK+Q+E + D K + QLVQ S+
Sbjct: 258 SDL-SQPKRPRGRPPGRKKNGASALPSQPKRPRGRPKKKQEEPNDDNKVASYQLVQPLSV 317
Query: 342 ENPVGSSSLLEIDGVPKNTENFVLLENNVERERSTLQEVSTCNSEDEVPAKKRRVRRKVK 401
E P SS+LLEID V N+E V LEN+VER ST++E+STCNSEDEVP +KRRVRR
Sbjct: 318 EYPDVSSNLLEIDDVSHNSEKPVSLENSVERGSSTIEEISTCNSEDEVPVQKRRVRRNAD 377
Query: 402 SRNLVDDVGVSSLTEYQEDGSIANNHEADENVKSEYSGEDNLLCKDISENVVLDASSIEF 461
++N VDDVG SL E +EDGS A NHEA+ENV SEYSGED LCK+ISE +LD S F
Sbjct: 378 TKNHVDDVGTLSLIENREDGSNATNHEANENVTSEYSGEDTRLCKNISEKAILDTGSTGF 437
Query: 462 SIPESVALPRVVLCLAHNGKVAWDLKWKPINACTDNCKHRMGYLAVLLGNGSLEVWEVPF 521
SIPE+VALPR+VLCLAHNGKVAWDLKWKP NA T CK RMGYLAVLLGNGSLEVWEVPF
Sbjct: 438 SIPETVALPRLVLCLAHNGKVAWDLKWKPTNARTTKCKQRMGYLAVLLGNGSLEVWEVPF 497
Query: 522 PHAVKTIYSKFNGEGTDPRFVKLKPIFRCSRLRTANTQSIPLTVEWSLAPPYDYLLAGCH 581
PH VK IYSK NGEGTDPRFVKLKP FRCS LR+A+TQSIPLTVEWS PPYDYLLAGCH
Sbjct: 498 PHVVKAIYSKLNGEGTDPRFVKLKPTFRCSMLRSADTQSIPLTVEWSPTPPYDYLLAGCH 557
Query: 582 DGTVALWKFSANSSCEDTRPLLRFSADTVPIRAVAWAPSESNLESANVILTAGHGGLKFW 641
DGTVALWKFSA+S+ EDTRPLLRFSADTVPIRAVAWAPSES ES NVIL A HGG+KFW
Sbjct: 558 DGTVALWKFSASSTAEDTRPLLRFSADTVPIRAVAWAPSESEPESENVILIASHGGIKFW 617
Query: 642 DLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLS 682
DLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLS
Sbjct: 618 DLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLS 644
BLAST of IVF0020322 vs. ExPASy TrEMBL
Match:
A0A6J1CU50 (uncharacterized protein LOC111014310 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111014310 PE=4 SV=1)
HSP 1 Score: 936.4 bits (2419), Expect = 6.7e-269
Identity = 481/669 (71.90%), Postives = 533/669 (79.67%), Query Frame = 0
Query: 28 EKKEPEKRAKKTSNKGK----KKPPAKEKKEPEKRAKKKTPVATTTAAAATTTSTSVNEH 87
E EP A TS GK K+P A++KKE RAKKK A S +E
Sbjct: 9 EVVEPPVPAASTSTGGKRGKRKQPVARQKKEAPGRAKKKPGGA------------SADEE 68
Query: 88 QRTDRLNDVLPKVKVSEFDPCVENHFRAMDAIVELCCEAEEGDGGIDESDIQRFSSSTIF 147
Q T RL+ V +KV EFD C ENHFRAMD I ELC EAE+GDGGIDESDIQRFSSS F
Sbjct: 69 QPTGRLDGV--GIKVLEFDHCAENHFRAMDTIAELCGEAEDGDGGIDESDIQRFSSSAFF 128
Query: 148 LREWRFYNYEAKTIKFANDSTGPEGKDADITINLPQFSSAAVLKKGAPPGASTSLDFRNF 207
LREWRFYNYE KT+KFA+D G EGKD DITINLPQFSSAAVLK G P GA+TSLD+RNF
Sbjct: 129 LREWRFYNYEPKTVKFASDLRGSEGKDGDITINLPQFSSAAVLKNGTPSGAATSLDWRNF 188
Query: 208 AMHVGGPVWAIDWCPQVHGRTNSLIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLV 267
M+VGGPVWA+DWCPQV +T++LIKCEFIAVSAHPPGSSYHKMG PL GRGMVQIWCLV
Sbjct: 189 VMYVGGPVWALDWCPQVLEKTDALIKCEFIAVSAHPPGSSYHKMGTPLIGRGMVQIWCLV 248
Query: 268 HGTENYEPIDV-----------GEPPSDLSSQPKKPRGRPPGRKKKEASGLPSPPKRPRG 327
HGTEN+EP E SDLSSQPK+PRGRPPG KKK AS LPS PKRPRG
Sbjct: 249 HGTENHEPEPAYATKCKSKPKKDEVSSDLSSQPKRPRGRPPGTKKKGASDLPSQPKRPRG 308
Query: 328 RPKKEQKESTDKKGDNCQLVQEFSMENPVGSSSLLEIDGVPKNTENFVLLENNVERERST 387
RPKK+Q+ S D GDN Q+VQ S+E P GSS+LLEIDG PKN+E +LL N+VER++ST
Sbjct: 309 RPKKKQEGSNDNMGDNNQIVQSLSVEYPAGSSNLLEIDGDPKNSEELLLLGNSVERQKST 368
Query: 388 LQEVSTCNSEDEVPAKKRRVRRKVKSRNLVDDVGVSSLTEYQEDGSIANNHEADENVKSE 447
LQ VSTCNS+DE PA+KRRVRRKV ++N +DD+G T +EDGS + + +ENV SE
Sbjct: 369 LQAVSTCNSKDEGPAQKRRVRRKVGTKNHIDDMGTLPFTVNREDGSSTISFQENENVISE 428
Query: 448 YSGEDNLLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKVAWDLKWKPINACTD 507
YSGED LLC +IS+N EFSIPESVALPRVVLCLAHNGKVAWDLKWKP NACT
Sbjct: 429 YSGEDTLLCNNISKNA-------EFSIPESVALPRVVLCLAHNGKVAWDLKWKPSNACTT 488
Query: 508 NCKHRMGYLAVLLGNGSLEVWEVPFPHAVKTIYSKFNGEGTDPRFVKLKPIFRCSRLRTA 567
NCKHRMGYLAVLLGNGSLEVWE+PFPH VK IYSKFN EGTDPRFVKLKPIFR + L++A
Sbjct: 489 NCKHRMGYLAVLLGNGSLEVWEIPFPHVVKAIYSKFNREGTDPRFVKLKPIFRSTMLKSA 548
Query: 568 NTQSIPLTVEWSLAPPYDYLLAGCHDGTVALWKFSANSSCEDTRPLLRFSADTVPIRAVA 627
N QSIPLTVEWS PPYDYL AGC+DGTVALWKFSANS+CEDTRPLLRFSADTVPIR VA
Sbjct: 549 NIQSIPLTVEWSSTPPYDYLFAGCNDGTVALWKFSANSTCEDTRPLLRFSADTVPIRRVA 608
Query: 628 WAPSESNLESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSF 682
WAP+ES+ ESANV+LTA HGGLKFWDLRDPFRPLWD+HPAPR+IYSLDWLP+PRCV LSF
Sbjct: 609 WAPNESDPESANVVLTASHGGLKFWDLRDPFRPLWDIHPAPRMIYSLDWLPDPRCVILSF 656
BLAST of IVF0020322 vs. NCBI nr
Match:
KAA0043896.1 (DNA binding protein, putative isoform 1 [Cucumis melo var. makuwa] >TYK25240.1 DNA binding protein, putative isoform 1 [Cucumis melo var. makuwa])
HSP 1 Score: 1363 bits (3527), Expect = 0.0
Identity = 681/681 (100.00%), Postives = 681/681 (100.00%), Query Frame = 0
Query: 1 MEELQSPPEPSSTDITSNKGRKKPQAKEKKEPEKRAKKTSNKGKKKPPAKEKKEPEKRAK 60
MEELQSPPEPSSTDITSNKGRKKPQAKEKKEPEKRAKKTSNKGKKKPPAKEKKEPEKRAK
Sbjct: 1 MEELQSPPEPSSTDITSNKGRKKPQAKEKKEPEKRAKKTSNKGKKKPPAKEKKEPEKRAK 60
Query: 61 KKTPVATTTAAAATTTSTSVNEHQRTDRLNDVLPKVKVSEFDPCVENHFRAMDAIVELCC 120
KKTPVATTTAAAATTTSTSVNEHQRTDRLNDVLPKVKVSEFDPCVENHFRAMDAIVELCC
Sbjct: 61 KKTPVATTTAAAATTTSTSVNEHQRTDRLNDVLPKVKVSEFDPCVENHFRAMDAIVELCC 120
Query: 121 EAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTIKFANDSTGPEGKDADITINLPQF 180
EAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTIKFANDSTGPEGKDADITINLPQF
Sbjct: 121 EAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTIKFANDSTGPEGKDADITINLPQF 180
Query: 181 SSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHGRTNSLIKCEFIAVSAHPP 240
SSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHGRTNSLIKCEFIAVSAHPP
Sbjct: 181 SSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHGRTNSLIKCEFIAVSAHPP 240
Query: 241 GSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDVGEPPSDLSSQPKKPRGRPPGRKKKEA 300
GSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDVGEPPSDLSSQPKKPRGRPPGRKKKEA
Sbjct: 241 GSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDVGEPPSDLSSQPKKPRGRPPGRKKKEA 300
Query: 301 SGLPSPPKRPRGRPKKEQKESTDKKGDNCQLVQEFSMENPVGSSSLLEIDGVPKNTENFV 360
SGLPSPPKRPRGRPKKEQKESTDKKGDNCQLVQEFSMENPVGSSSLLEIDGVPKNTENFV
Sbjct: 301 SGLPSPPKRPRGRPKKEQKESTDKKGDNCQLVQEFSMENPVGSSSLLEIDGVPKNTENFV 360
Query: 361 LLENNVERERSTLQEVSTCNSEDEVPAKKRRVRRKVKSRNLVDDVGVSSLTEYQEDGSIA 420
LLENNVERERSTLQEVSTCNSEDEVPAKKRRVRRKVKSRNLVDDVGVSSLTEYQEDGSIA
Sbjct: 361 LLENNVERERSTLQEVSTCNSEDEVPAKKRRVRRKVKSRNLVDDVGVSSLTEYQEDGSIA 420
Query: 421 NNHEADENVKSEYSGEDNLLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKVAW 480
NNHEADENVKSEYSGEDNLLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKVAW
Sbjct: 421 NNHEADENVKSEYSGEDNLLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKVAW 480
Query: 481 DLKWKPINACTDNCKHRMGYLAVLLGNGSLEVWEVPFPHAVKTIYSKFNGEGTDPRFVKL 540
DLKWKPINACTDNCKHRMGYLAVLLGNGSLEVWEVPFPHAVKTIYSKFNGEGTDPRFVKL
Sbjct: 481 DLKWKPINACTDNCKHRMGYLAVLLGNGSLEVWEVPFPHAVKTIYSKFNGEGTDPRFVKL 540
Query: 541 KPIFRCSRLRTANTQSIPLTVEWSLAPPYDYLLAGCHDGTVALWKFSANSSCEDTRPLLR 600
KPIFRCSRLRTANTQSIPLTVEWSLAPPYDYLLAGCHDGTVALWKFSANSSCEDTRPLLR
Sbjct: 541 KPIFRCSRLRTANTQSIPLTVEWSLAPPYDYLLAGCHDGTVALWKFSANSSCEDTRPLLR 600
Query: 601 FSADTVPIRAVAWAPSESNLESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLD 660
FSADTVPIRAVAWAPSESNLESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLD
Sbjct: 601 FSADTVPIRAVAWAPSESNLESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLD 660
Query: 661 WLPNPRCVFLSFDDGTLRLLS 681
WLPNPRCVFLSFDDGTLRLLS
Sbjct: 661 WLPNPRCVFLSFDDGTLRLLS 681
BLAST of IVF0020322 vs. NCBI nr
Match:
XP_008442823.1 (PREDICTED: uncharacterized protein LOC103486595 [Cucumis melo])
HSP 1 Score: 1356 bits (3509), Expect = 0.0
Identity = 678/681 (99.56%), Postives = 679/681 (99.71%), Query Frame = 0
Query: 1 MEELQSPPEPSSTDITSNKGRKKPQAKEKKEPEKRAKKTSNKGKKKPPAKEKKEPEKRAK 60
MEELQSPPEPSSTDITSNKGRKKPQAKEKKEPEKRAKKTSNKGKKKPPAKEKKEPEKRAK
Sbjct: 1 MEELQSPPEPSSTDITSNKGRKKPQAKEKKEPEKRAKKTSNKGKKKPPAKEKKEPEKRAK 60
Query: 61 KKTPVATTTAAAATTTSTSVNEHQRTDRLNDVLPKVKVSEFDPCVENHFRAMDAIVELCC 120
KKTPVATTTAAAATTTSTSVNEHQRTDRLNDVLPKVKVSEFDPCVENHFRAMDAIVELCC
Sbjct: 61 KKTPVATTTAAAATTTSTSVNEHQRTDRLNDVLPKVKVSEFDPCVENHFRAMDAIVELCC 120
Query: 121 EAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTIKFANDSTGPEGKDADITINLPQF 180
EAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTIKFANDSTGPEGKDADITINLPQF
Sbjct: 121 EAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTIKFANDSTGPEGKDADITINLPQF 180
Query: 181 SSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHGRTNSLIKCEFIAVSAHPP 240
SSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHGRTNSLIKCEFIAVSAHPP
Sbjct: 181 SSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHGRTNSLIKCEFIAVSAHPP 240
Query: 241 GSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDVGEPPSDLSSQPKKPRGRPPGRKKKEA 300
GSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDVGEPPSDLSSQPKKPRGRPPGRKKKEA
Sbjct: 241 GSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDVGEPPSDLSSQPKKPRGRPPGRKKKEA 300
Query: 301 SGLPSPPKRPRGRPKKEQKESTDKKGDNCQLVQEFSMENPVGSSSLLEIDGVPKNTENFV 360
SGLPSPPKRPRGRPKKEQKESTDKKGDNCQLVQEFSMENPVGSSSLLEIDGVPKNTENFV
Sbjct: 301 SGLPSPPKRPRGRPKKEQKESTDKKGDNCQLVQEFSMENPVGSSSLLEIDGVPKNTENFV 360
Query: 361 LLENNVERERSTLQEVSTCNSEDEVPAKKRRVRRKVKSRNLVDDVGVSSLTEYQEDGSIA 420
LLENNVERERSTLQEVSTCNSEDEVPAKKRRVRRKVKSRNLVDDVGVSSLTEYQEDGSIA
Sbjct: 361 LLENNVERERSTLQEVSTCNSEDEVPAKKRRVRRKVKSRNLVDDVGVSSLTEYQEDGSIA 420
Query: 421 NNHEADENVKSEYSGEDNLLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKVAW 480
NNHEADENVKSEYSGEDNLLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKVAW
Sbjct: 421 NNHEADENVKSEYSGEDNLLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKVAW 480
Query: 481 DLKWKPINACTDNCKHRMGYLAVLLGNGSLEVWEVPFPHAVKTIYSKFNGEGTDPRFVKL 540
DLKWKPINACTDNCKHRMGYLAVLLGNGSLEVWEVPFPHAVKTIYSKFNGEGTDPRFVKL
Sbjct: 481 DLKWKPINACTDNCKHRMGYLAVLLGNGSLEVWEVPFPHAVKTIYSKFNGEGTDPRFVKL 540
Query: 541 KPIFRCSRLRTANTQSIPLTVEWSLAPPYDYLLAGCHDGTVALWKFSANSSCEDTRPLLR 600
KPIFRCSRLRTANTQSIPLTVEWSLAPPYDYLLAGCHDGTVALWKFSANSSCEDTRPLLR
Sbjct: 541 KPIFRCSRLRTANTQSIPLTVEWSLAPPYDYLLAGCHDGTVALWKFSANSSCEDTRPLLR 600
Query: 601 FSADTVPIRAVAWAPSESNLESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLD 660
FSADTVPIRAVAWAPSESNLESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLD
Sbjct: 601 FSADTVPIRAVAWAPSESNLESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLD 660
Query: 661 WLPNPRCVFLSFDDGTLRLLS 681
WLPNPR + LSFDDGTLRLLS
Sbjct: 661 WLPNPRYILLSFDDGTLRLLS 681
BLAST of IVF0020322 vs. NCBI nr
Match:
XP_004149225.3 (uncharacterized protein LOC101210135 isoform X1 [Cucumis sativus] >KAE8651086.1 hypothetical protein Csa_002356 [Cucumis sativus])
HSP 1 Score: 1255 bits (3247), Expect = 0.0
Identity = 640/714 (89.64%), Postives = 655/714 (91.74%), Query Frame = 0
Query: 1 MEELQSPPEPSSTDITSNKGRKK-PQAKEKKEPEKRAKKTSNKGKKKPPAKEKKEPEKRA 60
MEELQSPPEPSSTDITSNKG+KK P AKEKKEPEKRAKKTSNKGKKKPPAKEKKEPEKRA
Sbjct: 1 MEELQSPPEPSSTDITSNKGKKKKPPAKEKKEPEKRAKKTSNKGKKKPPAKEKKEPEKRA 60
Query: 61 KKKTPVATTTAAAATTTSTSVNEHQRTDRLNDVLPKVKVSEFDPCVENHFRAMDAIVELC 120
KKKTPV T A TTST VN+HQ T RL+DV+P+VKVSEFDPCVENHFRAMDAIVELC
Sbjct: 61 KKKTPVTATVVTA--TTSTEVNKHQSTARLDDVVPEVKVSEFDPCVENHFRAMDAIVELC 120
Query: 121 CEAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTIKFANDSTGPEGKDADITINLPQ 180
CEAE+GDGGIDESDIQRFSSSTIFLREWRFYNYE KTIKFANDS GPEGKDADITI+LPQ
Sbjct: 121 CEAEDGDGGIDESDIQRFSSSTIFLREWRFYNYEPKTIKFANDSRGPEGKDADITIDLPQ 180
Query: 181 FSSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHGRTNSLIKCEFIAVSAHP 240
FSSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVH RTNSLIKCEFIAVSAHP
Sbjct: 181 FSSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHERTNSLIKCEFIAVSAHP 240
Query: 241 PGSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDVGEPPSDLSSQPKKPRGRPPGRKKKE 300
PGSSYHKMGIPLTGRGMVQIWCLVHGTE+YEPIDVGEPPSDLSSQPK+PRGRPPGRK+K
Sbjct: 241 PGSSYHKMGIPLTGRGMVQIWCLVHGTESYEPIDVGEPPSDLSSQPKRPRGRPPGRKEKG 300
Query: 301 ASGLPSPPKRPRGRPKKEQKESTDKK-GDNCQLVQEFSMENPVGSSSLLEIDGVPKNTEN 360
AS LPS PKRPRGRPKKEQKES DKK GDNCQLVQEFSMENPVGSS+LLEIDGVPKNTEN
Sbjct: 301 ASVLPSQPKRPRGRPKKEQKESNDKKKGDNCQLVQEFSMENPVGSSNLLEIDGVPKNTEN 360
Query: 361 FVLLENNVERERSTLQEVSTC-------------------------------NSEDEVPA 420
FVLLENNVERE STLQEVSTC NSEDEVPA
Sbjct: 361 FVLLENNVERESSTLQEVSTCHSEDEVPAKKRRVRRKVKPRNLVDDVGVLSPNSEDEVPA 420
Query: 421 KKRRVRRKVKSRNLVDDVGVSSLTEYQEDGSIANNHEADENVKSEYSGEDNLLCKDISEN 480
KKRRVRRKVK RNLVDDVGV SL EYQEDGSIANNHEA+ENVKSEYSGEDNLLCKDISEN
Sbjct: 421 KKRRVRRKVKPRNLVDDVGVLSLAEYQEDGSIANNHEANENVKSEYSGEDNLLCKDISEN 480
Query: 481 VVLDASSIEFSIPESVALPRVVLCLAHNGKVAWDLKWKPINACTDNCKHRMGYLAVLLGN 540
VVLDASSIEFSIPESVALPRVVLCLAHNGKVAWDLKWKP+NACTDNCKHRMGYLAVLLGN
Sbjct: 481 VVLDASSIEFSIPESVALPRVVLCLAHNGKVAWDLKWKPMNACTDNCKHRMGYLAVLLGN 540
Query: 541 GSLEVWEVPFPHAVKTIYSKFNGEGTDPRFVKLKPIFRCSRLRTANTQSIPLTVEWSLAP 600
GSLEVWEVPFPHAVK IYSKFNGEGTDPRF+KLKPIFRCSRLRT NTQSIPLTVEWS P
Sbjct: 541 GSLEVWEVPFPHAVKAIYSKFNGEGTDPRFMKLKPIFRCSRLRTTNTQSIPLTVEWSRTP 600
Query: 601 PYDYLLAGCHDGTVALWKFSANSSCEDTRPLLRFSADTVPIRAVAWAPSESNLESANVIL 660
PYDYLLAGCHDGTVALWKFSANSSCEDTRPLLRFSADTVPIRAVAWAPSES+LESANVIL
Sbjct: 601 PYDYLLAGCHDGTVALWKFSANSSCEDTRPLLRFSADTVPIRAVAWAPSESDLESANVIL 660
Query: 661 TAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLS 681
TAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLS
Sbjct: 661 TAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLS 712
BLAST of IVF0020322 vs. NCBI nr
Match:
XP_038903194.1 (uncharacterized protein LOC120089853 [Benincasa hispida])
HSP 1 Score: 1084 bits (2803), Expect = 0.0
Identity = 544/643 (84.60%), Postives = 577/643 (89.74%), Query Frame = 0
Query: 39 TSNKGKKKPPAKEKKEPEKRAKKKTPVATTTAAAATTTSTSVNEHQRTDRLNDVLPKVKV 98
+S KGKKKPPA+EKK+ EK A+ K ATTT+TSVN+HQ T RL+ PKVKV
Sbjct: 15 SSKKGKKKPPAREKKKSEKTAQNK--------PGATTTTTSVNKHQPTGRLDG--PKVKV 74
Query: 99 SEFDPCVENHFRAMDAIVELCCEAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTIK 158
SEFD C+ENHF AMD IVELCCEAE DGGIDESDIQRF+SSTIFLREWRFYNYE K IK
Sbjct: 75 SEFDHCIENHFNAMDTIVELCCEAE--DGGIDESDIQRFASSTIFLREWRFYNYEPKFIK 134
Query: 159 FANDSTGPEGKDADITINLPQFSSAAVLKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCP 218
FA+DS GPEGKDADITI LPQFSSAAVLK GAPPGA+TSLDFRNFAMHVGGPVWA+DWCP
Sbjct: 135 FASDSRGPEGKDADITITLPQFSSAAVLKNGAPPGATTSLDFRNFAMHVGGPVWALDWCP 194
Query: 219 QVHGRTNSLIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDVGEPP 278
QVH RT+SLIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWC VHGTE+YEP +V EPP
Sbjct: 195 QVHERTDSLIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCFVHGTESYEPTNVEEPP 254
Query: 279 SDLSSQPKKPRGRPPGRKKKEASGLPSPPKRPRGRPKKEQKESTDKKGDNCQLVQEFSME 338
+DLSSQPK+PRGRP GRKK ASGLP PKRPRGRPKK+Q+ES DKKGD+C LVQ FS+E
Sbjct: 255 ADLSSQPKRPRGRPSGRKKNGASGLPPQPKRPRGRPKKKQEESNDKKGDSCPLVQAFSIE 314
Query: 339 NPVGSSSLLEIDGVPKNTENFVLLENNVERERSTLQEVSTCNSEDEVPAKKRRVRRKVKS 398
NPVGSS+LLE+DGVPKN+EN VLLEN+VERERSTLQEVSTCNSEDEVPA+KRRVRRK +
Sbjct: 315 NPVGSSNLLEMDGVPKNSENIVLLENSVERERSTLQEVSTCNSEDEVPAQKRRVRRKTEP 374
Query: 399 RNLVDDVGVSSLTEYQEDGSIANNHEADENVKSEYSGEDNLLCKDISENVVLDASSIEFS 458
+N V DVG+ SLTE +EDGS A + EA+ENV EYSGEDNLLCK+IS N VLD SSIEFS
Sbjct: 375 KNHVGDVGMLSLTENREDGSNAISLEANENVVCEYSGEDNLLCKNISGNAVLDTSSIEFS 434
Query: 459 IPESVALPRVVLCLAHNGKVAWDLKWKPINACTDNCKHRMGYLAVLLGNGSLEVWEVPFP 518
IPESVALPRVVLCLAHNGKVAWDLKWKP NA TDNCK RMGYLAVLLGNGSLEVWEVPFP
Sbjct: 435 IPESVALPRVVLCLAHNGKVAWDLKWKPTNASTDNCKLRMGYLAVLLGNGSLEVWEVPFP 494
Query: 519 HAVKTIYSKFNGEGTDPRFVKLKPIFRCSRLRTANTQSIPLTVEWSLAPPYDYLLAGCHD 578
HAVK IYSKFNGEGTDPRFVKLKPIFRCS LR ANTQSIPLTVEWS PPYDYLLAGCHD
Sbjct: 495 HAVKAIYSKFNGEGTDPRFVKLKPIFRCSMLRNANTQSIPLTVEWSQTPPYDYLLAGCHD 554
Query: 579 GTVALWKFSANSSCEDTRPLLRFSADTVPIRAVAWAPSESNLESANVILTAGHGGLKFWD 638
GTVALWKFSANSSCEDTRPLLRFSADTVPIRAVAWAPSES ESANVILTAGHGGLKFWD
Sbjct: 555 GTVALWKFSANSSCEDTRPLLRFSADTVPIRAVAWAPSESGSESANVILTAGHGGLKFWD 614
Query: 639 LRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLS 681
LRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLS
Sbjct: 615 LRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLS 645
BLAST of IVF0020322 vs. NCBI nr
Match:
XP_011652007.2 (uncharacterized protein LOC101210135 isoform X2 [Cucumis sativus])
HSP 1 Score: 957 bits (2474), Expect = 0.0
Identity = 474/528 (89.77%), Postives = 483/528 (91.48%), Query Frame = 0
Query: 186 LKKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHGRTNSLIKCEFIAVSAHPPGSSYH 245
L+KGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVH RTNSLIKCEFIAVSAHPPGSSYH
Sbjct: 15 LRKGAPPGASTSLDFRNFAMHVGGPVWAIDWCPQVHERTNSLIKCEFIAVSAHPPGSSYH 74
Query: 246 KMGIPLTGRGMVQIWCLVHGTENYEPIDVGEPPSDLSSQPKKPRGRPPGRKKKEASGLPS 305
KMGIPLTGRGMVQIWCLVHGTE+YEPIDVGEPPSDLSSQPK+PRGRPPGRK+K AS LPS
Sbjct: 75 KMGIPLTGRGMVQIWCLVHGTESYEPIDVGEPPSDLSSQPKRPRGRPPGRKEKGASVLPS 134
Query: 306 PPKRPRGRPKKEQKESTDKK-GDNCQLVQEFSMENPVGSSSLLEIDGVPKNTENFVLLEN 365
PKRPRGRPKKEQKES DKK GDNCQLVQEFSMENPVGSS+LLEIDGVPKNTENFVLLEN
Sbjct: 135 QPKRPRGRPKKEQKESNDKKKGDNCQLVQEFSMENPVGSSNLLEIDGVPKNTENFVLLEN 194
Query: 366 NVERERSTLQEVSTC-------------------------------NSEDEVPAKKRRVR 425
NVERE STLQEVSTC NSEDEVPAKKRRVR
Sbjct: 195 NVERESSTLQEVSTCHSEDEVPAKKRRVRRKVKPRNLVDDVGVLSPNSEDEVPAKKRRVR 254
Query: 426 RKVKSRNLVDDVGVSSLTEYQEDGSIANNHEADENVKSEYSGEDNLLCKDISENVVLDAS 485
RKVK RNLVDDVGV SL EYQEDGSIANNHEA+ENVKSEYSGEDNLLCKDISENVVLDAS
Sbjct: 255 RKVKPRNLVDDVGVLSLAEYQEDGSIANNHEANENVKSEYSGEDNLLCKDISENVVLDAS 314
Query: 486 SIEFSIPESVALPRVVLCLAHNGKVAWDLKWKPINACTDNCKHRMGYLAVLLGNGSLEVW 545
SIEFSIPESVALPRVVLCLAHNGKVAWDLKWKP+NACTDNCKHRMGYLAVLLGNGSLEVW
Sbjct: 315 SIEFSIPESVALPRVVLCLAHNGKVAWDLKWKPMNACTDNCKHRMGYLAVLLGNGSLEVW 374
Query: 546 EVPFPHAVKTIYSKFNGEGTDPRFVKLKPIFRCSRLRTANTQSIPLTVEWSLAPPYDYLL 605
EVPFPHAVK IYSKFNGEGTDPRF+KLKPIFRCSRLRT NTQSIPLTVEWS PPYDYLL
Sbjct: 375 EVPFPHAVKAIYSKFNGEGTDPRFMKLKPIFRCSRLRTTNTQSIPLTVEWSRTPPYDYLL 434
Query: 606 AGCHDGTVALWKFSANSSCEDTRPLLRFSADTVPIRAVAWAPSESNLESANVILTAGHGG 665
AGCHDGTVALWKFSANSSCEDTRPLLRFSADTVPIRAVAWAPSES+LESANVILTAGHGG
Sbjct: 435 AGCHDGTVALWKFSANSSCEDTRPLLRFSADTVPIRAVAWAPSESDLESANVILTAGHGG 494
Query: 666 LKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLS 681
LKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLS
Sbjct: 495 LKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLRLLS 542
BLAST of IVF0020322 vs. TAIR 10
Match:
AT1G19485.1 (Transducin/WD40 repeat-like superfamily protein )
HSP 1 Score: 510.8 bits (1314), Expect = 1.7e-144
Identity = 295/603 (48.92%), Postives = 366/603 (60.70%), Query Frame = 0
Query: 98 VSEFDPCVENHFRAMDAIVELCCEAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTI 157
+S FD E+H +A+++I +LC EA + IDE+DI SSS FLREWR YN+E K+
Sbjct: 8 ISLFDYSAESHLKAVESITDLCGEA---NADIDENDINILSSSVTFLREWRHYNFEPKSF 67
Query: 158 KFANDS-TGPEGKDADITINLPQFSSAAVLKKGAPPGASTSLD--FRNFAMHVGGPVWAI 217
F N++ + KD + + LPQFSSA K S+S ++F MHVGG VWA+
Sbjct: 68 AFYNEAEKNHQPKDIN-SQTLPQFSSARAPKVKIHDDESSSSGEISKDFVMHVGGSVWAM 127
Query: 218 DWCPQVHGRTNSLIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDV 277
+WCP+VHG ++ KCEF+AV+ HPP S HK+GIPL GRG++QIWC+++ T + V
Sbjct: 128 EWCPRVHGNPDAQAKCEFLAVATHPPDSYSHKIGIPLIGRGIIQIWCIINATCKKDSGQV 187
Query: 278 GEPPSDL--------------SSQPKKPRGRPPGRKKKEASGLPSPPKRPRGRP-KKEQK 337
+ L +++PKKPRGRP RK + + PK+PRGRP KK
Sbjct: 188 SDKGKKLTGKSRKQPSGETTETTEPKKPRGRP--RKHPVET---TEPKKPRGRPRKKSTA 247
Query: 338 ESTDKKGDNCQLVQEFSMENPVGSSSLLEIDGVP-KNTENFVLLENNVERERSTLQEVST 397
E + D+ V+ S+ P S + P + + E V E S Q +S+
Sbjct: 248 ELPVELDDDVLYVEALSVRYPENS----VVPATPLRILRETPVTETKVNNEGSG-QVLSS 307
Query: 398 CNSEDEVPAKKRRVRRKVKSRNLVDDVGVSSLTEYQEDGSIANNHEADENVKSEYSGEDN 457
N+ ++P RR R+K KS TE I EA NV S+ S
Sbjct: 308 DNANIKLPV--RRKRQKTKS------------TEESCTPMILEYSEAVGNVPSKPS---- 367
Query: 458 LLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKVAWDLKWKPINACTDNCKHRM 517
ISE++ VALPRVVLCLAHNGKV WD+KW+P A KH M
Sbjct: 368 ---SGISEDI--------------VALPRVVLCLAHNGKVVWDMKWRPSYAGDSLNKHSM 427
Query: 518 GYLAVLLGNGSLEVWEVPFPHAVKTIYSKFNGEGTDPRFVKLKPIFRCSRLRTANTQSIP 577
GYLAVLLGNGSLEVW+VP P A +Y TDPRFVKL P+F+CS L+ +T+SIP
Sbjct: 428 GYLAVLLGNGSLEVWDVPMPKATSALYLSSKKAATDPRFVKLAPVFKCSNLKCGDTKSIP 487
Query: 578 LTVEWSLAPPYDYLLAGCHDGTVALWKFSANSSCEDTRPLLRFSADTVPIRAVAWAPSES 637
LTVEWS D+LLAGCHDGTVALWKFS S EDTRPLL FSADT PIRAVAWAP ES
Sbjct: 488 LTVEWSTLGNPDFLLAGCHDGTVALWKFSTTKSSEDTRPLLFFSADTAPIRAVAWAPGES 547
Query: 638 NLESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLR 682
+ ESAN++ TAGH GLKFWDLRDPFRPLWDLHP PR IYSLDWL +P CV LSFDDGTLR
Sbjct: 548 DQESANIVATAGHAGLKFWDLRDPFRPLWDLHPVPRFIYSLDWLQDPSCVLLSFDDGTLR 561
BLAST of IVF0020322 vs. TAIR 10
Match:
AT1G19485.2 (Transducin/WD40 repeat-like superfamily protein )
HSP 1 Score: 510.8 bits (1314), Expect = 1.7e-144
Identity = 295/603 (48.92%), Postives = 366/603 (60.70%), Query Frame = 0
Query: 98 VSEFDPCVENHFRAMDAIVELCCEAEEGDGGIDESDIQRFSSSTIFLREWRFYNYEAKTI 157
+S FD E+H +A+++I +LC EA + IDE+DI SSS FLREWR YN+E K+
Sbjct: 8 ISLFDYSAESHLKAVESITDLCGEA---NADIDENDINILSSSVTFLREWRHYNFEPKSF 67
Query: 158 KFANDS-TGPEGKDADITINLPQFSSAAVLKKGAPPGASTSLD--FRNFAMHVGGPVWAI 217
F N++ + KD + + LPQFSSA K S+S ++F MHVGG VWA+
Sbjct: 68 AFYNEAEKNHQPKDIN-SQTLPQFSSARAPKVKIHDDESSSSGEISKDFVMHVGGSVWAM 127
Query: 218 DWCPQVHGRTNSLIKCEFIAVSAHPPGSSYHKMGIPLTGRGMVQIWCLVHGTENYEPIDV 277
+WCP+VHG ++ KCEF+AV+ HPP S HK+GIPL GRG++QIWC+++ T + V
Sbjct: 128 EWCPRVHGNPDAQAKCEFLAVATHPPDSYSHKIGIPLIGRGIIQIWCIINATCKKDSGQV 187
Query: 278 GEPPSDL--------------SSQPKKPRGRPPGRKKKEASGLPSPPKRPRGRP-KKEQK 337
+ L +++PKKPRGRP RK + + PK+PRGRP KK
Sbjct: 188 SDKGKKLTGKSRKQPSGETTETTEPKKPRGRP--RKHPVET---TEPKKPRGRPRKKSTA 247
Query: 338 ESTDKKGDNCQLVQEFSMENPVGSSSLLEIDGVP-KNTENFVLLENNVERERSTLQEVST 397
E + D+ V+ S+ P S + P + + E V E S Q +S+
Sbjct: 248 ELPVELDDDVLYVEALSVRYPENS----VVPATPLRILRETPVTETKVNNEGSG-QVLSS 307
Query: 398 CNSEDEVPAKKRRVRRKVKSRNLVDDVGVSSLTEYQEDGSIANNHEADENVKSEYSGEDN 457
N+ ++P RR R+K KS TE I EA NV S+ S
Sbjct: 308 DNANIKLPV--RRKRQKTKS------------TEESCTPMILEYSEAVGNVPSKPS---- 367
Query: 458 LLCKDISENVVLDASSIEFSIPESVALPRVVLCLAHNGKVAWDLKWKPINACTDNCKHRM 517
ISE++ VALPRVVLCLAHNGKV WD+KW+P A KH M
Sbjct: 368 ---SGISEDI--------------VALPRVVLCLAHNGKVVWDMKWRPSYAGDSLNKHSM 427
Query: 518 GYLAVLLGNGSLEVWEVPFPHAVKTIYSKFNGEGTDPRFVKLKPIFRCSRLRTANTQSIP 577
GYLAVLLGNGSLEVW+VP P A +Y TDPRFVKL P+F+CS L+ +T+SIP
Sbjct: 428 GYLAVLLGNGSLEVWDVPMPKATSALYLSSKKAATDPRFVKLAPVFKCSNLKCGDTKSIP 487
Query: 578 LTVEWSLAPPYDYLLAGCHDGTVALWKFSANSSCEDTRPLLRFSADTVPIRAVAWAPSES 637
LTVEWS D+LLAGCHDGTVALWKFS S EDTRPLL FSADT PIRAVAWAP ES
Sbjct: 488 LTVEWSTLGNPDFLLAGCHDGTVALWKFSTTKSSEDTRPLLFFSADTAPIRAVAWAPGES 547
Query: 638 NLESANVILTAGHGGLKFWDLRDPFRPLWDLHPAPRIIYSLDWLPNPRCVFLSFDDGTLR 682
+ ESAN++ TAGH GLKFWDLRDPFRPLWDLHP PR IYSLDWL +P CV LSFDDGTLR
Sbjct: 548 DQESANIVATAGHAGLKFWDLRDPFRPLWDLHPVPRFIYSLDWLQDPSCVLLSFDDGTLR 561
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q8BL74 | 7.9e-09 | 23.16 | General transcription factor 3C polypeptide 2 OS=Mus musculus OX=10090 GN=Gtf3c2... | [more] |
Match Name | E-value | Identity | Description | |
A0A5D3DPQ1 | 0.0e+00 | 100.00 | DNA binding protein, putative isoform 1 OS=Cucumis melo var. makuwa OX=1194695 G... | [more] |
A0A1S3B6M4 | 0.0e+00 | 99.56 | uncharacterized protein LOC103486595 OS=Cucumis melo OX=3656 GN=LOC103486595 PE=... | [more] |
A0A0A0LGM2 | 0.0e+00 | 93.41 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G775290 PE=4 SV=1 | [more] |
A0A6J1F7U5 | 4.2e-271 | 75.31 | uncharacterized protein LOC111441649 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1CU50 | 6.7e-269 | 71.90 | uncharacterized protein LOC111014310 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |
KAA0043896.1 | 0.0 | 100.00 | DNA binding protein, putative isoform 1 [Cucumis melo var. makuwa] >TYK25240.1 D... | [more] |
XP_008442823.1 | 0.0 | 99.56 | PREDICTED: uncharacterized protein LOC103486595 [Cucumis melo] | [more] |
XP_004149225.3 | 0.0 | 89.64 | uncharacterized protein LOC101210135 isoform X1 [Cucumis sativus] >KAE8651086.1 ... | [more] |
XP_038903194.1 | 0.0 | 84.60 | uncharacterized protein LOC120089853 [Benincasa hispida] | [more] |
XP_011652007.2 | 0.0 | 89.77 | uncharacterized protein LOC101210135 isoform X2 [Cucumis sativus] | [more] |
Match Name | E-value | Identity | Description | |
AT1G19485.1 | 1.7e-144 | 48.92 | Transducin/WD40 repeat-like superfamily protein | [more] |
AT1G19485.2 | 1.7e-144 | 48.92 | Transducin/WD40 repeat-like superfamily protein | [more] |