Cp4.1LG07g08040 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG07g08040
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionSplicing factor 3B subunit 2
LocationCp4.1LG07 : 7216696 .. 7222950 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GACATAATTTCGGGAATCCCTTTCTTCTTCTTCTCTTCTTCTCTTCTTCGTCTGTTCTTCGTTCTCATAGTCGTGTCCGCGGTTTGGTTTCTGATTGATTTGTGGGAAGCTTGAATCCAACATGACGGCGGAGGTGATTTCTCAGCCGAATGGAGTTGTCGCGAATGGTGGTAACTTGGACCTTAATTCTAACCCTAAATCTGGCGCCGCCAAGAAATCGCGGGAAAGTGAACGACGTCGTCGTCGGCGAAAGCAGAAGAAGAACCAGAAGGCTTCTAAGGTGAAGGAGGCTGCTTCTGGAGAGGATAGTGATGCTTCTGGCGATAACACAAAGGAGAATGATGATCTACTCCAGGTTTTTGGTTTTTGGTTTTTTATTTTTTATTTTTTTTTACGTTGAGGACATGTTTGTCGGTTTGTGGTTTTAGGGATACATTTTCTCAATGCTTCGGTGTTGTTGGGTAACAGTAGTTTTTCGTACGAAGATTGAACTAGATAGGTAGAATGTGTGTGTTGTCATGTGGGAAGGAGCTTGATGTCTAAGAAATCTATCATTTTCAACCATTATAAGGGGATTTAAGCTGAAATTAAGATGATACCCTCTGTGAGTTAGTAAGCAAGTAACACCAAGAAATCTATCATTTTCAACCATTAGAAGGGGATTTAAGTTGAAATTAAGACGATACCCTCTGTGAGTTAGTATAGTAAGCAAGTAGCACCAAGAAATCTATCATTTTCAACCATTAGAAGGGGATTTAAGTTGATATTTCTGCCTGGAATTTGTTTAAGAACAATTAAATTAACTTAATTTTGATTGTTGGATAGCAAAAACATACATGGGTCTATTCCCTCTTCTTCTTCTTTGCTTCATTTTTTTTTGTAAGAAACTAAATTTTCATTTGAGGAAAAAATGAAAAAAATACAAGGGTATAAAGAAAACGAAGCCCACAAACGAACTCGATCAAAAGAAAAGGGTTCCAATAAAGTAAAATGAGGCCAAACGAATAGTTACAAAAGACCTTAGAAACCGATGCCTAAAGGGAAACATAGAACCTGATAAAGGTCCGAGCATCACTATGGAATCTTTTGATTCCTCTGAAGATTTTATTATTCTTCTCCCCTCACAAATCCCATAACAGAGCACAAATCCTTGCTTCATTTCATTTAGTTAAATGATTTTTTTAATGTTAACTAACCATTCATAACGCATTGTTTGCTTTTGAAATTATAGGTTGTTGAGAAAGTAGAAATTGAATATGTACCTGAGAAGGCTGAATTAGATGATAGCTTGGACGAAGAATTTAGAAGAGTTTTTGAGAAATTCAATTTCAGTGATGTAGTTGGTGTTGAGGTAAGTATAAGCTTGCCAGAATAATTTTAGTTATCAATTCAATTTCTTCTGTGTCCAATCCTCTGAACCCTCTGGTTGATATTTCTACATAGGAGAATGAAAACAAAGATGAGTCTGCCCAAAATGCAGCATCTAAGAAGTCGGACTCAGATTCTGATGATGAAGAACTTGATAACCAGCAAAAGGAAAAAGGAGGCTTATCAAACAAGAAAAAGAAGGTAATAGGCTATTATCTTATACATATAAATAACGCATTGGTTACTACTTGCCACAATGATCTGAAAGTATATTGTTTTTGTCATTTTTCAGTTGCAACGGCGTATGAAGATTGCAGAACTGAAGCAGATTTGTTTGAGACCCGATGTTGTTGAGGTAGAAATGGACGCGTTGCACTTTTCTTTTACCTTAAAAAATCATTATAAAAACCTGACACCAGAGTGTCTGGCTACATTTTCTAGTTATGGCATTCTTATGTGCTAATTTATTTACTGTTCAGATATGGGATGCAACTGCAGCTGATCCCAAGTTACTTGTTTATCTAAAATCTTATCGCAACACAGTTCCTGTGCCAAGGCATTGGTGTCAGAAAAGGAAATTTTTACAGGTCTGTAATTCCGCTAATATTAAATTTACAGAATATACGTCTATGCACAATTGATGGGTATGCTAAACACGAGGGAAATGAGAGAGTTGAGTGAGGGAAGATCGTGTTGATAAGGTTTCTCTTATGTATACATATCTGGTTCTTGCAGGGGTGTGGCTGTGACACTGTTGGAGCCGGAACTTGTAGGTGTTTGCTAAGAGCACCCTTGTATTATTTTTGTTTTAGTATATTTGGTGGTATGGGTTGGGATATGAACATTTTTATGTTGCATTGATGGGCGGGTTTACTTGTGTATTGCATGATTCTTTTTTCTTATATATATCTACATCAATTCAATAATCAGTCTGCTTTGAAGTTTTAGATACCTTTCTGTTATTAGTGACCCTGCATTCTATAATTCTAGTGATGAAATAATATCTTAGTTTAGATTGTATCTTATTGGTAGTTGGATTTATTTTGTAGGGGAAGCGTGGTATTGAAAAACAGCCATTCCAACTTCCAGATTTTATTGCAGCAACAGGAATTGAGAAGATTAGACAGGTTGCTTTCACTACAATTTCTAGTTCTTGTTATGCTGGAAAAATCTTGTAGTCAGTCTCAATATATTTCTATACTCATGTTAGACCCAATGTGATGCCTGACACTGCCAATTCTCTATTATATTTGTTATGCAGGCTTACATAGAAAAAGAGGATAGTAAGAAGTTGAAGCAAAAGCAAAGAGAACGAATGCAGCCAAAGATGGGAAAAATGGATATTGATTATCAGGTTTTTACTGCCTTCTGATTTTGATTTGATGTATATTTCAATACTCGAGTTCTGCTGCTAGGTGTCTGTGTTGTTGTGTCCAGAACGTGAACATTCTAAGTTATTGAATTGGCTCTTGTGCATTAGGTTCTTCATGATGCTTTTTTCAAGTACCAAACGAAGCCAAAGCTGACAACACTTGGAGATCTGTACTATGAAGGGAAAGAATTCGAGGTATTGGGTTGTTACTATTATTTATCAACGATGCTCTTTGGTCTCTAGGACCAGTGCATCTGGGTGTTTTTCATGCATTAAGATATTGTTTGGGCCAATGTGCATTTTAAGTTTATAAAGAAGAAGCTGGTACAGAAGTAGCTATTTGTTCAGATACTGAGTAGTTAATCCTAATGGAGATTTCTTCCTTCCTCACTTATTTTTTCCTCGGTGTACAGGTTAAGTTAAGGGAGATGAAACCCGGCATGCTATCACAAGAACTGAAAGAAGCACTTGGTATGCCAGAGGGCGCTCCTCCCCCATGGCTCATTAACATGCAGGTTGGACAGTTCAATTTATCGTCAAACTCAAATCTGATCTCATGTTTTCTTAAAGCTGTTTATTCGCGATCTTATAAATTATATTTTCCTTTCATAGAGATACGGTCCTCCACCATCCTACCCAGATCTAAAAATTCCAGGACTCAATGCTCCCATTCCACCTGGAGCTAGCTTTGGTTACCATCCTGGTGGCTGGGGAAAGCCTCCCGTCGATGAAGTAAGTATTTGACTGTCATATTCTATCTTCAGGTTAAGGTCTGTCTTCTAAATCTTACCATTGTTCTCCGTTTTTATGAATAGTATGGCCGTCCACTGTATGGAGATGTGTTTGGTGTTCAGCAGCAGGAGCAAGCTAACTACGAGGTATCTTATTTATTTGTCTAACAACAGATTGTTTATTGATTTAGTGAAGGATAGATACATGAAAAGAAAAAACTAAAATTCCCAACCTTTTAAGTTCTTATAAGACTTCCTCAAACTATAGGAAAGGGGTAGAATTACATAAGTAGGTGGTGAACACCAGTCTGAAGACAGAATTTATTGAATCCAATATTTTCTCTTAGTCTCTCTACGTTTGAAATGCTTCTTTGCTGATTTCTTCTTTTTTGTCTCGAATGTGATTTTTTTTTTTNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTATGGCAACACCTTCTGTGCCGCACTCATATTTTAAGACATTCTGTTATACTAAGTTCTTAAAAATAGATTAGTGTGATTCTCTCAACTTCCACTTTTGAGATTTTTGTATGTCTGTTACTTGATTGTTTGATGCCTTGTATAGGAGGAACCTGTTGATAAGACCAAGCATTGGGGTGATTTGGAGGAAGAGGAAGAGGAAGAGGTGGAGGAGGAGGATGAAGAGGAAATTGAAGAAGAGGAAATGCAAGATGGCATTGAATCTGTGGATAGTCTATCGAGGTATGCCATCATTGCCATTTTATCTCAAGCTCAGTTTTAAAATCTCTTTCTTGGTGTAATCATCTTGAATGTAATTATATGAGAAAATGAAATATTCTTGTTGGAATTCTGCAAAGGGATAAAAAGTAAAATATTCTTGAATGTTCAAATTTCCAAGGATGGGCACGATAAAGAAAAGTTAAAGATATTACATGTTATTTCCACAAGATCAAACATATGTGAGATCTTGGATTAAGACACTTCAATTAATATGAATGCTTTTGATATTTTTGAATGCTTTTGATATTTTTGAATGCTTTGGTCAATTTCATATACTGACTAGAATAGATTGAAGTTAAACAATTAAAACAGTATAACAAATAGATTGAATGCGTATTTAGCCTTGTCTTAAATGCGGCTGTGGTGAGACATGCCTTTGTGGTTGTGCGTCCCAGACCTTTTGGATATCCAAACCTGATCTTTGGAGTTTGTCTATGCTTTCCAGTTATTCTCCATTCTCCGGATTGATTCAGTAACGAGTTCCAATATTTTGCTTATACCGTATACTGGACGTATGCGATTAAGCCTTTAAGCAGTTCAATAATTGACTTCTCTGTTTCATGCAGCACTCCTACTGGTGTGGAGACACCAGATGTTATTGACCTTCGGAAACAACAGAGGAAGGAGCCGGACAGGCCTCTTTACCAAGTAAGTAAATAATCTGTGAATGCGACACTTCATCAGAACTTGCCTTCTCTCTATTATTCATTTCAAACTCAAATGTGATTTACTGTTCGGTTTCTTGAAGGTTCTCGAAGAAAAAGAAGAGAAGGTTGCTCCTGGGACATTGCTTGGAACTACACATACGTATGCAACTTATTCGCTTCATCTTTTTATGATAGCCATCAACTTTTCTTTCTTCTTGTACTCAAATTGACACTTTGATGTGCATGTCTGTTCAGTTATGTTATTAGTGGCGGTACTCAAGATAAGACGGGAGCCAAAAGGGTAAGAAACTTTTTAATGGTTCGTGGTATAATATGAACTCGGTGATTATTTGTCGACAATTAGTAATAATCTTCCTTTTCATGGCTATTTATCTTCCAGGTTGATTTGCTTAGAGGTCAAAAATCTGATAAAGTGGACGTTACTTTACGACCCGAAGAATTGGAAGCTATGGAAAATGTTCTACCTGCGAAGTATGTGATATCCACTTAGATGTGTAACTAACTAATTCGTGTCATAATGGAGCTCCTAATTATCTTTTTTCTCCTCTCAGATACGAGGAAGCTAGGGAAGAGGAGAAGTTGCGGAGTCAAAGGGAGGACTTCAGCGACATGGTTGCGGAGGTGTGATTCTTTTGTTAAACGAATCGACCGATTCGTTCTTGATACTCAACATATTAACAATTAGAACATTTTTTCTTGCAGAATGAGAAGAAAAGAAAACGTAAGATGCAGGAGAAGGATGGAAAATCTAAGAAGAAAGATTTCAAGTTCTAGGCCATTTTTTGTTTCGGTAATCGAACCACTCTATCGACTACTACTACCGCCATACCGAGCTGGCTTTTGCTTTCGTTGGAGATGTCGATCTTGGTAGTATCAAGTAATGTGATATCCTATAAAACTTCCAACACTGGCTCTTCCTAAAACCACCCAGATCCCTCTTTGCAGGGGTACTCTACATAGATAGTCGCATATACAGACTCTTCAATGGAAGTAGTTTGTCATATTAGTAGGTTCTGTATTCTTCGTTCTTTTTCGATTCCCGAGGTTCGAAAGGGCTGCTGTGCTATGACTAAAATGAATAGTAGCTACTATAGATTTATCTACAAATGAGTTTTATGTTGTTTTGTCACGGCGGCTCTGTTGTACTTAAAACACCAATTAAACTTTGACTTGTTTAAGTGGAAGTTAATTGTTCCAAATTGATTCAATTTTCAGCTGATTAGTCTTGTGCCCCATCACGAAGCAAAGATTCGGATTACCTAGGATTCGGATTAACCTGTTGATCGAATATCTCAAGACAAAAATACTTGTTTAAGATTTGAATCACTCAACAAACAA

mRNA sequence

GACATAATTTCGGGAATCCCTTTCTTCTTCTTCTCTTCTTCTCTTCTTCGTCTGTTCTTCGTTCTCATAGTCGTGTCCGCGGTTTGGTTTCTGATTGATTTGTGGGAAGCTTGAATCCAACATGACGGCGGAGGTGATTTCTCAGCCGAATGGAGTTGTCGCGAATGGTGGTAACTTGGACCTTAATTCTAACCCTAAATCTGGCGCCGCCAAGAAATCGCGGGAAAGTGAACGACGTCGTCGTCGGCGAAAGCAGAAGAAGAACCAGAAGGCTTCTAAGGTGAAGGAGGCTGCTTCTGGAGAGGATAGTGATGCTTCTGGCGATAACACAAAGGAGAATGATGATCTACTCCAGGTTGTTGAGAAAGTAGAAATTGAATATGTACCTGAGAAGGCTGAATTAGATGATAGCTTGGACGAAGAATTTAGAAGAGTTTTTGAGAAATTCAATTTCAGTGATGTAGTTGGTGTTGAGGAGAATGAAAACAAAGATGAGTCTGCCCAAAATGCAGCATCTAAGAAGTCGGACTCAGATTCTGATGATGAAGAACTTGATAACCAGCAAAAGGAAAAAGGAGGCTTATCAAACAAGAAAAAGAAGTTGCAACGGCGTATGAAGATTGCAGAACTGAAGCAGATTTGTTTGAGACCCGATGTTGTTGAGATATGGGATGCAACTGCAGCTGATCCCAAGTTACTTGTTTATCTAAAATCTTATCGCAACACAGTTCCTGTGCCAAGGCATTGGTGTCAGAAAAGGAAATTTTTACAGGGGAAGCGTGGTATTGAAAAACAGCCATTCCAACTTCCAGATTTTATTGCAGCAACAGGAATTGAGAAGATTAGACAGGCTTACATAGAAAAAGAGGATAGTAAGAAGTTGAAGCAAAAGCAAAGAGAACGAATGCAGCCAAAGATGGGAAAAATGGATATTGATTATCAGGTTCTTCATGATGCTTTTTTCAAGTACCAAACGAAGCCAAAGCTGACAACACTTGGAGATCTGTACTATGAAGGGAAAGAATTCGAGGTTAAGTTAAGGGAGATGAAACCCGGCATGCTATCACAAGAACTGAAAGAAGCACTTGGTATGCCAGAGGGCGCTCCTCCCCCATGGCTCATTAACATGCAGAGATACGGTCCTCCACCATCCTACCCAGATCTAAAAATTCCAGGACTCAATGCTCCCATTCCACCTGGAGCTAGCTTTGGTTACCATCCTGGTGGCTGGGGAAAGCCTCCCGTCGATGAATATGGCCGTCCACTGTATGGAGATGTGTTTGGTGTTCAGCAGCAGGAGCAAGCTAACTACGAGGAGGAACCTGTTGATAAGACCAAGCATTGGGGTGATTTGGAGGAAGAGGAAGAGGAAGAGGTGGAGGAGGAGGATGAAGAGGAAATTGAAGAAGAGGAAATGCAAGATGGCATTGAATCTGTGGATAGTCTATCGAGCACTCCTACTGGTGTGGAGACACCAGATGTTATTGACCTTCGGAAACAACAGAGGAAGGAGCCGGACAGGCCTCTTTACCAAGTTCTCGAAGAAAAAGAAGAGAAGGTTGCTCCTGGGACATTGCTTGGAACTACACATACTTATGTTATTAGTGGCGGTACTCAAGATAAGACGGGAGCCAAAAGGGTTGATTTGCTTAGAGGTCAAAAATCTGATAAAGTGGACGTTACTTTACGACCCGAAGAATTGGAAGCTATGGAAAATGTTCTACCTGCGAAATACGAGGAAGCTAGGGAAGAGGAGAAGTTGCGGAGTCAAAGGGAGGACTTCAGCGACATGGTTGCGGAGAATGAGAAGAAAAGAAAACGTAAGATGCAGGAGAAGGATGGAAAATCTAAGAAGAAAGATTTCAAGTTCTAGGCCATTTTTTGTTTCGGTAATCGAACCACTCTATCGACTACTACTACCGCCATACCGAGCTGGCTTTTGCTTTCGTTGGAGATGTCGATCTTGGTAGTATCAAGTAATGTGATATCCTATAAAACTTCCAACACTGGCTCTTCCTAAAACCACCCAGATCCCTCTTTGCAGGGGTACTCTACATAGATAGTCGCATATACAGACTCTTCAATGGAAGTAGTTTGTCATATTAGTAGGTTCTGTATTCTTCGTTCTTTTTCGATTCCCGAGGTTCGAAAGGGCTGCTGTGCTATGACTAAAATGAATAGTAGCTACTATAGATTTATCTACAAATGAGTTTTATGTTGTTTTGTCACGGCGGCTCTGTTGTACTTAAAACACCAATTAAACTTTGACTTGTTTAAGTGGAAGTTAATTGTTCCAAATTGATTCAATTTTCAGCTGATTAGTCTTGTGCCCCATCACGAAGCAAAGATTCGGATTACCTAGGATTCGGATTAACCTGTTGATCGAATATCTCAAGACAAAAATACTTGTTTAAGATTTGAATCACTCAACAAACAA

Coding sequence (CDS)

ATGACGGCGGAGGTGATTTCTCAGCCGAATGGAGTTGTCGCGAATGGTGGTAACTTGGACCTTAATTCTAACCCTAAATCTGGCGCCGCCAAGAAATCGCGGGAAAGTGAACGACGTCGTCGTCGGCGAAAGCAGAAGAAGAACCAGAAGGCTTCTAAGGTGAAGGAGGCTGCTTCTGGAGAGGATAGTGATGCTTCTGGCGATAACACAAAGGAGAATGATGATCTACTCCAGGTTGTTGAGAAAGTAGAAATTGAATATGTACCTGAGAAGGCTGAATTAGATGATAGCTTGGACGAAGAATTTAGAAGAGTTTTTGAGAAATTCAATTTCAGTGATGTAGTTGGTGTTGAGGAGAATGAAAACAAAGATGAGTCTGCCCAAAATGCAGCATCTAAGAAGTCGGACTCAGATTCTGATGATGAAGAACTTGATAACCAGCAAAAGGAAAAAGGAGGCTTATCAAACAAGAAAAAGAAGTTGCAACGGCGTATGAAGATTGCAGAACTGAAGCAGATTTGTTTGAGACCCGATGTTGTTGAGATATGGGATGCAACTGCAGCTGATCCCAAGTTACTTGTTTATCTAAAATCTTATCGCAACACAGTTCCTGTGCCAAGGCATTGGTGTCAGAAAAGGAAATTTTTACAGGGGAAGCGTGGTATTGAAAAACAGCCATTCCAACTTCCAGATTTTATTGCAGCAACAGGAATTGAGAAGATTAGACAGGCTTACATAGAAAAAGAGGATAGTAAGAAGTTGAAGCAAAAGCAAAGAGAACGAATGCAGCCAAAGATGGGAAAAATGGATATTGATTATCAGGTTCTTCATGATGCTTTTTTCAAGTACCAAACGAAGCCAAAGCTGACAACACTTGGAGATCTGTACTATGAAGGGAAAGAATTCGAGGTTAAGTTAAGGGAGATGAAACCCGGCATGCTATCACAAGAACTGAAAGAAGCACTTGGTATGCCAGAGGGCGCTCCTCCCCCATGGCTCATTAACATGCAGAGATACGGTCCTCCACCATCCTACCCAGATCTAAAAATTCCAGGACTCAATGCTCCCATTCCACCTGGAGCTAGCTTTGGTTACCATCCTGGTGGCTGGGGAAAGCCTCCCGTCGATGAATATGGCCGTCCACTGTATGGAGATGTGTTTGGTGTTCAGCAGCAGGAGCAAGCTAACTACGAGGAGGAACCTGTTGATAAGACCAAGCATTGGGGTGATTTGGAGGAAGAGGAAGAGGAAGAGGTGGAGGAGGAGGATGAAGAGGAAATTGAAGAAGAGGAAATGCAAGATGGCATTGAATCTGTGGATAGTCTATCGAGCACTCCTACTGGTGTGGAGACACCAGATGTTATTGACCTTCGGAAACAACAGAGGAAGGAGCCGGACAGGCCTCTTTACCAAGTTCTCGAAGAAAAAGAAGAGAAGGTTGCTCCTGGGACATTGCTTGGAACTACACATACTTATGTTATTAGTGGCGGTACTCAAGATAAGACGGGAGCCAAAAGGGTTGATTTGCTTAGAGGTCAAAAATCTGATAAAGTGGACGTTACTTTACGACCCGAAGAATTGGAAGCTATGGAAAATGTTCTACCTGCGAAATACGAGGAAGCTAGGGAAGAGGAGAAGTTGCGGAGTCAAAGGGAGGACTTCAGCGACATGGTTGCGGAGAATGAGAAGAAAAGAAAACGTAAGATGCAGGAGAAGGATGGAAAATCTAAGAAGAAAGATTTCAAGTTCTAG

Protein sequence

MTAEVISQPNGVVANGGNLDLNSNPKSGAAKKSRESERRRRRRKQKKNQKASKVKEAASGEDSDASGDNTKENDDLLQVVEKVEIEYVPEKAELDDSLDEEFRRVFEKFNFSDVVGVEENENKDESAQNAASKKSDSDSDDEELDNQQKEKGGLSNKKKKLQRRMKIAELKQICLRPDVVEIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLPDFIAATGIEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLTTLGDLYYEGKEFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGPPPSYPDLKIPGLNAPIPPGASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEPVDKTKHWGDLEEEEEEEVEEEDEEEIEEEEMQDGIESVDSLSSTPTGVETPDVIDLRKQQRKEPDRPLYQVLEEKEEKVAPGTLLGTTHTYVISGGTQDKTGAKRVDLLRGQKSDKVDVTLRPEELEAMENVLPAKYEEAREEEKLRSQREDFSDMVAENEKKRKRKMQEKDGKSKKKDFKF
BLAST of Cp4.1LG07g08040 vs. Swiss-Prot
Match: SF3B2_HUMAN (Splicing factor 3B subunit 2 OS=Homo sapiens GN=SF3B2 PE=1 SV=2)

HSP 1 Score: 446.8 bits (1148), Expect = 3.6e-124
Identity = 294/586 (50.17%), Postives = 383/586 (65.36%), Query Frame = 1

Query: 35  ESERRRRRRKQKKNQKASKVKEAASGEDSDASGDNTKENDDLLQVVEKVEIEYVPEKAEL 94
           + E+ R+RR +KK +K  +V+  +S    D   D+T+         + VEIEYV E+ E+
Sbjct: 320 KKEKNRKRRNRKKKKKPQRVRGVSSESSGDREKDSTRSRGSDSPAAD-VEIEYVTEEPEI 379

Query: 95  DDSLDEEFRRVFEKFNFSDVVGVE---ENENKDESAQNAASKKSD-----SDSDDEELDN 154
            +     F+R+FE F  +D V  E   E E  D+   +AA KK        DSDD+  D+
Sbjct: 380 YEPNFIFFKRIFEAFKLTDDVKKEKEKEPEKLDKLENSAAPKKKGFEEEHKDSDDDSSDD 439

Query: 155 QQKEKGG---LSNKKKKLQRRMKIAELKQICLRPDVVEIWDATAADPKLLVYLKSYRNTV 214
           +Q++K     LS KK +   R  +AELKQ+  RPDVVE+ D TA DPKLLV+LK+ RN+V
Sbjct: 440 EQEKKPEAPKLSKKKLRRMNRFTVAELKQLVARPDVVEMHDVTAQDPKLLVHLKATRNSV 499

Query: 215 PVPRHWCQKRKFLQGKRGIEKQPFQLPDFIAATGIEKIRQAYIEKEDSKKLKQKQRERMQ 274
           PVPRHWC KRK+LQGKRGIEK PF+LPDFI  TGI+++R+A  EKE+ K +K K RE+++
Sbjct: 500 PVPRHWCFKRKYLQGKRGIEKPPFELPDFIKRTGIQEMREALQEKEEQKTMKSKMREKVR 559

Query: 275 PKMGKMDIDYQVLHDAFFKYQTKPKLTTLGDLYYEGKEFEVKLREMKPGMLSQELKEALG 334
           PKMGK+DIDYQ LHDAFFK+QTKPKLT  GDLYYEGKEFE +L+E KPG LS EL+ +LG
Sbjct: 560 PKMGKIDIDYQKLHDAFFKWQTKPKLTIHGDLYYEGKEFETRLKEKKPGDLSDELRISLG 619

Query: 335 MPEG-----APPPWLINMQRYGPPPSYPDLKIPGLNAPIPPGASFGYHPGGWGKPPVDEY 394
           MP G      PPPWLI MQRYGPPPSYP+LKIPGLN+PIP   SFGYH GGWGKPPVDE 
Sbjct: 620 MPVGPNAHKVPPPWLIAMQRYGPPPSYPNLKIPGLNSPIPESCSFGYHAGGWGKPPVDET 679

Query: 395 GRPLYGDVFGVQQQE-QANYEEEPVDKTKHWGDLEEEEEEEVEEEDEEEIEEEEMQD--- 454
           G+PLYGDVFG    E Q   EEE +D+T  WG+LE  +EE  EEE+EEE +E++  +   
Sbjct: 680 GKPLYGDVFGTNAAEFQTKTEEEEIDRTP-WGELEPSDEESSEEEEEEESDEDKPDETGF 739

Query: 455 ------GIESVDSLSSTPTGVETPDVIDLRKQQRKE----PDRP-LYQVLEEKEEKVAPG 514
                 G+ +    SS P G+ETP++I+LRK++ +E     + P L+ VL EK      G
Sbjct: 740 ITPADSGLITPGGFSSVPAGMETPELIELRKKKIEEAMDGSETPQLFTVLPEKRTATVGG 799

Query: 515 TLLGTTHTYVISGGTQDKTGAKRVDLLRGQKSDKVDVTLRPEELEAMENVLPAKYEEARE 574
            ++G+TH Y +S     K  A     L+G     V+V L PEELE     +  KYEE   
Sbjct: 800 AMMGSTHIYDMSTVMSRKGPAPE---LQG-----VEVALAPEELELDPMAMTQKYEEHVR 859

Query: 575 EEKLRSQREDFSDMVAEN---EKKRKRKMQEKD---GKSKKKDFKF 584
           E++ + ++EDFSDMVAE+   +K++KRK Q +D   G  K K+FKF
Sbjct: 860 EQQAQVEKEDFSDMVAEHAAKQKQKKRKAQPQDSRGGSKKYKEFKF 895

BLAST of Cp4.1LG07g08040 vs. Swiss-Prot
Match: SA145_SCHPO (Pre-mRNA-splicing factor sap145 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=sap145 PE=1 SV=1)

HSP 1 Score: 327.0 bits (837), Expect = 4.1e-88
Identity = 250/602 (41.53%), Postives = 347/602 (57.64%), Query Frame = 1

Query: 1   MTAEVISQPNGVVANGGNLDLNSNPKSGAAKKSRESERRRRRRK-QKKNQKASKVKEAAS 60
           + AE+ +  N +      L+ N+  K+   KKSR   RR +++  ++K    +K+ E   
Sbjct: 16  LMAEIQTAQNPLKELEKILERNNKQKN---KKSRNQVRREKKKLLREKTNSGAKLAE--- 75

Query: 61  GEDSDASGDNTKENDDLLQ---------------VVEKVEIEYVPEKAELD--DSLDEEF 120
            ++SD     T+ ND+L                  V+ +    + +  ELD  D L E+F
Sbjct: 76  -KNSDDKDQLTENNDNLYNDKKSNGNFYDTNKTDSVDGMVYTTIVDSVELDPNDPLIEQF 135

Query: 121 RRVFEKFNFSDVVGVEEN-ENKDESAQNAASKKSDSDSDDEELDNQQKEKGGLSNKKKKL 180
           + VF +F      G E++ E+ D+     +  +  S+ +++ L  QQ+EK  LS KK + 
Sbjct: 136 KDVFNRFKAD---GQEKDFEDTDKGQIMYSDDEILSEGEEDALQKQQEEK--LSKKKLRK 195

Query: 181 QRRMKIAELKQICLRPDVVEIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQGKRG 240
            +RM +A+LK +  + DVVE WD ++ DP  L +LK+Y NTVPVPRHW QKR +L G+RG
Sbjct: 196 LKRMTVAQLKMLSEKADVVEWWDVSSLDPLFLTHLKAYPNTVPVPRHWNQKRDYLSGQRG 255

Query: 241 IEKQPFQLPDFIAATGIEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFF 300
           IE+Q F+LP +I ATGI ++R A  E E    L+QK RER+QPKMGK+DIDYQ LHDAFF
Sbjct: 256 IERQLFELPSYIRATGIVQMRNAVHENEADMPLRQKMRERVQPKMGKLDIDYQKLHDAFF 315

Query: 301 KYQTKPKLTTLGDLYYEGKEFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGP 360
           +YQTKP LT  G+ Y+EGKE E  ++E +PG +S+EL+EALG+  GAPPPWL  MQRYGP
Sbjct: 316 RYQTKPVLTGFGECYFEGKELEADVKEKRPGDISEELREALGIAPGAPPPWLFAMQRYGP 375

Query: 361 PPSYPDLKIPGLNAPIPPGASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEP 420
           PPSYPDLKIPG+N PIP GA +G+HPGGWGKPPVD++ RPLYGDVFG  +         P
Sbjct: 376 PPSYPDLKIPGVNCPIPTGAQWGFHPGGWGKPPVDQFNRPLYGDVFGNVKPRIHAGTGSP 435

Query: 421 VDKTKHWGDLEEEEEEEVEEEDE--------EEIEEEEMQDGIESV-------DSLSSTP 480
           V  T+HWG+LEE EEEE  EE+E        EEI E E  +  +S        + L + P
Sbjct: 436 V-STQHWGELEEFEEEESSEEEESEDVEYPTEEITERETIEEYQSASEPRSQREDLHAEP 495

Query: 481 ------TGVETPDVIDLRKQQRKEPD---RPLYQVLEEKEEKVAPGTLLGTTHTYVISGG 540
                 + VE  D ++LRK  +   D   R LYQVL EK   ++    +G  H Y I   
Sbjct: 496 LTYFNQSNVEV-DNVELRKNTQPSSDAANRDLYQVLPEKSTNIS--GFMGPQHQYDIP-T 555

Query: 541 TQDKTGAKRVDLLRGQKSDKVDVTLR-----PEELEAMENVLPAKYEEAREEEKLRSQRE 555
            +D    KR        ++K DV L       +EL  + +    K   A+  +K +S+R+
Sbjct: 556 AEDTLPQKRNAHSMLSSTNKGDVALNQSSNWQDELSELVSEQAMKVGAAK-RQKTQSKRD 599

BLAST of Cp4.1LG07g08040 vs. Swiss-Prot
Match: CUS1_YEAST (Cold sensitive U2 snRNA suppressor 1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=CUS1 PE=1 SV=1)

HSP 1 Score: 134.4 bits (337), Expect = 3.9e-30
Identity = 121/403 (30.02%), Postives = 199/403 (49.38%), Query Frame = 1

Query: 94  LDDSLDEEFRRVFEKFNFSD------VVGVEENENKDESAQNAASKKSDSDSDDEELDNQ 153
           +D  L++EF+ V ++F   +      +   E+N +     +N    +  +  D+ E    
Sbjct: 55  VDAKLEKEFKDVLQRFQVQENDTPKEITKDEKNNHVVIVEKNPVMNRKHTAEDELEDTPS 114

Query: 154 QKEKGGLSNKKKKLQRRMKIAELKQICLRPDVVEIWDATAADPKLLVYLKSYRNTVPVPR 213
              +  LS +K++   +  +++LK     P ++E +D  A  P LL  +K  +N +PVP 
Sbjct: 115 DGIEEHLSARKRRKTEKPSLSQLKSQVPYPQIIEWYDCDARYPGLLASIKCTKNVIPVPS 174

Query: 214 HWCQKRKFLQGKRGIEKQPFQLPDFIAATGIEKIR----QAYIEKEDSKKLKQKQRERMQ 273
           HW  K+++L G+  + K+PF+LPD I  T IE++R    Q+ ++ +D K LK+  R R+Q
Sbjct: 175 HWQSKKEYLSGRSLLGKRPFELPDIIKKTNIEQMRSTLPQSGLDGQDEKSLKEASRARVQ 234

Query: 274 PKMGKMDIDYQVLHDAFFKYQTKPK---LTTLGDLYYEGKEF--EVKLREM----KPGML 333
           PKMG +D+DY+ LHD FFK     K   L   GD+YYE +    E   + M    +PG +
Sbjct: 235 PKMGALDLDYKKLHDVFFKIGANWKPDHLLCFGDVYYENRNLFEETNWKRMVDHKRPGRI 294

Query: 334 SQELKEALGMPEGAPPPWLINMQRYGPPPSYPDLKIPGLNAPIP--PGASFG-YHPGGWG 393
           SQEL+  + +PEG  PPW + M+  G P  YPDLKI GLN  I    G  +G   P    
Sbjct: 295 SQELRAIMNLPEGQLPPWCMKMKDIGLPTGYPDLKIAGLNWDITNLKGDVYGKIIPNHHS 354

Query: 394 KPPVDEYGRPLYGDVFGVQQQEQANYEEEPVDKTKHWGDLEEEEEEEVEEEDEEEIEEEE 453
           +    + GR  +G +   +  E  N +E+            E   ++ + +DE E + + 
Sbjct: 355 RS--KKQGRNYFGALISFETPEFENSKEDTQANA-------ENGRQDDKIDDEVEHKLDH 414

Query: 454 MQDGIESVDSLSSTPTGVETPDVIDLRKQQRKEPDRPLYQVLE 475
            Q+ I  V S              +  ++  +E ++ LY VL+
Sbjct: 415 FQEDISEVTSAE------------EKLERNEEESEKQLYTVLK 436

BLAST of Cp4.1LG07g08040 vs. TrEMBL
Match: A0A0A0LMT1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G075450 PE=4 SV=1)

HSP 1 Score: 1030.0 bits (2662), Expect = 1.1e-297
Identity = 555/583 (95.20%), Postives = 570/583 (97.77%), Query Frame = 1

Query: 1   MTAEVISQPNGVVANGGNLDLNSNPKSGAAKKSRESERRRRRRKQKKNQKASKVKEAASG 60
           MT EV SQPNGVV+NG +LDLNSNPKSGA KKSRE+ERRRRRRKQKKNQKASKVKEAA G
Sbjct: 1   MTVEV-SQPNGVVSNG-DLDLNSNPKSGAVKKSRENERRRRRRKQKKNQKASKVKEAAGG 60

Query: 61  EDSDASGDNTKENDDLLQVVEKVEIEYVPEKAELDDSLDEEFRRVFEKFNFSDVVGVEEN 120
           EDSDASGD+TKENDD LQVVEKVEIEY+PEKAELDDSLDE+FR+VFEKF+FS+V G EEN
Sbjct: 61  EDSDASGDDTKENDDPLQVVEKVEIEYIPEKAELDDSLDEDFRKVFEKFSFSEVAGAEEN 120

Query: 121 ENKDESAQNAASKKSDSDSDDEELDNQQKEKGGLSNKKKKLQRRMKIAELKQICLRPDVV 180
           E+KDESAQNA SKKSDSDSDDEE DNQQKEK GLSNKKKKLQRRMKIAELKQIC RPDVV
Sbjct: 121 EDKDESAQNATSKKSDSDSDDEEHDNQQKEK-GLSNKKKKLQRRMKIAELKQICSRPDVV 180

Query: 181 EIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLPDFIAATGIEK 240
           EIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLPDFIAATGIEK
Sbjct: 181 EIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLPDFIAATGIEK 240

Query: 241 IRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLTTLGDLYYEGK 300
           IRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLTTLGDLYYEGK
Sbjct: 241 IRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLTTLGDLYYEGK 300

Query: 301 EFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGPPPSYPDLKIPGLNAPIPPG 360
           EFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGPPPSYPDLKIPGLNAPIPPG
Sbjct: 301 EFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGPPPSYPDLKIPGLNAPIPPG 360

Query: 361 ASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEPVDKTKHWGDLEEEEEEEVE 420
           ASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEPVDKTKHWGDLEEEEEEEVE
Sbjct: 361 ASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEPVDKTKHWGDLEEEEEEEVE 420

Query: 421 EEDEEEIEEEEMQDGIESVDSLSSTPTGVETPDVIDLRKQQRKEPDRPLYQVLEEKEEKV 480
           EEDEEE+EEEEM+DGIESVDS SSTPTGVETPDVIDLRKQQRKEPDRPLYQVLEEKEE+V
Sbjct: 421 EEDEEELEEEEMEDGIESVDSQSSTPTGVETPDVIDLRKQQRKEPDRPLYQVLEEKEERV 480

Query: 481 APGTLLGTTHTYVISGGTQDKTGAKRVDLLRGQKSDKVDVTLRPEELEAMENVLPAKYEE 540
           APGTLLGT+HTYVISGGTQDKTGAKRVDLLRGQKSDKVDVTLRPEELEAMENVLPAKYEE
Sbjct: 481 APGTLLGTSHTYVISGGTQDKTGAKRVDLLRGQKSDKVDVTLRPEELEAMENVLPAKYEE 540

Query: 541 AREEEKLRSQREDFSDMVAENEKKRKRKMQEKDGKSKKKDFKF 584
           AREEEKLRSQREDFSDMVAENEKKRKRKMQEK+GKSKKKDFKF
Sbjct: 541 AREEEKLRSQREDFSDMVAENEKKRKRKMQEKEGKSKKKDFKF 580

BLAST of Cp4.1LG07g08040 vs. TrEMBL
Match: W9RNM3_9ROSA (Splicing factor 3B subunit 2 OS=Morus notabilis GN=L484_012118 PE=4 SV=1)

HSP 1 Score: 916.8 bits (2368), Expect = 1.4e-263
Identity = 493/587 (83.99%), Postives = 541/587 (92.16%), Query Frame = 1

Query: 1   MTAEVISQPNGVVANGGNLDLNSNPKSGAAKKSRESERRRRRRKQKKNQKASKVKEAASG 60
           MTAE +  PNGVV NG   DL+ NP S A KKSRESERRRRRRKQKKN+ +     AASG
Sbjct: 1   MTAETLPHPNGVVPNG---DLDVNPSSNATKKSRESERRRRRRKQKKNKPSKAPDSAASG 60

Query: 61  EDSDASGDNTKENDDLLQVVEKVEIEYVPEKAELDDSLDEEFRRVFEKFNFSDVVGVEEN 120
           ++SDA+ D+ KE+ +  Q+V++VE+EYVPEKAEL+D +DEEFR++FEKF+F D  G EE+
Sbjct: 61  DESDAADDDAKEHVNPQQIVDQVEVEYVPEKAELEDGMDEEFRKIFEKFSFQDSAGAEED 120

Query: 121 ENKDESAQNAASKK---SDSDSDDEELDNQQKEKGGLSNKKKKLQRRMKIAELKQICLRP 180
           + KDESA+N  + K   SDSDS+++E D+QQKEK GLSNKKKKLQRRMKIAELKQIC RP
Sbjct: 121 K-KDESAENTTANKKADSDSDSEEDEQDDQQKEK-GLSNKKKKLQRRMKIAELKQICSRP 180

Query: 181 DVVEIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLPDFIAATG 240
           DVVE+WDAT+ADPKLLV+LKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLPDFIAATG
Sbjct: 181 DVVEVWDATSADPKLLVFLKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLPDFIAATG 240

Query: 241 IEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLTTLGDLYY 300
           IEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLT+LGDLY+
Sbjct: 241 IEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLTSLGDLYH 300

Query: 301 EGKEFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGPPPSYPDLKIPGLNAPI 360
           EGKEFEVKLREMKPGMLSQELKEALGMP+GAPPPWLINMQRYGPPPSYP LKIPGLNAPI
Sbjct: 301 EGKEFEVKLREMKPGMLSQELKEALGMPDGAPPPWLINMQRYGPPPSYPHLKIPGLNAPI 360

Query: 361 PPGASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEPVDKTKHWGDLEEEEEE 420
           PPGASFGYHPGGWGKPPVDEYG+PLYGDVFG+QQQEQ NYEEEPVDKTKHWGDLEEEEEE
Sbjct: 361 PPGASFGYHPGGWGKPPVDEYGQPLYGDVFGIQQQEQPNYEEEPVDKTKHWGDLEEEEEE 420

Query: 421 EVEEEDEEE-IEEEEMQDGIESVDSLSSTPTGVETPDVIDLRKQQRKEPDRPLYQVLEEK 480
           E EEE+EEE IEEEE++DGI+SVDSLSSTPTGVETPDVIDLRKQQRKEP+RPLYQVLEEK
Sbjct: 421 EEEEEEEEEQIEEEELEDGIQSVDSLSSTPTGVETPDVIDLRKQQRKEPERPLYQVLEEK 480

Query: 481 EEKVAPGTLLGTTHTYVISGGTQDKTGAKRVDLLRGQKSDKVDVTLRPEELEAMENVLPA 540
           EEK+APGTLLGTTHTYV++ G Q+K GAKRVDLLRGQK+DKVDVTL+PEELE MENVLPA
Sbjct: 481 EEKIAPGTLLGTTHTYVVASGAQEKLGAKRVDLLRGQKTDKVDVTLQPEELEVMENVLPA 540

Query: 541 KYEEAREEEKLRSQREDFSDMVAENEKKRKRKMQEKDGKSKKKDFKF 584
           KYEEAREEEK RSQREDFSDMVAENEKKRKRKMQ+K+GKSKKKDFKF
Sbjct: 541 KYEEAREEEKQRSQREDFSDMVAENEKKRKRKMQDKEGKSKKKDFKF 582

BLAST of Cp4.1LG07g08040 vs. TrEMBL
Match: A0A067JLP2_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21406 PE=4 SV=1)

HSP 1 Score: 912.1 bits (2356), Expect = 3.4e-262
Identity = 494/593 (83.31%), Postives = 541/593 (91.23%), Query Frame = 1

Query: 1   MTAEVISQPNGVVANGGNLDLNSNPKSGAAKKSRESERRRRRRKQKKNQK-ASKVKEA-- 60
           MT + +S  NGVV+NG   + N    +  AKKSRESERRRRRRKQKKN K AS+  ++  
Sbjct: 1   MTVDTLSYQNGVVSNGDLANSNPTSNNAVAKKSRESERRRRRRKQKKNNKVASRDPDSSA 60

Query: 61  -ASGEDSD-----ASGDNTKENDDLLQVVEKVEIEYVPEKAELDDSLDEEFRRVFEKFNF 120
            A+G +SD     A+GD++KEN D  Q +E+V IEYVPEKAEL D +DEEFR++FEKF+F
Sbjct: 61  NANGNESDDDASAANGDDSKENADPQQALEQVVIEYVPEKAELGDGMDEEFRKIFEKFSF 120

Query: 121 SDVVGVEENENKDESAQNAAS-KKSDSDSDDEELDNQQKEKGGLSNKKKKLQRRMKIAEL 180
            ++ G EEN+ KDESAQN  S KK+DSDS++EELDNQQKEK G+SNKKKKL RRMKIAEL
Sbjct: 121 QEIAGSEENDKKDESAQNVDSKKKADSDSEEEELDNQQKEK-GVSNKKKKLLRRMKIAEL 180

Query: 181 KQICLRPDVVEIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLP 240
           KQIC RPDVVE+WDATAADPKLLV+LKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLP
Sbjct: 181 KQICSRPDVVEVWDATAADPKLLVFLKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLP 240

Query: 241 DFIAATGIEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLT 300
           DFIAATGIEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLT
Sbjct: 241 DFIAATGIEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLT 300

Query: 301 TLGDLYYEGKEFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGPPPSYPDLKI 360
           T GDLY+EGKEFEVKLREMKPG LS+ELK+ALGMPEGAPPPWLINMQRYGPPPSYP LKI
Sbjct: 301 THGDLYHEGKEFEVKLREMKPGTLSKELKDALGMPEGAPPPWLINMQRYGPPPSYPHLKI 360

Query: 361 PGLNAPIPPGASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEPVDKTKHWGD 420
           PGLNAPIPPGASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQ NYEEEPVDKTKHWGD
Sbjct: 361 PGLNAPIPPGASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQPNYEEEPVDKTKHWGD 420

Query: 421 LEEEEEEEVEEEDEEEIEEEEMQDGIESVDSLSSTPTGVETPDVIDLRKQQRKEPDRPLY 480
           LEE+EEEE EEE+EEEIEEEE++DGI+SVDSLSSTPTGVETPDVIDLRKQQRKEP+RPLY
Sbjct: 421 LEEDEEEEEEEEEEEEIEEEELEDGIQSVDSLSSTPTGVETPDVIDLRKQQRKEPERPLY 480

Query: 481 QVLEEKEEKVAPGTLLGTTHTYVISGGTQDKTGAKRVDLLRGQKSDKVDVTLRPEELEAM 540
           QVLEEKEE++APGTLLGTTHTYV+  GTQDK+GAKRVDLLRGQK+D+VDVTL+PEEL+ M
Sbjct: 481 QVLEEKEERIAPGTLLGTTHTYVVGTGTQDKSGAKRVDLLRGQKTDRVDVTLQPEELDVM 540

Query: 541 ENVLPAKYEEAREEEKLRSQREDFSDMVAENEKKRKRKMQEKDGKSKKKDFKF 584
           +NVLPAKYEEAREEEKLRSQREDFSDMVAENEKKRKRKMQEK+GKSKKKDFKF
Sbjct: 541 DNVLPAKYEEAREEEKLRSQREDFSDMVAENEKKRKRKMQEKEGKSKKKDFKF 592

BLAST of Cp4.1LG07g08040 vs. TrEMBL
Match: A0A061GFX9_THECC (Proline-rich spliceosome-associated family protein isoform 2 OS=Theobroma cacao GN=TCM_029917 PE=4 SV=1)

HSP 1 Score: 902.1 bits (2330), Expect = 3.5e-259
Identity = 494/593 (83.31%), Postives = 541/593 (91.23%), Query Frame = 1

Query: 1   MTAEVISQPNGVVANGGNLDL---NSNPKSGAAKKSRESERRRRRRKQKKNQKASKVK-E 60
           MTAEV+S  NG V + G+L+    N+NP S A+KKSRESERRRRRRKQKKN+K S ++ +
Sbjct: 1   MTAEVLSHQNGAVVSNGDLNKKTSNANP-SVASKKSRESERRRRRRKQKKNKKTSHLQND 60

Query: 61  AASGE-----DSDASGDNTKENDDLLQVVEKVEIEYVPEKAELDDSLDEEFRRVFEKFNF 120
           AA+G      DSDA  D TKEN D  Q+ E+V +EYVPEKAELDD +DEEFR+VFEKF+F
Sbjct: 61  AANGAVSDAGDSDAGEDETKENSDPQQITEQVVVEYVPEKAELDDGIDEEFRKVFEKFSF 120

Query: 121 SDVVGVEENENKDESAQNA-ASKKSDSDSDDEELDNQQKEKGGLSNKKKKLQRRMKIAEL 180
            +  G EE + KDESA++A A KK DSDSD+EE DNQQKEK G+SNKKKKLQRRMKIAEL
Sbjct: 121 WEAAGSEETDKKDESAEDADAKKKDDSDSDEEEQDNQQKEK-GVSNKKKKLQRRMKIAEL 180

Query: 181 KQICLRPDVVEIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLP 240
           KQIC RPDVVE+WDATA+DPKLLV+LK+YRNTVPVPRHWCQKRK+LQGKRGIEKQPFQLP
Sbjct: 181 KQICSRPDVVEVWDATASDPKLLVFLKAYRNTVPVPRHWCQKRKYLQGKRGIEKQPFQLP 240

Query: 241 DFIAATGIEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLT 300
           DFIAATGIEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLT
Sbjct: 241 DFIAATGIEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLT 300

Query: 301 TLGDLYYEGKEFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGPPPSYPDLKI 360
           T GDLY+EGKEFEVKLREMKPG LS ELKEALGMPEGAPPPWLINMQRYGPPPSYP LKI
Sbjct: 301 THGDLYHEGKEFEVKLREMKPGSLSHELKEALGMPEGAPPPWLINMQRYGPPPSYPQLKI 360

Query: 361 PGLNAPIPPGASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEPVDKTKHWGD 420
           PGLNAPIP GA FGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQ NYEEEPVDK+KHWGD
Sbjct: 361 PGLNAPIPLGAIFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQPNYEEEPVDKSKHWGD 420

Query: 421 LEEEEEEEVEEEDEEEIEEEEMQDGIESVDSLSSTPTGVETPDVIDLRKQQRKEPDRPLY 480
           LEEEEEEE EEE+EEEIEEEE++DGI+SVDSLSSTPTGVETPDVIDLRKQQRKEP+RPLY
Sbjct: 421 LEEEEEEE-EEEEEEEIEEEELEDGIQSVDSLSSTPTGVETPDVIDLRKQQRKEPERPLY 480

Query: 481 QVLEEKEEKVAPGTLLGTTHTYVISGGTQDKTGAKRVDLLRGQKSDKVDVTLRPEELEAM 540
           QVLEEKEE++APGTLLGTTHTYV++ GTQDK+ AKRVDLL+GQKSD+V+V+L+PEELE M
Sbjct: 481 QVLEEKEERIAPGTLLGTTHTYVVNTGTQDKSAAKRVDLLKGQKSDRVEVSLQPEELELM 540

Query: 541 ENVLPAKYEEAREEEKLRSQREDFSDMVAENEKKRKRKMQEKDGKSKKKDFKF 584
           +NVLPAKYEEAREEEKLRSQREDFSDMVAENEKKRKRKMQEK+GKSKKKDFKF
Sbjct: 541 DNVLPAKYEEAREEEKLRSQREDFSDMVAENEKKRKRKMQEKEGKSKKKDFKF 590

BLAST of Cp4.1LG07g08040 vs. TrEMBL
Match: A0A151SEI3_CAJCA (Splicing factor 3B subunit 2 OS=Cajanus cajan GN=KK1_024869 PE=4 SV=1)

HSP 1 Score: 901.7 bits (2329), Expect = 4.5e-259
Identity = 492/588 (83.67%), Postives = 539/588 (91.67%), Query Frame = 1

Query: 1   MTAEVISQPNGVVANGGNLDLNSNPKSGAA--KKSRESERRRRRRKQKKNQKASKVKEAA 60
           MTAE ++  NGVV+NG  +  +++  S AA  KKSRESERRRRRRKQKKN KASK     
Sbjct: 1   MTAETLAYQNGVVSNGDLVTTSNSSSSSAAATKKSRESERRRRRRKQKKNNKASK----E 60

Query: 61  SGEDSDASGDNTKENDDLLQVVEKVEIEYVPEKAELDDSLDEEFRRVFEKFNFSDVVGVE 120
           + ED    GD+TKEN +  QVVE+VEIEYVPE+AELD+ LDEEFR++FEKF+F +V G E
Sbjct: 61  TAED----GDDTKENAEPQQVVEQVEIEYVPERAELDEGLDEEFRKIFEKFSFGEVTGSE 120

Query: 121 ENENKDESAQNA-ASKKSDSDSDDEELDNQQKEKGGLSNKKKKLQRRMKIAELKQICLRP 180
           +N+ KDESA+NA  +KK+DSDS++EE DN+Q+EK G+SNKKKKLQRRMKIAELKQI  RP
Sbjct: 121 DNDKKDESAENATTNKKADSDSEEEENDNEQREK-GISNKKKKLQRRMKIAELKQISSRP 180

Query: 181 DVVEIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLPDFIAATG 240
           DVVE+WDATA+DPKLLV+LKSYRNTVPVPRHWCQKRKFLQGKRGIEK PFQLPDFIAATG
Sbjct: 181 DVVEVWDATASDPKLLVFLKSYRNTVPVPRHWCQKRKFLQGKRGIEKPPFQLPDFIAATG 240

Query: 241 IEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLTTLGDLYY 300
           IEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLT+LGDLY+
Sbjct: 241 IEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLTSLGDLYH 300

Query: 301 EGKEFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGPPPSYPDLKIPGLNAPI 360
           EGKEFEVKLREMKPG LS ELKEALGMPEGAPPPWLINMQRYGPPPSYP LKIPGLNAPI
Sbjct: 301 EGKEFEVKLREMKPGCLSHELKEALGMPEGAPPPWLINMQRYGPPPSYPHLKIPGLNAPI 360

Query: 361 PPGASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEPVDKTKHWGDL--EEEE 420
           PPGASFGYHPGGWGKPPVDEYGRPLYGDVFGV QQEQ NYEEEPVDKTKHWGDL  EEEE
Sbjct: 361 PPGASFGYHPGGWGKPPVDEYGRPLYGDVFGVHQQEQPNYEEEPVDKTKHWGDLEEEEEE 420

Query: 421 EEEVEEEDEEEIEEEEMQDGIESVDSLSSTPTGVETPDVIDLRKQQRKEPDRPLYQVLEE 480
           EEE EEE+EEE+EEEE++ GI+SVDSLSSTPTGVETPDVIDLRKQQRKEP+RPLYQVLEE
Sbjct: 421 EEEEEEEEEEEMEEEELEAGIQSVDSLSSTPTGVETPDVIDLRKQQRKEPERPLYQVLEE 480

Query: 481 KEEKVAPGTLLGTTHTYVISGGTQDKTGAKRVDLLRGQKSDKVDVTLRPEELEAMENVLP 540
           KEEK+APGTLLGTTHTYV++ GTQDK+GAKRVDLLRGQKSDKVDVTL PEEL+AMENVLP
Sbjct: 481 KEEKIAPGTLLGTTHTYVVNTGTQDKSGAKRVDLLRGQKSDKVDVTLLPEELDAMENVLP 540

Query: 541 AKYEEAREEEKLRSQREDFSDMVAENEKKRKRKMQEKDGKSKKKDFKF 584
           AKYEEAREEEKLR+QREDFSDMVAENEK++KRKMQEK+GKSKKKDFKF
Sbjct: 541 AKYEEAREEEKLRNQREDFSDMVAENEKRKKRKMQEKEGKSKKKDFKF 579

BLAST of Cp4.1LG07g08040 vs. TAIR10
Match: AT4G21660.2 (AT4G21660.2 proline-rich spliceosome-associated (PSP) family protein)

HSP 1 Score: 785.0 bits (2026), Expect = 3.1e-227
Identity = 442/605 (73.06%), Postives = 505/605 (83.47%), Query Frame = 1

Query: 1   MTAE-VISQPNGVVANGGNLDLNSNPKSGAAKKSRESERRRRRRKQKKNQKASKVKEAAS 60
           MTA+  ++  + VV+NG   D+++   S ++KKSRE +RRRRRRKQKKN KAS+    AS
Sbjct: 1   MTADSTVALVHSVVSNG---DVSNGNTSASSKKSREIDRRRRRRKQKKNNKASQADVDAS 60

Query: 61  GEDSDASGDNTKENDDLLQVVEKVEIEYVPEKAELDDSLDEEFRRVFEKFNFSDVVGVEE 120
             D  A+ ++ +  D   QV E++ IEYVPE+AE +D  ++EF+ +FEKFNF + +  EE
Sbjct: 61  --DVSAASESKENTDPQPQVCEQIVIEYVPEQAEFEDGFNDEFKEIFEKFNFREPLASEE 120

Query: 121 NENKDESAQNAASKK---SDSDSDDEELDNQQKEKGGLSNKKKKLQRRMKIAELKQICLR 180
           +  KDES +    KK   SDSDSDD+E DNQ KEK G+SNKKKKLQRRMKIAELKQ+  R
Sbjct: 121 DGTKDESEEKEDVKKKVNSDSDSDDDEQDNQNKEK-GISNKKKKLQRRMKIAELKQVSAR 180

Query: 181 PDVVEIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQ--------------GKRGI 240
           PDVVE+WDAT+ADPKLLV+LKSYRNTVPVPRHW QKRK+LQ              GKRGI
Sbjct: 181 PDVVEVWDATSADPKLLVFLKSYRNTVPVPRHWSQKRKYLQGNYIKRDRNVCDNHGKRGI 240

Query: 241 EKQPFQLPDFIAATGIEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFK 300
           EKQPF LPDFIAATGIEKIRQAYIEKED KKLKQKQRERMQPKMGKMDIDYQVLHDAFFK
Sbjct: 241 EKQPFHLPDFIAATGIEKIRQAYIEKEDGKKLKQKQRERMQPKMGKMDIDYQVLHDAFFK 300

Query: 301 YQTKPKLTTLGDLYYEGKEFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGPP 360
           YQTKPKL+ LGDLY+EGKEFEVKLRE KPG LS +LKEALGMPEGAPPPWLINMQRYGPP
Sbjct: 301 YQTKPKLSALGDLYFEGKEFEVKLRETKPGFLSNDLKEALGMPEGAPPPWLINMQRYGPP 360

Query: 361 PSYPDLKIPGLNAPIPPGASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEPV 420
           PSYP LKIPGLNAPIP GASFG+H GGWGKPPVDEYGRPLYGDVFGVQQQ+Q NYEEEP+
Sbjct: 361 PSYPHLKIPGLNAPIPIGASFGFHAGGWGKPPVDEYGRPLYGDVFGVQQQDQPNYEEEPI 420

Query: 421 DKTKHWGDLEEEEEEEVEEED--EEEIEEEEMQDGIESVDSLSSTPTGVETPDVIDLRKQ 480
           DK+KHWGDLEEEEEEE EEE+  EEE++EEE++DG ESVD+LSSTPTG+ETPD I+LRK 
Sbjct: 421 DKSKHWGDLEEEEEEEEEEEEEQEEEMDEEELEDGTESVDTLSSTPTGIETPDAIELRKD 480

Query: 481 QRKEPDRPLYQVLEEKEEKVAPGTLLGTTHTYVISGGTQDKTGAKRVDLLRGQKSDKVDV 540
           QRKEPDR LYQVLEEK E VAPGTLLGT+HTYVI  GTQ+KTGAKRVDLLRGQK+D+VDV
Sbjct: 481 QRKEPDRALYQVLEEKGESVAPGTLLGTSHTYVIKTGTQEKTGAKRVDLLRGQKTDRVDV 540

Query: 541 TLRPEELEAMENVLPAKYEEAREEEKLRSQREDFSDMVAEN--EKKRKRKMQEKDGKSKK 584
           +L+PEEL+AMENVLPAKYEEAREEEKLR++  D SDMV E+  +  RKRKM +K+GK KK
Sbjct: 541 SLQPEELDAMENVLPAKYEEAREEEKLRNKPVDLSDMVVEHVQQNSRKRKMHDKEGK-KK 598

BLAST of Cp4.1LG07g08040 vs. TAIR10
Match: AT1G11520.1 (AT1G11520.1 pliceosome associated protein-related)

HSP 1 Score: 155.2 bits (391), Expect = 1.2e-37
Identity = 86/114 (75.44%), Postives = 97/114 (85.09%), Query Frame = 1

Query: 434 DGIESVDSLSSTPTGVETPDVIDLRKQQRKEPDRPLYQVLEEKEEKV-APGTLLGTTHTY 493
           D ++   SLSSTPTG+ETPD I+LRK+QRKEPDR LYQVLEEK E V APGTLL TTHTY
Sbjct: 85  DAMDVSKSLSSTPTGIETPDAIELRKEQRKEPDRALYQVLEEKGESVVAPGTLLRTTHTY 144

Query: 494 VISGGTQDKTGAKRVDLLRGQKSDKVDVTLRPEELEAMENVLPAKYEEAREEEK 547
           VI  GTQDKTG KRVDLLRGQK+D+VD +L+PEEL+AM NVL  +YEEAREEEK
Sbjct: 145 VIKTGTQDKTGTKRVDLLRGQKTDRVDFSLQPEELDAMGNVL--QYEEAREEEK 196

BLAST of Cp4.1LG07g08040 vs. NCBI nr
Match: gi|659082396|ref|XP_008441818.1| (PREDICTED: splicing factor 3B subunit 2 isoform X1 [Cucumis melo])

HSP 1 Score: 1036.2 bits (2678), Expect = 2.2e-299
Identity = 559/583 (95.88%), Postives = 570/583 (97.77%), Query Frame = 1

Query: 1   MTAEVISQPNGVVANGGNLDLNSNPKSGAAKKSRESERRRRRRKQKKNQKASKVKEAASG 60
           MT EV SQPNGVV+NG +LDLNSNPKSGA KKSRESERRRRRRKQKKNQKASKVKEAA G
Sbjct: 1   MTVEV-SQPNGVVSNG-DLDLNSNPKSGAVKKSRESERRRRRRKQKKNQKASKVKEAAGG 60

Query: 61  EDSDASGDNTKENDDLLQVVEKVEIEYVPEKAELDDSLDEEFRRVFEKFNFSDVVGVEEN 120
           +DSDASGD+TKENDD LQVVEKVEIEYVPEKAELDDSLDE+FR+VFEKF FS+V G EEN
Sbjct: 61  DDSDASGDDTKENDDPLQVVEKVEIEYVPEKAELDDSLDEDFRKVFEKFTFSEVAGAEEN 120

Query: 121 ENKDESAQNAASKKSDSDSDDEELDNQQKEKGGLSNKKKKLQRRMKIAELKQICLRPDVV 180
           ENKDESAQNA SKKSDSDSDDEELDNQQKEK GLSNKKKKLQRRMKIAELKQIC RPDVV
Sbjct: 121 ENKDESAQNATSKKSDSDSDDEELDNQQKEK-GLSNKKKKLQRRMKIAELKQICSRPDVV 180

Query: 181 EIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLPDFIAATGIEK 240
           EIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLPDFIAATGIEK
Sbjct: 181 EIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLPDFIAATGIEK 240

Query: 241 IRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLTTLGDLYYEGK 300
           IRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLTTLGDLYYEGK
Sbjct: 241 IRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLTTLGDLYYEGK 300

Query: 301 EFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGPPPSYPDLKIPGLNAPIPPG 360
           EFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGPPPSYPDLKIPGLNAPIPPG
Sbjct: 301 EFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGPPPSYPDLKIPGLNAPIPPG 360

Query: 361 ASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEPVDKTKHWGDLEEEEEEEVE 420
           ASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEPVDKTKHWGDLEEEEEEEVE
Sbjct: 361 ASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEPVDKTKHWGDLEEEEEEEVE 420

Query: 421 EEDEEEIEEEEMQDGIESVDSLSSTPTGVETPDVIDLRKQQRKEPDRPLYQVLEEKEEKV 480
           EEDEEE+EEEEM+DGIESVDS SSTPTGVETPDVIDLRKQQRKEPDRPLYQVLEEKEE+V
Sbjct: 421 EEDEEELEEEEMEDGIESVDSQSSTPTGVETPDVIDLRKQQRKEPDRPLYQVLEEKEERV 480

Query: 481 APGTLLGTTHTYVISGGTQDKTGAKRVDLLRGQKSDKVDVTLRPEELEAMENVLPAKYEE 540
           APGTLLGTTHTYVISGGTQDKTGAKRVDLLRGQKSDKVDVTLRPEELEAMENVLPAKYEE
Sbjct: 481 APGTLLGTTHTYVISGGTQDKTGAKRVDLLRGQKSDKVDVTLRPEELEAMENVLPAKYEE 540

Query: 541 AREEEKLRSQREDFSDMVAENEKKRKRKMQEKDGKSKKKDFKF 584
           AREEEKLRSQREDFSDMVAENEKKRKRKMQEK+GKSKKKDFKF
Sbjct: 541 AREEEKLRSQREDFSDMVAENEKKRKRKMQEKEGKSKKKDFKF 580

BLAST of Cp4.1LG07g08040 vs. NCBI nr
Match: gi|449470216|ref|XP_004152814.1| (PREDICTED: splicing factor 3B subunit 2 isoform X1 [Cucumis sativus])

HSP 1 Score: 1030.0 bits (2662), Expect = 1.6e-297
Identity = 555/583 (95.20%), Postives = 570/583 (97.77%), Query Frame = 1

Query: 1   MTAEVISQPNGVVANGGNLDLNSNPKSGAAKKSRESERRRRRRKQKKNQKASKVKEAASG 60
           MT EV SQPNGVV+NG +LDLNSNPKSGA KKSRE+ERRRRRRKQKKNQKASKVKEAA G
Sbjct: 1   MTVEV-SQPNGVVSNG-DLDLNSNPKSGAVKKSRENERRRRRRKQKKNQKASKVKEAAGG 60

Query: 61  EDSDASGDNTKENDDLLQVVEKVEIEYVPEKAELDDSLDEEFRRVFEKFNFSDVVGVEEN 120
           EDSDASGD+TKENDD LQVVEKVEIEY+PEKAELDDSLDE+FR+VFEKF+FS+V G EEN
Sbjct: 61  EDSDASGDDTKENDDPLQVVEKVEIEYIPEKAELDDSLDEDFRKVFEKFSFSEVAGAEEN 120

Query: 121 ENKDESAQNAASKKSDSDSDDEELDNQQKEKGGLSNKKKKLQRRMKIAELKQICLRPDVV 180
           E+KDESAQNA SKKSDSDSDDEE DNQQKEK GLSNKKKKLQRRMKIAELKQIC RPDVV
Sbjct: 121 EDKDESAQNATSKKSDSDSDDEEHDNQQKEK-GLSNKKKKLQRRMKIAELKQICSRPDVV 180

Query: 181 EIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLPDFIAATGIEK 240
           EIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLPDFIAATGIEK
Sbjct: 181 EIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLPDFIAATGIEK 240

Query: 241 IRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLTTLGDLYYEGK 300
           IRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLTTLGDLYYEGK
Sbjct: 241 IRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLTTLGDLYYEGK 300

Query: 301 EFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGPPPSYPDLKIPGLNAPIPPG 360
           EFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGPPPSYPDLKIPGLNAPIPPG
Sbjct: 301 EFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGPPPSYPDLKIPGLNAPIPPG 360

Query: 361 ASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEPVDKTKHWGDLEEEEEEEVE 420
           ASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEPVDKTKHWGDLEEEEEEEVE
Sbjct: 361 ASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEPVDKTKHWGDLEEEEEEEVE 420

Query: 421 EEDEEEIEEEEMQDGIESVDSLSSTPTGVETPDVIDLRKQQRKEPDRPLYQVLEEKEEKV 480
           EEDEEE+EEEEM+DGIESVDS SSTPTGVETPDVIDLRKQQRKEPDRPLYQVLEEKEE+V
Sbjct: 421 EEDEEELEEEEMEDGIESVDSQSSTPTGVETPDVIDLRKQQRKEPDRPLYQVLEEKEERV 480

Query: 481 APGTLLGTTHTYVISGGTQDKTGAKRVDLLRGQKSDKVDVTLRPEELEAMENVLPAKYEE 540
           APGTLLGT+HTYVISGGTQDKTGAKRVDLLRGQKSDKVDVTLRPEELEAMENVLPAKYEE
Sbjct: 481 APGTLLGTSHTYVISGGTQDKTGAKRVDLLRGQKSDKVDVTLRPEELEAMENVLPAKYEE 540

Query: 541 AREEEKLRSQREDFSDMVAENEKKRKRKMQEKDGKSKKKDFKF 584
           AREEEKLRSQREDFSDMVAENEKKRKRKMQEK+GKSKKKDFKF
Sbjct: 541 AREEEKLRSQREDFSDMVAENEKKRKRKMQEKEGKSKKKDFKF 580

BLAST of Cp4.1LG07g08040 vs. NCBI nr
Match: gi|659082398|ref|XP_008441819.1| (PREDICTED: splicing factor 3B subunit 2 isoform X2 [Cucumis melo])

HSP 1 Score: 1024.6 bits (2648), Expect = 6.7e-296
Identity = 559/602 (92.86%), Postives = 570/602 (94.68%), Query Frame = 1

Query: 1   MTAEVISQPNGVVANGGNLDLNSNPKSGAAKKSRESERRRRRRKQKKNQKASKVKEAASG 60
           MT EV SQPNGVV+NG +LDLNSNPKSGA KKSRESERRRRRRKQKKNQKASKVKEAA G
Sbjct: 1   MTVEV-SQPNGVVSNG-DLDLNSNPKSGAVKKSRESERRRRRRKQKKNQKASKVKEAAGG 60

Query: 61  EDSDASGDNTKENDDLLQVVEKVEIEYVPEKAELDDSLDEEFRRVFEKFNFSDVVGVEEN 120
           +DSDASGD+TKENDD LQVVEKVEIEYVPEKAELDDSLDE+FR+VFEKF FS+V G EEN
Sbjct: 61  DDSDASGDDTKENDDPLQVVEKVEIEYVPEKAELDDSLDEDFRKVFEKFTFSEVAGAEEN 120

Query: 121 ENKDESAQNAASKKSDSDSDDEELDNQQKEKGGLSNKKKKLQRRMKIAELKQICLRPDVV 180
           ENKDESAQNA SKKSDSDSDDEELDNQQKEKG LSNKKKKLQRRMKIAELKQIC RPDVV
Sbjct: 121 ENKDESAQNATSKKSDSDSDDEELDNQQKEKG-LSNKKKKLQRRMKIAELKQICSRPDVV 180

Query: 181 EIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQ-------------------GKRG 240
           EIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQ                   GKRG
Sbjct: 181 EIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQVSNSAGIKGVAVTLLEPELGKRG 240

Query: 241 IEKQPFQLPDFIAATGIEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFF 300
           IEKQPFQLPDFIAATGIEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFF
Sbjct: 241 IEKQPFQLPDFIAATGIEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFF 300

Query: 301 KYQTKPKLTTLGDLYYEGKEFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGP 360
           KYQTKPKLTTLGDLYYEGKEFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGP
Sbjct: 301 KYQTKPKLTTLGDLYYEGKEFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGP 360

Query: 361 PPSYPDLKIPGLNAPIPPGASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEP 420
           PPSYPDLKIPGLNAPIPPGASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEP
Sbjct: 361 PPSYPDLKIPGLNAPIPPGASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEP 420

Query: 421 VDKTKHWGDLEEEEEEEVEEEDEEEIEEEEMQDGIESVDSLSSTPTGVETPDVIDLRKQQ 480
           VDKTKHWGDLEEEEEEEVEEEDEEE+EEEEM+DGIESVDS SSTPTGVETPDVIDLRKQQ
Sbjct: 421 VDKTKHWGDLEEEEEEEVEEEDEEELEEEEMEDGIESVDSQSSTPTGVETPDVIDLRKQQ 480

Query: 481 RKEPDRPLYQVLEEKEEKVAPGTLLGTTHTYVISGGTQDKTGAKRVDLLRGQKSDKVDVT 540
           RKEPDRPLYQVLEEKEE+VAPGTLLGTTHTYVISGGTQDKTGAKRVDLLRGQKSDKVDVT
Sbjct: 481 RKEPDRPLYQVLEEKEERVAPGTLLGTTHTYVISGGTQDKTGAKRVDLLRGQKSDKVDVT 540

Query: 541 LRPEELEAMENVLPAKYEEAREEEKLRSQREDFSDMVAENEKKRKRKMQEKDGKSKKKDF 584
           LRPEELEAMENVLPAKYEEAREEEKLRSQREDFSDMVAENEKKRKRKMQEK+GKSKKKDF
Sbjct: 541 LRPEELEAMENVLPAKYEEAREEEKLRSQREDFSDMVAENEKKRKRKMQEKEGKSKKKDF 599

BLAST of Cp4.1LG07g08040 vs. NCBI nr
Match: gi|778667945|ref|XP_011649013.1| (PREDICTED: splicing factor 3B subunit 2 isoform X2 [Cucumis sativus])

HSP 1 Score: 1017.7 bits (2630), Expect = 8.1e-294
Identity = 555/604 (91.89%), Postives = 570/604 (94.37%), Query Frame = 1

Query: 1   MTAEVISQPNGVVANGGNLDLNSNPKSGAAKKSRESERRRRRRKQKKNQKASKVKEAASG 60
           MT EV SQPNGVV+NG +LDLNSNPKSGA KKSRE+ERRRRRRKQKKNQKASKVKEAA G
Sbjct: 1   MTVEV-SQPNGVVSNG-DLDLNSNPKSGAVKKSRENERRRRRRKQKKNQKASKVKEAAGG 60

Query: 61  EDSDASGDNTKENDDLLQVVEKVEIEYVPEKAELDDSLDEEFRRVFEKFNFSDVVGVEEN 120
           EDSDASGD+TKENDD LQVVEKVEIEY+PEKAELDDSLDE+FR+VFEKF+FS+V G EEN
Sbjct: 61  EDSDASGDDTKENDDPLQVVEKVEIEYIPEKAELDDSLDEDFRKVFEKFSFSEVAGAEEN 120

Query: 121 ENKDESAQNAASKKSDSDSDDEELDNQQKEKGGLSNKKKKLQRRMKIAELKQICLRPDVV 180
           E+KDESAQNA SKKSDSDSDDEE DNQQKEKG LSNKKKKLQRRMKIAELKQIC RPDVV
Sbjct: 121 EDKDESAQNATSKKSDSDSDDEEHDNQQKEKG-LSNKKKKLQRRMKIAELKQICSRPDVV 180

Query: 181 EIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQ---------------------GK 240
           EIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQ                     GK
Sbjct: 181 EIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQVSNSAGIKFMGVAVTLLEPELGK 240

Query: 241 RGIEKQPFQLPDFIAATGIEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDA 300
           RGIEKQPFQLPDFIAATGIEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDA
Sbjct: 241 RGIEKQPFQLPDFIAATGIEKIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDA 300

Query: 301 FFKYQTKPKLTTLGDLYYEGKEFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRY 360
           FFKYQTKPKLTTLGDLYYEGKEFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRY
Sbjct: 301 FFKYQTKPKLTTLGDLYYEGKEFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRY 360

Query: 361 GPPPSYPDLKIPGLNAPIPPGASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEE 420
           GPPPSYPDLKIPGLNAPIPPGASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEE
Sbjct: 361 GPPPSYPDLKIPGLNAPIPPGASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEE 420

Query: 421 EPVDKTKHWGDLEEEEEEEVEEEDEEEIEEEEMQDGIESVDSLSSTPTGVETPDVIDLRK 480
           EPVDKTKHWGDLEEEEEEEVEEEDEEE+EEEEM+DGIESVDS SSTPTGVETPDVIDLRK
Sbjct: 421 EPVDKTKHWGDLEEEEEEEVEEEDEEELEEEEMEDGIESVDSQSSTPTGVETPDVIDLRK 480

Query: 481 QQRKEPDRPLYQVLEEKEEKVAPGTLLGTTHTYVISGGTQDKTGAKRVDLLRGQKSDKVD 540
           QQRKEPDRPLYQVLEEKEE+VAPGTLLGT+HTYVISGGTQDKTGAKRVDLLRGQKSDKVD
Sbjct: 481 QQRKEPDRPLYQVLEEKEERVAPGTLLGTSHTYVISGGTQDKTGAKRVDLLRGQKSDKVD 540

Query: 541 VTLRPEELEAMENVLPAKYEEAREEEKLRSQREDFSDMVAENEKKRKRKMQEKDGKSKKK 584
           VTLRPEELEAMENVLPAKYEEAREEEKLRSQREDFSDMVAENEKKRKRKMQEK+GKSKKK
Sbjct: 541 VTLRPEELEAMENVLPAKYEEAREEEKLRSQREDFSDMVAENEKKRKRKMQEKEGKSKKK 600

BLAST of Cp4.1LG07g08040 vs. NCBI nr
Match: gi|731439529|ref|XP_002270799.3| (PREDICTED: splicing factor 3B subunit 2 [Vitis vinifera])

HSP 1 Score: 927.9 bits (2397), Expect = 8.5e-267
Identity = 503/584 (86.13%), Postives = 542/584 (92.81%), Query Frame = 1

Query: 1   MTAEVISQPNGVVANGGNLDLNSNPKSGAAKKSRESERRRRRRKQKKNQKASKVKEAASG 60
           MTA+ +S PNGVV+N G+   NSN    + KKSRESERRRRRRKQKKN KASK  +A +G
Sbjct: 1   MTADTLSLPNGVVSN-GDPKQNSN---ASTKKSRESERRRRRRKQKKNSKASKAFDATAG 60

Query: 61  EDSDASGDNTKENDDLLQVVEKVEIEYVPEKAELDDSLDEEFRRVFEKFNFSDVVGVEEN 120
           +DSDA  D+ KEN+D  Q VEKVE+EYVPEKAELDD+ DEEFR++FEKF+F D+ G+EEN
Sbjct: 61  DDSDAGDDDAKENNDPQQAVEKVEVEYVPEKAELDDN-DEEFRKIFEKFSFHDIAGLEEN 120

Query: 121 ENKDESAQNAA-SKKSDSDSDDEELDNQQKEKGGLSNKKKKLQRRMKIAELKQICLRPDV 180
           + KDE+A  AA +KK+DSDS++EE D QQKEKGGLSNKKKKLQRRMKIAELKQIC RPDV
Sbjct: 121 DKKDETAPAAALNKKADSDSEEEEQDAQQKEKGGLSNKKKKLQRRMKIAELKQICSRPDV 180

Query: 181 VEIWDATAADPKLLVYLKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLPDFIAATGIE 240
           VE+WDATAADPKLLV+LKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLPDFIAATGIE
Sbjct: 181 VEVWDATAADPKLLVFLKSYRNTVPVPRHWCQKRKFLQGKRGIEKQPFQLPDFIAATGIE 240

Query: 241 KIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLTTLGDLYYEG 300
           KIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLT  GDLY+EG
Sbjct: 241 KIRQAYIEKEDSKKLKQKQRERMQPKMGKMDIDYQVLHDAFFKYQTKPKLTNHGDLYHEG 300

Query: 301 KEFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGPPPSYPDLKIPGLNAPIPP 360
           KEFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGPPPSYP LKIPGLNAPIPP
Sbjct: 301 KEFEVKLREMKPGMLSQELKEALGMPEGAPPPWLINMQRYGPPPSYPHLKIPGLNAPIPP 360

Query: 361 GASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQANYEEEPVDKTKHWGDLEEEEEEEV 420
           GASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQ NYEEEPVDKTKHWGDLEEEEEEE 
Sbjct: 361 GASFGYHPGGWGKPPVDEYGRPLYGDVFGVQQQEQPNYEEEPVDKTKHWGDLEEEEEEE- 420

Query: 421 EEEDEEEIEEEEMQDGIESVDSLSSTPTGVETPDVIDLRKQQRKEPDRPLYQVLEEKEEK 480
           EEE+EEEIEEEE++ GI+SVDSLSSTPTGVETPDVIDLRKQQRKEP+RPLYQVLEEKEEK
Sbjct: 421 EEEEEEEIEEEELEAGIQSVDSLSSTPTGVETPDVIDLRKQQRKEPERPLYQVLEEKEEK 480

Query: 481 VAPGTLLGTTHTYVISGGTQDKTGAKRVDLLRGQKSDKVDVTLRPEELEAMENVLPAKYE 540
           +APGTLLGTTHTYV++ GTQDKT AKRVDLLRGQK+DKVDVTL+PEELE +ENV+ AKYE
Sbjct: 481 IAPGTLLGTTHTYVVNTGTQDKTAAKRVDLLRGQKTDKVDVTLQPEELEVLENVVAAKYE 540

Query: 541 EAREEEKLRSQREDFSDMVAENEKKRKRKMQEKDGKSKKKDFKF 584
           EAREEEK RSQREDFSDMVAENEKKRKRKMQEK+GKSKKKDFKF
Sbjct: 541 EAREEEKQRSQREDFSDMVAENEKKRKRKMQEKEGKSKKKDFKF 578

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SF3B2_HUMAN3.6e-12450.17Splicing factor 3B subunit 2 OS=Homo sapiens GN=SF3B2 PE=1 SV=2[more]
SA145_SCHPO4.1e-8841.53Pre-mRNA-splicing factor sap145 OS=Schizosaccharomyces pombe (strain 972 / ATCC ... [more]
CUS1_YEAST3.9e-3030.02Cold sensitive U2 snRNA suppressor 1 OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A0A0A0LMT1_CUCSA1.1e-29795.20Uncharacterized protein OS=Cucumis sativus GN=Csa_2G075450 PE=4 SV=1[more]
W9RNM3_9ROSA1.4e-26383.99Splicing factor 3B subunit 2 OS=Morus notabilis GN=L484_012118 PE=4 SV=1[more]
A0A067JLP2_JATCU3.4e-26283.31Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21406 PE=4 SV=1[more]
A0A061GFX9_THECC3.5e-25983.31Proline-rich spliceosome-associated family protein isoform 2 OS=Theobroma cacao ... [more]
A0A151SEI3_CAJCA4.5e-25983.67Splicing factor 3B subunit 2 OS=Cajanus cajan GN=KK1_024869 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G21660.23.1e-22773.06 proline-rich spliceosome-associated (PSP) family protein[more]
AT1G11520.11.2e-3775.44 pliceosome associated protein-related[more]
Match NameE-valueIdentityDescription
gi|659082396|ref|XP_008441818.1|2.2e-29995.88PREDICTED: splicing factor 3B subunit 2 isoform X1 [Cucumis melo][more]
gi|449470216|ref|XP_004152814.1|1.6e-29795.20PREDICTED: splicing factor 3B subunit 2 isoform X1 [Cucumis sativus][more]
gi|659082398|ref|XP_008441819.1|6.7e-29692.86PREDICTED: splicing factor 3B subunit 2 isoform X2 [Cucumis melo][more]
gi|778667945|ref|XP_011649013.1|8.1e-29491.89PREDICTED: splicing factor 3B subunit 2 isoform X2 [Cucumis sativus][more]
gi|731439529|ref|XP_002270799.3|8.5e-26786.13PREDICTED: splicing factor 3B subunit 2 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: Cellular Component
TermDefinition
GO:0005634nucleus
Vocabulary: INTERPRO
TermDefinition
IPR007180DUF382
IPR006568PSP_pro-rich
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005634 nucleus
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG07g08040.1Cp4.1LG07g08040.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006568PSP, proline-richPFAMPF04046PSPcoord: 310..355
score: 1.8
IPR006568PSP, proline-richSMARTSM00581testneucoord: 306..359
score: 1.2
IPR007180Domain of unknown function DUF382PFAMPF04037DUF382coord: 176..301
score: 8.7
NoneNo IPR availableunknownCoilCoilcoord: 411..433
score: -coord: 144..171
scor
NoneNo IPR availablePANTHERPTHR12785FAMILY NOT NAMEDcoord: 15..583
score:
NoneNo IPR availablePANTHERPTHR12785:SF8SUBFAMILY NOT NAMEDcoord: 15..583
score: