Tan0009824 (gene) Snake gourd v1

Overview
NameTan0009824
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionU4/U6.U5 tri-snRNP-associated protein 2-like
LocationLG09: 46860841 .. 46870368 (-)
RNA-Seq ExpressionTan0009824
SyntenyTan0009824
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAACATAGACTCAACTCAATTCAATTATATACTCTAAACATAAACTTCCTGCCTCAACTCAATTTTGCTCTCGAAACGAGACTTTTGAATTAACTCATAAACTAACTTTGTTCAACTCAAACTTGCCTCATCTCAAATTCTAAAGGGGAAAACCCCTAAGATTGCTTTGCAAAATAAAATAAAATTGCATAAAATAAAGAGGAAAACCCTAACTGCTCCACGCCGCTCCTTCTTCTTCACTCCCTCTCCTCCTCCCGCCGTGAGTCATCTGGCCGACGATTCTTCTTTCCTCGCCTAAATACAAAGAAACTACAGTTCAGTCGAAGTAAACAACTCCAGGGTTTGCAACTGGAAGAGGATTTCAGTTGTCCACGTTTTTTCTTCGTTTGAATCGATTGCTCTGCGTGAGCCGGAACGTTTGAGCTACATCAAAACATGCTGGAATTTGATAGACGACCAGCCAGATGAGAGAGAATAAAGAGTCAATAGAATGGGATCAAAGAGACGAGACGATAATGGGGTAGATGAGGAAGAGTTTGCTTCAGACTTAAAGAGGCAGAAATTACTAGGGGAGTTCTCATCATCTTCTTCTCCACCTGCCTCAGAGAACCCTCGGCTTCCCGGTTTTAACTATGGCGATGATGATGAAGAAGAAGATTATAAATTGAAACAAAATGGAAGTAGATATGATGGAGATAAAGGGGATGGCAATGATGATGAGGAAGATGATGAAGACCATGATGATGATGCGAATCATGTAAGGCGAAGCCGTGATGTTGAAGTTCGAAAAGATTGTCCTTATCTTGATACTGTTAACCGTCAGGTGCATTGCTTGTTTCTTATTTGTATAGTCGTTCTAAATTTCTGCATCTCAGGGTTAGGGCCATTTGTTACTGCAATGAGTTTGTTACCTAAGGTTTCTTTGTTTCTTATTTGTATATTCGTTCTGCATTTCTGCATCTCAGGGTTAGGGCCATTTGTTACTGCAATGAGTTTGTTACCTAAGTTTTCTTTACTTCCTGTCAATTCATTTTTGTTCATGCTTTTCAGGTTTTGGATTTCGATTTCGAGAAGTTTTGCTCTGTCTGTCTCTCGAATCTGAATGTTTATGCCTGCCTGGTATGTGGTAAGTACTATCAAGGGAGGGGGAAGAAGTCTCATGCTTACACTCACAGTCTTGAAGCAGGACACCACGTGTATATCAACCTTCGGACAGAGAAAGTTTACTGCCTTCCTGATGGATATGAGATTAATGACCCATCCTTAGATGATATTCGATATGTCCTGAATCCAAGGTTTGCTTATACTTGATTCATTATTTGGTTTTGCATACTATTAGAGCATGGTACACTAATGTTACATCTGCAATGTTGAAGGAAAATTTGTACTAGAGCCAACACAGCACTCAAAAAGAAAGAAAAGAAAGAAGAAGAAAACATGAGATATAAGAAAATCCTTGAGTTCTATGTTTGACACCTTTCATAGGCTGTCTTTAGCTAGGATACACATCCTCAAGCATTTTGGAACTTTATCATCTTTAAGACCTTTTTCTCTTGCAATCTAATATAGAGTTCTGGTTGGAATTTGCGAGCAGGGAAAAGAAAACTGAGAGTGATTTGGCTGTATGTTATTGGTCCTTTGAAAATCTAATTTTGAGTTTTATCACAATCTAACACAGTCTAAGAGTAGAGTTGCAGTTAATCTACTATTTACACTTTGGCACTATGAATTCAAGTAGCCCTATTATTGCCATTGTAAGAGTTAGAAACCATCTAAAAAGAGTTCAGGATGGCCCAGCTACGCCAAGAAAAAGAATAGAAGAAGCATCTGACTCCTTTGTTGCATAGAAATGGATTTATATTAAACTCTTACTTCTCTGATAATATCATTTTTGTGATTGCCTCCTGGTGTAGGACTGACAATTTATTTTGTTATCATCCTATTTCTGACTTCTGTGCTGATTGGAGGGCTAGTTGTAACTGTAACTTCCCTTTGTCTGGGTTGTGGCTTTCCTCCCCAATCTTTTGTACATTTCATATCTTATTAAATTTCTTTTGAAAGAAAATACATCCACCTTACTCCATATTTTGATTTTCTCATTCAAGTTTTCAGTTTTTTCATGAGCTTAAAACTCATCTTTCTGATCTATCCACCTCCCAGGCCTTACCAATTGAGTCTAGATCTTGGGGCCAAGCGTTTTGAAATTTGAAAGGTGTTTGGCCCTGAAGAACACCAGTGAATCCAACAGAAATGGGGTGTCGTCTGAAATCATTCATCTTTTTCAAAACTTTGGAGTCTTTAAATATGTTTACCCATTCTCTACTTACTAAGAAGCAATCAACCAGGGACTCTATTGCCTACTCTCTTATATTAGACCAAGTGTAGTATCCATTACTGAAGGGAAGATCTACCAAATTCAGTTTTTCTATCAACTCACAAGAGCATCTTGTGCTTTTTGTTCTTCTAAGTACCACCCTGGGTCCCCGTGGGATGACGATCTGTGCCACCTTCTTTTGCTCTTACAGAGAAAGTAATGCATTTTATCTTTGATCAGTTGATGGGTTTTTTATAACCCATTATTTTGTATGAAATTTGATTTGATGGATCTACCACAACACAAACGTTGACAGTTCCATTATATAAATTATGTTTGCACTCATCATCTTTGGTAAAATAGGGATGTGAAATATTATCATGGGTCATTAAATCATGAAATCCCCAATACGAAATTGATGTTGGACAATGGTAGGAAGTTAATTGGGGAAAGATATGAGAGATCTTGAAGTGCTCATGGCTGTTGAAGCAAAATTGACATTAAAAGAAAAATTGAAGGGATCATTGTAGAATTCCCTACTGGACGTTAGTAGTCCACCTTGGTAATATCTCTATAAAAACTGTCATAACTTTTTTTCAGTTTTAGAGAAAAATTCATTGCATTTAAAAAAGAATAGTTCCTACTCATGTAAGATACTTATTAGACTTAAATGAGACTAGTTATTAAAAATTACCTTCAAACTTGAGCAAAATATGAATCATAAAATACTAAAGTAATAATAAGAATATAATAAATTCTAGTGCTTGACTATGAATAGATATCATACTGTATTTAATCAAATGTGCTTTAGTGATCCTTTTGGCACTATTTGCAGGTTTGCCAAAGAGCAGGTAAAGCAGCTTGACAAGAACAAGCAATGGTCTAGGGCACTTGATGGTTCTGATTACCTTCCTGGAATGGTAATTATGTTTTCATTTTATCTTGGTTATCTTATCTGTTAGATTGGCAACAAGGCCTAAATTTTGGTTTTGTTAAACCAGGTAGGGCTTAACAACATTAAGGAAACTGATTTTGTAAATGTGACAATTCAGTCCTTAATGAGGGTTACACCACTCAGGAACTTCTTCCTAATACCTGAGAACTATCAGCACTGCAGATCTCCCCTTGTCCACCGGTTTGGTGAACTCACACGTAAGATTTGGCATGCAAGAAACTTCAAAGGACAGGTTTGACTGTACTGTCATTGACATATTCTTCCCGATTGTGGTTGGGTTTCTACTTTTGAATTCATAGATTTTAAATCCTGAAGAATTTATAGGTAAGCCCGCATGAATTCCTGCAAGCAGTTATGAAGGCTAGTAAAAAACGTTTCCGAATAGGTGCGCAGTCAGATCCTGTTGAATTTATGTCATGGTTTCTTAATACACTTCATTCAGAATTGCGAATTTCAAAGAAAAGTAGCAGTATAATCTACGAATGTTTCCAGGTTCTCTTCAATTTCCTAGAGTAATGTTGTGAATTATGTTCTATTGGTACTGGTAAAATGAAACATTTTTAGTCTTATTGTTATTATTTTGATAGGGGGAATTGGAGGTTGTGAAAGAGATTCACTCGAAAGCTCTCACTGAGAAGAAAGAAAATGGTGACGATCAGGATGCTGGAAGTGAAGGCAGCAGTGTTATAATGGAAACATCCAGAATGCCATTCTTAATGCTTGGATTGGATTTGCCACCACCACCTCTTTTCAAAGACGTTATGGAGAAAAATATAATACCACAGGTAGGTGTAATTATGTTGCTGGTGTTCTTTAGCGCAGTAGGATACTCATACAACCCTTACTTCTTATTGTCTTTTGGAAATTTTCATTTATTGCTCAATTCAATATTTAATATTTATTTTGGGACAATTACAAACATTGACCATACTCAAGATAGCGAGGCTTTAGAAGAGAACATGGAAAGAACCTGAGGAAGACATCGCTAATTTGTTTGTTTATTGTGCTTGTACTTTCTTCTATTTGGTAATAATGGTTAGGGATTTGAGTTGGTTGGGCTGAAGAATCAAAGACCTTCTAATGCTAATGGTTGCTAGAGCACTCTAAAGACCTTCTCATGGTTGCTGCTAGTTAGGAGATCGAGACAACAAAAGGCTTGGGATTTTTGAAAATTAGAGAGGACTCCTTTTCTTTTTTCTTTTCCTTTCATAATGTAGGATATTACTGATTCTCAAGCTTCTTCTTCCTTTTCTTTTTTTTTTCTTTTTAAATTATTATTATTATTATTTCTTTTTGGTGAGAAATATCAAGGAAAATCTATATTAACAAAAGAAGGTGAACATCCTAAAGGTCTTAGGGATGAGGGACCTCTAAGCCAGAAGAGGAGGAATCAAAGAAAAGGCTGATAGAATCTGAATATATTAAAATGTGAAAGATAACCCAACTATAAGATAAATCAAGTAATTTGATGAACTTTGGACCCTTCCCTTTCTCGAGCACTCTCACTTATGCCCTAAATCAAAACTCACTGAATTAATGCCTACCTTCTCCCTATCTCACTCCCTCTATTTATAACAAAATATCATAACTCACTCCTTAATTATTTACTTTTATACCCTTAATACCAATCCTGTCATAGTTCTTTCTTCATCCTCCCATTCGTAAGAAAGTAAAGTTTTTGTGGCAAGTTAGGGTGTTTTCTAACTTGTGGGACTGTGGGGAAAGAGAAACATTATAATCTTTCAAGGGCATGAGCGGTCTTCGTATGACGTTTAGTCCCTGGCTAGATTCTATGCTTCTCTTTGGGCTTCAACGGTGAAGCCGTTTTGTAATCATCTTTTAGGTCTTATTTTAATTGATTGGAGACCCTTCTAGTAGTTAGTTGGTCTTCATCTTTTGTGGGCTCTCTTTTCGTATGCCTTTGTATTCTTTCATTCCTTCTCAAATCTCAATGAAGGCTCAGTTTCTTATAAAAAATGGTGTGTAAGCTGTAATGTGCCATAGACCATACAAAGAACTAACTTTCTTGGGACTGGTTTCTCTCCACATCAACTTTACTAAGTGGAGGTCTAGATGAGATTCCCCTGTTTGGAGGATCTTGGTATCTGACTTAACTCAGAACGTACCAGGTGTATCTTGGTTCCAATTTATAGAGTCTATAGAATGGTTTGAATGTATTCATTCAATGATCGTCATGAGCCTTTCCCATTCTTTAATCTCTCCTTCTGTATACTCTCTTCTAAAATGAAGACTCTAGGTTTTGTTTTATTTCGAGCAACAATCTTTAATGACAAAAGATCTAGAGTTTGCTACCATGTAAAGGGAAGGGAACATGGTGCTCAAAGGACTGGAAGTGATCCATCTCTCTTCCTAAAATCTTATTTTCCTTTCATTTCCCAGTTTGTAGCTGATGTTATTTTCAACGAAACTTCTAGTCTAGGGTTGTATTTTCGTTTGCAAATTTGAACATATTTGGGAGGAACCAATTGCTGTCCTTCTGACTATATTTTCTATTGACAATCAACTTCCTAAGGGCTGCATCTTTTTGAGTGTATTTCCAAACCCATTTCGTTAAAGGGGCAATATTCTGTTTTTCACCCTCTAAACTTGTCTATGCTCGGACCTCCCAATGATAGTGGAAATGTTTAAGTGGGGGAGGTTTGTTTACGAGGGTGTTTTTGAAATATGCGGCTAGGTGAGGCTAGGTCTGAGCAACACACAGATGGGAAGAAGGTATGGTCGAAAGACATACCTTAGGTATTGATGTACAATCAGTAGCCCATGACCATGAGAATGCATGTTTTGCACAGTAATAGGTAGCATAGAGCTTGATAGTGGAATGTGGGTGGGCATAGGTAGATGCACCCCTACCAAAGAAACGGGCGCTCGTGGATCGACAACAGCAGTGGCGAGCATGCACCCAGACATACGCAACAAGCCATTGATGCTATTATCCACAAGCGCATGTTCCTATGGTATGGGCGTGTCTGCCTGTGCCCTTGCCCTTGTTCAATTGTTGCACTCTGCATGCTTGCCTGCATGCTTGCTTGCCAACCATGGTTGTAACAAACGTCCATGCCCATTTGTCATCCATGTGTTGCTCACGCCTAGCCTTGCCAACCGCGTATTTTGAAATGATGCTCGTAACAAACCTCCCTGACTTAAGCCTCCCGGCGTCTCGTTGAGCTAACCTCTATCAGCTTCTTGGACGAGTCTTTGGGCTAAATGTTTCTACCTAACACCTAGTTTCATGATCATATGCTAATCTTTACACGTAGAGTAAAATTAATACCATATTTTGAACTTTACAATATCATTTGCAAATTTTAGTTAAGACTAAAGCCTATTGAAGAAAAAAAAATCCCCAACTTGAACTTACGAAAGTTTCACATACTCTTCTCCCCCATTCTTAGTAGGCGTGATCCTACCCTGATCATGTGTCCCTCCTACCCTCCCCGTCCCAGGCCTATTGTGACTAGGGGTAGAATGAATTGCGTATTTCTCCCGTAGCTTCTTTTGGTGCTACCCTACCCAGATAAGTCTCTACGGTGTGGAAGAGATCACACAACCAACACCCAACTCTGCTATATGGACACGACCCTTGACTCTCCTTAGGATATTCCTCTTACTCATCGTTTTAATCATTTATTTTGGTGAGTTACGTGCTCATTAAGTTTCATCATGATTCCAGGAACCATGTTGAATGAATGTGTCATTCCTACGGAGTAGCGGTCAACAGCCACTTCCAACCTCTTATCCTTTGGTGATGGTTCTTGGCAATAAAACAACGTGCTTGATCCTGGTATTCCTTAATGCACTTGCATGCTCATACAACATGCTCACCAAGCGATTGTTTGATCATCTCACGTTCGAGTTTTGCAATGATTGCTTCCAATGATCCGTCTAACTGTACAAGATAAAAGGAACAAATTAATTTCCCACAAAGGATAATCTTCTTCAAGGTAAAAACCGCACTCTAATACCTACAGTTATAGGATAGCCCACGAGTGCTCCTCTTAGTCTTCACCATTCCTAGAAACTAGAGACAATATTTAGGAAGAACACAATATACCAACAAAAGTCAACACAATTGGCAAAGGGAGGATTTCCCAACTTTTACCTTTTTCTCTCAAACTGCTTTTGGATCACAACGGGAGCCTCTTTGCTTGTTTCCGATTCCTGGCAAGCAAATTTTTTGAAAAAGGCTTTAGGAAACTTACATGAAATATCAGGTGGATCTTCAATCTGTCTGGGAATTTCAAATGAAGTGCCCGTTCTTTTGATCACTTCAAAGGGTCTGTCATACCTCGACACCCCCTTGTTGACGATGCTCTTGATGATGTTTTTTCCATTTCTGTGCAAAGCCCAATGCTGAAGAAAATTTGTGTTCATATCTCCTGTCAGCCACCTCAGTTTACTTTTCTGAAGCCATAATCTCTGTTCCTTGATGATTAGATCTGTGTAAGCTGTCTTCAAGGGTTTTCTTTCAATTCTTTGAGCCTCATTTAGCTGCTTTTGAGTTTCTTCTAGAGAGTCAAGTTGATCTATCAAGGATAAGGTGTAATTTTTTCCTCCTTTTATATTCTCGAAGGAGTCTTTGGTCCACTAAGTTTTCTCTTAAGAAGTTTAAGCTTTCCATTTAGTTTTCCCATCTTTTTTAGTCTCGAAGGTGTCATTTTCAAGCTTCTCATCTTCAAATCTACAAGTTTTGTAAACTTTTTATTTTCTAAAAGAAGATTATGTAATTACTCAGCTTTCTAATTTCCACGAGAAGGATTCCTTGTCTAAGAGGTAAAACTTGTTGACAATAAGAAGTAATTGTGCAACTTCTTATTAGCTGGTTTTGGGAGCTTTTGCTAATTTGAAGCTGGTTTCTAGTTCTTGGGCCTGGGTTTTGTCAGCTGGGTTTTCACCTTCATTTTCAGACATATTGGGGGCTATTTCTTTTTAGGTTCTCTGGCTATTTGTTTCATCTCAAGGCTTCTTCCTGATGTGATCTTTCCTTATCTTTTGAAGGCTATTCTTCTTATGTTTTTAGTTTGAATTGACGTGTCCTTTATATGTTTTGTTTTATTTTTTTTCTTTTATTCTCTCCTCCATGGAGTTTTTTGTATTTTGAGCATTTGTCTCTTTTTCATTATTTCGATGAATTCTTTATTTCCTTGTAAAAAAAAGTAATTATGCAACATCTTAGTGTATCCCAGGTGATCTAATTTCTGGAGGTTTGTTTGCATTTTTATTTTCTAGGTTCCACTCTTCAATATTTTGAAGAAATTTGATGGTGAAACTATCACAGAAGTTGTCCGTCCACGTATAGCAAGAATGCGCTACCGTGTCATTCGATTGCCTCAATATTTAATTCTTCATATGCGACGATTTACAAAGAACAACTTTTTTGTGGAGAAAAATCCCACATTAGGTAGTTATTAGATTGCATTTTATTTGTTAAAAGAATAATGCAGAATGAAACCATCTTAATCTAATAATTAACATTGTGATCTCAGTGAACTTTCCTGTCAAGAATCTGGAATTGAAGGATTACATCCCCCTGCCAACACCTAAAGAGAATGAAAAATTGTGTTCAAAGTACGATTTGATTGCAAATATTGTTCATGATGGCAAACCGGACGAAGGGTACTACAGGGTATTTGTACAGAGGAAGTCGGAAGAATTATGGTAATCTTCTTTCCCTTTTTGTAAATGGTACCCACCCAAGGATCTAGAATTTGTCTGATTCATTGCAATCCAATGTTGACATTACAGGTACGAGATGCAGGATCTTCATGTCTCAGAAACACTTCCTCAGATGGTTGCTCTCTCTGAGGCTTATATGCAAATATATGAACGGCAGCAATAGATAGGAAGTTCGATCTCTGCATCTAACTGCGTAGTTGGTTTATTTCCCCTCTAGAACATATGTATCTGTACCTAAACAGGAGATGGGAGAAGATGACGTTTTGAAATGGTGGATATAAAGGAAACTTATTCATTCAGCATATTTGAAGCTGAAGCTGTGATTGAGGATAATATTGCTTATTGCTGCTAATATCTTTGGTAAATTGTAGAATCAATTGTCATTTTAAACCTACTTTGAAAAGTTAAACTGAAAAAGGCAGGTGAAATTTTGTTGCCTAAATTACTTCAAAATCTCTGTGTTGGGGCAACTTAATCGTTTGGAAGACTATTTGCAGAGTTGCCAGTGCGTCTTAGAGCTTTGATTTCATGGTAGCATTCAACTAAAGCTTAATTTTGAATTTC

mRNA sequence

CAAACATAGACTCAACTCAATTCAATTATATACTCTAAACATAAACTTCCTGCCTCAACTCAATTTTGCTCTCGAAACGAGACTTTTGAATTAACTCATAAACTAACTTTGTTCAACTCAAACTTGCCTCATCTCAAATTCTAAAGGGGAAAACCCCTAAGATTGCTTTGCAAAATAAAATAAAATTGCATAAAATAAAGAGGAAAACCCTAACTGCTCCACGCCGCTCCTTCTTCTTCACTCCCTCTCCTCCTCCCGCCGTGAGTCATCTGGCCGACGATTCTTCTTTCCTCGCCTAAATACAAAGAAACTACAGTTCAGTCGAAGTAAACAACTCCAGGGTTTGCAACTGGAAGAGGATTTCAGTTGTCCACGTTTTTTCTTCGTTTGAATCGATTGCTCTGCGTGAGCCGGAACGTTTGAGCTACATCAAAACATGCTGGAATTTGATAGACGACCAGCCAGATGAGAGAGAATAAAGAGTCAATAGAATGGGATCAAAGAGACGAGACGATAATGGGGTAGATGAGGAAGAGTTTGCTTCAGACTTAAAGAGGCAGAAATTACTAGGGGAGTTCTCATCATCTTCTTCTCCACCTGCCTCAGAGAACCCTCGGCTTCCCGGTTTTAACTATGGCGATGATGATGAAGAAGAAGATTATAAATTGAAACAAAATGGAAGTAGATATGATGGAGATAAAGGGGATGGCAATGATGATGAGGAAGATGATGAAGACCATGATGATGATGCGAATCATGTAAGGCGAAGCCGTGATGTTGAAGTTCGAAAAGATTGTCCTTATCTTGATACTGTTAACCGTCAGGTTTTGGATTTCGATTTCGAGAAGTTTTGCTCTGTCTGTCTCTCGAATCTGAATGTTTATGCCTGCCTGGTATGTGGTAAGTACTATCAAGGGAGGGGGAAGAAGTCTCATGCTTACACTCACAGTCTTGAAGCAGGACACCACGTGTATATCAACCTTCGGACAGAGAAAGTTTACTGCCTTCCTGATGGATATGAGATTAATGACCCATCCTTAGATGATATTCGATATGTCCTGAATCCAAGGTTTGCCAAAGAGCAGGTAAAGCAGCTTGACAAGAACAAGCAATGGTCTAGGGCACTTGATGGTTCTGATTACCTTCCTGGAATGGTAGGGCTTAACAACATTAAGGAAACTGATTTTGTAAATGTGACAATTCAGTCCTTAATGAGGGTTACACCACTCAGGAACTTCTTCCTAATACCTGAGAACTATCAGCACTGCAGATCTCCCCTTGTCCACCGGTTTGGTGAACTCACACGTAAGATTTGGCATGCAAGAAACTTCAAAGGACAGGTAAGCCCGCATGAATTCCTGCAAGCAGTTATGAAGGCTAGTAAAAAACGTTTCCGAATAGGTGCGCAGTCAGATCCTGTTGAATTTATGTCATGGTTTCTTAATACACTTCATTCAGAATTGCGAATTTCAAAGAAAAGTAGCAGTATAATCTACGAATGTTTCCAGGGGGAATTGGAGGTTGTGAAAGAGATTCACTCGAAAGCTCTCACTGAGAAGAAAGAAAATGGTGACGATCAGGATGCTGGAAGTGAAGGCAGCAGTGTTATAATGGAAACATCCAGAATGCCATTCTTAATGCTTGGATTGGATTTGCCACCACCACCTCTTTTCAAAGACGTTATGGAGAAAAATATAATACCACAGGTTCCACTCTTCAATATTTTGAAGAAATTTGATGGTGAAACTATCACAGAAGTTGTCCGTCCACGTATAGCAAGAATGCGCTACCGTGTCATTCGATTGCCTCAATATTTAATTCTTCATATGCGACGATTTACAAAGAACAACTTTTTTGTGGAGAAAAATCCCACATTAGTGAACTTTCCTGTCAAGAATCTGGAATTGAAGGATTACATCCCCCTGCCAACACCTAAAGAGAATGAAAAATTGTGTTCAAAGTACGATTTGATTGCAAATATTGTTCATGATGGCAAACCGGACGAAGGGTACTACAGGGTATTTGTACAGAGGAAGTCGGAAGAATTATGGTACGAGATGCAGGATCTTCATGTCTCAGAAACACTTCCTCAGATGGTTGCTCTCTCTGAGGCTTATATGCAAATATATGAACGGCAGCAATAGATAGGAAGTTCGATCTCTGCATCTAACTGCGTAGTTGGTTTATTTCCCCTCTAGAACATATGTATCTGTACCTAAACAGGAGATGGGAGAAGATGACGTTTTGAAATGGTGGATATAAAGGAAACTTATTCATTCAGCATATTTGAAGCTGAAGCTGTGATTGAGGATAATATTGCTTATTGCTGCTAATATCTTTGGTAAATTGTAGAATCAATTGTCATTTTAAACCTACTTTGAAAAGTTAAACTGAAAAAGGCAGGTGAAATTTTGTTGCCTAAATTACTTCAAAATCTCTGTGTTGGGGCAACTTAATCGTTTGGAAGACTATTTGCAGAGTTGCCAGTGCGTCTTAGAGCTTTGATTTCATGGTAGCATTCAACTAAAGCTTAATTTTGAATTTC

Coding sequence (CDS)

ATGGGATCAAAGAGACGAGACGATAATGGGGTAGATGAGGAAGAGTTTGCTTCAGACTTAAAGAGGCAGAAATTACTAGGGGAGTTCTCATCATCTTCTTCTCCACCTGCCTCAGAGAACCCTCGGCTTCCCGGTTTTAACTATGGCGATGATGATGAAGAAGAAGATTATAAATTGAAACAAAATGGAAGTAGATATGATGGAGATAAAGGGGATGGCAATGATGATGAGGAAGATGATGAAGACCATGATGATGATGCGAATCATGTAAGGCGAAGCCGTGATGTTGAAGTTCGAAAAGATTGTCCTTATCTTGATACTGTTAACCGTCAGGTTTTGGATTTCGATTTCGAGAAGTTTTGCTCTGTCTGTCTCTCGAATCTGAATGTTTATGCCTGCCTGGTATGTGGTAAGTACTATCAAGGGAGGGGGAAGAAGTCTCATGCTTACACTCACAGTCTTGAAGCAGGACACCACGTGTATATCAACCTTCGGACAGAGAAAGTTTACTGCCTTCCTGATGGATATGAGATTAATGACCCATCCTTAGATGATATTCGATATGTCCTGAATCCAAGGTTTGCCAAAGAGCAGGTAAAGCAGCTTGACAAGAACAAGCAATGGTCTAGGGCACTTGATGGTTCTGATTACCTTCCTGGAATGGTAGGGCTTAACAACATTAAGGAAACTGATTTTGTAAATGTGACAATTCAGTCCTTAATGAGGGTTACACCACTCAGGAACTTCTTCCTAATACCTGAGAACTATCAGCACTGCAGATCTCCCCTTGTCCACCGGTTTGGTGAACTCACACGTAAGATTTGGCATGCAAGAAACTTCAAAGGACAGGTAAGCCCGCATGAATTCCTGCAAGCAGTTATGAAGGCTAGTAAAAAACGTTTCCGAATAGGTGCGCAGTCAGATCCTGTTGAATTTATGTCATGGTTTCTTAATACACTTCATTCAGAATTGCGAATTTCAAAGAAAAGTAGCAGTATAATCTACGAATGTTTCCAGGGGGAATTGGAGGTTGTGAAAGAGATTCACTCGAAAGCTCTCACTGAGAAGAAAGAAAATGGTGACGATCAGGATGCTGGAAGTGAAGGCAGCAGTGTTATAATGGAAACATCCAGAATGCCATTCTTAATGCTTGGATTGGATTTGCCACCACCACCTCTTTTCAAAGACGTTATGGAGAAAAATATAATACCACAGGTTCCACTCTTCAATATTTTGAAGAAATTTGATGGTGAAACTATCACAGAAGTTGTCCGTCCACGTATAGCAAGAATGCGCTACCGTGTCATTCGATTGCCTCAATATTTAATTCTTCATATGCGACGATTTACAAAGAACAACTTTTTTGTGGAGAAAAATCCCACATTAGTGAACTTTCCTGTCAAGAATCTGGAATTGAAGGATTACATCCCCCTGCCAACACCTAAAGAGAATGAAAAATTGTGTTCAAAGTACGATTTGATTGCAAATATTGTTCATGATGGCAAACCGGACGAAGGGTACTACAGGGTATTTGTACAGAGGAAGTCGGAAGAATTATGGTACGAGATGCAGGATCTTCATGTCTCAGAAACACTTCCTCAGATGGTTGCTCTCTCTGAGGCTTATATGCAAATATATGAACGGCAGCAATAG

Protein sequence

MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Homology
BLAST of Tan0009824 vs. ExPASy Swiss-Prot
Match: Q53GS9 (U4/U6.U5 tri-snRNP-associated protein 2 OS=Homo sapiens OX=9606 GN=USP39 PE=1 SV=2)

HSP 1 Score: 473.4 bits (1217), Expect = 3.5e-132
Identity = 251/485 (51.75%), Postives = 328/485 (67.63%), Query Frame = 0

Query: 76  DEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLV 135
           DE+ + + +  A + R   +    + CPYLDT+NR VLDFDFEK CS+ LS++N YACLV
Sbjct: 79  DEDSEPEREVRAKNGRVDSEDRRSRHCPYLDTINRSVLDFDFEKLCSISLSHINAYACLV 138

Query: 136 CGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFA 195
           CGKY+QGRG KSHAY HS++  HHV++NL T K YCLPD YEI D SL+DI YVL P F 
Sbjct: 139 CGKYFQGRGLKSHAYIHSVQFSHHVFLNLHTLKFYCLPDNYEIIDSSLEDITYVLKPTFT 198

Query: 196 KEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPEN 255
           K+Q+  LDK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL  +N
Sbjct: 199 KQQIANLDKQAKLSRAYDGTTYLPGIVGLNNIKANDYANAVLQALSNVPPLRNYFLEEDN 258

Query: 256 YQHCRSP-------LVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSD 315
           Y++ + P       LV RFGEL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D
Sbjct: 259 YKNIKRPPGDIMFLLVQRFGELMRKLWNPRNFKAHVSPHEMLQAVVLCSKKTFQITKQGD 318

Query: 316 PVEFMSWFLNTLHSEL-RISKKSSSIIYECFQGELEVV--KEIHSKALTEKKENGDDQDA 375
            V+F+SWFLN LHS L    KK  +I+ + FQG + +   K  H     E+KE     D 
Sbjct: 319 GVDFLSWFLNALHSALGGTKKKKKTIVTDVFQGSMRIFTKKLPHPDLPAEEKEQLLHND- 378

Query: 376 GSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVR 435
             E    ++E++   F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E   
Sbjct: 379 --EYQETMVEST---FMYLTLDLPTAPLYKDEKEQLIIPQVPLFNILAKFNGITEKEYKT 438

Query: 436 PRIARM-RYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEN 495
            +   + R+++ +LP YLI  ++RFTKNNFFVEKNPT+VNFP+ N++L++Y+       +
Sbjct: 439 YKENFLKRFQLTKLPPYLIFCIKRFTKNNFFVEKNPTIVNFPITNVDLREYLSEEVQAVH 498

Query: 496 EKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQ 550
           +   + YDLIANIVHDGKP EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+Q
Sbjct: 499 KN--TTYDLIANIVHDGKPSEGSYRIHVLHHGTGKWYELQDLQVTDILPQMITLSEAYIQ 555

BLAST of Tan0009824 vs. ExPASy Swiss-Prot
Match: Q5R761 (U4/U6.U5 tri-snRNP-associated protein 2 OS=Pongo abelii OX=9601 GN=USP39 PE=2 SV=1)

HSP 1 Score: 471.9 bits (1213), Expect = 1.0e-131
Identity = 251/485 (51.75%), Postives = 327/485 (67.42%), Query Frame = 0

Query: 76  DEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLV 135
           DE+ + + +  A + R   +    + CPYLDT+NR VLDFDFEK CS+  S++N YACLV
Sbjct: 79  DEDSEPEREVRAKNGRVDSEDRRSRHCPYLDTINRSVLDFDFEKLCSISPSHVNAYACLV 138

Query: 136 CGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFA 195
           CGKY+QGRG KSHAY HS++  HHV++NL T K YCLPD YEI D SL+DI YVL P F 
Sbjct: 139 CGKYFQGRGLKSHAYIHSVQFSHHVFLNLHTLKFYCLPDNYEIIDSSLEDITYVLKPTFT 198

Query: 196 KEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPEN 255
           K+Q+  LDK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL  +N
Sbjct: 199 KQQIANLDKQAKLSRAYDGTTYLPGIVGLNNIKANDYANAVLQALSNVPPLRNYFLEEDN 258

Query: 256 YQHCRSP-------LVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSD 315
           Y++ + P       LV RFGEL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D
Sbjct: 259 YKNIKRPPGDIMFLLVQRFGELMRKLWNPRNFKAHVSPHEMLQAVVLCSKKTFQITKQGD 318

Query: 316 PVEFMSWFLNTLHSEL-RISKKSSSIIYECFQGELEVV--KEIHSKALTEKKENGDDQDA 375
            V+F+SWFLN LHS L    KK  +I+ + FQG + +   K  H     E+KE     D 
Sbjct: 319 GVDFLSWFLNALHSALGGTKKKKKTIVTDVFQGSMRIFTKKLPHPDLPAEEKEQLLHND- 378

Query: 376 GSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVR 435
             E    ++E++   F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E   
Sbjct: 379 --EYQETMVEST---FMYLTLDLPTAPLYKDEKEQLIIPQVPLFNILAKFNGITEKEYKT 438

Query: 436 PRIARM-RYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEN 495
            +   + R+++ +LP YLI  ++RFTKNNFFVEKNPT+VNFP+ N++L++Y+       +
Sbjct: 439 YKENFLKRFQLTKLPPYLIFCIKRFTKNNFFVEKNPTIVNFPITNVDLREYLSEEVQAVH 498

Query: 496 EKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQ 550
           E   + YDLIANIVHDGKP EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+Q
Sbjct: 499 EN--TTYDLIANIVHDGKPSEGSYRIHVLHHGTGKWYELQDLQVTDILPQMITLSEAYIQ 555

BLAST of Tan0009824 vs. ExPASy Swiss-Prot
Match: Q3TIX9 (U4/U6.U5 tri-snRNP-associated protein 2 OS=Mus musculus OX=10090 GN=Usp39 PE=1 SV=2)

HSP 1 Score: 470.7 bits (1210), Expect = 2.3e-131
Identity = 252/491 (51.32%), Postives = 330/491 (67.21%), Query Frame = 0

Query: 70  KGDGNDDEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVCLSNLN 129
           K +   DE+ + + +  A + R   +    + CPYLDT+NR VLDFDFEK CS+ LS++N
Sbjct: 72  KREREADEDSEPEREVRAKNGRVDSEDRRSRHCPYLDTINRSVLDFDFEKLCSISLSHIN 131

Query: 130 VYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYV 189
            YACLVCGKY+QGRG KSHAY HS++  HHV++NL T K YCLPD YEI D SL+DI YV
Sbjct: 132 AYACLVCGKYFQGRGLKSHAYIHSVQFSHHVFLNLHTLKFYCLPDNYEIIDSSLEDITYV 191

Query: 190 LNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNF 249
           L P F K+Q+  LDK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+
Sbjct: 192 LKPTFTKQQIANLDKQAKLSRAYDGTTYLPGIVGLNNIKANDYANAVLQALSNVPPLRNY 251

Query: 250 FLIPENYQHCRSP-------LVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFR 309
           FL  +NY++ + P       LV RFGEL RK+W+ RNFK  VSPHE LQAV+  SKK F+
Sbjct: 252 FLEEDNYKNIKRPPGDIMFLLVQRFGELMRKLWNPRNFKAHVSPHEMLQAVVLCSKKTFQ 311

Query: 310 IGAQSDPVEFMSWFLNTLHSEL-RISKKSSSIIYECFQGELEVV--KEIHSKALTEKKEN 369
           I  Q D V+F+SWFLN LHS L    KK  +I+ + FQG + +   K  H     E+KE 
Sbjct: 312 ITKQGDGVDFLSWFLNALHSALGGTKKKKKTIVNDVFQGSMRIFTKKLPHPDLPAEEKEQ 371

Query: 370 GDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGET 429
               D   E    ++E++   F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T
Sbjct: 372 LLHND---EYQETMVEST---FMYLTLDLPTAPLYKDEKEQLIIPQVPLFNILAKFNGIT 431

Query: 430 ITEVVRPRIARM-RYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPL 489
             E    +   + R+++ +LP YLI  ++RFTKNNFFVEKNPT+VNFP+ N++L++Y+  
Sbjct: 432 EKEYKTYKENFLKRFQLTKLPPYLIFCIKRFTKNNFFVEKNPTIVNFPITNVDLREYLSE 491

Query: 490 PTPKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVAL 549
                ++   + YDLIANIVHDGKP EG YR+ V       WYE+QDL V++ LPQM+ L
Sbjct: 492 EVQAVHKN--TTYDLIANIVHDGKPSEGSYRIHVLHHGTGKWYELQDLQVTDILPQMITL 551

BLAST of Tan0009824 vs. ExPASy Swiss-Prot
Match: Q9USR2 (Probable mRNA-splicing protein ubp10 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=ubp10 PE=3 SV=1)

HSP 1 Score: 372.5 bits (955), Expect = 8.4e-102
Identity = 212/504 (42.06%), Postives = 303/504 (60.12%), Query Frame = 0

Query: 54  EEDYKLKQNGSRYDGDKGDGNDDEEDDED-HDDDANHVRRSRDVEVRKDCPYLDTVNRQV 113
           EED  +  NG R   + G      +D ED HD  +  +       +     YLDT+NR++
Sbjct: 16  EEDNNI-DNGKRKKLELG------KDMEDVHDIASKEMEEHETTPIISQNLYLDTINRKL 75

Query: 114 LDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCL 173
           LDFDFEK CSV L+NL+VYACLVCG+Y+QGRG  SHAY H+L   HHV++N  T K Y L
Sbjct: 76  LDFDFEKVCSVSLTNLSVYACLVCGRYFQGRGPSSHAYFHALTENHHVFVNCSTLKFYVL 135

Query: 174 PDGYEINDPSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDF 233
           P+ Y++   +L DI YV+ P F K +V++LD   Q S  L    Y+PG VG+NNIK  D+
Sbjct: 136 PESYQVESSALQDIAYVMRPTFTKLEVQRLDHTPQLSYDLMLKPYVPGFVGMNNIKNNDY 195

Query: 234 VNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQA 293
            NV I  L  V P RN+FL+ +N+ +C   LV R   L RK+W+ + FK  VSP E +Q 
Sbjct: 196 FNVVIHMLAHVKPFRNYFLL-KNFDNC-PQLVQRLAILIRKLWNHKAFKSHVSPQELIQE 255

Query: 294 VMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRISK----KSSSIIYECFQGELEVVKEI 353
           V   S K++ I  Q DPVEF+SWFLNTLH+ L   K    K +SI++  FQG +     I
Sbjct: 256 VTVLSHKKYSINEQKDPVEFLSWFLNTLHNCLGGKKSTIAKPTSIVHYSFQGFV----RI 315

Query: 354 HSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPL 413
            S+ + +  E G  +     G  VI +T+ +PFL L LDLPP P+F+D  E NIIPQV L
Sbjct: 316 ESQKIRQHAEKG--EQVVFTGDRVI-QTNVVPFLYLTLDLPPKPIFQDEFEGNIIPQVEL 375

Query: 414 FNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVK 473
             IL K++G    E+      R R+ ++  P Y I H++RF KNN+F E+N T+V FP+ 
Sbjct: 376 KEILNKYNGVHTQELAG---MRRRFHLMTAPPYFIFHIKRFMKNNYFTERNQTIVTFPLD 435

Query: 474 NLELKDYIPLPTPKENEKLCSKYDLIANIVHD----GKPDEGYYRVFVQRKSEELWYEMQ 533
           + ++  +I     + N K+ +KY+L+ANI+H+     + +   +R+ ++  S   WY++Q
Sbjct: 436 DFDMSPFIDDSFIQSNPKISTKYNLVANIIHESVTHAEEEFHNFRIQIRNPSTNKWYQIQ 495

Query: 534 DLHVSETLPQMVALSEAYMQIYER 549
           DL+V E    M+ L E+++Q++ER
Sbjct: 496 DLYVEEISSDMIRLGESFIQLWER 500

BLAST of Tan0009824 vs. ExPASy Swiss-Prot
Match: P43589 (Pre-mRNA-splicing factor SAD1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=SAD1 PE=1 SV=1)

HSP 1 Score: 181.4 bits (459), Expect = 2.7e-44
Identity = 136/466 (29.18%), Postives = 220/466 (47.21%), Query Frame = 0

Query: 104 YLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYIN 163
           YL+TV R+ LDFD EK C + LS LNVY CLVCG YYQGR +KS A+ HS++  HHV++N
Sbjct: 31  YLETVVREKLDFDSEKICCITLSPLNVYCCLVCGHYYQGRHEKSPAFIHSIDENHHVFLN 90

Query: 164 LRTEKVYCLPDGYEINDPS----LDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLP 223
           L + K Y LP   +I        L+ I++   P +     K L+   +    L    YL 
Sbjct: 91  LTSLKFYMLPQNVQILHDGEVQLLNSIKFAAYPTYCP---KDLEDFPRQCFDLSNRTYLN 150

Query: 224 GMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARN 283
           G +G  N    D+ +  +  +  + P+R+ FL+  N+   +   + R     +KIW  + 
Sbjct: 151 GFIGFTNAATYDYAHSVLLLISHMVPVRDHFLL--NHFDNQGEFIKRLSICVKKIWSPKL 210

Query: 284 FKGQVSPHEFLQAVMKASKKRFRIGAQSDPVE---FMSWFLNTLHSELRISKKSSSIIYE 343
           FK  +S  +F+      S  + R G   +P++   F+ W  N + S    S    SI+  
Sbjct: 211 FKHHLSVDDFV------SYLKVREGLNLNPIDPRLFLLWLFNKICSS---SNDLKSILNH 270

Query: 344 CFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKD 403
             +G++++ K        E K    +   G     VI++    PF +L LDLP    F+D
Sbjct: 271 SCKGKVKIAK-------VENKPEASESVTG----KVIVK----PFWVLTLDLPEFSPFED 330

Query: 404 VMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFV 463
               + +PQ+ +  +L KF         R       + + RLPQ+LI H  RF +N+   
Sbjct: 331 GNSVDDLPQINITKLLTKFTKS------RSSSTSTVFELTRLPQFLIFHFNRFDRNS--- 390

Query: 464 EKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVH--------DGK----PD 523
                  + PVKN   ++   +    E E L  KY L AN+VH        DG      +
Sbjct: 391 -------DHPVKN---RNQTLVEFSSELEILHVKYRLKANVVHVVIKQPSTDGNAFNGDE 448

Query: 524 EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ 551
           + ++   +     E W E+  ++ +E   +++ L E ++Q++E+Q+
Sbjct: 451 KSHWITQLYDNKSEKWIEIDGINTTEREAELLFLKETFIQVWEKQE 448

BLAST of Tan0009824 vs. NCBI nr
Match: XP_022142503.1 (U4/U6.U5 tri-snRNP-associated protein 2-like [Momordica charantia])

HSP 1 Score: 1047.7 bits (2708), Expect = 3.4e-302
Identity = 525/550 (95.45%), Postives = 532/550 (96.73%), Query Frame = 0

Query: 1   MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLK 60
           MGSKRRDDN VDEEE A ++KRQKLLGEFS SS PPASENPRLPGFNYGDDDEEEDYK K
Sbjct: 1   MGSKRRDDNAVDEEELAPEIKRQKLLGEFSPSSPPPASENPRLPGFNYGDDDEEEDYKFK 60

Query: 61  QNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGSR  GD GD NDDEEDDE +DDDANHV+RSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 61  QNGSRNGGDGGDDNDDEEDDE-YDDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120

Query: 121 CSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180

Query: 181 PSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRF KEQV+ LDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 181 PSLDDIRYVLNPRFTKEQVELLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240

Query: 241 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300
           MRVTPLRNFFLIPENYQHC+SPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Sbjct: 241 MRVTPLRNFFLIPENYQHCKSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENG 360
           FRIGAQSDPVEFMSWFLNTLHSELR+SKKSSSIIYECFQGELEVVKEIHSKALTEKKENG
Sbjct: 301 FRIGAQSDPVEFMSWFLNTLHSELRMSKKSSSIIYECFQGELEVVKEIHSKALTEKKENG 360

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420
           DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGE I
Sbjct: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGEAI 420

Query: 421 TEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRPRIARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 421 TEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480

Query: 481 PKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PKENEKL SKYDLIANIVHDGKPDEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 481 PKENEKLRSKYDLIANIVHDGKPDEGCYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 541 AYMQIYERQQ 549

BLAST of Tan0009824 vs. NCBI nr
Match: XP_038894256.1 (U4/U6.U5 tri-snRNP-associated protein 2-like isoform X2 [Benincasa hispida] >XP_038894257.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X2 [Benincasa hispida])

HSP 1 Score: 1047.0 bits (2706), Expect = 5.8e-302
Identity = 520/550 (94.55%), Postives = 536/550 (97.45%), Query Frame = 0

Query: 1   MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLK 60
           MGSKRR+++ +DEEE   DLKR KLLGE  S SSPPASENP+LPGFNYGDD+EEE+YK K
Sbjct: 1   MGSKRRNNSLLDEEELGPDLKRHKLLGEV-SPSSPPASENPQLPGFNYGDDEEEEEYKFK 60

Query: 61  QNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGSRYDGD+GD NDDEEDDE+HDDDANHV+RSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 61  QNGSRYDGDEGDDNDDEEDDEEHDDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120

Query: 121 CSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180

Query: 181 PSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRFAKEQV+QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 181 PSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240

Query: 241 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300
           MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Sbjct: 241 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENG 360
           FRIGAQSDPVEFMSWFLNTLHSELRI+KKSSSIIYECFQGELEVVKEIHSKAL EKKENG
Sbjct: 301 FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENG 360

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420
           DDQDAG+E SSV+METSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI
Sbjct: 361 DDQDAGTEDSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420

Query: 421 TEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRPRIARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 421 TEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480

Query: 481 PKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PK+NEKLCSKYDLIANIVHDGKP+EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 481 PKDNEKLCSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 541 AYMQIYERQQ 549

BLAST of Tan0009824 vs. NCBI nr
Match: XP_038894254.1 (U4/U6.U5 tri-snRNP-associated protein 2-like isoform X1 [Benincasa hispida] >XP_038894255.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X1 [Benincasa hispida])

HSP 1 Score: 1047.0 bits (2706), Expect = 5.8e-302
Identity = 520/550 (94.55%), Postives = 536/550 (97.45%), Query Frame = 0

Query: 1   MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLK 60
           MGSKRR+++ +DEEE   DLKR KLLGE  S SSPPASENP+LPGFNYGDD+EEE+YK K
Sbjct: 47  MGSKRRNNSLLDEEELGPDLKRHKLLGEV-SPSSPPASENPQLPGFNYGDDEEEEEYKFK 106

Query: 61  QNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGSRYDGD+GD NDDEEDDE+HDDDANHV+RSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 107 QNGSRYDGDEGDDNDDEEDDEEHDDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 166

Query: 121 CSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 167 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 226

Query: 181 PSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRFAKEQV+QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 227 PSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 286

Query: 241 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300
           MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Sbjct: 287 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 346

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENG 360
           FRIGAQSDPVEFMSWFLNTLHSELRI+KKSSSIIYECFQGELEVVKEIHSKAL EKKENG
Sbjct: 347 FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENG 406

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420
           DDQDAG+E SSV+METSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI
Sbjct: 407 DDQDAGTEDSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 466

Query: 421 TEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRPRIARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 467 TEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 526

Query: 481 PKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PK+NEKLCSKYDLIANIVHDGKP+EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 527 PKDNEKLCSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 586

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 587 AYMQIYERQQ 595

BLAST of Tan0009824 vs. NCBI nr
Match: XP_011655023.1 (U4/U6.U5 tri-snRNP-associated protein 2 [Cucumis sativus] >KGN65053.1 hypothetical protein Csa_013002 [Cucumis sativus])

HSP 1 Score: 1031.2 bits (2665), Expect = 3.3e-297
Identity = 512/550 (93.09%), Postives = 532/550 (96.73%), Query Frame = 0

Query: 1   MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLK 60
           MGSKRR ++ +DEEE   DLKR KLLGE S SSSPPASENP+LPGFNYGDDDEEED+K K
Sbjct: 1   MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFK 60

Query: 61  QNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGS+YDGD+GD NDDEEDDE++D++ N V+RSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 61  QNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120

Query: 121 CSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180

Query: 181 PSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRFAKEQV+QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 181 PSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240

Query: 241 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300
           MRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Sbjct: 241 MRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENG 360
           FRIGAQSDPVEFMSWFLNTLHSELRI+KKSSSIIYECFQGELEVVKEIHSKAL EKKENG
Sbjct: 301 FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENG 360

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420
           ++QDAG+EGSSV METSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI
Sbjct: 361 EEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420

Query: 421 TEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRPRIARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 421 TEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480

Query: 481 PKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PK+N+KL SKYDLIANIVHDGKP+EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 481 PKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 541 AYMQIYERQQ 550

BLAST of Tan0009824 vs. NCBI nr
Match: XP_008463627.1 (PREDICTED: U4/U6.U5 tri-snRNP-associated protein 2-like [Cucumis melo] >XP_008463628.1 PREDICTED: U4/U6.U5 tri-snRNP-associated protein 2-like [Cucumis melo] >XP_008463629.1 PREDICTED: U4/U6.U5 tri-snRNP-associated protein 2-like [Cucumis melo] >XP_008463630.1 PREDICTED: U4/U6.U5 tri-snRNP-associated protein 2-like [Cucumis melo])

HSP 1 Score: 1031.2 bits (2665), Expect = 3.3e-297
Identity = 511/550 (92.91%), Postives = 531/550 (96.55%), Query Frame = 0

Query: 1   MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLK 60
           MGSKRR+++ +DEEE   DLKR KLLGE S SSSPPASENP+LPGFNYGDDDEEED+K K
Sbjct: 1   MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFK 60

Query: 61  QNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGS+YD D+GD NDDEEDDE+HDD  N V+RSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 61  QNGSKYDADEGDYNDDEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120

Query: 121 CSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180

Query: 181 PSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRFAKEQV+QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 181 PSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240

Query: 241 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300
           MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Sbjct: 241 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENG 360
           FRIGAQSDPVEFMSWFLNTLHSELRI+KKSSSIIYECFQGELEVVKEIHSKAL EKKENG
Sbjct: 301 FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENG 360

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420
           D+QDAG++GSSV+METSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI
Sbjct: 361 DEQDAGTQGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420

Query: 421 TEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRP IARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 421 TEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480

Query: 481 PKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PK+N+KL SKYDLIAN+VHDGKP+EG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 481 PKDNDKLRSKYDLIANVVHDGKPNEGCYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 541 AYMQIYERQQ 550

BLAST of Tan0009824 vs. ExPASy TrEMBL
Match: A0A6J1CND9 (U4/U6.U5 tri-snRNP-associated protein 2-like OS=Momordica charantia OX=3673 GN=LOC111012606 PE=4 SV=1)

HSP 1 Score: 1047.7 bits (2708), Expect = 1.7e-302
Identity = 525/550 (95.45%), Postives = 532/550 (96.73%), Query Frame = 0

Query: 1   MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLK 60
           MGSKRRDDN VDEEE A ++KRQKLLGEFS SS PPASENPRLPGFNYGDDDEEEDYK K
Sbjct: 1   MGSKRRDDNAVDEEELAPEIKRQKLLGEFSPSSPPPASENPRLPGFNYGDDDEEEDYKFK 60

Query: 61  QNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGSR  GD GD NDDEEDDE +DDDANHV+RSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 61  QNGSRNGGDGGDDNDDEEDDE-YDDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120

Query: 121 CSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180

Query: 181 PSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRF KEQV+ LDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 181 PSLDDIRYVLNPRFTKEQVELLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240

Query: 241 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300
           MRVTPLRNFFLIPENYQHC+SPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Sbjct: 241 MRVTPLRNFFLIPENYQHCKSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENG 360
           FRIGAQSDPVEFMSWFLNTLHSELR+SKKSSSIIYECFQGELEVVKEIHSKALTEKKENG
Sbjct: 301 FRIGAQSDPVEFMSWFLNTLHSELRMSKKSSSIIYECFQGELEVVKEIHSKALTEKKENG 360

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420
           DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGE I
Sbjct: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGEAI 420

Query: 421 TEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRPRIARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 421 TEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480

Query: 481 PKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PKENEKL SKYDLIANIVHDGKPDEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 481 PKENEKLRSKYDLIANIVHDGKPDEGCYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 541 AYMQIYERQQ 549

BLAST of Tan0009824 vs. ExPASy TrEMBL
Match: A0A1S3CJP7 (U4/U6.U5 tri-snRNP-associated protein 2-like OS=Cucumis melo OX=3656 GN=LOC103501728 PE=4 SV=1)

HSP 1 Score: 1031.2 bits (2665), Expect = 1.6e-297
Identity = 511/550 (92.91%), Postives = 531/550 (96.55%), Query Frame = 0

Query: 1   MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLK 60
           MGSKRR+++ +DEEE   DLKR KLLGE S SSSPPASENP+LPGFNYGDDDEEED+K K
Sbjct: 1   MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFK 60

Query: 61  QNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGS+YD D+GD NDDEEDDE+HDD  N V+RSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 61  QNGSKYDADEGDYNDDEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120

Query: 121 CSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180

Query: 181 PSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRFAKEQV+QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 181 PSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240

Query: 241 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300
           MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Sbjct: 241 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENG 360
           FRIGAQSDPVEFMSWFLNTLHSELRI+KKSSSIIYECFQGELEVVKEIHSKAL EKKENG
Sbjct: 301 FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENG 360

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420
           D+QDAG++GSSV+METSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI
Sbjct: 361 DEQDAGTQGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420

Query: 421 TEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRP IARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 421 TEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480

Query: 481 PKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PK+N+KL SKYDLIAN+VHDGKP+EG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 481 PKDNDKLRSKYDLIANVVHDGKPNEGCYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 541 AYMQIYERQQ 550

BLAST of Tan0009824 vs. ExPASy TrEMBL
Match: A0A0A0LTF9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G185102 PE=4 SV=1)

HSP 1 Score: 1031.2 bits (2665), Expect = 1.6e-297
Identity = 512/550 (93.09%), Postives = 532/550 (96.73%), Query Frame = 0

Query: 1   MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLK 60
           MGSKRR ++ +DEEE   DLKR KLLGE S SSSPPASENP+LPGFNYGDDDEEED+K K
Sbjct: 1   MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFK 60

Query: 61  QNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGS+YDGD+GD NDDEEDDE++D++ N V+RSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 61  QNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120

Query: 121 CSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180

Query: 181 PSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRFAKEQV+QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 181 PSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240

Query: 241 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300
           MRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Sbjct: 241 MRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENG 360
           FRIGAQSDPVEFMSWFLNTLHSELRI+KKSSSIIYECFQGELEVVKEIHSKAL EKKENG
Sbjct: 301 FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENG 360

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420
           ++QDAG+EGSSV METSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI
Sbjct: 361 EEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420

Query: 421 TEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRPRIARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 421 TEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480

Query: 481 PKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PK+N+KL SKYDLIANIVHDGKP+EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 481 PKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 541 AYMQIYERQQ 550

BLAST of Tan0009824 vs. ExPASy TrEMBL
Match: A0A6J1EKT5 (U4/U6.U5 tri-snRNP-associated protein 2-like OS=Cucurbita moschata OX=3662 GN=LOC111434213 PE=4 SV=1)

HSP 1 Score: 1029.6 bits (2661), Expect = 4.7e-297
Identity = 517/550 (94.00%), Postives = 529/550 (96.18%), Query Frame = 0

Query: 1   MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLK 60
           MGSKR++D+ VDEEE   DLKR K LGE SS SSPPASENP+LPGFNYGDDDEEEDYK K
Sbjct: 1   MGSKRQNDSVVDEEELGPDLKRHKSLGE-SSPSSPPASENPQLPGFNYGDDDEEEDYKSK 60

Query: 61  QNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGS YDGD+GDG DDEE+DED     NH+ RSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 61  QNGSGYDGDEGDGTDDEENDEDE----NHIMRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120

Query: 121 CSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180

Query: 181 PSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRFAKEQV+QLDK KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 181 PSLDDIRYVLNPRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240

Query: 241 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300
           MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Sbjct: 241 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENG 360
           FRIGAQSDPVEFMSWFLNTLHSELR+SKKSSSIIYECFQGELEVVKEIHSKALTEKKENG
Sbjct: 301 FRIGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENG 360

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420
           DDQDAG+EGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI
Sbjct: 361 DDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420

Query: 421 TEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRPRIARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 421 TEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480

Query: 481 PKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PKE+EKL SKYDLIANIVHDGKP+EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 481 PKESEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 541 AYMQIYERQQ 545

BLAST of Tan0009824 vs. ExPASy TrEMBL
Match: A0A6J1KIZ3 (U4/U6.U5 tri-snRNP-associated protein 2-like OS=Cucurbita maxima OX=3661 GN=LOC111495660 PE=4 SV=1)

HSP 1 Score: 1023.1 bits (2644), Expect = 4.4e-295
Identity = 514/550 (93.45%), Postives = 527/550 (95.82%), Query Frame = 0

Query: 1   MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLK 60
           MGSKR++D+ VDEEE   DLKR K LGE  S SSPPASENP+LPGFNYGDDDEEEDYK K
Sbjct: 1   MGSKRQNDSVVDEEELGPDLKRHKSLGEL-SPSSPPASENPQLPGFNYGDDDEEEDYKSK 60

Query: 61  QNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGS YDGD+GD  DDEE+DED     NH+ RSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 61  QNGSGYDGDEGDVTDDEENDEDE----NHIMRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120

Query: 121 CSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180

Query: 181 PSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRFAKEQV+QLDK KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 181 PSLDDIRYVLNPRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240

Query: 241 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300
           MRVTPLRNFFLIP+NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Sbjct: 241 MRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENG 360
           FRIGAQSDPVEFMSWFLNTLHSELR+SKKSSSIIYECFQGELEVVKEIHSKALTEKKENG
Sbjct: 301 FRIGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENG 360

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420
           DDQDAG+EGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI
Sbjct: 361 DDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420

Query: 421 TEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRPRIARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 421 TEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480

Query: 481 PKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PKE+EKL SKYDLIANIVHDGKP+EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 481 PKESEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 541 AYMQIYERQQ 545

BLAST of Tan0009824 vs. TAIR 10
Match: AT4G22285.1 (Ubiquitin C-terminal hydrolases superfamily protein )

HSP 1 Score: 782.7 bits (2020), Expect = 1.9e-226
Identity = 407/562 (72.42%), Postives = 464/562 (82.56%), Query Frame = 0

Query: 1   MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFN-YGDDDEEEDYKL 60
           M  +R   NGV EEE   ++KR++++    S   P    NP LP  N Y DDDEEE+ + 
Sbjct: 1   MKGEREVKNGVSEEE--REVKRKRVMERSDSPPPPLGFNNPLLPLANTYDDDDEEEENEQ 60

Query: 61  KQNGSRYDG-DKGDGNDD-------EEDDEDHDDDAN--HVRRSRDVEVRKDCPYLDTVN 120
           K++ +R +G  KG+GN +       EE D+D DDD +    + SR VEVR+DCPYLDTVN
Sbjct: 61  KKSQARGNGVAKGEGNGNKVKGEAQEEVDDDEDDDVSKGKGKHSRHVEVRRDCPYLDTVN 120

Query: 121 RQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKV 180
           RQVLDFDFE+FCSV LSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKV
Sbjct: 121 RQVLDFDFERFCSVSLSNLNVYACLVCGKYFQGRSQKSHAYTHSLEAGHHVYINLLTEKV 180

Query: 181 YCLPDGYEINDPSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKE 240
           YCLPD YEINDPSLDDIR+VLNPRF++ QV +LDKN+QWSRALDGSDYLPGMVGLNNI++
Sbjct: 181 YCLPDSYEINDPSLDDIRHVLNPRFSRAQVNELDKNRQWSRALDGSDYLPGMVGLNNIQK 240

Query: 241 TDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEF 300
           T+FVNVTIQSLMRVTPLRNFFLIPENYQHC+SPLVHRFGELTRKIWHARNFKGQVSPHEF
Sbjct: 241 TEFVNVTIQSLMRVTPLRNFFLIPENYQHCKSPLVHRFGELTRKIWHARNFKGQVSPHEF 300

Query: 301 LQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIH 360
           LQAVMKASKKRFRIG QSDPVEFMSW LNTLH +LR SK +SSII++CFQGELEVVKE  
Sbjct: 301 LQAVMKASKKRFRIGQQSDPVEFMSWLLNTLHMDLRTSKDASSIIHKCFQGELEVVKEFQ 360

Query: 361 SKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLF 420
                           G+E      E SRM FLMLGLDLPPPPLFKDVMEKNIIPQV LF
Sbjct: 361 ----------------GNENK----EISRMSFLMLGLDLPPPPLFKDVMEKNIIPQVALF 420

Query: 421 NILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKN 480
           ++LKKFDGET+TEVVRP++ARMRYRVI+ P+YL+ HM RF KNNFF EKNPTLVNFPVK+
Sbjct: 421 DLLKKFDGETVTEVVRPKLARMRYRVIKSPRYLMFHMVRFKKNNFFKEKNPTLVNFPVKD 480

Query: 481 LELKDYIP-LPTPKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHV 540
           +EL+DYIP LP   E E +CSKY+LIANIVHDGKP++GY+RVFVQRKS+ELWYEMQDLHV
Sbjct: 481 MELRDYIPSLPRAPEGENVCSKYNLIANIVHDGKPEDGYFRVFVQRKSQELWYEMQDLHV 540

Query: 541 SETLPQMVALSEAYMQIYERQQ 551
           +ETLPQMV LSEAYMQIYE+++
Sbjct: 541 AETLPQMVELSEAYMQIYEQEE 540

BLAST of Tan0009824 vs. TAIR 10
Match: AT4G22350.2 (Ubiquitin C-terminal hydrolases superfamily protein )

HSP 1 Score: 777.3 bits (2006), Expect = 8.0e-225
Identity = 403/560 (71.96%), Postives = 456/560 (81.43%), Query Frame = 0

Query: 1   MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLK 60
           M  +R   NGV EEE   ++KR++++    S   P    NP LP  N  DDD  +  K +
Sbjct: 1   MKGEREVKNGVSEEE--REVKRKRVMERSDSPPPPLGFNNPLLPFANAYDDDNNQQNKSQ 60

Query: 61  QNGSRYDGDKGDGND-------DEEDDEDHDDDANHVR--RSRDVEVRKDCPYLDTVNRQ 120
              +     +G+GN        + +DDED DDDA+  R   SR VEVR+DCPYLDTVNRQ
Sbjct: 61  TRCNVVAKGEGNGNKVKGEAQVEVDDDEDDDDDASKGRGKHSRHVEVRRDCPYLDTVNRQ 120

Query: 121 VLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYC 180
           VLDFDFE+FCSV LSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYC
Sbjct: 121 VLDFDFERFCSVSLSNLNVYACLVCGKYFQGRSQKSHAYTHSLEAGHHVYINLLTEKVYC 180

Query: 181 LPDGYEINDPSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETD 240
           LPD YEINDPSLDDIR+VLNPRF++ QV +LDKN+QWSRALDGSDYLPGMVGLNNI++T+
Sbjct: 181 LPDSYEINDPSLDDIRHVLNPRFSRAQVNELDKNRQWSRALDGSDYLPGMVGLNNIQKTE 240

Query: 241 FVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQ 300
           FVNVTIQSLMRVTPLRNFFLIPENYQHC+SPL HRFGELTRKIWHARNFKGQVSPHEFLQ
Sbjct: 241 FVNVTIQSLMRVTPLRNFFLIPENYQHCKSPLGHRFGELTRKIWHARNFKGQVSPHEFLQ 300

Query: 301 AVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSK 360
           AVMKASKKRFRIG QSDPVEFMSW LNTLH +LR SK +SSII++CFQGELEVVKE    
Sbjct: 301 AVMKASKKRFRIGQQSDPVEFMSWLLNTLHMDLRTSKDASSIIHKCFQGELEVVKEYQ-- 360

Query: 361 ALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNI 420
                         G+E      E SRMPFLMLGLDLPPPPLFKDVMEKNIIPQV LF++
Sbjct: 361 --------------GNENK----EISRMPFLMLGLDLPPPPLFKDVMEKNIIPQVALFDL 420

Query: 421 LKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLE 480
           LKKFDGET+TEVVRP++ARMRYRVI+ P+YL+ HM RF KNNFF EKNPTLVNFPVK++E
Sbjct: 421 LKKFDGETVTEVVRPKLARMRYRVIKSPRYLMFHMVRFKKNNFFKEKNPTLVNFPVKDME 480

Query: 481 LKDYIP-LPTPKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSE 540
           L+DYIP LP   E E +CSKY+LIANIVHDGKP++GY+RVFVQRKS+ELWYEMQDLHV+E
Sbjct: 481 LRDYIPSLPRAPEGENVCSKYNLIANIVHDGKPEDGYFRVFVQRKSQELWYEMQDLHVAE 538

Query: 541 TLPQMVALSEAYMQIYERQQ 551
           TLPQMV LSEAYMQIYE+Q+
Sbjct: 541 TLPQMVELSEAYMQIYEQQE 538

BLAST of Tan0009824 vs. TAIR 10
Match: AT4G22350.1 (Ubiquitin C-terminal hydrolases superfamily protein )

HSP 1 Score: 767.3 bits (1980), Expect = 8.3e-222
Identity = 399/553 (72.15%), Postives = 451/553 (81.56%), Query Frame = 0

Query: 1   MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLK 60
           M  +R   NGV EEE   ++KR++++ E S S  PP                     K +
Sbjct: 1   MKGEREVKNGVSEEE--REVKRKRVM-ERSDSPPPPLVA------------------KGE 60

Query: 61  QNGSRYDGDKGDGNDDEEDDEDHDDDANHVR--RSRDVEVRKDCPYLDTVNRQVLDFDFE 120
            NG++    KG+   + +DDED DDDA+  R   SR VEVR+DCPYLDTVNRQVLDFDFE
Sbjct: 61  GNGNKV---KGEAQVEVDDDEDDDDDASKGRGKHSRHVEVRRDCPYLDTVNRQVLDFDFE 120

Query: 121 KFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEI 180
           +FCSV LSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEI
Sbjct: 121 RFCSVSLSNLNVYACLVCGKYFQGRSQKSHAYTHSLEAGHHVYINLLTEKVYCLPDSYEI 180

Query: 181 NDPSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQ 240
           NDPSLDDIR+VLNPRF++ QV +LDKN+QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQ
Sbjct: 181 NDPSLDDIRHVLNPRFSRAQVNELDKNRQWSRALDGSDYLPGMVGLNNIQKTEFVNVTIQ 240

Query: 241 SLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASK 300
           SLMRVTPLRNFFLIPENYQHC+SPL HRFGELTRKIWHARNFKGQVSPHEFLQAVMKASK
Sbjct: 241 SLMRVTPLRNFFLIPENYQHCKSPLGHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASK 300

Query: 301 KRFRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKE 360
           KRFRIG QSDPVEFMSW LNTLH +LR SK +SSII++CFQGELEVVKE           
Sbjct: 301 KRFRIGQQSDPVEFMSWLLNTLHMDLRTSKDASSIIHKCFQGELEVVKEYQ--------- 360

Query: 361 NGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGE 420
                  G+E      E SRMPFLMLGLDLPPPPLFKDVMEKNIIPQV LF++LKKFDGE
Sbjct: 361 -------GNENK----EISRMPFLMLGLDLPPPPLFKDVMEKNIIPQVALFDLLKKFDGE 420

Query: 421 TITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP- 480
           T+TEVVRP++ARMRYRVI+ P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DYIP 
Sbjct: 421 TVTEVVRPKLARMRYRVIKSPRYLMFHMVRFKKNNFFKEKNPTLVNFPVKDMELRDYIPS 480

Query: 481 LPTPKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVA 540
           LP   E E +CSKY+LIANIVHDGKP++GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV 
Sbjct: 481 LPRAPEGENVCSKYNLIANIVHDGKPEDGYFRVFVQRKSQELWYEMQDLHVAETLPQMVE 509

Query: 541 LSEAYMQIYERQQ 551
           LSEAYMQIYE+Q+
Sbjct: 541 LSEAYMQIYEQQE 509

BLAST of Tan0009824 vs. TAIR 10
Match: AT4G22410.1 (Ubiquitin C-terminal hydrolases superfamily protein )

HSP 1 Score: 572.8 bits (1475), Expect = 3.0e-163
Identity = 282/355 (79.44%), Postives = 307/355 (86.48%), Query Frame = 0

Query: 108 VNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE 167
           V  QVLDF FE+FCSV LSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TE
Sbjct: 2   VEFQVLDFHFERFCSVSLSNLNVYACLVCGKYFQGRSQKSHAYTHSLEAGHHVYINLLTE 61

Query: 168 KVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNI 227
           KVYCLPD YEINDPSLDDIR+VLNPRF++ QV +LDKN+QWSRALDGSDYLPGMVGLNNI
Sbjct: 62  KVYCLPDSYEINDPSLDDIRHVLNPRFSRAQVNELDKNRQWSRALDGSDYLPGMVGLNNI 121

Query: 228 KETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPH 287
           ++T+FVNVTIQSLMRVTPLRNFF IPENYQHC+SPLVH FGELTRKIWHARNFKGQVSPH
Sbjct: 122 QKTEFVNVTIQSLMRVTPLRNFFHIPENYQHCKSPLVHCFGELTRKIWHARNFKGQVSPH 181

Query: 288 EFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKE 347
           EFLQAVMKASKKRFRIG QSDPVEFMSW LNTLH +LR SK +SSII++CFQGELEVVKE
Sbjct: 182 EFLQAVMKASKKRFRIGQQSDPVEFMSWLLNTLHMDLRTSKDASSIIHKCFQGELEVVKE 241

Query: 348 IHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVP 407
                             G+E      E SRM FLMLGLDLPPPPLFKDVMEKNIIPQV 
Sbjct: 242 FQ----------------GNENK----EISRMSFLMLGLDLPPPPLFKDVMEKNIIPQVA 301

Query: 408 LFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTL 463
           LF++LKKFDGET+TEVVRP++ARMRYRVI+ P+YL+ HM RF KNNFF EKNPTL
Sbjct: 302 LFDLLKKFDGETVTEVVRPKLARMRYRVIKSPRYLMFHMVRFKKNNFFKEKNPTL 336

BLAST of Tan0009824 vs. TAIR 10
Match: AT4G22420.1 (Ubiquitin-specific protease family C19-related protein )

HSP 1 Score: 61.2 bits (147), Expect = 2.9e-09
Identity = 52/108 (48.15%), Postives = 67/108 (62.04%), Query Frame = 0

Query: 23  QKLLGEFSSSSSPPAS-ENPRLPGFN-YGDDDEEEDYKLKQNGSRYDG-DKGDGN----- 82
           +K + E S S  PP    N  LP  N Y DDDEEE  +LK++ +R +G  KG+GN     
Sbjct: 39  EKRVIERSDSPPPPLGFNNHLLPLANAYDDDDEEEGNELKKSQARRNGVAKGEGNGNKVN 98

Query: 83  ----DDEEDDEDHDDDANHVR--RSRDVEVRKDCPYLDTVNRQVLDFD 117
               ++ +D+ED DDDA+  R   SR VEVR+DCPYLDTVNRQV+  D
Sbjct: 99  GEAQEEVDDEEDDDDDASKGRGKHSRHVEVRRDCPYLDTVNRQVIIID 146

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q53GS93.5e-13251.75U4/U6.U5 tri-snRNP-associated protein 2 OS=Homo sapiens OX=9606 GN=USP39 PE=1 SV... [more]
Q5R7611.0e-13151.75U4/U6.U5 tri-snRNP-associated protein 2 OS=Pongo abelii OX=9601 GN=USP39 PE=2 SV... [more]
Q3TIX92.3e-13151.32U4/U6.U5 tri-snRNP-associated protein 2 OS=Mus musculus OX=10090 GN=Usp39 PE=1 S... [more]
Q9USR28.4e-10242.06Probable mRNA-splicing protein ubp10 OS=Schizosaccharomyces pombe (strain 972 / ... [more]
P435892.7e-4429.18Pre-mRNA-splicing factor SAD1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / ... [more]
Match NameE-valueIdentityDescription
XP_022142503.13.4e-30295.45U4/U6.U5 tri-snRNP-associated protein 2-like [Momordica charantia][more]
XP_038894256.15.8e-30294.55U4/U6.U5 tri-snRNP-associated protein 2-like isoform X2 [Benincasa hispida] >XP_... [more]
XP_038894254.15.8e-30294.55U4/U6.U5 tri-snRNP-associated protein 2-like isoform X1 [Benincasa hispida] >XP_... [more]
XP_011655023.13.3e-29793.09U4/U6.U5 tri-snRNP-associated protein 2 [Cucumis sativus] >KGN65053.1 hypothetic... [more]
XP_008463627.13.3e-29792.91PREDICTED: U4/U6.U5 tri-snRNP-associated protein 2-like [Cucumis melo] >XP_00846... [more]
Match NameE-valueIdentityDescription
A0A6J1CND91.7e-30295.45U4/U6.U5 tri-snRNP-associated protein 2-like OS=Momordica charantia OX=3673 GN=L... [more]
A0A1S3CJP71.6e-29792.91U4/U6.U5 tri-snRNP-associated protein 2-like OS=Cucumis melo OX=3656 GN=LOC10350... [more]
A0A0A0LTF91.6e-29793.09Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G185102 PE=4 SV=1[more]
A0A6J1EKT54.7e-29794.00U4/U6.U5 tri-snRNP-associated protein 2-like OS=Cucurbita moschata OX=3662 GN=LO... [more]
A0A6J1KIZ34.4e-29593.45U4/U6.U5 tri-snRNP-associated protein 2-like OS=Cucurbita maxima OX=3661 GN=LOC1... [more]
Match NameE-valueIdentityDescription
AT4G22285.11.9e-22672.42Ubiquitin C-terminal hydrolases superfamily protein [more]
AT4G22350.28.0e-22571.96Ubiquitin C-terminal hydrolases superfamily protein [more]
AT4G22350.18.3e-22272.15Ubiquitin C-terminal hydrolases superfamily protein [more]
AT4G22410.13.0e-16379.44Ubiquitin C-terminal hydrolases superfamily protein [more]
AT4G22420.12.9e-0948.15Ubiquitin-specific protease family C19-related protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001607Zinc finger, UBP-typeSMARTSM00290Zf_UBP_1coord: 120..169
e-value: 1.2E-19
score: 81.3
IPR001607Zinc finger, UBP-typePFAMPF02148zf-UBPcoord: 121..182
e-value: 3.3E-15
score: 56.1
IPR001607Zinc finger, UBP-typePROSITEPS50271ZF_UBPcoord: 119..180
score: 18.881624
NoneNo IPR availableGENE3D3.90.70.10Cysteine proteinasescoord: 191..550
e-value: 1.3E-92
score: 312.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 53..71
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 25..40
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..92
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..22
NoneNo IPR availablePANTHERPTHR21646UBIQUITIN CARBOXYL-TERMINAL HYDROLASEcoord: 80..549
NoneNo IPR availablePANTHERPTHR21646:SF71BNAA06G13940D PROTEINcoord: 80..549
NoneNo IPR availableSUPERFAMILY57850RING/U-boxcoord: 100..194
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 102..190
e-value: 1.6E-41
score: 142.1
IPR001394Peptidase C19, ubiquitin carboxyl-terminal hydrolasePFAMPF00443UCHcoord: 222..546
e-value: 4.3E-36
score: 124.8
IPR028889Ubiquitin specific protease domainPROSITEPS50235USP_3coord: 222..549
score: 35.182213
IPR033809USP39CDDcd02669Peptidase_C19Mcoord: 103..547
e-value: 0.0
score: 679.426
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 214..548

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0009824.1Tan0009824.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016579 protein deubiquitination
biological_process GO:0000245 spliceosomal complex assembly
biological_process GO:0006397 mRNA processing
cellular_component GO:0005681 spliceosomal complex
molecular_function GO:0004843 thiol-dependent deubiquitinase
molecular_function GO:0008270 zinc ion binding