CcUC03G053630.1 (mRNA) Watermelon (PI 537277) v1

Overview
NameCcUC03G053630.1
TypemRNA
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionL10-interacting MYB domain-containing protein
LocationCicolChr03: 23229539 .. 23234944 (-)
Sequence length2180
RNA-Seq ExpressionCcUC03G053630.1
SyntenyCcUC03G053630.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCTGGTTATTTGGGACCCTATTCATCTCTTTGGGCTTCTATTGAATCGAGAAATAGGTTTGATTGTCCATCTTTTTGATATAATAGTAAGGCATCCTCCGGATAATTCAAATCGAAGATGCATGGTGCAATATCATGGGTGAGCAAGCAATATCAACCTTTTCAGATGGTGGTGCAGAAGCTAAAGAAATCCGGACAAAAAGTAAGGACGGCTCAGACTCTTTGATTGTATTAGATGTTCAGAGTAGTGAGAAGCTGGCAAGAAGTTAAGATGGACTAGTGATATGGATCATTACCTTGGGAGGACTCTCGCGGGATATGCGATGAAGGGGTGTAAACTTGATAGAACTTTACAATGTGGGGTAGTTGATTTAGCTGTTTCAGCTTTAAATGAGAAATATGGGTCAGACTTGACAAAAGAACACATAAGAAACAGGCTAAAAACTTGGAGGAAGCAATATCGTAATCTTTCTCATGATGGGTTCAGGTGGGACGAAACAAGAAAGGTTATCATTGCTAATAACTCAGTGTGGGATGATTATGTCAAGGTAAGTTATAAGTAGAAACTCTGCAGCCTTTACCTAATGGCATGAGTACTGTTGTCTTTCCATGCATTTGATCATGTTCTTTATTGTTGTTGTTACTGTTTGTTTTGTGAAATGAAGCTTTTCATCGAGAATCATAAAAAGAAATACAAGGCATACAAAAAAAACAGCCCCACAAATGGGGCTGGTTTGATCATGTTCTTCTTGTTGGGATTACTAGCATCAAACTTTCTCTTTTTTTTTCCTCCTTTTTGGTTTGTGGTAAGAGATAGTTCATAGCCACAAAAGAGATGTACAATACGGGAGTGAGATAAGAAATTCTACCTCACTGATACCAAGGTGGTGAGGAAAAAAAATATCTTTTGTTGGCTTGAATTGGAAAAAGTAATGGTTACAATACCTTAGAAGGGTGTCTTCTTCTCTCGGCATTGATACTAAGCCCCATTCAAAAATAATTTCATATCTTTGGTGCTTTTCCAAACGCCCATGAACAACCAAAATAAGGAAAACGTTGTTTTCCAACATGTCCTGCTGTGAGTACAATGAAATAACTAGTGTCACTGGTCCTTCCTTACCTTTATTCATAAGACAACAATTAGAGTATAGAGAAAATTCTCGATGATCTCTGAAATCAATACATATCACTTTGGAATTGTCTTTGAAACCAGCATTTAGCAGGTAAATGGAATCTGATAAATGGAAGACTTAATAGATATTAATATCATTGAGAAGGTAGCTAGAGTCTAGAGAGTTCATGATGTTCTCTAAATCCTTTTGCATGAAAAGGCATACTATCAACTGTCGTGTACAAGGGACAACAAATTTAACGAAGCTAAATGGTGAAATCTTGATAGACAACCAACCATGCTGCTTCAGTCTCATTTTGTATATGGTCTGAACTACCTAAACAAGTTACCATTCAATGTATAGGTCGTTTCAATGATGTTCCCTGATGTATTCTACCTGAATTTTAGCTCCAGAATAACAATTTGTTATCATTTCTTTTGCACATAGATCAATTCTGAGGCCAAAAGCTTTCATGGTAGAGTATTTGAAAACTATGATCAGTTTTGCATTTTCTTTAGATACTACAAATGGAGGCGTTGGATTTCCCCATTGCCGCTAATGATGGAAAGACTGGATGTGAAAGGAACTCTTTGAGGTGGACTCGTGAAATGGACCATTGCCTTAGAAGAGTCGTCATGCAGCATGTAATTCTTGGGGACAAAGGCATGGTAGATAATAAATTCAGTCCCCTTGTATATGATGCAGCTATATTGGATTTAAGAGAAAGCCTGGCTCTTGAATTGACCAAAGAACAAGTTGAGGATCGTTTTAATTCATGGAAAAGAGAATATGGTTTGATAAGGGACCTGCTGGACCAAGGTGACTTTGAATGGGATGATCACCAAAAGATGCTAGTTGCAAAGGACTCAGTATGGGATGCGTCTATTGAGGTATATTATATTTCAGTTAATCTTTCAGTGGTCCATGTCCATAAAAGAATGATGCATTCGATTTTTCTTTTCCTTTTTCCTTTTTCCTTTTTCATTGATGGTGTTCAAAATGGTACTTGCCTATAATTGGAACAGAGAAACCCGGATACTAGACATCTTAGAGGGAAGGTCATTGAGAACTACAATGAATTGTGTGCTATTGCTGGGTGTGACAATCCATCTGAAAGTTCTCTCAATGCTGCTGCTAATTCTTTGGATTTATCTGTAGACGAAGCTATAAATGCCAGAGATGTGTGTCACAATCAAAGTAACAGGGCAGCAGATAATGAAAATTACGTAACTTGGACCAAGGAAATGGATACCTGCTTATTAAAGATGCTGGTTAAGCAAGTGAGTCTTGGAAATAAGATTGACAAAAACTTTAAGCCTGCAGCTTACACAGCTGCTCTTACATTTTTGAATGAGAGATTTGCATTGGACTTGACAAAGGAAAACGTCAAAAGCAGGTTAAACACGTGGAAGAAGCAGTATGGAATAGTGAAGTCACTCCTCTCTCATGATGGATTTGAGTGGGATGAAAAACACAAGATGATTGTTGCTACCGACTTTGATTGGACTGCACACACTAAGGTATTTGTATATCATTTTACCTTTCAAGATGGTCTGATTTTATTTTTTAGGATGAAAAATCATCATTGGTTAAGATAAATGAAATAAGCAAAATAAGGGAAACACAAAAACACTAGCTGAGGCGAGACAAAACTAGTTACGAAATAGTTTTGTTGGGATGCCCAGCAATAGGTTTTGAAATGGATCAAACTTCACCTCCCATTCCAGGATCTCTCAATAACCTTGTATCTCTTCCCAACCATATCTACAACAAAATAGTTAAAAGGGAAAAAAAAAAAAAAAGGAAAAAAAATCTCAACATGTGGACTGAGATACTGTTGGCAGAGAGCATTTTGATAGGAATGTTACTAGTATTATTAGGATATTAAGGGTATAATGGTACGTAGTTAGGGAAGTTTTTATGGTATTCAGTTATAAATAGAGGAAGCGAGATAGGAGAAAATTTGGCAATGATTTGGCAAATGAATTAGGGCTTGAAAAAGATATATGAAAACATTAACGATAAGTATATTGCAATGTCAAGAGAAATTACAATATAAACAAGTAATTTGAGGTACTTGCACTATCCCTCCTGAGATATTTTCCAAGCCCTAATTCACTCACCACAATGTAATACCCCCTCCTCATTCCTCAGTACTCTATTTGTAGTCAAACCCATAACCACTTCTCTAACTAATTACCACTATACGTTTACTAATAAGCTCTACAAAATCTCCTATCAGTACTCTCACAAAAGGGAGAAGGGATGTTAGAGTTGACTACTCTTCTTTCTTTACTTGAGGGTCACCCTTTAGAAGAGGGAGAAGGTATGTTAGAGTTTGGAGCTCTAATCCTTTAGAAGGTTCTCATGTAAGTCGGTTTTTCCTTGTTTGGTTGATTCTTCTCCCGTAGGTGTGTCAGTCTTTACGGTACTTTGGAGGATTAAAATTCTAAGGAAGGTGAGGTTTTTTACCTACAAAGTTCTTCATGATCGTGCTAACACATTGGATTGGCTTGTGAGGAAGTTGCCTTTGTTTGTTGGGTCTTTTTGTTGTATTCTTTGTCGGAAGGCGAAGGAAGACTTGGACCATATTTCTAGTGCTATGATTATGCGAGTAGTCTTTGGGATTCCTTCCTTCAAGAGTTTGGTGTGAAGTATGTTCATCACAGAATCATTAGCGATATGATCGAGGAGTTCCTCCTCAATCCACTCTTTGAGGAGAGGGGCCAATTTTTATGGCTTTCGAGGTGTGTGCAATTATGTGGGTGCTGTGGGGTGAGTGGAGTGGAATAGTAGGGTTTTCAGGGGTTTGGGTAGGGCTCCTTTGGAGACTTGGTCCCTTGTTCATTTCCATGTCTCATTATGAGCTTCAATTTCGAAGACGTTCTGTAATTATTCTATAATCATTATTATGTTTAGTTGGAGTCCCTTCTTGTAGAGAGATGTCCCTTTTTTGTGGACTTCGTTCTTTGTACGCCTGTGTATTTTTTCATTTTTTCTCAATGAAAGTTGCCTTTTTCATTAAGAAAAAGAACAAAAACCCCCAAAAAAAACCCCAAAAACAAAAATAAATTAATAAATAAACCTCAAGAGGGTTTGAACACTTGGTTTATCTAGTAATTTGGTTGTCCTATTATCTTTCAATGCGTTTGGGTTCTATCATCTTTTTCTTCACATTTTGTTGATCTAATAATGTAATACAGATGGGGATATGTCCATTTCGGATATGTCCCATTTAGATTGTTTTACCAGTTAATAGCCGGAGTCCATCTTATCAATTTCATTTGTCATATGACTCATATCTTGCCTCTATCTAAAGTTTTTTATTAATATTATTGACTTGACACCGATTTTAATGGAATTATGGATCAACTTCTAGAAGAAGAGAGAATACATTATGAAAAAATATTTTACTTACACTTATTGCATCTCCTTTTATTATTCCGTATTATTTTTTGTTTAATCTGGTAGGCTTAAGGACATGTGAAATTTTGCACATTCATTCAGGGTTTTTTAACATTCATTTCAGGGACACCTTGATGTGCAGGAATTGCAAGCCAAGACGATTGAGAATTACAATGAGCTGTGTATGATTTTTGGCAACGAGGAGAAAACTGAAGGTTGGTCAATTGGTGAAAAACTCGATAAGGACCGTACATTGGACAACCACAACCATACAGAACTCCAAGTAGGGATATCAGATGATGATGCAGGGGGAGGTGATGGTTGCAGTGATGCTGATAGCATGGAGGCTTCATCTCAACAAACAGGAACTAGACCATCCTCCTCTTCGCATTCTCGCAAGTCTTTAAAGCGAAGTTGCAATGGCGATCTCATGGTGCAAATAATGAGTGTCATGGCTGCTAATGTTGCTCGGATAGCTGATGCATTGTCAGACAGGCCAACATGCTTAGATCAAGTGTTTGATGTAGTTCAAACCATGCCTGGGTTGGACGACAATCTGATCCTCGATGCCTGTGAGTTTCTCTCCCTCGATGATAAAAGGGCTGTGATGTTTATGAAATTGGATGAGAGGTTGAGAAGAAAGTGGCTACTAAAAAAGTTGCACAGTTAGGATTGCATATAAGATCTATAGTGTTCCTTGTAGATATTTATACCTATTTTTCTTCAAGTCATTAGCTAGATGAAGGAATTATCATGATTATTATTTGCCCTATTCTTTCTAATATAATTTTTACTGCATTTATTGCATAGACTGGCTTGGCTTCATAGCAGTCATTTGGTTGGATGAATCCGGGGCTATAGAAACAATTATAGGATGTTTTTATGAGAG

mRNA sequence

CTCTGGTTATTTGGGACCCTATTCATCTCTTTGGGCTTCTATTGAATCGAGAAATAGGTTTGATTGTCCATCTTTTTGATATAATAGTAAGGCATCCTCCGGATAATTCAAATCGAAGATGCATGGTGCAATATCATGGGTGAGCAAGCAATATCAACCTTTTCAGATGGTGGTGCAGAAGCTAAAGAAATCCGGACAAAAAGTAAGGACGGCTCAGACTCTTTGATTGTATTAGATGTTCAGAGTAGTGAGAAGCTGGCAAGAAGTTAAGATGGACTAGTGATATGGATCATTACCTTGGGAGGACTCTCGCGGGATATGCGATGAAGGGGTGTAAACTTGATAGAACTTTACAATGTGGGGTAGTTGATTTAGCTGTTTCAGCTTTAAATGAGAAATATGGGTCAGACTTGACAAAAGAACACATAAGAAACAGGCTAAAAACTTGGAGGAAGCAATATCGTAATCTTTCTCATGATGGGTTCAGGTGGGACGAAACAAGAAAGGTTATCATTGCTAATAACTCAGTGTGGGATGATTATGTCAAGATACTACAAATGGAGGCGTTGGATTTCCCCATTGCCGCTAATGATGGAAAGACTGGATGTGAAAGGAACTCTTTGAGGTGGACTCGTGAAATGGACCATTGCCTTAGAAGAGTCGTCATGCAGCATGTAATTCTTGGGGACAAAGGCATGGTAGATAATAAATTCAGTCCCCTTGTATATGATGCAGCTATATTGGATTTAAGAGAAAGCCTGGCTCTTGAATTGACCAAAGAACAAGTTGAGGATCGTTTTAATTCATGGAAAAGAGAATATGGTTTGATAAGGGACCTGCTGGACCAAGGTGACTTTGAATGGGATGATCACCAAAAGATGCTAGTTGCAAAGGACTCAGTATGGGATGCGTCTATTGAGAGAAACCCGGATACTAGACATCTTAGAGGGAAGGTCATTGAGAACTACAATGAATTGTGTGCTATTGCTGGGTGTGACAATCCATCTGAAAGTTCTCTCAATGCTGCTGCTAATTCTTTGGATTTATCTGTAGACGAAGCTATAAATGCCAGAGATGTGTGTCACAATCAAAGTAACAGGGCAGCAGATAATGAAAATTACGTAACTTGGACCAAGGAAATGGATACCTGCTTATTAAAGATGCTGGTTAAGCAAGTGAGTCTTGGAAATAAGATTGACAAAAACTTTAAGCCTGCAGCTTACACAGCTGCTCTTACATTTTTGAATGAGAGATTTGCATTGGACTTGACAAAGGAAAACGTCAAAAGCAGGTTAAACACGTGGAAGAAGCAGTATGGAATAGTGAAGTCACTCCTCTCTCATGATGGATTTGAGTGGGATGAAAAACACAAGATGATTGTTGCTACCGACTTTGATTGGACTGCACACACTAAGGGACACCTTGATGTGCAGGAATTGCAAGCCAAGACGATTGAGAATTACAATGAGCTGTGTATGATTTTTGGCAACGAGGAGAAAACTGAAGGTTGGTCAATTGGTGAAAAACTCGATAAGGACCGTACATTGGACAACCACAACCATACAGAACTCCAAGTAGGGATATCAGATGATGATGCAGGGGGAGGTGATGGTTGCAGTGATGCTGATAGCATGGAGGCTTCATCTCAACAAACAGGAACTAGACCATCCTCCTCTTCGCATTCTCGCAAGTCTTTAAAGCGAAGTTGCAATGGCGATCTCATGGTGCAAATAATGAGTGTCATGGCTGCTAATGTTGCTCGGATAGCTGATGCATTGTCAGACAGGCCAACATGCTTAGATCAAGTGTTTGATGTAGTTCAAACCATGCCTGGGTTGGACGACAATCTGATCCTCGATGCCTGTGAGTTTCTCTCCCTCGATGATAAAAGGGCTGTGATGTTTATGAAATTGGATGAGAGGTTGAGAAGAAAGTGGCTACTAAAAAAGTTGCACAGTTAGGATTGCATATAAGATCTATAGTGTTCCTTGTAGATATTTATACCTATTTTTCTTCAAGTCATTAGCTAGATGAAGGAATTATCATGATTATTATTTGCCCTATTCTTTCTAATATAATTTTTACTGCATTTATTGCATAGACTGGCTTGGCTTCATAGCAGTCATTTGGTTGGATGAATCCGGGGCTATAGAAACAATTATAGGATGTTTTTATGAGAG

Coding sequence (CDS)

ATGGATCATTACCTTGGGAGGACTCTCGCGGGATATGCGATGAAGGGGTGTAAACTTGATAGAACTTTACAATGTGGGGTAGTTGATTTAGCTGTTTCAGCTTTAAATGAGAAATATGGGTCAGACTTGACAAAAGAACACATAAGAAACAGGCTAAAAACTTGGAGGAAGCAATATCGTAATCTTTCTCATGATGGGTTCAGGTGGGACGAAACAAGAAAGGTTATCATTGCTAATAACTCAGTGTGGGATGATTATGTCAAGATACTACAAATGGAGGCGTTGGATTTCCCCATTGCCGCTAATGATGGAAAGACTGGATGTGAAAGGAACTCTTTGAGGTGGACTCGTGAAATGGACCATTGCCTTAGAAGAGTCGTCATGCAGCATGTAATTCTTGGGGACAAAGGCATGGTAGATAATAAATTCAGTCCCCTTGTATATGATGCAGCTATATTGGATTTAAGAGAAAGCCTGGCTCTTGAATTGACCAAAGAACAAGTTGAGGATCGTTTTAATTCATGGAAAAGAGAATATGGTTTGATAAGGGACCTGCTGGACCAAGGTGACTTTGAATGGGATGATCACCAAAAGATGCTAGTTGCAAAGGACTCAGTATGGGATGCGTCTATTGAGAGAAACCCGGATACTAGACATCTTAGAGGGAAGGTCATTGAGAACTACAATGAATTGTGTGCTATTGCTGGGTGTGACAATCCATCTGAAAGTTCTCTCAATGCTGCTGCTAATTCTTTGGATTTATCTGTAGACGAAGCTATAAATGCCAGAGATGTGTGTCACAATCAAAGTAACAGGGCAGCAGATAATGAAAATTACGTAACTTGGACCAAGGAAATGGATACCTGCTTATTAAAGATGCTGGTTAAGCAAGTGAGTCTTGGAAATAAGATTGACAAAAACTTTAAGCCTGCAGCTTACACAGCTGCTCTTACATTTTTGAATGAGAGATTTGCATTGGACTTGACAAAGGAAAACGTCAAAAGCAGGTTAAACACGTGGAAGAAGCAGTATGGAATAGTGAAGTCACTCCTCTCTCATGATGGATTTGAGTGGGATGAAAAACACAAGATGATTGTTGCTACCGACTTTGATTGGACTGCACACACTAAGGGACACCTTGATGTGCAGGAATTGCAAGCCAAGACGATTGAGAATTACAATGAGCTGTGTATGATTTTTGGCAACGAGGAGAAAACTGAAGGTTGGTCAATTGGTGAAAAACTCGATAAGGACCGTACATTGGACAACCACAACCATACAGAACTCCAAGTAGGGATATCAGATGATGATGCAGGGGGAGGTGATGGTTGCAGTGATGCTGATAGCATGGAGGCTTCATCTCAACAAACAGGAACTAGACCATCCTCCTCTTCGCATTCTCGCAAGTCTTTAAAGCGAAGTTGCAATGGCGATCTCATGGTGCAAATAATGAGTGTCATGGCTGCTAATGTTGCTCGGATAGCTGATGCATTGTCAGACAGGCCAACATGCTTAGATCAAGTGTTTGATGTAGTTCAAACCATGCCTGGGTTGGACGACAATCTGATCCTCGATGCCTGTGAGTTTCTCTCCCTCGATGATAAAAGGGCTGTGATGTTTATGAAATTGGATGAGAGGTTGAGAAGAAAGTGGCTACTAAAAAAGTTGCACAGTTAG

Protein sequence

MDHYLGRTLAGYAMKGCKLDRTLQCGVVDLAVSALNEKYGSDLTKEHIRNRLKTWRKQYRNLSHDGFRWDETRKVIIANNSVWDDYVKILQMEALDFPIAANDGKTGCERNSLRWTREMDHCLRRVVMQHVILGDKGMVDNKFSPLVYDAAILDLRESLALELTKEQVEDRFNSWKREYGLIRDLLDQGDFEWDDHQKMLVAKDSVWDASIERNPDTRHLRGKVIENYNELCAIAGCDNPSESSLNAAANSLDLSVDEAINARDVCHNQSNRAADNENYVTWTKEMDTCLLKMLVKQVSLGNKIDKNFKPAAYTAALTFLNERFALDLTKENVKSRLNTWKKQYGIVKSLLSHDGFEWDEKHKMIVATDFDWTAHTKGHLDVQELQAKTIENYNELCMIFGNEEKTEGWSIGEKLDKDRTLDNHNHTELQVGISDDDAGGGDGCSDADSMEASSQQTGTRPSSSSHSRKSLKRSCNGDLMVQIMSVMAANVARIADALSDRPTCLDQVFDVVQTMPGLDDNLILDACEFLSLDDKRAVMFMKLDERLRRKWLLKKLHS
Homology
BLAST of CcUC03G053630.1 vs. NCBI nr
Match: XP_030959168.1 (uncharacterized protein LOC115981123 [Quercus lobata])

HSP 1 Score: 573.9 bits (1478), Expect = 1.5e-159
Identity = 294/595 (49.41%), Postives = 413/595 (69.41%), Query Frame = 0

Query: 1   MDHYLGRTLAGYAMKGCKLDRTLQCGVVDLAVSALNEKYGSDLTKEHIRNRLKTWRKQY- 60
           MD  LG+ L     KG K+D+ LQ    D AV ALNE++G DLTKEHIRNRL+TWRKQY 
Sbjct: 191 MDRCLGKILVEQVNKGHKIDKILQREAYDAAVLALNERFGPDLTKEHIRNRLRTWRKQYL 250

Query: 61  ---RNLSHDGFRWDETRKVIIANNSVWDDYVK-----------ILQ-------------- 120
                LSH GF+WD  +K+IIA++SVWDDYVK            +Q              
Sbjct: 251 ILKELLSHSGFKWDAMQKMIIASDSVWDDYVKTHPDARIFRNRFIQNYDQLFIIFGDSHE 310

Query: 121 ----MEALDFPIAANDGKTGCERNSLRWTREMDHCLRRVVMQHVILGDKGMVDNKFSPLV 180
               ++ +D       GK      ++RWT EMD CL +V+++ VILG+K  +DNKF P  
Sbjct: 311 AAEPVDVIDVSPVRCGGKVKDLGKNVRWTFEMDRCLGKVLVEQVILGNKNRLDNKFKPAA 370

Query: 181 YDAAILDLRESLALELTKEQVEDRFNSWKREYGLIRDLLDQGDFEWDDHQKMLVAKDSVW 240
           Y+AA+L ++E   L+LTK+ V +R  +WK++Y ++++LLDQ DFEWD+ +KM++A DS W
Sbjct: 371 YEAAVLAIKERFHLDLTKDHVRNRLKTWKKQYDILQELLDQRDFEWDERRKMVIANDSAW 430

Query: 241 DASIERNPDTRHLRGKVIENYNELCAIAGCDNPSESSLNAAANSLDL-SVDEAINARDVC 300
           +  I+ NPD R ++G+VI NY ELC I GC++P ESS+N A N+LDL + +EA+ A +  
Sbjct: 431 NEYIKINPDARTVQGRVINNYEELCVIIGCNDPPESSVNIAENNLDLIAENEAVVAEEKY 490

Query: 301 HNQSNRAADNENYVTWTKEMDTCLLKMLVKQVSLGNKIDKNFKPAAYTAALTFLNERFAL 360
           +N+ + A D   Y++WT EMD CL ++LV+QV LGNK+DKNFKP AY AALT LNE+F L
Sbjct: 491 YNEVDNAKDKVKYISWTDEMDRCLTQLLVQQVMLGNKLDKNFKPVAYMAALTVLNEKFGL 550

Query: 361 DLTKENVKSRLNTWKKQYGIVKSLLSHDGFEWDEKHKMIVATDFDWTAHTKGHLDVQELQ 420
           DLTKEN+++RL TWKKQYG+VK LLSH GFEWD+++KM+VATD DW  + K + D ++L+
Sbjct: 551 DLTKENIRNRLKTWKKQYGLVKELLSHGGFEWDDRYKMVVATDSDWNEYIKRYPDARQLR 610

Query: 421 AKTIENYNELCMIFGNEEKTEGW-SIGE--KLDKDRTLDNHNHTELQVGISDDDAGGGDG 480
           A++IENY++L +I GNE     W   G   +L+ + T ++  H E  V +  ++    + 
Sbjct: 611 ARSIENYDDLRIIVGNEAPDGHWFEAGSTLRLEGNSTFNDEEHVETPVQMFANEEMSHED 670

Query: 481 CSDADSMEASSQQTGTRPSSSSHSRKSLKRSCNGDLMVQIMSVMAANVARIADALSD--R 540
            S  D M+ SSQQT  RPSSSSHS++ LKR  + D+M+++MS MAA++ RIADAL++  +
Sbjct: 671 TS--DGMQGSSQQTRARPSSSSHSKRLLKRRRSSDVMLKMMSAMAADIGRIADALTENNK 730

Query: 541 PTCLDQVFDVVQTMPGLDDNLILDACEFLSLDDKRAVMFMKLDERLRRKWLLKKL 557
             CLD++F++VQT+PG DD+LI++ACE+LS D++RA+MFMKL+ERLR+KWLLK+L
Sbjct: 731 TVCLDELFEMVQTIPGFDDDLIIEACEYLSFDERRAMMFMKLNERLRKKWLLKRL 783

BLAST of CcUC03G053630.1 vs. NCBI nr
Match: XP_023877154.1 (uncharacterized protein LOC111989590 [Quercus suber])

HSP 1 Score: 570.1 bits (1468), Expect = 2.1e-158
Identity = 290/595 (48.74%), Postives = 411/595 (69.08%), Query Frame = 0

Query: 1   MDHYLGRTLAGYAMKGCKLDRTLQCGVVDLAVSALNEKYGSDLTKEHIRNRLKTWRKQY- 60
           MD  LG+ L     KG K+D+ LQ    D AV ALNE++G DLTKEHIRNRL+TWRKQY 
Sbjct: 191 MDRCLGKILVEQVNKGHKIDKILQREAYDAAVLALNERFGPDLTKEHIRNRLRTWRKQYL 250

Query: 61  ---RNLSHDGFRWDETRKVIIANNSVWDDYVK-----------ILQ-------------- 120
                LSH+GF+WD  +K+IIA++SVWDDYVK            +Q              
Sbjct: 251 ILKELLSHNGFKWDAMQKMIIASDSVWDDYVKTHPDARIFRNRFIQNYDQLFIIFGDSHE 310

Query: 121 ----MEALDFPIAANDGKTGCERNSLRWTREMDHCLRRVVMQHVILGDKGMVDNKFSPLV 180
               ++ +D       GK      ++RWT EMD CL +V+++ VILG+K  +DNKF P  
Sbjct: 311 AAEPVDVIDVSPVRCGGKAKDLGKNVRWTFEMDRCLGKVLVEQVILGNKNRLDNKFKPAA 370

Query: 181 YDAAILDLRESLALELTKEQVEDRFNSWKREYGLIRDLLDQGDFEWDDHQKMLVAKDSVW 240
           Y+AA+L ++E   L+LTK+ V +R  +WK+++ ++++LLDQ DFEWD+ +KM++A DS W
Sbjct: 371 YEAAVLAIKERFHLDLTKDHVRNRLKTWKKQFDILQELLDQRDFEWDERRKMVIANDSAW 430

Query: 241 DASIERNPDTRHLRGKVIENYNELCAIAGCDNPSESSLNAAANSLDL-SVDEAINARDVC 300
           +  ++ NPD R ++G+VI NY ELC I GC++P ESS+N A N+LDL + +EA+ A +  
Sbjct: 431 NEYVKINPDARTVQGRVINNYEELCVIIGCNDPPESSVNIAENNLDLIAENEAVVAEETY 490

Query: 301 HNQSNRAADNENYVTWTKEMDTCLLKMLVKQVSLGNKIDKNFKPAAYTAALTFLNERFAL 360
           +N+ + A D   Y++WT EMD CL ++LV+QV LGNK+DKNFKP AY AA+T LNE+F L
Sbjct: 491 YNEVDNAKDKGKYISWTDEMDRCLTQLLVQQVMLGNKLDKNFKPVAYMAAVTVLNEKFGL 550

Query: 361 DLTKENVKSRLNTWKKQYGIVKSLLSHDGFEWDEKHKMIVATDFDWTAHTKGHLDVQELQ 420
           DLTKEN+++RL TWKKQYG+VK LLS  GF+WDE++KM+VATD DW  + K + D ++LQ
Sbjct: 551 DLTKENIRNRLKTWKKQYGLVKELLSQGGFKWDERYKMVVATDSDWNEYIKRYPDARQLQ 610

Query: 421 AKTIENYNELCMIFGNEEKTEGW---SIGEKLDKDRTLDNHNHTELQVGISDDDAGGGDG 480
           A++IENY++L +I GNE     W       +L  + T ++  H E  V +  ++    + 
Sbjct: 611 ARSIENYDDLRIIVGNEAPDGHWFEAGATLRLQGNSTFNDEEHVETPVQMFANEEMSHED 670

Query: 481 CSDADSMEASSQQTGTRPSSSSHSRKSLKRSCNGDLMVQIMSVMAANVARIADALSD--R 540
            S  D M+ SSQQT  RPSSSSHS++ LKR  + D+M+++MS MAA++ RIADAL++  +
Sbjct: 671 TS--DGMQGSSQQTRARPSSSSHSKRLLKRRRSSDVMLKMMSAMAADIGRIADALTENNK 730

Query: 541 PTCLDQVFDVVQTMPGLDDNLILDACEFLSLDDKRAVMFMKLDERLRRKWLLKKL 557
             CLD++F++VQT+PG DD+LI++ACE+LS D++RA+MFMKL+ERLR+KWLLK+L
Sbjct: 731 TVCLDELFEMVQTIPGFDDDLIIEACEYLSFDERRAIMFMKLNERLRKKWLLKRL 783

BLAST of CcUC03G053630.1 vs. NCBI nr
Match: KAF3973412.1 (hypothetical protein CMV_003146 [Castanea mollissima])

HSP 1 Score: 568.2 bits (1463), Expect = 8.1e-158
Identity = 291/595 (48.91%), Postives = 409/595 (68.74%), Query Frame = 0

Query: 1   MDHYLGRTLAGYAMKGCKLDRTLQCGVVDLAVSALNEKYGSDLTKEHIRNRLKTWRKQY- 60
           MD  LG+ L     KG K+D+ LQ    D AV ALNE++G DLTKEHIRNRL+TWRKQY 
Sbjct: 205 MDRCLGKILVEQVNKGNKIDKILQREAYDAAVLALNERFGPDLTKEHIRNRLRTWRKQYL 264

Query: 61  ---RNLSHDGFRWDETRKVIIANNSVWDDYVK-----------ILQ-------------- 120
                LSH GF+WD  +K+IIA++SVWDDYVK            +Q              
Sbjct: 265 ILKELLSHSGFKWDAMQKMIIASDSVWDDYVKTHPDARIFRNRFIQNYDQLFIIFGDSHE 324

Query: 121 ----MEALDFPIAANDGKTGCERNSLRWTREMDHCLRRVVMQHVILGDKGMVDNKFSPLV 180
               ++ +D       GK      ++RWT EMD CL +V+++ VILG+K  +DNKF P  
Sbjct: 325 AAEPVDVIDVSPVRCGGKAKDLGKNVRWTFEMDRCLGKVLVEQVILGNKNRLDNKFKPAA 384

Query: 181 YDAAILDLRESLALELTKEQVEDRFNSWKREYGLIRDLLDQGDFEWDDHQKMLVAKDSVW 240
           Y+AA+  ++E   L+LTK+ V +R  +WK++Y ++++LLDQ DFEWD+ +KM++A DS  
Sbjct: 385 YEAAVFTIKERFHLDLTKDHVRNRLKTWKKQYDILQELLDQRDFEWDERRKMVIANDSAC 444

Query: 241 DASIERNPDTRHLRGKVIENYNELCAIAGCDNPSESSLNAAANSLDL-SVDEAINARDVC 300
           +  ++ NPD R ++G+VI NY ELC I GC++P ESS+N A N+LDL + +EA+ A +  
Sbjct: 445 NEYVKINPDARTVQGRVINNYEELCVIIGCNDPPESSVNIAENNLDLIAENEAVVAEETY 504

Query: 301 HNQSNRAADNENYVTWTKEMDTCLLKMLVKQVSLGNKIDKNFKPAAYTAALTFLNERFAL 360
           +N+ + A D   Y++WT EMD CL ++LV+QV LGNK+DKNFKP AY AALT LNE+F L
Sbjct: 505 YNEVDNAKDKGKYISWTDEMDRCLTQLLVQQVMLGNKLDKNFKPVAYMAALTVLNEKFGL 564

Query: 361 DLTKENVKSRLNTWKKQYGIVKSLLSHDGFEWDEKHKMIVATDFDWTAHTKGHLDVQELQ 420
           DLTKEN+++RL TWKKQYG+VK LLSH GFEWDE++KM+VATD DW  + K   D ++L+
Sbjct: 565 DLTKENIRNRLKTWKKQYGLVKELLSHGGFEWDERYKMVVATDSDWNEYIKRSPDARQLR 624

Query: 421 AKTIENYNELCMIFGNEEKTEGW---SIGEKLDKDRTLDNHNHTELQVGISDDDAGGGDG 480
           A++IENY++L +I GNE     W       +L+ + T ++  H E  V +  ++    + 
Sbjct: 625 ARSIENYDDLRIIVGNEAPDGHWFEAGATLRLEGNSTFNDEEHVETPVQMFANEEMSHED 684

Query: 481 CSDADSMEASSQQTGTRPSSSSHSRKSLKRSCNGDLMVQIMSVMAANVARIADALSD--R 540
            S  D M+ SSQQT  RPSSSSHS++ LKR  + D+M+++MS MAA++ RIADAL++  +
Sbjct: 685 TS--DGMQGSSQQTRARPSSSSHSKRLLKRRRSSDVMLKMMSAMAADIGRIADALAENNK 744

Query: 541 PTCLDQVFDVVQTMPGLDDNLILDACEFLSLDDKRAVMFMKLDERLRRKWLLKKL 557
             CLD++F++VQT+PG DD+LI++ACE+LS D++RA+MFMKL+ERLR+KWLLK+L
Sbjct: 745 TVCLDELFEMVQTIPGFDDDLIIEACEYLSFDERRAMMFMKLNERLRKKWLLKRL 797

BLAST of CcUC03G053630.1 vs. NCBI nr
Match: KAA8550002.1 (hypothetical protein F0562_001686 [Nyssa sinensis])

HSP 1 Score: 529.3 bits (1362), Expect = 4.2e-146
Identity = 288/612 (47.06%), Postives = 385/612 (62.91%), Query Frame = 0

Query: 1   MDHYLGRTLAGYAMKGCKLDRTLQCGVVDLAVSALNEKYGSDLTKEHIRNRLKTWRKQY- 60
           MD  LG+ L     KG K+D  +Q    + AV+ALNEK+G D+TK+HI+NRLKTW+KQY 
Sbjct: 209 MDRCLGKILVEEVEKGHKVDNIIQTEAYNTAVTALNEKFGPDITKDHIKNRLKTWKKQYG 268

Query: 61  ---RNLSHDGFRWDETRKVIIANNSVWDDYVKILQ------------------------- 120
                LSH GF+WDE RK++I N+S W+DY+K                            
Sbjct: 269 ILKELLSHIGFKWDEARKMVIGNDSAWNDYIKTHHDAHPFRGRVVENYDHLCIIFGNNHA 328

Query: 121 ---------------------MEALD-FPIAANDGKTGCERNSLRWTREMDHCLRRVVMQ 180
                                +EA++  PI    G    E+N + WT EMD CL  ++++
Sbjct: 329 TGSYSRTVDDIVHSLAGDSEGVEAINASPIRCYSGLRDEEKN-MEWTNEMDRCLSTILVK 388

Query: 181 HVILGDKGMVDNKFSPLVYDAAILDLRESLALELTKEQVEDRFNSWKREYGLIRDLLDQG 240
            V LG+K  +DNKF P  Y AA+L L E   L+ T + V +R  +WK+ YG ++++LDQ 
Sbjct: 389 QVKLGNKSKLDNKFKPAAYAAAVLALSERFQLDFTNDHVRNRIKTWKKLYGSVKEILDQS 448

Query: 241 DFEWDDHQKMLVAKDSVWDASIERNPDTRHLRGKVIENYNELCAIAGCDNPSESSLNAAA 300
           +F+WD  +KM+   DSVW   I+ NPD R L G+VIENY+ELCAI G DNP+ESS N A 
Sbjct: 449 EFKWDKERKMITTNDSVWHDYIKINPDARLLHGRVIENYDELCAIIGNDNPTESSKNDAE 508

Query: 301 NSLDLSVD-EAINARDVCHNQSNRAADNENYVTWTKEMDTCLLKMLVKQVSLGNKIDKNF 360
             +D + D E +       +Q + A +   Y+ WT EMD CL++ LV+QV LGNK++KNF
Sbjct: 509 ADMDWAADNEDVETEVAYQSQRDNAKERGKYIIWTDEMDCCLMEKLVEQVKLGNKLEKNF 568

Query: 361 KPAAYTAALTFLNERFALDLTKENVKSRLNTWKKQYGIVKSLLSHDGFEWDEKHKMIVAT 420
           KP AYTA LT LNE F LDLTKEN+KSRL TWKK YG+VK +LSH GF WDEK KM+VAT
Sbjct: 569 KPVAYTAVLTALNENFVLDLTKENIKSRLKTWKKVYGLVKEVLSHRGFVWDEKRKMVVAT 628

Query: 421 DFDWTAHTKGHLDVQELQAKTIENYNELCMIFGNEEKTEGWS-IGEKLDKDRTLDNHNHT 480
           D  W  + K H D + L+A++IENY+EL +I  N+  T  +S  G K D +   +N  H 
Sbjct: 629 DSVWNEYIKMHPDAKFLRARSIENYDELRIIIDNDHATRSFSNAGTKGDVNPASNNEEHE 688

Query: 481 ELQV-GISDDDAGGGDGCSDADSMEASSQQTGTRPSSSSHSRKSLKRSCNGDLMVQIMSV 540
           E  +  +  D+    D  +DA  M+ SSQQT  RPSSSSHS++  K+   GDLMV++MS 
Sbjct: 689 ETPLQNVFVDEEMSKDNTNDA--MQGSSQQTRARPSSSSHSKQPSKKRHGGDLMVEMMSA 748

Query: 541 MAANVARIADAL--SDRPTCLDQVFDVVQTMPGLDDNLILDACEFLSLDDKRAVMFMKLD 557
           MAAN+ RIADAL  S++   LD++F++VQ +PG DD+LI++ACEFLS D+KRA MF+KLD
Sbjct: 749 MAANIGRIADALTGSNQSVSLDELFEMVQHIPGFDDDLIIEACEFLSFDEKRAKMFLKLD 808

BLAST of CcUC03G053630.1 vs. NCBI nr
Match: EOY32978.1 (Uncharacterized protein TCM_040985 [Theobroma cacao])

HSP 1 Score: 508.4 bits (1308), Expect = 7.6e-140
Identity = 268/588 (45.58%), Postives = 383/588 (65.14%), Query Frame = 0

Query: 1   MDHYLGRTLAGYAMKGCKLDRTLQCGVVDLAVSALNEKYGSDLTKEHIRNRLKTWRKQY- 60
           MD+YLG++L     +G KLD TLQ    D A+S LNEK G +LTK+HIRNRL+TW+KQY 
Sbjct: 191 MDYYLGKSLVEKVKEGYKLDGTLQREAYDAALSTLNEKIGLELTKDHIRNRLRTWKKQYV 250

Query: 61  ---RNLSHDGFRWDETRKVIIANNSVWDDYVK-----------ILQ-------------- 120
              +  SH GF+WD+TRK+IIA+ SVW  YVK           +++              
Sbjct: 251 VLKQLFSHPGFKWDKTRKMIIADGSVWTTYVKAHPDARIYRGRVIENYDNLCTIFGSDNE 310

Query: 121 -MEALDFPIAANDGKTGCERNSLRWTREMDHCLRRVVMQHVILGDKGMVDNKFSPLVYDA 180
             E +D     N  K   +  ++ WT EMD  L +V+++ V LG+K  +DNK  PL Y+A
Sbjct: 311 VAEGVDISPLQNGVKVKDQAKNMMWTYEMDQYLSKVLVEQVKLGNKSKLDNKLRPLAYEA 370

Query: 181 AILDLRESLALELTKEQVEDRFNSWKREYGLIRDLLDQGDFEWDDHQKMLVAKDSVWDAS 240
           A+  L +   L+LTKE + +R  +WK++Y ++++LL   +FEWD  Q M++A DS W+  
Sbjct: 371 AVSALSKRFQLDLTKEHIRNRLKTWKKQYEILKELLHHSEFEWDKTQNMVIANDSAWNRY 430

Query: 241 IERNPDTRHLRGKVIENYNELCAIAGCDNPSESSLNAAANSLDLSV-DEAINARDVCHNQ 300
           I+  PD R  RG+VI NY EL AI GCD+  ESSLN++ + ++L+  +EA +  ++ + Q
Sbjct: 431 IKITPDARSFRGRVIRNYYELFAIFGCDDLPESSLNSSNDDVNLTANNEAADTEELFYGQ 490

Query: 301 SNRAADNENYVTWTKEMDTCLLKMLVKQVSLGNKIDKNFKPAAYTAALTFLNERFALDLT 360
           S+ A D   Y+ WT EMD CL + LV+QV++GNK  K+FKP A+ AAL+ LN++F+LDLT
Sbjct: 491 SDVAKDKGKYILWTDEMDQCLTEQLVQQVTIGNKHQKSFKPVAFRAALSVLNKKFSLDLT 550

Query: 361 KENVKSRLNTWKKQYGIVKSLLSHDGFEWDEKHKMIVATDFDWTAHTKGHLDVQELQAKT 420
            EN+ +RL TWKKQY +VK LLS  GFEWDE  KM++A D +W    K + DV  ++ + 
Sbjct: 551 TENIGNRLRTWKKQYRLVKELLSQRGFEWDEGQKMVIANDSEWRECIKRNPDVSRIRGRC 610

Query: 421 IENYNELCMIFGNEEKTEGWSIGEKLDKDRTLDNHNHTELQVGISDDDAGGGDGCSDADS 480
           I+N++EL +I GNE     WS       +   +N    +  V +  D+  G D     D 
Sbjct: 611 IDNFSELNIIVGNELAVGHWSEAGDRVVNPIQNNEEPVDAPVQVVVDEEMGHDNTD--DD 670

Query: 481 MEASSQQTGTRPSSSSHSRKSLKRSCNGDLMVQIMSVMAANVARIADALSD-RPTCLDQV 540
           M+ SSQQT  RPSSSSH++++LKR    D+M+++MS MAAN+ RIADAL++ +  CLD++
Sbjct: 671 MQVSSQQTRARPSSSSHAKEALKRRRTSDVMLEMMSDMAANIGRIADALTESKAVCLDEL 730

Query: 541 FDVVQTMPGLDDNLILDACEFLSLDDKRAVMFMKLDERLRRKWLLKKL 557
           F +VQ++P  DD+LI+DACE+LS D+KRA+MF+KLDERLR+KWLLK+L
Sbjct: 731 FQMVQSIPEFDDDLIIDACEYLSFDEKRAMMFVKLDERLRKKWLLKRL 776

BLAST of CcUC03G053630.1 vs. ExPASy TrEMBL
Match: A0A7N2KMQ1 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 573.9 bits (1478), Expect = 7.1e-160
Identity = 294/595 (49.41%), Postives = 413/595 (69.41%), Query Frame = 0

Query: 1   MDHYLGRTLAGYAMKGCKLDRTLQCGVVDLAVSALNEKYGSDLTKEHIRNRLKTWRKQY- 60
           MD  LG+ L     KG K+D+ LQ    D AV ALNE++G DLTKEHIRNRL+TWRKQY 
Sbjct: 191 MDRCLGKILVEQVNKGHKIDKILQREAYDAAVLALNERFGPDLTKEHIRNRLRTWRKQYL 250

Query: 61  ---RNLSHDGFRWDETRKVIIANNSVWDDYVK-----------ILQ-------------- 120
                LSH GF+WD  +K+IIA++SVWDDYVK            +Q              
Sbjct: 251 ILKELLSHSGFKWDAMQKMIIASDSVWDDYVKTHPDARIFRNRFIQNYDQLFIIFGDSHE 310

Query: 121 ----MEALDFPIAANDGKTGCERNSLRWTREMDHCLRRVVMQHVILGDKGMVDNKFSPLV 180
               ++ +D       GK      ++RWT EMD CL +V+++ VILG+K  +DNKF P  
Sbjct: 311 AAEPVDVIDVSPVRCGGKVKDLGKNVRWTFEMDRCLGKVLVEQVILGNKNRLDNKFKPAA 370

Query: 181 YDAAILDLRESLALELTKEQVEDRFNSWKREYGLIRDLLDQGDFEWDDHQKMLVAKDSVW 240
           Y+AA+L ++E   L+LTK+ V +R  +WK++Y ++++LLDQ DFEWD+ +KM++A DS W
Sbjct: 371 YEAAVLAIKERFHLDLTKDHVRNRLKTWKKQYDILQELLDQRDFEWDERRKMVIANDSAW 430

Query: 241 DASIERNPDTRHLRGKVIENYNELCAIAGCDNPSESSLNAAANSLDL-SVDEAINARDVC 300
           +  I+ NPD R ++G+VI NY ELC I GC++P ESS+N A N+LDL + +EA+ A +  
Sbjct: 431 NEYIKINPDARTVQGRVINNYEELCVIIGCNDPPESSVNIAENNLDLIAENEAVVAEEKY 490

Query: 301 HNQSNRAADNENYVTWTKEMDTCLLKMLVKQVSLGNKIDKNFKPAAYTAALTFLNERFAL 360
           +N+ + A D   Y++WT EMD CL ++LV+QV LGNK+DKNFKP AY AALT LNE+F L
Sbjct: 491 YNEVDNAKDKVKYISWTDEMDRCLTQLLVQQVMLGNKLDKNFKPVAYMAALTVLNEKFGL 550

Query: 361 DLTKENVKSRLNTWKKQYGIVKSLLSHDGFEWDEKHKMIVATDFDWTAHTKGHLDVQELQ 420
           DLTKEN+++RL TWKKQYG+VK LLSH GFEWD+++KM+VATD DW  + K + D ++L+
Sbjct: 551 DLTKENIRNRLKTWKKQYGLVKELLSHGGFEWDDRYKMVVATDSDWNEYIKRYPDARQLR 610

Query: 421 AKTIENYNELCMIFGNEEKTEGW-SIGE--KLDKDRTLDNHNHTELQVGISDDDAGGGDG 480
           A++IENY++L +I GNE     W   G   +L+ + T ++  H E  V +  ++    + 
Sbjct: 611 ARSIENYDDLRIIVGNEAPDGHWFEAGSTLRLEGNSTFNDEEHVETPVQMFANEEMSHED 670

Query: 481 CSDADSMEASSQQTGTRPSSSSHSRKSLKRSCNGDLMVQIMSVMAANVARIADALSD--R 540
            S  D M+ SSQQT  RPSSSSHS++ LKR  + D+M+++MS MAA++ RIADAL++  +
Sbjct: 671 TS--DGMQGSSQQTRARPSSSSHSKRLLKRRRSSDVMLKMMSAMAADIGRIADALTENNK 730

Query: 541 PTCLDQVFDVVQTMPGLDDNLILDACEFLSLDDKRAVMFMKLDERLRRKWLLKKL 557
             CLD++F++VQT+PG DD+LI++ACE+LS D++RA+MFMKL+ERLR+KWLLK+L
Sbjct: 731 TVCLDELFEMVQTIPGFDDDLIIEACEYLSFDERRAMMFMKLNERLRKKWLLKRL 783

BLAST of CcUC03G053630.1 vs. ExPASy TrEMBL
Match: A0A2N9FX33 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS19734 PE=4 SV=1)

HSP 1 Score: 566.2 bits (1458), Expect = 1.5e-157
Identity = 288/592 (48.65%), Postives = 408/592 (68.92%), Query Frame = 0

Query: 1   MDHYLGRTLAGYAMKGCKLDRTLQCGVVDLAVSALNEKYGSDLTKEHIRNRLKTWRKQY- 60
           MD  LG+ L     KG K+D+ LQ    D AV  LNE++G +L+KEHIRNRL+TWRKQY 
Sbjct: 191 MDRCLGKILVEQVRKGHKIDKILQREAYDAAVLDLNERFGPELSKEHIRNRLRTWRKQYL 250

Query: 61  ---RNLSHDGFRWDETRKVIIANNSVWDDYVK------------ILQMEAL--------- 120
                LSH+GF+WDE +K+IIA++S+WDDYVK            I   + L         
Sbjct: 251 ILNELLSHNGFKWDEMQKMIIASDSIWDDYVKTHPDARIFRNRFIQNYDQLYIIFGNYNE 310

Query: 121 ---DFPIAAN----DGKTGCERNSLRWTREMDHCLRRVVMQHVILGDKGMVDNKFSPLVY 180
                PI A+     GK   +  ++RWT EMD CL +V+++ VILG+K  +DNKF P  Y
Sbjct: 311 TREPIPIDASPVQCGGKARDQGKNMRWTYEMDRCLGKVLVEQVILGNKNKLDNKFKPAAY 370

Query: 181 DAAILDLRESLALELTKEQVEDRFNSWKREYGLIRDLLDQGDFEWDDHQKMLVAKDSVWD 240
           +AA+L +++   ++L K+ V +R  +WK++Y ++++LLDQ  FEWD  +KM++A DS W+
Sbjct: 371 EAAVLAIKKQFHIDLMKDHVRNRLKTWKKQYDILQELLDQSGFEWDGRRKMVIANDSAWN 430

Query: 241 ASIERNPDTRHLRGKVIENYNELCAIAGCDNPSESSLNAAANSLDLSVD-EAINARDVCH 300
             ++ NPD R ++G+VI NY ELC I G ++P ESSLN A N+LDL V+ EA+ A +  +
Sbjct: 431 EYLKINPDARTVQGRVINNYEELCVIIGYNDPPESSLNIAENNLDLIVENEAVVAEEAYY 490

Query: 301 NQSNRAADNENYVTWTKEMDTCLLKMLVKQVSLGNKIDKNFKPAAYTAALTFLNERFALD 360
           N+ + A D   Y++WT EMD CL ++LV+QV LGNK++KNFKP AY  ALT LNE+F LD
Sbjct: 491 NEIDNAKDKGKYISWTDEMDRCLTQLLVEQVMLGNKLEKNFKPVAYMTALTVLNEKFGLD 550

Query: 361 LTKENVKSRLNTWKKQYGIVKSLLSHDGFEWDEKHKMIVATDFDWTAHTKGHLDVQELQA 420
           LT+EN+++RL TWKKQYG+VK LLSH GFEWDE++KM+VA D DW  + K H D ++L+A
Sbjct: 551 LTRENIRNRLKTWKKQYGLVKELLSHSGFEWDERYKMVVAPDSDWNEYIKRHPDARQLRA 610

Query: 421 KTIENYNELCMIFGNEEKTEGWS-IGEKLDKDRTLDNHNHTELQVGISDDDAGGGDGCSD 480
           ++IENY+EL +I GNE     WS  G +L+ + T ++  H E    +  ++    D  S 
Sbjct: 611 RSIENYDELRIIVGNEPPGRHWSEAGARLEGNSTFNDEEHVETPAQMFGNEEMSQDNAS- 670

Query: 481 ADSMEASSQQTGTRPSSSSHSRKSLKRSCNGDLMVQIMSVMAANVARIADALSD--RPTC 540
            D M+ SS QT  RPSSSS+S++ LKR  + D M+++MS MAA++ RIADAL++  +  C
Sbjct: 671 -DGMQGSSHQTRARPSSSSYSKQLLKRRRSSDAMLEMMSAMAADIGRIADALTENNKTVC 730

Query: 541 LDQVFDVVQTMPGLDDNLILDACEFLSLDDKRAVMFMKLDERLRRKWLLKKL 557
           LD++F++VQT+PG DD+LI++ACE+LS D++RA+MFMKL+ERLR+KWLLK+L
Sbjct: 731 LDELFEMVQTIPGFDDDLIIEACEYLSFDERRAMMFMKLNERLRKKWLLKRL 780

BLAST of CcUC03G053630.1 vs. ExPASy TrEMBL
Match: A0A5J5C7S2 (Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_001686 PE=4 SV=1)

HSP 1 Score: 529.3 bits (1362), Expect = 2.0e-146
Identity = 288/612 (47.06%), Postives = 385/612 (62.91%), Query Frame = 0

Query: 1   MDHYLGRTLAGYAMKGCKLDRTLQCGVVDLAVSALNEKYGSDLTKEHIRNRLKTWRKQY- 60
           MD  LG+ L     KG K+D  +Q    + AV+ALNEK+G D+TK+HI+NRLKTW+KQY 
Sbjct: 209 MDRCLGKILVEEVEKGHKVDNIIQTEAYNTAVTALNEKFGPDITKDHIKNRLKTWKKQYG 268

Query: 61  ---RNLSHDGFRWDETRKVIIANNSVWDDYVKILQ------------------------- 120
                LSH GF+WDE RK++I N+S W+DY+K                            
Sbjct: 269 ILKELLSHIGFKWDEARKMVIGNDSAWNDYIKTHHDAHPFRGRVVENYDHLCIIFGNNHA 328

Query: 121 ---------------------MEALD-FPIAANDGKTGCERNSLRWTREMDHCLRRVVMQ 180
                                +EA++  PI    G    E+N + WT EMD CL  ++++
Sbjct: 329 TGSYSRTVDDIVHSLAGDSEGVEAINASPIRCYSGLRDEEKN-MEWTNEMDRCLSTILVK 388

Query: 181 HVILGDKGMVDNKFSPLVYDAAILDLRESLALELTKEQVEDRFNSWKREYGLIRDLLDQG 240
            V LG+K  +DNKF P  Y AA+L L E   L+ T + V +R  +WK+ YG ++++LDQ 
Sbjct: 389 QVKLGNKSKLDNKFKPAAYAAAVLALSERFQLDFTNDHVRNRIKTWKKLYGSVKEILDQS 448

Query: 241 DFEWDDHQKMLVAKDSVWDASIERNPDTRHLRGKVIENYNELCAIAGCDNPSESSLNAAA 300
           +F+WD  +KM+   DSVW   I+ NPD R L G+VIENY+ELCAI G DNP+ESS N A 
Sbjct: 449 EFKWDKERKMITTNDSVWHDYIKINPDARLLHGRVIENYDELCAIIGNDNPTESSKNDAE 508

Query: 301 NSLDLSVD-EAINARDVCHNQSNRAADNENYVTWTKEMDTCLLKMLVKQVSLGNKIDKNF 360
             +D + D E +       +Q + A +   Y+ WT EMD CL++ LV+QV LGNK++KNF
Sbjct: 509 ADMDWAADNEDVETEVAYQSQRDNAKERGKYIIWTDEMDCCLMEKLVEQVKLGNKLEKNF 568

Query: 361 KPAAYTAALTFLNERFALDLTKENVKSRLNTWKKQYGIVKSLLSHDGFEWDEKHKMIVAT 420
           KP AYTA LT LNE F LDLTKEN+KSRL TWKK YG+VK +LSH GF WDEK KM+VAT
Sbjct: 569 KPVAYTAVLTALNENFVLDLTKENIKSRLKTWKKVYGLVKEVLSHRGFVWDEKRKMVVAT 628

Query: 421 DFDWTAHTKGHLDVQELQAKTIENYNELCMIFGNEEKTEGWS-IGEKLDKDRTLDNHNHT 480
           D  W  + K H D + L+A++IENY+EL +I  N+  T  +S  G K D +   +N  H 
Sbjct: 629 DSVWNEYIKMHPDAKFLRARSIENYDELRIIIDNDHATRSFSNAGTKGDVNPASNNEEHE 688

Query: 481 ELQV-GISDDDAGGGDGCSDADSMEASSQQTGTRPSSSSHSRKSLKRSCNGDLMVQIMSV 540
           E  +  +  D+    D  +DA  M+ SSQQT  RPSSSSHS++  K+   GDLMV++MS 
Sbjct: 689 ETPLQNVFVDEEMSKDNTNDA--MQGSSQQTRARPSSSSHSKQPSKKRHGGDLMVEMMSA 748

Query: 541 MAANVARIADAL--SDRPTCLDQVFDVVQTMPGLDDNLILDACEFLSLDDKRAVMFMKLD 557
           MAAN+ RIADAL  S++   LD++F++VQ +PG DD+LI++ACEFLS D+KRA MF+KLD
Sbjct: 749 MAANIGRIADALTGSNQSVSLDELFEMVQHIPGFDDDLIIEACEFLSFDEKRAKMFLKLD 808

BLAST of CcUC03G053630.1 vs. ExPASy TrEMBL
Match: A0A5B7BRF2 (Uncharacterized protein OS=Davidia involucrata OX=16924 GN=Din_039932 PE=4 SV=1)

HSP 1 Score: 518.1 bits (1333), Expect = 4.6e-143
Identity = 282/612 (46.08%), Postives = 384/612 (62.75%), Query Frame = 0

Query: 1   MDHYLGRTLAGYAMKGCKLDRTLQCGVVDLAVSALNEKYGSDLTKEHIRNRLKTWRKQY- 60
           MD  LG+ L     KG K+D  +Q    + AV+ALNEK+G D+TK+HI+NRLKTW+KQY 
Sbjct: 191 MDRCLGKILVEEVEKGRKVDNIIQTEAYNTAVTALNEKFGPDITKDHIKNRLKTWKKQYG 250

Query: 61  ---RNLSHDGFRWDETRKVIIANNSVWDDYVKILQ------------------------- 120
                LSH GF+WDE RK++I ++S+W+DY+K                            
Sbjct: 251 ILKELLSHTGFKWDEARKMVIGDDSIWNDYIKTHHDAHLFRGRVVENYDHLCIIFGNNHA 310

Query: 121 ---------------------MEALD-FPIAANDGKTGCERNSLRWTREMDHCLRRVVMQ 180
                                +EA++  PI    G    E+N ++WT EMD+CL  ++++
Sbjct: 311 TGSYSRTADDIVHSLAGDSEGVEAINASPIRCYSGLRDQEKN-MKWTNEMDYCLSTILVE 370

Query: 181 HVILGDKGMVDNKFSPLVYDAAILDLRESLALELTKEQVEDRFNSWKREYGLIRDLLDQG 240
            V LG+K  +DNKF P  YDAA+  L E   L+ TK+ V +R  +WK+ YG +++LLD  
Sbjct: 371 QVKLGNKSKLDNKFKPAAYDAAVSALSERFQLDFTKDHVRNRIKTWKKLYGSMKELLDHS 430

Query: 241 DFEWDDHQKMLVAKDSVWDASIERNPDTRHLRGKVIENYNELCAIAGCDNPSESSLNAAA 300
           +F+WD+  KM+ A DSVW   I+  PD R L+G VIENY+ELC I G DNP+ESS N A 
Sbjct: 431 EFKWDEELKMVTANDSVWHDYIKIKPDARLLQGLVIENYDELCVIIGNDNPTESSKNDAE 490

Query: 301 NSLDLSVD-EAINARDVCHNQSNRAADNENYVTWTKEMDTCLLKMLVKQVSLGNKIDKNF 360
             +D + D E I       +Q +   +   Y+ WT EMD CL + LV+QV LGNK++KNF
Sbjct: 491 ADMDWAADNEGIETEVAYQSQPDNGKERGKYIIWTDEMDRCLTEKLVEQVKLGNKLEKNF 550

Query: 361 KPAAYTAALTFLNERFALDLTKENVKSRLNTWKKQYGIVKSLLSHDGFEWDEKHKMIVAT 420
           KP AYTA +T LNE FALDLTKEN+KSRL TWKK YG+VK +LSH GF WDE+ KM+VAT
Sbjct: 551 KPVAYTAVVTTLNENFALDLTKENIKSRLKTWKKLYGLVKEVLSHRGFVWDEERKMVVAT 610

Query: 421 DFDWTAHTKGHLDVQELQAKTIENYNELCMIFGNEEKTEGWSI-GEKLDKDRTLDNHNHT 480
           D  W  + K H D + L+A++IE ++EL +I  N   T  + + G K D + T +N  H 
Sbjct: 611 DSVWNEYIKMHPDAKFLRARSIEYFDELRIIIDNNHATGCFCVTGAKGDMNPTSNNEEHE 670

Query: 481 ELQV-GISDDDAGGGDGCSDADSMEASSQQTGTRPSSSSHSRKSLKRSCNGDLMVQIMSV 540
           E  +  +  D+    D  +  +  + SSQQT  RPSSSSHS++  K+    DLMV++MS 
Sbjct: 671 ETPLQNVFVDEEMSKDNTN--NGTQGSSQQTRARPSSSSHSKQPSKKRHGSDLMVEMMST 730

Query: 541 MAANVARIADAL--SDRPTCLDQVFDVVQTMPGLDDNLILDACEFLSLDDKRAVMFMKLD 557
           MAAN+ RIADAL  S++  CLD++F++VQ +PG DD+LI++ACEFLSLD+KRA MF+KLD
Sbjct: 731 MAANIGRIADALTGSNQSVCLDELFEMVQNIPGFDDDLIIEACEFLSLDEKRAKMFLKLD 790

BLAST of CcUC03G053630.1 vs. ExPASy TrEMBL
Match: A0A061GU73 (Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_040985 PE=4 SV=1)

HSP 1 Score: 508.4 bits (1308), Expect = 3.7e-140
Identity = 268/588 (45.58%), Postives = 383/588 (65.14%), Query Frame = 0

Query: 1   MDHYLGRTLAGYAMKGCKLDRTLQCGVVDLAVSALNEKYGSDLTKEHIRNRLKTWRKQY- 60
           MD+YLG++L     +G KLD TLQ    D A+S LNEK G +LTK+HIRNRL+TW+KQY 
Sbjct: 191 MDYYLGKSLVEKVKEGYKLDGTLQREAYDAALSTLNEKIGLELTKDHIRNRLRTWKKQYV 250

Query: 61  ---RNLSHDGFRWDETRKVIIANNSVWDDYVK-----------ILQ-------------- 120
              +  SH GF+WD+TRK+IIA+ SVW  YVK           +++              
Sbjct: 251 VLKQLFSHPGFKWDKTRKMIIADGSVWTTYVKAHPDARIYRGRVIENYDNLCTIFGSDNE 310

Query: 121 -MEALDFPIAANDGKTGCERNSLRWTREMDHCLRRVVMQHVILGDKGMVDNKFSPLVYDA 180
             E +D     N  K   +  ++ WT EMD  L +V+++ V LG+K  +DNK  PL Y+A
Sbjct: 311 VAEGVDISPLQNGVKVKDQAKNMMWTYEMDQYLSKVLVEQVKLGNKSKLDNKLRPLAYEA 370

Query: 181 AILDLRESLALELTKEQVEDRFNSWKREYGLIRDLLDQGDFEWDDHQKMLVAKDSVWDAS 240
           A+  L +   L+LTKE + +R  +WK++Y ++++LL   +FEWD  Q M++A DS W+  
Sbjct: 371 AVSALSKRFQLDLTKEHIRNRLKTWKKQYEILKELLHHSEFEWDKTQNMVIANDSAWNRY 430

Query: 241 IERNPDTRHLRGKVIENYNELCAIAGCDNPSESSLNAAANSLDLSV-DEAINARDVCHNQ 300
           I+  PD R  RG+VI NY EL AI GCD+  ESSLN++ + ++L+  +EA +  ++ + Q
Sbjct: 431 IKITPDARSFRGRVIRNYYELFAIFGCDDLPESSLNSSNDDVNLTANNEAADTEELFYGQ 490

Query: 301 SNRAADNENYVTWTKEMDTCLLKMLVKQVSLGNKIDKNFKPAAYTAALTFLNERFALDLT 360
           S+ A D   Y+ WT EMD CL + LV+QV++GNK  K+FKP A+ AAL+ LN++F+LDLT
Sbjct: 491 SDVAKDKGKYILWTDEMDQCLTEQLVQQVTIGNKHQKSFKPVAFRAALSVLNKKFSLDLT 550

Query: 361 KENVKSRLNTWKKQYGIVKSLLSHDGFEWDEKHKMIVATDFDWTAHTKGHLDVQELQAKT 420
            EN+ +RL TWKKQY +VK LLS  GFEWDE  KM++A D +W    K + DV  ++ + 
Sbjct: 551 TENIGNRLRTWKKQYRLVKELLSQRGFEWDEGQKMVIANDSEWRECIKRNPDVSRIRGRC 610

Query: 421 IENYNELCMIFGNEEKTEGWSIGEKLDKDRTLDNHNHTELQVGISDDDAGGGDGCSDADS 480
           I+N++EL +I GNE     WS       +   +N    +  V +  D+  G D     D 
Sbjct: 611 IDNFSELNIIVGNELAVGHWSEAGDRVVNPIQNNEEPVDAPVQVVVDEEMGHDNTD--DD 670

Query: 481 MEASSQQTGTRPSSSSHSRKSLKRSCNGDLMVQIMSVMAANVARIADALSD-RPTCLDQV 540
           M+ SSQQT  RPSSSSH++++LKR    D+M+++MS MAAN+ RIADAL++ +  CLD++
Sbjct: 671 MQVSSQQTRARPSSSSHAKEALKRRRTSDVMLEMMSDMAANIGRIADALTESKAVCLDEL 730

Query: 541 FDVVQTMPGLDDNLILDACEFLSLDDKRAVMFMKLDERLRRKWLLKKL 557
           F +VQ++P  DD+LI+DACE+LS D+KRA+MF+KLDERLR+KWLLK+L
Sbjct: 731 FQMVQSIPEFDDDLIIDACEYLSFDEKRAMMFVKLDERLRKKWLLKRL 776

BLAST of CcUC03G053630.1 vs. TAIR 10
Match: AT2G24960.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 12 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 187.6 bits (475), Expect = 2.8e-47
Identity = 164/629 (26.07%), Postives = 270/629 (42.93%), Query Frame = 0

Query: 1   MDHYLGRTLAGYAMKGCKLDRTL-QCGVVDLAVSALNEKYGSDLTKEHIRNRLKTWRKQY 60
           MD Y    +     +G K      +   +D+ V   N ++     K  +R+R     K Y
Sbjct: 176 MDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLV-LFNARFSGQYGKRVLRHRYNKLLKYY 235

Query: 61  RN----LSHDGFRWDETRKVIIANNSVWDDYVK------ILQMEALD--------FPIAA 120
           ++    L  DGF WDETR +I A+++VWD Y+K        +M++L         F   A
Sbjct: 236 KDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSYNDLDTIFACQA 295

Query: 121 NDG---------------KTGCERNSLR----WTREMDHCLRRVVMQHVILGDKGMVDNK 180
             G               K   E+NS R    WT  MD+ L  ++++ V  G++  V   
Sbjct: 296 EQGTDHRDDGSAAQTSETKASQEQNSDRTRIFWTPPMDYHLIDLLVEQVNNGNR--VGQT 355

Query: 181 FSPLVYDAAILDLRESLALELTKEQVEDRFNSWKREYGLIRDLLDQGDFEWDDHQKMLVA 240
           F    ++  +         +  K+ +++R+   +R Y  I+ LL+Q  F WD  + M++A
Sbjct: 356 FITSAWNEMVTAFNAKFGSQHNKDVLKNRYKHLRRLYNDIKFLLEQNGFSWDARRDMVIA 415

Query: 241 KDSVWDASIERNPDTRHLRGKVIENYNELCAIAGCDNPSESSLNAAANSLDLSVDEAINA 300
            D +W+  I+ +P+ R  R K I +Y  LC I G +  S+      A + D S  E +  
Sbjct: 416 DDDIWNTYIQAHPEARSYRVKTIPSYPNLCFIFGKET-SDGRYTRLAQAFDPSPAETVRM 475

Query: 301 ---------RDVCHNQSNRAADNEN-----------YVTWTKEMDTCLLKMLVKQVSLGN 360
                    +D    Q      NE             + WT+ MD CL+ ++++QVS GN
Sbjct: 476 NESGSTDGFKDTRSFQKVVYTSNEKNDYPCSNIGPPCIEWTRVMDHCLIDLMLEQVSRGN 535

Query: 361 KIDKNFKPAAYTAALTFLNERFALDLTKENVKSRLNTWKKQYGIVKSLLSHDGFEWDEKH 420
           KI + F   A+       N +F L      +++R     K+   + ++L+ DGF WD + 
Sbjct: 536 KIGETFTEQAWADMAESFNAKFGLQTDMFMLENRYILLMKERDDINNILNLDGFTWDVEK 595

Query: 421 KMIVATDFDWTAHTKGHLDVQELQAKTIENYNELCMIFGNEEKTEGWSIGEKLDKDRTLD 480
           + IVA D  W A+ K H D    + KT+++Y  LC                KL++  + +
Sbjct: 596 QTIVAEDEYWEAYIKEHPDATIYKGKTLDSYGNLC----------------KLNEHLSQE 655

Query: 481 NHNHTELQVGISDDDAGGGDGCSDADSMEASSQQTGTRPS---------------SSSHS 540
           + N   L + + +     G+     D   +  +Q   RP+               +   +
Sbjct: 656 SFNCENLMIELEN----YGNEMEIVDDFSSPHKQQNKRPNPITPPLGIVVCKAQKTGVET 715

Query: 541 RKSLKRSCNGDLMVQIMSVMAANVARIADALSDRPTCLDQVFDVVQTMPGLDDNLILDAC 557
           RK L  +   D             +RI +AL           D +Q +P +DD L+LDAC
Sbjct: 716 RKPLCETEGDDDDCTKPMPQIEIYSRIGNAL-----------DALQALPDMDDELLLDAC 768

BLAST of CcUC03G053630.1 vs. TAIR 10
Match: AT2G24960.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes - 50 (source: NCBI BLink). )

HSP 1 Score: 174.5 bits (441), Expect = 2.4e-43
Identity = 163/652 (25.00%), Postives = 270/652 (41.41%), Query Frame = 0

Query: 1   MDHYLGRTLAGYAMKGCKLDRTL-QCGVVDLAVSALNEKYGSDLTKEHIRNRLKTWRKQY 60
           MD Y    +     +G K      +   +D+ V   N ++     K  +R+R     K Y
Sbjct: 176 MDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLV-LFNARFSGQYGKRVLRHRYNKLLKYY 235

Query: 61  RN----LSHDGFRWDETRKVIIANNSVWDDYVK------ILQMEALD--------FPIAA 120
           ++    L  DGF WDETR +I A+++VWD Y+K        +M++L         F   A
Sbjct: 236 KDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSYNDLDTIFACQA 295

Query: 121 NDG---------------KTGCERNSLR----WTREMDHCLRRVVMQHVILGDKGMVDNK 180
             G               K   E+NS R    WT  MD+ L  ++++ V  G++  V   
Sbjct: 296 EQGTDHRDDGSAAQTSETKASQEQNSDRTRIFWTPPMDYHLIDLLVEQVNNGNR--VGQT 355

Query: 181 FSPLVYDAAILDLRESLALELTKEQVEDRFNSWKREYGLIRDLLDQGDFEWDDHQKMLVA 240
           F    ++  +         +  K+ +++R+   +R Y  I+ LL+Q  F WD  + M++A
Sbjct: 356 FITSAWNEMVTAFNAKFGSQHNKDVLKNRYKHLRRLYNDIKFLLEQNGFSWDARRDMVIA 415

Query: 241 KDSVWDA-----------------------SIERNPDTRHLRGKVIENYNELCAIAGCDN 300
            D +W+                         ++ +P+ R  R K I +Y  LC I G + 
Sbjct: 416 DDDIWNTYIQACHILFLFKISVICLCLQMKHVQAHPEARSYRVKTIPSYPNLCFIFGKET 475

Query: 301 PSESSLNAAANSLDLSVDEAINA---------RDVCHNQSNRAADNEN-----------Y 360
            S+      A + D S  E +           +D    Q      NE             
Sbjct: 476 -SDGRYTRLAQAFDPSPAETVRMNESGSTDGFKDTRSFQKVVYTSNEKNDYPCSNIGPPC 535

Query: 361 VTWTKEMDTCLLKMLVKQVSLGNKIDKNFKPAAYTAALTFLNERFALDLTKENVKSRLNT 420
           + WT+ MD CL+ ++++QVS GNKI + F   A+       N +F L      +++R   
Sbjct: 536 IEWTRVMDHCLIDLMLEQVSRGNKIGETFTEQAWADMAESFNAKFGLQTDMFMLENRYIL 595

Query: 421 WKKQYGIVKSLLSHDGFEWDEKHKMIVATDFDWTAHTKGHLDVQELQAKTIENYNELCMI 480
             K+   + ++L+ DGF WD + + IVA D  W A+ K H D    + KT+++Y  LC  
Sbjct: 596 LMKERDDINNILNLDGFTWDVEKQTIVAEDEYWEAYIKEHPDATIYKGKTLDSYGNLC-- 655

Query: 481 FGNEEKTEGWSIGEKLDKDRTLDNHNHTELQVGISDDDAGGGDGCSDADSMEASSQQTGT 540
                         KL++  + ++ N   L + + +     G+     D   +  +Q   
Sbjct: 656 --------------KLNEHLSQESFNCENLMIELEN----YGNEMEIVDDFSSPHKQQNK 715

Query: 541 RPS---------------SSSHSRKSLKRSCNGDLMVQIMSVMAANVARIADALSDRPTC 557
           RP+               +   +RK L  +   D             +RI +AL      
Sbjct: 716 RPNPITPPLGIVVCKAQKTGVETRKPLCETEGDDDDCTKPMPQIEIYSRIGNAL------ 775

BLAST of CcUC03G053630.1 vs. TAIR 10
Match: AT4G02210.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes - 26 (source: NCBI BLink). )

HSP 1 Score: 161.8 bits (408), Expect = 1.6e-39
Identity = 116/465 (24.95%), Postives = 215/465 (46.24%), Query Frame = 0

Query: 105 KTGCERNSLRWTREMDHCLRRVVMQHVILGDKGMVDNKFSPLVYDAAILDLRESLALELT 164
           + G ER    WT EMD     ++++ V  G++   D+ FS   +                
Sbjct: 4   RNGNERLRTVWTPEMDQYFIELMVEQVRKGNR-FEDHLFSKRAWKFMSCSFTAKFKFLYG 63

Query: 165 KEQVEDRFNSWKREYGLIRDLLDQGDFEWDDHQKMLVAKDSVWDASIERNPDTRHLRGKV 224
           K+ +++R  + +  +  + +LL +  F WDD ++M+VA + VWD  ++ +PD+R  R K 
Sbjct: 64  KDVLKNRHKTLRNLFKSVNNLLIEDGFSWDDTRQMVVADNCVWDEYLKIHPDSRSFRIKS 123

Query: 225 IENYNELCAIAG---CDNPSESSLNAAANSLDLSVDEAINARDVCHNQSNRAADNENYV- 284
           I  Y +LC +      ++ +E S++   +   +  D+  N   +C + + R+    + V 
Sbjct: 124 IPCYKDLCLVYSDGMSEHKAEESISEGESKTLIQEDDGYNR--ICESSTVRSNSKGSSVT 183

Query: 285 ----TWTKEMDTCLLKMLVKQVSLGNKIDKNFKPAAYTAALTFLNERFALDLTKENVKSR 344
               TW   MD   + +++ Q   GN+I+  F+  A+T  +   N +F  +   + +K+R
Sbjct: 184 RCRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNR 243

Query: 345 LNTWKKQYGIVKSLLSHDGFEWDEKHKMIVATDFDWTAHTKGHLDVQELQAKTIENYNEL 404
             + ++Q+  +KS+L  DGF WD + +M+ A +  W  + K H D ++   + I  Y +L
Sbjct: 244 YKSLRRQFNAIKSILRSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDL 303

Query: 405 CMIFGNE--EKTEGWSIGEKLDKD---RTLDNHNHTELQVGISDDDAGGGDGCSDADSME 464
           C++ G+   E+ E +   +  D +   +   +   T+L +   ++D+       D  +  
Sbjct: 304 CVLCGDSGIEENECFVAMDWFDPETEFQEFKSSGTTDLSISAEEEDS--NSLLFDPKNKR 363

Query: 465 ASSQQTGTRPSSSSHSRKSLKRSCNGDLMVQIMSVMAANVARIADALSDRPTCLDQVFDV 524
                T T P +    R             Q MS+                   +   + 
Sbjct: 364 DQLANTDTSPINPKKPRVD---------ETQTMSI-------------------EDTVEA 423

Query: 525 VQTMPGLDDNLILDACEFLSLDDKRAVMFMKLDERLRRKWLLKKL 557
           +Q +P +DD LILDAC+ L  D  +A  F+ LD +LR+KWLL+KL
Sbjct: 424 IQALPDMDDELILDACDLLE-DKLKAKTFLALDVKLRKKWLLRKL 434

BLAST of CcUC03G053630.1 vs. TAIR 10
Match: AT4G02210.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24960.2). )

HSP 1 Score: 161.8 bits (408), Expect = 1.6e-39
Identity = 116/465 (24.95%), Postives = 215/465 (46.24%), Query Frame = 0

Query: 105 KTGCERNSLRWTREMDHCLRRVVMQHVILGDKGMVDNKFSPLVYDAAILDLRESLALELT 164
           + G ER    WT EMD     ++++ V  G++   D+ FS   +                
Sbjct: 4   RNGNERLRTVWTPEMDQYFIELMVEQVRKGNR-FEDHLFSKRAWKFMSCSFTAKFKFLYG 63

Query: 165 KEQVEDRFNSWKREYGLIRDLLDQGDFEWDDHQKMLVAKDSVWDASIERNPDTRHLRGKV 224
           K+ +++R  + +  +  + +LL +  F WDD ++M+VA + VWD  ++ +PD+R  R K 
Sbjct: 64  KDVLKNRHKTLRNLFKSVNNLLIEDGFSWDDTRQMVVADNCVWDEYLKIHPDSRSFRIKS 123

Query: 225 IENYNELCAIAG---CDNPSESSLNAAANSLDLSVDEAINARDVCHNQSNRAADNENYV- 284
           I  Y +LC +      ++ +E S++   +   +  D+  N   +C + + R+    + V 
Sbjct: 124 IPCYKDLCLVYSDGMSEHKAEESISEGESKTLIQEDDGYNR--ICESSTVRSNSKGSSVT 183

Query: 285 ----TWTKEMDTCLLKMLVKQVSLGNKIDKNFKPAAYTAALTFLNERFALDLTKENVKSR 344
               TW   MD   + +++ Q   GN+I+  F+  A+T  +   N +F  +   + +K+R
Sbjct: 184 RCRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNR 243

Query: 345 LNTWKKQYGIVKSLLSHDGFEWDEKHKMIVATDFDWTAHTKGHLDVQELQAKTIENYNEL 404
             + ++Q+  +KS+L  DGF WD + +M+ A +  W  + K H D ++   + I  Y +L
Sbjct: 244 YKSLRRQFNAIKSILRSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDL 303

Query: 405 CMIFGNE--EKTEGWSIGEKLDKD---RTLDNHNHTELQVGISDDDAGGGDGCSDADSME 464
           C++ G+   E+ E +   +  D +   +   +   T+L +   ++D+       D  +  
Sbjct: 304 CVLCGDSGIEENECFVAMDWFDPETEFQEFKSSGTTDLSISAEEEDS--NSLLFDPKNKR 363

Query: 465 ASSQQTGTRPSSSSHSRKSLKRSCNGDLMVQIMSVMAANVARIADALSDRPTCLDQVFDV 524
                T T P +    R             Q MS+                   +   + 
Sbjct: 364 DQLANTDTSPINPKKPRVD---------ETQTMSI-------------------EDTVEA 423

Query: 525 VQTMPGLDDNLILDACEFLSLDDKRAVMFMKLDERLRRKWLLKKL 557
           +Q +P +DD LILDAC+ L  D  +A  F+ LD +LR+KWLL+KL
Sbjct: 424 IQALPDMDDELILDACDLLE-DKLKAKTFLALDVKLRKKWLLRKL 434

BLAST of CcUC03G053630.1 vs. TAIR 10
Match: AT4G02550.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 370 Blast hits to 300 proteins in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 10; Plants - 354; Viruses - 0; Other Eukaryotes - 6 (source: NCBI BLink). )

HSP 1 Score: 116.3 bits (290), Expect = 7.8e-26
Identity = 87/283 (30.74%), Postives = 143/283 (50.53%), Query Frame = 0

Query: 280 VTWTKEMDTCLLKMLVKQVSLGNKIDKNFKPAAYTAALTFLNERFALDLTKENVKSRLNT 339
           V W+  MD CL++ L  Q   GNK+DK F   AYTAA   +N RF L+LT +   +RL T
Sbjct: 20  VIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKT 79

Query: 340 WKKQYGIVKSLLSHDGFEWDEKHKMI-VATDFDWTAHTKGHLDVQELQAKTIENYNELCM 399
            KK+Y +++ +LS DGF W+   KMI   +D  W  +   + D +  + K IE Y EL  
Sbjct: 80  IKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMYEELRT 139

Query: 400 IFGNEEKTEGWSIGEKLDKDRTLDNHNHTELQVGI---SDDDAGGGDGC-SDADSMEASS 459
           + G+ +    ++  +K       D     E  V     S ++    DG  S A + E   
Sbjct: 140 VCGDYQTPGKYNKVKKESSHHLNDVKQFEEDSVSFPLGSSEEHSDTDGTESYAGASEYMH 199

Query: 460 QQTGTRPSSSSHSRKSLKRSCNGDLMVQIMSVMAANVARIADALSDRPTCL--DQVFDVV 519
           +++   P      R+  KRS N D   + M V+A+++ R+ADA+    T +  +++   V
Sbjct: 200 EESQDLPPPRDPLRRPSKRSRNSDPCQEAMLVVASSIRRLADAVVQSKTLINTEELLKAV 259

Query: 520 QTMPGLDDNLILDACEFLSLDDKRAVMFMKLDERLRRKWLLKK 556
             +  L++   + A E+L+ D  +A  FM  + R+R+ +L ++
Sbjct: 260 MEIDELEEAKQMYAFEYLNGDPVKARAFMAYNNRMRKMFLFRQ 302

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_030959168.11.5e-15949.41uncharacterized protein LOC115981123 [Quercus lobata][more]
XP_023877154.12.1e-15848.74uncharacterized protein LOC111989590 [Quercus suber][more]
KAF3973412.18.1e-15848.91hypothetical protein CMV_003146 [Castanea mollissima][more]
KAA8550002.14.2e-14647.06hypothetical protein F0562_001686 [Nyssa sinensis][more]
EOY32978.17.6e-14045.58Uncharacterized protein TCM_040985 [Theobroma cacao][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A7N2KMQ17.1e-16049.41Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
A0A2N9FX331.5e-15748.65Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS19734 PE=4 SV=1[more]
A0A5J5C7S22.0e-14647.06Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_001686 PE=4 SV=1[more]
A0A5B7BRF24.6e-14346.08Uncharacterized protein OS=Davidia involucrata OX=16924 GN=Din_039932 PE=4 SV=1[more]
A0A061GU733.7e-14045.58Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_040985 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G24960.22.8e-4726.07unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G24960.12.4e-4325.00unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G02210.11.6e-3924.95unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G02210.21.6e-3924.95unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G02550.17.8e-2630.74unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 45..65
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 428..471
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 448..470
NoneNo IPR availablePANTHERPTHR46929:SF12MYB/SANT-LIKE DNA-BINDING DOMAIN PROTEINcoord: 105..264
NoneNo IPR availablePANTHERPTHR46929:SF12MYB/SANT-LIKE DNA-BINDING DOMAIN PROTEINcoord: 1..89
coord: 268..557
NoneNo IPR availablePANTHERPTHR46929EXPRESSED PROTEINcoord: 105..264
coord: 268..557
NoneNo IPR availablePANTHERPTHR46929EXPRESSED PROTEINcoord: 1..89
IPR024752Myb/SANT-like domainPFAMPF12776Myb_DNA-bind_3coord: 114..209
e-value: 6.0E-16
score: 59.2
coord: 1..86
e-value: 7.4E-11
score: 42.9
coord: 281..375
e-value: 7.9E-23
score: 81.3

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CcUC03G053630CcUC03G053630gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CcUC03G053630.1-exonCcUC03G053630.1-exon-CicolChr03:23229539..23230303exon
CcUC03G053630.1-exonCcUC03G053630.1-exon-CicolChr03:23232313..23232807exon
CcUC03G053630.1-exonCcUC03G053630.1-exon-CicolChr03:23232943..23233314exon
CcUC03G053630.1-exonCcUC03G053630.1-exon-CicolChr03:23234397..23234944exon


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CcUC03G053630.1-three_prime_utrCcUC03G053630.1-three_prime_utr-CicolChr03:23229539..23229757three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CcUC03G053630.1-cdsCcUC03G053630.1-cds-CicolChr03:23229758..23230303CDS
CcUC03G053630.1-cdsCcUC03G053630.1-cds-CicolChr03:23232313..23232807CDS
CcUC03G053630.1-cdsCcUC03G053630.1-cds-CicolChr03:23232943..23233314CDS
CcUC03G053630.1-cdsCcUC03G053630.1-cds-CicolChr03:23234397..23234660CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CcUC03G053630.1-five_prime_utrCcUC03G053630.1-five_prime_utr-CicolChr03:23234661..23234944five_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CcUC03G053630.1CcUC03G053630.1-proteinpolypeptide