Clc05G01970 (gene) Watermelon (cordophanus) v2

Overview
NameClc05G01970
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function (DUF604)
LocationClcChr05: 1345148 .. 1348797 (+)
RNA-Seq ExpressionClc05G01970
SyntenyClc05G01970
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAATTTAACGGCTAAAATTGTAATTCAATCTCATAAATGCTTACGGACAGTACACACAAATGAAGCAAACCGCCATGGAATTCCATTAATTTTCTTCATTTCAATACGCTAAGATTTGAATCAATTTGAGTGAAACAAGCAAATTTTCGAAAACAAACTAAGTTCGAATCGCGAGCTTCAAATGTAGCGAACAGATTACTGATTTTCCATCACTTTCTCGAGAAATTTCTGCGCGGGAATTCGTAGTTTCATCAGCAATGAAAATCCGTCAGAAAAATGCACAAGATCGTCAACCGGAGAGAAATCCCCTCACCGTCTCGATTAAATATCTTCCGAAGTCGATGCTTTACTTTCTCATTGCATTCTCCTCGATCATCTGTTTATTTTACTCTCCAAAATTTCTCTATTATTCTTCATTTTATTGTCACAGCCGTGGATCTTCATCTTCATCCATGAATATCGGCAACTCTGTTCTTGATCCGGCACCAGAATCGCCATTGCCGAAACCTCGAGAGGAAACAAATCTCTCTCATGTAGTTTTCGGCATCGCCGCTTCCGCTAAGCTCTGGAATCAAAGAAAAAATTACATCAAGCTATGGTGGAGATCCAACGAAATGCAAGGAATCGTCTGGCTAGATGAGGCAGTGGAAAAAGCAGAAGACGATCATCTCCTCCCGACGGTAAGAATTTCCGGCGATACTTCTGAATTTGTTTACGGAAATCCGAAAGGGCACAGATCTGCGATAAGGATTTCGCGGATTGTCTCTGAAACTTTGAGGCTAGGGTTTTCCGCTAGGGCAGAGGTACGGTGGCTAGTAATGGGCGACGATGACACTTTTTTTGCCGTCCATAACTTGGTTAAGGTGCTGCGGAAGTACGATGATAATCAATTTTACTATATTGGAAGTTCGTCGGAAAGTCATTTACAGAATATGCATTTCTCTTACAATATGGCTTACGGCGGAGGCGGATTCGCCATAAGTTATCCATTGGCGAAAGAACTGGAGAAAATGCAGGATAATTGCTTGCAGAGGTATCCGAAATTGTACGGTTCCGATGATCGGATTCAGGCTTGTATGGCGGAGCTCGGCGTTCCTCTCACCAGAGAACCAGGATTTCATCAGGTGCGTTTTCATCGCTTTCCAATTCCATTATTAACTCTACATTCACACTCCTACTTCACCTTGATTCTAGGTTTGTCCTCAAACCATTAATAATTTAGTCCACTAAAATTAGTTGATAACCATTTAATCTCAATTTACTTTTTGAAGTTTAATGAGTCTCATCTAATTTTATTATAATTAAGATTTATTAAAATGTCTTATACATGCAGATAGAAAGATTCGAAGTCAAATTATTTATAAATTAGATCTTCCTGTTAAAAATTTCAAAGGTCAAAATGGATTAAATCATTATCAATTAAAATTTATAGACTAAATTTTTATAAAAATAAAAATTTAGGAATTGAAGTAGATGTAATTTGACCTTATCTAAGTAACAGAAGCTAGTAATAATATATATACAAAATTTTGAAGTAATAATTAATAGCATTTGGCTAAGTGGTTTTCCTATGAAGTATTTGTGCATATGACCTTAGGTGGTAGGGACACTAAAGTCCGTTAATTTACACTCTATTGTTGACCTTAGCATTATGAATAGAGAATCAAAAGTTAATCACTTTTAGGGTGTGTTTTGAAATACCTTTTCAAATGTTTAATTTTAAAATAAGTCATTTAGAAAGAAATATAAGTGCTTCCTAATCACTTAAAATGGATTTAAAAGTTATTAATATTTTTCTTTTAAACAATTCTTATAAAATTGTTTAAAAGAAAATGAATTTTTGAGAAACATTTATTTTTTAAGTTAATACAAACAGGCCATCAATACTTCCTAAATTGACGGTTGAAAACCGGCACAATACTTCAATTTCAGAATTCCTCTAACCAAATTTAATTTAAGAAGTTTAGTTATTGAATTTGAGATTTATATCTATTTAATTTCTTAACTTTAAAAATTAGTTAATTGGTCTATGAACTTTTAATTTTATATAAAATAGATCCTTAAACCTTAAATTTCACATATTCGATATTTTAGAACATTCATGATATCATTAGACACAAATTTGAATGTTTAGAATTCCATTGGACAGACTTTTCCAAAAAAAAAAAAAAAAAAAATTCCATTAGACGTAAAATTCAATTTTATACCTAAATACATCAATTGATTTTTTCTTTTTTAAAAAGTCATATGCAATATTGGACTTTGACACATAAATGAATTCATAAGGTACATAAAATTAAAAGTTCAGGAACTTATTAGACATTTTTAAAAGTTGAGGAACCGAATAGTTACAAATCTCAAACTTTAGTAACTAATAATTTAATTAATTAAATAAATATATATTTGAAGGTTAAGTTGCATTTTTAGTCCCCATGATATTGAGAAAGTTAGATTTTGATCCTTATGATTTATATTTGAAATTTAGTTCATATAATTTGACAAAACTCTCACAAATAGTCCCTATAGTAGAGACTATGTATAATAATTTTATCCAATCATAAAGATTATTTACAAAGTTTTATCAAACATAAGGATTAAACACTAACTTTTAAAACCATGGATTAAATTCTAACTTTTTCAAACCAAATTTATAATTAAAGCTTATTTTAATCCACATTTATCACGATATTGTGATTAGTGCGATGTTTACGGTAATCTGTTTGGGCTTCTAACGGCCCATCCATTGGCACCTTTGGTCTCTCTTCACCACTTGGACATAGTTGATCCAATATTCCCAGTAATGGGCCGTTTTGAGGCCCTAAAGAAACTTGGGCCCGCAATGAATCTGGACTCAGCTGGGCTCATGCAGCAATCCATATGCTACCACAAATCTCGGCTTTGGACTGTTTCCATTGCATGGGGCTACGCTGTTCAAATATTTCTCACCAAATTATTCCCTAAGGACGTCCAACTTCCAGCCACCACCTTCCTCAATTGGTACCCGAACGCCGGCACCTCCGCCCACGCCTTCAACACCCGATCGATTAACCTCCACGCTTGCCAAACTCCCTTTGTTTATTATTTACACGACATCACGTTCGATGCAAGTATGAATCGGACGGTCAGCCGATACACCCGCTACCGAATACCTGAACCCGAATGCAAGTGGAAAATGTTGGATAATTCAATGATAGAAAGAGTCGATGTTTTGAAAAAGGCCGACCCATTCTTATGGGACAAGGTAGTAGAAGGAAATGAAATTATATATTTAATGTAGTAACTTGACAATTTATTTAATGTTATTATTATTTTGTATGTTGTTTAGGCTCCGAGAAGGCAATGCTGTAGAATTTTGGGGACAGAAAAACAAGGTGAAATGTTAGTTGAAATAGGAGAATGCGAAGAAGATGAAATAATTGGGTAATTTCATTGCCCTTTTAATTCTCTTTATATAATAAAGATATTTAGGTTATATTAGATGGTTTTTTTTCTTCCATATAATCCTTTTTCTTTCCTCCCTTGTTTTTTTCTCTCCATTCAGGGACGTTTGGTATTTGACAACTTTCATTATCCCTAATTGTGTAAGAGAAAAATGGAAATAGAGACGGAGAAAAAAAATATATATATATATACAACTAATCCATATA

mRNA sequence

AAAAAATTTAACGGCTAAAATTGTAATTCAATCTCATAAATGCTTACGGACAGTACACACAAATGAAGCAAACCGCCATGGAATTCCATTAATTTTCTTCATTTCAATACGCTAAGATTTGAATCAATTTGAGTGAAACAAGCAAATTTTCGAAAACAAACTAAGTTCGAATCGCGAGCTTCAAATGTAGCGAACAGATTACTGATTTTCCATCACTTTCTCGAGAAATTTCTGCGCGGGAATTCGTAGTTTCATCAGCAATGAAAATCCGTCAGAAAAATGCACAAGATCGTCAACCGGAGAGAAATCCCCTCACCGTCTCGATTAAATATCTTCCGAAGTCGATGCTTTACTTTCTCATTGCATTCTCCTCGATCATCTGTTTATTTTACTCTCCAAAATTTCTCTATTATTCTTCATTTTATTGTCACAGCCGTGGATCTTCATCTTCATCCATGAATATCGGCAACTCTGTTCTTGATCCGGCACCAGAATCGCCATTGCCGAAACCTCGAGAGGAAACAAATCTCTCTCATGTAGTTTTCGGCATCGCCGCTTCCGCTAAGCTCTGGAATCAAAGAAAAAATTACATCAAGCTATGGTGGAGATCCAACGAAATGCAAGGAATCGTCTGGCTAGATGAGGCAGTGGAAAAAGCAGAAGACGATCATCTCCTCCCGACGGTAAGAATTTCCGGCGATACTTCTGAATTTGTTTACGGAAATCCGAAAGGGCACAGATCTGCGATAAGGATTTCGCGGATTGTCTCTGAAACTTTGAGGCTAGGGTTTTCCGCTAGGGCAGAGGTACGGTGGCTAGTAATGGGCGACGATGACACTTTTTTTGCCGTCCATAACTTGGTTAAGGTGCTGCGGAAGTACGATGATAATCAATTTTACTATATTGGAAGTTCGTCGGAAAGTCATTTACAGAATATGCATTTCTCTTACAATATGGCTTACGGCGGAGGCGGATTCGCCATAAGTTATCCATTGGCGAAAGAACTGGAGAAAATGCAGGATAATTGCTTGCAGAGGTATCCGAAATTGTACGGTTCCGATGATCGGATTCAGGCTTGTATGGCGGAGCTCGGCGTTCCTCTCACCAGAGAACCAGGATTTCATCAGTGCGATGTTTACGGTAATCTGTTTGGGCTTCTAACGGCCCATCCATTGGCACCTTTGGTCTCTCTTCACCACTTGGACATAGTTGATCCAATATTCCCAGTAATGGGCCGTTTTGAGGCCCTAAAGAAACTTGGGCCCGCAATGAATCTGGACTCAGCTGGGCTCATGCAGCAATCCATATGCTACCACAAATCTCGGCTTTGGACTGTTTCCATTGCATGGGGCTACGCTGTTCAAATATTTCTCACCAAATTATTCCCTAAGGACGTCCAACTTCCAGCCACCACCTTCCTCAATTGGTACCCGAACGCCGGCACCTCCGCCCACGCCTTCAACACCCGATCGATTAACCTCCACGCTTGCCAAACTCCCTTTGTTTATTATTTACACGACATCACGTTCGATGCAAGTATGAATCGGACGGTCAGCCGATACACCCGCTACCGAATACCTGAACCCGAATGCAAGTGGAAAATGTTGGATAATTCAATGATAGAAAGAGTCGATGTTTTGAAAAAGGCCGACCCATTCTTATGGGACAAGGCTCCGAGAAGGCAATGCTGTAGAATTTTGGGGACAGAAAAACAAGGTGAAATGTTAGTTGAAATAGGAGAATGCGAAGAAGATGAAATAATTGGGTAATTTCATTGCCCTTTTAATTCTCTTTATATAATAAAGATATTTAGGTTATATTAGATGGTTTTTTTTCTTCCATATAATCCTTTTTCTTTCCTCCCTTGTTTTTTTCTCTCCATTCAGGGACGTTTGGTATTTGACAACTTTCATTATCCCTAATTGTGTAAGAGAAAAATGGAAATAGAGACGGAGAAAAAAAATATATATATATATACAACTAATCCATATA

Coding sequence (CDS)

ATGAAAATCCGTCAGAAAAATGCACAAGATCGTCAACCGGAGAGAAATCCCCTCACCGTCTCGATTAAATATCTTCCGAAGTCGATGCTTTACTTTCTCATTGCATTCTCCTCGATCATCTGTTTATTTTACTCTCCAAAATTTCTCTATTATTCTTCATTTTATTGTCACAGCCGTGGATCTTCATCTTCATCCATGAATATCGGCAACTCTGTTCTTGATCCGGCACCAGAATCGCCATTGCCGAAACCTCGAGAGGAAACAAATCTCTCTCATGTAGTTTTCGGCATCGCCGCTTCCGCTAAGCTCTGGAATCAAAGAAAAAATTACATCAAGCTATGGTGGAGATCCAACGAAATGCAAGGAATCGTCTGGCTAGATGAGGCAGTGGAAAAAGCAGAAGACGATCATCTCCTCCCGACGGTAAGAATTTCCGGCGATACTTCTGAATTTGTTTACGGAAATCCGAAAGGGCACAGATCTGCGATAAGGATTTCGCGGATTGTCTCTGAAACTTTGAGGCTAGGGTTTTCCGCTAGGGCAGAGGTACGGTGGCTAGTAATGGGCGACGATGACACTTTTTTTGCCGTCCATAACTTGGTTAAGGTGCTGCGGAAGTACGATGATAATCAATTTTACTATATTGGAAGTTCGTCGGAAAGTCATTTACAGAATATGCATTTCTCTTACAATATGGCTTACGGCGGAGGCGGATTCGCCATAAGTTATCCATTGGCGAAAGAACTGGAGAAAATGCAGGATAATTGCTTGCAGAGGTATCCGAAATTGTACGGTTCCGATGATCGGATTCAGGCTTGTATGGCGGAGCTCGGCGTTCCTCTCACCAGAGAACCAGGATTTCATCAGTGCGATGTTTACGGTAATCTGTTTGGGCTTCTAACGGCCCATCCATTGGCACCTTTGGTCTCTCTTCACCACTTGGACATAGTTGATCCAATATTCCCAGTAATGGGCCGTTTTGAGGCCCTAAAGAAACTTGGGCCCGCAATGAATCTGGACTCAGCTGGGCTCATGCAGCAATCCATATGCTACCACAAATCTCGGCTTTGGACTGTTTCCATTGCATGGGGCTACGCTGTTCAAATATTTCTCACCAAATTATTCCCTAAGGACGTCCAACTTCCAGCCACCACCTTCCTCAATTGGTACCCGAACGCCGGCACCTCCGCCCACGCCTTCAACACCCGATCGATTAACCTCCACGCTTGCCAAACTCCCTTTGTTTATTATTTACACGACATCACGTTCGATGCAAGTATGAATCGGACGGTCAGCCGATACACCCGCTACCGAATACCTGAACCCGAATGCAAGTGGAAAATGTTGGATAATTCAATGATAGAAAGAGTCGATGTTTTGAAAAAGGCCGACCCATTCTTATGGGACAAGGCTCCGAGAAGGCAATGCTGTAGAATTTTGGGGACAGAAAAACAAGGTGAAATGTTAGTTGAAATAGGAGAATGCGAAGAAGATGAAATAATTGGGTAA

Protein sequence

MKIRQKNAQDRQPERNPLTVSIKYLPKSMLYFLIAFSSIICLFYSPKFLYYSSFYCHSRGSSSSSMNIGNSVLDPAPESPLPKPREETNLSHVVFGIAASAKLWNQRKNYIKLWWRSNEMQGIVWLDEAVEKAEDDHLLPTVRISGDTSEFVYGNPKGHRSAIRISRIVSETLRLGFSARAEVRWLVMGDDDTFFAVHNLVKVLRKYDDNQFYYIGSSSESHLQNMHFSYNMAYGGGGFAISYPLAKELEKMQDNCLQRYPKLYGSDDRIQACMAELGVPLTREPGFHQCDVYGNLFGLLTAHPLAPLVSLHHLDIVDPIFPVMGRFEALKKLGPAMNLDSAGLMQQSICYHKSRLWTVSIAWGYAVQIFLTKLFPKDVQLPATTFLNWYPNAGTSAHAFNTRSINLHACQTPFVYYLHDITFDASMNRTVSRYTRYRIPEPECKWKMLDNSMIERVDVLKKADPFLWDKAPRRQCCRILGTEKQGEMLVEIGECEEDEIIG
Homology
BLAST of Clc05G01970 vs. NCBI nr
Match: XP_038876379.1 (uncharacterized protein LOC120068817 [Benincasa hispida])

HSP 1 Score: 869.8 bits (2246), Expect = 1.2e-248
Identity = 420/475 (88.42%), Postives = 441/475 (92.84%), Query Frame = 0

Query: 1   MKIRQKNAQDRQPERNPLTVSIKYLPKSMLYFLIAFSSIICLFYSPKFLYYSSFYCHSRG 60
           MKIRQKNA DR  ERNPLT SIK LP  MLYFL+ F S+ICLFYSPKFLYYSSFYC SRG
Sbjct: 1   MKIRQKNAHDRPLERNPLTASIKSLPMLMLYFLLMFFSMICLFYSPKFLYYSSFYCRSRG 60

Query: 61  SSSSSMNIGNSVLDPAPESPLPKPREETNLSHVVFGIAASAKLWNQRKNYIKLWWRSNEM 120
           SSSSSM I +SVLDPAPES LPKPREETN+SHV+FGIAASAKLWN RKNYIKLWWRSNEM
Sbjct: 61  SSSSSMKIDDSVLDPAPESLLPKPREETNISHVIFGIAASAKLWNHRKNYIKLWWRSNEM 120

Query: 121 QGIVWLDEAVEKAEDDHLLPTVRISGDTSEFVYGNPKGHRSAIRISRIVSETLRLGFSAR 180
           +GIVWLDEAVE A DD  LPTV IS DTSEF+Y NP+GHRSAIRISRIVSETLRLG  AR
Sbjct: 121 RGIVWLDEAVESAGDDDFLPTVMISSDTSEFLYENPEGHRSAIRISRIVSETLRLGLFAR 180

Query: 181 AEVRWLVMGDDDTFFAVHNLVKVLRKYDDNQFYYIGSSSESHLQNMHFSYNMAYGGGGFA 240
           AEVRW+VMGDDDTFFAV NLVKVLRKYDDNQFYYIGSSSESHLQNMHFSYNMAYGGGGFA
Sbjct: 181 AEVRWIVMGDDDTFFAVDNLVKVLRKYDDNQFYYIGSSSESHLQNMHFSYNMAYGGGGFA 240

Query: 241 ISYPLAKELEKMQDNCLQRYPKLYGSDDRIQACMAELGVPLTREPGFHQCDVYGNLFGLL 300
           ISYPLAKELEKMQDNCLQRYPKLYGSDDRIQACMAELGVPLTREPGFHQCDVYGNLFGLL
Sbjct: 241 ISYPLAKELEKMQDNCLQRYPKLYGSDDRIQACMAELGVPLTREPGFHQCDVYGNLFGLL 300

Query: 301 TAHPLAPLVSLHHLDIVDPIFPVMGRFEALKKLGPAMNLDSAGLMQQSICYHKSRLWTVS 360
           TAHPLAPLVSLHHLDIVDPIFP MGRFEAL+KLGP MNLDSAGLMQQSICYHKS LWTVS
Sbjct: 301 TAHPLAPLVSLHHLDIVDPIFPEMGRFEALQKLGPPMNLDSAGLMQQSICYHKSGLWTVS 360

Query: 361 IAWGYAVQIFLTKLFPKDVQLPATTFLNWYPNAGTSAHAFNTRSINLHACQTPFVYYLHD 420
           IAWGYAVQIFLTKLFP+DVQLPATTFLNWYPNAGTSAHAFNTRSI+L ACQTPFVYYLH+
Sbjct: 361 IAWGYAVQIFLTKLFPRDVQLPATTFLNWYPNAGTSAHAFNTRSISLDACQTPFVYYLHN 420

Query: 421 ITFDASMNRTVSRYTRYRIPEPECKWKMLDNSMIERVDVLKKADPFLWDKAPRRQ 476
           ITF+AS N+TVSRYT YRIPEPECKWKM D++MIE VDVL+K DP LWDKAP+++
Sbjct: 421 ITFNASTNQTVSRYTHYRIPEPECKWKMSDHAMIEIVDVLRKPDPLLWDKAPKKE 475

BLAST of Clc05G01970 vs. NCBI nr
Match: XP_022147245.1 (uncharacterized protein LOC111016241 [Momordica charantia])

HSP 1 Score: 752.7 bits (1942), Expect = 2.1e-213
Identity = 366/486 (75.31%), Postives = 407/486 (83.74%), Query Frame = 0

Query: 1   MKIRQKNAQDRQPERNPLTVSIKYLPKSMLYFLIAFSSIICLFYSPKFLYYSSFYCHSRG 60
           MKI +K A+DR  ++NPL  SIK LP  M Y L+  SSI+CLFYS KF+ YSS  C+SR 
Sbjct: 1   MKISEKTAEDRPQQKNPLAASIKSLPILMFYLLLLISSIVCLFYSAKFV-YSSSSCNSR- 60

Query: 61  SSSSSMNIGNSVLDPAPESPLPKPREETNLSHVVFGIAASAKLWNQRKNYIKLWWRSNEM 120
             S S  IG+SVLDP PE+P PKP +ETNLSHVVFGIAASAKLWN RKNYI LWWRS+EM
Sbjct: 61  -ISFSKEIGDSVLDPEPETPSPKPNQETNLSHVVFGIAASAKLWNHRKNYIMLWWRSSEM 120

Query: 121 QGIVWLDEAVEKAEDDHLLPTVRISGDTSEFVYGNPKGHRSAIRISRIVSETLRLGFSAR 180
           +GI+WLDE +E A+DDHLLP+ RIS DTS+F Y NP+GHRSAIR SRIVSE+LRLG   R
Sbjct: 121 RGIIWLDEPIESADDDHLLPSTRISADTSKFAYRNPEGHRSAIRTSRIVSESLRLGIFGR 180

Query: 181 AEVRWLVMGDDDTFFAVHNLVKVLRKYDDNQFYYIGSSSESHLQNMHFSYNMAYGGGGFA 240
           AEVRWLVMGDDDTFFAV NLV+VLRKYD NQFYYIG SSESHLQNMHFSY+MAYGGGGFA
Sbjct: 181 AEVRWLVMGDDDTFFAVDNLVQVLRKYDHNQFYYIGGSSESHLQNMHFSYDMAYGGGGFA 240

Query: 241 ISYPLAKELEKMQDNCLQRYPKLYGSDDRIQACMAELGVPLTREPGFHQCDVYGNLFGLL 300
           ISYPLAKELEKMQD+CLQRYP LYGSDDRIQACMAELGVPLTREPGFHQCDVYG+LFGLL
Sbjct: 241 ISYPLAKELEKMQDSCLQRYPNLYGSDDRIQACMAELGVPLTREPGFHQCDVYGDLFGLL 300

Query: 301 TAHPLAPLVSLHHLDIVDPIFPVMGRFEALKKLGPAMNLDSAGLMQQSICYHKSRLWTVS 360
           TAHPLAPLVSLHHLDIVDPIFP MGR +ALK+LG  M LD AGLMQQSICY KS  WTVS
Sbjct: 301 TAHPLAPLVSLHHLDIVDPIFPEMGRVQALKRLGSPMKLDPAGLMQQSICYDKSGAWTVS 360

Query: 361 IAWGYAVQIFLTKLFPKDVQLPATTFLNWYPNAGTSAHAFNTRSINLHACQTPFVYYLHD 420
           I+WGYAVQIF   LFP++VQLPATTFLNWY +AG+SAHAFNTR +    CQ PFVYYL  
Sbjct: 361 ISWGYAVQIFRGILFPRNVQLPATTFLNWYRSAGSSAHAFNTRPVTADVCQKPFVYYLRH 420

Query: 421 ITFDASMNRTVSRYTRYRIPEPECKWKMLDNSMIERVDVLKKADPFLWDKAPRRQCCRIL 480
           + F AS N T+S Y+ YR+ EPECKWK+LD + +E+V V K+ D  LWDKAPRRQCCRIL
Sbjct: 421 VAFGASTNMTISDYSWYRVHEPECKWKLLDRT-VEKVVVWKRPDSSLWDKAPRRQCCRIL 480

Query: 481 GTEKQG 487
             EKQG
Sbjct: 481 AMEKQG 482

BLAST of Clc05G01970 vs. NCBI nr
Match: XP_007051739.2 (PREDICTED: uncharacterized protein LOC18614099 isoform X1 [Theobroma cacao])

HSP 1 Score: 611.3 bits (1575), Expect = 7.5e-171
Identity = 289/468 (61.75%), Postives = 356/468 (76.07%), Query Frame = 0

Query: 34  IAFSSIICLFYSPKFLYYSSFYCHSRGSSSSSMNIGNSVLDPAPESPLPKPREETNLSHV 93
           I   SI+C+FY+  F   S+          +   I   V+ P   SP PKP+E+T L H+
Sbjct: 35  ILLISILCIFYTLSFSNVSNSSNQELKIIKTIHGIDQDVIPPV-SSPKPKPQEKTGLQHI 94

Query: 94  VFGIAASAKLWNQRKNYIKLWWRSNEMQGIVWLDEAVEKAEDDHLLPTVRISGDTSEFVY 153
           VFGIAASA+LW+ RKNYIKLWW+  EM+GIVWLD+AVE  +DDHLLP ++IS DT EF Y
Sbjct: 95  VFGIAASARLWDHRKNYIKLWWKPQEMRGIVWLDKAVENGDDDHLLPPIKISRDTPEFKY 154

Query: 154 GNPKGHRSAIRISRIVSETLRLGFSARAEVRWLVMGDDDTFFAVHNLVKVLRKYDDNQFY 213
            NPKGHRSAIRISRIVSETLRLG      VRW VMGDDDTFF   NLV+VL KYD NQFY
Sbjct: 155 RNPKGHRSAIRISRIVSETLRLGLEG---VRWFVMGDDDTFFVPDNLVRVLSKYDHNQFY 214

Query: 214 YIGSSSESHLQNMHFSYNMAYGGGGFAISYPLAKELEKMQDNCLQRYPKLYGSDDRIQAC 273
           YIGSSSESHLQN++FSY MAYGGGGFAISYPLAK LEKMQD C+QRYP+LYGSDDRI AC
Sbjct: 215 YIGSSSESHLQNINFSYGMAYGGGGFAISYPLAKALEKMQDRCIQRYPRLYGSDDRIHAC 274

Query: 274 MAELGVPLTREPGFHQCDVYGNLFGLLTAHPLAPLVSLHHLDIVDPIFPVMGRFEALKKL 333
           MAELGV LT+E GFHQ DVYG+L GLL AHP+APLVS+HHLD+V PIFP + R +AL++L
Sbjct: 275 MAELGVALTKERGFHQYDVYGSLLGLLGAHPVAPLVSIHHLDVVQPIFPNVNRVQALQRL 334

Query: 334 GPAMNLDSAGLMQQSICYHKSRLWTVSIAWGYAVQIFLTKLFPKDVQLPATTFLNWYPNA 393
              +NLDSA +MQQS+CY K+R WT+S++WGYAVQI+      +++++PA TFLNWY  A
Sbjct: 335 KVPINLDSAAVMQQSVCYDKTRSWTISVSWGYAVQIYRGIFSVREMEMPARTFLNWYRRA 394

Query: 394 GTSAHAFNTRSINLHACQTPFVYYLHDITFDASMNRTVSRYTRYRIPEPECKWKMLDNSM 453
             +  +FNTR  + + CQ PFVYYL +   + + N+T S Y ++++   ECKW+M D S 
Sbjct: 395 DYTGFSFNTRPFSRNVCQKPFVYYLSNALHNKNTNQTASEYVQHQVSSSECKWRMADPSR 454

Query: 454 IERVDVLKKADPFLWDKAPRRQCCRILGTEKQGEMLVEIGECEEDEII 502
           IERV+V KK DP LWDK+PRR CCR+L T+K+G M++++GEC EDE+I
Sbjct: 455 IERVEVYKKPDPNLWDKSPRRNCCRVLPTKKKGTMVIDVGECGEDEVI 498

BLAST of Clc05G01970 vs. NCBI nr
Match: XP_017637457.1 (PREDICTED: uncharacterized protein LOC108479398 [Gossypium arboreum])

HSP 1 Score: 609.4 bits (1570), Expect = 2.8e-170
Identity = 297/499 (59.52%), Postives = 364/499 (72.95%), Query Frame = 0

Query: 12  QPERNPLTVSIKYLPKSMLYFLIAFSSIICLFYSPKFLYYSSFYCHSRGSSSSSMNIGN- 71
           Q  RN  T   K+L  S+L       SI+C+FY+  F            SS+  +NI   
Sbjct: 15  QTMRNHFTTFPKFLLSSILLI-----SILCIFYTVSF----------SNSSNKDLNIITA 74

Query: 72  --------SVLDPAPESPLPKPREETNLSHVVFGIAASAKLWNQRKNYIKLWWRSNEMQG 131
                   +V  P   SP P P  +T L  +VFGIAASA+LW+ RKNYIKLWW++ +M+G
Sbjct: 75  VHGGREEVAVAPPPVPSPKPSPSSKTTLRQIVFGIAASARLWDHRKNYIKLWWKA-QMRG 134

Query: 132 IVWLDEAVEKAEDDHLLPTVRISGDTSEFVYGNPKGHRSAIRISRIVSETLRLGFSARAE 191
           +VWLD+ V+   DDHLLP   ISGDTS+F Y NPKGHRSAIRISRIVSETLRLG     +
Sbjct: 135 VVWLDKGVKPGIDDHLLPQKMISGDTSKFKYNNPKGHRSAIRISRIVSETLRLGLD---D 194

Query: 192 VRWLVMGDDDTFFAVHNLVKVLRKYDDNQFYYIGSSSESHLQNMHFSYNMAYGGGGFAIS 251
           VRW VMGDDDTFF   NLV+VL KYD NQFYYIGSSSESHLQN++FSY MAYGGGGFAIS
Sbjct: 195 VRWFVMGDDDTFFVPDNLVRVLSKYDHNQFYYIGSSSESHLQNINFSYGMAYGGGGFAIS 254

Query: 252 YPLAKELEKMQDNCLQRYPKLYGSDDRIQACMAELGVPLTREPGFHQCDVYGNLFGLLTA 311
           YPLAK L KMQD C+QRYP LYGSDDRI ACMAELGVPLT+EPGFHQ DV+GNL GLL+A
Sbjct: 255 YPLAKALAKMQDRCIQRYPGLYGSDDRIHACMAELGVPLTKEPGFHQYDVFGNLLGLLSA 314

Query: 312 HPLAPLVSLHHLDIVDPIFPVMGRFEALKKLGPAMNLDSAGLMQQSICYHKSRLWTVSIA 371
           HP+APLVS+HHLD V+PIFP M R +ALK+L   +NLDSA LMQQS+CY K+R WTVS++
Sbjct: 315 HPVAPLVSIHHLDKVEPIFPNMNRVQALKRLNIPINLDSAALMQQSVCYDKTRSWTVSVS 374

Query: 372 WGYAVQIFLTKLFPKDVQLPATTFLNWYPNAGTSAHAFNTRSINLHACQTPFVYYLHDIT 431
           WGY VQI+      +++++PA TFLNWY  A  +  AFNTR +  H CQ PFVYYL   +
Sbjct: 375 WGYTVQIYRGIFSVREMEMPARTFLNWYKRADYTGFAFNTRPVTRHVCQKPFVYYLSKAS 434

Query: 432 FDASMNRTVSRYTRYRIPEPECKWKMLDNSMIERVDVLKKADPFLWDKAPRRQCCRILGT 491
           ++  MN+TVS + ++++  P+CKWKM D S IERV+V +K DP LWDK PRR CCR+L T
Sbjct: 435 YNKVMNQTVSEHVQHQVSNPDCKWKMADPSRIERVEVYRKPDPNLWDKPPRRNCCRVLPT 494

Query: 492 EKQGEMLVEIGECEEDEII 502
           +K+G M++++G C +DE+I
Sbjct: 495 KKKGTMVIDVGVCGDDEVI 494

BLAST of Clc05G01970 vs. NCBI nr
Match: XP_016709358.2 (uncharacterized protein LOC107923692 [Gossypium hirsutum] >KAG4169850.1 hypothetical protein ERO13_A12G107800v2 [Gossypium hirsutum])

HSP 1 Score: 609.4 bits (1570), Expect = 2.8e-170
Identity = 298/499 (59.72%), Postives = 364/499 (72.95%), Query Frame = 0

Query: 12  QPERNPLTVSIKYLPKSMLYFLIAFSSIICLFYSPKFLYYSSFYCHSRGSSSSSMNIGN- 71
           Q  RN  T   K+L  S+L       SI+C+FY+  F            SS+  +NI   
Sbjct: 15  QTMRNHFTTFPKFLISSILLI-----SILCIFYTVSF----------SNSSNKDLNIITA 74

Query: 72  --------SVLDPAPESPLPKPREETNLSHVVFGIAASAKLWNQRKNYIKLWWRSNEMQG 131
                   +V  P   SP P P  +T L  +VFGIAASA+LW+ RKNYIKLWW++ +M+G
Sbjct: 75  VHGGREEVAVAPPPVPSPKPSPSSKTTLRQIVFGIAASARLWDHRKNYIKLWWKA-QMRG 134

Query: 132 IVWLDEAVEKAEDDHLLPTVRISGDTSEFVYGNPKGHRSAIRISRIVSETLRLGFSARAE 191
           +VWLD+ V+   DDHLLP   ISGDTS+F Y NPKGHRSAIRISRIVSETLRLG     +
Sbjct: 135 VVWLDKGVKPGIDDHLLPQKMISGDTSKFKYNNPKGHRSAIRISRIVSETLRLGLD---D 194

Query: 192 VRWLVMGDDDTFFAVHNLVKVLRKYDDNQFYYIGSSSESHLQNMHFSYNMAYGGGGFAIS 251
           VRW VMGDDDTFF   NLV+VL KYD NQFYYIGSSSESHLQN++FSY MAYGGGGFAIS
Sbjct: 195 VRWFVMGDDDTFFVPDNLVRVLSKYDHNQFYYIGSSSESHLQNINFSYGMAYGGGGFAIS 254

Query: 252 YPLAKELEKMQDNCLQRYPKLYGSDDRIQACMAELGVPLTREPGFHQCDVYGNLFGLLTA 311
           YPLAK L KMQD C+QRYP LYGSDDRI ACMAELGVPLT+EPGFHQ DV+GNL GLL+A
Sbjct: 255 YPLAKALAKMQDRCIQRYPGLYGSDDRIHACMAELGVPLTKEPGFHQYDVFGNLLGLLSA 314

Query: 312 HPLAPLVSLHHLDIVDPIFPVMGRFEALKKLGPAMNLDSAGLMQQSICYHKSRLWTVSIA 371
           HP+APLVS+HHLD V+PIFP M R +ALK+L   +NLDSA LMQQS+CY K+R WTVSI+
Sbjct: 315 HPIAPLVSIHHLDKVEPIFPNMNRVQALKRLNIPINLDSAALMQQSVCYDKTRSWTVSIS 374

Query: 372 WGYAVQIFLTKLFPKDVQLPATTFLNWYPNAGTSAHAFNTRSINLHACQTPFVYYLHDIT 431
           WGY VQI       +++++PA TFLNWY  A  +  AFNTR +  H CQ PFVYYL  ++
Sbjct: 375 WGYTVQINRGIFSVREMEMPARTFLNWYKRADYTGFAFNTRPVTRHVCQKPFVYYLSKVS 434

Query: 432 FDASMNRTVSRYTRYRIPEPECKWKMLDNSMIERVDVLKKADPFLWDKAPRRQCCRILGT 491
           ++  MN+TVS + ++++  P+CKWKM D S IERV+V +K DP LWDK PRR CCR+L T
Sbjct: 435 YNKVMNQTVSEHVQHQVSNPDCKWKMADPSRIERVEVYRKPDPNLWDKPPRRNCCRVLPT 494

Query: 492 EKQGEMLVEIGECEEDEII 502
           +K+G M++++G C +DE+I
Sbjct: 495 KKKGTMVIDVGVCGDDEVI 494

BLAST of Clc05G01970 vs. ExPASy Swiss-Prot
Match: Q8BHT6 (Beta-1,3-glucosyltransferase OS=Mus musculus OX=10090 GN=B3glct PE=1 SV=3)

HSP 1 Score: 53.1 bits (126), Expect = 1.0e-05
Identity = 34/108 (31.48%), Postives = 48/108 (44.44%), Query Frame = 0

Query: 182 EVRWLVMGDDDTFFAVHNLVKVLRKYDDNQFYYIGSSSESHLQNMHFSYNMAYGGGGFAI 241
           ++ WLV+ DDDT  ++  L  +L  YD +   ++G      L    +SY    GGGG   
Sbjct: 332 KISWLVIVDDDTLISISRLRHLLSCYDSSDPVFLGERYGYGLGTGGYSY--VTGGGGMVF 391

Query: 242 SYPLAKELEKMQDNCLQRYPKLYGSDDRIQACMAELGVPLTREPGFHQ 290
           S    + L      C   Y      D  +  C + LGVP+T  P FHQ
Sbjct: 392 SREAIRRLLVSSCRC---YSNDAPDDMVLGMCFSGLGVPVTHSPLFHQ 434

BLAST of Clc05G01970 vs. ExPASy TrEMBL
Match: A0A6J1CZL8 (uncharacterized protein LOC111016241 OS=Momordica charantia OX=3673 GN=LOC111016241 PE=3 SV=1)

HSP 1 Score: 752.7 bits (1942), Expect = 1.0e-213
Identity = 366/486 (75.31%), Postives = 407/486 (83.74%), Query Frame = 0

Query: 1   MKIRQKNAQDRQPERNPLTVSIKYLPKSMLYFLIAFSSIICLFYSPKFLYYSSFYCHSRG 60
           MKI +K A+DR  ++NPL  SIK LP  M Y L+  SSI+CLFYS KF+ YSS  C+SR 
Sbjct: 1   MKISEKTAEDRPQQKNPLAASIKSLPILMFYLLLLISSIVCLFYSAKFV-YSSSSCNSR- 60

Query: 61  SSSSSMNIGNSVLDPAPESPLPKPREETNLSHVVFGIAASAKLWNQRKNYIKLWWRSNEM 120
             S S  IG+SVLDP PE+P PKP +ETNLSHVVFGIAASAKLWN RKNYI LWWRS+EM
Sbjct: 61  -ISFSKEIGDSVLDPEPETPSPKPNQETNLSHVVFGIAASAKLWNHRKNYIMLWWRSSEM 120

Query: 121 QGIVWLDEAVEKAEDDHLLPTVRISGDTSEFVYGNPKGHRSAIRISRIVSETLRLGFSAR 180
           +GI+WLDE +E A+DDHLLP+ RIS DTS+F Y NP+GHRSAIR SRIVSE+LRLG   R
Sbjct: 121 RGIIWLDEPIESADDDHLLPSTRISADTSKFAYRNPEGHRSAIRTSRIVSESLRLGIFGR 180

Query: 181 AEVRWLVMGDDDTFFAVHNLVKVLRKYDDNQFYYIGSSSESHLQNMHFSYNMAYGGGGFA 240
           AEVRWLVMGDDDTFFAV NLV+VLRKYD NQFYYIG SSESHLQNMHFSY+MAYGGGGFA
Sbjct: 181 AEVRWLVMGDDDTFFAVDNLVQVLRKYDHNQFYYIGGSSESHLQNMHFSYDMAYGGGGFA 240

Query: 241 ISYPLAKELEKMQDNCLQRYPKLYGSDDRIQACMAELGVPLTREPGFHQCDVYGNLFGLL 300
           ISYPLAKELEKMQD+CLQRYP LYGSDDRIQACMAELGVPLTREPGFHQCDVYG+LFGLL
Sbjct: 241 ISYPLAKELEKMQDSCLQRYPNLYGSDDRIQACMAELGVPLTREPGFHQCDVYGDLFGLL 300

Query: 301 TAHPLAPLVSLHHLDIVDPIFPVMGRFEALKKLGPAMNLDSAGLMQQSICYHKSRLWTVS 360
           TAHPLAPLVSLHHLDIVDPIFP MGR +ALK+LG  M LD AGLMQQSICY KS  WTVS
Sbjct: 301 TAHPLAPLVSLHHLDIVDPIFPEMGRVQALKRLGSPMKLDPAGLMQQSICYDKSGAWTVS 360

Query: 361 IAWGYAVQIFLTKLFPKDVQLPATTFLNWYPNAGTSAHAFNTRSINLHACQTPFVYYLHD 420
           I+WGYAVQIF   LFP++VQLPATTFLNWY +AG+SAHAFNTR +    CQ PFVYYL  
Sbjct: 361 ISWGYAVQIFRGILFPRNVQLPATTFLNWYRSAGSSAHAFNTRPVTADVCQKPFVYYLRH 420

Query: 421 ITFDASMNRTVSRYTRYRIPEPECKWKMLDNSMIERVDVLKKADPFLWDKAPRRQCCRIL 480
           + F AS N T+S Y+ YR+ EPECKWK+LD + +E+V V K+ D  LWDKAPRRQCCRIL
Sbjct: 421 VAFGASTNMTISDYSWYRVHEPECKWKLLDRT-VEKVVVWKRPDSSLWDKAPRRQCCRIL 480

Query: 481 GTEKQG 487
             EKQG
Sbjct: 481 AMEKQG 482

BLAST of Clc05G01970 vs. ExPASy TrEMBL
Match: A0A1U8L7J9 (uncharacterized protein LOC107923692 OS=Gossypium hirsutum OX=3635 GN=LOC107923692 PE=4 SV=1)

HSP 1 Score: 611.3 bits (1575), Expect = 3.6e-171
Identity = 299/499 (59.92%), Postives = 365/499 (73.15%), Query Frame = 0

Query: 12  QPERNPLTVSIKYLPKSMLYFLIAFSSIICLFYSPKFLYYSSFYCHSRGSSSSSMNIGNS 71
           Q  RN  T   K+L  S+L       SI+C+FY+  F            SS+  +NI  +
Sbjct: 15  QTMRNHFTTFPKFLLSSILLI-----SILCIFYTVSF----------SNSSNKDLNIITA 74

Query: 72  VLD---------PAPESPLPKPREETNLSHVVFGIAASAKLWNQRKNYIKLWWRSNEMQG 131
           V D         P   SP P P  +T L  +VFGIAASA+LW+ RKNYIKLWW++ +M+G
Sbjct: 75  VHDGREEVAVAPPPVPSPKPSPSSKTTLRQIVFGIAASARLWDHRKNYIKLWWKA-QMRG 134

Query: 132 IVWLDEAVEKAEDDHLLPTVRISGDTSEFVYGNPKGHRSAIRISRIVSETLRLGFSARAE 191
           +VWLD+ V+   DDHLLP   ISGDTS+F Y NPKGHRSAIRISRIVSETLRLG     +
Sbjct: 135 VVWLDKGVKPGIDDHLLPQKMISGDTSKFKYNNPKGHRSAIRISRIVSETLRLGLD---D 194

Query: 192 VRWLVMGDDDTFFAVHNLVKVLRKYDDNQFYYIGSSSESHLQNMHFSYNMAYGGGGFAIS 251
           VRW VMGDDDTFF   NLV+VL KYD NQFYYIGSSSESHLQN++FSY MAYGGGGFAIS
Sbjct: 195 VRWFVMGDDDTFFVPDNLVRVLSKYDHNQFYYIGSSSESHLQNINFSYGMAYGGGGFAIS 254

Query: 252 YPLAKELEKMQDNCLQRYPKLYGSDDRIQACMAELGVPLTREPGFHQCDVYGNLFGLLTA 311
           YPLAK L KMQD C+QRYP LYGSDDRI ACMAELGVPLT+EPGFHQ DV+GNL GLL+A
Sbjct: 255 YPLAKALAKMQDRCIQRYPGLYGSDDRIHACMAELGVPLTKEPGFHQYDVFGNLLGLLSA 314

Query: 312 HPLAPLVSLHHLDIVDPIFPVMGRFEALKKLGPAMNLDSAGLMQQSICYHKSRLWTVSIA 371
           HP+APLVS+HHLD V+PIFP M R +ALK+L   +NLDSA LMQQS+CY K+R WTVSI+
Sbjct: 315 HPIAPLVSIHHLDKVEPIFPNMNRVQALKRLNIPINLDSAALMQQSVCYDKTRSWTVSIS 374

Query: 372 WGYAVQIFLTKLFPKDVQLPATTFLNWYPNAGTSAHAFNTRSINLHACQTPFVYYLHDIT 431
           WGY VQI       +++++PA TFLNWY  A  +  AFNTR +  H CQ PFVYYL  ++
Sbjct: 375 WGYTVQINRGIFSVREMEMPARTFLNWYKRADYTGFAFNTRPVTRHVCQKPFVYYLSKVS 434

Query: 432 FDASMNRTVSRYTRYRIPEPECKWKMLDNSMIERVDVLKKADPFLWDKAPRRQCCRILGT 491
           ++  MN+TVS + ++++  P+CKWKM D S IERV+V +K DP LWDK PRR CCR+L T
Sbjct: 435 YNKVMNQTVSEHVQHQVSNPDCKWKMADPSRIERVEVYRKPDPNLWDKPPRRNCCRVLPT 494

Query: 492 EKQGEMLVEIGECEEDEII 502
           +K+G M++++G C +DE+I
Sbjct: 495 KKKGTMVIDVGVCGDDEVI 494

BLAST of Clc05G01970 vs. ExPASy TrEMBL
Match: A0A6P4PNE6 (uncharacterized protein LOC108479398 OS=Gossypium arboreum OX=29729 GN=LOC108479398 PE=4 SV=1)

HSP 1 Score: 609.4 bits (1570), Expect = 1.4e-170
Identity = 297/499 (59.52%), Postives = 364/499 (72.95%), Query Frame = 0

Query: 12  QPERNPLTVSIKYLPKSMLYFLIAFSSIICLFYSPKFLYYSSFYCHSRGSSSSSMNIGN- 71
           Q  RN  T   K+L  S+L       SI+C+FY+  F            SS+  +NI   
Sbjct: 15  QTMRNHFTTFPKFLLSSILLI-----SILCIFYTVSF----------SNSSNKDLNIITA 74

Query: 72  --------SVLDPAPESPLPKPREETNLSHVVFGIAASAKLWNQRKNYIKLWWRSNEMQG 131
                   +V  P   SP P P  +T L  +VFGIAASA+LW+ RKNYIKLWW++ +M+G
Sbjct: 75  VHGGREEVAVAPPPVPSPKPSPSSKTTLRQIVFGIAASARLWDHRKNYIKLWWKA-QMRG 134

Query: 132 IVWLDEAVEKAEDDHLLPTVRISGDTSEFVYGNPKGHRSAIRISRIVSETLRLGFSARAE 191
           +VWLD+ V+   DDHLLP   ISGDTS+F Y NPKGHRSAIRISRIVSETLRLG     +
Sbjct: 135 VVWLDKGVKPGIDDHLLPQKMISGDTSKFKYNNPKGHRSAIRISRIVSETLRLGLD---D 194

Query: 192 VRWLVMGDDDTFFAVHNLVKVLRKYDDNQFYYIGSSSESHLQNMHFSYNMAYGGGGFAIS 251
           VRW VMGDDDTFF   NLV+VL KYD NQFYYIGSSSESHLQN++FSY MAYGGGGFAIS
Sbjct: 195 VRWFVMGDDDTFFVPDNLVRVLSKYDHNQFYYIGSSSESHLQNINFSYGMAYGGGGFAIS 254

Query: 252 YPLAKELEKMQDNCLQRYPKLYGSDDRIQACMAELGVPLTREPGFHQCDVYGNLFGLLTA 311
           YPLAK L KMQD C+QRYP LYGSDDRI ACMAELGVPLT+EPGFHQ DV+GNL GLL+A
Sbjct: 255 YPLAKALAKMQDRCIQRYPGLYGSDDRIHACMAELGVPLTKEPGFHQYDVFGNLLGLLSA 314

Query: 312 HPLAPLVSLHHLDIVDPIFPVMGRFEALKKLGPAMNLDSAGLMQQSICYHKSRLWTVSIA 371
           HP+APLVS+HHLD V+PIFP M R +ALK+L   +NLDSA LMQQS+CY K+R WTVS++
Sbjct: 315 HPVAPLVSIHHLDKVEPIFPNMNRVQALKRLNIPINLDSAALMQQSVCYDKTRSWTVSVS 374

Query: 372 WGYAVQIFLTKLFPKDVQLPATTFLNWYPNAGTSAHAFNTRSINLHACQTPFVYYLHDIT 431
           WGY VQI+      +++++PA TFLNWY  A  +  AFNTR +  H CQ PFVYYL   +
Sbjct: 375 WGYTVQIYRGIFSVREMEMPARTFLNWYKRADYTGFAFNTRPVTRHVCQKPFVYYLSKAS 434

Query: 432 FDASMNRTVSRYTRYRIPEPECKWKMLDNSMIERVDVLKKADPFLWDKAPRRQCCRILGT 491
           ++  MN+TVS + ++++  P+CKWKM D S IERV+V +K DP LWDK PRR CCR+L T
Sbjct: 435 YNKVMNQTVSEHVQHQVSNPDCKWKMADPSRIERVEVYRKPDPNLWDKPPRRNCCRVLPT 494

Query: 492 EKQGEMLVEIGECEEDEII 502
           +K+G M++++G C +DE+I
Sbjct: 495 KKKGTMVIDVGVCGDDEVI 494

BLAST of Clc05G01970 vs. ExPASy TrEMBL
Match: A0A061DUY9 (Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_005286 PE=4 SV=1)

HSP 1 Score: 609.0 bits (1569), Expect = 1.8e-170
Identity = 288/468 (61.54%), Postives = 355/468 (75.85%), Query Frame = 0

Query: 34  IAFSSIICLFYSPKFLYYSSFYCHSRGSSSSSMNIGNSVLDPAPESPLPKPREETNLSHV 93
           I   SI+C+FY+  F   S+          +   I   V+ P   SP PKP+E+T L H+
Sbjct: 35  ILLISILCIFYTLSFSNVSNSSNQELKIIKTIHGIDQDVIPPV-SSPKPKPQEKTGLQHI 94

Query: 94  VFGIAASAKLWNQRKNYIKLWWRSNEMQGIVWLDEAVEKAEDDHLLPTVRISGDTSEFVY 153
           VFGIAASA+LW+ RKNYIKLWW+  EM+GIVWLD+AVE  +DDHLLP ++IS DT EF Y
Sbjct: 95  VFGIAASARLWDHRKNYIKLWWKPQEMRGIVWLDKAVENGDDDHLLPPIKISRDTPEFKY 154

Query: 154 GNPKGHRSAIRISRIVSETLRLGFSARAEVRWLVMGDDDTFFAVHNLVKVLRKYDDNQFY 213
            NPKGHR AIRISRIVSETLRLG      VRW VMGDDDTFF   NLV+VL KYD NQFY
Sbjct: 155 RNPKGHRYAIRISRIVSETLRLGLEG---VRWFVMGDDDTFFVPDNLVRVLSKYDHNQFY 214

Query: 214 YIGSSSESHLQNMHFSYNMAYGGGGFAISYPLAKELEKMQDNCLQRYPKLYGSDDRIQAC 273
           YIGSSSESHLQN++FSY MAYGGGGFAISYPLAK LEKMQD C+QRYP+LYGSDDRI AC
Sbjct: 215 YIGSSSESHLQNINFSYGMAYGGGGFAISYPLAKALEKMQDRCIQRYPRLYGSDDRIHAC 274

Query: 274 MAELGVPLTREPGFHQCDVYGNLFGLLTAHPLAPLVSLHHLDIVDPIFPVMGRFEALKKL 333
           MAELGV LT+E GFHQ DVYG+L GLL AHP+APLVS+HHLD+V PIFP + R +AL++L
Sbjct: 275 MAELGVALTKERGFHQYDVYGSLLGLLGAHPVAPLVSIHHLDVVQPIFPNVNRVQALQRL 334

Query: 334 GPAMNLDSAGLMQQSICYHKSRLWTVSIAWGYAVQIFLTKLFPKDVQLPATTFLNWYPNA 393
              +NLDSA +MQQS+CY K+R WT+S++WGYAVQI+      +++++PA TFLNWY  A
Sbjct: 335 KVPINLDSAAVMQQSVCYDKTRSWTISVSWGYAVQIYRGIFSVREMEMPARTFLNWYRRA 394

Query: 394 GTSAHAFNTRSINLHACQTPFVYYLHDITFDASMNRTVSRYTRYRIPEPECKWKMLDNSM 453
             +  +FNTR  + + CQ PFVYYL +   + + N+T S Y ++++   ECKW+M D S 
Sbjct: 395 DYTGFSFNTRPFSRNVCQKPFVYYLSNALHNKNTNQTASEYVQHQVSSSECKWRMADPSR 454

Query: 454 IERVDVLKKADPFLWDKAPRRQCCRILGTEKQGEMLVEIGECEEDEII 502
           IERV+V KK DP LWDK+PRR CCR+L T+K+G M++++GEC EDE+I
Sbjct: 455 IERVEVYKKPDPNLWDKSPRRNCCRVLPTKKKGTMVIDVGECGEDEVI 498

BLAST of Clc05G01970 vs. ExPASy TrEMBL
Match: A0A5D2MWI9 (Uncharacterized protein OS=Gossypium tomentosum OX=34277 GN=ES332_A12G125900v1 PE=4 SV=1)

HSP 1 Score: 608.6 bits (1568), Expect = 2.3e-170
Identity = 297/499 (59.52%), Postives = 364/499 (72.95%), Query Frame = 0

Query: 12  QPERNPLTVSIKYLPKSMLYFLIAFSSIICLFYSPKFLYYSSFYCHSRGSSSSSMNIGN- 71
           Q  RN  T   K+L  S+L       SI+C+FY+  F            SS+  +NI   
Sbjct: 15  QTMRNHFTTFPKFLLSSILLI-----SILCIFYTVSF----------SNSSNKDLNIITA 74

Query: 72  --------SVLDPAPESPLPKPREETNLSHVVFGIAASAKLWNQRKNYIKLWWRSNEMQG 131
                   +V  P   SP P P  +T L  +VFGIAASA+LW+ RKNYIKLWW++ +M+G
Sbjct: 75  VHGGREEVAVAPPPVPSPKPSPSSKTTLRQIVFGIAASARLWDHRKNYIKLWWKA-QMRG 134

Query: 132 IVWLDEAVEKAEDDHLLPTVRISGDTSEFVYGNPKGHRSAIRISRIVSETLRLGFSARAE 191
           +VWLD+ V+   DDHLLP   ISGDTS+F Y NPKGHRSAIRISRIVSETLRLG     +
Sbjct: 135 VVWLDKGVKPGIDDHLLPQKMISGDTSKFKYNNPKGHRSAIRISRIVSETLRLGLD---D 194

Query: 192 VRWLVMGDDDTFFAVHNLVKVLRKYDDNQFYYIGSSSESHLQNMHFSYNMAYGGGGFAIS 251
           VRW VMGDDDTFF   NLV+VL KYD NQFYYIGSSSESHLQN++FSY MAYGGGGFAIS
Sbjct: 195 VRWFVMGDDDTFFVPDNLVRVLSKYDHNQFYYIGSSSESHLQNINFSYGMAYGGGGFAIS 254

Query: 252 YPLAKELEKMQDNCLQRYPKLYGSDDRIQACMAELGVPLTREPGFHQCDVYGNLFGLLTA 311
           YPLAK L KMQD C+QRYP LYGSDDRI ACMAELGVPLT+EPGFHQ DV+GNL GLL+A
Sbjct: 255 YPLAKALAKMQDRCIQRYPGLYGSDDRIHACMAELGVPLTKEPGFHQYDVFGNLLGLLSA 314

Query: 312 HPLAPLVSLHHLDIVDPIFPVMGRFEALKKLGPAMNLDSAGLMQQSICYHKSRLWTVSIA 371
           HP+APLVS+HHLD V+PIFP M R +ALK+L   +NLDSA LMQQS+CY K+R WTVS++
Sbjct: 315 HPVAPLVSIHHLDKVEPIFPNMNRVQALKRLNIPINLDSAALMQQSVCYDKTRSWTVSVS 374

Query: 372 WGYAVQIFLTKLFPKDVQLPATTFLNWYPNAGTSAHAFNTRSINLHACQTPFVYYLHDIT 431
           WGY VQI       +++++PA TFLNWY  A  +  AFNTR +  H CQ PFVYYL  ++
Sbjct: 375 WGYTVQINRGIFSVREMEMPARTFLNWYKRADYTGFAFNTRPVTRHVCQKPFVYYLSKVS 434

Query: 432 FDASMNRTVSRYTRYRIPEPECKWKMLDNSMIERVDVLKKADPFLWDKAPRRQCCRILGT 491
           ++  MN+TVS + ++++  P+CKWKM D S IERV+V +K DP LWDK PRR CCR+L T
Sbjct: 435 YNKVMNQTVSEHVQHQVSNPDCKWKMADPSRIERVEVYRKPDPNLWDKPPRRNCCRVLPT 494

Query: 492 EKQGEMLVEIGECEEDEII 502
           +K+G M++++G C +DE+I
Sbjct: 495 KKKGTMVIDVGVCGDDEVI 494

BLAST of Clc05G01970 vs. TAIR 10
Match: AT5G41460.1 (Protein of unknown function (DUF604) )

HSP 1 Score: 554.3 bits (1427), Expect = 1.0e-157
Identity = 262/433 (60.51%), Postives = 322/433 (74.36%), Query Frame = 0

Query: 71  SVLDPAPESPLPKPREETNLSHVVFGIAASAKLWNQRKNYIKLWWRSNEMQGIVWLDEAV 130
           S   P P  P P P  +T   HVVFGIAASA+LW QRK YIK+W++ N+M+  VWL++ V
Sbjct: 93  SYASPPPSPPPPPPPPQTGFQHVVFGIAASARLWKQRKEYIKIWYKPNQMRSYVWLEKPV 152

Query: 131 --EKAEDDHLLPTVRISGDTSEFVYGNPKGHRSAIRISRIVSETLRLGFSARAEVRWLVM 190
             E  ED+  LP V+ISGDTS+F Y N +GHRSAIRISRIV+ETL+LG     +VRW VM
Sbjct: 153 TEEDEEDEISLPPVKISGDTSKFPYKNKQGHRSAIRISRIVTETLKLGLK---DVRWFVM 212

Query: 191 GDDDTFFAVHNLVKVLRKYDDNQFYYIGSSSESHLQNMHFSYNMAYGGGGFAISYPLAKE 250
           GDDDT F   NL++VLRKYD NQ YYIGS SESHLQN++FSY MAYGGGGFAISYPLA  
Sbjct: 213 GDDDTVFVAENLIRVLRKYDHNQMYYIGSLSESHLQNIYFSYGMAYGGGGFAISYPLAVA 272

Query: 251 LEKMQDNCLQRYPKLYGSDDRIQACMAELGVPLTREPGFHQCDVYGNLFGLLTAHPLAPL 310
           L KMQD C++RYP LYGSDDR+QACMAELGVPLT+E GFHQ DVYGNLFGLL AHP+APL
Sbjct: 273 LSKMQDRCIKRYPALYGSDDRMQACMAELGVPLTKELGFHQYDVYGNLFGLLAAHPVAPL 332

Query: 311 VSLHHLDIVDPIFPVMGRFEALKKLGPAMNLDSAGLMQQSICYHKSRLWTVSIAWGYAVQ 370
           V+LHHLD+V+PIFP M R +ALK L     LDSAGLMQQSICY K R WTVS++WG+AVQ
Sbjct: 333 VTLHHLDVVEPIFPNMTRVDALKHLQVPAKLDSAGLMQQSICYDKRRKWTVSVSWGFAVQ 392

Query: 371 IFLTKLFPKDVQLPATTFLNWYPNAGTSAHAFNTRSINLHACQTPFVYYLHDITFDASMN 430
           IF      +++++P+ TFLNWY  A  +A+AFNTR ++ H CQ PFV+Y+         N
Sbjct: 393 IFRGIFSAREIEMPSRTFLNWYRRADYTAYAFNTRPVSRHPCQKPFVFYMTSTRVHRVTN 452

Query: 431 RTVSRYTRYRIPEPECKWKMLDNSMIERVDVLKKADPFLWDKAPRRQCCRILGTEKQGEM 490
            TVSRY  +R+  PEC+WKM + S I+ V V KK DP LWD++PRR CCR+  ++K   +
Sbjct: 453 MTVSRYEIHRVAHPECRWKMANPSDIKTVIVYKKPDPHLWDRSPRRNCCRV-KSKKNNTL 512

Query: 491 LVEIGECEEDEII 502
            + +  C+E E++
Sbjct: 513 EISVAVCKEGEVV 521

BLAST of Clc05G01970 vs. TAIR 10
Match: AT4G11350.1 (Protein of unknown function (DUF604) )

HSP 1 Score: 547.4 bits (1409), Expect = 1.2e-155
Identity = 269/505 (53.27%), Postives = 353/505 (69.90%), Query Frame = 0

Query: 1   MKIRQKNAQDRQP-ERNPLTVSIKYLPKSMLYFLIAFSSIICLFYSPKFLYYSSFYCHSR 60
           MK  QK++ ++   +R+   +S+   P  ++ +LI F S+  + Y+ K +  ++  C   
Sbjct: 1   MKGNQKDSSEKPIWDRSSSGISMT-RPGRLIIWLILFISVTYIIYTLK-IVSTTHPCEDL 60

Query: 61  GSSSSSMNIGNSVLDPAPESPLPKPREETNLSHVVFGIAASAKLWNQRKNYIKLWWRSNE 120
            S S                 +P  +E T+L+HVVFGIAAS+KLW QRK YIK+W++  +
Sbjct: 61  TSESILQQRPEKKAVTVTVKAVPAEQEATDLNHVVFGIAASSKLWKQRKEYIKIWYKPKK 120

Query: 121 MQGIVWLDEAVE-KAE--DDHLLPTVRISGDTSEFVYGNPKGHRSAIRISRIVSETL-RL 180
           M+G VWLDE V+ K+E  D   LP+VRISGDTS F Y N +GHRSAIRISRIVSETL  L
Sbjct: 121 MRGYVWLDEEVKIKSETGDQESLPSVRISGDTSSFPYTNKQGHRSAIRISRIVSETLMSL 180

Query: 181 GFSARAEVRWLVMGDDDTFFAVHNLVKVLRKYDDNQFYYIGSSSESHLQNMHFSYNMAYG 240
              ++  VRW VMGDDDT F   NL++VLRKYD  Q YYIGS SESHLQN+ FSY MAYG
Sbjct: 181 DSESKKNVRWFVMGDDDTVFVTDNLIRVLRKYDHEQMYYIGSLSESHLQNIIFSYGMAYG 240

Query: 241 GGGFAISYPLAKELEKMQDNCLQRYPKLYGSDDRIQACMAELGVPLTREPGFHQCDVYGN 300
           GGGFAISYPLA  L KMQD C+QRYP LYGSDDR+QACMAELGVPLT+E GFHQ DV+GN
Sbjct: 241 GGGFAISYPLAVALSKMQDQCIQRYPALYGSDDRMQACMAELGVPLTKEIGFHQYDVHGN 300

Query: 301 LFGLLTAHPLAPLVSLHHLDIVDPIFPVMGRFEALKKLGPAMNLDSAGLMQQSICYHKSR 360
           LFGLL AHP+ P VS+HHLD+V+PIFP M R  A+KKL   M +DSA L+QQSICY K +
Sbjct: 301 LFGLLAAHPITPFVSMHHLDVVEPIFPNMTRVRAIKKLTTPMKIDSAALLQQSICYDKHK 360

Query: 361 LWTVSIAWGYAVQIFLTKLFPKDVQLPATTFLNWYPNAGTSAHAFNTRSINLHACQTPFV 420
            WT+S++WG+AVQ+F     P+++++P+ TFLNWY  A  +A+AFNTR ++ + CQ PFV
Sbjct: 361 SWTISVSWGFAVQVFRGSFSPREMEMPSRTFLNWYKRADYTAYAFNTRPVSRNHCQKPFV 420

Query: 421 YYLHDITFDASMNRTVSRYTRYRIPEPECKWKMLDNSMIERVDVLKKADPFLWDKAPRRQ 480
           +++    FD  +N TVS YTR+R+P+P C+W M +   I  + V KK DP LW+++PRR 
Sbjct: 421 FHMSSAKFDPQLNTTVSEYTRHRVPQPACRWDMANPEEINTIVVYKKPDPHLWNRSPRRN 480

Query: 481 CCRILGTEKQGEMLVEIGECEEDEI 501
           CCR+L T++   + + +G C   E+
Sbjct: 481 CCRVLQTKRNNTLWINVGVCRAGEV 503

BLAST of Clc05G01970 vs. TAIR 10
Match: AT4G23490.1 (Protein of unknown function (DUF604) )

HSP 1 Score: 545.4 bits (1404), Expect = 4.7e-155
Identity = 271/493 (54.97%), Postives = 343/493 (69.57%), Query Frame = 0

Query: 26  PKSMLYFLIAFSSIICLFYSPKFLYYS-------SFYCHSRGSSSSSMNIGNSVLDPAPE 85
           PK M++ LI F     + Y  K +  S       SF   S  S++ S N+ +     A  
Sbjct: 34  PKLMVW-LICFIVFTYIIYMLKLVSTSRSCDDSTSFTTVSALSTNVSSNVSSLSTSLASR 93

Query: 86  SPLPKPREE-------TNLSHVVFGIAASAKLWNQRKNYIKLWWRSNEMQGIVWLDEAVE 145
               +  EE       T+L+HVVFGIAAS+KLW QRK YIK+W++   M+G VWLD+ V+
Sbjct: 94  RRNWEEEEEDTVVDKLTDLNHVVFGIAASSKLWKQRKEYIKIWYKPKRMRGYVWLDKEVK 153

Query: 146 KA----EDDHLLPTVRISGDTSEFVYGNPKGHRSAIRISRIVSETLRLGFSARAEVRWLV 205
           K+    +D+ LLP V+ISG T+ F Y N +G RSA+RISRIVSETLRLG      VRW V
Sbjct: 154 KSLSDDDDEKLLPPVKISGGTASFPYTNKQGQRSALRISRIVSETLRLG---PKNVRWFV 213

Query: 206 MGDDDTFFAVHNLVKVLRKYDDNQFYYIGSSSESHLQNMHFSYNMAYGGGGFAISYPLAK 265
           MGDDDT F + NL++VLRKYD  Q YYIGS SESHLQN+ FSY MAYGGGGFAISYPLAK
Sbjct: 214 MGDDDTVFVIDNLIRVLRKYDHEQMYYIGSLSESHLQNIFFSYGMAYGGGGFAISYPLAK 273

Query: 266 ELEKMQDNCLQRYPKLYGSDDRIQACMAELGVPLTREPGFHQCDVYGNLFGLLTAHPLAP 325
            L KMQD C+QRYP LYGSDDR+QACMAELGVPLT+E GFHQ DVYGNLFGLL AHP+ P
Sbjct: 274 ALSKMQDRCIQRYPALYGSDDRMQACMAELGVPLTKELGFHQYDVYGNLFGLLAAHPVTP 333

Query: 326 LVSLHHLDIVDPIFPVMGRFEALKKLGPAMNLDSAGLMQQSICYHKSRLWTVSIAWGYAV 385
            VS+HHLD+V+PIFP M R  ALKK+   M LDSAGL+QQSICY K + WT+S++WGYAV
Sbjct: 334 FVSMHHLDVVEPIFPNMTRVRALKKITEPMKLDSAGLLQQSICYDKHKSWTISVSWGYAV 393

Query: 386 QIFLTKLFPKDVQLPATTFLNWYPNAGTSAHAFNTRSINLHACQTPFVYYLHDITFDASM 445
           QIF     P+++++P+ TFLNWY  A  +A+AFNTR ++ + CQ PFV+Y+    FD  +
Sbjct: 394 QIFRGIFSPREMEMPSRTFLNWYKRADYTAYAFNTRPVSRNPCQKPFVFYMSSTKFDQQL 453

Query: 446 NRTVSRYTRYRIPEPECKWKMLDNSMIERVDVLKKADPFLWDKAPRRQCCRILGTEKQGE 501
           N TVS YT +R+  P C+WKM + + I  + V KK DP LW+++PRR CCR+L T++   
Sbjct: 454 NTTVSEYTIHRVSHPSCRWKMTNPAEINTIVVYKKPDPHLWERSPRRNCCRVLQTKRNNT 513

BLAST of Clc05G01970 vs. TAIR 10
Match: AT1G01570.1 (Protein of unknown function (DUF604) )

HSP 1 Score: 528.5 bits (1360), Expect = 5.9e-150
Identity = 268/468 (57.26%), Postives = 326/468 (69.66%), Query Frame = 0

Query: 39  IICLFYSPKFLYYSSFYCHSRGSSSSSMNIGNSVLDPAPESPLPKPREETNLSHVVFGIA 98
           I+ + +S +F++Y   +     SSSS   I  SV      S      ++T L HVVFGIA
Sbjct: 8   ILAILFSLQFVFYPLNFI----SSSSQPLIKFSVSPVVSGSGSVHEPDQTELKHVVFGIA 67

Query: 99  ASAKLWNQRKNYIKLWWRSN-EMQGIVWLDEAVEKAED-DHLLPTVRISGDTSEFVYGNP 158
           ASAK W  RK+Y+KLWW+ N EM G+VWLD+ + + ++    LP +RIS DTS F Y  P
Sbjct: 68  ASAKFWKHRKDYVKLWWKPNGEMNGVVWLDQHINQNDNVSKTLPPIRISSDTSRFQYRYP 127

Query: 159 KGHRSAIRISRIVSETLRL--GFSARAEVRWLVMGDDDTFFAVHNLVKVLRKYDDNQFYY 218
           KG RSAIRI+RIVSET+RL  G      VRW+VMGDDDT F   NLVKVLRKYD NQFYY
Sbjct: 128 KGLRSAIRITRIVSETVRLLNGTELEKNVRWIVMGDDDTVFFPENLVKVLRKYDHNQFYY 187

Query: 219 IGSSSESHLQNMHFSYNMAYGGGGFAISYPLAKELEKMQDNCLQRYPKLYGSDDRIQACM 278
           IGSSSESH+QN+ FSY MAYGGGGFAISYPLAK LEKMQD C+QRY +LYGSDDRI ACM
Sbjct: 188 IGSSSESHIQNLKFSYGMAYGGGGFAISYPLAKALEKMQDRCIQRYSELYGSDDRIHACM 247

Query: 279 AELGVPLTREPGFHQCDVYGNLFGLLTAHPLAPLVSLHHLDIVDPIFPVMGRFEALKKLG 338
           +ELGVPLT+E GFHQ D+YG L GLL+AHPLAPLVS+HHLD+VDP+FP MGR  A+++  
Sbjct: 248 SELGVPLTKEVGFHQIDLYGKLLGLLSAHPLAPLVSIHHLDLVDPVFPNMGRVNAMRRFM 307

Query: 339 PAMNLDSAGLMQQSICYHKSRLWTVSIAWGYAVQIFLTKLFPKDVQLPATTFLNWYPNAG 398
               LDS  L QQSICY     WTVS++WGY VQI    L  +++ +P  TF++WY  A 
Sbjct: 308 VPAKLDSPSLAQQSICYDADHRWTVSVSWGYTVQIIRGVLSAREMVIPTRTFIDWYKQAD 367

Query: 399 TSAHAFNTRSINLHACQTPFVYYLHDITFDASMNRTVSRYTR-YRIPEPECKWKMLDNSM 458
             ++AFNTR I   ACQ P VYYL +   D ++ RT S Y R Y + EPEC W M D S 
Sbjct: 368 ERSYAFNTRPIAKSACQRPRVYYLSNALPDLALRRTASEYVRWYDMWEPECDWDMSDPSE 427

Query: 459 IERVDVLKKADPFLWDK--APRRQCCRILGTEKQGEMLVEIGECEEDE 500
            ERV V KK DP  W+K  APRR CCR+L T K G M++++G C++DE
Sbjct: 428 FERVIVYKKPDPDRWNKHRAPRRDCCRVLPTTKNGTMVIDVGACKDDE 471

BLAST of Clc05G01970 vs. TAIR 10
Match: AT1G07850.1 (Protein of unknown function (DUF604) )

HSP 1 Score: 492.7 bits (1267), Expect = 3.6e-139
Identity = 239/460 (51.96%), Postives = 310/460 (67.39%), Query Frame = 0

Query: 46  PKFLYYSSFYCHSRGSSSSSMNIGNSVLDPAPESPLPKPRE----ETNLSHVVFGIAASA 105
           P    YS      + +SSSS+       D  P  P   P+      T L H+VFGIAAS+
Sbjct: 77  PSSSTYSHLSLVDKANSSSSVVPEEEDDDVPPRVPALYPQRPRMFNTTLDHIVFGIAASS 136

Query: 106 KLWNQRKNYIKLWWRSNEMQGIVWLDEAVEKAEDDHLLPTVRISGDTSEFVYGNPKGHRS 165
            LW  RK YIK WWR  + +G+VW+D+ V    +D  LP +RIS DTS F Y +P G RS
Sbjct: 137 VLWETRKEYIKSWWRPGKTRGVVWIDKRVRTYRNDP-LPEIRISQDTSRFRYTHPVGDRS 196

Query: 166 AIRISRIVSETLRLGFSARAEVRWLVMGDDDTFFAVHNLVKVLRKYDDNQFYYIGSSSES 225
           A+RISR+V+ETLRLG   +  VRW VMGDDDT F V N+V VL KYD  QFYY+GSSSE+
Sbjct: 197 AVRISRVVTETLRLG---KKGVRWFVMGDDDTVFVVDNVVNVLSKYDHTQFYYVGSSSEA 256

Query: 226 HLQNMHFSYNMAYGGGGFAISYPLAKELEKMQDNCLQRYPKLYGSDDRIQACMAELGVPL 285
           H+QN+ FSY+MA+GGGGFAISY LA EL +MQD C+QRYP LYGSDDRIQACM ELGVPL
Sbjct: 257 HVQNIFFSYSMAFGGGGFAISYALALELLRMQDRCIQRYPGLYGSDDRIQACMTELGVPL 316

Query: 286 TREPGFHQCDVYGNLFGLLTAHPLAPLVSLHHLDIVDPIFPVMGRFEALKKLGPAMNLDS 345
           T+EPGFHQ DVYG+L GLL AHP+APLVSLHH+D+V PIFP M R  AL+ L  +  LD 
Sbjct: 317 TKEPGFHQYDVYGDLLGLLGAHPVAPLVSLHHIDVVQPIFPKMKRSRALRHLMSSAVLDP 376

Query: 346 AGLMQQSICYHKSRLWTVSIAWGYAVQIFLTKLFPKDVQLPATTFLNWYPNAGTSAHAFN 405
           A + QQSICY ++R W++S++WG+ VQI    + P+++++P+ TFLNW+  A    +AFN
Sbjct: 377 ASIFQQSICYDQNRFWSISVSWGFVVQIIRGIISPRELEMPSRTFLNWFRKADYIGYAFN 436

Query: 406 TRSINLHACQTPFVYYLHDITFDASMNRTVSRYTRYRIPE-PECKWKMLDNSMIERVDVL 465
           TR ++ H CQ PFV+YL+   +D    + +  Y   +    P C+W++     I+ V VL
Sbjct: 437 TRPVSRHPCQRPFVFYLNSAKYDEGRRQVIGYYNLDKTRRIPGCRWRLDSPGKIDSVVVL 496

Query: 466 KKADPFLWDKAPRRQCCRILGTEKQGEMLVEIGECEEDEI 501
           K+ DP  W K+PRR CCR+L + +   M + +G C + EI
Sbjct: 497 KRPDPLRWHKSPRRDCCRVLPSRRNQTMYIWVGNCADGEI 532

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038876379.11.2e-24888.42uncharacterized protein LOC120068817 [Benincasa hispida][more]
XP_022147245.12.1e-21375.31uncharacterized protein LOC111016241 [Momordica charantia][more]
XP_007051739.27.5e-17161.75PREDICTED: uncharacterized protein LOC18614099 isoform X1 [Theobroma cacao][more]
XP_017637457.12.8e-17059.52PREDICTED: uncharacterized protein LOC108479398 [Gossypium arboreum][more]
XP_016709358.22.8e-17059.72uncharacterized protein LOC107923692 [Gossypium hirsutum] >KAG4169850.1 hypothet... [more]
Match NameE-valueIdentityDescription
Q8BHT61.0e-0531.48Beta-1,3-glucosyltransferase OS=Mus musculus OX=10090 GN=B3glct PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A6J1CZL81.0e-21375.31uncharacterized protein LOC111016241 OS=Momordica charantia OX=3673 GN=LOC111016... [more]
A0A1U8L7J93.6e-17159.92uncharacterized protein LOC107923692 OS=Gossypium hirsutum OX=3635 GN=LOC1079236... [more]
A0A6P4PNE61.4e-17059.52uncharacterized protein LOC108479398 OS=Gossypium arboreum OX=29729 GN=LOC108479... [more]
A0A061DUY91.8e-17061.54Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_005286 PE=4 SV=1[more]
A0A5D2MWI92.3e-17059.52Uncharacterized protein OS=Gossypium tomentosum OX=34277 GN=ES332_A12G125900v1 P... [more]
Match NameE-valueIdentityDescription
AT5G41460.11.0e-15760.51Protein of unknown function (DUF604) [more]
AT4G11350.11.2e-15553.27Protein of unknown function (DUF604) [more]
AT4G23490.14.7e-15554.97Protein of unknown function (DUF604) [more]
AT1G01570.15.9e-15057.26Protein of unknown function (DUF604) [more]
AT1G07850.13.6e-13951.96Protein of unknown function (DUF604) [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006740Protein of unknown function DUF604PFAMPF04646DUF604coord: 224..477
e-value: 8.6E-110
score: 366.1
NoneNo IPR availableGENE3D3.90.550.50coord: 80..324
e-value: 4.6E-45
score: 156.0
NoneNo IPR availablePANTHERPTHR10811FRINGE-RELATEDcoord: 26..501
NoneNo IPR availablePANTHERPTHR10811:SF81TRANSFERRING GLYCOSYL GROUP TRANSFERASEcoord: 26..501

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc05G01970.1Clc05G01970.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010951 negative regulation of endopeptidase activity
biological_process GO:0009611 response to wounding
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0008375 acetylglucosaminyltransferase activity
molecular_function GO:0004867 serine-type endopeptidase inhibitor activity