Cla019665.1 (mRNA) Watermelon (97103) v1

NameCla019665
TypemRNA
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionUnknown Protein (AHRD V1); contains Interpro domain(s) IPR007789 Protein of unknown function DUF688
LocationChr3 : 8200589 .. 8202780 (-)
Sequence length2028
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGAAAGAAAACTCAATTTTAATGCACCGCTCATGTCTGTGAGGAGATTTTCCAAGGCAGCTAGTTCTTTAGCTAAAGTGAATGAAAAGAAAGCTGAAAATTCCCATTTTAGTAGACGGAGTACCTTCCTAATTTCTAGACCACAATTCAATTTAGATCAAGTAACAGAACCAGTGGCAGTCCCCTTCCACTGGGAGCAGATTCCAGGAAGAGCTAAGAATGATAGTGGCTCAGCCTCACCTGAGGTTCAGCTGCCTCAGCTTCCTGAGAGGACTTGTTCTACTCCAAGGATTTCCTTTGGGAGGGCTTTGGATGTTAACAGATATAGTTTGGAAATGGAAACTTGTCATCAAAATGGGTGTGAAGCATCTTCTTCAAATGCCATTGTTGTTAGATTAGAGTCAACAAAAGCTAGCGATGTGAGGAGCTTGGCATCTGAGAATGATGAAGATGATGATGATTTCTCTGATGCACGTGAGACATTGTCCCTCACTGGTTCATTCTCTGTTAACAACTGTAGTGTTAGTGGTATAAGTGGATGCAATGGTCCCATGGTAAAACCATCAGGAACTTTTCGAACGGATCCTCAAACTCGAGATTTCATGATGAGCCGCTTCTTGCCTGCTGCCAAGGCAATGGTTTTGGAGCCTGCTAAGTATTCCTTAAAGAAGAAACTTGTAGCAGTTGAGCAACCTAGACAAGTTAAGAAGGTGATGTCTGAGAATAGGAGGATGTCTCCAATTAAACGACTTGAGTCTACCCTGTTACTACAGTATGGCAAAGATGAAGTGCATGGAGTAGATGAAGTAGATGAAGAAAGTGACTCTGTGGATGATGAATATGACAATTCAGGTAATATATCAGCTAGAGGTTGTGGTCTAATACCCAATATATGCTTCAAAAACTCTTTGGGCCTTCTTAATCCTGTGCCTGGGCTGAGAATCAGGACCGAGGGACCTATGTCTGTCACTAATAAAGTTGGAGGATCTAGCAGAACAATGCACCATTCACACAGCCAAAAGACCAACAAGGTTACTGTTTAATTTATTTTTTTCCTTCCCATTCACTTATTATGTATGCAAGCATAATTGCTGATAAATTTTACATCTTACAGCATGCTTGGGATGCTGCCTACAAGCAAAAATCAGAAGCTGCTGTTGGATCTCCTAAGCTGCCCGAGGTGAAAGATAAGTGGATGGGTGAATCCAAACATTTTCCTTCCTCCACCGACCCACAAATGAGAGGTAGGTCTTCTCCCTTCAGGCATTCGAGGGCCGCTTCTCCCTTCCGGAATGAAGCATCACAGTCTCCTTACAGAAAGCAGTCACTTGTAGTTCCTAAAGAAGTCGACATTATCTCCAAATCCAAAGAGGATACTGATTTTCATGATACGTCATCCATTCAAGCAACTAAACACGGGGTTGACATGGCAACTACCCTGATAGAGAAGACACTCTATATAGATACTGCAAGTGTTGCTGAAGCGAATTCTCCATTTAACTCAACCCATTTGGATGGCGAGAAGAAATCTGATCGCCCTAGTGGGAAGAATGAGACTGCATTTGAGACGAGAGTGATGGAAGAAAGTACCATTGTAGAACCTTCTTTCCTAGAAATAAAATGCTTAACTCTGGTTGAAGAAGGGAGGCTGGAGCGTGAAGCTGCAGAATCTAAAAGCCAATATGCTATTGATGATGGCCCCAATGTGGGAACTGGGCTTTATAAAGAAGATCATACTGGATACGCTAATTTGGGCACTGCTGATGAAGAAGATTACTCCAAGGCCAATTATCAGCTAGTCAAAGTAGAAGATCCAGCAAGTGCCATAGTAACTTCTGTGATATCTTCTCAACCTCCGCCACTACCAAAGTCTCCTTCCGAGTCTTGGCTCTGGCGTACCCTGCCTTCAGTTTCCTCTAAAAAGTTACTAGCAGGATCAAATCTTGGAAACAAGTTTTATCAAAAGCCACAGAGCCCTAGAACATCAGTTAGTACCAAGTGGGAAACCATTGTAAAATCTTCGAATTTGCGTCACGATCATGTTCGTTACTCCGAGGTAATGTCATCAACTCTCAAACATACTATATACTGGCTTCGGAAAAGGGATTAACCTGTTTCTTTTCTGGTTTGTTTGCAGGAATTAATTCCTCGTGTTTCTCAGCACTCAACAACAGAAAATTTCAAGTAG

mRNA sequence

ATGGAGGAAAGAAAACTCAATTTTAATGCACCGCTCATGTCTGTGAGGAGATTTTCCAAGGCAGCTAGTTCTTTAGCTAAAGTGAATGAAAAGAAAGCTGAAAATTCCCATTTTAGTAGACGGAGTACCTTCCTAATTTCTAGACCACAATTCAATTTAGATCAAGTAACAGAACCAGTGGCAGTCCCCTTCCACTGGGAGCAGATTCCAGGAAGAGCTAAGAATGATAGTGGCTCAGCCTCACCTGAGGTTCAGCTGCCTCAGCTTCCTGAGAGGACTTGTTCTACTCCAAGGATTTCCTTTGGGAGGGCTTTGGATGTTAACAGATATAGTTTGGAAATGGAAACTTGTCATCAAAATGGGTGTGAAGCATCTTCTTCAAATGCCATTGTTGTTAGATTAGAGTCAACAAAAGCTAGCGATGTGAGGAGCTTGGCATCTGAGAATGATGAAGATGATGATGATTTCTCTGATGCACGTGAGACATTGTCCCTCACTGGTTCATTCTCTGTTAACAACTGTAGTGTTAGTGGTATAAGTGGATGCAATGGTCCCATGGTAAAACCATCAGGAACTTTTCGAACGGATCCTCAAACTCGAGATTTCATGATGAGCCGCTTCTTGCCTGCTGCCAAGGCAATGGTTTTGGAGCCTGCTAAGTATTCCTTAAAGAAGAAACTTGTAGCAGTTGAGCAACCTAGACAAGTTAAGAAGGTGATGTCTGAGAATAGGAGGATGTCTCCAATTAAACGACTTGAGTCTACCCTGTTACTACAGTATGGCAAAGATGAAGTGCATGGAGTAGATGAAGTAGATGAAGAAAGTGACTCTGTGGATGATGAATATGACAATTCAGGTAATATATCAGCTAGAGGTTGTGGTCTAATACCCAATATATGCTTCAAAAACTCTTTGGGCCTTCTTAATCCTGTGCCTGGGCTGAGAATCAGGACCGAGGGACCTATGTCTGTCACTAATAAAGTTGGAGGATCTAGCAGAACAATGCACCATTCACACAGCCAAAAGACCAACAAGCATGCTTGGGATGCTGCCTACAAGCAAAAATCAGAAGCTGCTGTTGGATCTCCTAAGCTGCCCGAGGTGAAAGATAAGTGGATGGGTGAATCCAAACATTTTCCTTCCTCCACCGACCCACAAATGAGAGGTAGGTCTTCTCCCTTCAGGCATTCGAGGGCCGCTTCTCCCTTCCGGAATGAAGCATCACAGTCTCCTTACAGAAAGCAGTCACTTGTAGTTCCTAAAGAAGTCGACATTATCTCCAAATCCAAAGAGGATACTGATTTTCATGATACGTCATCCATTCAAGCAACTAAACACGGGGTTGACATGGCAACTACCCTGATAGAGAAGACACTCTATATAGATACTGCAAGTGTTGCTGAAGCGAATTCTCCATTTAACTCAACCCATTTGGATGGCGAGAAGAAATCTGATCGCCCTAGTGGGAAGAATGAGACTGCATTTGAGACGAGAGTGATGGAAGAAAGTACCATTGTAGAACCTTCTTTCCTAGAAATAAAATGCTTAACTCTGGTTGAAGAAGGGAGGCTGGAGCGTGAAGCTGCAGAATCTAAAAGCCAATATGCTATTGATGATGGCCCCAATGTGGGAACTGGGCTTTATAAAGAAGATCATACTGGATACGCTAATTTGGGCACTGCTGATGAAGAAGATTACTCCAAGGCCAATTATCAGCTAGTCAAAGTAGAAGATCCAGCAAGTGCCATAGTAACTTCTGTGATATCTTCTCAACCTCCGCCACTACCAAAGTCTCCTTCCGAGTCTTGGCTCTGGCGTACCCTGCCTTCAGTTTCCTCTAAAAAGTTACTAGCAGGATCAAATCTTGGAAACAAGTTTTATCAAAAGCCACAGAGCCCTAGAACATCAGTTAGTACCAAGTGGGAAACCATTGTAAAATCTTCGAATTTGCGTCACGATCATGTTCGTTACTCCGAGGAATTAATTCCTCGTGTTTCTCAGCACTCAACAACAGAAAATTTCAAGTAG

Coding sequence (CDS)

ATGGAGGAAAGAAAACTCAATTTTAATGCACCGCTCATGTCTGTGAGGAGATTTTCCAAGGCAGCTAGTTCTTTAGCTAAAGTGAATGAAAAGAAAGCTGAAAATTCCCATTTTAGTAGACGGAGTACCTTCCTAATTTCTAGACCACAATTCAATTTAGATCAAGTAACAGAACCAGTGGCAGTCCCCTTCCACTGGGAGCAGATTCCAGGAAGAGCTAAGAATGATAGTGGCTCAGCCTCACCTGAGGTTCAGCTGCCTCAGCTTCCTGAGAGGACTTGTTCTACTCCAAGGATTTCCTTTGGGAGGGCTTTGGATGTTAACAGATATAGTTTGGAAATGGAAACTTGTCATCAAAATGGGTGTGAAGCATCTTCTTCAAATGCCATTGTTGTTAGATTAGAGTCAACAAAAGCTAGCGATGTGAGGAGCTTGGCATCTGAGAATGATGAAGATGATGATGATTTCTCTGATGCACGTGAGACATTGTCCCTCACTGGTTCATTCTCTGTTAACAACTGTAGTGTTAGTGGTATAAGTGGATGCAATGGTCCCATGGTAAAACCATCAGGAACTTTTCGAACGGATCCTCAAACTCGAGATTTCATGATGAGCCGCTTCTTGCCTGCTGCCAAGGCAATGGTTTTGGAGCCTGCTAAGTATTCCTTAAAGAAGAAACTTGTAGCAGTTGAGCAACCTAGACAAGTTAAGAAGGTGATGTCTGAGAATAGGAGGATGTCTCCAATTAAACGACTTGAGTCTACCCTGTTACTACAGTATGGCAAAGATGAAGTGCATGGAGTAGATGAAGTAGATGAAGAAAGTGACTCTGTGGATGATGAATATGACAATTCAGGTAATATATCAGCTAGAGGTTGTGGTCTAATACCCAATATATGCTTCAAAAACTCTTTGGGCCTTCTTAATCCTGTGCCTGGGCTGAGAATCAGGACCGAGGGACCTATGTCTGTCACTAATAAAGTTGGAGGATCTAGCAGAACAATGCACCATTCACACAGCCAAAAGACCAACAAGCATGCTTGGGATGCTGCCTACAAGCAAAAATCAGAAGCTGCTGTTGGATCTCCTAAGCTGCCCGAGGTGAAAGATAAGTGGATGGGTGAATCCAAACATTTTCCTTCCTCCACCGACCCACAAATGAGAGGTAGGTCTTCTCCCTTCAGGCATTCGAGGGCCGCTTCTCCCTTCCGGAATGAAGCATCACAGTCTCCTTACAGAAAGCAGTCACTTGTAGTTCCTAAAGAAGTCGACATTATCTCCAAATCCAAAGAGGATACTGATTTTCATGATACGTCATCCATTCAAGCAACTAAACACGGGGTTGACATGGCAACTACCCTGATAGAGAAGACACTCTATATAGATACTGCAAGTGTTGCTGAAGCGAATTCTCCATTTAACTCAACCCATTTGGATGGCGAGAAGAAATCTGATCGCCCTAGTGGGAAGAATGAGACTGCATTTGAGACGAGAGTGATGGAAGAAAGTACCATTGTAGAACCTTCTTTCCTAGAAATAAAATGCTTAACTCTGGTTGAAGAAGGGAGGCTGGAGCGTGAAGCTGCAGAATCTAAAAGCCAATATGCTATTGATGATGGCCCCAATGTGGGAACTGGGCTTTATAAAGAAGATCATACTGGATACGCTAATTTGGGCACTGCTGATGAAGAAGATTACTCCAAGGCCAATTATCAGCTAGTCAAAGTAGAAGATCCAGCAAGTGCCATAGTAACTTCTGTGATATCTTCTCAACCTCCGCCACTACCAAAGTCTCCTTCCGAGTCTTGGCTCTGGCGTACCCTGCCTTCAGTTTCCTCTAAAAAGTTACTAGCAGGATCAAATCTTGGAAACAAGTTTTATCAAAAGCCACAGAGCCCTAGAACATCAGTTAGTACCAAGTGGGAAACCATTGTAAAATCTTCGAATTTGCGTCACGATCATGTTCGTTACTCCGAGGAATTAATTCCTCGTGTTTCTCAGCACTCAACAACAGAAAATTTCAAGTAG

Protein sequence

MEERKLNFNAPLMSVRRFSKAASSLAKVNEKKAENSHFSRRSTFLISRPQFNLDQVTEPVAVPFHWEQIPGRAKNDSGSASPEVQLPQLPERTCSTPRISFGRALDVNRYSLEMETCHQNGCEASSSNAIVVRLESTKASDVRSLASENDEDDDDFSDARETLSLTGSFSVNNCSVSGISGCNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKKVMSENRRMSPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGNISARGCGLIPNICFKNSLGLLNPVPGLRIRTEGPMSVTNKVGGSSRTMHHSHSQKTNKHAWDAAYKQKSEAAVGSPKLPEVKDKWMGESKHFPSSTDPQMRGRSSPFRHSRAASPFRNEASQSPYRKQSLVVPKEVDIISKSKEDTDFHDTSSIQATKHGVDMATTLIEKTLYIDTASVAEANSPFNSTHLDGEKKSDRPSGKNETAFETRVMEESTIVEPSFLEIKCLTLVEEGRLEREAAESKSQYAIDDGPNVGTGLYKEDHTGYANLGTADEEDYSKANYQLVKVEDPASAIVTSVISSQPPPLPKSPSESWLWRTLPSVSSKKLLAGSNLGNKFYQKPQSPRTSVSTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK
BLAST of Cla019665 vs. TrEMBL
Match: A0A0A0LX77_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G470450 PE=4 SV=1)

HSP 1 Score: 1146.7 bits (2965), Expect = 0.0e+00
Identity = 585/678 (86.28%), Postives = 614/678 (90.56%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSLAKVNEKKAENSHFSRRSTFLISRPQFNLDQVTEPV 60
           MEERKLNFNAPLMSVRRFSKAASS++K NEKK+ENSHFSRRSTF +SRPQFNLDQVTEPV
Sbjct: 1   MEERKLNFNAPLMSVRRFSKAASSISKANEKKSENSHFSRRSTFPVSRPQFNLDQVTEPV 60

Query: 61  AVPFHWEQIPGRAKNDSGSASPEVQLPQLPERTCSTPRISFGRALDVNRYSLEMETCHQN 120
           AVPF+WEQIPGRAKNDSGSASPEV LP  PERTCSTPR+SFG ALD N+YS EME CHQ+
Sbjct: 61  AVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANKYSSEMEACHQD 120

Query: 121 GCEASSSNAIVVRLESTKASDVRSLASENDEDDDD--FSDARETLSLTGSFSVNNCSVSG 180
           GCE+SSSNAIVVRLES KAS  RSLASEND+DDDD  FSDARETLSLTGSFSVNNCSVSG
Sbjct: 121 GCESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGSFSVNNCSVSG 180

Query: 181 ISGCNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKK 240
           ISG NGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKK
Sbjct: 181 ISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKK 240

Query: 241 VMSENRRMSPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGNISARGCGLIPN 300
             +ENRR+SPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSG+ISARGCGLIPN
Sbjct: 241 --AENRRISPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGHISARGCGLIPN 300

Query: 301 ICFKNSLGLLNPVPGLRIRTEGPMSVTNKVGGSSRTMHHSHSQKTNKHAWDAAYKQKSEA 360
           ICFKNSLGLLNPVPG+RIRTE PMSVT KVGGSSRT+HHS+ QK NKHAWDA YKQKSEA
Sbjct: 301 ICFKNSLGLLNPVPGMRIRTEAPMSVTKKVGGSSRTLHHSYGQKMNKHAWDATYKQKSEA 360

Query: 361 AVGSPKLPEVKDKWMGESKHFPSSTDPQMRGRSSPFRHSRAASPFRNEASQSPYRKQSLV 420
           AVGSP+L EVKDKW GESKHF SSTD QM+GRSSPFRHSRAASPFRNEAS+SP R+Q  V
Sbjct: 361 AVGSPRLLEVKDKWTGESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEASRSPCRRQPFV 420

Query: 421 VPKEVDIISKSKEDTDFHDTSSIQAT-KHGVDMATTLIEKTLYIDTASVAEANSPFNSTH 480
           VPKEVDIISKSK D D HDT SIQAT K GVDMA  L+EKTLYIDTASVA  N PFNS  
Sbjct: 421 VPKEVDIISKSKGDIDCHDTPSIQATNKDGVDMANILMEKTLYIDTASVAGTNPPFNSAI 480

Query: 481 LDGEKKSDRPSGKNETAFETRVMEESTIVEPSFLEIKCLTLVEEGRLEREAAESKSQYAI 540
            D +KKS+ P+GKNETA E RVMEEST  EPSFLEIKCLT+VEEGRLEREAAESK + AI
Sbjct: 481 FDDKKKSECPNGKNETACEMRVMEESTTEEPSFLEIKCLTMVEEGRLEREAAESKIKDAI 540

Query: 541 DDGPNVGTGLYKEDHTGYANLGTADEEDYSKANYQLVKVEDPASAIVTSVISSQPPPLPK 600
           DD   VG GLY+EDHT Y NLG+ADEEDYSKANYQLVKVEDPAS  VTS ISSQPPPLPK
Sbjct: 541 DDCLKVGHGLYEEDHTEYTNLGSADEEDYSKANYQLVKVEDPASVKVTSAISSQPPPLPK 600

Query: 601 SPSESWLWRTLPSVSSKKLLAGSNLGNKFYQKPQSPRTSVSTKWETIVKSSNLRHDHVRY 660
           SPSESWLWRTLPSVSSKKLLAGSN GNK YQKPQSPRTS STKWETIVKSSNL HDHVRY
Sbjct: 601 SPSESWLWRTLPSVSSKKLLAGSNFGNKLYQKPQSPRTSASTKWETIVKSSNLCHDHVRY 660

Query: 661 SEELIPRVSQHSTTENFK 676
           SEEL+PRVSQHSTTENFK
Sbjct: 661 SEELLPRVSQHSTTENFK 676

BLAST of Cla019665 vs. TrEMBL
Match: A0A061GLK7_THECC (Transcription initiation factor TFIID subunit 11, putative OS=Theobroma cacao GN=TCM_029685 PE=4 SV=1)

HSP 1 Score: 458.0 bits (1177), Expect = 2.0e-125
Identity = 302/727 (41.54%), Postives = 408/727 (56.12%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSLAKVNEKKAENSHFSRRSTFLISRPQFNLDQVTEPV 60
           MEERKLNFNAPL+SVRRFS  ++   +  +K  EN   +RR T        +LDQVTEPV
Sbjct: 1   MEERKLNFNAPLLSVRRFSATSAFSDRDKQKIVENPCPNRRHTLPFYNSDVSLDQVTEPV 60

Query: 61  AVPFHWEQIPGRAKNDSGSASPEVQLPQLPERTCS-TPRISFGRALDVNRYSLEMETCHQ 120
           AVPF WEQIPG+AK   G    E Q    P +  S TPR+  GR LD+ +Y++E E  +Q
Sbjct: 61  AVPFVWEQIPGKAK---GGIEHESQ----PNKEASGTPRLPPGRVLDIMKYTVEKEFENQ 120

Query: 121 N----GCEASSSNAIVVRLESTKASDVRSLASENDEDDDDFSDARETLSLTGSFSVNNCS 180
           N      E  S N  V +L+S+         SE++ DDD +SDA +TLS T S S+N CS
Sbjct: 121 NVVRPQSEIYSLNDNVTKLDSSNKGINEKCISESETDDDAYSDALDTLSPTDSLSMN-CS 180

Query: 181 VSGISGCNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQ 240
           +SG+SG +G + KPSGTF +DPQTRDFMMSRFLPAAKAM LE  +Y+ +K+ VA   PR+
Sbjct: 181 ISGLSGSSGLVAKPSGTFSSDPQTRDFMMSRFLPAAKAMTLEMPQYASRKQSVAPALPRE 240

Query: 241 VKKVMSENRRMSPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGNISARGCGL 300
            KKV+  +R+  P+ + ES ++  Y +D    VD   EE++   D+Y++SGN+S + CGL
Sbjct: 241 DKKVVVGDRK-PPVNQYESVIIPHYNQD----VD--GEETEDEYDDYEDSGNLSRKACGL 300

Query: 301 IPNICFKNSLGLLNPVPGLRIRTEGPMSVTNKVGGSSR-TMHHSHSQKTNKHAWDAAYKQ 360
           +P + FKNSL LLNPVPGL++RT   M  T +V   S+ T   SHSQ   KHAWDA +K 
Sbjct: 301 LPRLSFKNSLCLLNPVPGLKVRTHSSMPSTREVAKPSKATYMKSHSQIIEKHAWDAVHKN 360

Query: 361 KSEA----------------------------AVGSPKLPEVKDKWMGESKHFPSSTDPQ 420
           KS++                             V SP+LPE+  K    S  F +S D Q
Sbjct: 361 KSDSGVQSPQPQENKSDTGVQSPRLPENKLSGGVQSPRLPEIGKKMTCGSNQFTNSGDQQ 420

Query: 421 MRGRSSPFR--HSRAASPFRNEASQSPYRKQS-LVVPKEVD------IISKSKEDTDFHD 480
           +  RS P R   S   SP+R E  QSP+R    L +PKE +      +I  +K + +  +
Sbjct: 421 IVNRSPPKRLPGSARISPYRRERPQSPFRGGGFLGMPKEAEKFNANMLIKYTKSNNNSQE 480

Query: 481 TSSIQATKHGVDMATTLIEKTLYIDTASVAEANSPFNSTHLDGEKKSDRPSGKNETAFET 540
               Q+T+ G    +  +EKTLY+DT + AE  S  NS   D +   D     ++T    
Sbjct: 481 LVPYQSTRQGSGALSPAVEKTLYVDTVNFAEIASS-NSDSSDTKAPMDSMGKHSDTLLVN 540

Query: 541 RVMEESTIVEPSFLEIKCLTLVEE---GRLEREAAESKSQYAIDDGPNVG-----TGLYK 600
           R++EES  VE S  +IKCL L++     + E   +   S+ +  D P++         ++
Sbjct: 541 RMLEESATVESSLQDIKCLNLLDGKDISKYEITGSVYSSRSSFSDKPDLKGQAEMMDCFR 600

Query: 601 EDHTGYANLG----TADEEDYSKANYQLVKVEDPASAIVTSVISSQPPPLPKSPSESWLW 660
           ++     +LG     AD      AN   V+  D       S  S  PPPLPK+PSESWLW
Sbjct: 601 QNGGLNKSLGRIKVRADRSLTLSANGD-VREADQEENNAGSDCSPLPPPLPKTPSESWLW 660

Query: 661 RTLPSVSSKKLLAGSNLGNKFYQKPQSPRTSVS-TKWETIVKSSNLRHDHVRYSEELIPR 672
             LPSV+S+   + S  G +FY K + P+ S + TKWETIVK+S L HDHVRYSEEL+  
Sbjct: 661 CALPSVTSRNSFSQSYNGTRFYPKKEEPKVSATDTKWETIVKTSYLHHDHVRYSEELVTH 710

BLAST of Cla019665 vs. TrEMBL
Match: A0A067JBF4_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21573 PE=4 SV=1)

HSP 1 Score: 433.3 bits (1113), Expect = 5.3e-118
Identity = 289/689 (41.94%), Postives = 392/689 (56.89%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSLAKVNEKKAENSHFSRRSTFLISRPQFNLDQVTEPV 60
           MEERKLNFNAPLMSVRR S A         KK EN+   +R+T    +  FNLDQVTEPV
Sbjct: 1   MEERKLNFNAPLMSVRRSSTATKPSNVTKGKKFENAQLVKRNTLPSYKSDFNLDQVTEPV 60

Query: 61  AVPFHWEQIPGRAKNDSGSASPEVQLPQLPERTCSTPRISFGRALDVNRY----SLEMET 120
           AVPFHWEQIPGR K+ S    P+   P+  E    TPR +  RALDV ++      E + 
Sbjct: 61  AVPFHWEQIPGRRKDGS---KPD---PRGCEEASVTPRFTPRRALDVVKHIEDKKPEDQV 120

Query: 121 CHQNGCEASSSNAIVVRLESTK--ASDVRSLASENDEDDDDFSDARETLSLTGSFSVNNC 180
             +   +++S N I   L+ +K   ++     SEND+DDD +SDAR+TLS   SFSV +C
Sbjct: 121 AFRPQIQSNSFNDIANGLDCSKEGVNEKSDFNSENDDDDDLYSDARDTLSGMDSFSV-DC 180

Query: 181 SVSGISGCNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPR 240
           SVSG+SG +   VKPSGTF  DPQTRDFMMSRFLPAAKAM LE  +Y+ +K+ V+ EQPR
Sbjct: 181 SVSGVSGFDSLAVKPSGTFNADPQTRDFMMSRFLPAAKAMTLEAPQYASRKQPVSGEQPR 240

Query: 241 QVKKVMSENRRMSPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGNISARGCG 300
           Q+ +V+  + R  P+ R ES  +  Y +      D VDEES+   D+Y N G I  +GCG
Sbjct: 241 QIVQVVQRD-RTPPVNRKESFNVPSYHQ------DLVDEESEDECDQYVNYGKIMTKGCG 300

Query: 301 LIPNICFKNSLGLLNPVPGLRIRTEGPMSVTNKVGGSSRTMH-HSHSQKTNKHAWDAAYK 360
           L+P +C KNSL L+NPVPG+++R + PMS    +   +++++  S S   NK A D  +K
Sbjct: 301 LLPLLCVKNSLRLVNPVPGMKVRNQSPMSAARDIKRMTKSVYSRSQSPTINKPAKDPVHK 360

Query: 361 QKSEAAVGSPKLPEVKDKWMGESKHFPSSTDPQMRGRSSPFRHSRAASPFRNEASQSPYR 420
           ++ +  V SP+L  V +K  G S  F  + D QM  R+SPFR S A SP+RNEA QSP+ 
Sbjct: 361 KEPDNEVQSPRLVGVDNKLTGGSNRFTYARDRQMISRTSPFRRSGAISPYRNEAPQSPFP 420

Query: 421 KQS-LVVPKEVD------------IISKSKEDTDFHDTSSIQATKHGVDMATTLIEKTLY 480
               L VPK+++              SKS+E   +H        +HG    +   EKTLY
Sbjct: 421 IGGFLGVPKDLENFKANKLNLYGKCYSKSQELVPYH------GLRHGSRPLSPTTEKTLY 480

Query: 481 IDTASVAEANSPFNSTHLDGEKKSDRPSGKN-ETAFETRVMEESTIVEPSFLEIKCLTLV 540
           +DT +VA      N+   D +K    P+ K+ ++   +R ++E+  +E +  ++  L   
Sbjct: 481 VDTVNVAGLLCS-NAGSSDIKKGGMGPAEKDIKSLLSSREIQETYTIESTSKDVTSLNFP 540

Query: 541 EEGRLEREAAESKSQYAIDDGPNVGTGLYKEDHTGYANLGTADEEDYSKANYQLVKVEDP 600
           E+   + + +         D  + G  L +E       + T  E + +  N Q+  + D 
Sbjct: 541 EQKSGDADLSLLSDMSTHRDQWDTGEDLSQES-LALVCVSTTTEGNLNIENDQISNM-DI 600

Query: 601 ASAIVTSVISSQPPPLPKSPSESWLWRTLPSVSSKKLLAGSNLGNKFYQKPQ-SPRTSVS 660
            +A       S PP LPK+PSESWL RTLP+VSS+   +    G  F  K Q S  TS S
Sbjct: 601 GNAKTGFAQCSLPPSLPKTPSESWLSRTLPTVSSQNPSSHLYRGTNFRSKRQDSKTTSTS 660

Query: 661 TKWETIVKSSNLRHDHVRYSEELIPRVSQ 668
           TKWE IVKSS L +DHVRYSEEL P  SQ
Sbjct: 661 TKWENIVKSSYLHNDHVRYSEELFPHASQ 666

BLAST of Cla019665 vs. TrEMBL
Match: A0A067GDT4_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g006045mg PE=4 SV=1)

HSP 1 Score: 429.5 bits (1103), Expect = 7.6e-117
Identity = 285/681 (41.85%), Postives = 387/681 (56.83%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAA-SSLAKVNEKKAENSHFSRRSTFLISRPQFNLDQVTEP 60
           M+ERKLNFNAPL+SVRR+S  A +S    N K  E S  SRR +    R   NL+QVTEP
Sbjct: 1   MDERKLNFNAPLLSVRRYSTTAVASSDGENGKMVEISASSRRYSIPFYRTDLNLEQVTEP 60

Query: 61  VAVPFHWEQIPGRAKNDSGSASPEVQLPQLPERTCSTPRISFGRALDVNRYSLEMETCHQ 120
            AVPF WEQIPGR K+      PE+Q     E    TPR+   +ALD+ +Y L  E    
Sbjct: 61  AAVPFMWEQIPGRPKD----GGPELQHS---EDAPVTPRLPPLKALDIIKYPLAKEFDDS 120

Query: 121 NGCEASSSNAIVVRLESTKASDVRSLASENDEDDDDFSDARETLSLTGSFSVNNCSVSGI 180
              E+ S N  +  L+S   ++      + D DDD +SDA +TLS T S+S+N CS+SG+
Sbjct: 121 PRVESRSLNENMCTLDSPNEANDWKQQLDTDNDDDVYSDALDTLSSTDSYSIN-CSLSGL 180

Query: 181 SGCNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKKV 240
           SG +G +VK SGTF TDPQTRDFMM RFLPAAKAM LEP +Y+ +K+ V +EQPRQV KV
Sbjct: 181 SGSDGQVVKRSGTFSTDPQTRDFMMRRFLPAAKAMALEPPQYASRKQPVTIEQPRQVIKV 240

Query: 241 MSENRRMSPIKRLESTLLLQYGKD-EVHGVDEVDEESDSVDDEYDNSGNISARGCGLIPN 300
           +SE+RR  P+   +S  +  YG+D E    +E +EE++   DEYD+S N+S + CGL+P 
Sbjct: 241 VSEDRR--PLVN-KSIFIPHYGEDVEEEEEEEEEEETEDEVDEYDDSDNLSGKACGLLPR 300

Query: 301 ICFKNSLGLLNPVPGLRIRTEGPMSVTNKVGGSSRTMH-HSHSQKTNKHAWDAAYKQKSE 360
           +C   SL LLNP+PGL+ RT   +S ++ V    +  +  S +Q   KH  DA YK ++E
Sbjct: 301 LCLNKSLCLLNPMPGLKARTHSSVSSSSDVRNLGKAAYTESRNQTVKKHVRDAVYKHQAE 360

Query: 361 AAVGSPKLPEVKDKWMGESKHFPSSTDPQMRGRSSPFRHSRAASPFRNEASQSPYRKQS- 420
           + V SPKL  +++K    S  F   +D QM GRSSP+R  R  SP+RNE  QSP+R    
Sbjct: 361 SGVQSPKLLGIENKMTCGSNRFACLSDQQMAGRSSPYR--RGISPYRNERPQSPFRGGGF 420

Query: 421 LVVPKEVDIISKSKEDTDFHDTSSIQ------ATKHGVDMATTLIEKTLYIDTASVAEAN 480
           L VPKE + +  +K +      S  Q      + K      +  +EKTLY+DT + ++ +
Sbjct: 421 LGVPKEAENVRANKLNPYNRAGSKSQELFPHHSFKKRFGSLSPAVEKTLYVDTVNFSKIS 480

Query: 481 SPFNSTHLDG-EKKSDRPSGKNETAFETRVMEESTIVEPSFLEIKCLTLVEEGRLEREAA 540
                   +G E+ +   + K+E+  ET+V         S  E K +    +G +E    
Sbjct: 481 DTMGQMKSEGIERTASVDTVKDESRSETKVSASIEASRSSSFE-KIMHPAGQGDME---- 540

Query: 541 ESKSQYAIDDGPNVGTGLYKEDHTGYANLGTADEEDYSKANYQLVKVEDPASAIVTSVIS 600
               Q    DG      L +E  +      TADE   S   ++  + +D       S  S
Sbjct: 541 ----QCLGLDGE-----LNQECKSLVCTNVTADETLNSICQHK-SEADDLGCINSGSEQS 600

Query: 601 SQPPPLPKSPSESWLWRTLPSVSSKKLLAGSNLGNKFYQKPQSPRTSV-STKWETIVKSS 660
             P PLPK P+ESWLWRTLPSVSS+   +  N+G +F  K Q P+T + +TKWETIVK+S
Sbjct: 601 LLPLPLPKKPTESWLWRTLPSVSSRNSFSNPNVGTRFNPKKQDPKTPLTTTKWETIVKTS 653

Query: 661 NLRHDHVRYSEELIPRVSQHS 670
              HDH+RYSEEL    SQ S
Sbjct: 661 YAHHDHIRYSEELTSHFSQQS 653

BLAST of Cla019665 vs. TrEMBL
Match: V4UF07_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10014532mg PE=4 SV=1)

HSP 1 Score: 427.6 bits (1098), Expect = 2.9e-116
Identity = 279/687 (40.61%), Postives = 391/687 (56.91%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAA-SSLAKVNEKKAENSHFSRRSTFLISRPQFNLDQVTEP 60
           M+ERKLNFNAPL+SVRR+S  A +S    N K  E+S  SRR +    R   NL+QVTEP
Sbjct: 1   MDERKLNFNAPLLSVRRYSTTAVASSDGENGKMVESSASSRRYSIPFYRTDLNLEQVTEP 60

Query: 61  VAVPFHWEQIPGRAKNDSGSASPEVQLPQLPERTCSTPRISFGRALDVNRYSLEMETCHQ 120
            AVPF WEQIPGR K+      PE++     E    TPR++  +AL++ +Y L  E    
Sbjct: 61  AAVPFMWEQIPGRPKD----GGPELEHS---EDAPVTPRLTPLKALNIIKYPLAKEFDDL 120

Query: 121 NGCEASSSNAIVVRLESTKASDVRSLASENDEDDDDFSDARETLSLTGSFSVNNCSVSGI 180
              E+ S N  +  L+S   ++      + D DDD +SDA +TLS T S+S+N CS+SG+
Sbjct: 121 PRVESRSLNENMCTLDSP--NEANDWKQQLDTDDDVYSDALDTLSSTDSYSIN-CSLSGL 180

Query: 181 SGCNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKKV 240
           SG +G +VK SGTF TDPQTRDFMM RFLPAAKAM LEP +Y+ +K+ V +EQPRQV KV
Sbjct: 181 SGSDGQVVKRSGTFSTDPQTRDFMMRRFLPAAKAMALEPPQYASRKQPVTIEQPRQVIKV 240

Query: 241 MSENRRMSPIKRLESTLLLQYGKD-------EVHGVDEVDEESDSVDDEYDNSGNISARG 300
           +SE+RR  P+   +S  +  YG+D       E    +E +EE++   DEYD+SGN+S + 
Sbjct: 241 VSEDRR--PLVN-KSIFIPHYGEDVEEEEEEEEEEEEEEEEETEDEVDEYDDSGNLSRKA 300

Query: 301 CGLIPNICFKNSLGLLNPVPGLRIRTEGPMSVTNKVGGSSRTMH-HSHSQKTNKHAWDAA 360
           CGL+P +C   SL LLNP+PGL+ RT   +S ++ V    +  +  S +Q   KH  DA 
Sbjct: 301 CGLLPRLCLNKSLCLLNPMPGLKARTHSSVSSSSDVRNLGKAAYTESRNQTVKKHVRDAV 360

Query: 361 YKQKSEAAVGSPKLPEVKDKWMGESKHFPSSTDPQMRGRSSPFRHSRAASPFRNEASQSP 420
           YK ++E+ V SPKL  +++K    SK F   +D QM GRSSP+R  R  SP+RNE  QSP
Sbjct: 361 YKHQAESGVQSPKLLGIENKMTCGSKQFACLSDQQMAGRSSPYR--RGISPYRNERPQSP 420

Query: 421 YRKQS-LVVPKEVDIISKSKEDTDFHDTSSIQ------ATKHGVDMATTLIEKTLYIDTA 480
           +R    L VPKE + +  +K +      S  Q      + K      +  +EKTLY+DT 
Sbjct: 421 FRGGGFLGVPKEAENVRANKLNPYNRAGSKSQELFPHHSFKKRFGSLSPAVEKTLYVDTV 480

Query: 481 SVAEANSPFNSTHLDG-EKKSDRPSGKNETAFETRVMEESTIVEPSFLEIKCLTLVEEGR 540
           + ++ +        +G E+ +   + K+E+  ET+V           + I+        +
Sbjct: 481 NFSKISDTMGQMESEGRERIASVDTAKDESRSETKVS----------VSIEASRSSSSEK 540

Query: 541 LEREAAESKSQYAIDDGPNVGTGLYKEDHTGYANLGTADEEDYSKANYQLVKVEDPASAI 600
           +   A +   ++ +     +   L +E  +      TADE   S   ++  + +D     
Sbjct: 541 IMHPAGQGDMEHCL----GLHGELNQECKSLVCTNVTADETLNSICQHK-SEADDLGCIN 600

Query: 601 VTSVISSQPPPLPKSPSESWLWRTLPSVSSKKLLAGSNLGNKFYQKPQSPRTSV-STKWE 660
             S  S  P PLPK P+ESWLWRTLPSVSS+   +  N+G +F  K Q P+T + +TKWE
Sbjct: 601 SGSEQSPLPLPLPKKPTESWLWRTLPSVSSRNSFSNPNVGTRFNPKKQDPKTPLTTTKWE 657

Query: 661 TIVKSSNLRHDHVRYSEELIPRVSQHS 670
           TIVK+S   HDH+RYSEEL    SQ S
Sbjct: 661 TIVKTSYAHHDHIRYSEELTSHFSQQS 657

BLAST of Cla019665 vs. NCBI nr
Match: gi|449465006|ref|XP_004150220.1| (PREDICTED: uncharacterized protein LOC101207534 [Cucumis sativus])

HSP 1 Score: 1146.7 bits (2965), Expect = 0.0e+00
Identity = 585/678 (86.28%), Postives = 614/678 (90.56%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSLAKVNEKKAENSHFSRRSTFLISRPQFNLDQVTEPV 60
           MEERKLNFNAPLMSVRRFSKAASS++K NEKK+ENSHFSRRSTF +SRPQFNLDQVTEPV
Sbjct: 1   MEERKLNFNAPLMSVRRFSKAASSISKANEKKSENSHFSRRSTFPVSRPQFNLDQVTEPV 60

Query: 61  AVPFHWEQIPGRAKNDSGSASPEVQLPQLPERTCSTPRISFGRALDVNRYSLEMETCHQN 120
           AVPF+WEQIPGRAKNDSGSASPEV LP  PERTCSTPR+SFG ALD N+YS EME CHQ+
Sbjct: 61  AVPFYWEQIPGRAKNDSGSASPEVHLPHPPERTCSTPRLSFGTALDANKYSSEMEACHQD 120

Query: 121 GCEASSSNAIVVRLESTKASDVRSLASENDEDDDD--FSDARETLSLTGSFSVNNCSVSG 180
           GCE+SSSNAIVVRLES KAS  RSLASEND+DDDD  FSDARETLSLTGSFSVNNCSVSG
Sbjct: 121 GCESSSSNAIVVRLESAKASGGRSLASENDDDDDDDDFSDARETLSLTGSFSVNNCSVSG 180

Query: 181 ISGCNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKK 240
           ISG NGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKK
Sbjct: 181 ISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKK 240

Query: 241 VMSENRRMSPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGNISARGCGLIPN 300
             +ENRR+SPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSG+ISARGCGLIPN
Sbjct: 241 --AENRRISPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGHISARGCGLIPN 300

Query: 301 ICFKNSLGLLNPVPGLRIRTEGPMSVTNKVGGSSRTMHHSHSQKTNKHAWDAAYKQKSEA 360
           ICFKNSLGLLNPVPG+RIRTE PMSVT KVGGSSRT+HHS+ QK NKHAWDA YKQKSEA
Sbjct: 301 ICFKNSLGLLNPVPGMRIRTEAPMSVTKKVGGSSRTLHHSYGQKMNKHAWDATYKQKSEA 360

Query: 361 AVGSPKLPEVKDKWMGESKHFPSSTDPQMRGRSSPFRHSRAASPFRNEASQSPYRKQSLV 420
           AVGSP+L EVKDKW GESKHF SSTD QM+GRSSPFRHSRAASPFRNEAS+SP R+Q  V
Sbjct: 361 AVGSPRLLEVKDKWTGESKHFSSSTDLQMKGRSSPFRHSRAASPFRNEASRSPCRRQPFV 420

Query: 421 VPKEVDIISKSKEDTDFHDTSSIQAT-KHGVDMATTLIEKTLYIDTASVAEANSPFNSTH 480
           VPKEVDIISKSK D D HDT SIQAT K GVDMA  L+EKTLYIDTASVA  N PFNS  
Sbjct: 421 VPKEVDIISKSKGDIDCHDTPSIQATNKDGVDMANILMEKTLYIDTASVAGTNPPFNSAI 480

Query: 481 LDGEKKSDRPSGKNETAFETRVMEESTIVEPSFLEIKCLTLVEEGRLEREAAESKSQYAI 540
            D +KKS+ P+GKNETA E RVMEEST  EPSFLEIKCLT+VEEGRLEREAAESK + AI
Sbjct: 481 FDDKKKSECPNGKNETACEMRVMEESTTEEPSFLEIKCLTMVEEGRLEREAAESKIKDAI 540

Query: 541 DDGPNVGTGLYKEDHTGYANLGTADEEDYSKANYQLVKVEDPASAIVTSVISSQPPPLPK 600
           DD   VG GLY+EDHT Y NLG+ADEEDYSKANYQLVKVEDPAS  VTS ISSQPPPLPK
Sbjct: 541 DDCLKVGHGLYEEDHTEYTNLGSADEEDYSKANYQLVKVEDPASVKVTSAISSQPPPLPK 600

Query: 601 SPSESWLWRTLPSVSSKKLLAGSNLGNKFYQKPQSPRTSVSTKWETIVKSSNLRHDHVRY 660
           SPSESWLWRTLPSVSSKKLLAGSN GNK YQKPQSPRTS STKWETIVKSSNL HDHVRY
Sbjct: 601 SPSESWLWRTLPSVSSKKLLAGSNFGNKLYQKPQSPRTSASTKWETIVKSSNLCHDHVRY 660

Query: 661 SEELIPRVSQHSTTENFK 676
           SEEL+PRVSQHSTTENFK
Sbjct: 661 SEELLPRVSQHSTTENFK 676

BLAST of Cla019665 vs. NCBI nr
Match: gi|659068207|ref|XP_008443305.1| (PREDICTED: uncharacterized protein LOC103486924 [Cucumis melo])

HSP 1 Score: 1141.7 bits (2952), Expect = 0.0e+00
Identity = 584/677 (86.26%), Postives = 607/677 (89.66%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSLAKVNEKKAENSHFSRRSTFLISRPQFNLDQVTEPV 60
           MEERKLNFNAPLMSVRRFSKAASS+ K NEKK+ENSHFSRRSTF +SRPQFNLDQVTEPV
Sbjct: 1   MEERKLNFNAPLMSVRRFSKAASSIYKANEKKSENSHFSRRSTFPVSRPQFNLDQVTEPV 60

Query: 61  AVPFHWEQIPGRAKNDSGSASPEVQLPQLPERTCSTPRISFGRALDVNRYSLEMETCHQN 120
           AVPF+WEQIPGRAKNDSGSASPEVQLP  PERTCSTPR+SFG ALD N+YS EME CHQ+
Sbjct: 61  AVPFYWEQIPGRAKNDSGSASPEVQLPHPPERTCSTPRLSFGTALDANKYSSEMEACHQD 120

Query: 121 GCEASSSNAIVVRLESTKASDVRSLASENDEDDDD--FSDARETLSLTGSFSVNNCSVSG 180
           GCE+SSSNAIVVRLES KAS  RSLASEND+DDDD  FSDARETLSLTGSFSVNNCSVSG
Sbjct: 121 GCESSSSNAIVVRLESAKASGARSLASENDDDDDDDDFSDARETLSLTGSFSVNNCSVSG 180

Query: 181 ISGCNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKK 240
           ISG NGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKK
Sbjct: 181 ISGYNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQVKK 240

Query: 241 VMSENRRMSPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGNISARGCGLIPN 300
           V  ENRRMSPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGNISARGCGLIPN
Sbjct: 241 V--ENRRMSPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGNISARGCGLIPN 300

Query: 301 ICFKNSLGLLNPVPGLRIRTEGPMSVTNKVGGSSRTMHHSHSQKTNKHAWDAAYKQKSEA 360
           ICFKNSLGLLNPVPG+RIRTE PMSVT KVG SSRT+HH + QKTNKHAWDA YKQKSEA
Sbjct: 301 ICFKNSLGLLNPVPGMRIRTEAPMSVTKKVGESSRTLHHPYGQKTNKHAWDATYKQKSEA 360

Query: 361 AVGSPKLPEVKDKWMGESKHFPSSTDPQMRGRSSPFRHSRAASPFRNEASQSPYRKQSLV 420
           AVGS KL EVKDKW GESKHF  STD QM+GRSSPFRHSRAASPFRNEASQSP R+Q  V
Sbjct: 361 AVGSHKLLEVKDKWTGESKHFSFSTDLQMKGRSSPFRHSRAASPFRNEASQSPCRRQPFV 420

Query: 421 VPKEVDIISKSKEDTDFHDTSSIQATKHGVDMATTLIEKTLYIDTASVAEANSPFNSTHL 480
           VPKEVD ISKSK D DFHDT SIQA K GVDMA+ L+EKTLYIDTASVAE N PFN    
Sbjct: 421 VPKEVDTISKSKGDVDFHDTPSIQANKDGVDMASFLMEKTLYIDTASVAETNPPFNPAIS 480

Query: 481 DGEKKSDRPSGKNETAFETRVMEESTIVEPSFLEIKCLTLVEEGRLEREAAESKSQYAID 540
           D +KK +  +GK+ETA E RVMEEST  EPSFLEIKCLT+VEEGRLEREAAESKS+   D
Sbjct: 481 DDKKKLEYHNGKDETACEMRVMEESTTEEPSFLEIKCLTMVEEGRLEREAAESKSKDVTD 540

Query: 541 DGPNVGTGLYKEDHTGYANLGTADEEDYSKANYQLVKVEDPASAIVTSVISSQPPPLPKS 600
             P VG GLY+EDHT Y N GTADEEDYSKANYQLVKVEDPA   VTS ISSQPPPLPKS
Sbjct: 541 YCPIVGHGLYEEDHTEYTNSGTADEEDYSKANYQLVKVEDPAIVKVTSSISSQPPPLPKS 600

Query: 601 PSESWLWRTLPSVSSKKLLAGSNLGNKFYQKPQSPRTSVSTKWETIVKSSNLRHDHVRYS 660
           PSESWLWRTLPSVSSKKLLAGSNLGNK YQKPQSPR S STKWETIVKSSNLRHDHVRYS
Sbjct: 601 PSESWLWRTLPSVSSKKLLAGSNLGNKLYQKPQSPRISASTKWETIVKSSNLRHDHVRYS 660

Query: 661 EELIPRVSQHSTTENFK 676
           EELIPRVSQHSTTENFK
Sbjct: 661 EELIPRVSQHSTTENFK 675

BLAST of Cla019665 vs. NCBI nr
Match: gi|590623585|ref|XP_007025362.1| (Transcription initiation factor TFIID subunit 11, putative [Theobroma cacao])

HSP 1 Score: 458.0 bits (1177), Expect = 2.9e-125
Identity = 302/727 (41.54%), Postives = 408/727 (56.12%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSLAKVNEKKAENSHFSRRSTFLISRPQFNLDQVTEPV 60
           MEERKLNFNAPL+SVRRFS  ++   +  +K  EN   +RR T        +LDQVTEPV
Sbjct: 1   MEERKLNFNAPLLSVRRFSATSAFSDRDKQKIVENPCPNRRHTLPFYNSDVSLDQVTEPV 60

Query: 61  AVPFHWEQIPGRAKNDSGSASPEVQLPQLPERTCS-TPRISFGRALDVNRYSLEMETCHQ 120
           AVPF WEQIPG+AK   G    E Q    P +  S TPR+  GR LD+ +Y++E E  +Q
Sbjct: 61  AVPFVWEQIPGKAK---GGIEHESQ----PNKEASGTPRLPPGRVLDIMKYTVEKEFENQ 120

Query: 121 N----GCEASSSNAIVVRLESTKASDVRSLASENDEDDDDFSDARETLSLTGSFSVNNCS 180
           N      E  S N  V +L+S+         SE++ DDD +SDA +TLS T S S+N CS
Sbjct: 121 NVVRPQSEIYSLNDNVTKLDSSNKGINEKCISESETDDDAYSDALDTLSPTDSLSMN-CS 180

Query: 181 VSGISGCNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPRQ 240
           +SG+SG +G + KPSGTF +DPQTRDFMMSRFLPAAKAM LE  +Y+ +K+ VA   PR+
Sbjct: 181 ISGLSGSSGLVAKPSGTFSSDPQTRDFMMSRFLPAAKAMTLEMPQYASRKQSVAPALPRE 240

Query: 241 VKKVMSENRRMSPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGNISARGCGL 300
            KKV+  +R+  P+ + ES ++  Y +D    VD   EE++   D+Y++SGN+S + CGL
Sbjct: 241 DKKVVVGDRK-PPVNQYESVIIPHYNQD----VD--GEETEDEYDDYEDSGNLSRKACGL 300

Query: 301 IPNICFKNSLGLLNPVPGLRIRTEGPMSVTNKVGGSSR-TMHHSHSQKTNKHAWDAAYKQ 360
           +P + FKNSL LLNPVPGL++RT   M  T +V   S+ T   SHSQ   KHAWDA +K 
Sbjct: 301 LPRLSFKNSLCLLNPVPGLKVRTHSSMPSTREVAKPSKATYMKSHSQIIEKHAWDAVHKN 360

Query: 361 KSEA----------------------------AVGSPKLPEVKDKWMGESKHFPSSTDPQ 420
           KS++                             V SP+LPE+  K    S  F +S D Q
Sbjct: 361 KSDSGVQSPQPQENKSDTGVQSPRLPENKLSGGVQSPRLPEIGKKMTCGSNQFTNSGDQQ 420

Query: 421 MRGRSSPFR--HSRAASPFRNEASQSPYRKQS-LVVPKEVD------IISKSKEDTDFHD 480
           +  RS P R   S   SP+R E  QSP+R    L +PKE +      +I  +K + +  +
Sbjct: 421 IVNRSPPKRLPGSARISPYRRERPQSPFRGGGFLGMPKEAEKFNANMLIKYTKSNNNSQE 480

Query: 481 TSSIQATKHGVDMATTLIEKTLYIDTASVAEANSPFNSTHLDGEKKSDRPSGKNETAFET 540
               Q+T+ G    +  +EKTLY+DT + AE  S  NS   D +   D     ++T    
Sbjct: 481 LVPYQSTRQGSGALSPAVEKTLYVDTVNFAEIASS-NSDSSDTKAPMDSMGKHSDTLLVN 540

Query: 541 RVMEESTIVEPSFLEIKCLTLVEE---GRLEREAAESKSQYAIDDGPNVG-----TGLYK 600
           R++EES  VE S  +IKCL L++     + E   +   S+ +  D P++         ++
Sbjct: 541 RMLEESATVESSLQDIKCLNLLDGKDISKYEITGSVYSSRSSFSDKPDLKGQAEMMDCFR 600

Query: 601 EDHTGYANLG----TADEEDYSKANYQLVKVEDPASAIVTSVISSQPPPLPKSPSESWLW 660
           ++     +LG     AD      AN   V+  D       S  S  PPPLPK+PSESWLW
Sbjct: 601 QNGGLNKSLGRIKVRADRSLTLSANGD-VREADQEENNAGSDCSPLPPPLPKTPSESWLW 660

Query: 661 RTLPSVSSKKLLAGSNLGNKFYQKPQSPRTSVS-TKWETIVKSSNLRHDHVRYSEELIPR 672
             LPSV+S+   + S  G +FY K + P+ S + TKWETIVK+S L HDHVRYSEEL+  
Sbjct: 661 CALPSVTSRNSFSQSYNGTRFYPKKEEPKVSATDTKWETIVKTSYLHHDHVRYSEELVTH 710

BLAST of Cla019665 vs. NCBI nr
Match: gi|1000938093|ref|XP_015583729.1| (PREDICTED: uncharacterized protein LOC8258854 [Ricinus communis])

HSP 1 Score: 443.4 bits (1139), Expect = 7.3e-121
Identity = 296/703 (42.11%), Postives = 396/703 (56.33%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSLAKVNE------KKAENSHFSRRSTFLISRPQFNLD 60
           MEERKLNFN PL+SVRR S    S A          KK +N H  RR T    +P + LD
Sbjct: 1   MEERKLNFNIPLLSVRRSSTPTRSSAPTKSSSGEKGKKNDNFHPDRRRTLPSCKPAYILD 60

Query: 61  QVTEPVAVPFHWEQIPGRAKNDSGSASPEVQLPQLPERTCSTPRISFGRALDVNRYSLEM 120
           QVTEPVAVPF WEQIPGR K+    A P+   PQ  E    TPRI   R LDV ++    
Sbjct: 61  QVTEPVAVPFQWEQIPGRPKD---GAVPD---PQGHEEVSVTPRIPPRRVLDVVKHIDNK 120

Query: 121 ETCHQNGC----EASSSNAIVVRLESTK--ASDVRSLASENDEDDDDFSDARETLSLTGS 180
           +   Q+      EA S   IV RL+ +K    +   +  END+D+D +SDA +TLS T S
Sbjct: 121 KPEDQDALTPQIEAKSFTNIVGRLDCSKEGVDEKAIIILENDDDEDVYSDALDTLSPTDS 180

Query: 181 FSVNNCSVSGISGCNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLV 240
           FSVN CS+SG+SG +   VKPSGTF  D Q +DFMMSRFLPAAKAM LEP +Y+ +K+ V
Sbjct: 181 FSVN-CSLSGVSGFDNLAVKPSGTFSIDQQAQDFMMSRFLPAAKAMTLEPPQYASRKQPV 240

Query: 241 AVEQPRQVKKVMSENRRMSPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGNI 300
           + EQPRQ  K ++ + R  P+ R  S  +  Y +      D+ DEES+   D+Y +SGNI
Sbjct: 241 SGEQPRQTTKAVNRD-RTPPVIRNRSCNIPPYHQ------DKEDEESEDECDDYSDSGNI 300

Query: 301 SARGCGLIPNICFKNSLGLLNPVPGLRIRTEGPMSVTNKVGGSSRTM-HHSHSQKTNKHA 360
           +A+GCG +P +C KNSL LLNPVPG++IRT+  MS T  +   ++ +   S S    K A
Sbjct: 301 TAKGCGFLPRLCIKNSLCLLNPVPGMKIRTQTSMSSTKDIKKLTKAVFSRSQSPTVKKPA 360

Query: 361 WDAAYKQKSEAAVGSPKLPEVKDKWMGESKHFPSSTDPQMRGRSSPFRHSRAASPFRNEA 420
            +A  KQK ++ V SP++  V++K  G S  F  +TD QM  R+SPFR S   SP RNEA
Sbjct: 361 RNAVSKQKQDSEVPSPRMVGVENKLTGGSNRFTYATDRQMISRTSPFRRSGCISPHRNEA 420

Query: 421 SQSPYR-KQSLVVPKEVDIISKSKEDTDFH-------DTSSIQATKHGVDMATTLIEKTL 480
            QSP+R + S  +PK+++ + KS +   F+       +  S    + G   A+  +EKTL
Sbjct: 421 PQSPFRGRGSQGIPKQLENL-KSNQFNSFNRGYSKSQELVSYNGIRRGSRPASPTVEKTL 480

Query: 481 YIDTASVAEANSPFNSTHLDGEKKSDRPSGKNETAFE--TRVMEESTIVEPSFLEIKCLT 540
           Y+DT + A                SD   G  ++A +    + +E  +V+ SF ++KCL 
Sbjct: 481 YVDTVNAA-------GILCSNSGSSDIKKGFVDSAEKDLKSLFQEIAVVKSSFRDMKCLN 540

Query: 541 LV-EEGRLEREAAES---KSQYAIDDGPNVGTGLYKED----HTGYANLGTADEEDYSKA 600
           +   EG+LE +   S   +     D  P+ G     ED          +  A E + +  
Sbjct: 541 VAGGEGKLETKGLRSGGPELPLLSDGSPDKGQAEMTEDLSKESMALVCISAATEGNVNIE 600

Query: 601 NYQLVKVEDPASAIVTSVISSQ--PPPLPKSPSESWLWRTLPSVSSKKLLAGSNLGNKFY 660
           + Q+ K +D  S   + V+     PP LPK+PSESWLWRTLPS+SS+   + S   N F 
Sbjct: 601 SDQISKRDDTGSEKTSLVLVQPPIPPLLPKTPSESWLWRTLPSISSQNQSSNSYRNNSFL 660

Query: 661 QKPQSPRT-SVSTKWETIVKSSNLRHDHVRYSEELIPRVSQHS 670
            K Q  +T S +TKWE IVKSS L HDHVRYSEEL P  SQ S
Sbjct: 661 SKRQDTKTFSATTKWENIVKSSYLHHDHVRYSEELFPHASQQS 681

BLAST of Cla019665 vs. NCBI nr
Match: gi|643704038|gb|KDP21102.1| (hypothetical protein JCGZ_21573 [Jatropha curcas])

HSP 1 Score: 433.3 bits (1113), Expect = 7.6e-118
Identity = 289/689 (41.94%), Postives = 392/689 (56.89%), Query Frame = 1

Query: 1   MEERKLNFNAPLMSVRRFSKAASSLAKVNEKKAENSHFSRRSTFLISRPQFNLDQVTEPV 60
           MEERKLNFNAPLMSVRR S A         KK EN+   +R+T    +  FNLDQVTEPV
Sbjct: 1   MEERKLNFNAPLMSVRRSSTATKPSNVTKGKKFENAQLVKRNTLPSYKSDFNLDQVTEPV 60

Query: 61  AVPFHWEQIPGRAKNDSGSASPEVQLPQLPERTCSTPRISFGRALDVNRY----SLEMET 120
           AVPFHWEQIPGR K+ S    P+   P+  E    TPR +  RALDV ++      E + 
Sbjct: 61  AVPFHWEQIPGRRKDGS---KPD---PRGCEEASVTPRFTPRRALDVVKHIEDKKPEDQV 120

Query: 121 CHQNGCEASSSNAIVVRLESTK--ASDVRSLASENDEDDDDFSDARETLSLTGSFSVNNC 180
             +   +++S N I   L+ +K   ++     SEND+DDD +SDAR+TLS   SFSV +C
Sbjct: 121 AFRPQIQSNSFNDIANGLDCSKEGVNEKSDFNSENDDDDDLYSDARDTLSGMDSFSV-DC 180

Query: 181 SVSGISGCNGPMVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKKLVAVEQPR 240
           SVSG+SG +   VKPSGTF  DPQTRDFMMSRFLPAAKAM LE  +Y+ +K+ V+ EQPR
Sbjct: 181 SVSGVSGFDSLAVKPSGTFNADPQTRDFMMSRFLPAAKAMTLEAPQYASRKQPVSGEQPR 240

Query: 241 QVKKVMSENRRMSPIKRLESTLLLQYGKDEVHGVDEVDEESDSVDDEYDNSGNISARGCG 300
           Q+ +V+  + R  P+ R ES  +  Y +      D VDEES+   D+Y N G I  +GCG
Sbjct: 241 QIVQVVQRD-RTPPVNRKESFNVPSYHQ------DLVDEESEDECDQYVNYGKIMTKGCG 300

Query: 301 LIPNICFKNSLGLLNPVPGLRIRTEGPMSVTNKVGGSSRTMH-HSHSQKTNKHAWDAAYK 360
           L+P +C KNSL L+NPVPG+++R + PMS    +   +++++  S S   NK A D  +K
Sbjct: 301 LLPLLCVKNSLRLVNPVPGMKVRNQSPMSAARDIKRMTKSVYSRSQSPTINKPAKDPVHK 360

Query: 361 QKSEAAVGSPKLPEVKDKWMGESKHFPSSTDPQMRGRSSPFRHSRAASPFRNEASQSPYR 420
           ++ +  V SP+L  V +K  G S  F  + D QM  R+SPFR S A SP+RNEA QSP+ 
Sbjct: 361 KEPDNEVQSPRLVGVDNKLTGGSNRFTYARDRQMISRTSPFRRSGAISPYRNEAPQSPFP 420

Query: 421 KQS-LVVPKEVD------------IISKSKEDTDFHDTSSIQATKHGVDMATTLIEKTLY 480
               L VPK+++              SKS+E   +H        +HG    +   EKTLY
Sbjct: 421 IGGFLGVPKDLENFKANKLNLYGKCYSKSQELVPYH------GLRHGSRPLSPTTEKTLY 480

Query: 481 IDTASVAEANSPFNSTHLDGEKKSDRPSGKN-ETAFETRVMEESTIVEPSFLEIKCLTLV 540
           +DT +VA      N+   D +K    P+ K+ ++   +R ++E+  +E +  ++  L   
Sbjct: 481 VDTVNVAGLLCS-NAGSSDIKKGGMGPAEKDIKSLLSSREIQETYTIESTSKDVTSLNFP 540

Query: 541 EEGRLEREAAESKSQYAIDDGPNVGTGLYKEDHTGYANLGTADEEDYSKANYQLVKVEDP 600
           E+   + + +         D  + G  L +E       + T  E + +  N Q+  + D 
Sbjct: 541 EQKSGDADLSLLSDMSTHRDQWDTGEDLSQES-LALVCVSTTTEGNLNIENDQISNM-DI 600

Query: 601 ASAIVTSVISSQPPPLPKSPSESWLWRTLPSVSSKKLLAGSNLGNKFYQKPQ-SPRTSVS 660
            +A       S PP LPK+PSESWL RTLP+VSS+   +    G  F  K Q S  TS S
Sbjct: 601 GNAKTGFAQCSLPPSLPKTPSESWLSRTLPTVSSQNPSSHLYRGTNFRSKRQDSKTTSTS 660

Query: 661 TKWETIVKSSNLRHDHVRYSEELIPRVSQ 668
           TKWE IVKSS L +DHVRYSEEL P  SQ
Sbjct: 661 TKWENIVKSSYLHNDHVRYSEELFPHASQ 666

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0LX77_CUCSA0.0e+0086.28Uncharacterized protein OS=Cucumis sativus GN=Csa_1G470450 PE=4 SV=1[more]
A0A061GLK7_THECC2.0e-12541.54Transcription initiation factor TFIID subunit 11, putative OS=Theobroma cacao GN... [more]
A0A067JBF4_JATCU5.3e-11841.94Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21573 PE=4 SV=1[more]
A0A067GDT4_CITSI7.6e-11741.85Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g006045mg PE=4 SV=1[more]
V4UF07_9ROSI2.9e-11640.61Uncharacterized protein OS=Citrus clementina GN=CICLE_v10014532mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|449465006|ref|XP_004150220.1|0.0e+0086.28PREDICTED: uncharacterized protein LOC101207534 [Cucumis sativus][more]
gi|659068207|ref|XP_008443305.1|0.0e+0086.26PREDICTED: uncharacterized protein LOC103486924 [Cucumis melo][more]
gi|590623585|ref|XP_007025362.1|2.9e-12541.54Transcription initiation factor TFIID subunit 11, putative [Theobroma cacao][more]
gi|1000938093|ref|XP_015583729.1|7.3e-12142.11PREDICTED: uncharacterized protein LOC8258854 [Ricinus communis][more]
gi|643704038|gb|KDP21102.1|7.6e-11841.94hypothetical protein JCGZ_21573 [Jatropha curcas][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR007789DUF688
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cla019665Cla019665gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cla019665Cla019665.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla019665.1.cds3Cla019665.1.cds3CDS
Cla019665.1.cds2Cla019665.1.cds2CDS
Cla019665.1.cds1Cla019665.1.cds1CDS


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007789Protein of unknown function DUF688PFAMPF05097DUF688coord: 1..474
score: 5.3
NoneNo IPR availablePANTHERPTHR33671FAMILY NOT NAMEDcoord: 1..671
score: 6.9E
NoneNo IPR availablePANTHERPTHR33671:SF3F28N24.8 PROTEINcoord: 1..671
score: 6.9E