CsaV3_2G031920.1 (mRNA) Cucumber (Chinese Long) v3

NameCsaV3_2G031920.1
TypemRNA
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionUPF0481 protein At3g47200-like
Locationchr2 : 21063429 .. 21065166 (-)
Sequence length1203
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAATAGTGAGATTGAAATAATGAGTGAGAATATTGATCAAAATGTTCGTGATAATGTTGTGATATCCATTGACAAAATGTTGGAAGGACTACCTCGAGTTAATCCAAAATGTAACACTATCTATCAAGTCCCCAAAGAGCTACGCGAGATTAATGATAAAGCATATGTCCCTCAATTCATTTCCATCGGTCCTTTTCACTATCGTACTCGAAAGGATTTGATAGCCAATGAACATTATAAGCTTCAAGGTTTCTTTAACTTTCTAGGTCGTATAAGTATAATTAATAGCCATATTCAATTATTGGAAGAAAACCAGGTTAATATTAGCTCAAAGGTCCTTGTGGAAAAATCTCATGATTGGGTGAAAGAAGCTTGGAATTGTTATGCAGAACCAATAAAGATGAAAGATGAGGAGTTTATTATAATGATGCTTGTGGATGCTTGTTTCATAGTCGAGTTTTTTCTACTATATTATGGTTCATTCCATGAAGATGGCAAATTATTCAATGCAGAATTATCATTATTCTACTACGGAGTATTCTACGAAATACTTCTTGACTTGATAAAGTTGGAAAATCAAGTCCCTTTCTTTCTTCTTCAAAATCTATTTGACCTGATGCCAAAAGATAAAGTTGATATCTCCTCCATTATTGGAGGTTATAAAGATAGCCCTATCTCCTTGATAGATCTTACATACATGGTTCTTAAGGAGTTTGGGTTCGTGCGTGAATATAAGATCAATAATTTGTATCATAAAAATCCAAAGCACTTGCTTGATTTCTTAAGTTTCTACTTCCTCCCCGTGCCCCCTAATAACTGGAATAGGAAGTTTGATCATGTGAAAATTAGTAAGCAATGGAGGTTGAGTCCTCCAACCACAACTGAGCTCTGCGAGGCTGGTGTCACCATTAAAGTAGCAAAAAGAAAAAACAATTTATGCTTCATGAACATAAGTTTTAAAAATGGGGTTTTAGAAATCCCACCTATAGTTATTGAAGGTACTTTTGAAGTCTTGATACGAAACGTGTTAGCATTTGAGATTTTTCCGGCAGGAAATCAGAAGAAGTATGCAATCCAATATGTGACATTTTTGGATGATTTGATAAGCACAGAGAAAGACTTGTGCTTACTTGTGAAGGCTGGAGTTATAATCAACGATACTGGTGGTAGTGATAAAGAAGTTTCAGAATTGTTTAACAGTCTTACTAAACTTGTCACCACTCCATTGCCTTCCTACTTCGATGATACCAGCAAAGCTTTACGTGTGCATTGCGATGGATCGTGGAACAAGGCTAAAGCTTCACTCAAACATAGCTATTTCAATACGCCATGGGCTATTATCTCATTCTTTGCTGCAACTTTCCTCATTATTCTTACTATCCTTCAAACCATATTCTCCGCTATCTCTGCATTTCCTAACTAAGCTGCTTCAAACCACCTCTCTTATGCATACTGTTAAACAAATGTTATTAGGATTCTATATTATTTATTTTTATTGAATATTATTTTTTTAATTTCTTATAAGATTGTACCATGCATATTAGTGCTTCTATCCCTTATACTTAATTCTAGTATACCTTTATATTCTGTATTTGCTACATTGTTGAATAATGTTAGCTAGAGGAATAAGAGTATACATTTTACATCCATATCGAATATATATCTCACGTTAAATTCTATGTAATAAAAATGAAACATTAGGAAGCAGTGATGATGAGTATTTTGTAGCATAA

mRNA sequence

ATGGAAAATAGTGAGATTGAAATAATGAGTGAGAATATTGATCAAAATGTTCGTGATAATGTTGTGATATCCATTGACAAAATGTTGGAAGGACTACCTCGAGTTAATCCAAAATGTAACACTATCTATCAAGTCCCCAAAGAGCTACGCGAGATTAATGATAAAGCATATGTCCCTCAATTCATTTCCATCGGTCCTTTTCACTATCGTACTCGAAAGGATTTGATAGCCAATGAACATTATAAGCTTCAAGGTTTCTTTAACTTTCTAGGTCGTATAAGTATAATTAATAGCCATATTCAATTATTGGAAGAAAACCAGGTTAATATTAGCTCAAAGGTCCTTGTGGAAAAATCTCATGATTGGGTGAAAGAAGCTTGGAATTGTTATGCAGAACCAATAAAGATGAAAGATGAGGAGTTTATTATAATGATGCTTGTGGATGCTTGTTTCATAGTCGAGTTTTTTCTACTATATTATGGTTCATTCCATGAAGATGGCAAATTATTCAATGCAGAATTATCATTATTCTACTACGGAGTATTCTACGAAATACTTCTTGACTTGATAAAGTTGGAAAATCAAGTCCCTTTCTTTCTTCTTCAAAATCTATTTGACCTGATGCCAAAAGATAAAGTTGATATCTCCTCCATTATTGGAGGTTATAAAGATAGCCCTATCTCCTTGATAGATCTTACATACATGGTTCTTAAGGAGTTTGGGTTCGTGCGTGAATATAAGATCAATAATTTGTATCATAAAAATCCAAAGCACTTGCTTGATTTCTTAAGTTTCTACTTCCTCCCCGTGCCCCCTAATAACTGGAATAGGAAGTTTGATCATGTGAAAATTAGTAAGCAATGGAGGTTGAGTCCTCCAACCACAACTGAGCTCTGCGAGGCTGGTGTCACCATTAAAGTAGCAAAAAGAAAAAACAATTTATGCTTCATGAACATAAGTTTTAAAAATGGGGTTTTAGAAATCCCACCTATAGTTATTGAAGGTACTTTTGAAGTCTTGATACGAAACGTGTTAGCATTTGAGATTTTTCCGGCAGGAAATCAGAAGAAGTATGCAATCCAATATGTGACATTTTTGGATGATTTGATAAGCACAGAGAAAGACTTGTGCTTACTTGTGAAGGCTGGAGTTATAATCAACGATACTGGTGGAAGCAGTGATGATGAGTATTTTGTAGCATAA

Coding sequence (CDS)

ATGGAAAATAGTGAGATTGAAATAATGAGTGAGAATATTGATCAAAATGTTCGTGATAATGTTGTGATATCCATTGACAAAATGTTGGAAGGACTACCTCGAGTTAATCCAAAATGTAACACTATCTATCAAGTCCCCAAAGAGCTACGCGAGATTAATGATAAAGCATATGTCCCTCAATTCATTTCCATCGGTCCTTTTCACTATCGTACTCGAAAGGATTTGATAGCCAATGAACATTATAAGCTTCAAGGTTTCTTTAACTTTCTAGGTCGTATAAGTATAATTAATAGCCATATTCAATTATTGGAAGAAAACCAGGTTAATATTAGCTCAAAGGTCCTTGTGGAAAAATCTCATGATTGGGTGAAAGAAGCTTGGAATTGTTATGCAGAACCAATAAAGATGAAAGATGAGGAGTTTATTATAATGATGCTTGTGGATGCTTGTTTCATAGTCGAGTTTTTTCTACTATATTATGGTTCATTCCATGAAGATGGCAAATTATTCAATGCAGAATTATCATTATTCTACTACGGAGTATTCTACGAAATACTTCTTGACTTGATAAAGTTGGAAAATCAAGTCCCTTTCTTTCTTCTTCAAAATCTATTTGACCTGATGCCAAAAGATAAAGTTGATATCTCCTCCATTATTGGAGGTTATAAAGATAGCCCTATCTCCTTGATAGATCTTACATACATGGTTCTTAAGGAGTTTGGGTTCGTGCGTGAATATAAGATCAATAATTTGTATCATAAAAATCCAAAGCACTTGCTTGATTTCTTAAGTTTCTACTTCCTCCCCGTGCCCCCTAATAACTGGAATAGGAAGTTTGATCATGTGAAAATTAGTAAGCAATGGAGGTTGAGTCCTCCAACCACAACTGAGCTCTGCGAGGCTGGTGTCACCATTAAAGTAGCAAAAAGAAAAAACAATTTATGCTTCATGAACATAAGTTTTAAAAATGGGGTTTTAGAAATCCCACCTATAGTTATTGAAGGTACTTTTGAAGTCTTGATACGAAACGTGTTAGCATTTGAGATTTTTCCGGCAGGAAATCAGAAGAAGTATGCAATCCAATATGTGACATTTTTGGATGATTTGATAAGCACAGAGAAAGACTTGTGCTTACTTGTGAAGGCTGGAGTTATAATCAACGATACTGGTGGAAGCAGTGATGATGAGTATTTTGTAGCATAA

Protein sequence

MENSEIEIMSENIDQNVRDNVVISIDKMLEGLPRVNPKCNTIYQVPKELREINDKAYVPQFISIGPFHYRTRKDLIANEHYKLQGFFNFLGRISIINSHIQLLEENQVNISSKVLVEKSHDWVKEAWNCYAEPIKMKDEEFIIMMLVDACFIVEFFLLYYGSFHEDGKLFNAELSLFYYGVFYEILLDLIKLENQVPFFLLQNLFDLMPKDKVDISSIIGGYKDSPISLIDLTYMVLKEFGFVREYKINNLYHKNPKHLLDFLSFYFLPVPPNNWNRKFDHVKISKQWRLSPPTTTELCEAGVTIKVAKRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFEIFPAGNQKKYAIQYVTFLDDLISTEKDLCLLVKAGVIINDTGGSSDDEYFVA
BLAST of CsaV3_2G031920.1 vs. NCBI nr
Match: KGN62932.1 (hypothetical protein Csa_2G380610 [Cucumis sativus])

HSP 1 Score: 773.9 bits (1997), Expect = 2.7e-220
Identity = 392/395 (99.24%), Postives = 393/395 (99.49%), Query Frame = 0

Query: 1   MENSEIEIMSENIDQNVRDNVVISIDKMLEGLPRVNPKCNTIYQVPKELREINDKAYVPQ 60
           MENSEIEIMSENIDQNVRDNVVISIDKMLEGLPRVNPKCNTIYQVPKELREINDKAYVPQ
Sbjct: 1   MENSEIEIMSENIDQNVRDNVVISIDKMLEGLPRVNPKCNTIYQVPKELREINDKAYVPQ 60

Query: 61  FISIGPFHYRTRKDLIANEHYKLQGFFNFLGRISIINSHIQLLEENQVNISSKVLVEKSH 120
           FISIGPFHYRTRKDLIANEHYKLQGFFNFLGRISIINSHIQLLEENQVNISSKVLVEKSH
Sbjct: 61  FISIGPFHYRTRKDLIANEHYKLQGFFNFLGRISIINSHIQLLEENQVNISSKVLVEKSH 120

Query: 121 DWVKEAWNCYAEPIKMKDEEFIIMMLVDACFIVEFFLLYYGSFHEDGKLFNAELXXXXXX 180
           DWVKEAWNCYAEPIKMKDEEFIIMMLVDACFIVEFFLLYYGSFHEDGKLFNAELXXXXXX
Sbjct: 121 DWVKEAWNCYAEPIKMKDEEFIIMMLVDACFIVEFFLLYYGSFHEDGKLFNAELXXXXXX 180

Query: 181 XXYEILLDLIKLENQVPFFLLQNLFDLMPKDKVDISSIIGGYKDSPISLIDLTYMVLKEF 240
           XXYEILLDLIKLENQVPFFLLQNLFDLMPKDKVDISSIIGGYKDSPISLIDLTYMVLKEF
Sbjct: 181 XXYEILLDLIKLENQVPFFLLQNLFDLMPKDKVDISSIIGGYKDSPISLIDLTYMVLKEF 240

Query: 241 GFVREYKINNLYHKNPKHLLDFLSFYFLPVPPNNWNRKFDHVKISKQWRLSPPTTTELCE 300
           GFVREYKINNLYHKNPKHLLDFLSFYFLPVPPNNWNRKFDHVKISKQWRLSPPTTTELCE
Sbjct: 241 GFVREYKINNLYHKNPKHLLDFLSFYFLPVPPNNWNRKFDHVKISKQWRLSPPTTTELCE 300

Query: 301 AGVTIKVAKRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFEIFPAGNQKKYAI 360
           AGVTIKVAKRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFEIFPAGNQKKYAI
Sbjct: 301 AGVTIKVAKRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFEIFPAGNQKKYAI 360

Query: 361 QYVTFLDDLISTEKDLCLLVKAGVIINDTGGSSDD 396
           QYVTFLDDLISTEKDLCLLVKAGVIINDTGGS  +
Sbjct: 361 QYVTFLDDLISTEKDLCLLVKAGVIINDTGGSDKE 395

BLAST of CsaV3_2G031920.1 vs. NCBI nr
Match: XP_004138863.2 (PREDICTED: UPF0481 protein At3g47200, partial [Cucumis sativus])

HSP 1 Score: 636.0 bits (1639), Expect = 8.9e-179
Identity = 341/443 (76.98%), Postives = 345/443 (77.88%), Query Frame = 0

Query: 44  QVPKELREINDKAYVPQFISIGPFHYRTRKDLIANEHYKLQGFFNF-------------- 103
           QVPKELR++NDKAY PQFISIGPFH+ TR DLIANEHYKLQGF NF              
Sbjct: 1   QVPKELRKMNDKAYTPQFISIGPFHHHTRNDLIANEHYKLQGFNNFLHRIKINYKQIESS 60

Query: 104 ------------------------------------------------------------ 163
                                                                       
Sbjct: 61  KKLVEKCHGNKEMYAILYVLFLDDLINTEHDVHLLVKAGVIINTFGGNDKDISELFNSLS 120

Query: 164 -----------------LGRISIINSHIQLLEENQVNISSKVLVEKSHDWVKEAWNCYAE 223
                            L RISIINSHIQLLEENQVNISSKVLVEKSHDWVKEAWNCYAE
Sbjct: 121 KFVIIPGHSHFDDIIKALRRISIINSHIQLLEENQVNISSKVLVEKSHDWVKEAWNCYAE 180

Query: 224 PIKMKDEEFIIMMLVDACFIVEFFLLYYGSFHEDGKLFNAELXXXXXXXXYEILLDLIKL 283
           PIKMKDEEFIIMMLVDACFIVEFFLLYYGSFHEDGKLFNAELXXXXXXXXYEILLDLIKL
Sbjct: 181 PIKMKDEEFIIMMLVDACFIVEFFLLYYGSFHEDGKLFNAELXXXXXXXXYEILLDLIKL 240

Query: 284 ENQVPFFLLQNLFDLMPKDKVDISSIIGGYKDSPISLIDLTYMVLKEFGFVREYKINNLY 343
           ENQVPFFLLQNLFDLMPKDKVDISSIIGGYKDSPISLIDLTYMVLKEFGFVREYKINNLY
Sbjct: 241 ENQVPFFLLQNLFDLMPKDKVDISSIIGGYKDSPISLIDLTYMVLKEFGFVREYKINNLY 300

Query: 344 HKNPKHLLDFLSFYFLPVPPNNWNRKFDHVKISKQWRLSPPTTTELCEAGVTIKVAKRKN 396
           HKNPKHLLDFLSFYFLPVPPNNWNRKFDHVKISKQWRLSPPTTTELCEAGVTIKVAKRKN
Sbjct: 301 HKNPKHLLDFLSFYFLPVPPNNWNRKFDHVKISKQWRLSPPTTTELCEAGVTIKVAKRKN 360

BLAST of CsaV3_2G031920.1 vs. NCBI nr
Match: XP_008445182.1 (PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo] >XP_008445184.1 PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo] >XP_016899950.1 PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo])

HSP 1 Score: 431.4 bits (1108), Expect = 3.3e-117
Identity = 254/467 (54.39%), Postives = 298/467 (63.81%), Query Frame = 0

Query: 1   MENSEI-----------EIMSENID--QNVRDNVVISIDKMLEGLPRVNPKCNTIYQVPK 60
           MENSEI           + + E I   + V DNVVISIDK+L GLPR+NPKC+ IYQV K
Sbjct: 1   MENSEIIETKVENDICDDELGETISEIEKVCDNVVISIDKILGGLPRINPKCHIIYQVSK 60

Query: 61  ELREINDKAYVPQFISIGPFHYRTRKDLIANEHYKLQGFFNFLGRISIINSHIQLLEENQ 120
           ELRE+NDKAY PQFISIGPFH+RTR DLIANEHYKLQGF NFL R   IN++ Q+     
Sbjct: 61  ELREMNDKAYAPQFISIGPFHHRTRNDLIANEHYKLQGFNNFLHR---INNYEQI----- 120

Query: 121 VNISSKVLVEKSHDWVKEAWNCYAEPIKMKDEEFIIMMLVDACFIVEFFLLYYGSFHEDG 180
              SSK  V+K H WVKEAWNCYAEPI M +EEF++MMLVDACFI+EFF+L     +   
Sbjct: 121 --ESSKEFVKKCHGWVKEAWNCYAEPINMNEEEFVLMMLVDACFILEFFILLIDDHYGGD 180

Query: 181 KLFNAEL---------XXXXXXXXYEILLDLIKLENQVPFFLLQNLFDLMPKDKVDISSI 240
            LF A+                  +EIL+DLIKLENQVPFFLLQNLFDLMPK  V +   
Sbjct: 181 YLFEADQIFQIQDMVDFSFYRGVFFEILIDLIKLENQVPFFLLQNLFDLMPKHDVPMFP- 240

Query: 241 IGGYKDSPISLIDLTYMVLKEFGFVREYKINNLYHKNPKHLLDFLSFYFLPVPPNNWNRK 300
                    SLID+T  +L  FGFV +YKIN+LYHK PKHLLDFLSFYF P+ PN+ + +
Sbjct: 241 ---------SLIDITSEILTWFGFVGKYKINDLYHKKPKHLLDFLSFYFFPLLPNDDHIR 300

Query: 301 F--DHVKISKQ------------------------------------------------W 360
           F  +  K S Q                                                +
Sbjct: 301 FKQNERKNSDQNNNNLLRFFRPLFPAHWLKKNNDSFGVPSLCCFSNKEAETREKDSENYF 360

Query: 361 RLSPPTTTELCEAGVTIKVAKRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFE 396
           RLSPP+ TELCEAGVTIK AKR+ +LCFMNI FKNGVLEIP I I+ TFEV+IRNV+AF+
Sbjct: 361 RLSPPSITELCEAGVTIKAAKRE-DLCFMNIGFKNGVLEIPCIDIDCTFEVVIRNVIAFD 420

BLAST of CsaV3_2G031920.1 vs. NCBI nr
Match: XP_008445187.1 (PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo])

HSP 1 Score: 335.1 bits (858), Expect = 3.3e-88
Identity = 205/395 (51.90%), Postives = 257/395 (65.06%), Query Frame = 0

Query: 9   MSENIDQNVRDNVVISIDKMLEGLPRVNPKCNTIYQVPKELREINDKAYVPQFISIGPFH 68
           +SE  DQ +  NVVI I KML+ LP+VN +C +IYQV KEL EIN KAY+PQ ISIGP H
Sbjct: 4   ISEVDDQKLCGNVVICIGKMLKQLPQVNAEC-SIYQVSKELLEINRKAYIPQLISIGPIH 63

Query: 69  YRTRKDLIANEHYKLQGFFNFLGRISIINSHIQLLEENQVNISSKVLVEKSHDWVKEAWN 128
           + T  DL+AN+ YKLQGF NFL RI+I N  I  +E+     +   LVEK+H WV+EA N
Sbjct: 64  HGTNNDLVANQQYKLQGFINFLRRININNKQILSMEDILQTGTLNTLVEKAHHWVEEARN 123

Query: 129 CY-AEPIKMKD-EEFIIMMLVDACFIVEFFLLYYGSFHEDGKLF----NAELXXXXXXXX 188
           CY + PI   D + F+IMMLVDACFIVEF +L +   H +GK      N ++        
Sbjct: 124 CYTSPPINTIDMDAFVIMMLVDACFIVEFLILKFDYDHPNGKFLQIQDNIDISFYQGMDL 183

Query: 189 YEILLDLIKLENQVPFFLLQNLFDLMPKDKVDISSIIGGYKDSPISLIDLTYMVLKEFGF 248
           + IL DLIKLENQVPFFLLQ LFDL+PK   DIS +I  ++       DLT   LK F  
Sbjct: 184 H-ILYDLIKLENQVPFFLLQYLFDLIPKH--DISMMISSFR-------DLTLRALK-FRL 243

Query: 249 VREYKINNLYHKNPKHLLDFLSFYFLPVPPNNWNRKFDHVK--ISKQWRLSPPTTTELCE 308
           VR Y+IN    K PKH +D L+FYF+P      N +    K  I ++ R  PP+ TEL E
Sbjct: 244 VRTYEIN--LFKEPKHFVDLLTFYFVPSAGQKVNNQHGIFKSTIEEKNRWIPPSITELRE 303

Query: 309 AGVTIKVAKRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFEIFPAGNQKKYAI 368
           AGVTIK A++  +L   +I+FKNGVL IPP+ I   FE+++RN++AFE   A    KY I
Sbjct: 304 AGVTIKKAEKAKHL--TDITFKNGVLRIPPLHIYDEFELVLRNMVAFEQISARKMNKYVI 363

Query: 369 QYVTFLDDLISTEKDLCLLVKAGVIINDTGGSSDD 396
           QYV F+DDLISTEKD+ LLV+AGVIIN  GGS  +
Sbjct: 364 QYVLFMDDLISTEKDVRLLVEAGVIINQIGGSDKE 382

BLAST of CsaV3_2G031920.1 vs. NCBI nr
Match: XP_008445188.1 (PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo])

HSP 1 Score: 334.7 bits (857), Expect = 4.2e-88
Identity = 191/396 (48.23%), Postives = 256/396 (64.65%), Query Frame = 0

Query: 3   NSEIEIMSENIDQNVRDNVVISIDKMLEGLPRVNPKCNTIYQVPKELREINDKAYVPQFI 62
           N+ +EI   +  Q V DNVVISI+KML+ +P  +    +IY+VPK+LRE+N KAY PQ I
Sbjct: 23  NNMVEISVVDQQQLVCDNVVISIEKMLDQVPPTHENQCSIYRVPKQLREMNPKAYAPQLI 82

Query: 63  SIGPFHYRTRKDLIANEHYKLQGFFNFLGRISIINSHIQLLEENQVNISSKVLVEKSHDW 122
           SIGPFHY T K+LIANE YKLQGF N+L R+  + S  QL+    V    + LV+++  W
Sbjct: 83  SIGPFHYHTHKNLIANEQYKLQGFINYLRRVYKMESLEQLVRTKSV----EDLVKRAQSW 142

Query: 123 VKEAWNCYAEPIKMKDEEFIIMMLVDACFIVEFFLLYYGSFHEDGKLFNAELXXXXXXXX 182
           V+EA NCYAE I M DE+FI MMLVD CFIVEFF+L +  ++E  +    ++        
Sbjct: 143 VEEARNCYAETINMNDEDFIKMMLVDGCFIVEFFILDFEEYNESHESLFPQIENNVSMSF 202

Query: 183 Y-----EILLDLIKLENQVPFFLLQNLFDLMPKDKVDISSIIGGYKDSPISLIDLTYMVL 242
           Y     +I  DLIKLENQ+PFF+LQ+LFDL+PK           +KD+P     LTY  L
Sbjct: 203 YKERIPDIDEDLIKLENQLPFFVLQHLFDLIPK-----------HKDAPNCFKQLTYEYL 262

Query: 243 KEFGFVREYKINNLYHKNPKHLLDFLSFYFLPVPPNNWNRKFDHVKISKQWR-LSPPTTT 302
              G++  Y+ +++    PKH +DFLSFY +P      ++K +     ++W  + PP+ T
Sbjct: 263 -TMGWLENYEPSDILSIKPKHFIDFLSFYLVPEHQYEHDQKSND---EEEWNIIIPPSIT 322

Query: 303 ELCEAGVTIKVAKRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFEIFPAGNQK 362
           E+CEAGVTIK A  KN  C +NI F+NG+LEIPP+ I+  FE ++RN+LAFE FP   + 
Sbjct: 323 EICEAGVTIKKAD-KNTKCLLNIRFENGILEIPPLHIDDYFEPMMRNLLAFEHFPVEVKN 382

Query: 363 KYAIQYVTFLDDLISTEKDLCLLVKAGVIINDTGGS 393
            Y I Y+TF+D LI TEKD+ LLVK  +IIND GGS
Sbjct: 383 TYVIPYLTFMDYLIITEKDVNLLVKEKIIINDIGGS 398

BLAST of CsaV3_2G031920.1 vs. TAIR10
Match: AT4G31980.1 (unknown protein)

HSP 1 Score: 159.1 bits (401), Expect = 5.8e-39
Identity = 118/386 (30.57%), Postives = 191/386 (49.48%), Query Frame = 0

Query: 11  ENIDQNVRDNVVISIDKMLEGLPRVNPKCNTIYQVPKELREINDKAYVPQFISIGPFHYR 70
           E ++QN  D +V SI   L  L  ++ KC  IY+VP +LR +N  AY P+ +S GP H R
Sbjct: 265 ERMNQNEGDALVDSIKAKLAFLSSLSTKC-CIYKVPNKLRRLNPDAYTPRLVSFGPLH-R 324

Query: 71  TRKDLIANEHYKLQGFFNFLGRISIINSHIQLLEENQVNISSKVLVEKSHDWVKEAWNCY 130
            +++L A E  K +   +F+ R                N S + LV  +  W + A +CY
Sbjct: 325 GKEELQAMEDQKYRYLLSFIPR---------------TNSSLEDLVRLARTWEQNARSCY 384

Query: 131 AEPIKMKDEEFIIMMLVDACFIVEFFLL--YYGSFHEDGKLFNAELXXXXXXXXYEILLD 190
           AE +K+  +EF+ M++VD  F+VE  L   Y     E+ ++F   +         ++  D
Sbjct: 385 AEDVKLHSDEFVEMLVVDGSFLVELLLRSHYPRLRGENDRIFGNSMMIT------DVCRD 444

Query: 191 LIKLENQVPFFLLQNLFDLMPKDKVDISSIIGGYKDSPISLIDLTYMVLKEFGFVREYKI 250
           +I +ENQ+PFF+++ +F L          ++  Y+    S+I L     + F +      
Sbjct: 445 MILIENQLPFFVVKEIFLL----------LLNYYQQGTPSIIQLAQ---RHFSYFLSRID 504

Query: 251 NNLYHKNPKHLLDFLSFYFLPVPPNNWNRKFDHVKISKQWRLSPPTTTELCEAGVTIKVA 310
           +  +   P+H +D L   +LP  P     ++  VK+      + P  TEL  AGV  K A
Sbjct: 505 DEKFITEPEHFVDLLRSCYLPQFP--IKLEYTTVKVD-----NAPEATELHTAGVRFKPA 564

Query: 311 KRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFEIFPAGNQKKYAIQYVTFLDD 370
           +  +  C ++ISF +GVL+IP IV++   E L +N++ FE     N  K  + Y+  L  
Sbjct: 565 ETSS--CLLDISFADGVLKIPTIVVDDLTESLYKNIIGFEQCRCSN--KNFLDYIMLLGC 603

Query: 371 LISTEKDLCLLVKAGVIINDTGGSSD 395
            I +  D  LL+ +G+I+N  G S D
Sbjct: 625 FIKSPTDADLLIHSGIIVNYLGNSVD 603

BLAST of CsaV3_2G031920.1 vs. TAIR10
Match: AT3G50170.1 (Plant protein of unknown function (DUF247))

HSP 1 Score: 115.2 bits (287), Expect = 9.6e-26
Identity = 112/402 (27.86%), Postives = 181/402 (45.02%), Query Frame = 0

Query: 7   EIMSENIDQNVRDNVVISIDKMLEGLPRVNPKC----NTIYQVPKELREINDKAYVPQFI 66
           E++ E  ++   D+ VISI   LE   R +         IY+VP  L+E + K+Y PQ +
Sbjct: 76  EVVEERPEETTGDSWVISIRDKLEQADRDDDTTIWGKLCIYRVPHYLQENDKKSYFPQTV 135

Query: 67  SIGPFHYRTRKDLIANEHYKLQGFFNFLGRISIINSHIQLLEENQVNISSKVLVEKSHDW 126
           S+GP+H+  +K L   E +K +     L R+           + ++ + +  + E     
Sbjct: 136 SLGPYHH-GKKRLRPMERHKWRALNKVLKRL-----------KQRIEMYTNAMRELE--- 195

Query: 127 VKEAWNCYAEPIKMKDEEFIIMMLVDACFIVEFFLLYYGSFHEDGKLFNAELXXXXXXXX 186
            ++A  CY  PI +   EF  M+++D CF++E F      F E G   N +         
Sbjct: 196 -EKARACYEGPISLSRNEFTEMLVLDGCFVLELFRGTVEGFTEIGYARN-DPVFAMRGLM 255

Query: 187 YEILLDLIKLENQVPFFLLQNLFDLM--PKDKVDISSIIGGYKDSPISLID--LTYMVLK 246
           + I  D+I LENQ+P F+L  L +L    +++  I + +      P+      LT     
Sbjct: 256 HSIQRDMIMLENQLPLFVLDRLLELQLGTQNQTGIVAHVAVKFFDPLMPTGEALTKPDQS 315

Query: 247 EFGFVREYKINNLYHKNPKHLLDFLSFYFLPVPPNNWNRKF------DHVKISKQWRLSP 306
           +     E  ++ L  K   H LD      L   P    R        +   + K+ +   
Sbjct: 316 KLMNWLEKSLDTLGDKGELHCLDVFRRSLLQSSPTPNTRSLLKRLTRNTRVVDKRQQQLV 375

Query: 307 PTTTELCEAGVTIKVAKRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFEIFPA 366
              TEL EAGV  K  KRK +  F +I FKNG LEIP ++I    + L  N++AFE    
Sbjct: 376 HCVTELREAGV--KFRKRKTDR-FWDIEFKNGYLEIPKLLIHDGTKSLFSNLIAFEQCHI 435

Query: 367 GNQKKYAIQYVTFLDDLISTEKDLCLLVKAGVIINDTGGSSD 395
                +   Y+ F+D+LI++ +D+  L   G+I +  G  S+
Sbjct: 436 -ESSNHITSYIIFMDNLINSSEDVSYLHYCGIIEHWLGSDSE 456

BLAST of CsaV3_2G031920.1 vs. TAIR10
Match: AT3G50120.1 (Plant protein of unknown function (DUF247))

HSP 1 Score: 110.9 bits (276), Expect = 1.8e-24
Identity = 106/395 (26.84%), Postives = 174/395 (44.05%), Query Frame = 0

Query: 18  RDNVVISIDKMLEGLPRVNPKC----NTIYQVPKELREINDKAYVPQFISIGPFHYRTRK 77
           RD+ VISI   LE   R +         IY+VP  L+E ++K+Y PQ +S+GP+H+  +K
Sbjct: 77  RDDWVISITDKLEQAHRDDDTTLWGKLCIYRVPYYLQENDNKSYFPQTVSLGPYHH-GKK 136

Query: 78  DLIANEHYKLQGFFNFLGRISIINSHIQLLEENQVNISSKVLVEKSHDWVKEAWNCYAEP 137
            L + + +K +     L R                N   K+ ++   +  ++A  CY  P
Sbjct: 137 RLRSMDRHKWRAVNRVLKR---------------TNQGIKMYIDAMRELEEKARACYEGP 196

Query: 138 IKMKDEEFIIMMLVDACFIVEFFLLYYGSFHEDGKLFNAELXXXXXXXXYEILLDLIKLE 197
           + +   EFI M+++D CF++E F      F E G   N +         + I  D++ LE
Sbjct: 197 LSLSSNEFIEMLVLDGCFVLELFRGAVEGFTELGYARN-DPVFAMRGSMHSIQRDMVMLE 256

Query: 198 NQVPFFLLQNLFDLM--PKDKVDISSIIGGYKDSPISLID----LTYMVLKEFGFVREYK 257
           NQ+P F+L  L +L    +++  + + +      P+   D     +     E    R+  
Sbjct: 257 NQLPLFVLNRLLELQLGTRNQTGLVAQLAIRFFDPLMPTDEPLTKSGQSKLENSLARDKS 316

Query: 258 INNLYHKNPKHLLDFLSFYFLPVPP--------NNWNRKFDHVKISKQWRLSPPTTTELC 317
            +        H LD      L   P          W+R        +Q  +     TEL 
Sbjct: 317 FDPFADMGELHCLDVFRRSLLRSSPKPEPRLTRKRWSRNTRVADKRRQQLIH--CVTELK 376

Query: 318 EAGVTIKVAKRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFEIFPAGNQKKYA 377
           EAG  IK  +RK +  F ++ FKNG LEIP ++I    + L  N++AFE     +     
Sbjct: 377 EAG--IKFRRRKTDR-FWDMQFKNGYLEIPRLLIHDGTKSLFLNLIAFEQCHIDSSNDIT 436

Query: 378 IQYVTFLDDLISTEKDLCLLVKAGVIINDTGGSSD 395
             Y+ F+D+LI + +D+  L   G+I +  G  S+
Sbjct: 437 -SYIIFMDNLIDSHEDVSYLHYCGIIEHWLGSDSE 448

BLAST of CsaV3_2G031920.1 vs. TAIR10
Match: AT5G22550.2 (Plant protein of unknown function (DUF247))

HSP 1 Score: 109.4 bits (272), Expect = 5.3e-24
Identity = 102/399 (25.56%), Postives = 173/399 (43.36%), Query Frame = 0

Query: 42  IYQVPKELREINDKAYVPQFISIGPFHYRTRKD--LIANEHYKLQGFFNFLGRISIINSH 101
           IY++P  L+++NDKAY P+ +SIGP+H+ + K    +  EH K             +   
Sbjct: 46  IYRIPHTLKQVNDKAYAPKIVSIGPYHHSSDKQHLKMIEEHKK-----------RYLEMF 105

Query: 102 IQLLEENQVNISSKVLVEKSHDWVKEAWNCYAEPIKMKDEEFIIMMLVDACFIVEFFLLY 161
           +   +EN V +    LV+      ++  + Y+E ++   ++ I +ML+D CFI+  FL+ 
Sbjct: 106 VSKTKENGVYLIH--LVDLVSGLEQKIRDSYSENLEFSQQKLIKVMLLDGCFILMLFLVV 165

Query: 162 YGSFHEDGKLFNAELXXXXXXXXY---EILLDLIKLENQVPFFLLQNLFDLMPKDKVDIS 221
                   K+    L        +    +  DL+ LENQVP FLL+ L        ++ S
Sbjct: 166 ------SQKIEYTNLKDPIFKLRWILPTLRSDLLLLENQVPLFLLKVL--------LETS 225

Query: 222 SIIGGYKDSPISLIDLTYMVLKEFGFVREYKINNLYHKNPKHLLDFLSFYFLPVPPN--- 281
            +      + ++     Y + K  GF  ++  NNL     KHLLD +   F+P PP    
Sbjct: 226 KLAPSTSLNMLAFKFFDYSIKKPEGFWEKH--NNL---RAKHLLDLIRKTFIPAPPPSTT 285

Query: 282 ---------NWNRKFDHVKISKQWRLSPPTTTELCEAGVT-------------------- 341
                    N  R++   + SK   L   + ++      T                    
Sbjct: 286 PRQCCINIFNGPREYSRTETSKNICLGKISCSKEITGAQTXXXXXXXXXPFLGLIVSARK 345

Query: 342 -----IKVAKRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFEIFPAGNQKKYA 399
                IK  +++N    ++ISFK+G++EIP +V +     L+ N +AFE F      +  
Sbjct: 346 LRLRGIKFMRKENVETPLDISFKSGLVEIPLLVFDDFISNLLINCVAFEQFNMSCSTEIT 405

BLAST of CsaV3_2G031920.1 vs. TAIR10
Match: AT3G50160.1 (Plant protein of unknown function (DUF247))

HSP 1 Score: 109.0 bits (271), Expect = 6.9e-24
Identity = 115/398 (28.89%), Postives = 177/398 (44.47%), Query Frame = 0

Query: 4   SEIEIMSENIDQNVRDNVVISIDKMLEGL---PRVNPKCNTIYQVPKELREINDKAYVPQ 63
           S + I  +N +Q +R+  VIS++  ++ L      +     IY+VP  L+E + K+Y+PQ
Sbjct: 66  SVVSIEDKN-EQKLREIWVISLNDKMKTLGDNATTSWDNLCIYRVPPYLQENDTKSYMPQ 125

Query: 64  FISIGPFHYRTRKDLIANEHYKLQGFFNFLGRISIINSHIQLLEENQVNISSKVLVEKSH 123
            +SIGP+H+   K L+  E +K +                     N V   +K  +E   
Sbjct: 126 IVSIGPYHH-GHKHLMPMERHKWRAV-------------------NMVMARAKHDIEMYI 185

Query: 124 DWVKE----AWNCYAEPIKMKDEEFIIMMLVDACFIVEFFLLYYGSFHEDGKLFNAELXX 183
           D +KE    A  CY  PI M   EFI M+++D  FI+E F      F E G   N +   
Sbjct: 186 DAMKELEEKARACYQGPINMNRNEFIEMLVLDGVFIIEIFKGTSEGFQEIGYAPN-DPVF 245

Query: 184 XXXXXXYEILLDLIKLENQVPFFLLQNLFDLMPKDKVDISSIIGGYKDSPISLIDLTYMV 243
                   I  D++ LENQ+P+ +L+ L  L   D +D  +            + L    
Sbjct: 246 GMRGLMQSIRRDMVMLENQLPWSVLKGLLQLQRPDVLDKVN------------VQLFQPF 305

Query: 244 LKEFGFVREYKINNLYHKNPKHLLDFLSFYFLPVPPNNWNRKFDHVKISKQWRLSPPTTT 303
            +     RE     L  +   H LD L    L    ++     D   ++KQ +      T
Sbjct: 306 FQPLLPTREV----LTEEGGLHCLDVLRRGLL---QSSGTSDEDMSMVNKQPQQLIHCVT 365

Query: 304 ELCEAGVTIKVAKRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFEIFPAGNQK 363
           EL  AGV      RK    F +I FKNG L+IP ++I    + L  N++AFE     + K
Sbjct: 366 ELRNAGVEF---MRKETGHFWDIEFKNGYLKIPKLLIHDGTKSLFLNLIAFEQCHIKSSK 418

Query: 364 KYAIQYVTFLDDLISTEKDLCLLVKAGVIINDTGGSSD 395
           K    Y+ F+D+LI++ +D+  L   G+I N  G  S+
Sbjct: 426 KIT-SYIIFMDNLINSSEDVSYLHHYGIIENWLGSDSE 418

BLAST of CsaV3_2G031920.1 vs. Swiss-Prot
Match: sp|Q9SD53|Y3720_ARATH (UPF0481 protein At3g47200 OS=Arabidopsis thaliana OX=3702 GN=At3g47200 PE=2 SV=1)

HSP 1 Score: 98.2 bits (243), Expect = 2.2e-19
Identity = 95/374 (25.40%), Postives = 160/374 (42.78%), Query Frame = 0

Query: 42  IYQVPKELREINDKAYVPQFISIGPFHYRTRKDLIANEHYK--LQGFFNFLGRISIINSH 101
           I++VP+    +N KAY P+ +SIGP+HY  +   +  +H    LQ F +           
Sbjct: 48  IFRVPESFVALNPKAYKPKVVSIGPYHYGEKHLQMIQQHKPRLLQLFLD----------- 107

Query: 102 IQLLEENQVNISSKVLVEKSHDWVKEAWNCYAEPIKMKDEEFIIMMLVDACFIVEFFLLY 161
               E  + ++   VLV+   D   +    Y+E +K    + + MM++D CFI+  FL+ 
Sbjct: 108 ----EAKKKDVEENVLVKAVVDLEDKIRKSYSEELK-TGHDLMFMMVLDGCFILMVFLIM 167

Query: 162 YGSFHEDGKLFNAELXXXXXXXXYEILLDLIKLENQVPFFLLQNLFDLMPKDKVDISSII 221
            G+        + +           I  DL+ LENQVPFF+LQ L+         + S I
Sbjct: 168 SGNIE-----LSEDPIFSIPWLLSSIQSDLLLLENQVPFFVLQTLY---------VGSKI 227

Query: 222 GGYKD-SPISLIDLTYMVLKEFGFVREYKINNLYHKNPKHLLDFLSFYFLPVPPNNWNRK 281
           G   D + I+       + KE  +  +++     +   KHLLD +   FLP    +    
Sbjct: 228 GVSSDLNRIAFHFFKNPIDKEGSYWEKHR-----NYKAKHLLDLIRETFLPNTSESDKAS 287

Query: 282 FDHVKISKQWRLSPPTTTELCEAGVTIKVAK------------RKNNLCFMNISFKNGVL 341
             HV++      S    +   +A   I  AK            R      +N+  K   L
Sbjct: 288 SPHVQVQLHEGKSGNVPSVDSKAVPLILSAKRLRLQGIKFRLRRSKEDSILNVRLKKNKL 347

Query: 342 EIPPIVIEGTFEVLIRNVLAFEIFPAGNQKKYAIQYVTFLDDLISTEKDLCLLVKAGVII 399
           +IP +  +G       N +AFE F   +  +    Y+ F+  L++ E+D+  L    +II
Sbjct: 348 QIPQLRFDGFISSFFLNCVAFEQFYTDSSNEIT-TYIVFMGCLLNNEEDVTFLRNDKLII 385

BLAST of CsaV3_2G031920.1 vs. TrEMBL
Match: tr|A0A0A0LQ57|A0A0A0LQ57_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G380610 PE=4 SV=1)

HSP 1 Score: 773.9 bits (1997), Expect = 1.8e-220
Identity = 392/395 (99.24%), Postives = 393/395 (99.49%), Query Frame = 0

Query: 1   MENSEIEIMSENIDQNVRDNVVISIDKMLEGLPRVNPKCNTIYQVPKELREINDKAYVPQ 60
           MENSEIEIMSENIDQNVRDNVVISIDKMLEGLPRVNPKCNTIYQVPKELREINDKAYVPQ
Sbjct: 1   MENSEIEIMSENIDQNVRDNVVISIDKMLEGLPRVNPKCNTIYQVPKELREINDKAYVPQ 60

Query: 61  FISIGPFHYRTRKDLIANEHYKLQGFFNFLGRISIINSHIQLLEENQVNISSKVLVEKSH 120
           FISIGPFHYRTRKDLIANEHYKLQGFFNFLGRISIINSHIQLLEENQVNISSKVLVEKSH
Sbjct: 61  FISIGPFHYRTRKDLIANEHYKLQGFFNFLGRISIINSHIQLLEENQVNISSKVLVEKSH 120

Query: 121 DWVKEAWNCYAEPIKMKDEEFIIMMLVDACFIVEFFLLYYGSFHEDGKLFNAELXXXXXX 180
           DWVKEAWNCYAEPIKMKDEEFIIMMLVDACFIVEFFLLYYGSFHEDGKLFNAELXXXXXX
Sbjct: 121 DWVKEAWNCYAEPIKMKDEEFIIMMLVDACFIVEFFLLYYGSFHEDGKLFNAELXXXXXX 180

Query: 181 XXYEILLDLIKLENQVPFFLLQNLFDLMPKDKVDISSIIGGYKDSPISLIDLTYMVLKEF 240
           XXYEILLDLIKLENQVPFFLLQNLFDLMPKDKVDISSIIGGYKDSPISLIDLTYMVLKEF
Sbjct: 181 XXYEILLDLIKLENQVPFFLLQNLFDLMPKDKVDISSIIGGYKDSPISLIDLTYMVLKEF 240

Query: 241 GFVREYKINNLYHKNPKHLLDFLSFYFLPVPPNNWNRKFDHVKISKQWRLSPPTTTELCE 300
           GFVREYKINNLYHKNPKHLLDFLSFYFLPVPPNNWNRKFDHVKISKQWRLSPPTTTELCE
Sbjct: 241 GFVREYKINNLYHKNPKHLLDFLSFYFLPVPPNNWNRKFDHVKISKQWRLSPPTTTELCE 300

Query: 301 AGVTIKVAKRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFEIFPAGNQKKYAI 360
           AGVTIKVAKRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFEIFPAGNQKKYAI
Sbjct: 301 AGVTIKVAKRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFEIFPAGNQKKYAI 360

Query: 361 QYVTFLDDLISTEKDLCLLVKAGVIINDTGGSSDD 396
           QYVTFLDDLISTEKDLCLLVKAGVIINDTGGS  +
Sbjct: 361 QYVTFLDDLISTEKDLCLLVKAGVIINDTGGSDKE 395

BLAST of CsaV3_2G031920.1 vs. TrEMBL
Match: tr|A0A1S4DW65|A0A1S4DW65_CUCME (UPF0481 protein At3g47200-like OS=Cucumis melo OX=3656 GN=LOC103488289 PE=4 SV=1)

HSP 1 Score: 431.4 bits (1108), Expect = 2.2e-117
Identity = 254/467 (54.39%), Postives = 298/467 (63.81%), Query Frame = 0

Query: 1   MENSEI-----------EIMSENID--QNVRDNVVISIDKMLEGLPRVNPKCNTIYQVPK 60
           MENSEI           + + E I   + V DNVVISIDK+L GLPR+NPKC+ IYQV K
Sbjct: 1   MENSEIIETKVENDICDDELGETISEIEKVCDNVVISIDKILGGLPRINPKCHIIYQVSK 60

Query: 61  ELREINDKAYVPQFISIGPFHYRTRKDLIANEHYKLQGFFNFLGRISIINSHIQLLEENQ 120
           ELRE+NDKAY PQFISIGPFH+RTR DLIANEHYKLQGF NFL R   IN++ Q+     
Sbjct: 61  ELREMNDKAYAPQFISIGPFHHRTRNDLIANEHYKLQGFNNFLHR---INNYEQI----- 120

Query: 121 VNISSKVLVEKSHDWVKEAWNCYAEPIKMKDEEFIIMMLVDACFIVEFFLLYYGSFHEDG 180
              SSK  V+K H WVKEAWNCYAEPI M +EEF++MMLVDACFI+EFF+L     +   
Sbjct: 121 --ESSKEFVKKCHGWVKEAWNCYAEPINMNEEEFVLMMLVDACFILEFFILLIDDHYGGD 180

Query: 181 KLFNAEL---------XXXXXXXXYEILLDLIKLENQVPFFLLQNLFDLMPKDKVDISSI 240
            LF A+                  +EIL+DLIKLENQVPFFLLQNLFDLMPK  V +   
Sbjct: 181 YLFEADQIFQIQDMVDFSFYRGVFFEILIDLIKLENQVPFFLLQNLFDLMPKHDVPMFP- 240

Query: 241 IGGYKDSPISLIDLTYMVLKEFGFVREYKINNLYHKNPKHLLDFLSFYFLPVPPNNWNRK 300
                    SLID+T  +L  FGFV +YKIN+LYHK PKHLLDFLSFYF P+ PN+ + +
Sbjct: 241 ---------SLIDITSEILTWFGFVGKYKINDLYHKKPKHLLDFLSFYFFPLLPNDDHIR 300

Query: 301 F--DHVKISKQ------------------------------------------------W 360
           F  +  K S Q                                                +
Sbjct: 301 FKQNERKNSDQNNNNLLRFFRPLFPAHWLKKNNDSFGVPSLCCFSNKEAETREKDSENYF 360

Query: 361 RLSPPTTTELCEAGVTIKVAKRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFE 396
           RLSPP+ TELCEAGVTIK AKR+ +LCFMNI FKNGVLEIP I I+ TFEV+IRNV+AF+
Sbjct: 361 RLSPPSITELCEAGVTIKAAKRE-DLCFMNIGFKNGVLEIPCIDIDCTFEVVIRNVIAFD 420

BLAST of CsaV3_2G031920.1 vs. TrEMBL
Match: tr|A0A1S3BCY5|A0A1S3BCY5_CUCME (UPF0481 protein At3g47200-like OS=Cucumis melo OX=3656 GN=LOC103488292 PE=4 SV=1)

HSP 1 Score: 335.1 bits (858), Expect = 2.2e-88
Identity = 205/395 (51.90%), Postives = 257/395 (65.06%), Query Frame = 0

Query: 9   MSENIDQNVRDNVVISIDKMLEGLPRVNPKCNTIYQVPKELREINDKAYVPQFISIGPFH 68
           +SE  DQ +  NVVI I KML+ LP+VN +C +IYQV KEL EIN KAY+PQ ISIGP H
Sbjct: 4   ISEVDDQKLCGNVVICIGKMLKQLPQVNAEC-SIYQVSKELLEINRKAYIPQLISIGPIH 63

Query: 69  YRTRKDLIANEHYKLQGFFNFLGRISIINSHIQLLEENQVNISSKVLVEKSHDWVKEAWN 128
           + T  DL+AN+ YKLQGF NFL RI+I N  I  +E+     +   LVEK+H WV+EA N
Sbjct: 64  HGTNNDLVANQQYKLQGFINFLRRININNKQILSMEDILQTGTLNTLVEKAHHWVEEARN 123

Query: 129 CY-AEPIKMKD-EEFIIMMLVDACFIVEFFLLYYGSFHEDGKLF----NAELXXXXXXXX 188
           CY + PI   D + F+IMMLVDACFIVEF +L +   H +GK      N ++        
Sbjct: 124 CYTSPPINTIDMDAFVIMMLVDACFIVEFLILKFDYDHPNGKFLQIQDNIDISFYQGMDL 183

Query: 189 YEILLDLIKLENQVPFFLLQNLFDLMPKDKVDISSIIGGYKDSPISLIDLTYMVLKEFGF 248
           + IL DLIKLENQVPFFLLQ LFDL+PK   DIS +I  ++       DLT   LK F  
Sbjct: 184 H-ILYDLIKLENQVPFFLLQYLFDLIPKH--DISMMISSFR-------DLTLRALK-FRL 243

Query: 249 VREYKINNLYHKNPKHLLDFLSFYFLPVPPNNWNRKFDHVK--ISKQWRLSPPTTTELCE 308
           VR Y+IN    K PKH +D L+FYF+P      N +    K  I ++ R  PP+ TEL E
Sbjct: 244 VRTYEIN--LFKEPKHFVDLLTFYFVPSAGQKVNNQHGIFKSTIEEKNRWIPPSITELRE 303

Query: 309 AGVTIKVAKRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFEIFPAGNQKKYAI 368
           AGVTIK A++  +L   +I+FKNGVL IPP+ I   FE+++RN++AFE   A    KY I
Sbjct: 304 AGVTIKKAEKAKHL--TDITFKNGVLRIPPLHIYDEFELVLRNMVAFEQISARKMNKYVI 363

Query: 369 QYVTFLDDLISTEKDLCLLVKAGVIINDTGGSSDD 396
           QYV F+DDLISTEKD+ LLV+AGVIIN  GGS  +
Sbjct: 364 QYVLFMDDLISTEKDVRLLVEAGVIINQIGGSDKE 382

BLAST of CsaV3_2G031920.1 vs. TrEMBL
Match: tr|A0A1S3BBL9|A0A1S3BBL9_CUCME (UPF0481 protein At3g47200-like OS=Cucumis melo OX=3656 GN=LOC103488293 PE=4 SV=1)

HSP 1 Score: 334.7 bits (857), Expect = 2.8e-88
Identity = 191/396 (48.23%), Postives = 256/396 (64.65%), Query Frame = 0

Query: 3   NSEIEIMSENIDQNVRDNVVISIDKMLEGLPRVNPKCNTIYQVPKELREINDKAYVPQFI 62
           N+ +EI   +  Q V DNVVISI+KML+ +P  +    +IY+VPK+LRE+N KAY PQ I
Sbjct: 23  NNMVEISVVDQQQLVCDNVVISIEKMLDQVPPTHENQCSIYRVPKQLREMNPKAYAPQLI 82

Query: 63  SIGPFHYRTRKDLIANEHYKLQGFFNFLGRISIINSHIQLLEENQVNISSKVLVEKSHDW 122
           SIGPFHY T K+LIANE YKLQGF N+L R+  + S  QL+    V    + LV+++  W
Sbjct: 83  SIGPFHYHTHKNLIANEQYKLQGFINYLRRVYKMESLEQLVRTKSV----EDLVKRAQSW 142

Query: 123 VKEAWNCYAEPIKMKDEEFIIMMLVDACFIVEFFLLYYGSFHEDGKLFNAELXXXXXXXX 182
           V+EA NCYAE I M DE+FI MMLVD CFIVEFF+L +  ++E  +    ++        
Sbjct: 143 VEEARNCYAETINMNDEDFIKMMLVDGCFIVEFFILDFEEYNESHESLFPQIENNVSMSF 202

Query: 183 Y-----EILLDLIKLENQVPFFLLQNLFDLMPKDKVDISSIIGGYKDSPISLIDLTYMVL 242
           Y     +I  DLIKLENQ+PFF+LQ+LFDL+PK           +KD+P     LTY  L
Sbjct: 203 YKERIPDIDEDLIKLENQLPFFVLQHLFDLIPK-----------HKDAPNCFKQLTYEYL 262

Query: 243 KEFGFVREYKINNLYHKNPKHLLDFLSFYFLPVPPNNWNRKFDHVKISKQWR-LSPPTTT 302
              G++  Y+ +++    PKH +DFLSFY +P      ++K +     ++W  + PP+ T
Sbjct: 263 -TMGWLENYEPSDILSIKPKHFIDFLSFYLVPEHQYEHDQKSND---EEEWNIIIPPSIT 322

Query: 303 ELCEAGVTIKVAKRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFEIFPAGNQK 362
           E+CEAGVTIK A  KN  C +NI F+NG+LEIPP+ I+  FE ++RN+LAFE FP   + 
Sbjct: 323 EICEAGVTIKKAD-KNTKCLLNIRFENGILEIPPLHIDDYFEPMMRNLLAFEHFPVEVKN 382

Query: 363 KYAIQYVTFLDDLISTEKDLCLLVKAGVIINDTGGS 393
            Y I Y+TF+D LI TEKD+ LLVK  +IIND GGS
Sbjct: 383 TYVIPYLTFMDYLIITEKDVNLLVKEKIIINDIGGS 398

BLAST of CsaV3_2G031920.1 vs. TrEMBL
Match: tr|A0A0A0LLX1|A0A0A0LLX1_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G380590 PE=4 SV=1)

HSP 1 Score: 309.7 bits (792), Expect = 9.7e-81
Identity = 189/399 (47.37%), Postives = 247/399 (61.90%), Query Frame = 0

Query: 3   NSEIEIMSENIDQNVRDNVVISIDKMLEGLPRVNPKCNTIYQVPKELREINDKAYVPQFI 62
           N+ IEI   N  Q + D VVISI  +L  +P       +IY+VPK+ RE+N KAYVPQ I
Sbjct: 18  NNMIEISGVN-QQLICDRVVISIKNLLYQVPPALKNQCSIYRVPKQQREMNPKAYVPQHI 77

Query: 63  SIGPFHYRTRKDLIANEHYKLQGFFNFLGRISIINSHIQLLEENQVNISSKVLVEKSHDW 122
           SIGPF+Y   ++L ANE YK Q   NFL R+  I S  QLL+      S + LV+K+  W
Sbjct: 78  SIGPFYYHADENLRANEQYKFQSVINFLRRVYKIESLEQLLQTR----SLEDLVKKAQSW 137

Query: 123 VKEAWNCYAEPIKMKDEEFIIMMLVDACFIVEFFLLYYGSFHEDGKLFNAELXXXXXXXX 182
           VKEA NCYAE I M DE+FI MML+D CFIVEFF+L Y  +    + F  ++        
Sbjct: 138 VKEARNCYAESIDMNDEDFIKMMLMDGCFIVEFFILDYEEYKMPDESFFPKIENNVSMSF 197

Query: 183 Y-----EILLDLIKLENQVPFFLLQNLFDLMPKDKVDISSIIGGYKDSPISLIDLTYMVL 242
           Y     +I  DLIKLENQ+PFF+LQ+L+DL+PK             D+P    +LT   L
Sbjct: 198 YKERIPDIDDDLIKLENQLPFFILQHLYDLIPKQ-----------DDNPKCFKELTCKYL 257

Query: 243 KEFGFVREYKINNLYHKNPKHLLDFLSFYFLPVPPNNWNRKFDHVKIS---KQWR-LSPP 302
           K  G++  Y+ +++    PKH +DFLSFYF+P      + + +H K S   ++W  + PP
Sbjct: 258 K-MGWLENYEPSDIQSIEPKHFIDFLSFYFVP------HHRCEHDKKSSDLEEWNVIIPP 317

Query: 303 TTTELCEAGVTIKVAKRKNNLCFMNISFKNGVLEIPPIVIEGTFEVLIRNVLAFEIFPAG 362
           + TEL EAGVTIK  K +N  C MNI FKNG+LEIPP+ I+  FE ++R++LAFE FP  
Sbjct: 318 SITELFEAGVTIK--KAENTKCLMNIKFKNGILEIPPLHIDDYFEPMMRDLLAFEHFPIE 377

Query: 363 NQKKYAIQYVTFLDDLISTEKDLCLLVKAGVIINDTGGS 393
            Q  Y I Y+TF+D LISTE D+ LLVK  +IIND GGS
Sbjct: 378 VQNTYVIPYITFMDYLISTENDVNLLVKEKIIINDIGGS 391

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KGN62932.12.7e-22099.24hypothetical protein Csa_2G380610 [Cucumis sativus][more]
XP_004138863.28.9e-17976.98PREDICTED: UPF0481 protein At3g47200, partial [Cucumis sativus][more]
XP_008445182.13.3e-11754.39PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo] >XP_008445184.1 PREDICT... [more]
XP_008445187.13.3e-8851.90PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo][more]
XP_008445188.14.2e-8848.23PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo][more]
Match NameE-valueIdentityDescription
AT4G31980.15.8e-3930.57unknown protein[more]
AT3G50170.19.6e-2627.86Plant protein of unknown function (DUF247)[more]
AT3G50120.11.8e-2426.84Plant protein of unknown function (DUF247)[more]
AT5G22550.25.3e-2425.56Plant protein of unknown function (DUF247)[more]
AT3G50160.16.9e-2428.89Plant protein of unknown function (DUF247)[more]
Match NameE-valueIdentityDescription
sp|Q9SD53|Y3720_ARATH2.2e-1925.40UPF0481 protein At3g47200 OS=Arabidopsis thaliana OX=3702 GN=At3g47200 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
tr|A0A0A0LQ57|A0A0A0LQ57_CUCSA1.8e-22099.24Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G380610 PE=4 SV=1[more]
tr|A0A1S4DW65|A0A1S4DW65_CUCME2.2e-11754.39UPF0481 protein At3g47200-like OS=Cucumis melo OX=3656 GN=LOC103488289 PE=4 SV=1[more]
tr|A0A1S3BCY5|A0A1S3BCY5_CUCME2.2e-8851.90UPF0481 protein At3g47200-like OS=Cucumis melo OX=3656 GN=LOC103488292 PE=4 SV=1[more]
tr|A0A1S3BBL9|A0A1S3BBL9_CUCME2.8e-8848.23UPF0481 protein At3g47200-like OS=Cucumis melo OX=3656 GN=LOC103488293 PE=4 SV=1[more]
tr|A0A0A0LLX1|A0A0A0LLX1_CUCSA9.7e-8147.37Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G380590 PE=4 SV=1[more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR004158DUF247_pln
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsaV3_2G031920CsaV3_2G031920gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsaV3_2G031920.1.cds2CsaV3_2G031920.1.cds2CDS
CsaV3_2G031920.1.cds1CsaV3_2G031920.1.cds1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsaV3_2G031920.1.exon2CsaV3_2G031920.1.exon2exon
CsaV3_2G031920.1.exon1CsaV3_2G031920.1.exon1exon


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsaV3_2G031920.1CsaV3_2G031920.1-proteinpolypeptide


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004158Protein of unknown function DUF247, plantPFAMPF03140DUF247coord: 42..392
e-value: 1.4E-72
score: 245.0
NoneNo IPR availablePANTHERPTHR31549FAMILY NOT NAMEDcoord: 37..391