CmUC04G070220 (gene) Watermelon (USVL531) v1

Overview
NameCmUC04G070220
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionEndoribonuclease E-like protein
LocationCmU531Chr04: 8042265 .. 8047602 (+)
RNA-Seq ExpressionCmUC04G070220
SyntenyCmUC04G070220
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCTTCACAAATCAGCTTCCCTTTCAGCTCTCCATTTCCTTAACAAAGTCTTTCATCTTCCGCAGCTTTTCCCCAAATCTAAAACCACTCCCATCAATCTACTCTGCTTCACCCTTTAAATCATCATCCAAAATTTCCAAATCCCAGAACCCAACAACAGGTACAATCACAGCCCCATTGCAAGCAATTAAACGCAAGTGAATTCGTTCTCTTTCCTCCTTACCATTTTGTTTTATGTTCAGCTCTTGCCCTTTGACGACAATTAGTTCTTTTGAATTGACAACCGTTTGTTTTTAATCTTTAAAACCCTTTGCAATGTATCCACAATCACACACTCACAAACTTATCTTAAAAGGGAAATCATCGAATAATTGGCTAGCGACGATAATTTGGGCTTCTTGGGCTCTTGCAGAGTTCCAAGGATTCTGCTATTGGGATAGAATCAGAATTACTATTAGGGTATTAAAGGTATATTAGTAATTAGCTTGGGAAGTTGTTCTCTTCATTCTCATTTGTTATGAATGGTGGATATGAGAGGATGAATGTATTCATTGGCTTATTTGGTTTAGGCTCAAGTGATATTACCCAAAAGGAGGGTGGCTCTATGTGCCTCAAGTACTTGACTTATCTTGTAGTTATTCTGTCTTATATTTCAATATCATTTTAGTTGTTTGGATTCTATCAGATTCATGCCAGTTGAGAACAAAAACTTGCTTCTTAGGTGCACGAGCAAATGATGTAGCTACAACTGAAAAGGAAGAGCAAGAAGAGACGGAAGTTGCAGAGGGATATACCATCTCTCAATTTTGTGATAAAATAATTGATATTTTCATGAATGAGAAGCCAAAGACTAAAGAATGGAGGAAGTTTTTGGTATTTAGGGAGGAGTGGAAAAAGTATAGGGAGAGCTTCTACAGTCATTGCCAAAGGCGGGCAGACTGGGAGAGTGATCCAATTATGAAAGAGAAGTTAATTTCACTTAGGAGAAAGGTCAAAAGGGTATGTACTTGTCCTCTTTGTTTTCCTCTCCATTTGTTTTTGAAACTTTCTTGCTATGATTTGTGTTCTTCAATACCATATCTTGTTGCATTGGTTTTTGTTCGCAAGAGTTTGAAAACAAAATTTAGAGGAGAATTTAGGTTGTAATGTTCACTGCTTGTTATATGGTGATTATTTCCGTTGCCTAGAGTCCAATTTGTGAACAATTAAAGCTGCAACACTTGTTTTCTGTTTTTGCATTCGTTGATCATGAGTGAGAAATATACAGATTGATGATGAAATGGAAATCCACAGTGAACTTCTCAAGGAATTACAGGGCAGCCCAACTGACATTAATGCGATAGTTGCAAAGCGGCGCAAAGAGTTCACAGAGGAATTTTTTAAGTTCCTTACTCTGATATCGGAAACCCATGACAGTTTGGAAGATCGTGATGGTAATATTTGAAATCTTTCATTATTAATTTTAGTTTTAACGTGGATAACCAAGTTTCTTTGTTGGATTAAACTGAGACTTATAAATTCAGTTTAGATTTAATCTCAGATTTATTTATTTATTTATTTTTGATAAGATAAGAATTTGTTAATATTTACTGCATCCTTATCTTTGAACCATTTTTACAAGTATTTAGATGAATTTAATCTTTTAGTTTAGGTATGATAATAAATCTACTCCTCTCTAAATCTTGAGGCCCCATAATGCATTTCTTACCTGAGAACTGAGTAATAAAAGTTGTCTTTCTTCATAATTTTTCCCTAAATTAGTTTTGGGATTATTATTTGAAAAGTTTGTATCGTGCATTCACACTGTACCATACATTTCATTATTTTTCGTACTCATCTTTATGATTTAGGATTAAGTAATGGCTTTTGTAATCTATAGTAGTTATCTCAAGAATTGACTAAAGCCCAGATTTGTAACAACATTTTAACTAGCAGATAGAGTTACTCTAATCTGAAGGAAAAAAAAAAAAAAAATGAAATGAAAGGAAAATAAATAGAAAAAGATTTTATCACTAGTTTTCTCTTGAACTAATTTTTCTATTTGTGCAGCGGTGGCTCGGCTGGCAGCCAGATGTCTGGCTGCAGTTAGTGCTTACGATCGAACATTAGAAAATGTGGAGACATTGGATTCTGCACAGGCCAAATTTGATGATATACTGAATTCTCCCTCATTGGACGTGGCTTGTGAGAAGATTGCAAGTCTTGGAAAGGCAAAGGAACTTGACTCATCGTTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGAATCCACAACCATGAAGAATGAGGTAATGTTTAGATTTATATATGATCTCACTACTTCAACTGCATATTCATTACATTTTAATTCCCAAACTTTCAAATAGTCTGCTTTGCTCCAAGTGGTATGTTTTTGTCTCTCGACTTTGAATAATGCATCTAAAAATTTTGATACCAAATTATTAGATGTGGCTTCAAATACTTGTCTATTTGACATCTATTGTTTTTGTCAAATGAAAAGAGAAAACAAAGTGCATATGTCAGGTTTCTATTTGAAAATTAATGACAAATAATAAAGTAGTTTTTATGCCAAGTTCAAAACTAAAACACAATGTTGGGAAGTTTAGGGAAAAAATTGAATTCACATGAGAGTTTGTGGACACTGAAATAGATTTTCCTCTATCTATATGTGTCATGTCATATGACCCTACAAATCATTCTGGTTTGAAGCCATTATATTATTGTGGATTGTCAATGTCCCTGGCCACAATATTAATTGCTGTTGTTAACTATGAAGTTCTTGAAAGTAACTCCCTGCAACATCCCCAGTTTTGCCCGAGTTGGAAACTTACTCTTATTTTACAATTCATTTCTAAGACGGTCTAACACTTTTTTTTCTTTTCTGAGTAACCGAGCTATCATGTGGAAGAATAAAAAAGTAGGAAAAAATACCCTTTTGGTCCTTAAGTTTTGAGTCTAATTTCTATTTGGTCCCTAGCTTTCAAAATGTAGTCCCCATCCCTCACTTTGATATCGGAGGTATTTTTCTTTGTTTTGCAGGTCCTTTTAAAGTTTTGAGGTTTGTTTCAATTTAGTCCCTAAGTTTAACAATGTCTATGAATTAATTTAAAAGCATTATAGTTAATTAAGTGTCATTATTTTTCATCACTATTAGAATTGAAATTAAAATTTCACTTCATAGTTATTTTAAATCAATTAATGGACATTAATGTTAATGACTGTTAGTCGATATTTAAACTACCACTTTCTTAGGGCCAAATAAAATGAAACTCAAATATCAAGAGCAAAATTGTAACATTTTGAACCCTAAGGACTAAATTGAAATCAATCAAAACTTAAAGATTAAATGTGTAACATTTTGAAATCTAGGGACTAAAGGGAACTAAACCCAAAACTTTGGGTAAAAAAAAAAGGTACTCTTCCAAAAAAAAATGAAAAAAAAAAAAAAAAAAAAAAAACCCTCAGTCTATGGGACTCCAATCGAAAAGGATCAAACCTGATTGCTTCACACATTTAGTGTTGGACATGCTTTTTAGAAAATATTTTAGAGTGAAGCTCTGATTATATTTATATCAGCAATGTCGGAAAAATGTTAATCAAATCATTCCTCTTAAAGCACTTATAGAATTTCTTCCACCACTTTTGATAATAGAAAATTTTAGAAAGTGATTTTAATTGGTTAAAAGCACTCTACTCTTTTTAACATCTTAACCAAGCACTGTCCATTTAAAAAAGAAGAAAAAGAAAATTTGTCATTAATAATCACTTTACAAGGCATGCCAAAATACACTAATACAAAAAAGGGAGTTCTGTTTCTAAATTCTCATTATAATTGATATGTTGGAATTGGATTACAAACTTGATTGAAAGTCAATGAAGAGCTGACGATGAATTTTTATGTCTAAGATTACTTGAATATTGTATGCCCTTGGTGAAAAAATGTTCCATCTTTATTGAGTTTGGCAGTCTAAATGAAATACATTCCAAAGCTTTCCAATAGTTTCAGATTGAAATATTATGTCACCTTCTCTATCTTTCTGTGTGTGAATGTAGGTGAAAGAAATAATGTATCATTTATACAAAACCACAAAAAGCAGTCTTAGAAGCATGGCCCCTAAAGAAATAAAGCTGTTAAAACATTTGCTGAACATCGTAGATCCTGAAGAACGATTTTCTGCTTTAGCAACAGCCTTCGCCCCAGGTGATGGAAGTGAACAAAAAGATCCAAATGCTTTGTACACGTGAGTAAGCCTCATCATAATTTATACATTTATTCATGATAATTACATACTTATCTTATAGGTTTCATTAGTCGTCAGAGTTTACTTCAAAATATACCTACTACTTGCTTTGAACTTTTCCGTTTAGTTCATAATGCTTGCAGAACTTTTAGAAAGTAACTATTTTAATTATTATCAATTATTTTGGCATCAATTATTTATCGGATCATGTGGCTCGCAGCTATATAGACTAGTTAAGGCAAGTAATGGATGGGGAACACTCCATACATGCCCAAATCGACTATCAACCCTCATATTAGCTAAAACTATATGACTTTATTATGTTGATAAATTACTAGGATACCTCCTAACAAGAACACTCAAGAATAACATAGAAAACTCAAAAATGCTCAAATATGGAAATATATGAAAATATGTTGTAATATTATAAGAAAAGTTACTAGATAACCTAACCCTTTCGAGAGGGTTAATCTCTCCCAAATTCTCCAAAGGAAATCCTACAAAAATCTTTGTACCTTTATCCCAACCTCACTCCCTCTATTTATAACTAAAAGTCCTAACGAACTTACTATCTATTTACTAGTATGCTGTATGCATACTAATAATCTTAATATCTCCCCGACTATGGGTCTTACATAAATTCAAAATCACATTATCCATCTATTAAATAATTGACAACAAAACAACACTAAGGCCCAAAATGATGCCATAGGAACAGAATAACATCTATTTTGAATACTTGACAATGGTATAAAAATGATGTTCAAACTGATTGTTTGGTTAATGTTTAGGAACCATAATAAAATTTAATAGAATGCAAAGTAAGATTTGGTCATGACTATCATTTTGTGTAATTGGGCAGAACTCCGACAGAGCTGCATAAGTGGATAAAGATAATGCTTGATTCATACCATCTGAACCAGGAAGAAACAGACATAAGAGAAGCAAGGAATATGACTCAGCCTGTTGTTATACAAAGGCTATTCATCCTCAAGGATACTATTGAAACTGAGTATTTGGAACAGAATGAGTCTCACAATCCTCAATCCAAACCAAATCATGTTTCTGAAGATGCGGTTTCTATATAG

mRNA sequence

ATGGCCTTCACAAATCAGCTTCCCTTTCAGCTCTCCATTTCCTTAACAAAGTCTTTCATCTTCCGCAGCTTTTCCCCAAATCTAAAACCACTCCCATCAATCTACTCTGCTTCACCCTTTAAATCATCATCCAAAATTTCCAAATCCCAGAACCCAACAACAGAGTTCCAAGGATTCTGCTATTGGGATAGAATCAGAATTACTATTAGGGCTCAAGTGATATTACCCAAAAGGAGGGTGGCTCTATGTGCCTCAAGTGCACGAGCAAATGATGTAGCTACAACTGAAAAGGAAGAGCAAGAAGAGACGGAAGTTGCAGAGGGATATACCATCTCTCAATTTTGTGATAAAATAATTGATATTTTCATGAATGAGAAGCCAAAGACTAAAGAATGGAGGAAGTTTTTGGTATTTAGGGAGGAGTGGAAAAAGTATAGGGAGAGCTTCTACAGTCATTGCCAAAGGCGGGCAGACTGGGAGAGTGATCCAATTATGAAAGAGAAGTTAATTTCACTTAGGAGAAAGGTCAAAAGGATTGATGATGAAATGGAAATCCACAGTGAACTTCTCAAGGAATTACAGGGCAGCCCAACTGACATTAATGCGATAGTTGCAAAGCGGCGCAAAGAGTTCACAGAGGAATTTTTTAAGTTCCTTACTCTGATATCGGAAACCCATGACAGTTTGGAAGATCGTGATGCGGTGGCTCGGCTGGCAGCCAGATGTCTGGCTGCAGTTAGTGCTTACGATCGAACATTAGAAAATGTGGAGACATTGGATTCTGCACAGGCCAAATTTGATGATATACTGAATTCTCCCTCATTGGACGTGGCTTGTGAGAAGATTGCAAGTCTTGGAAAGGCAAAGGAACTTGACTCATCGTTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGAATCCACAACCATGAAGAATGAGCCATTATATTATTGTGGATTGTCAATGTCCCTGGCCACAATATTAATTGCTGTTGTTAACTATGAAGTTCTTGAAAGTAACTCCCTGCAACATCCCCAGTTTTGCCCGAGTTGGAAACTTACTCTTATTTTACAATTCATTTCTAAGACGGTGAAAGAAATAATGTATCATTTATACAAAACCACAAAAAGCAGTCTTAGAAGCATGGCCCCTAAAGAAATAAAGCTGTTAAAACATTTGCTGAACATCGTAGATCCTGAAGAACGATTTTCTGCTTTAGCAACAGCCTTCGCCCCAGGTGATGGAAGTGAACAAAAAGATCCAAATGCTTTGTACACAACTCCGACAGAGCTGCATAAGTGGATAAAGATAATGCTTGATTCATACCATCTGAACCAGGAAGAAACAGACATAAGAGAAGCAAGGAATATGACTCAGCCTGTTGTTATACAAAGGCTATTCATCCTCAAGGATACTATTGAAACTGAGTATTTGGAACAGAATGAGTCTCACAATCCTCAATCCAAACCAAATCATGTTTCTGAAGATGCGGTTTCTATATAG

Coding sequence (CDS)

ATGGCCTTCACAAATCAGCTTCCCTTTCAGCTCTCCATTTCCTTAACAAAGTCTTTCATCTTCCGCAGCTTTTCCCCAAATCTAAAACCACTCCCATCAATCTACTCTGCTTCACCCTTTAAATCATCATCCAAAATTTCCAAATCCCAGAACCCAACAACAGAGTTCCAAGGATTCTGCTATTGGGATAGAATCAGAATTACTATTAGGGCTCAAGTGATATTACCCAAAAGGAGGGTGGCTCTATGTGCCTCAAGTGCACGAGCAAATGATGTAGCTACAACTGAAAAGGAAGAGCAAGAAGAGACGGAAGTTGCAGAGGGATATACCATCTCTCAATTTTGTGATAAAATAATTGATATTTTCATGAATGAGAAGCCAAAGACTAAAGAATGGAGGAAGTTTTTGGTATTTAGGGAGGAGTGGAAAAAGTATAGGGAGAGCTTCTACAGTCATTGCCAAAGGCGGGCAGACTGGGAGAGTGATCCAATTATGAAAGAGAAGTTAATTTCACTTAGGAGAAAGGTCAAAAGGATTGATGATGAAATGGAAATCCACAGTGAACTTCTCAAGGAATTACAGGGCAGCCCAACTGACATTAATGCGATAGTTGCAAAGCGGCGCAAAGAGTTCACAGAGGAATTTTTTAAGTTCCTTACTCTGATATCGGAAACCCATGACAGTTTGGAAGATCGTGATGCGGTGGCTCGGCTGGCAGCCAGATGTCTGGCTGCAGTTAGTGCTTACGATCGAACATTAGAAAATGTGGAGACATTGGATTCTGCACAGGCCAAATTTGATGATATACTGAATTCTCCCTCATTGGACGTGGCTTGTGAGAAGATTGCAAGTCTTGGAAAGGCAAAGGAACTTGACTCATCGTTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGAATCCACAACCATGAAGAATGAGCCATTATATTATTGTGGATTGTCAATGTCCCTGGCCACAATATTAATTGCTGTTGTTAACTATGAAGTTCTTGAAAGTAACTCCCTGCAACATCCCCAGTTTTGCCCGAGTTGGAAACTTACTCTTATTTTACAATTCATTTCTAAGACGGTGAAAGAAATAATGTATCATTTATACAAAACCACAAAAAGCAGTCTTAGAAGCATGGCCCCTAAAGAAATAAAGCTGTTAAAACATTTGCTGAACATCGTAGATCCTGAAGAACGATTTTCTGCTTTAGCAACAGCCTTCGCCCCAGGTGATGGAAGTGAACAAAAAGATCCAAATGCTTTGTACACAACTCCGACAGAGCTGCATAAGTGGATAAAGATAATGCTTGATTCATACCATCTGAACCAGGAAGAAACAGACATAAGAGAAGCAAGGAATATGACTCAGCCTGTTGTTATACAAAGGCTATTCATCCTCAAGGATACTATTGAAACTGAGTATTTGGAACAGAATGAGTCTCACAATCCTCAATCCAAACCAAATCATGTTTCTGAAGATGCGGTTTCTATATAG

Protein sequence

MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQEETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLINSAWASAKESTTMKNEPLYYCGLSMSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKPNHVSEDAVSI
Homology
BLAST of CmUC04G070220 vs. NCBI nr
Match: XP_038883874.1 (uncharacterized protein At4g37920 isoform X1 [Benincasa hispida])

HSP 1 Score: 714.1 bits (1842), Expect = 8.2e-202
Identity = 396/503 (78.73%), Postives = 402/503 (79.92%), Query Frame = 0

Query: 1   MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFC 60
           MAFTN L FQLSIS TKSFIF SFS  LKPLPSIYSAS FK S +I KS NPT       
Sbjct: 1   MAFTNHLLFQLSISSTKSFIFPSFSATLKPLPSIYSASLFKPSPEIYKSDNPTP------ 60

Query: 61  YWDRIRITIRAQV-ILPKRRVALCASSARANDVATTEKEEQEETEVAEGYTISQFCDKII 120
               + IT   Q  I  + R   C   A  NDVATTEKEE+ E EVAEGYTISQFCDKII
Sbjct: 61  ----VTITTPMQFKIHCELRAKTCFLGALVNDVATTEKEEEAEMEVAEGYTISQFCDKII 120

Query: 121 DIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRI 180
           DIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRI
Sbjct: 121 DIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRI 180

Query: 181 DDEMEIHSELLKELQGSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLA 240
           DDEMEIH ELLKELQ SPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLA
Sbjct: 181 DDEMEIHGELLKELQDSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLA 240

Query: 241 ARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLI 300
           ARCLAAVSAYDRTLENVETLDSAQAKFDDIL SPSLDVACEKIASL KAKELDSSLILLI
Sbjct: 241 ARCLAAVSAYDRTLENVETLDSAQAKFDDILTSPSLDVACEKIASLAKAKELDSSLILLI 300

Query: 301 NSAWASAKESTTMKNEPLYYCGLSMSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLIL 360
           NSAWA+AKESTTMKNE                                            
Sbjct: 301 NSAWAAAKESTTMKNE-------------------------------------------- 360

Query: 361 QFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSE 420
                 VKEIMYHLYK TKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSE
Sbjct: 361 ------VKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSE 420

Query: 421 QKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEY 480
           QKDP ALYTTP ELHKWIKIMLDSYHLNQE+TDIREARNMTQPVVIQRLFILKDTIETEY
Sbjct: 421 QKDPKALYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPVVIQRLFILKDTIETEY 443

Query: 481 LEQNESHNPQSKPNHVSEDAVSI 503
           LEQNE  NPQS PNHVSEDAVSI
Sbjct: 481 LEQNEFQNPQSTPNHVSEDAVSI 443

BLAST of CmUC04G070220 vs. NCBI nr
Match: XP_038883875.1 (uncharacterized protein At4g37920 isoform X2 [Benincasa hispida])

HSP 1 Score: 710.3 bits (1832), Expect = 1.2e-200
Identity = 394/502 (78.49%), Postives = 400/502 (79.68%), Query Frame = 0

Query: 1   MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFC 60
           MAFTN L FQLSIS TKSFIF SFS  LKPLPSIYSAS FK S +I KS NPT       
Sbjct: 1   MAFTNHLLFQLSISSTKSFIFPSFSATLKPLPSIYSASLFKPSPEIYKSDNPTP------ 60

Query: 61  YWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQEETEVAEGYTISQFCDKIID 120
               + IT   Q            +SA  NDVATTEKEE+ E EVAEGYTISQFCDKIID
Sbjct: 61  ----VTITTPMQF----------KASALVNDVATTEKEEEAEMEVAEGYTISQFCDKIID 120

Query: 121 IFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRID 180
           IFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRID
Sbjct: 121 IFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRID 180

Query: 181 DEMEIHSELLKELQGSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAA 240
           DEMEIH ELLKELQ SPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAA
Sbjct: 181 DEMEIHGELLKELQDSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAA 240

Query: 241 RCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN 300
           RCLAAVSAYDRTLENVETLDSAQAKFDDIL SPSLDVACEKIASL KAKELDSSLILLIN
Sbjct: 241 RCLAAVSAYDRTLENVETLDSAQAKFDDILTSPSLDVACEKIASLAKAKELDSSLILLIN 300

Query: 301 SAWASAKESTTMKNEPLYYCGLSMSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQ 360
           SAWA+AKESTTMKNE                                             
Sbjct: 301 SAWAAAKESTTMKNE--------------------------------------------- 360

Query: 361 FISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEQ 420
                VKEIMYHLYK TKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEQ
Sbjct: 361 -----VKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEQ 420

Query: 421 KDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYL 480
           KDP ALYTTP ELHKWIKIMLDSYHLNQE+TDIREARNMTQPVVIQRLFILKDTIETEYL
Sbjct: 421 KDPKALYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPVVIQRLFILKDTIETEYL 432

Query: 481 EQNESHNPQSKPNHVSEDAVSI 503
           EQNE  NPQS PNHVSEDAVSI
Sbjct: 481 EQNEFQNPQSTPNHVSEDAVSI 432

BLAST of CmUC04G070220 vs. NCBI nr
Match: XP_008442081.1 (PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 705.7 bits (1820), Expect = 2.9e-199
Identity = 387/504 (76.79%), Postives = 404/504 (80.16%), Query Frame = 0

Query: 1   MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFC 60
           MAFTN LPFQ  IS TKSFIF +FS  LKPLPSIYSASPFK S K SKS N TT      
Sbjct: 1   MAFTNHLPFQFYISSTKSFIFPNFSTTLKPLPSIYSASPFKPSPKFSKSDNRTT------ 60

Query: 61  YWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQEETEVAEGYTISQFCDKIID 120
               + IT   Q+           +SAR NDVAT+EKEEQ E EVA+GY++SQFCDKIID
Sbjct: 61  ----VTITAPLQIF---------NASARVNDVATSEKEEQAEMEVAKGYSLSQFCDKIID 120

Query: 121 IFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRID 180
           IFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVK+ID
Sbjct: 121 IFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKKID 180

Query: 181 DEMEIHSELLKELQGSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAA 240
           DEMEIHSELLKELQ SPTDINAIVA RRKEFT+EFFKFLTLISETHDSLEDRDAVARLAA
Sbjct: 181 DEMEIHSELLKELQDSPTDINAIVANRRKEFTDEFFKFLTLISETHDSLEDRDAVARLAA 240

Query: 241 RCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN 300
           RCLAAVSAYDRTLENVETLDSAQAKFD+ILNSPSLDVACEKIASL KAKELDSSLILLIN
Sbjct: 241 RCLAAVSAYDRTLENVETLDSAQAKFDNILNSPSLDVACEKIASLAKAKELDSSLILLIN 300

Query: 301 SAWASAKESTTMKNEPLYYCGLSMSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQ 360
           SAWASAKESTTMKNE                                             
Sbjct: 301 SAWASAKESTTMKNE--------------------------------------------- 360

Query: 361 FISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEQ 420
                VKEIMYHLYK TKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAF+PGDGSEQ
Sbjct: 361 -----VKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFSPGDGSEQ 420

Query: 421 KDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYL 480
           KDPNALYTTP ELHKWIKIMLDSYHLNQE+TDIREARNMTQP+VIQRLFILKDTIETEYL
Sbjct: 421 KDPNALYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYL 435

Query: 481 EQNESHNPQSKP--NHVSEDAVSI 503
           EQN+  NPQS+P  NH SEDA+SI
Sbjct: 481 EQNQFQNPQSRPNHNHGSEDAISI 435

BLAST of CmUC04G070220 vs. NCBI nr
Match: XP_004146379.1 (uncharacterized protein At4g37920 isoform X1 [Cucumis sativus] >KGN54831.1 hypothetical protein Csa_012204 [Cucumis sativus])

HSP 1 Score: 694.5 bits (1791), Expect = 6.7e-196
Identity = 380/504 (75.40%), Postives = 400/504 (79.37%), Query Frame = 0

Query: 1   MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFC 60
           MAFTN LPFQ  +S TK FIF SFS  L PLPSIYSASPFK S KISKS N T+      
Sbjct: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTS------ 60

Query: 61  YWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQEETEVAEGYTISQFCDKIID 120
               + IT   Q+           +SAR NDVAT+EKEEQ E EVA+GY++SQFCDKIID
Sbjct: 61  ----VTITAPLQIF---------NASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIID 120

Query: 121 IFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRID 180
           IF+NEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWE DPIMKEKLISLRRKVK+ID
Sbjct: 121 IFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKID 180

Query: 181 DEMEIHSELLKELQGSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAA 240
           DEMEIHSELLKELQ SPTDINAIVAKR KEFT+EFFKFLTLISETHDSLEDRDAVARLAA
Sbjct: 181 DEMEIHSELLKELQDSPTDINAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAA 240

Query: 241 RCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN 300
           RCLAAVSAY+RTLENVETLDSAQ KFD+ILNSPSLDVACEKIASL KAKELDSSLILLIN
Sbjct: 241 RCLAAVSAYNRTLENVETLDSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLIN 300

Query: 301 SAWASAKESTTMKNEPLYYCGLSMSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQ 360
           SAWASAKESTTMKNE                                             
Sbjct: 301 SAWASAKESTTMKNE--------------------------------------------- 360

Query: 361 FISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEQ 420
                VKEIMYHLYK TKSSLRSMAPKEIKLLKHLLNIVDPEERFSALAT F+PGDGSEQ
Sbjct: 361 -----VKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQ 420

Query: 421 KDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYL 480
           KDPNALYTTP ELHKWIKIMLDSYHLNQE+TDIREARNMTQP+VIQRLFILKDTIETEYL
Sbjct: 421 KDPNALYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYL 435

Query: 481 EQNESHNPQSKP--NHVSEDAVSI 503
           EQN+  NPQS+P  NH SEDA+SI
Sbjct: 481 EQNQFQNPQSRPSHNHGSEDAISI 435

BLAST of CmUC04G070220 vs. NCBI nr
Match: KAG6603165.1 (hypothetical protein SDJN03_03774, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 687.6 bits (1773), Expect = 8.2e-194
Identity = 381/502 (75.90%), Postives = 395/502 (78.69%), Query Frame = 0

Query: 1   MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFC 60
           MA TNQL FQLSIS TK+FIFR FS   KPLPSI SA+PFKSS K SKS N  T      
Sbjct: 1   MAITNQLAFQLSISSTKTFIFRRFSAAQKPLPSISSATPFKSSPKNSKSDNRAT------ 60

Query: 61  YWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQEETEVAEGYTISQFCDKIID 120
                     A V  P +  A    SARANDVATTE EEQ E EVAEGYTISQFCDKIID
Sbjct: 61  ----------ATVPTPMQFNA----SARANDVATTEMEEQAEMEVAEGYTISQFCDKIID 120

Query: 121 IFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRID 180
           IFMNEKPKTKEWRK LVFREEWKKYRESFYSHCQRRADWESDPIMKEKL+SL R+VKRID
Sbjct: 121 IFMNEKPKTKEWRKLLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLLSLGRRVKRID 180

Query: 181 DEMEIHSELLKELQGSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAA 240
           DEMEIHSELLKELQ SPTDINAIVAKRR+EFTE+FFKFLTL+SETHDSLED DAVARLAA
Sbjct: 181 DEMEIHSELLKELQDSPTDINAIVAKRRQEFTEDFFKFLTLVSETHDSLEDHDAVARLAA 240

Query: 241 RCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN 300
           RCL+AVSAYDRTLE+VETLDSAQ KFDDILNSPSLDVACEKIASL KAKELDSSLILLIN
Sbjct: 241 RCLSAVSAYDRTLEHVETLDSAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLIN 300

Query: 301 SAWASAKESTTMKNEPLYYCGLSMSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQ 360
           SAWASAKESTTMKNE                                             
Sbjct: 301 SAWASAKESTTMKNE--------------------------------------------- 360

Query: 361 FISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEQ 420
                VKEIMYHLYK TKS LRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSE 
Sbjct: 361 -----VKEIMYHLYKATKSGLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEP 420

Query: 421 KDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYL 480
           KDPNA+YTTP ELHKWIKIMLDSYHLNQE+TDIREAR M QP+VIQRLFILKDTIETEYL
Sbjct: 421 KDPNAIYTTPKELHKWIKIMLDSYHLNQEDTDIREARKMAQPIVIQRLFILKDTIETEYL 432

Query: 481 EQNESHNPQSKPNHVSEDAVSI 503
           EQNES N QSKPNHVS +AVSI
Sbjct: 481 EQNESQNAQSKPNHVSANAVSI 432

BLAST of CmUC04G070220 vs. ExPASy Swiss-Prot
Match: Q84WN0 (Uncharacterized protein At4g37920 OS=Arabidopsis thaliana OX=3702 GN=At4g37920 PE=2 SV=2)

HSP 1 Score: 479.2 bits (1232), Expect = 5.8e-134
Identity = 249/385 (64.68%), Postives = 295/385 (76.62%), Query Frame = 0

Query: 98  EEQEETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRA 157
           E+  E EVAEGYT++QFCDKIID+F+NEKPK K+W+ +LV R+EW KY  +FY  C+ RA
Sbjct: 70  EDPMEVEVAEGYTMAQFCDKIIDLFLNEKPKVKQWKTYLVLRDEWNKYSVNFYKRCRIRA 129

Query: 158 DWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDINAIVAKRRKEFTEEFFK 217
           D E+DPI+K+KL+SL  KVK+ID EME H++LLKE+Q +PTDINAI AKRR++FT EFF+
Sbjct: 130 DTETDPILKQKLVSLESKVKKIDKEMEKHNDLLKEIQENPTDINAIAAKRRRDFTGEFFR 189

Query: 218 FLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDV 277
           ++TL+SET D LEDRDAVARLA RCL+AVSAYD TLE+VETLD+AQAKF+DILNSPS+D 
Sbjct: 190 YVTLLSETLDGLEDRDAVARLATRCLSAVSAYDNTLESVETLDTAQAKFEDILNSPSVDS 249

Query: 278 ACEKIASLGKAKELDSSLILLINSAWASAKESTTMKNEPLYYCGLSMSLATILIAVVNYE 337
           ACEKI SL KAKELDSSLILLINSA+A+AKES T+ NE                      
Sbjct: 250 ACEKIRSLAKAKELDSSLILLINSAYAAAKESQTVTNE---------------------- 309

Query: 338 VLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLN 397
                                        K+IMYHLYK TKSSLRS+ PKEIKLLK+LLN
Sbjct: 310 ----------------------------AKDIMYHLYKATKSSLRSITPKEIKLLKYLLN 369

Query: 398 IVDPEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREAR 457
           I DPEERFSALATAF+PGD  E KDP ALYTTP ELHKWIKIMLD+YHLN+EETDI+EA+
Sbjct: 370 ITDPEERFSALATAFSPGDDHEAKDPKALYTTPKELHKWIKIMLDAYHLNKEETDIKEAK 404

Query: 458 NMTQPVVIQRLFILKDTIETEYLEQ 483
            M+QP+VIQRLFILKDTIE EYL++
Sbjct: 430 QMSQPIVIQRLFILKDTIEDEYLDK 404

BLAST of CmUC04G070220 vs. ExPASy TrEMBL
Match: A0A1S3B4W5 (uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103486045 PE=4 SV=1)

HSP 1 Score: 705.7 bits (1820), Expect = 1.4e-199
Identity = 387/504 (76.79%), Postives = 404/504 (80.16%), Query Frame = 0

Query: 1   MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFC 60
           MAFTN LPFQ  IS TKSFIF +FS  LKPLPSIYSASPFK S K SKS N TT      
Sbjct: 1   MAFTNHLPFQFYISSTKSFIFPNFSTTLKPLPSIYSASPFKPSPKFSKSDNRTT------ 60

Query: 61  YWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQEETEVAEGYTISQFCDKIID 120
               + IT   Q+           +SAR NDVAT+EKEEQ E EVA+GY++SQFCDKIID
Sbjct: 61  ----VTITAPLQIF---------NASARVNDVATSEKEEQAEMEVAKGYSLSQFCDKIID 120

Query: 121 IFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRID 180
           IFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVK+ID
Sbjct: 121 IFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKKID 180

Query: 181 DEMEIHSELLKELQGSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAA 240
           DEMEIHSELLKELQ SPTDINAIVA RRKEFT+EFFKFLTLISETHDSLEDRDAVARLAA
Sbjct: 181 DEMEIHSELLKELQDSPTDINAIVANRRKEFTDEFFKFLTLISETHDSLEDRDAVARLAA 240

Query: 241 RCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN 300
           RCLAAVSAYDRTLENVETLDSAQAKFD+ILNSPSLDVACEKIASL KAKELDSSLILLIN
Sbjct: 241 RCLAAVSAYDRTLENVETLDSAQAKFDNILNSPSLDVACEKIASLAKAKELDSSLILLIN 300

Query: 301 SAWASAKESTTMKNEPLYYCGLSMSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQ 360
           SAWASAKESTTMKNE                                             
Sbjct: 301 SAWASAKESTTMKNE--------------------------------------------- 360

Query: 361 FISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEQ 420
                VKEIMYHLYK TKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAF+PGDGSEQ
Sbjct: 361 -----VKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFSPGDGSEQ 420

Query: 421 KDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYL 480
           KDPNALYTTP ELHKWIKIMLDSYHLNQE+TDIREARNMTQP+VIQRLFILKDTIETEYL
Sbjct: 421 KDPNALYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYL 435

Query: 481 EQNESHNPQSKP--NHVSEDAVSI 503
           EQN+  NPQS+P  NH SEDA+SI
Sbjct: 481 EQNQFQNPQSRPNHNHGSEDAISI 435

BLAST of CmUC04G070220 vs. ExPASy TrEMBL
Match: A0A0A0L3X1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G515540 PE=4 SV=1)

HSP 1 Score: 694.5 bits (1791), Expect = 3.2e-196
Identity = 380/504 (75.40%), Postives = 400/504 (79.37%), Query Frame = 0

Query: 1   MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFC 60
           MAFTN LPFQ  +S TK FIF SFS  L PLPSIYSASPFK S KISKS N T+      
Sbjct: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTS------ 60

Query: 61  YWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQEETEVAEGYTISQFCDKIID 120
               + IT   Q+           +SAR NDVAT+EKEEQ E EVA+GY++SQFCDKIID
Sbjct: 61  ----VTITAPLQIF---------NASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIID 120

Query: 121 IFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRID 180
           IF+NEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWE DPIMKEKLISLRRKVK+ID
Sbjct: 121 IFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKID 180

Query: 181 DEMEIHSELLKELQGSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAA 240
           DEMEIHSELLKELQ SPTDINAIVAKR KEFT+EFFKFLTLISETHDSLEDRDAVARLAA
Sbjct: 181 DEMEIHSELLKELQDSPTDINAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAA 240

Query: 241 RCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN 300
           RCLAAVSAY+RTLENVETLDSAQ KFD+ILNSPSLDVACEKIASL KAKELDSSLILLIN
Sbjct: 241 RCLAAVSAYNRTLENVETLDSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLIN 300

Query: 301 SAWASAKESTTMKNEPLYYCGLSMSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQ 360
           SAWASAKESTTMKNE                                             
Sbjct: 301 SAWASAKESTTMKNE--------------------------------------------- 360

Query: 361 FISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEQ 420
                VKEIMYHLYK TKSSLRSMAPKEIKLLKHLLNIVDPEERFSALAT F+PGDGSEQ
Sbjct: 361 -----VKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQ 420

Query: 421 KDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYL 480
           KDPNALYTTP ELHKWIKIMLDSYHLNQE+TDIREARNMTQP+VIQRLFILKDTIETEYL
Sbjct: 421 KDPNALYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYL 435

Query: 481 EQNESHNPQSKP--NHVSEDAVSI 503
           EQN+  NPQS+P  NH SEDA+SI
Sbjct: 481 EQNQFQNPQSRPSHNHGSEDAISI 435

BLAST of CmUC04G070220 vs. ExPASy TrEMBL
Match: A0A6J1F3Z5 (uncharacterized protein At4g37920 OS=Cucurbita moschata OX=3662 GN=LOC111439867 PE=4 SV=1)

HSP 1 Score: 687.6 bits (1773), Expect = 4.0e-194
Identity = 381/502 (75.90%), Postives = 394/502 (78.49%), Query Frame = 0

Query: 1   MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFC 60
           MA TNQL FQLSIS TK+FIFR FS   KPLPSI SA+PFKSS K SKS N  T      
Sbjct: 1   MAITNQLAFQLSISSTKTFIFRRFSAAQKPLPSISSATPFKSSPKNSKSDNRAT------ 60

Query: 61  YWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQEETEVAEGYTISQFCDKIID 120
                     A V  P +  A    SAR NDVATTE EEQ E EVAEGYTISQFCDKIID
Sbjct: 61  ----------ATVPTPMQFNA----SARTNDVATTEMEEQAEMEVAEGYTISQFCDKIID 120

Query: 121 IFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRID 180
           IFMNEKPKTKEWRK LVFREEWKKYRESFYSHCQRRADWESDPIMKEKL+SL R+VKRID
Sbjct: 121 IFMNEKPKTKEWRKLLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLLSLGRRVKRID 180

Query: 181 DEMEIHSELLKELQGSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAA 240
           DEMEIHSELLKELQ SPTDINAIVAKRRKEFTE+FFKFLTL+SETHDSLED DAVARLAA
Sbjct: 181 DEMEIHSELLKELQDSPTDINAIVAKRRKEFTEDFFKFLTLVSETHDSLEDHDAVARLAA 240

Query: 241 RCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN 300
           RCL+AVSAYDRTLE+VETLDSAQ KFDDILNSPSLDVACEKIASL KAKELDSSLILLIN
Sbjct: 241 RCLSAVSAYDRTLEHVETLDSAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLIN 300

Query: 301 SAWASAKESTTMKNEPLYYCGLSMSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQ 360
           SAWASAKESTTMKNE                                             
Sbjct: 301 SAWASAKESTTMKNE--------------------------------------------- 360

Query: 361 FISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEQ 420
                VKEIMYHLYK TKS LRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSE 
Sbjct: 361 -----VKEIMYHLYKATKSGLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEP 420

Query: 421 KDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYL 480
           KDPNA+YTTP ELHKWIKIMLDSYHLNQE+TDIREAR M QP+VIQRLFILKDTIETEYL
Sbjct: 421 KDPNAIYTTPKELHKWIKIMLDSYHLNQEDTDIREARKMAQPIVIQRLFILKDTIETEYL 432

Query: 481 EQNESHNPQSKPNHVSEDAVSI 503
           EQNES N QSKPNHVS +AVSI
Sbjct: 481 EQNESQNAQSKPNHVSTNAVSI 432

BLAST of CmUC04G070220 vs. ExPASy TrEMBL
Match: A0A6J1HRT8 (uncharacterized protein At4g37920 OS=Cucurbita maxima OX=3661 GN=LOC111467208 PE=4 SV=1)

HSP 1 Score: 678.3 bits (1749), Expect = 2.4e-191
Identity = 376/502 (74.90%), Postives = 391/502 (77.89%), Query Frame = 0

Query: 1   MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFC 60
           MA TNQL FQLSIS T++FIFR FS    PLPSI SA PFK + K SKS N  T      
Sbjct: 1   MAITNQLAFQLSISSTRTFIFRRFSAAQNPLPSISSAIPFKPAPKNSKSDNRAT------ 60

Query: 61  YWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQEETEVAEGYTISQFCDKIID 120
                     A V  P +  A    SARANDVATTE EEQ E EVAEGYTISQFCDKIID
Sbjct: 61  ----------ATVPTPMQFNA----SARANDVATTEMEEQTEMEVAEGYTISQFCDKIID 120

Query: 121 IFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRID 180
           IFMNEKPKTKEWRK LVFREEWKKYRESFYSHCQRRADWESDPIMKEKL+SL R+VKRID
Sbjct: 121 IFMNEKPKTKEWRKLLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLLSLGRRVKRID 180

Query: 181 DEMEIHSELLKELQGSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAA 240
           DEMEIHSELLKELQ SPTDINAIVAKRRKEFTE+FFKFLTL+SETHDSLED DAVARLAA
Sbjct: 181 DEMEIHSELLKELQDSPTDINAIVAKRRKEFTEDFFKFLTLVSETHDSLEDHDAVARLAA 240

Query: 241 RCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN 300
           RCL+AVSAYDRTLE+VETLDSAQ KFDDILNSP+LDVACEKIASL KAKELDSSLILLIN
Sbjct: 241 RCLSAVSAYDRTLEHVETLDSAQVKFDDILNSPTLDVACEKIASLAKAKELDSSLILLIN 300

Query: 301 SAWASAKESTTMKNEPLYYCGLSMSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQ 360
           SAWASAKESTTMKNE                                             
Sbjct: 301 SAWASAKESTTMKNE--------------------------------------------- 360

Query: 361 FISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEQ 420
                VKEIMY LYK TKS LRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSE 
Sbjct: 361 -----VKEIMYRLYKATKSGLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEP 420

Query: 421 KDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYL 480
           KDPNA+YTTP ELHKWIKIMLDSYHLNQE+TDIREAR M QP+VIQRLFILKDTIETEYL
Sbjct: 421 KDPNAIYTTPKELHKWIKIMLDSYHLNQEDTDIREARKMAQPIVIQRLFILKDTIETEYL 432

Query: 481 EQNESHNPQSKPNHVSEDAVSI 503
           EQNE  NPQSKPNHVS +AVSI
Sbjct: 481 EQNELQNPQSKPNHVSANAVSI 432

BLAST of CmUC04G070220 vs. ExPASy TrEMBL
Match: A0A6J1DBT6 (uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Momordica charantia OX=3673 GN=LOC111018874 PE=4 SV=1)

HSP 1 Score: 606.3 bits (1562), Expect = 1.2e-169
Identity = 343/484 (70.87%), Postives = 359/484 (74.17%), Query Frame = 0

Query: 1   MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFC 60
           MA  N LPF LS S  K+ IF    P     P I SA     S K SKS +PTT      
Sbjct: 1   MAMANYLPFHLSSSSPKTSIFPKALPEAPRNPLISSA----LSPKKSKSNHPTT------ 60

Query: 61  YWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQEETEVAEGYTISQFCDKIID 120
               I IT   ++           +S  ANDVAT E E Q E EVAEGYTISQFCDKIID
Sbjct: 61  ----ISITSPTKL--------KATASLGANDVATAEMEAQSEMEVAEGYTISQFCDKIID 120

Query: 121 IFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRID 180
           IF+NEKPKTKEWRK LVFREEWKKYRESFYSHCQRR DWESDP MKE+LISLRRKVKRID
Sbjct: 121 IFLNEKPKTKEWRKLLVFREEWKKYRESFYSHCQRRVDWESDPSMKERLISLRRKVKRID 180

Query: 181 DEMEIHSELLKELQGSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAA 240
           DEMEIHSEL KELQ SPTDINAIVAKRRK+FTEEFF FLTLISETHDSLEDRDAVARLAA
Sbjct: 181 DEMEIHSELFKELQDSPTDINAIVAKRRKDFTEEFFXFLTLISETHDSLEDRDAVARLAA 240

Query: 241 RCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN 300
           RCL+AVSAYDRTLE V+TLD AQAKFDDILNSPSLDVACEKI SL KAKELDSSLILLIN
Sbjct: 241 RCLSAVSAYDRTLEYVDTLDCAQAKFDDILNSPSLDVACEKIESLAKAKELDSSLILLIN 300

Query: 301 SAWASAKESTTMKNEPLYYCGLSMSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQ 360
           SAWASAKESTTMKNE                                             
Sbjct: 301 SAWASAKESTTMKNE--------------------------------------------- 360

Query: 361 FISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEQ 420
                VKEIMY LY+ TKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSE 
Sbjct: 361 -----VKEIMYRLYRATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEA 412

Query: 421 KDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYL 480
           +DPNA+YTTP ELHKWIKIMLDSYHLNQE+T++REARNM QPVVIQRLFILKDTIETEYL
Sbjct: 421 RDPNAMYTTPKELHKWIKIMLDSYHLNQEDTEMREARNMNQPVVIQRLFILKDTIETEYL 412

Query: 481 EQNE 485
           EQ E
Sbjct: 481 EQTE 412

BLAST of CmUC04G070220 vs. TAIR 10
Match: AT4G37920.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast thylakoid membrane, chloroplast, chloroplast envelope; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G36320.1); Has 123 Blast hits to 120 proteins in 40 species: Archae - 2; Bacteria - 11; Metazoa - 8; Fungi - 0; Plants - 85; Viruses - 0; Other Eukaryotes - 17 (source: NCBI BLink). )

HSP 1 Score: 479.2 bits (1232), Expect = 4.1e-135
Identity = 249/385 (64.68%), Postives = 295/385 (76.62%), Query Frame = 0

Query: 98  EEQEETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRA 157
           E+  E EVAEGYT++QFCDKIID+F+NEKPK K+W+ +LV R+EW KY  +FY  C+ RA
Sbjct: 70  EDPMEVEVAEGYTMAQFCDKIIDLFLNEKPKVKQWKTYLVLRDEWNKYSVNFYKRCRIRA 129

Query: 158 DWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDINAIVAKRRKEFTEEFFK 217
           D E+DPI+K+KL+SL  KVK+ID EME H++LLKE+Q +PTDINAI AKRR++FT EFF+
Sbjct: 130 DTETDPILKQKLVSLESKVKKIDKEMEKHNDLLKEIQENPTDINAIAAKRRRDFTGEFFR 189

Query: 218 FLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDV 277
           ++TL+SET D LEDRDAVARLA RCL+AVSAYD TLE+VETLD+AQAKF+DILNSPS+D 
Sbjct: 190 YVTLLSETLDGLEDRDAVARLATRCLSAVSAYDNTLESVETLDTAQAKFEDILNSPSVDS 249

Query: 278 ACEKIASLGKAKELDSSLILLINSAWASAKESTTMKNEPLYYCGLSMSLATILIAVVNYE 337
           ACEKI SL KAKELDSSLILLINSA+A+AKES T+ NE                      
Sbjct: 250 ACEKIRSLAKAKELDSSLILLINSAYAAAKESQTVTNE---------------------- 309

Query: 338 VLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLN 397
                                        K+IMYHLYK TKSSLRS+ PKEIKLLK+LLN
Sbjct: 310 ----------------------------AKDIMYHLYKATKSSLRSITPKEIKLLKYLLN 369

Query: 398 IVDPEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREAR 457
           I DPEERFSALATAF+PGD  E KDP ALYTTP ELHKWIKIMLD+YHLN+EETDI+EA+
Sbjct: 370 ITDPEERFSALATAFSPGDDHEAKDPKALYTTPKELHKWIKIMLDAYHLNKEETDIKEAK 404

Query: 458 NMTQPVVIQRLFILKDTIETEYLEQ 483
            M+QP+VIQRLFILKDTIE EYL++
Sbjct: 430 QMSQPIVIQRLFILKDTIEDEYLDK 404

BLAST of CmUC04G070220 vs. TAIR 10
Match: AT1G36320.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G37920.1); Has 93 Blast hits to 90 proteins in 22 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 85; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 275.8 bits (704), Expect = 6.9e-74
Identity = 148/391 (37.85%), Postives = 235/391 (60.10%), Query Frame = 0

Query: 92  VATTEKEEQEETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYS 151
           VA  EK++  E  V +   + + CDK+I++FM +KP   +WR+ L F +EW   R  FY 
Sbjct: 74  VAKEEKKDGSEEVVVDNQRMIKVCDKLIEVFMVDKPTPSDWRRLLAFSKEWDSIRPHFYK 133

Query: 152 HCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGS-PTDINAIVAKRRKE 211
            CQ RAD E +P MK K+  L RK+K +D++++ H+ELL  ++ + P +I  +VA+RRK+
Sbjct: 134 RCQERADSEDNPEMKHKVHRLARKLKEVDEDIQRHNELLNVIKRTPPAEIGELVARRRKD 193

Query: 212 FTEEFFKFLTLISET-HDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDI 271
           FT EFF+ L  ++E+ +D+ ++++A+A L    +AAV AYD + E+++ L++A+ K  DI
Sbjct: 194 FTNEFFEHLHTVAESYYDNPDEQNALASLGKLSIAAVQAYDTSTESIDALNAAEMKLQDI 253

Query: 272 LNSPSLDVACEKIASLGKAKELDSSLILLINSAWASAKESTTMKNEPLYYCGLSMSLATI 331
           +NSPSLD AC KI SL +  +LDS+L+L+I  AW++AKES  MK E              
Sbjct: 254 INSPSLDAACRKIDSLAEKNQLDSALVLMITKAWSAAKESNMMKEE-------------- 313

Query: 332 LIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEI 391
                                               VK+I+YHLY T + +L+ + PKE+
Sbjct: 314 ------------------------------------VKDILYHLYVTARGNLQRLMPKEV 373

Query: 392 KLLKHLLNIVDPEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQE 451
           ++LK+LL+I DP+E+ SAL  AF PGD  E  D + LYTTP  L   +K +L++YH ++E
Sbjct: 374 RILKYLLSIEDPQEQISALQDAFTPGDELEGTDVDYLYTTPEHLQSLMKTVLEAYHFSRE 414

Query: 452 ETDIREARNMTQPVVIQRLFILKDTIETEYL 481
            + ++EA+++  P +I ++  LK  +E +Y+
Sbjct: 434 GSLVKEAKDLMHPELIAKIEQLKKLVEKKYM 414

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038883874.18.2e-20278.73uncharacterized protein At4g37920 isoform X1 [Benincasa hispida][more]
XP_038883875.11.2e-20078.49uncharacterized protein At4g37920 isoform X2 [Benincasa hispida][more]
XP_008442081.12.9e-19976.79PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X1 [Cucumis ... [more]
XP_004146379.16.7e-19675.40uncharacterized protein At4g37920 isoform X1 [Cucumis sativus] >KGN54831.1 hypot... [more]
KAG6603165.18.2e-19475.90hypothetical protein SDJN03_03774, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
Q84WN05.8e-13464.68Uncharacterized protein At4g37920 OS=Arabidopsis thaliana OX=3702 GN=At4g37920 P... [more]
Match NameE-valueIdentityDescription
A0A1S3B4W51.4e-19976.79uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Cucumis melo OX=3... [more]
A0A0A0L3X13.2e-19675.40Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G515540 PE=4 SV=1[more]
A0A6J1F3Z54.0e-19475.90uncharacterized protein At4g37920 OS=Cucurbita moschata OX=3662 GN=LOC111439867 ... [more]
A0A6J1HRT82.4e-19174.90uncharacterized protein At4g37920 OS=Cucurbita maxima OX=3661 GN=LOC111467208 PE... [more]
A0A6J1DBT61.2e-16970.87uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Momordica charant... [more]
Match NameE-valueIdentityDescription
AT4G37920.14.1e-13564.68unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G36320.16.9e-7437.85unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 482..502
NoneNo IPR availablePANTHERPTHR31755:SF2ENDORIBONUCLEASE E-LIKE PROTEINcoord: 74..315
NoneNo IPR availablePANTHERPTHR31755:SF2ENDORIBONUCLEASE E-LIKE PROTEINcoord: 363..493
IPR040320Uncharacterized protein At4g37920-likePANTHERPTHR31755FOLATE RECEPTOR-LIKEcoord: 74..315
coord: 363..493

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC04G070220.1CmUC04G070220.1mRNA