It is generally assumed that new genes arise through duplication and/or recombination of existing genes. The probability that a new functional gene could arise out of random non-coding DNA is so far considered to be negligible, as it seems unlikely that such an RNA or protein sequence could have an initial function that influences the fitness of an organism. Here, we have tested this question systematically, by expressing clones with random sequences in Escherichia coli and subjecting them to competitive growth. Contrary to expectations, we find that random sequences with bioactivity are not rare. In our experiments we find that up to 25% of the evaluated clones enhance the growth rate of their cells and up to 52% inhibit growth. Testing of individual clones in competition assays confirms their activity and provides an indication that their activity could be exerted by either the transcribed RNA or the translated peptide. This suggests that transcribed and translated random parts of the genome could indeed have a high potential to become functional. The results also suggest that random sequences may become an effective new source of molecules for studying cellular functions, as well as for pharmacological activity screening.
Results_IPTG_control
Table with read numbers for each clone for experiment plus/minus IPTG control shown in Table 1 of the paper. Further Tabs in the file include the respective DeSeq2 statistics for each sampled timepoint.
Results_Exp1
Table with read numbers for each clone for experiment 1 shown in Table 1 of the paper. Further Tabs in the file include the respective DeSeq2 statistics for each sampled timepoint.
Results_Exp2
Table with read numbers for each clone for experiment 2 shown in Table 1 of the paper. Further Tabs in the file include the respective DeSeq2 statistics for each sampled timepoint.
Results_Exp3
Table with read numbers for each clone for experiment 3 shown in Table 1 of the paper. Further Tabs in the file include the respective DeSeq2 statistics for each sampled timepoint.
Results_Exp4
Table with read numbers for each clone for experiment 4 shown in Table 1 of the paper. Further Tabs in the file include the respective DeSeq2 statistics for each sampled timepoint.
Results_Exp5
Table with read numbers for each clone for experiment 5 shown in Table 1 of the paper. Further Tabs in the file include the respective DeSeq2 statistics for each sampled timepoint.
Results_Exp6
Table with read numbers for each clone for experiment 6 shown in Table 1 of the paper. Further Tabs in the file include the respective DeSeq2 statistics for each sampled timepoint.
Results_Exp7
Table with read numbers for each clone for experiment 7 shown in Table 1 of the paper. Further Tabs in the file include the respective DeSeq2 statistics for each sampled timepoint.