Was
Darwin
Wrong?

What's Wrong with Independent Birth of Organisms?

A review of Periannan Senapathy's book
'Independent Birth of Organisms'

and an investigation into the origin of life

A review by Gert Korthof. updated 8 May 2026 (first published 29 Dec 2002)

Introduction

The origin of life is an age-old and unsolved problem. Independent researcher Periannan Senapathy came up with an extraordinary solution: the independent origin of all organisms, including humans (386, 387) from random DNA in a primordial pond. It is an overambitious theory that seeks to replace everything Charles Darwin and evolutionary biologists have put forward during the last 150 years. It is an anti-evolution theory, however it is a fully naturalistic theory, consequently it also contradicts Creationism, Intelligent Design and Theïstic Evolution.

The theory posits that life started with DNA. However, DNA is the least feasible option for the origin of life. The fundamental flaw of this DNA-centric view of life is that it overlooks the fact that DNA on its own can do nothing. DNA itself has no catalytic properties. DNA is completely dependent on enzymes to get replicated, transcribed, translated and methylated. No amount of STOP codon statistics can change that. The fact that the genome consists of 'small islands of genes scattered in an ocean of meaningless DNA', is not a clue to the origin of life. It is a red herring. It is a false lead.
Of course, enzymes are encoded by genes. But to read and translate those genes, specific enzymes need to be present first. In other words: it is a vicious circle. That's why the origin of life is a hard problem. It can't be solved by assuming that all those enzymes are present in a primordial pond along with a virtually infinite amount of DNA sequences. This would be nothing short of a miracle.

[ Introduction updated: 21 Jul 2025 ]

Periannan Senapathy (1994) 'Independent Birth of Organisms. A New Theory That Distinct Organisms Arose Independently From The Primordial Pond Showing That Evolutionary Theories Are Fundamentally Incorrect'.

Contents of this review

A misleading computer experiment
A test for randomness
Searching for exons, discarding introns
Eukaryotes with intronless genes, prokaryotes with introns
A static versus a dynamic genome
A DNA sequence is not a genome ***
Genome-centered view of life ***
Sexual reproduction is far more complicated than asexual reproduction
Can a male or female arise from haploid cells?
Can a male or female arise from diploid cells?
Did prokaryotes arise from eukaryotes?
All mammals require a mother
- All mammals require a father
Common descent versus independent origin
The role of randomness
The role of natural selection
The role of mutation
The role of adaptation: random perfection?
- Ecology and the food chain
The role of time: the chronological order of life
The role of place: the biogeography of life
The clumpiness of morphospace
The primordial pond is an unlimited resource *** 15 Jun 25
Incompatible requirements for a primordial pond
The final refutation of independent origin
Origin of life
Origin of species 2 Jul 2025
What is life?
PLOS ONE article
Nature Precedings articles
Conclusion. The elephant in the room 21 Aug 2025
- What I learned 3 Sep 2022
Conclusion after 20 years *** 14 Jul 2023
Appendix: Genetics Primer
Humans 23 Jun 2025
** Wikipedia article 31 Jul 2025
Notes
Further Reading

***) with hindsight these are the most important paragraphs.

chicken and egg problem:

Elephant in the room:

A general overview:

Independent origin
and the facts of life

Last Note

Important updates

15 Sep 2021

27 Sep 2021
Updated: 15 Apr 25

04 July 2023

1 A misleading computer experiment

Here is Senapathy's discovery in his own words:

"As I was working with the origin of genes from random genetic sequences, I realized that simple-to-complex gene evolution was quite unnecessary to explain the origin of complex genes found in multicellular creatures. I could demonstrate that the complex genes of multicellular creatures simply existed in very long random genetic sequences" (99)

"the genome would be mostly random DNA sequence with only small 'islands' of genes scattered in an ocean of meaningless DNA. Such an architecture actually exists in the genomes of all living multicellular organisms, with the intergenic sequences termed 'junk DNA'." (caption Figure 8.11)

In this figure from Senapathy's book a computer generated random sequence of the letters of the alphabet is shown:

AVTQMOIBIYUTTYRXBVGHSFRETYPNMKJBZXCVBFGTWRWEDDFALH
OILPMNKJUVBGHYFQSZVDFTRYOPMMJLAJSHJGFRTYQREFFGFBNBMI
ALKEIUQJLJRYTWSDHTRHFMNZBXVCHQYTNVHSKFYWURIOPMCVHY
HDFQIOREUYSKJGHADGLZXMNRBCNVYQNEUCBNRTYVBNFYUIRHJY
NBBNZCXJKWOPIKIUQWYRTOHVBCNMZJSGHFGTWRERUUIOPPMKJH
HGAWRYBCGDFHNXCYRQZCVNBIOVYZNSGHENMBKHJIYQXHFIAGHII
YOPPZNCVJHFGJJMMBJHQOVNMZBXVTRRQEWFHKPLOIQAZSGJHUIO
THKLPMLOKBESICUBNJCGTQRWETRYIIPOCNMKSALIWTYTYHCBZVX
ASPMQIZUXEMCCUVIEASRTTYPOIVLASFGUEYRTHNBCVXVAQWHGK
JFGURYTOPZDFGKHLUITYWRERYVNGFASDFLJIWKERHVNXJWIWZAQ
WSDDSXCMKOPLJHEDRFTGYHUBVFEWSXMUBTCWQAORWAGJLNVX
ZWSAQEDCBHYNMKOLPIUWQERVFCXNBMJHGFTIOLEWQAJUIMJTED
WSAQZXCVBNMKIJHYTEWQSDFHLOPKYQAMOECIMTBUNHFSKOPMQ
PMZALYBECMIQMXNCVHVKITUYQASLKJGNHVMCHFDZCERWTQYTU
IOMBJGPOQWUNXGDFFHSYRTYAZNGHKITYTQRHSDNJHKLJPGKJFJHS
LQEIWUZVNOTIHLOKIAFDJGDFKIUUTWTYERUJGHDNMHKJHKJKJKQ
ZATMLPOKJIMNBVZXUYTRWDKJHGFAASOJHSAWQHJKLYVOFQWED
FGHBVCDEWSQAZXMJIXOLPMYHTGRFEDSWQAZXCFRDIOPNJHGGVJ
JFHJJYGHRTEDWQEXSSDXGFVOPLKMJHJJBHNGHQWSRFGGYUHIOKP
OLMKJNBHVCZSSZWAWDFGOPNGYHYTOFFGVGBHWIYHGUHJQWSC
FVBMNIYUTPIIERJDFHOIURTYIERUQOBMNZBXCBNCGHFTRYGJHJJGH
GFGSOFSKSKHJJKTKMQIOYUTCBTREOPZWSETFBGJGNUJBDFEDLOPT
EGJALJQEPBMNZWAXIBECVBIYOYQWTREHJLMBNQACVIMJXKFGODR

Fig 7.1. The words TO, BE, OR, NOT, TO, BE found in a random string of letters in the right order but with nonsense in between the words (p.226).

This example is a simplification and therefore misleading:

This example symbolizes the sense strand of single-stranded DNA. But DNA is double stranded and always has a Positive-sense and a Negative-sense strand. (see: wikipedia). See my criticism: here.
Contrary to this text, all exons of a gene are organised in triplets. Consequently, STOP codons appear or disappear depending on where you start reading. STOP codons are relative to the reading frame. Furthermore, the 'words' (exons) must always consist of multiples of 3. (517).
it is not sufficient to search for genes, one also needs binding sites in DNA for many different regulatory proteins upstream of the gene (See his Genetics Primer, page 362).
Most importantly: Senapathy assumes a pre-existing language with pre-existing words with a pre-existing meaning. English: TO, BE, OR, NOT. Where do those words and meanings come from? Words have a beginning and an end. If words are genes there must be START and STOP signals in the sequence to delineate the words from their (meaningless) surroundings. Those signals are not in his computer simulation. Furthermore, different Start and Stop signals are required for transcription and translation. added: 17 Apr 2026

Highlighted are the pieces of Shakespeare's phrase "TO BE OR NOT TO BE". Senapathy admits that it is hopeless to search for an uninterrupted simple phrase "TO BE OR NOT TO BE". That is why he allows for the words being separated by strings of an arbitrary number of arbitrary letters. The phrase is present completely by accident, but it occurs with predictable frequency in a random sequence of letters of about 3500 characters long. So far so good: this is uncontroversial. For practical reasons, the example phrase contains only 2 and 3 letter words. If longer words were needed, then the distance between the individual words would be larger and so would the whole string of letters. Furthermore, it would nearly always be possible to find the words in the right order, says Senapathy. Furthermore, any sentence and indeed the complete works of Shakespeare (interrupted by nonsense words) can be found when the random sequence is long enough (22). Senapathy does not tell how long. This is not just a game with words. What is possible with words is possible with genes according to Senapathy. Just substitute words for 'exons' (335), the nonsense between the words with 'introns' (335) and you have Senapathy's theory! Please note that this is a mathematical theory. It is a claim about probabilities. Only when applied to real genomes, it does become an empirical claim.
His central idea is:

split genes (genes with introns) are easy to find in computer generated random DNA sequences. Genes without introns are impossible to find.
so when in the Primordial Pond DNA sequences are randomly assembled from their building blocks, genes with introns will easily be formed by accident (split genes are primordial; they were never in one piece (200), and introns were never inserted)
since split genes are only found in eukaryotes (plants and animals), eukaryotes must have originated first
prokaryotes (which don't have introns) must have evolved from eukaryotes by 'losing introns' (23).

Introns are pieces of DNA in the middle of a gene, which are removed after they are transcibed into mRNA (336) and before the gene is translated (250) in the cytoplasm into protein and so the intron sequence does not end up in the protein. Introns are central to Senapathy's theory because the occurrence of genes in random DNA sequences depends on finding split genes: genes with introns. He concludes that finding the uninterrupted gene sequences of today's prokaryotes in a computer-generated random DNA sequence is extremely unlikely (84). This is no surprise because bacteria have the most compact genomes and the highest gene densities of all species on earth (246, p.233). Therefore, in real life the random origin of current genes of prokaryotes must also be extremely improbable. On the other hand, eukaryotic genomes contain large amounts of DNA and a lot of junk DNA, so they are ideal candidates.

Ignoring everything else (!), the crucial test for Senapathy's theory is: if split genes cannot be found in a random DNA sequence with sufficient probability, then his whole theory breaks down. In figure 1, the words are only 2-3 letters, otherwise we would need pages full of letters to demonstrate the effect. For longer words, we simply need longer sequences of letters. Senapathy does not calculate how long. However, we can see from the figure that the meaningless (135) pieces of letters are far greater than the real parts of the gene. This is exactly what is found in real genes of eukaryotes (organisms with a nucleus in their cells such as all animals and plants). Therefore, it is not very surprising that Senapathy became enthusiastic for the thesis of independent origin of organisms. If we add the fact that more than 98% of the human genome seems to be meaningless junk DNA anyway, the theory seems plausible at first sight (287). Furthermore, the existence of introns is one of the great unsolved puzzles of molecular and evolutionary biology and Senapathy claims to have solved it.

sequence length	possible sequences
1	4
2	16
3	64
4	256
5	1024
6	4096
7	16,384
8	65,536
9	262,144
10	1,048,576
50	1.27x10³⁰
100	1.6x10⁶⁰
200	2.57x10¹²⁰

data from de Duve (270)
codon=triplet; bases=4

Probability of the human genome

Please note: in 1994 Senapathy could not know the complete sequence of the human genome. In 2001 the First Draft of the Human Genome Sequence was Released. So he could not test whether the human genome would pass a test for randomness.

Indeed, given enough time and resources, a computer could generate all the genomes in the world. Therefore, the probability is not zero. However, you do not have achieved anything if you do not calculate the probability of at least one complete genome (90). The probability of one gene does not help. The human genome happens to be very large. The haploid human genome has a length of ∼ 3 billion base pairs and the diploid genome is ∼ 6 billion base pairs (296).
The number of possible sequences of 6 billion bases is not calculated by Senapathy. So he has no idea how many trials he needs to produce a human being. Yes, it is true: the larger a piece of DNA, the higher the probability of finding genes. But at the same time, the larger a piece of DNA, the harder it is to synthesize it abiotically.
Please note, that the diploid genome cannot be reduced to the haploid because of heterozygosity (different alleles) and the X/Y pair.
For illustration, we can calculate the possible sequences up to 200 bases length (see table). Surprisingly, there are more than 1 million possible sequences of only 10 bases (which is very much smaller than the smallest exon). A sequence of 200 bases is still nothing compared to 6 billion bases. The probability of the human sequence of 6 billion bases must be indistinguishable from infinite.
We need to answer some basic but very difficult questions:

How many sequences of 6 billion bases are possible? (random sequences)
How many of those will produce a human being? (human sequences)
How many sequences of arbitrary length will produce a human being? (not considered here)
What is the smallest DNA sequence that produces a human? (not considered here)

Let us assume for the moment that only one sequence of 6 billion bases produces a healthy human being, then the probability is 1/A. Since A is ∞ infinitely large, the probability is 1/∞ or ∼ 0 (zero). However, there are now 7 billion people on earth and since each individual has a unique genome, there are 7 billion unique sequences (B) that produce a human being. But that is not all. If we count all humans beings ever lived on earth, we arrive at an estimated number of 108 billion (266). So, the probability to hit a human genome is 108 billion divided by A, which is still close to ∞ (I guess).
There is still more variability in the human genome we didn't include. For example, there are 38 million (mostly neutral) Single Nucleotide Polymorphisms (SNPs), 1.4 million bi-allelic indels and 14,000 large deletions in the human genome (295). There have been found 1,146,401 autosomal protein-coding SNVs in 15,336 protein-coding genes of 6,515 individuals (304). There are 117,277 mutations in The Human Gene Mutation Database. Furthermore, there are naturally occurring structural genomic variants in the human genome. Further, a survey showed that the average healthy person has about 20 genes knocked out (334). If we add every possible combination of SNP, mutation, indels and structural variants, B. rises above 108 billion (by an unknown amount). In Nov 2014 a new genome sequencing technology detected 22,000 segments of 50 to 5,000 bases in length that have never been reported before (347). More variation means more possible genomes able to create a human.
Another way of estimating the amount of neutral variation –compatible with health– is the following. Evolutionary analyses indicate that natural selection has conserved five times more base pairs that don't code for proteins than ones that do (1.5%) (267), so we arrive at 7.5%. Another estimate is that up to 74% of the human DNA may be transcribed into RNA (232). Others estimate that probably around 60% of the mammalian genome is transcribed (261). Comparative analysis of 29 mammalian genomes reveals a high-resolution map of >3.5 million constrained elements that encompass ∼4% of the human genome and suggest potential functional classes for ∼60% of the constrained bases (268).

Human chromosomes under a scanning electron microscope. ©Nature2017

Above we defined B. as all the sequences that produce a human being. However, in fact we should have added: how many sequences of 6 billion bases partitioned into 46 paired pieces (called 'chromosomes', see picture above) of specified length and contents. This is a far stronger requirement than simply 6 billion bases. Of course, any partition of the total DNA into chromosomes could produce a human, but the independent origin scenario demands the origin of the human genome as it is now. And then we have ignored we need not one human genome, but a female and a male genome! (See: §sex).

Another serious restriction applies: the actual human genome (46 chromosomes) must conform a pattern of sequence and chromosome similarities with all of the 1.9 million species on earth, particularly, but not exclusively, primates and mammals. This significantly reduces the number of acceptable human genome sequences, because not any human genome fits that pattern.

Before discussing introns and exons, it is wise to understand the analysis of John Maynard Smith:

If we imagine the simplest conceivable organism whose hereditary mechanism depends on the processes of nucleic acid replication and protein synthesis, it would have to possess enough DNA to specify all the varieties of tRNA, the protein and RNA components of the ribosomes, the activating enzymes associated with the 20 amino acids, the various enzymes which replicate the DNA and make an RNA transcript of it, and more besides." (93)

So far he only stated the problem that has to be solved. He continues:

"It is impossible that an organism of this degree of complexity should arise by physico-chemical processes, without natural selection." (John Maynard Smith, 93, p.111)

It is very useful to realize that according to Senapathy organisms with the complexity Maynard Smith described above, can originate spontaneously. But that's not all. According to Senapathy, organisms far more complex, so far more improbable than minimal life can arise spontaneously: "The very first cells were highly complex eukaryotic cells with a nucleus" (p.239). Please note, that I am not assuming the theory of evolution, but I am merely pointing out what is highly improbable for Maynard Smith is highly probable for Senapathy. Senapathy's computer simulation is so seductive because it ignores the complexity of minimal life.

A computer simulation is a virtual world

"The basic principle is that if genes were abundantly available in the primordial pond, they could have randomly assembled to form various genomes, each capable of forming an organism." Introduction page 5 (my emphasis)

This is the crucial passage where he makes his most fundamental error: genomes are capable of forming an organism. Even if Senapathy found the complete sequence of the human genome in a computer simulation (he did not), this would not prove that the human genome could originate in the real world. A computer simulation is a virtual world! Nothing can be inferred about the real world outside the computer. Certainly not what happened billions of years ago. For example: there is a lot more in a chromosome than DNA alone! (see: §A DNA sequence is not a genome). A computer simulation is a virtual world and life is chemistry! All life needs energy and molecules! A computer can produce almost instantaneously a virtual DNA sequence of a billion bases at virtually zero energetic costs. In the real world one needs at least Adenine, Thymine, Cytosine, Guanine, phosphate and deoxyribose for DNA synthesis. DNA must be synthesized! Energy is required! Even when these building blocks are present, it is still difficult to assemble nucleotides into chainlike DNA polymers that compose messages and carry out reactions (127, page 53), (525).
If life is ever created artificially, it will be in a test tube, not in a computer! (58). The seductive nature of computer simulations is so strong that it completely obscures the fact that a computer simulation is a virtual world. However, it must be said that Richard Dawkins (193) demonstrated the principle of natural selection with a computer experiment, the Weasel program, and the experiment started with a random sequence of letters!

Random sequence libraries

Recently, in vitro selection experiments show that it is possible that functional RNA's (ribozymes, ligases, polymerases) can arise from random sequence libraries (235). The first and perhaps still the most robust of these ligases were selected by Bartel and Szostak (1993) from a pool that encompassed 220 random sequence positions. This looks like the origin of genomes from random sequences. Please note: (1) these are chemical experiments, not computer experiments, 2) the length of the sequences are a million times smaller than the human genome, 3) this is about RNA, not DNA. DNA cannot self-replicate. 4) no organism is created, certainly not an eukaryotic organism from random sequences. These experiments are in the context of an RNA world, not a DNA-protein world. In 1985 Ballivet and Kauffman (251) created tens of thousands of random DNA sequences, so created what later became combinatorial chemistry. However, their goal was not to create genomes or organisms. Stuart Kauffman thinks it is possible to create minimal life forms from Collectively Autocatalytic Sets (CAS) (251), but these are certainly not complete prokaryotic or eukaryotic genomes. In 2009, Harvard University geneticist George Church unveiled a technique that lets researchers design millions of slightly different versions of a strand of DNA (272).

Random Genome Project 14 June 2023

Sean Eddy proposed a Random Genome Project (448). "He wonders what would happen if you insert a large amount of random DNA sequence into a genome, and he predicts that much of it would be transcribed [into RNA], indicating that pervasive transcription is not an indication of function." (401, ch 8, 10). Pleased note that random DNA would be injected in a cell and that means all cellular machinary would be present. So, this doesn't support the origin of life hypothesis.

The Eigen Limit
The maximum length (informational content) of a nucleic acid sequence is inversely proportional to the error rate of its replication (the Eigen limit). When the error rate exceeds this limit, new errors accumulate in the system, compounding the original error and resulting in eventual randomization - 'mutational meltdown' or 'error-cascade' (173), (192). So, what Senapathy does makes no sense. Of course your computer can generate endless long sequences. The point is however, that it has to be synthesized, copied, and maintained in the real world. It is important that any hypothesis be framed in light of our understanding of the physico-chemical properties of molecules (173). The Eigen limit forbids Senapathy's sequences, even when they 'self-assemble'.
Manfred Eigen (1992) concludes:

"The genes found today cannot have arisen randomly, as it were by the throw of a dice. There must exist a process of optimization that works towards functional efficiency. Even if there are several routes to optimal efficiency, mere trial and error cannot be one of them". (192)

2 A test for randomness

random DNA ©wikibooks.

Senapathy asks an interesting question about the real world: How would the DNA sequence of extant organisms look like, if it originated from random DNA? Is it random DNA? Let's test for randomness.

A test for randomness (improved 1 Mar 2018, extended 17 Mar 2026)

If DNA has directly arisen from random assembly of the building blocks and genomes are immutable, and evolution is not allowed, then all genomes we examine today should show randomness. A prediction is that the distribution of stop codons should be random. Is it? Actually there are two questions: What do we observe? What do we expect? Senapathy does not state this clearly at the beginning. Senapathy produces plots to answer both questions, but they are difficult to read, not well explained and do not seem to support his theory.
In a cell, a stopcodon is the DNA code for the end of the protein. If the distribution of stop codons in extant genomes would match a purely random distribution, it would be strongly suggestive for the random origin of genomes. The frequency of stop codons in a random computer generated DNA string must be calculated on the basis of the fact that 3 out of the 64 codons are stop codons, that is approx. 4,7%. (assuming this holds for the origin of life also). So we must expect that the average length of genes between two stop codons (ignoring START codons) would be 64/3 = 21 codons (= 63 bases) (my calculation, Senapathy does not show this). However, this is far too small for a real gene! The average length of a human gene is approximately 450 codons (1350 bases) (343). The Random Hypothesis fails spectacularly! End of story!
Extrapolating to the human genome of approx. 1 billion triplets in a haploid genome, there should be 4,6875 x 1.000.000.000 = 46.875.000 stop codons (more than 46 million!). This is because junk DNA should also contain stop codons despite the fact that they have no 'meaning'. Similarly, there should be STOP codons in introns with a frequency 3/64. Has this been found?

Even worse, a sequence of bases with only a STOP codon is not a 'gene'. A protein-coding gene must also have a START codon. Senapthy does not include START codons in his simulations.

"Thus, the sequence that exists between two successively occurring stop codons, which are separated by a sequence that is a multiple of 3, is called a reading frame (RF).
NOTES AND REFERENCES Note 16 page 599.
1. The presence of open reading frames (ORFs) alone is not sufficient evidence for a functional gene. The mRNA needs to be translated into proteins (516).
2. the presence of regulatory signals needed for transcription are required. (516)
3. RNA polymerase does not recognize start (AUG) or stop (UAA, UAG, UGA) codons to begin or end transcription. Those codons are instructions for the ribosome during translation, not for RNA polymerase. Instead, RNA polymerase recognizes specific DNA sequences called promoters to start and terminators to stop.
4. a putative gene must have the property that the encoded protein folds in to a functional 3D shape.
  A 'gene' can also be a regulatory gene (regulatory sequence) that is not translated in to a protein. So, they don't need START or STOP codons. But they are crucial for gene expression.
5. Integrating de novo genes into already existing regulatory networks (516)
6. the protein must have a beneficial effect on the organism.
According to Senapathy the expected length of genes in computer generated random sequences would vary from 0 (two stop codons next to each other) up to 600 bases (=200 AA), but "More than 95% of all random genes are shorter than 100 bases" (approx. 33 AA.) (page 234). Indeed! This is the reason that ORF detecting software defines ORF's as a minimum of 100 codons (343).
According to Senapathy 'genes' in organisms are often 9000 bases (=3000 amino acids, AA) long (page 234). So this is far above the length of the genes (200 AA) found in the computer generated sequences. Therefore, one must conclude that typical eukaryotic genes could not be formed directly from random DNA.

Figure 7.4. page 236. See also the figure in PLOS ONE article were he repeats the idea.

Nevertheless, Senapathy is not discouraged by this result. He invokes splicing at the RNA level in the Primodial Pond exactly as splicing out introns and combining exons in to a long ORF in present-day organisms (536). The proof is in the caption of figure 7.4:
"The only way a gene longer than 600 nucleotides could originate was to select some short reading frames and splice them together consecutively (in the primary RNA copy, not shown in the figure), by editing out the intervening regions containing many stop codons." (my emphasis) page 236 Chapter 7.
So, he thinks the test for randomness is a success because he invoked 'processing' and because he thinks that is a permitted step in a Origin-of-Life scenario. But you cannot save your theory by RNA-splicing. RNA-splicing is irrelevant because it leaves DNA intact! Does he really mean that all the processing occurs in the Primordial Pond? Remarkably, exactly this is shown in his Figure 7.4! Amazing! But this 'processing' is nothing other than RNA splicing because it takes place at the RNA level. The DNA stays intact, there is no processing at the DNA level. Introns are not removed at the DNA level. Therefore, the DNA of all extant eukaryotes must still have the signature of random DNA with abundant stop codons. But it does not. This kills the whole idea. This should be the end of the story. But apparently not for Senapathy.

"Figure 7.4 describes how a gene coding for a typical protein could have simply occurred in the long primordial random DNA sequence, with no evolution from shorter coding sequences. We can confidently conclude that in the primordial pond, long random genetic sequences existed, and long proteins were synthesized from them by first linking the short RFs (exons) and making them into a long contiguous coding sequence for a protein." page 235. (my empahsis)
Effectively, he claims that the same processing as in the cells of present-day Eukaryotes happens in the Primordial Pond:
"The RNA-splicing process that we are discussing now is just one such machinery that was the outcome of the random process." (page 239).
By supposing all complex biochemical machinery is present in the Primordial Pond, Senapathy didn't explain anything.

But it gets worse. A second problem with this idea is that he assumes that the key sequences necessary for proper splicing (splice recognition sites) are just there. How convenient! But, splice recognition sites are by defintion non-random. So, it doens't help very much that introns are random sequences. Splice recognition sites are non-random and they are enough to kill his theory (419).
Furthermore, the processing assumes full-blown splicing machinery. This is cheating. One must not only find exons! This is not shown in the computer simulation (Figure 7.1)! So we should not search for: TO BE OR NOT TO BE, but:

wxyzTOabcdefghi ... wxyzBEabcdefghi ... wxyzORabcdefghi ... wxyzNOTabcdefghi ... wxyzTOabcdefghi ... wxyzBEabcdefghi
( wxyz and wxyz represent splice recognition sites (363) and ... represents an intron). How else can the words (exons) TO BE OR NOT BE be recognized? (26). The real number of bases essential for proper splicing is unlikely to be less than 10 and is plausibly as high as 30 (39). There is a 5' splice site, 3' splice site and a branch site in the middle of the intron (Intron splicing). To be correctly processed to proteins, begin and end of exons need to be recognised. However, this would make the task of finding them in a random sequence far more difficult than Senapathy imagined.

There is a third reason why his scenario fails: DNA is 100% useless without cellular machinery, that is specific enzymes and proteins such as transcription factors, RNA-polymerases (538), Splicesome, Ribosome. Furthermore, Senapathy knows that a nucleus with membrane (nuclear envelope) is required to prevent that unspliced mRNA is being translated in to protein. That means that the nuclear envelope must be able to discriminate betwen spliced and unspliced mRNA (Nuclear pore complex) (535). In fact his scenario only works when a complete eukaryotic cell is present in the Primordial Pond! If you have to assume that, you didn't solve the Origin of Life. The bottomline: DNA and mRNA are useless outside a complete eukaryotic cell. In stead of a solving a problem, introns seem to pose his biggest obstacle: removing introns from random DNA requires a living cell.
See: The primordial pond is an unlimited resource; The elephant in the room. (17 Mar 2026).
Structure of an eukaryotic gene, in: Senapathy, Appendix: Genetics Primer, figure 10, p. 555.

Start codons: 25 Sep 2011, 25 Feb 2018 Although, he knows Start codons (see figure above and many others in the Appendix), Senapathy wrongly ignores them in his calculations (537). However, an Open Reading Frame (ORF) is defined as the sequence between a Start codon and a Stop codon. The necessity of Start codons (usually AUG) gives a further restriction of the length of ORFs. Start codons occur 1 in 64 codons (see genetic code table). If we combine Start and Stop codon frequencies, the predicted average ORF length would be reduced from 21 to 17 codons! (362). There is absolutely no room for introns and exons in such a short piece. So, the discussion of introns and exons is totally irrelevant. See further: The elephant in the room.
Transposable Elements 9 Nov 2012
Transposable Elements (transposons) constitute two-thirds of our own genome and 85% of the corn genome (297). They originate by duplication. Senapathy extensively discusses transposons in Chapter 4. He knows the work of Barbara McClintock. As far as I can see he does not make an estimate of what percentage of a genome consists of transposons (in humans 45%). The fact that transposons move means that the genome is dynamic. This contradicts his 'immutable' genome. Senapathy is not worried by such a contradiction.
Long terminal repeats 9 Nov 2012
Long terminal repeats (LTRs) are sequences of DNA that repeat hundreds or thousands of times. They are found in retroviral DNA and in retrotransposons. Because of the repeats they are not random DNA.
Codon usage bias
The genetic code is redundant. The number of codons for 1 amino acid varies from 1 - 6. For those having more than one codon, the different codons are used in unequal frequency. Some are rarely used, others with high frequency. For the amino acid LEU the most frequently used codon is used 140 times more often than the least frequently used codon (33). This is a genome wide bias. Some species such as Thermus thermophilus avoid certain codons almost entirely (306). However, there should be no such huge codon bias when genomes arose by chance from the primordial pond. In the independent birth scenario all codons for an amino acid are expected to occur on average with the same frequency. There is an explanation for codon usage bias: (354).
Non-random stop codons 4 Jan 2013
In the standard genetic code, there are 3 stop codons: TAA , TAG , TGA . Assuming these 3 stop codons, we could state that in a random genome these 3 stop codons are expected to have equal frequency (about 33%) with standard deviation. However, the distribution of stop codons within the genome of an organism is non-random and can correlate with GC-content. For example, the E. coli K-12 genome contains 63% TAA, 29% TGA, and 8% TAG (wiki), so is highly non-random. If one looks at the biochemical properties of the stop codons, then it turns out that, almost by miracle, the stopcodon with the highest frequency in the genome, TAA, is relatively effective, meaning that the machinery almost always stops at the stop sign. TGA is not so good. (447).
However, later findings conclude that "In most eukaryotes and prokaryotes TGA is used at a significantly higher frequency than TAG as termination codon of protein-coding genes." (495). (termination codon = stop codon).
Independent origin cannot explain the correlation between biochemical effectiviness and frequency in the genome, but natural selection can. Humans appear to be the exception: "Most other species overuse the best stop codon, while we overuse the worst." (447). Again a deviation from the statistical expectation.
Furthermore, the stopcodons are clustered in the genetic code table: they all start with T and two start with TA. That is not a random distribution.
Finally, or better: firstly, the whole idea of a 'stop codon' depends on a full standard genetic code! Which in turn depends on a living cell. And in order to function as a stop codon a Release factor is required, which is a protein. Where does that protein come from? See: elephant in the room.
Mononucleotide repeats 18 Nov 11
Are nucleotide sequences actually used by organisms a random sample of all the possible sequences encoding that particular amino acid sequence, or do they deviate from a random choice? Short mononucleotide repeats occur at about the frequency expected by chance, but longer mononucleotide repeats are substantially rarer than predicted by the null model in all three organisms C. elegans, S. cerevisiae, and E. coli. For example, in E. coli, the codon TTT is avoided in favor of TTC at positions immediately followed by a T. This reduces the frequency of runs of four Thymines, and thus indirectly also of longer stretches of Thymine that necessarily contain runs of four. (237). Explanation: as mononucleotide repeats are prone to slippage during transcription and translation, the most parsimonious explanation is selection against error-prone nucleotide composition.
Dinucleotide frequency 12 Nov 12 update

AA AT AC AG

TA TT TC TG

CA CT CC CG

GA GT GC GG

Since the genome of any organism arose randomly, each genome must have identical statistical properties. How could they differ significantly? For example, within the human genome each of the 4x4=16 possible dinucleotides (see table) should be present in equal frequency (100/16=6.25%). However, there is a notable depressed level of the CG dinucleotide (0.99% compared with 9.80% of TT dinucleotde) (108, p. 11). Vertebrates: CpG dinucleotides have long been observed to occur with a much lower frequency in the sequence of vertebrate genomes than would be expected due to random chance. On the other hand, there are CpG islands: genomic regions of several hundred base pairs with a high GC content. See also: Chargaff's GC rule. In contrast, the genome of most invertebrates contains largely unbiased frequencies of CpG dinucleotides (383) These facts alone refute random origin of DNA sequences.
GC content: 24 Sep 25
One of the most striking features of human protein-coding genes is the pattern of GC-content present along its length. In particular, it has been observed that GC-content is highest at the 5' end of genes, and that this decreases as one travels downstream and is lowest at the 3' end of genes. In addition, it has been observed that GC-content is higher in exons than in introns, with it being highest in the first exon and decreasing with every subsequent exon. This is also true in introns, with GC-content highest in the first, and decreasing with every subsequent intron (515). This non-random.
Microsatellites, or short tandem repeats 28 sep 12
Microsatellites consisting of two, three or four nucleotides repeats and can be repeated 3 to 100 times. Especially, the higher repeats very unlikely occur in random DNA. They are non-random.
Chargaff's cluster rule
Apart from the first parity rule (A=T C=G), Erwin Chargaff made three other fundamental observations on the base composition of DNA, which are only now being incorporated into mainstream biology. The first species-invariant observation was that individual bases are clustered to a greater extent than expected on a random basis (197) which contradicts random origin.
"Another consequence of our studies on deoxyribonucleic acids of animal and plant origin is the conclusion that at least 60% of the pyrimidines occur as oligonucleotide tracts [runs] containing three or more pyrimidines in a row; and a corresponding statement must, owing to the equality relationship [between the two strands], apply also to the purines." (197)
Chargaff's second parity rule
The second species-invariant observation was that Chargaff's first parity rule also applies, to a close approximation, to single-stranded DNA. The validity of the rule became clearer when full genome sequences became available. For example, the "top" strand of Vaccinia virus has 63921 A, 63776 T, 32010 C, 32030 G (197). The bacterium Sarcina lutea has an extreme (A+T)/(G+C) ratio of 0.35 (Biopolymer Chemistry). If DNA were random 25% A, 25% T, 25% C, 25% G are predicted. So, again random origin is refuted.
Chargaff's GC rule (GC-content)
The ratio of C + G to the total bases (A+C+G+T) (GC-content) tends to be constant in a particular species, but varies between species (197). The CG-content or AT/CG ratio of genomes is not 1:1. Already in 1952 CG percentages as low as 34,8% have been discovered in the DNA of insect viruses (34). In the bacterial kingdom the CG percentage varies from 25% to 75% (35). Acidianus infernus and Methanococcus jannaschii have 31% GC-content (255). A species with an extremely low GC-content is Plasmodium falciparum: ~20% (wiki). The genomes of extremophile organisms such as Thermus thermophilus are particularly GC-rich (wiki). Although some deviation from the mean of 50% is to be expected in a random sequence of DNA, extreme deviations are highly improbable and contradict random origin.
A test for randomness of phase 0,1,2 introns 16 Mar 11
An intron could start between or within triplet codons. The same is true for the end of an intron. Between codons is called (phase 0), within codons is called phase 1 (after the first base) or phase 2 (after the second base). If introns have the same phase at the beginning and end, they are symmetric, otherwise they are asymmetric. There are three symmetric intron types (0,0), (1,1), and (2,2) and six asymmetric types (0,1), (0,2), (1,2), (1,0), (2,0), and (2,1). The theory of random origin of introns and exons predicts that all 9 intron phases should have approximately the same frequency in random DNA. Already in 1992 Fedorov et al. showed that the proportions of the three intron phases were significantly unequal. In 1993 Green et al. showed excess phase 0 introns and excess symmetric exons in ancient conserved regions (ACRs). Later publications based on GenBank give the same results. The proportions of three intron phases and frequencies of nine associations of introns showed significant nonrandom distribution: 48% phase 0, 28% phase 1, and 24% phase 2 and all symmetric exons (0,0), (1,1), and (2,2) showed significant excess over a random prediction (199). Conclusion: random origin is refuted.
A test for randomness of proteins updated 15 Jul 2025
Ptitsyn (1984) (136) concludes "that primary structures of proteins are basically just the examples of random amino acid sequences which have only been 'edited' during biological evolution".
Pande et al (1994) (137) examine the possible random nature of protein sequences. The hypothesis was that proteins are slightly edited random sequences. They found pronounced deviations from pure randomness.
Keefe & Szostak (138) state that "Functional primordial proteins presumably originated from random sequences".
De Lucrezia (2012) investigated the question 'Are extant proteins the exquisite result of natural selection or are they random sequences slightly edited by evolution?' and concluded: "Altogether, our results suggest that natural proteins are significantly edited from random polypeptides and evolutionary editing can be readily detected analyzing structural features." (497). They use virtual random proteins as a control.
Even if proteins emerge spontaneously prebiologically, this is irrelevant for the origin of DNA-based life, because proteins cannot be translated in to DNA (Crick's central dogma of molecular biology). So, the result is useless for 'Independent Origin of Life'. However, the experiment is interesting in order to get an idea of the extent to which natural selection has acted upon proteins or the extent to which natural proteins deviate from randomness.
Douglas L. Theobald (2010) (130) wrote "Many proteins probably do exist that have independent origins. For instance, in the Metazoa certain protein domains have probably evolved de novo that are not found in either Bacteria or Archaea. However, the independent evolution of unique Metazoan proteins, by itself, is not evidence for or against UCA." (UCA= Universal common ancester).
Interestingly, Olaf Weiss et all (2000) concluded: "Our results show that proteins are fairly close to random sequences" and: "These results confirm the idea that protein sequences can be regarded as slightly edited random strings." (498).
In my opinion, and in accordance with Paul Davies, all natural proteins are necessarily a subset of the total of all protein sequences, in other words: the universe of protein sequence space. The question is: is there a way to distinguish the 'very, very special subset of random sequences' from the set of all protein sequences?
Non-random genome architecture
Are genomes randomly arranged assemblages of genes or is gene order non-random? Genome architecture (that is, the order, spacing and orientation of genes in the genome) can be highly non-random (239).
Low-Complexity Sequences or Repetitive Sequences
Eukaryotic genomes contain vast amounts of repetitive DNA derived from transposable elements (TEs) (269). Low complexity sequences (LCRs) are well known within coding as well as non-coding sequences (496). The more Low-Complexity or Repetitive Sequences occur in eukaryote genomes, the more difficult they are to explain under the random origin scenario. The most difficult to explain are tandem repeats.

Paul Davies about genomes

updated & improved 19 Aug 2025

"If genomes are information-rich, then they have to be random (or almost so). If biological organization is random, its genesis should be easy. [!] The vast majority of possible sequences in a nucleic-acid molecule are random sequences. Only a tiny, tiny fraction of all possible random sequences would be even remotely biologically functional. A functioning genome is a random sequence, but it is not just any random sequence. It belongs to a very, very special subset of random sequences" (40).

These are illuminating and profound statements! If Senapathy would have claimed that, from a mathematical perspective, all existing DNA sequences are a subset of all possible DNA sequences, he would be right, by definition. Unfortunately, that would be rather underwhelming. However, Senapathy claimed that all eukaryotic genomes originated from random DNA sequences in the real world. And that's a huge difference! Without noticing, he switched from mathmatical truths to real-world truths. Senapathy mixed up those two claims. Unfortunately, he completely missed "but it is not just any random sequence. It belongs to a very, very special subset of random sequences". Any Origin of Life hypothesis must explain the origin of that tiny subset of random sequences. Senapathy took Davies' idea literally: if biological organization is random, its genesis should be easy! "It's easy" is exactly what Senapathy claimed in his book. But it isn't easy! The Origin of Life is a hard problem.

Senapathy's claim that all genomes arose from random DNA sequences, is in fact a Theory Of Everything (TOE) in biology, because the TOE 'explains' all real and possible genomes. In that sense, eukaryotic genomes are –just as prokaryotes– a tiny subset of the set of all possible DNA sequences. Introns are DNA sequences. Exons are DNA sequences. The genomes of organisms without introns (prokaryotes) are also DNA sequences. Coding as well as non-coding sequences are subsets of the set of all possible DNA sequences. There is no difference: all genomes are DNA sequences. The opposite, namely all DNA sequences are genomes, is not true. That's Davies' point. Senapathy's TOE explains nothing about the real world.

Laurence D. Hurst about introns 11 April 2025
There are three independent genome size variables: (1) number of introns in genes, (2) intron size in genes that have introns, (3) length of DNA between genes.
"Given that these three measures are independent, there is no logical reason why you should not have, for example, a large gap between genes but small introns within genes. Looking from single-celled organisms to plants, animals, and the like, we find, however, that these three measures are correlated: species that have lots of introns tend also to have bigger introns, and the intergenic DNA tends also to be longer." Chapter 2. (446).
If genomes were random DNA, there should be no correlation between the three variables! Correlation is a deviation from randomness. For example, there should also be genomes with very small random intergenic sequences, but with only a few very large introns in the genes. These considerations are solely for the sake of argument, see: The elephant in the room.
Small genomes are more likley than large genomes
The random origin of genomes scenario predicts that small genomes are more likely than large genomes on pure statistical grounds. The larger the genome, the rarer it must be. For example, single-celled eukaryote species should be more frequent than multi-cellular eukaryotic species.

3 Searching for exons, discarding introns and enhancers

Let's ignore here that the average eukaryote has a genome size 1000 times larger than an average prokaryote (246, page 52), and let's ignore all other problems (see § 6: DNA) and focus on introns and exons. Senapathy searches for exons (protein coding DNA) (549) and ignores the contents of introns (non-protein-coding DNA) in his random genomes. However, if one cannot ignore introns, Senapathy's strategy would fail. Are introns just random noise as he assumes in his computer simulations? It is important to distinguish eukaryotic spliceosomal introns (which are not self-splicing and do not code for proteins) from Group I (which are self-splicing) and Group II introns (which may code for proteins). Recently it has been discovered that introns do sometimes have identifiable functions (41). Also, comparative sequence analysis has revealed that the sequence of some introns is highly conserved, suggesting that functional contraints operate. Some of the observed conservation can be attributed to nonrandom sequences required for RNA splicing (150). In that case Senapathy is not justified in discarding introns and they should be included in his computer simulation of virtual genomes. But then his search for genes (exons+introns) in random DNA certainly would fail, just as it fails for prokaryotic genes.

What if introns are not random? updated 28 Feb 11
Senapathy's analysis is based on the idea that exons are non-random (functional) and introns are random DNA (non-functional). Maybe this is because it was generally assumed that the sequence of any given intron is junk DNA with no biological function. An indication of non-functional random introns is an intron length that is not divisible by three nucleotides (3n). The random character is also supported by Patthy (1999) based on data from Li & Grauer (1991) about substitution rates of spliceosomal introns. More recently, however, this is being disputed. For example, a point mutation in intron 7 of the human gene TPH1 is highly correlated to the development of the psychiatric disorder schizophrenia (wiki). Some introns are known to enhance the expression of the gene that they are contained in by a process known as intron-mediated enhancement (IME). One of the most important roles of introns currently under investigation is the transcription of the introns to small regulatory RNA, such as a type of RNAs called miRNA (microRNA). These small single-stranded RNAs regulate the expression of genes (wiki).
Additionally, introns need to be removed from mRNA, so they require precise recognition sites. The budding yeast Saccharomyces cerevisiae has a highly stringent seven-nucleotide branch-point sequence requirement (163). Nearly all eukaryotic nuclear introns begin with the nucleotide sequence GT, and end with AG. That is a nonrandom sequence.
Introns also encode a variety of untranslated RNAs including microRNAs, small nucleolar RNAs (snoRNAs) and guide RNAs for RNA editing (171). Some introns are known to enhance or be necessary for normal levels of mRNA transcription, processing and transport (171). Introns also contain many highly conserved elements: there are 100 ultraconserved elements shared by human, mouse and rat (171). There are 154 highly conserved intronic sequences in the chimpanzee genome (198). However, this is only a minority if the total number of introns is more than 100,000 as in humans. Nonetheless, those highly conserved intronic sequences are nonrandom sequences just as exons.
So, what if (some) introns are not random? If some introns are not random, they are just as improbable as exons, and cannot be ignored. In fact, the whole idea that one can ignore introns while searching for genes is wrong, because genes have a nonrandom number of introns and a nonrandom position within a gene (relative to the triplets). If human genes have on average 8 introns, plant genes 4 introns, fish and insects 3 introns (165), then Senapathy has to explain that this nonrandom pattern could originate by independent origin of genes from scratch. He failed to do so.

What about small introns?
Senapathy's theory predicts the same average intron length for all animals and plants based on statistics alone. However, the eukaryote Oikopleura genome contains introns which are very small (peak at 47 bp, only 2.4% > 1kb) (132). The unicellular eukaryote ciliate P. tetraurelia: most introns in its genome (>96%) are very short (<34 nucleotides) (239). Strikingly, all introns in intron-poor genomes of unicellular eukaryotes are short, with nearly uniform, apparently tightly controlled lengths and conserved, optimized splice signals at exon-intron junctions (246,page 235). In general, in vertebrates there are relatively long introns and short exons, whereas in lower eukaryotes, introns are short and exons are long (209). See also: insects with short introns: (344). This is a problem for Senapathy, because he needs long introns and short exons in every organism.

Identical intron positions not predicted
The positions of many introns are identical in orthologous genes of animals and plants (259). A small number of identical intron positions could be expected by chance, but too many is very unlikely. A random origin scenario should produce arbitrary intron positions in exons. Any position is acceptable as long as introns are removed precisely and reliably. So, identical intron positions refute the origin of introns-exons from random DNA.

Regulatory sequences

Update 13 June 2023: This paragraph has been moved to (402) and replaced by the following text:

Senapathy knows that the expression of genes require DNA bindingsites for regulatory proteins and promotors (See his Genetics Primer Appendix page 562). Without regulatory sequences nothing happens, the genome is dead. So, he should know that it is not sufficient to search for ORFs (Open Reading Frames) in random DNA, but the search should be a combination of the necessary regulatory sequences followed by the gene (ORF) itself. Furthermore, he should know the exact sequence of the regulatory sequences, because they are not random sequences. One example of many regulatory sequences is: TATA box (first identified in 1978). If he does not take regulatory sequences in to account, his theory fails. Above that, he knows that regulatory proteins are necessary: where do they come from? You need gene products before any DNA sequence can be read: a vicious circle: The elephant in the room!

Genes in the introns of other genes
There are 25 candidates for single-exon genes that are located within an intron of a gene on the same strand (125). Examples are the gene for neurofibromatosis type I (NF1) which contains 3 genes in the opposing strand of the intron; intron 22 of the Factor VIII gene contains 2 other genes; intron 17 of the retinoblastoma susceptibility gene RB1 contains another gene (167).
A substantial fraction of genes in complex eukaryotic genomes is contained within introns of protein-coding genes. In C. elegans only ~2.5% of protein-coding genes are so nested, whereas nearly 50% of non-protein-coding RNA genes found in introns (230).
Chlamydomonas reinhardtii (a unicellular green alga) has at least 70 small nucleolar RNAs (snoRNA) gene clusters within introns of protein-coding genes. This algae has the highest number of intronic snoRNA gene clusters among eukaryotes, and shows the functional importance of introns in a single-celled organism (231).
Conclusion: Senapathy's method is based on the idea that introns are meaningless random DNA. Now, we know this is wrong. So, according to his own logic the eukaryotic genome cannot arise in the primordial pool.

Enhancers in introns
Enhancers are frequently located within the introns of genes. While often found in intergenic regions, tissue-specific enhancers are highly enriched in intronic sequences, particularly the first intron. They can increase transcription regardless of their orientation or position within the gene. (546)

What if there is no absolute difference between introns and exons?
From the point of view of protein coding potential there is no difference between introns and exons. The previous examples of genes (in the opposite strand) in introns of genes (overlapping genes) is a demonstration of the principle. Also, intron splicing sites can be deleted or created de novo quite easily by mutation (179) as long as they are in frame. Also, different cell types of the same individual can interpret the same sequence of a pre-mRNA either as an exon or as an intron (Alternative splicing) (305). Furthermore, the definition of exons and introns is not absolute because there are weak and strong splice sites, and also exonic splicing enhancers and exonic splicing silencers.

Figure from: 195.

The changing paradigm of intron retention: regulation, ramifications and recipes (414)

Exonization is the process through which an intron becomes an exon (179). This is also known as Intron Retention (IR) (414). Intronization is the process through which an exon becomes an intron (195). Human Alu elements located in introns can be exonized (191). In humans introns <100 bp in length are retained in 95% of the genes (186). Intron retention is most common in lower metazoans and is also common in fungi and protozoa (exonization). The prevalence of exon skipping gradually increases further up the eukaryotic tree (209). This means also that there is no absolute, static difference between introns and exons (intronization).
Knowles and McLysaght (2009) demonstrate for the first time that human genes have arisen de novo from noncoding DNA since the divergence of the human and chimpanzee genomes (190). "In humans 2-5 % of the genes have been reported to retain introns" (187). Because of alternative splicing, the distinction between exons and introns is no longer absolute (179). Every sequence could be a useful RNA or protein (if translated). The difference between extant exons and introns is that exons (in combination with splicing machinery and splice sites) have been tested by natural selection for their usefulness in the organism. In 2007 (181) it has been found that almost 10% of alternatively spliced human genes involves the retention of an intron. High levels of intron retention (30 % of alternatively spliced genes) in the plant Arabidopsis thaliana are reported (189).
De novo origin of introns: several examples were found in human genes in which insertion of Alu, a primate-specific retro element, into an exon created a new intron in the 3' UTR (untranslated region) (209). This means that the whole idea that introns just occur in random DNA is wrong. They can arise by insertion.

What if there is no absolute difference between coding and non-coding DNA? new 19 Jan 13
Any sequence of DNA that is (1) transcriptionally active, and (2) has a translatable open reading frame could be a protein coding gene. This looks good for Senapathy. However, 'reading frame' implies a 'reader'. What is the reader? Where does it come from? Pioneering research in 2006 clearly showed that new genes could originate from non-coding sequences in Drosophila. Levine et al. identified five novel genes in Drosophila melanogaster that were derived from non-coding DNA (307). Again, the point is: those novel genes don't do anything unless they are sitting in a cell. So, Senapathy is begging the question: where does that cell come from?

tRNA introns.
Transfer RNA introns that depend upon proteins for removal occur at a specific location within the anticodon loop of unspliced tRNA precursors, and are removed by a tRNA splicing endonuclease. The exons are then linked together by a second protein, the tRNA splicing ligase (wikipedia). So, intron removal requires two nonrandom proteins: where do those proteins come from? See the complete elephant.

Noncoding DNA: junk DNA?

Senapathy is searching for 'protein coding genes' (exons, ORFs) in random DNA and ignores noncoding DNA. The most surprising discovery about the human genome is that the majority of the functional sequence does not encode proteins (271). Because these sequences do not code for proteins, stopcodons have no meaning.
Protein-coding sequences, which comprise only ~1.5% of the genome, are dwarfed by functional conserved non-coding elements (CNEs) which constitute 6% of the human genome (271). Furthermore, about 80% of the cell's DNA showed signs of being transcribed into RNA (134, 287). The transcriptome of the fruit fly Drosophila melanogaster reveals that some 75% of the organism's genome is transcribed at one stage or another – in line with the widespread transcription observed in other species (203). In animals several hundred thousand up to 3 million strongly conserved noncoding regions (CNCs) with a mean length of 28 base pairs are scattered throughout vertebrate genomes (221).
Furthermore, a considerable fraction of the junk DNA could be involved in chromatin structure maintenances and remodeling such as scaffold/matrix attachment regions (SARs/MARs) (117). Furthermore, in non-coding regions important transcription-factor binding sites (TF-binding sites) are found. Much of this has only recently been discovered. Senapathy could not know this. However, it invalidates his theory nonetheless.

Long non-coding RNA does not have START and STOP codons

Long noncoding RNAs are RNAs transcribed from DNA longer than 200 nucleotides. They do not encode proteins; they have no START and STOP codons. So, his search for ORFs would miss all the noncoding RNAs. lncRNAs were largely unknown in 1994.
In 20212 the ENCODE project revealed that 75% of the human genome is transcribed into non-coding RNA, and that there may be between 10,000 and 200,000 long non-coding RNA (lncRNA). Other estimates are more than 100,000 lncRNAs in the human genome. Scientists have shown that these can activate gene expression and silence genes (311). There are about 5000 noncoding RNA genes in total (412).

microRNA does not have START and STOP codons [6 Dec 2024]

microRNA are small, single-stranded, non-coding RNA molecules containing 21–23 nucleotides. Because they are not translated in to protein they have no START and STOP codons. (neither do they have triplets). Humans have about 500 miRNAs. Senapathy would miss all noncoding DNA and RNA.

4 Eukaryotes with intronless genes, prokaryotes with introns

"There were absolutely no contiguous genes (as found today in prokaryotes) in the primordial pond." page 247.

Let's ignore all other problems (see § 6: DNA) and focus on introns and exons. Senapathy claims he can find eukaryotic genes in random DNA because they are interrupted with random non-coding DNA (introns). What if eukaryotes have intronless genes too? Interestingly, many eukaryotic histone and GPCR genes are predominantly 'intronless'. A number of vertebrate 'intronless' genes have been compiled. The human genome report identified 901 single exon genes (source). Recently, intronless genes or single exon genes (SEG) have been discovered in eukaryotes (45). The process by which these genes are produced is making a DNA copy of processed intronless mRNA (called retroposition). The genes are called retrogenes. It is based on reverse transcription of mRNA (see: Steele review). These intronless retrocopies were long thought to be doomed to decay and were routinely classified as processed pseudogenes because of the expected lack of regulatory elements and the presence of deleterious mutations in many copies. Nevertheless, individual functional retrocopies (retrogenes) have been discovered since the late 1980s (118). Retroposition is an important mechanism of gene copying and produced a large number of functional genes in mammalian genomes. Proof is the existence of human intronless retrogenes and their parental intron-containing homologs (274). Retroposition produced approximately 1000 functional intronless genes in humans. Above the functional intronless genes, geneticists have found no less than 644 processed (intronless) pseudogenes on the human X chromosome (57).

Intronless genes

Drosophila melanogaster has
nearly 4500 intronless genes

Intronless genes in eukaryotes are more widespread than previously thought and do not necessarily depend on retroposition. A famous example is the SRY gene (Sex-determining region Y) which has only a single exon of 850 base pairs and no intron. In mammals 6% of the genes are intronless (196). This is more than enough to exclude the independent origin of mammalian genomes. Sakharkar et al (143) have identified 2017 expressed intronless genes in the mouse genome. About 5% of human genes lack introns; there would be at least 602 intronless human genes (182). Examples are: histones, olfactory receptor genes and G protein-coupled receptors (GPCRs) (more than 90% of mammalian GPCRs are intronless 182). According to other researchers, humans have 6229 intronless genes or 16.7% of the total number of genes (253). The spider mite Tetranychus urticae has 2966 intronless genes and the fruit fly Drosophila melanogaster has nearly 4500 intronless genes (241). Evolutionary analysis reveals that 56 intronless genes are conserved among the three domains of life--bacteria, archea and eukaryotes. Jain et al (144) reports the presence of 11,109 (19.9%) and 5,846 (21.7%) intronless genes in rice and Arabidopsis genomes. A total of 301 and 296 intronless genes from rice and Arabidopsis, respectively, are conserved among organisms representing the three major domains of life, i.e., archaea, bacteria, and eukaryotes. The yeast species Saccharomyces cerevisiae and Candida albicans (eukaryotes) are devoid of introns in >90% of their genes. Nearly all the genes (99.5%) of a red alga species (unicellular eukaryote) are intronless (49). Leishmania species (Eukaryota) have a unique genomic organization among eukaryotes; the genes do not have introns (373).
Mammalian G-protein-coupled receptor (GPCR) genes (this protein family is one of the largest in the mammalian genomes) are characterized by a large proportion of intronless genes or a lower density of introns when compared with GPCRs of invertebrates (166). A very small minority of human genes lack introns and are generally very small genes, examples being histone genes, many small RNA genes, various neurotransmitter and hormone receptor genes and autosomal processed copies of intron-containing X-linked genes (167).

Introns are very rare in animal mitochondrial genomes and animal mitochondria lack group II introns (153). The sequence of human mitochondrial genome shows extreme economy in that the genes have none or only a few noncoding bases between them (154). According to the theory of independent origin mitochondrial genomes in animals could not originate by independent origin. But mitochondrial genomes are essential for animals. Mitochondrial genes and nuclear genes cooperate and are both necessary for the organism. Senapathy knows that mitochondria and chloroplasts are present in eukaryotes (chapter 7 page 230) but he does not know that those organelles have very compact intronless DNA.

Conclusion: many eukaryotic genomes cannot be found in a random piece of DNA using Senapathy's search strategy and consequently his theory of independent origin would fail. Of course, Senapathy could not have known the extent of this phenomenon when he wrote his book (46), but it nevertheless refutes his theory of independent origin of many eukaryotes.

Wide variety of intron-density in eukaryotic genomes
Recently, attention has been drawn to eukaryotic genomes with very few introns (142). Unicellular eukaryotes with compact genomes have only a few introns (149). The intron density of annotated eukaryotic genomes varies by more than three orders of magnitude: from 140.000 introns in the human genome (intron density 8.4 introns per gene) to only 15 introns in the microsporidian Encepalitozoon cuniculi (171). Eukaryotes dramatically differ in their intron densities, ranging from only a few introns per genome in many unicellular forms to over 8 introns per gene in vertebrates as well as some invertebrates like the sea anemone (258). Only a single spliceosomal intron has been found in the intestinal parasite Giardia lamblia (164). The average number of introns per gene in most multicellular species is between 4 and 7, whereas the average number for most unicellular eukaryotes is less than 2 (164). Additionally, species with compact genomes also have small introns: the insect Belgica antarctica has a mean intron length of 333 bp. Compare this with: Drosophila melanogaster: 955 bp. and Aedes aegypti: 3728 bp. So, the introns of B. antarctica are more than 10 times smaller (344).
But according to Senapathy, all eukaryotes should necessarily have the same intron statistics because all those genomes are random DNA sequences. There should be no systematic difference between unicellular and multicellular eukaryotes. Also the gene densities (genes/Kb) of unicellular eukaryotes and vertebrates differ by a factor 500 (246, page 233). This is not compatible with random sequences. Furthermore, the intron-poor compact eukaryotic genomes have close to the same low probability for random origin as prokaryotes, and thus could also not originate in the primordial pond for this reason alone.

Prokaryotes with introns
Group I and II introns are both found in some bacterial and organellar genomes (152). Group I introns interrupt rRNA, mRNA and tRNA genes in bacterial genomes (wiki). Group I introns have highly conserved structure and function across all species in which they are found (150). Some group I introns encode homing endonuclease (HEG), which catalyzes intron mobility (151). Group II and III introns are similar and have a conserved secondary structure. Group II introns are ribozymes. Group II introns are now being found in unexpected numbers in bacterial genomes (here). Group II introns are found in ~25% of the sequenced bacterial genomes (165). Group II introns are also known in Archaebacteria. Group II introns are catalytic RNAs (ribozymes) that can self-splice in the absence of protein. A phylogenetic tree can be made for introns. Some introns code for a Reverse Transcriptase (RT) (here). Group II introns are complex elements that encode a large protein containing a reverse transcriptase domain and several accessory domains, and have a (nearly) uniform size of approximately 2.5 kb (149). Remarkably, almost all introns identified so far encode reverse transcriptase ORFs. So they are not random pieces of DNA.
All this means that Senapathy needs to revise his theory. Concerning introns there is no absolute distinction between prokaryotes and eukaryotes. The relevant distinction for his theory is not prokaryote/eukaryote, but number and length of introns of a species. And that results in a gradual scale from intronless, via intron-poor to intron-rich species. His theory must predict a minimum number and length of introns and Senapathy needs to specify that minimum. (See also: § 28).

Introns are spliced in the nucleus 19 Jul 13
Introns are spliced out in the nucleus of the eukaryotic cell. So, without nucleus and cytoplasm there can be no intron splicing. So, Senapathy has to explain the origin of the eukaryotic cell first. Explaining the origin of DNA with introns does not help to solve the origin of life. First, there has to be a cell. Senapathy overlooked an elephant in the room. See the complete elephant.

5 A static versus a dynamic genome

updated
30 Nov 25

"Remember, whenever we refer to immutable genome, we mean that the genome of one organism is not changeable to the genome of another organism with a new gene or a new body structure. The genes and the sequences in the genome can mutate, but the genome itself, speaking functionally, does not mutate." Note 117, page 589.

"In 1947 Barbara McClintock discovered, through a series of genetic studies in the corn plant (maize), that some genes were jumping around the genome." page 107 (chapter 4)
"Conclusion: Transposons cause genetic effects which are incapable of any evolutionary potential; these are passive parasitic processes occurring in immutable genomes" page 117 (chapter 4)

In chapter 3 Senapathy argued that 'The genome of every distinct organism is closed to evolutionary change'. In his theory genomes are born directly from the basic DNA building blocks and they do not change thereafter. His genomes are immutable. "This static view of living things presumes that no transition is possible in time or form between kinds, and that variation is regarded as accidental or inessential noise rather than important information about taxa." (527).
However, in chapter 4 Senapathy argues that the following 9 types of mutation: transposition, gene duplication, exon shuffling, point mutation, chromosomal rearrangements, recombination, crossing over, pleiotropic mutation and polyploidy cannot cause new genes or new species. So he accepts the existence of these 9 genomic mutations. This static view conflicts with the above mentioned 9 types of mutation. How can a genome be static and dynamic at the same time? He confuses the concept 'genomic mutation' with 'evolutionary useful genomic mutations' (417). Clear thinking demands clear concepts.

Transposable elements in the human genome [NATURE]

This static view conflicts with reality. Real genomes show characteristics of change. Real genomes are dynamic. Nearly 45% of the human genome is made up of jumping genes, Transposable Elements (TEs), retrotransposons or mobile elements. Transposable elements are relatively short sequences of DNA which can copy themselves and insert themselves randomly elsewhere in the genome. For example, our genome contains some 1.4 million copies of 300 base pairs (called Alu). Many of these Alu elements are continuing to multiply and insert themselves in new locations in the genome at a rate of about one new insertion per every 100 to 200 human births (59). Recent estimations are 1 Alu insertion for every 20 births in humans (242). Alu elements are the most successful Transposable Elements in the human genome. The human genome includes also more than 500,000 dead copies of L1 Transposable Elements and roughly 100 active L1 copies, that are able to spawn new L1s that jump to new chromosomal locations (206). A few short repetitions can be expected by chance, but not more than a million repetitions. A randomly generated genome does not contain such highly repetitious patterns. Senapathy's random genome model fails to explain this pattern. Furthermore, the active mobile elements are not random sequences, they contain open reading frames with genes which allow to copy, move, and insert themselves.

Human DNA pie chart
Human genome partitioned in different fractions.
Transposons are the dynamic element in the human genome.

Age distribution of Transposable Elements in the human genome,
The graph shows the fraction of the current genome that consists of TEs that inserted at a given time in the past.
Futuyma, Kirkpatrick, (400).

Unexpected variability in human genomes exists. Not only do we carry different copy numbers of parts of our DNA, we also have varying numbers of insertions-deletions (INDELs) and other major rearrangements in our genomes. There are at least 297 places in the genome where different individuals have different forms of these major structural variations (62). More than 800,000 of the small INDELs map to human genes, including 2,123 small INDELs that mapped to the coding exons of these genes, and more than 39,000 INDELs in the promoter regions of genes (205). (315).

Polyploidization: Another dynamic aspect of genomes is polyploidization. Example: about 100 million years ago the genome of a yeast ancestor duplicated, doubling the number of chromosomes from 8 to 16 (222). Most plant species have experienced at least one genome doubling early in their history (223) and about 35% of vascular plant species being recent polyploids ("neopolyploids": having formed since their genus arose) (224). The bread wheat (Triticum aestivum) genome is hexaploid, meaning that it contains 6 sets of chromosomes, which derive from three different diploid genomes (302). The record-holder is Celosia argentea which is a dodecaploid (twelve sets; 12x). Think about it: all polyploids could not originate in the primordial pond. Only the original (unduplicated) genome could (ignoring all other problems with the theory).

All this is evidence of an ever changing genome and it refutes the idea that genomes arise only once in the primordial pond and are static and immutable. Remarkably, when criticizing Darwinism, he knows about the dynamic nature of the genome, but when constructing his theory of immutable genomes Senapathy forgets the dynamic genome completely.

6 A DNA sequence is not a genome

Senapathy writes:

"We shall see how the random combinations of genes in a primordial pond could lead to the assembly of numerous genomes." (page 9).

Please note that random combinations of the bases A, T, C, G in to 'genes', and random combinations of 'genes' in to 'genomes', will result in junk DNA, junk 'genes' and junk 'genomes'. This is partly because of the combinatorial explosion of possibilities when considering more than a few residues. Here we consider the biological context. Senapathy assumes that DNA is the only possible molecule that fits the task. As if there are no alternative molecules. As if DNA is a universal law of nature, a chemical law. He tacitly assumes that the Genetic Code is the only possible genetic code. He assumes that there are only 3 possible STOP codons. As if it were a law of nature. Roughly, we need to distinguish two main levels of analysis: (1) DNA as a (bio)chemical molecule and (2) DNA and chromosomes in the biological context. The primary requirements are biochemical requirements. They precede cells and organisms. The secundary requirements (multicellularity, adaptation, sex, food, ecology) make sense only after the first requirements are fulfilled.

(1) DNA and chromosomes in the biochemical context

I include chromosomes because naked DNA does not occur in eukaryotes. There appears to be a number of variables that together characterise the current genetic system of life.

Why Deoxy-ribo-nucleic acid?

Why a double helix? DNA is a double helix. Why? Is there a natural law forcing DNA to be double-stranded? RNA is single-stranded. The information is only in 1 string. Which one?
Why right-handed DNA? The DNA of every organism on Earth is a right-handed double helix and most proteins are left-handed (467). Why homochirality? (the dominance of one set of chiral molecules in known forms of life). Essentially all known chemical reactions produce even mixtures of right-handed and left-handed molecules. In principle, a DNA or RNA strand made from left-handed nucleotides should work just as well as one made of right-handed bricks (396). All biological structures, functions and even organisms could be recreated in their mirror image. Mirror-image DNA molecules have the same capacity to hold information as their natural-chirality counterparts do (514). If the genomes of organisms arose directly from a primordial pond, and are immutable, then we should expect a random mix of organisms with left-handed and right-handed DNA. Furthermore, the distribution of the left-handed and right-handed organisms should be randomly distributed in the Linnean hierarchy, that is in families, genera and species.
Why only 4 bases forming two pairs, while more than 100 nitrogenous bases are possible? (160). He cannot just assume it because he starts with DNA.
- Why not 6 bases forming 3 pairs with a total of 6x6x6=216 different triplets? (403, 404).
- Why not use 20 different bases to code for 20 amino acids? (111).
- Why are there 2 types of bases: purine and pyrimidine bases?
- Why those 20 Amino Acids while 80 prebiotic AAs are available? [543].
The problem of prebiotic synthesis of the DNA bases is the same for independent origin and evolution, except that life did not start with DNA (but with RNA or something even more simple). However, the fact that the 4 bases in DNA are common to all life is only a problem for independent origin.
Why do the sugars in DNA have five- (pentose) rather than six-membered (hexose) rings? (430).
Why does the DNA backbone universally consist of deoxyribose and phosphate? Why does RNA use ribose instead of deoxyribose and Uracil instead of Thymine? The sugar deoxyribose is harder to make, and in present-day cells it is produced from ribose in a reaction catalyzed by a protein enzyme, suggesting that ribose predates deoxyribose in cells (260). Alternatives for DNA are chemically possible (256). Senapathy a priori excludes any evolution from simpler bases and sugars to the current ones.

Why this genetic code?

"...multitudes of complex organisms could have been born directly from the primordial pond, deriving their basic genetic codes, genes, and cellular machineries from a common universal gene pool." page 219.

"...such as the genetic code and basic cellular machineries, which were available in the primordial pond before the first cells were formed." page 310

"The genetic code and the genetic machineries such as transcription, splicing, and translation systems had been already established in the primordial pond before any living cells were formed." page 311. (my emphasis)

"Remember that the commonness of the genetic code and the presence of sets of the same genes in various organisms is due to the assembly of all the genomes from the same pool of genes, and the same genetic machineries, genetic codes and basic genetic principles." page 449.

Any DNA sequence is not a genome: a genetic code is required to interprete the sequence

For protein coding genes a genetic code is required. Without a genetic code a DNA sequence itself is just a meaningless polymer. Senapathy knows the genetic code table (see Figure below). The genetic code consists of 3-letter codes (triplets). One triplet codes for one of the 20 amino acids. Reading the triplets can only be done in the correct Reading frame, otherwise the sequence will be gibberish. The genetic code specifies how DNA is translated into proteins.

GENETIC CODE:

choice of 2 pairs of bases + triplet codes
choice of 20 amino acids (out of hundreds)
choice of pairings of triplets + amino acids

Given that there are 4 different bases in DNA and that they are read in triplets, it follows that there are 4×4×4 = 64 different triplets. Those 64 triplet codons are translated into 20 amino acids with the help of transfer-RNA (tRNA). Transfer-RNAs are the physical embodiment of the genetic code (see: Inheritance of the code below). The most remarkable fact is that all living creatures on earth have the same genetic code! One of the most profound questions one can ask is: Why is there a nearly (162) universal genetic code when there are millions of possible ways to connect 64 triplet codons with 20 amino acids? To be precise: 1.5 x 10⁸⁴ possible codes (127, p.163). But all these possible code assignments assume 64 triplets of A,T,C,G and the same 20 amino acids. But why assume this? We must distinguish between DNA in general and the genetic code:

The Codon Table, page 548 Appendix Genetics Primer. SENAPATHY.
Please note there is no START codon!

Codon length: Why a codonlength of 3 bases (triplet) resulting in 4x4x4=64 combinations? A codonlength of 4 increases the number of possible codons to 4x4x4x4 = 256 (404). A code with only two bases and a codon length of 4 (quadruplet codons) is also possible (111). A codon length of 6 (Sextuplets) is possible.
Why a commaless triplet code? (126)
Why a non-overlapping code? Each nucleotide is part of only one triplet (three nucleotides specify each amino acid in a protein) (126)
Reading Frame: assuming a triplet code and a linear sequence of bases, where to START reading? How to define the first base of a triplet? With a triplet code there are 6 possible reading frames: 3 in one direction and 3 in the other direction. Since there are 2 strands in DNA, there are in total 12 possible reading frames. Which one is the 'correct' one? Can both strands be used at different locations? etc.
Directionality: updated 3 Oct 25 The genetic code has directionality or polarity. Why? We read the triplets in the Genetic code table from left to right. However, DNA is a linear sequence of letters that could be read in both directions. The codons UUU, CCC, AAA, GGG produce the same amino acids irrespective of reading direction. All other codons, including STOP and START, produce different results when read in both directions. For example, STOP codon UGA will become AGU that codes for the amino acid SER. Codon GAU (ASP) will become STOP codon UAG. The 3 STOP codons become amino acids when read in the opposite direction. The result is a completely different ORF, because ORFs are defined by a START and STOP codon. And that results in a completely different protein if translated. Which is the 'correct' direction? (if there is a correct direction at all).
If all organisms are born independently, why do they all read DNA in the same direction? If this directionality is because RNA polymerase can only add nucleotides to the 3' end of the growing mRNA chain (Transcription), then the entire reading direction depends on this specific RNA polymerase. But since a polymerase (Thg1) exists that is capable of catalyzing nucleotide addition in the 3'–5' direction, why isn't it used for transcription of DNA in general? Please note that the statistics of ORF lengths are the same when a DNA sequence is read in the opposite direction. Finally, where does the standard RNA polymerase come from in the first place? (elephant in the room, vicious circle) (520).
Coding strand and noncoding strand: DNA is double-stranded, but only one strand serves as a template for transcription at any given time. This template strand is called the noncoding strand. The nontemplate strand is referred to as the coding strand because its sequence will be the same as that of the new RNA molecule. In most organisms, the strand of DNA that serves as the template for one gene may be the nontemplate strand for other genes within the same chromosome (!). How is decided which strand is used? Senapathy ignores this. (Nature Scitable, Coding strand).
Why a degenerate code? (redundant coding): why are there 61 meaningful triplet codons? 20 triplets would be enough, so 41 codons are redundant. What is the problem if 41 'codons' are unused? Is it necessary that all 61 codons are used? No, certainly not. Yes, we know now that all 61 codons are used, but the independent scenario must explain this fact.
Stop codons: Why are stop codons universal? (if every organism was born independently!) Why choose 3 'stop' codons (but first see: elephant in the room!)? And why UAA, AUG, UGA? Why not 1, 2 or 4 'stop'-codons? Senapathy does not give a justification for using 3 'stop-codons' and why precisely those 3. At the origin of life anything could happen. At that time there is maximum freedom because there is no burden of the past. One cannot just assume that 3 are somehow 'natural'. Which chemical law is operating here?
Genetic codes with only 2 stop codons do occur naturally in eukaryotes (335) and prokaryotes (338), (204). In some ciliates, like Paramecium, only UGA is a stop codon. This changes stop-codon statistics from 3/64 to 2/64 or 1/64. Quite a difference! This has important consequences for ORF length: in random DNA, ORFs are almost twice as short with 3 STOP codons as with 1 STOP codon (507). So, 1 STOP codon in the Genetic Code table would be most the advantageous solution (for Senapathy).
Furthermore, experimental proof that it is possible to re-engineer the standard 3 stop codon genome into one, is the succesfull engineering of Escherichia coli C321.ΔA with UAA as the sole stop codon (444).
Another question: Why is there a strong bias for 'stop' codon usage (frequency) in different species? For example, in Escherichia coli W3110 the codon UAG has a frequency of 0.2 per thousand compared with CUG has 53.1 per thousand (source).
Start codon: Why only one and nearly universal Start codon (362) (AUG = MET = methionine)? A start codon is necessary because bases must be read in triplets and the correct base must be identified as the first of the triplet. The start codon must hold throughout the whole genome. Misidentifying the first base of a triplet would result –if translated at all– in a different amino acid and consequently in a different protein (Just as in case of a one-base deletion or insertion).
Amino Acids: Life universally uses the same 20 Amino Acids. Why only 20 amino acids while 150 amino acids are possible in nature? (160). Why exactly those 20 amino acids and not others? "The amino acids used by life appeared to be anything but random. (...) Taking random groups of 20 AA from a set of 76 AA, not a single group out of million possible alternatives outperformed the natural set." (542). So, the 20 Amino Acids used in the Genetic Code are highly nonrandom. Furthermore, the universal use of the same 20 amino acids points strongly to common descent of all life. Only 12 of the 20 amino acids can be synthesized prebiotically, the other 8 can only be synthesized by living organisms (208).
Nonrandom redundancy (degeneracy) of the code is not random (very imporant!). The codons for an amino acid are clustered. Why? This is the structure of the genetic code. A redundant code is a logical consequence of 64 triplet-codons coding 20 amino acids and 3 stop codons. So, on average an amino acid is coded by 3 codons, the real number varies from 1 - 6. Mostly, the codons for a particular amino acid have the same first two letters. The third letter varies most. This is not random!
Why does the genetic code minimize the chemical consequences of amino acid mistakes? Optimal code, Steve Freeland (234).
Neighboring triplets in the genetic code tend to specify biochemically similar amino acids, so that single-nucleotide substitutions rarely lead to radical amino acid replacements (238). This is a non-random code!
Why is there a codon usage bias when organisms arise from random DNA? See: codon usage database. Some codons for the same amino acid are used very frequently, others rarely. Codon usage should be random in the independent origin scenario. It is not. If this isn't a falsification of the theory, then what is?
Info: this is not about the structure of the genetic code, but about how often the standard codons are used in a genome. Statistically, one would expect that all codons occur in about the same frequency in DNA. This is not so. For example. there are differences in the frequency of occurrence of synonymous codons (139). For the amino acid LEU the most frequently used codon is used 140 times more often than the least frequently used codon (33). In the bacterial kingdom the C+G percentage varies from 25% to 75% (35).
"The genetic code is degenerate, with multiple synonymous codons encoding the same amino acid, yet these codons are not functionally equivalent. Synonymous codon usage strongly influences mRNA stability and translation efficiency across species. mRNAs enriched in optimal codons are generally stable and efficiently translated, whereas those enriched in nonoptimal codons (nonoptimal mRNAs) are ineffectively translated and rapidly degraded." (544)
Why is Methionine always the first amino acid in protein-coding genes, if DNA arose spontaneously? (Initiation Codons in eukaryotes).

A tremendous freedom of choice! If all these variations are chemically and biologically possible, why is there only one genetic code? Variables: codonlength, number of different bases in DNA, number of different amino acids, how many and which start and stopcodons, distribution of redundant coding for the same amino acid, directionality, right-handed/left-handed DNA, etc. Every Origin of Life theory has to explain this restriction, but:
If organisms originated really independently, there should have been as many variations of the code as independently born organisms. (161)
The reason is: each independent origin is a new trial. The probability is zero that in a few million trials (= the number of species) the outcome would always be the same result, considering the 1.5 x 10⁸⁴ possible genetic codes alone. And that is assuming 64 triplets of A,C,T,G and 20 amino acids. Let alone all the variations above. So, independent origin spectacularly fails.

WHY A UNIVERSAL CODE?

Chemical necessity?
The only possible escape would be that the genetic code would be a chemical necessity. In that case the same genetic code would be produced automatically again and again. Is the code a chemical necessity? Is seems now likely that only 25% of the codons can be explained by a chemical affinity of amino acid and codon-RNA, and 75% of the codons are arbitrary assigned (127, p.174). The 25% follows from the universal laws of chemistry and thus will be the expected outcome on any planet. However, 75% of the genetic code must be explained by common descent. That is an inherited and 'frozen' accident. Independent origin cannot explain universal arbitrary features of the genetic code because there cannot be millions and trillions of trials with exactly the same outcome! Unless one particular 'frozen accident' is inherited by all descendant species. This is called evolution by common descent and modification. Of course the theory of evolution needs to explain how the genetic code originated, but the fact that every species has the same genetic code can only explained by common descent. The frozen accident theory may be not the most elegant theory, but at least it is not ruled out by probability theory! Any talk about 'common pool of genes' destroys a truly independent origin of life. Conclusion: the universal genetic code is the strongest argument against independent origin I can think of (128).
Implementation of the code
Additionally, and crucially, whatever its origin, the genetic code is encoded in DNA itself by Transfer RNAs (tRNAs) and aminoacyl tRNA synthetase (aa-tRNA) which attach the aminoacyl group to its allocated tRNA. This embodies the connection between DNA triplets and amino acids (82). There are at least 20 aminoacyl tRNA synthetases. This is the problem: the DNA sequence of an organism could be random, its translation cannot be random. The translation machinery must be invariable. The translation machinery gives meaning to a meaningless sequence of bases. tRNA and aa-tRNA genes must be exactly identical for all organisms on earth. So, the biggest problem is not producing random DNA sequences, but guarantee that exactly the same 20 tRNAs and aa-tRNAs are incorporated in each newly formed organism the Primordial Pond. This is an extremely non-random process. If even one is missing, it is fatal for the organism. How does a completely random process guarantee that each and every independently born organism (= species!) has exact the same the genetic code? In reality, an endless number of incorrect combinations would arise.
This looks like one of the many problems of Independent Origin. But it appears now that it is the most fundamental and deepest obstacle for an independent origin scenario: reading and interpreting a DNA sequence with the same translation code for every individual in the primordial pond. Producing genomes is pointless when there is no fixed universal genetic code.
Origin of the code
Francis Crick:
"The important point to realize is that in spite of the genetic code being almost universal, the mechanism necessary to embody it is far too complex to have arisen in one blow. It must have evolved from something much simpler. Indeed, the major problem in understanding the origin of life is trying to guess what the simpler system might have been". Francis Crick (1981) Life Itself, p.71 (184).
Is gradual evolution of the genetic code compatible with independent origin? Independent origin of complete eukaryotic genomes must assume the full set of 20 amino acids, because the current genetic code codes for 20 amino acids. Less than 20 will prevent the production of functional proteins. That is fatal. It cannot start with a subset of amino acids and later add additional ones (as in evolutionary scenario's). Evolutionary scenario's make the problem easier. For example, Wong proposed the evolution of the genetic code in two phases. In the first phase 6 - 10 amino acids were assigned to 61 codons. The code then expanded by the addition of phase-2 amino acids (145a). Any gradual evolution of the genetic code is incompatible with Senapathy's scenario. Not only the genetic code but also the genome would have to evolve. Both are (and must be) fixed in Senapathy's scenario. So, the burden to explain the abrupt origin of a full-blown 64/20 genetic code rests on Senapathy's shoulders. He did not address this difficulty (see also § 27: PLOS One article). See for other proposals of the evolution of the genetic code: (158).

The Standard Genetic Code contains 18 fragile codons (red) that can be changed into a STOP codon by a single point-mutation and whose mistranscription can therefore generate nonsense errors. The remaining 43 sense codons are "robust" to such errors.
Six amino acids are encoded exclusively by fragile codons ("fragile amino acids", shaded), ten amino acids are encoded exclusively by robust codons ("robust amino acids", unshaded) and four amino acids can be encoded either by robust or fragile codons ("facultative amino acids", hatched shading).
We observe a 8% depletion of fragile codons in single-exon genes in the human genome that is highly significant.
After Brian P. Cusack et al (240)

Considering all these problems with the spontaneous origin of the genetic code, one wonders why the first genomes did not consist of DNA genomes with only non-coding RNA genes which do not need a genetic code? There are 8,801 small RNAs and 9,640 long non-coding RNAs (lncRNAs) (287) totalling 18,441 RNA genes. One step further is the question:

Why a DNA genome?
Why are the first genomes made of DNA? What's wrong with RNA genes or RNA genomes? Double-stranded RNA viruses do exist. RNA genomes would be simpler because they could consist of rna genes which do not require a genetic code to be present, since no proteins have to be produced.
A sequence constructed with any 4 'letters' is not DNA
A sequence constructed with 4 different 'letters' is not DNA because the letters must be able to chemically pair specifically and reliably. They must consist of pairs. For example: A must pair with T, C with G. So not any 4 letters will do. A restriction applies which cannot be ignored. Furthermore, they must not only be able to pair, they must be able to separate temporarily for DNA replication. This pairing is completely absent from all computer simulations. Strangely, it seems irrelevant for computer simulations. But it cannot be ignored if one wants to solve the origin of life because DNA cannot exist without pairing. "All organisms, from bacteria to humans, face the daunting task of replicating, packaging and segregating up to two metres (about 6 × 10⁹ base pairs) of DNA when each cell divides." (340). See: cell division (below).
A single-stranded linear sequence of 4 letters is not DNA
A sequence constructed with 4 different letters is not DNA because Prokaryotic and Eukaryotic DNA have a double stranded helix. Single-stranded DNA (ssDNA) viruses exist. Double-stranded DNA (dsDNA) creates a problem. In nature usually only one strand of a particular region in DNA (sense strand, or positive sense, coding strand) is translated into proteins. The other non-sense or antisense strand ('non coding strand', 'complementary strand', 'antisense') is not translated. Senapathy's example in figure 1 is a virtual single-stranded DNA sequence and is supposed to be the coding strand. He ignores that both strand could be used as a code. How is it decided what is the coding and what the complementary strand?
In most organisms, the strand of DNA that serves as the coding template for one gene may be noncoding for other genes within the same chromosome (341). Amazingly, there are several protein coding genes encoded in the opposing strand of another gene! (Overlapping genes). These are genes within genes. Example is the gene for neurofibromatosis type I (NF1) which contains 3 genes in the opposing strand of the intron (167). If genomes are random sequences, then Senapathy should predict that genes are located randomly on either strand. If there is a big excess of genes on one strand, the hypothesis is falsified. The problem is that his approach is incapable to incorporate all these complications because he uses only single-stranded sequences.

Overlapping genes: The human genome contains thousands of overlapping genes, where the nucleotide sequence of one gene partially or completely overlaps with another. Studies suggest that approximately 10% to over 25% of human protein-coding genes are involved in overlap events, often arranged in pairs or complex clusters. Overlapping genes occur across all domains of life, including viruses, bacteria, archaea, and eukaryotes.
A linear sequence of A,T,C,G is not a gene
A sequence of the bases A,T,C,G of arbitrary length is not a gene (or exon) because the length must be a multiple of 3. The reason: the code is a triplet code. This objection is related to the genetic code (see above).
Double-stranded DNA does not form spontaneously 1 Feb 2012
The spontaneous formation of double-stranded DNA has never been observed. Double-stranded DNA is always formed by semi-conservative replication on the basis of an pre-existing double-stranded DNA polymere (Meselson-Stahl, 1958). So, of each double-stranded DNA molecule one strand is inherited as it is from the mother DNA molecule and the other strand is newly synthesized.
The bases A, T, C, G must first be synthesized 17 Jul 2025

A vicious circle
DNA contains the purines: Adenine, Guanine and pyrimidines: Thymine, Cytosine. Where do they come from? They must first be synthesized. Adenine and Guanine contain 5 Nitrogen, Cytosine: 3 N, Thymine: 2 N. What is the source of N? The molecules of Nitrogen gas (N₂) are extremely strongly bonded together, making nitrogen unavailable to most organisms. To split N₂ and make nitrogen biologically availabe requires a remarkable biochemical feat –nitrogen fixation– which uses a lot of energy. (442). This is a requirement for any OOL scenario.
Although further investigations established that the purine nucleotides Adenine and Guanine could also form under conditions likely present on primitive Earth (499), (500) it remains to be stablished whether the conditions favorable for synthesis do also allow for the simultaneous 'independent origin of eukaryotes' (animals and plants).
A second dilemma is: purine (A,G) and pyrimidine (T,C) biosynthesis require many enzymes which are encoded in DNA. For example, the de novo purine biosynthesis pathway involves a series of 10 enzymatic reactions in humans, catalyzed by 6 enzymes. These enzymes, along with their corresponding genes, are crucial for building the purine ring from simpler precursors. So, an efficient production of long DNA molecules requires specific genes, which are made of DNA, which consists of A, T, C, G, which must be synthesized from simpler precursors by enzymes, which are coded in DNA, which ... a beautiful vicious circle (see illustration above). Prebiotic non-enzymatic synthesis of purines and pyrimidines is possible, but it will be very slow and inefficient without enzymes. So, DNA molecules will be rare and long DNA molecules will be very rare. If life is to flourish it depends on efficient synthesis of purines and pyrimidines.
A third conundrum is the transition from non-DNA-encoded, non-enzymatic to DNA-encoded enzymatic synthesis.
DNA requires a 'DNA synthesizer'
Even if we assume statistics allows for a complete eukaryotic genome in a random sequence of A,T,C,G, then still those DNA sequences must be synthesized first from chemical building blocks: bases, deoxyribose, phosphate. It turns out that this is extremely difficult abiotically (83), (480),(525). Enzymes are required: Deoxyribose is generated from ribose 5-phosphate by ribonucleotide reductases. Nucleotides, the building blocks of DNA have never been produced in any prebiotic synthesis experiment (365). The enzyme dihydrofolate reductase is required for making nucleotides. At the same time enzyme inhibitors prevent DNA synthesis. This is fatal for de novo DNA synthesis. Finally, DNA polymerase III is well-known for its blazing catalytic speed of ~ 1000 base-pairs per second (360), so replicating DNA without any enzymes would take an eternity.
(Mainstream science has the RNA-world and the Pre-RNA world), but this is unavailabe in Independent Birth of Organisms.) Even in the lab it is difficult to synthesize a small eukaryotic chromosome, let alone a complete genome (329), (330). However, recent experiments show that the building blocks of RNA and DNA could have been produced abiotically starting with Urea (405).
How to build a genome. Lessons from synthetic genomes: the first synthetic genome (a bacterial genome) was created from scratch in the lab by Craig Venter in May 2010.
1. The first lesson: this work didn't create a truly synthetic life form, because the genome was put into an existing cell (129). (the original genome was carefully removed before it received the new genome).
2. The second lesson: it is very difficult to create a 1-million-base genome. Blue Heron (The Gene Synthesis Company) has mastered the art of synthesizing relatively long, entirely accurate sequences, and stringing them together to create gene-sized fragments on the order of hundreds to thousands of bases. However, the human genome is 3000 times bigger! To assemble a one million genome from 1k pieces the process involved, according to Venter, "invention after invention after invention of new ways to do things" and "There were literally thousands of hurdles that had to be overcome" (201).
3. The third lesson: errors! Initially the attempt failed because the artificial genome failed to take control of the cell. The cause was a single-base mistake which delayed the project 3 months (129).
Only a handful of genomes have been synthesized so far, mostly for bacteria. Today (2020) it is still a huge challenge to create fragments of a few thousand bases, let alone to create all the 16 chromosomes of the yeast Saccharomyces cerevisiae (a eukaryote). The longest yeast chromosome under construction is around 1.5 megabases (mega=million). The complete genome is about 12 million base pairs. It does not matter whether these fragments are random or specified sequences, the fragments need to be assembled to produce whole chromosomes (380). Many bright minds and Nobelprizes were needed to discover how DNA can be synthesized in the lab (547).
DNA synthesis costs energy
"Each building block needs to be activated, or chemically charged, before it can be incorporated into a polymer. Activation requires a preexisting source of chemical energy." (256). "It has been known for nearly 20 years that chromatin assembly is an ATP-dependent process" (146). There are two energetic components: the costs of nucleotide synthesis and the polymerization cost needed to make a DNA or mRNA molecule (188). Furthermore, DNA synthesis costs more energy than RNA synthesis. DNA replication is energy consuming process. So without ATP, DNA synthesis could never happen. However, without ATP synthase (an enzyme) ATP will not be produced. And without the gene for ATP synthase, ATP synthase will not be produced. That gene is made of DNA. A vicious circle.
Even the smallest genome is too long 2 Aug 2012
The genome of the urogenital bacterial parasite Mycoplasma genitalium is 582,970 base pairs long (525 genes), making it one of the smallest genomes of any independently dividing cell – for comparison, the gut bacterium Escherichia coli has 4.6 million base pairs and around 4,200 genes (282). Compare with the smallest eukaryotic genome of Encephalitozoon intestinalis: 2.25 million base pairs. Despite this 'small' genome size, the genome of Mycoplasma genitalium is too long to originate spontaneously (apart from all other requirements for a genome).
DNA has to be replicated every cell division updated 15 Jul 2025
In the Primordial Pond there is only naked DNA. DNA requires a cell. DNA replication requires a cell. DNA has to be replicated every cell division. This has nothing to do with ORFs, exons, introns, STOP codons. DNA replication must be performed before cell division and must be precise to ensure each daughter cell receives the full complement of chromosomes, no less, no more. Where does the cell come from? DNA replication requires DNA polymerase enzymes which are proteins. They must be present. Where do they come from? Those proteins require specific DNA sequences called Origin of replication. Those sequences are about 100 basepair long. There may be as many as 100,000 origins of replication in the human genome. Without OoR DNA can not be copied and a cell cannot divide (401). The number of DNA polymerases in eukaryotes is much more than in prokaryotes: 14 are known (DNA Replication in Eukaryotes). DNA ligase (enzyme) is required; where does it come from? Telomerase (a ribonucleoprotein) is required; where does it come from?
Watson-Crick base pairing is by far insufficient for DNA copying fidelity: 8 Jul 2023
Despite its beauty, Watson-Crick complementarity is absolutely insufficient to ensure an acceptable fidelity of replication, even with perfect raw materials. Fidelity of nucleotide attachment depends not on complementarity, but on active involvement of DNA polymerases. (410).
A genome is not a random collection of genes (1) 6 Aug 2012

"Because genes present in the random DNA sequences in the primordial pond were randomly assembled into various genomes" (p.377). added 5 Jul 2023
Even if all eukaryotic genes are produced abiotically, on statistical grounds it can be excluded that random assembly of those genes will produce a viable eukaryotic genome. Let alone an eukaryotic organism. The number of combinations of genes and their interactions that need to be probed (by natural selection!) is infinitely large, so it would take an infinite amount of time to test the whole thing. "It is not feasible to understand evolved organisms by exhaustively cataloging all interactions in a comprehensive, bottom-up manner". For only 10 genes there are already 115,975 possible interactions (283). And then we ignore when (development), where (tissue, organ), at what level genes are expressed and when they are shut down.
Amazingly, Senapathy seems to be aware of the problem:
"Out of a great number of combinations of different kinds of genes, only one genome can produce such a viable organism" (p.295) added 5 Jul 2023
but then he ignores the whole problem and continues.
A genome is not a random collection of genes (2): Gene regulatory networks

"Animals are more than the sum of their genes – it is the regulated expression of genes across space and time that helps to differentiate egg from embryo, leg from wing or bat from fly." (356)
A gene regulatory network is a collection of molecular regulators that interact with each other and with other substances in the cell to govern the gene expression levels of mRNA and proteins which, in turn, determine the function of the cell. (Gene regulatory network).
A genome is not a random collection of chromosomes 29 Sep 2012
Aneuploidy is a deviation from the standard number of chromosomes of a species: one or more chromosomes extra or missing. In humans, loss of any autosome (non sex chromosome) is lethal. Carcinogenesis has been shown to be initiated by random aneuploidy (292). Aneuploid embryos usually die.
RNA genes are ignored 24 Sep 2012
Genes that code for proteins have start en stop codons and are translated into proteins following the Genetic Code table. However, by mid-2009 evidence for at least 6000 human RNA genes had been obtained. RNA genes are difficult to identify using computer programs: there are no open reading frames (ORFs) to screen for (274, p. 262). Senapathy's search for genes depends on Open Reading Frames (sequences without STOP codons). This method fails to find RNA genes. The ENCODE project found 18.441 RNA-genes (290). RNA genes that were known when Senapathy wrote his book are Transfer RNA (tRNA) (mentioned on page 556) and Ribosomal RNA (rRNA) genes (not mentioned).

A DNA sequence is not a chromosome
Animals and plants cells do not contain naked DNA (265). For example, the diploid human genome contains 6 billion base pairs of DNA per cell with a total length of 2 meter packaged into 23 pair of chromosomes. Because each base pair is around 0.34 nanometers long (one-billionth of a meter), each diploid cell therefore contains about 2 meters of DNA. This creates the DNA Packaging Problem: (supercoiled DNA) How is all of that DNA packaged into chromosomes and into the nucleus? (320) and how is it unpackaged and unwinded in order to read genes.

A DNA sequence is not a chromosome

Human chromosomes under a scanning electron microscope. ©Nature2017

"A DNA sequence isn't enough; to understand the workings of the genome, we must study chromosome structure. Far from being the random result of packing 2 metres of DNA into a sphere perhaps 10 micrometres across, the structures vary across cell types and exert an as-yet-mysterious influence on gene expression." (176).

"We usually think of genomes abstractly as one-dimensional entities that are purely defined by their linear DNA sequences. Reality, of course, is far more complex. The DNA helix is folded hierarchically into several layers of higher-order structures that eventually form a chromosome" (177).

"Comparing the length of metaphase chromosomes to that of naked DNA, the packing ratio of DNA in metaphase chromosomes is approximately 10,000:1 (depending on the chromosome). This can be thought of as akin to taking a rope as long as a football field and compacting it down to less than half an inch. This level of compaction is achieved by repeatedly folding chromatin fibers into a hierarchy of multiple loops and coils." (320).

High quality G-banded human kayrotype, male, 46XY

G-banded human kayrotype, male, 46XY. © www.pathology.washington.edu

A DNA sequence is not a chromosome because important chromosomal structures must be present: telomeres, centromeres, nucleosomes, euchromatin and heterochromatin. At least these structures are present in linear chromosomes in eukaryotes, not in the circular chromosomes of most bacteria. Although linear chromosomes probably have advantages, they come with problems: replication of linear chromosomes presents a problem as it leads to gradual loss of the terminal telomeric regions, the telomeres.

heterochromatin and euchromatin: Giemsa staining reveals banding patterns with darkly stained heterochromatin and lightly stained euchromatin bands in chromosomes. Euchromatin is transcriptionally active and heterochromatin is transcriptionally silent. Functional genes are located in lightly stained euchromatin bands. This is a nonrandom pattern. If genes were located in heterochromatin, they would not be transcribed. These facts were known in 1994.

telomeres (white dots), centromeres (red dots)
(Science 22 April 2011)
Telomeres: Telomeres are structures at the ends of chromosomes that contain a series of non-coding DNA repeats, and which become shorter themselves but protect the coding regions from damage. Human telomeres are several kilobases of repeated sequences of DNA bound by specialized protective proteins. A peculiarity of the DNA-replication mechanism causes telomeres to shorten as cells divide. Sometimes the enzyme telomerase can replenish the lost DNA. If telomeres get too short, through aging or because telomere maintenance goes awry, cells can stop dividing. The protection conferred by telomeres is a fundamental biological mechanism present in nearly all animals and plants (119). 'Independently born' organisms (if they exist) have at best telomeres with a random length, simulating cells and chromosomes of random age –from old, average to young. That means many wil not be able to complete replication.
Centromeres are defined as pieces of non-coding highly repetitive DNA with variable and rapidly evolving DNA sequence that function as the site of spindle attachment at cell division (mitosis and meiosis). They are essential for equal chromosome separation during cell division. Remarkably, centromeres are inherited epigenetically (389). That means DNA sequence does not determine the identity of the centromere. Experimental studies indicate that specialized chromatin, rather than the underlying DNA, underpins centromere function in chromosome segregation (485). This implies that ultimately the centromere is inherited from a previous generation. There is no previous generation in the Independent Birth of Organisms hypothesis.
Histones are proteins found in eukaryotic cell nuclei that package and order the DNA into structural units called nucleosomes. Without histones human DNA could not fit in the nucleus because it would be too long. DNA without histones could certainly not enter mitosis. Histones are highly conserved in eukaryotes, so random origin is virtually excluded. Histones have also a function in gene regulation. Along with histones come more than 20 histone modifiers, chaperones and other regulators. Important question: Where do histones come from in the independent origin scenario? It does not help that histones are encoded in the DNA, because how are they supposed to be transcribed into mRNA and translated into protein? (see below)
Nucleosomes consist of DNA wrapped two times around small, globular histone octamer particles. They form the fundamental repeating units of eukaryotic chromatin, which is used together with condensin (359) and topoisomerase II to pack the large eukaryotic genomes into the nucleus while still ensuring appropriate access to it. Already in 1987 a publication appeared pointing out the relation between chromosome structure and gene expression.
"Since 1968, we have learned that DNA wraps around histones, packing ~10² base pairs into the 10^–8m nucleosome. We also know that individual chromosomes occupy distinct subnuclear volumes called chromosome territories which pack ~10⁸ base pairs into 10^–6 m (243).

©Science 28 Jul 2017: ChromEMT: Visualizing 3D chromatin structure
and compaction in interphase and mitotic cells.
sex chromosomes: eukaryotic genomes come in two forms: male and female genomes. Males have a unique chromosome that females don't have: the Y-chromosome. Because males have only one X-chromosome and females two X-chromosomes, a mechanism is necessary to ensure genes on the X-chromosome are expressed in the right levels. Dosage-insensitive genes are those that function perfectly well when present as a single copy. By contrast, two copies of dosage-sensitive genes are required for normal health. (dosage compensation, X-inactivation).
Linear chromosome: Here is the fundamental problem for the theory of independent origin: why should the most complex chromosome type, the linear chromosome, with all its problems, arise spontaneously in stead of the circular chromosome? For example: why don't humans have 46 circular chromosomes? Or why don't humans have one big circular or non-circular chromosome? On the other hand, evolution theory needs to explain the evolution of the linear chromosome (120).

(2) DNA and chromosomes in their biological context

Chicken and egg problem

The Chicken and Egg Problem

Circular Dependency: A needs B to start, but B needs A to exist

To produce a protein from DNA specific proteins are needed. It does not help that those proteins are encoded in DNA. Surely, every protein could be encoded in DNA. The point is that DNA needs to be transcribed and translated (250). That requires enzymes to be present in the first place. And that requires 20 Amino Acids to be present. 'The central dogma of molecular biology' states that genetic information encoded in DNA is transcribed to mRNA (by RNA polymerases (321)), and mRNA is translated to protein (by ribosomes, and 20 different tRNAs which are made by 20 different aminoacyl-tRNA synthetases). Since RNA-polymerase is a protein itself, it needs to be present before it can be produced (275). This is certainly impossible for the first 'independent organism'. This alone is fatal for the theory of independent origin. This is why the origin of any organism from naked DNA is impossible! Try it yourself: place a complete genome in a physiological saline solution and wait to see what happens...
This is also why scientists concluded that there must have been a RNA-world before a DNA-protein world. This is also why even giant viruses with all the genes for mRNA synthesis, etc. still depend on living cells for their reproduction (see: The elephant in the room).
The prebiotic synthesis of proteins is also a problem. "Prebiotic peptide assembly from amino acids is particularly difficult to establish, given that the intricate biological machinery used today to synthesize peptides obscures the origins of the process." (512). The chicken and egg problem is well known in the scientific literature (523).

Here are three examples of this vicious circle:

Transcription:
Transcription: a DNA segment is copied into mRNA by RNA-polymerase. RNA-polymerase is an enzym, which is a protein. That protein is encoded in DNA. The synthesis of RNA-polymerase starts with transcription. For transcription of the RNA-polymerase gene a RNA-polymerase enzym is required: vicious circle. (228). See: Transcription.
Translation:
"Translation is probably the most complex biochemical cellular process, needing more than 120 different molecular elements ranging from messenger RNA to ribosomes and their many protein and RNA accessories. ... even the smallest of cellular organisms (Mycoplasma genitalium) need a minimum of 90 different proteins for translation and about 30 for DNA replication." (185), (250). The eukaryotic ribosome is composed of 79 ribosomal proteins. "More than 200 assembly factors and small RNAs are needed to synthesize ribosomes in the nucleolus. Ribosomes are absolutely essential for life, generating all cellular proteins required for growth. Complete loss of any single ribosomal protein often leads to death of the embryo in mice" (317). (See: Translation).
Ribosomes are assembled from (382):
- 4 distinct ribosomal RNA (rRNA) molecules
- 80 different proteins, which form small and large subunits
- more than 200 assembly factors
Replication:
Every cell division requires DNA replication. DNA does not 'self-replicate'. "Nude DNA does not self-replicate" (394). Replication enzymes (DNA polymerases) are requited. At least 15 DNA polymerases operate in human cells. Human DNA polymerases are 900-1000 amino acids long (2700 - 3000 base pairs). The minimum set of proteins required to initiate DNA replication in eukaryotes (Saccharomyces cerevisiae) is 16 proteins (351). "The work of a generation of biochemists, notably Arthur Kornberg, has shown that it takes dozens of protein complexes, each involving many proteins to accomplish this [replication]. They can be thought of as complex components of several giant molecular machines, which synthesize the new DNA, check it for errors, and pass it on for further interactions which package it in chromosomes." (264). The polymerases responsible for replicating nuclear DNA are at least 100-fold faster and nearly 1,000-fold more accurate than polymerase η which is involved in DNA repair caused by ultraviolet light (278). See: Replication.

A DNA sequence is nothing without proteins

Two hands in the paradoxical act of
drawing one another into existence!
© M.C. Escher (wikipedia)
The most amazing fact is that Senapathy wrote in an endnote:
"Even the simplest cell needs DNA, amino acids, nucleotides, proteins and other molecules to construct the structure of the cell such as cell membranes, and cellular machineries such as ribosomes. The cell needs enzymes to synthesize these molecules and metabolize nutrients. Without such absolutely basic things, no living cell can ever come into existence." (Note 1 chapter 7 page 596.)
These few lines are sufficient to undermine origin of life from DNA. It is a perfect refutation of independent origin from naked DNA. If 'The cell needs enzymes', then where do these enzymes (proteins) come from (if not coded in DNA)? The prebiotic synthesis of proteins is a problem. "Prebiotic peptide assembly from amino acids is particularly difficult to establish, given the intricate biological machinery used today to synthesize peptides." (512). Add to that: where do the required 20 Amino Acids come from? And where do ribosomes come from? And how is the first gene transcribed and translated? Where does the 'genetic code' come from? The universal genetic code is embodied by Transfer-RNAs (tRNAs) which are small non-coding RNAs that deliver amino acids to the ribosome for protein synthesis (468). Transfer-RNAs must be present before any protein can be produced. That is a vicious circle. It is essentially the bootstrapping problem: a self-starting process that is supposed to continue or grow without external input. Or: a situation where a system depends on itself to start (see: chicken and egg problem below). It is a fatal boot problem, because Senapathy starts with naked DNA. Senapathy doesn't see The elephant in the room or worse, he ignores the boot problem. At least, he mistakenly thinks it is only a minor problem. Why? He thinks he has broken the vicious circle. How?
1. the DNA sequence for transcription enzymes are present in random DNA (453);
2. primitive transcription enzymes originated by chance (451), (454);
3. than came a switch from non-encoded to DNA-encoded enzymes (455);
4. "After the DNA-coded enzymes had evolved, the replication of the DNA must have become more efficient ..." (454).
Next problem: in the unlikely event that a transcription enzyme arises spontaneuously in the Primordial Pond, the organism is dependent on good luck again and again and again. Sheer luck can only produce a unique enzyme once, it can not reproducibly produce highly specific enzymes. Only inheritance (DNA) can reproducibly produce very specific transcription enzymes (regulation). All this is absent if transcription enzymes are supposed to arise by sheer accident.
Suppose a spontaneous arisen functional transcription enzyme existed, it certainly does not follow that the code for it can be found in random DNA. These are two completely independent things.
Here is a list of proteins:
1. histones
2. condensins
3. cohesins
4. DNA polymerases
5. replisome
6. primases
7. telomerases
8. topoisomerases
9. helicases
10. DNA gyrase
11. ligases
12. DNA glycosylase
13. deaminases
14. release factors
15. Ribosome Recycling Factor
16. nuclear receptors
17. chromatin remodellers
18. histone demethylases
19. DNA repair enzymes
20. Transcription Factor (TF): There are approximately 1600 TFs in the human genome.
21. repressor
22. activator
23. spliceosomes, splicing regulatory proteins
24. nucleases
25. DNA methyltransferases
26. ribosome
27. RNA polymerases
28. RNA-binding proteins
29. Origin recognition complex (ORC)
30. Chromatin Assembly Factor-1 (CAF-1)
31. Nuclease–helicase DNA2
A DNA sequence is nothing without RNA 18 Feb 12
DNA can do nothing without RNA. In fact, DNA cannot even replicate without the prior formation of an RNA primer (270, p.66).
DNA specifies amino acids but does not synthesize amino acids 1 Feb 12
DNA specifies the sequence of amino acids in proteins but does not synthesize amino acids. So, when there are no amino acids available (for example essential amino acids), protein synthesis is impossible. To synthesize 'non-essential' amino acids the cell needs specific enzymes. Those enzymes can only be produced when the right amino acids are available. This is because enzymes are a sequence of 20 different amino acids.
DNA does nothing without ribosomes 22 Mrt 24
"For if DNA is data then it can't go anywhere, or do anything, without a machine to process it. The ribosome is that machine." (435). Each animal cell can contain millions of ribosomes.
DNA sequence is not sufficient to produce proteins 13 Jul 11
About one quarter to one third of all proteins require metals to carry out their functions (metalloproteins: iron, copper, magnesium, cobalt, zinc, molybdenum, vanadium, manganese, nickel, selenium). For example, iron and copper are present in virtualy all enzymes and in some proteins that interact with oxygen (215). However, metals are not coded by DNA! So, protein specification by DNA is incomplete. Furthermore, most of the Mg2+ in a cell is bound to DNA, to RNA, to the cellular energy carrier ATP or to enzymes, and acts as an essential cofactor for these molecules. Mg2+ is not in the Sequence.
DNA is an archive (1). It must be read to have any effect 5 Jul 19
DNA is like a book. The book does not read itself. It needs a reader. The right piece of DNA must be read (transcription) on the right time in the right quantities in the right cells. The transcript must be translated. There must be a
- Transcription Code: where to start and stop copying the gene?
- Splicing Code: where to start and stop removing introns?
- Translation Code: where to start and stop translating the gene?
- Epigenetic Code: where to mark DNA bases?
DNA is an archive (2). It must be read in the right place and time 4 Jul 22
In a multi-cellular organism (plants and animals) genes must be transcribed in the right place and time especially and crucially in the development of the embryo (391). Also genes must 'know' if they are in a female or male body.
The Transcription Code
The processing of the information of DNA starts with transcription. Humans have approximately 1,600 DNA-binding transcription factors in their genome. Transcription requires:
- each gene has its own promoter element (contains a conserved gene sequence called the TATA box) and enhancer element(s)
- Termination: Specific DNA sequences signal RNA polymerase to stop and release the newly formed RNA.
A Transcription Factor is a protein. Where does that protein come from? The sequence of every protein must first be transcribed from DNA and then translated in to the protein. For that transcription, a Transcription Factor must be present. Furthermore, there is a transcription factor binding code within (!) the protein coding part of a gene (324). See also: 'Sequence basis of transcription initiation in the human genome' (437).
The Translation Code 30 Mar 12
- Kozak consensus sequence is a non random sequence which occurs on eukaryotic mRNA and plays a major role in the initiation of the translation process.
- ribosomal binding site (RBS) is a non random sequence on mRNA that is bound by the ribosome when initiating protein translation.
- Internal ribosome entry site (IRES) is a nucleotide sequence that allows for translation initiation in the middle of a messenger RNA (mRNA) sequence as part of the greater process of protein synthesis.
Without these non-random sequences no proteins can be produced.
The Histone Code 4 Oct 11
The histone code is a hypothesis that the transcription of genetic information encoded in DNA is in part regulated by chemical modifications to histone proteins, primarily on their unstructured ends. Together with similar modifications such as DNA methylation it is part of the epigenetic code. (see Epigenetics below)
The Splicing Code 18 Feb 11, 9 Mar 11
The very existence of introns requires splice site recognition: the border between intron and exon. So, Senapathy simply assumes splicing machinery when talking about introns and exons. This is the splicing code. Senapathy knows that "The sequences around the junctions of exons and introns are highly conserved" (p.549). How is that possible in random DNA? Where does it come from? Why that specific universal splicing code? The splicing sequence is in DNA, but that can only be 'a code' if specific splicing proteins recognize that sequence. The Splicing machinery must match the universal splicing codes in DNA. So, it does not solve the problem of the origin of split genes to search for splicing codes in random DNA. Of course one will find it in random DNA. The point is, however, that the splicing code is not random, so where does it come from? The spliceosome is a very complex ensemble of five snRNAs and about 200 proteins, so cannot arise from scratch. Furthermore, alternative splicing implies different proteins in different cell types. Furthermore, it has been shown that more than 20,000 unique Single Nucleotide Variants, SNVs, likely affect splicing (349). In addition to splicing, eukaryotes possess elaborate mRNA surveillance mechanisms, in particular nonsense-mediated decay (NMD), to assure that only correctly processed mature mRNAs are translated (170).
There are two splicing codes: exonic splicing sites and intronic splicing sites. The exonic splicing sites implies a dual code because a piece of DNA encodes a protein and a splicing signal at the same time (303). There maybe a third function: nucleosome positions bias certain synonymous codons.
All this effectively blocks the origin of splicing from scratch.
The Poly(A) Code 18 Feb 11
The information in DNA is modified before it is used. No equivalent to poly(A) or the caps are in DNA and these are added to the mRNA (Polyadenylation). Therefore, a DNA sequence is not enough. The mRNA is modified. This is crucial for exporting mRNA from the nucleus to the cytoplasm. Alternative polyadenylation can also shorten the coding region, thus making the mRNA code for a different protein. Again: the information in DNA is not enough.

The Epigenetic Code: imprinting James Watson

"The major problem, I think, is chromatin. What determines whether a given piece of DNA along the chromosome is functioning, since it's covered with the histones? What is happening at the level of methylation and epigenetics? You can inherit something beyond the DNA sequence. That's where the real excitement of genetics is now." (James D. Watson: 30).

When the human genome was first fully sequenced, it was often described as the recipe for making a person. In reality, the genome is more like an entire cookbook that can produce hundreds of different cell types depending on which genes are switched on and off. That switching is accomplished using a vast suite of epigenetic marks (284).

DNA-epigenetics (DNA methylation) and RNA-epigenetics (DNA/RNA methylation): is defined as the chemical modification of DNA and RNA that affects gene expression but does not involve changes to the underlying DNA sequence. As the emphasis in biology is switching away from 'The Sequence' and towards the mechanisms by which gene expression is controlled, epigenetics is becoming increasingly important (104). Cell differentiation is associated with selective DNA methylation. RNA marks are used by the cell to determine where, when, and how much of the protein should be generated (377).
In mammals, DNA methylation is essential for normal development (!) and is associated with a number of key processes including genomic imprinting, X-chromosome inactivation, repression of transposable elements, aging. Please note: DNA methylation does not occur in Senapathy's book. If DNA methylation doesn't occur in the Primordial Pond, mammals would then not be able to be born.

"Imprinting reflects competition between a mother's interests and a father's when it comes to gestating the offspring. A mother wants a fetus that doesn't grow too big, so she can survive the pregnancy. A father wants the opposite: a fetus that becomes a strapping baby and, later, a strapping adult who hoards resources and spreads his genes to new progeny. Essentially, imprinting means that in some places along the human genome –about 100 genes in all– the way DNA behaves depends on which parent passes it to the offspring." (357). This is all absent when a genome hypothetically originates de novo from a primary pond. Methylation marks are created and removed in the egg, sperm and fertilized zygote (390).

'Writers' add chemical marks to the DNA or to the histone proteins that DNA wraps around. ©Nature. (285) (modified).

©Nature (click to enlarge)

Epigenetic processes are essential for packaging and interpreting the genome, are fundamental to normal development and are increasingly recognized as being involved in human disease. Epigenetic mechanisms include, among other things, histone modification, positioning of histone variants, nucleosome remodelling, DNA methylation, small and non-coding RNAs.
(Nature, 7 Aug 2008).

Much of a cell's identity is determined by modifications to chromatin, which comprises DNA and the proteins that bind and package it. Epigenetic instructions, in the form of chemical marks that cling to chromatin, tell cells how to interpret the underlying genetic sequence, defining a cell's identity as, say, blood or muscle. The marks serve as instructions that are passed down as cells divide, providing a sort of cellular memory to ensure that skin cells beget other skin cells (285).
Methylated cytosine (5-methylcytosine), often referred to as DNA's fifth base, makes up a subset of nucleotides in the mammalian genome (121). It can regulate tissue-specific gene transcription, without affecting the genetic blueprint. Cytosine methylation may function as a memory module of cell identity and developmental state.

Two epigenetics examples:

DNA methylation is essential for the survival of the embryo. Two studies of mouse embryogenesis now show that transmission of DNA methylation from gametes is predominantly maternal. Mouse embryos need maternal imprints for normal development.
FBHM (Familial Biparental Hydatidiform Mole) is a recessive disorder in humans that results in repeated pregnancy loss due to a failure to establish maternal imprints at multiple loci throughout the genome (227).

DNA of human sperm is highly methylated and that of eggs moderately so. There is a massive loss of DNA methylation from most of the zygote genome immediately after fertilization in human embryos: the erasure of epigenetic memory (337). However, specific DNA methylation appears to be obligatory in plants and vertebrates (eukaryotes!) (298). That's the end of independent origin theory. Furthermore, DNA methylation is used for repressing expression and preventing further expansion of repetitive DNA elements.

A DNA sequence does not survive

"Any theory postulating that genes [!] may have emerged randomly and then waited to be used are fundamentally wrong, especially in a world dominated by the deleterious effects of the second law of thermodynamics. Genes had to have a functional meaning from the very beginning or they would have vanished soon after they emerged." (53).
Please note, this applies to genes. Let alone to genomes. Furthermore, phenotypic effects of DNA are ignored: RNA-editing changes the sequence, and so the phenotypic effect. With no RNA-editing in the primary pond, the DNA sequence would probably not survive for this reason alone.
A DNA sequence is not chemically stable outside a living cell updated 2 Oct 2025

"The stability and the length of DNA molecules in today's living beings indicate that the DNA formed in the primordial pond must likewise have been stable." (page 209)

"Further, the DNA molecule is highly stable even by itself in the test tube, indicating that such a stability is the inherent nature of the DNA molecule." (page 214)

"It is significant that the DNA from mummified bodies has been stable for thousands of years" (page 214)
Unfortunately, mummification is a preservation method, which occurs naturally in very dry or cold environments. So is not applicable to a Primordial Pond, which is a pond (water!).
His most extreme claim about DNA:
"This shows that DNA could have been stable even at the boiling temperatures of the primordial pond." (page 214)
Apparently, the Primoridal Pond is boiling. His evidence for the claim that DNA could have been stable is: "Thermophilus aquaticus lives at 90° C." However, Thermophilus aquaticus shows best growth at 65–70 °C (149–158 °F), but can survive at temperatures of 50–80 °C (wikipedia). He forgets that T. aquaticus has many unique adaptations to high temperatures which are not present in organisms the Primordial pond (481). Furthermore, its DNA is located inside the cell. Not outside a cell in a Primordial pond. That is an important difference. Senapathy does not discuss whether embryos can withstand boiling temperatures of the Primordial Pond.
A huge problem with the idea that DNA can withstand boiling temperatures is: DNA replication. Replication requires both strands to be separated, unzipped and unwound. If DNA were extremely stable, how could the strands be separated?
Even worse: there is a huge problem with DNA in water. "Life's cornerstone molecules break down in water. This is because proteins, and nucleic acids such as DNA and RNA, are vulnerable at their joints. Proteins are made of chains of amino acids, and nucleic acids are chains of nucleotides. If the chains are placed in water, it attacks the links and eventually breaks them. (...) This is the water paradox. If living things keep water controlled, then the implication, say many researchers, is obvious. Life probably formed on land, where water was only intermittently present." (519).
DNA degradation is a process by which DNA breaks down into smaller fragments. Environmental factors such as sunlight, heat, and humidity can increase the rate of degradation. Further, Cytosine deamination takes place (339). So, Senapathy builds his whole theory on very shaky foundations. The abiotic synthesis and stability of DNA is absolutely crucial for his theory but he quickly skips over this hot issue and continues with non-essential details.
A DNA sequence is not chemically stable inside a living cell

"DNA is an extremely stable molecule that can be immensely long" (page 216, chapter 6)
In the early 1970s, scientists believed that DNA was an extremely stable molecule, but Noble prize winner Tomas Lindahl (409) demonstrated that under physiological conditions DNA decays at a rate that ought to have made the development of life on Earth impossible. This insight led him to discover a molecular machinery, base excision repair, which constantly counteracts the collapse of our DNA. Before Lindahl, nobody really considered the idea that DNA requires active engagement by a set of housekeeping processes to keep it in a stable state (353).
A DNA sequence is nothing without a nucleus

In eukaryotes DNA and chromosomes are always found in the cell nucleus. The boundary of the cell nucleus is a nuclear envelope (a double membrane). It is rare to find DNA outside the nucleus, in the cytoplasm (434). How does the Primordial Pond ensure that all the chromomsomes end up in the nucleus? Rarely a cell has more than one nucleus. The defining property that sets eu-karyotic cells apart from pro-karyotic cells is the nucleus. "Genomes are more than linear sequences. We usually think of genomes abstractly as one-dimensional entities that are purely defined by their linear DNA sequences. In addition to the complex arrangement of the genetic information itself, the cellular factors that read, copy, and maintain the genome are organized in sophisticated patterns within the cell nucleus. Specific nuclear processes such as transcription and replication occur at spatially defined locations in the nucleus." (Tom Misteli: 177, 178).
All animals have 2 genomes. All plants have 3 genomes. 31 Aug 2022

A nuclear genome does not function without a mitochondrial genome. They depend on each other. The cell needs mitochondrial genes and mitochondria need nuclear genes. A random mixture of nuclear and mitochondrial genes certainly will fail (393b). Nuclear genes encode mitochondrial proteins. Mitochondrial genes have been transferred to the nuclear genome during evolutionary times (392, 393).
These facts alone falsify the theory of Independent Origin. Above that mitochondrial DNA is inherited strictly from the mother. That means that all hypothetical 'male genomes' are doomed. Hypothetical 'female genomes' are doomed without species-specific mitochondria. A mature oocyte has nearly half a million copies of mitochondrial genes.
A DNA sequence is nothing without a cell

He knows: remarkably, in the Appendix Senapathy knows:
"Usually, an organism starts its growth from a single cell" (p. 536)
But if you start with naked DNA, how do you get from naked DNA to a cell? That is: membranes, mitochondria, ribosomes, nucleus, nuclear membrane, centrosome, ATP (energy). And how do you get from cell to cell division? (See further details: § 7 Genome-centered approach).
- Cell division (mitosis): from the first animal or plant genome a body must be created. That means millions of cell divisions. Chromosomes do not only create an organism, they are duplicated just before each cell division. "Chromosome segregation must be executed with high fidelity so that the mother cell and the daughter cell that arise from division receive precisely the same DNA content". Otherwise aneuploidy will result. Comparing the length of metaphase chromosomes to that of naked DNA, the packing ratio of DNA in metaphase chromosomes is approximately 10,000:1 (Nature). Therefore, central to the problem of segregation is the issue of packaging." (Nature).
  Mitosis is a complicated process in which about 625 genes are involved (Nature, 2010). In 2017 the number of genes having a role in cell division was 1295 genes (361).
  The spindle apparatus is the structure that separates the chromosomes into the daughter cells during cell division. Without spindle no cell division. The kinetochore is a complex machinery composed of more than 100 proteins through which chromosomes attach to the microtubules that form the spindle apparatus, which allows chromosome segregation. (see above: 'A DNA sequence is nothing without proteins'). Importantly, kinetochore are transmitted through a combination of structural inheritance and epigenetic mechanisms. Where does the first cell gets its kinetochores from if it has no parent cell? How does this happen in the Primordial Pond?
  In other words: "To achieve accurate segregation, chromosomes rely on a specialized region known as the centromere. The centromere recruits the kinetochore, a proteinaceous macromolecular structure that forms attachments to the microtubules of the mitotic and meiotic spindles. Together, centromeres and kinetochores are the central players in chromosome segregation." (413). Proteins involved: separase, cohesin, etc.
  Conclusion: many non-random proteins must be present before a cell-division can happen. See Elephant in the room.
- Embryonic Genome Activation (EGA):
  The fertilised egg, at the one cel stage, is comparable with Senapathy's hypothetical random genome in the primordial pond. In both cases the genome must be read for the first time. That means producing RNA transcripts. But DNA can not read itself. In the fertilized egg this vicious circle is solved by the inheritance of maternal transcripts which are able to produce proteins (enzymes) and so on. But independently born organisms do not have a mother. Consequently, cannot initiate transcription (5 Oct 2025).
  
  At fertilization, the animal genome is inactive, and the earliest stages of development are driven by pre-existing proteins and RNA transcripts (mRNA) that have been stored in the cytoplasm of the egg by the mother. (521).
  1. Initially, embryos rely on proteins and mRNA inherited from the mother (maternal legacy) for development;
    The egg contains stored maternal genetic material mRNA which controls embryo development.
  2. transition from maternal to embryonic control of protein synthesis
  3. Transcription of the embryonic genome begins at the one-cell stage, driven by maternal transcription factors.
  4. RNA polymerase II, guided by transcription factors, begins to produce new messenger RNA (mRNA) from the embryonic genes.
  5. The new mRNA is then translated into proteins by ribosomes, initiating protein synthesis from the embryonic genome.
  As the fertilized egg divides into two cells, developmental control is gradually handed over to the embryo. The pre-existing RNA (known as maternally loaded RNA) is degraded, and the embryonic genome (called the zygotic genome) is activated (427). Only on the third day the human embryo, at the 8-cell stage, starts reading its genome (Embryonic Genome Activation). Before that moment transcripts can be detected in the embryo, indicating maternal deposition. But that is impossible for 'independently born organisms' because they do not have a mother and consequently no maternal RNA (352).
  Curiously, in the Appendix Genetics Primer page 560–561 he writes:
  "Some maternal factors (factors from the mother) or maternal genes can in some cases influence development from outside the embryo."
  So, he knows! But, he completely misses the fact that the first 'independetly born organisms' simply have no mothers!
  
  Further Reading: Maternal to zygotic transition also known as Embryonic Genome Activation (EGA).
A DNA sequence is nothing without microbes:

"Animals grow up under the influence of their microbes, not just the blueprints encoded in their genomes. Microbes play a role in development. The bodies and immune systems of animals ranging from tsetse flies to mammals mature properly only after exposure to bacteria. The larvae of some marine worms metamorphose into adults only when they encounter bacterial molecules. (350). 15 Jan 2015
A DNA sequence is nothing without an organism:

parts of the DNA sequence (genes) must be expressed at the right time and place in the right quantities in order to develop an adult from one cell and maintain life als an adult. This called positional information (Lewis Wolper). DNA itself does not contain positional information. To achieve this positional en temporal gene expression, proteins that control such processes have to bind to specific places in the genome. There is evidence that ageing entails a gradual drift towards more random patterns of gene expression, which might cause organ/tissue failure. There are millions of ways to express genes at the wrong time, place, quantity or sex. Furthermore:
- other genes in the genome influence the function of a gene
- the function of the gene product must be in accord with the laws of biochemistry (energy production, protein folding, catalysis, prevention of protein aggregation). Some proteins cannot fold without the help of other proteins, called chaperones, so that means: genes cannot function in isolation, they need other genes.
- DNA must be protected from damage: right from the start DNA must be protected from damage (mutation). Indeed: a very complicated DNA-repair system exists. Furthermore, DNA must be faithfully duplicated (replication) with the help of specialized enzymes. It is highly improbable that they arose by chance.
- a sequence of DNA is meaningless without the correct ecological context, that is biological and nonbiological (the physical conditions of the earth: temperature, atmosphere, climate, gravity, water). See: The genome is blind.
The interaction and interdependency of nuclear and mitochondrial genome:

Interactions between the nuclear genome and mitochondrial DNA are essential for proper cellular functioning, but incompatibilities between the two can lead to compromised development and fitness. Despite having their own genomes, mitochondria don't make many of their own proteins; most are synthesized in the cytosol by cellular equipment encoded in the nucleus. Thus, the interactions of mitochondrial and nuclear DNA are critical to cellular life (318) 26 Sep 2013.
A DNA sequence is not an organism:

an eukaryotic organism contains multiple separate genomes. See: separate page. Explaining the nuclear genome is not enough.
22 Jul 2019
There is no human genome!
There is no such a thing as 'the human genome'. A human genome is either female or male. For humans: 46XX (female), 46XY (male). The Y-chromosome harbors male-specific genes. The genetic difference between men and women is 15 times greater than between two men or two women. (378). A males gets his X-chromosome from his mother. A female from her father and mother. Above the DNA sequence itself, there is different gene expression between the sexes in all tissues and organs in the rest of the chromosomes (autosomes).
20 Nov 2022
A human genome does not guarantee a healthy individual:

There are over 6,000 known genetic disorders in humans (397). Each of those produce a diseased or handicapped person. When produced in the primordial pond they will not survive and reproduce. End of story.
A computer generated DNA string is a virtual thing, see: here.
The limits of a Genome-centered approach: see § 7

Summary of the argument of this paragraph

The problem of the origin of life can be expressed in different ways:

origin of DNA: a chemical problem
origin of the genetic code: a biochemical/evolutionary problem
origin of genomes: a biological problem
origin of species: Darwin's problem

Senapathy merges the problem of the origin of life (an unsolved problem), with the origin of species (solved in principle by Darwin). Senapathy knows that Darwin did not address the question of the origin of life in his Origin of Species (p.199). Following Darwin, evolutionary biologists focus on the origin of species, and leave the origin of life to specialists (chemists, Origin of Life researchers). When evolutionary biologists study the evolution 'from microbe to man', they are not handicapped by their ignorance of the origin of life. However, Senapathy has to solve three problems literally at the same time! He substitutes the 'origin of species' by the 'origin of individuals'. But how could those individuals belong to a species? There are no species in his scenario, only independently born individuals with unique genomes. An individual can only reproduce with another member of the same species. Since there are no individuals of the same species, those individuals cannot reproduce (sexually). So, those individuals die without leaving descendants. All sexual individuals die without leaving descendants. Conclusion: his theory does not explain the origin of species (429).
The origin of life: one cannot ignore the origin of the Genetic Code. The Genetic Code has a nonrandom structure. Senapathy has to analyze what the very problem of the origin of life is (see also paragraph §What is Life?). Today scientists argue that a RNA-world has preceded the DNA-world mainly because DNA does not self-assemble prebiotically.

Senapathy reduces the problem of the origin of life to the origin of a genome. Next he reduces that problem to finding the sequence or 'the origin of biological information'. Next he reduces that problem to a statistical problem. What we see is a stepwise narrowing down of the original problem. At the end he claims having solved the original problem whereas only tried to solve an extremely restricted form of the original problem. Neither the origin of life, nor the origin of genomes have been solved.
The origin of life is a chemical problem and the origin of species is a biological problem. I am afraid that Senapathy thinks that as soon as he has solved finding the Sequence, he has solved all the problems of biology! However, a sequence without cellular machinery is like software without a computer! (333).

2 Jul 23 He knows it!

Again and again Senapathy 'solves' problems by saying 'It's easy'. He did not solve the 26 objections above, but ignored them. For example he writes about the enzyme reverse transcriptase: "It is reasonable that such an enzyme was present in the primordial pond." and importantly: "In selecting the exons of a split gene from a random primordial DNA sequence, whatever machinery that did this should be capable of..." He simply assumes DNA-copying enzymes are present in the primordial pond. Reasonable? An enzyme? machinery? Where do they come from? You need an organism with a genome in the first place to produce 'machinery'! See Elephant in the room. The most amazing discovery is that Senapathy knows about the 26 facts I presented above:

"Even the simplest cell needs DNA, amino acids, nucleotides, proteins and other molecules to construct the structure of the cell such as cell membranes, and cellular machineries such as ribosomes. The cell needs enzymes to synthesize these molecules and metabolize nutrients. Without such absolutely basic things, no living cell can ever come into existence." Note 1 of Chapter 7, in NOTES AND REFERENCES page 596.

"even in the simplest living cell, there must be DNA polymerase, RNA polymerase, ribosomal proteins, many DNA-binding proteins that will serve as gene-regulatory proteins, and so on" (page 427).

He knows it:
ribosomes! but where do they come from? Wishful thinking!

"Some form of translational machinery, far more complex than a single enzymatic function, must have come into existence in the primordial soup to switch from non-DNA-coded to DNA-coded genetic machineries." page 216 Chapter 6.

He knows it:
translational machinery! but where did it come from? Wishful thinking!

"Numerous metabolic enzymes are required for the structures and functions of even the simplest living cell, because these are required for the metabolism of the basic chemicals which are required for the life of any living system." page 427 Chapter 9.

He knows it:
Enzymes! where do they come from? Wishful thinking! His logic appears to be this:

"It is an all-or-none-law that if complete genes existed in the random primordial DNA sequences, then complex machineries and complex cells should be formed;" p.284 Chapter 7.

He thinks that long random DNA automatically produces complex machineries and complex cells! Unfounded claim! Wishful thinking! No evidence. He fails to note that DNA cannot do anything by itself. To synthesize, replicate, transcribe, translate a genome one needs RNA polymerases, ribosomes, etc. etc. etc. (he knows they exist in the GENETICS PRIMER!) but these cellular machineries can only be produced by a functioning genome. Chicken and egg problem. Vicious circle. Elephant in the room.
In fact Senapathy had access to all the necessary scientific literature: he makes extensive use of the textbooks Douglas Futuyma (1986) Evolutionary Biology, E. C. Minkoff (1983) Evolutionary Biology, and all other mainstream evolution and biochemistry literature. There is one important omission: 'epigenetics' and 'methylation' do not occur in his book (416).

He knows it:
Natural selection! Senapathy apparently knows that he needs natural selection:

"Only one out of a very large number of "genomes" assembled into seed cells can become viable. (...) A creature born from a genome should be first physically fit in the environment. If not it dies at birth. If an ecological fit occurs for a physically fit organism, then it survives." (page 313 chapter 8).

"Then useful genes, complete with their split architecture and without any need to be evolved from shorter coding sequences, could simply be selected from among those available genes in the primordial sequences and assembled into genomes." (page 201)

He knows it:
In the Appendix, he knows 'promoter', 'RNA polymerase', 'regulatory sequence', 'Transfer RNAs' (tRNAs), 'Ribosomal RNA' (rRNA). page 547-548. Are these sequences and enzymes present in the primordial pond by accident? Note: the DNA sequences of tRNA and rRNA do not have STOP codons. However, he does NOT know: 'aminoacyl tRNA synthetase', 'aa-tRNA', 'aminoacylation'.

He knows it:
Mitochondria! "It is possible that they also contained other organelles such as mitochondria and chloroplasts, in which they compartmentalized some special functions." page 248.
He knows of the existence of mitochondria. But he just assumes they arise spontaneously! Does he know they contain DNA?

He knows it:
"We have not carried out the further necessary experiments to demonstrate all the aspects of the primordial pond, especially that one can make very long DNA molecules.". (page 215).
So, he knows there is no experimental evidence for his theory.

He knows it:
"Some maternal factors (factors from the mother) or maternal genes can in some cases influence development from outside the embryo." (page 560).
So, he knows! But, he completely misses the fact that the first 'independetly born organisms' have no mothers!

He knows it:
He knows that regulatory DNA sequences are necessary (page 312 and many other pages), but he ignores regulatory DNA sequences in his calculations and restricts himself to ORFs. But without regulatory sequences the genome is dead.

He knows it:
"Let us now compute how many possible protein sequences there can be, given a maximum length of 3000 amino acids for a protein. This is truly immense – 20³⁰⁰⁰. Out of these, we can be sure, only a tiny fraction will have any enzymatic or other biochemical or protein function." page 420.

7 The genome-centered view of life

"The basic principle is that if genes were abundantly available in the primordial pond, they could have randomly assembled to form various genomes, each capable of forming an organism." Introduction page 5 (my emphasis)

"Our foregoing analyses very clearly pictures what must have gone on in the primordial pond to give rise to numerous independent genomes which in turn gave rise to the independent birth of multitudes of organisms." page 430 chapter 9.

Senapathy's thinking is an extreme form of DNA-centrism, gene-centrism, genome-centrism and information-centrism. Gene-centrism means that the most important elements in genomes are protein-coding genes, and the rest of the DNA, including introns, is junk (415). Genome-centrism means that the genome has the power to create and maintain an organism. Information-centered means that the Sequence of 4 symbols A,T,C,G is the most important aspect of life. Here is an example of genome-centered thinking:

“Thus, the genome is the master of the cell and the organism.” (p.551, Appendix - Genetics Primer)

Senapathy is not alone. Recently Richard Dawkins wrote:

“Replicators are the units that survive (or fail to survive) through the generations. Vehicles are the agents that replicators programme as devices to help them survive. Genes are the primary replicators, organisms the obvious vehicles.” (291)

and this can be found on the pages of Nature:

“DNA is famous as the instruction manual of life — the multi-billion-base-pair data tape that directs how a fertilized egg turns into the specific cells, tissues and organs” (294).

Computational genomics researcher Eugene Koonin is also very genome- and sequence-centered (358), but that is the only similarity between them and Senapathy. It all started in 1948 with von Neumann' s striking view that a gene was an "information tape" that could program the organism (374). That's no problem as long as it is recognized as a metaphor. Philip Ball is a fierce critic of the DNA-centred view of life (472).

Why genome-centrism is wrong

The pervasive effect of the discovery by Watson and Crick of the α-helix structure of DNA and the Central Dogma is the genome-centered view of life. However, DNA only codes for protein. DNA does not code for carbohydrates (sugars), fats (lipids), amino acids (9 essential amino acids), energy (ATP), nutrients. Energy and nutrients are external to DNA and the cell. DNA is not alive, a genome is not alive, they are just complex molecules. Only a cell is alive (513). DNA may encode a cell's potential, but the RNA molecules present dictate the activities that define a cell's state at any particular moment (252).
Introns and splicing are central to Senapathy's theory. Splicing occurs at the level of RNA, not DNA. Senapathy's approach is genome-centered and information-centered. Life is first reduced to genomes and then DNA is reduced to information which a computer can handle. The Central Dogma is indeed central to biology, but that by no means does imply that it was involved in the origin of life. DNA does not produce Life. On its own DNA is dead. DNA cannot and does not decide when it is read. DNA has no internal clock. DNA is like a book which can not read itself. DNA needs a reader. Cells rely on precise gene regulation to produce specific proteins exactly when needed. DNA sequences that do not code for proteins, enhancers and promoters, specify the location and intensity of gene expression.
A related problem is genetic determinism. Niche-construction theorists, like developmental biologists, view phenotypes (and hence their environmental modification) as underdetermined by genes (247). A phenotype cannot completely be predicted from a genotype.

The cell
A quick glance at the internal structure of the eukaryotic cell is enough to see the limitations of the genome-centered approach:

Eukaryotic cell (source)
(see wikipedia for explanation)

According to Senapathy, p.552
Please note: except mitochondria, there are no internal structures
in the cell and no indication of DNA in mitochondria.
See: Endosymbiosis theory refutes Senapathy, § 28 and 30.

A few necessary cellular components:

The ribosome is an RNA-protein complex performing protein synthesis in all living cells. The emergence of the ribosome constituted a pivotal step in the evolution of life. This event happened nearly four billion years ago.
The centrosome (wiki, 55,56) is inherited from a mother cell. Upon cell division, each daughter cell receives one centrosome.
The mitochondrion is inherited from the mother(!); it is not coming from the environment (!). There is an interdependency of mitochondrial genome and nuclear genome. Where does the mitochondrion come from in Senapathy's theory? (408). How could the interdependency be explained? Furthermore, mitochondrial DNA has prokaryotic origins and has a circular form. Prokaryotes can not originate from random DNA according to Senapathy himself (see: Endosymbiosis refutes Senapathy).
other organelles: Rough endoplasmic reticulum, Golgi apparatus, etc.

A DNA sequence with introns needs splicing sites, the splicing out of introns requires a spliceosome (229), which uses five small nuclear RNAs and hundreds of proteins.

Senapathy has a gene- and genome-centered approach to explaining life. Probably, that was the common wisdom in 1994 when he wrote his book. Indeed, genes determine most of the differences between humans and other species. But Senapathy does not know its limitations when explaining the origin of life. The big mistake is to believe a naked genome is technically capable of creating a human being. This is succinctly described by the historian Jan Sapp:

"Critics of gene theory continue to emphasize that only a cell can make a cell, and that plant and animals emerge from eggs, not genes" (38)

Of course the cell must be equipped with a genome. Microbiologist Carl Woese wrote that the

"strange claim by some of the world's leading molecular biologists that the human genome is the holy grail of biology is a stunning example of a biology that has no genuine guiding vision"

It is an unremarkable fact of biology that all animals start life as a single cell and that animals and plants have two complete sets of chromosomes (diploid). In this context however, it is a highly significant fact, because that single cell resulted from the fusion of two haploid cells and the two sets of chromosomes directly came from two parents. In Senapathy's theory, there are no parents! William Harvey said: Omne vivum ex ovo: 'All life from eggs'.

Cell membrane

"One of the main arguments I will make in this book is that structures resembling microscopic soap bubbles were an absolute requirement for life to begin, as essential to the process as the assembly of genes and proteins". (David Deamer (2011) 215, p.3.)

Life is cellular. So, any theory trying to explain the origin of life needs to explain the origin of the cell. A cell is defined by a membrane. A membrane is neither made of DNA, nor proteins, but of phospho-lipids. Phospholipids are the result of a long evolutionary process, and their synthesis requires enzymatically catalyzed reactions that were not available for the first forms of cellular life (86). Contemporary phospholipid-based cell membranes are formidable barriers to the uptake of amino acids, metal ions, etc. Modern cells therefore require sophisticated protein channels and pumps to mediate the exchange of molecules with their environment (87). That is a perfect and sufficient reason why modern animals and plants cannot arise out of a primordial pond, not even single celled organisms. A characteristic that places fungi in a different kingdom from plants, bacteria, and some protists is chitin in their cell walls. Senapathy did not give any reason why his primordial pond would not be filled with random DNA sequences until the primordial pond became depleted of DNA building blocks. The process would stop there. All Senapathy has to say is "and the membranes that surround the cell were also available" in the primordial pond (page308). His theory says they were available! That is his 'explanation'. He has no explanation. So, we can forget about introns and exons. The membrane is a crucial argument against independent origin of present-day life.

What is life?
The genome-centric view of life largely ignores thinking about what is life? See: § What is life?

Some criticisms of genome-centered view in the scientific literature

It is true that some genomics researchers still believe that "The human genome encodes the blueprint of life" (290), but most disagree. Here follows a number of illustrative quotes from books and articles expressing the idea that the genome is not enough (please note most appeared after Senapathy's book):

Not by genes alone

Do genes code the organismal form? Not quite, says evolutionary biologist Massimo Pigliucci:

"Genes by themselves do literally nothing. Organisms do not begin with a bunch of genes that generate everything else: they need a set of environmental conditions, as well as the inheritance of materials and extra-genetic information from the previous generation. From the point of view of causal analysis, genes may be said to be a necessary but far from sufficient condition for the development (and evolution) of organisms."
(Original link: life.bio.sunysb.edu/ee/pigliuccilab/bookclub/ does not exist anymore.)

Not by DNA alone

"Today, the view that biological information is transmitted from one generation to the next by the DNA sequence alone appears untenable. There is increasing awareness that non-genetic information can also be inherited across generations."

Étienne Danchin et al (2011) Beyond DNA: integrating inclusive inheritance into an extended theory of evolution, Nature Reviews Genetics 12, 475-486 (July 2011).

A DNA sequence is meaningless

"A DNA sequence, by itself, is meaningless. The information in the double helix is interpreted through its interactions with the rest of the cell."

Barton et al (2007), EVOLUTION, p.381.

A DNA sequence is an archive

"An organism is not a linear script in a DNA language we have learned to read. In fact, such a simplification is a shocking distance form the truth."

"Without RNA, a cell would be all archive and no action".

Michael Yarus (2010) Life from an RNA World, Harvard University Press, p. 97.

A DNA sequence is dead

"Despite its obvious importance to life, biological energy receives far less attention than it deserves. According to molecular biologists, life is all about information. (...) Life without energy is dead"

Nick Lane (2005) Power, Sex, Suicide, p.68.

Genes don't do anything

"Critical to my appreciation of genetics was the understanding that by and large genes don't actually do anything at all."

Lisa Seachrist Chiu (2006) When a Gene makes You Smell Like a Fish and Other Tales about the Genes in Your Body, p.2.

DNA is an inert database

"DNA isn't life. It doesn't even leave the nucleus of the cell. A whole army of proteins is needed to unpack, edit, and execute the information it contains. Without this apparatus, DNA is but an inert database, full of errors and repetitions. To grasp the nature of life, we must move away from our obsession with genes alone."

front flap text of Denis Noble (2006) The Music of Life. Biology Beyond the Genome (216)

Every gene needs an environment

"We now know that there is no such thing as a gene that acts in isolation and that every gene needs an environment --whether the environment is the presence of molecules made by other genes, signals generated internally within the developing nervous system, or electrical activity transduced from the external world. The genes of brain development are impressively environment- and experience-dependent."

Mriganka Sur (2008) NEUROSCIENCE: The Emerging Nature of Nurture, Science 12 December 2008: Vol. 322. no. 5908, p. 1636

Mysteries of the Cell

"We live in the golden age of genetics, but the fundamental unit of biology is still arguably the cell."

John Travis, Mysteries of the Cell, Science 25 November 2011

Processes of Life

"Beyond doubt, Dupré emphasizes, the perpetuation of life from one generation to the next requires much more than simply the passage of DNA. He concludes that genomes do not merely store information. Because of their constant dynamic interaction with other constituents of the cell, their capacities depend not only on their sequence of base pairs. More important, those capacities are determined by the systems of which the DNA molecules are only part."

Review of Processes of Life. Essays in the Philosophy of Biology by John Dupré, Oxford University Press, 2012 in Science 17 Aug 2012.

DNA is not just a database

"It is tempting to regard this famous molecule as just a database containing the algorithm for constructing an organism. But DNA is also a physical object that constantly bends, twists, and interacts with other biomolecules."

Philip C. Nelson: Spare the (Elastic) Rod, Science 31 August 2012

DNA does not 'play itself'

Does this sheet music play itself?

Of course not.
Sheet music is coded music, but it needs an interpreter to become music.
Even a pianola roll does not play itself!
You need a pianola to play the music encoded in the paper roll.
A pianola roll without pianola is dead.

Also, DNA does not 'play itself'.
It needs cell machinery to 'play itself'.
DNA without a cell is dead.

Would a paper roll with random holes encode 'random music'? That is certainly possible, but still a pianola is required to play it. And the pianola itself does not spontaneously arise out of random parts.

Genetical determinism

"Scientists have long since abandoned any concept of biological determinism. It has now been proved beyond doubt that although our genes are fixed, their expression is highly dependent on what our environment throws at us."

Nature editorial, 'Life stresses', 11 Oct 2012

22 Nov 2012

A DNA-centric viewpoint

"After the Second World War, biology in the West moved away from thinking of the cell in physicochemical terms, towards a reductionist molecular-biology approach, with a DNA-centric viewpoint. (...)
Not surprisingly, cold-war divisions led many US scientists to dismiss Oparin. The Nobel laureate Hermann Muller, who thought that life originated as a gene, criticized the poor status of DNA within Oparin's picture of early life."

'In Retrospect: The Origin of Life', Nature Books and Arts, 22 Nov 2012

13 Aug 13
.

The Hologenome

"In this light, the phylosymbiotic microbiome (*) can be understood as an addition to the coadapted genomes of a host organism rather than an arbitrary amalgam. Linking the microbiome and host genome underscores the hologenome as a unit of evolution and blurs the lines between what biologists typically demarcate as the environment and the genotype of a species. Based on the mounting evidence for speciation by symbiosis, it is becoming clearer that a unified theory of evolution that considers the nuclear genome, cytoplasmic organelles, and microbiome as interacting components in the origin of new species is an emerging frontier for biology." (316).

*) the microbial community relationships that recapitulate the phylogeny of their host.

27 Nov 13

The Philosophy of Biology

"There is more in biology than nucleotide sequences, as there is more in language than letter sequences. ...
Development is a complex process of which DNA is an important, but not the only, factor. ...
Biology education should make clear that life requires not only DNA but also a complex cellular machinery." (322).

11 Dec 2013

Tibor Gánti: The Principles of Life

"Consequently, a living system should necessarily comprise at least two systems, of which one is the controlling unit and the other is the controlled part. (...) It is naïve to believe that the genesis of life can be clarified by studying whether a program can be developed by itself. It cannot." (p.15.)

"It has already been mentioned that the cytoplasm and the nucleus can only develop together: no multicellular organism can develop either from a nucleus alone or from the cytoplasm of an ovum without a nucleus. ... [similarly] an embryo cannot develop from the ovum if either its cytoplasm or its nucleus is missing, and no cell can operate without these either." (p. 17) (323).

"This book is a polemic essay. A polemic essay against the onesided idea of biology having the genes in the center. ... Fate brought that I have to oppose again a dogma, against the dogma of omnipotent genes." (Contra Crick, 1989)

30 Jul 2014

The dual nature of life: information and energy

Into the Cool. Energy Flow, Thermodynamics, and Life

"Life is not just a genetic entity. Genes by themselves do nothing more than salt crystals. Life is an open, cycling system organized by the laws of thermodynamics."

Eric D. Schneider and Dorion Sagan (2005), Into the Cool. Energy Flow, Thermodynamics, and Life, p.24 paperback.

9 Aug 2014

Richard Lewontin: a critic of the DNA-centric view

"The trouble with general scheme of explanation contained in the metaphor of development is that it is bad biology. If we had the complete DNA sequence of an organism and unlimited computational power, we could not compute the organism, because the organism does not compute itself from its genes. (...)
Of course it is true that chimps look different from humans because they have different genes. And a satisfactory explanation for the differences need not involve other causal factors."

Richard Lewontin (2000) The Triple Helix. Gene, Organism, and Environment, p.17.

Note: (342)

1 May 2015

A review of: In Search of Cell History by Franklin M Harold

"One theme in the book, which I happen to be partial to, is the implication that biologists have been overly fixated on DNA. We tend to think that, because variation in DNA maps onto variation in phenotype, genes control all aspects of living cells. However, the conversion of information in DNA into cell structure depends on the cell itself reading and interpreting the genetic information. And the cell has aspects of organization, for example membrane-bound structures and long-lived protein complexes, that are passed down from generation to generation without direct encoding in DNA. Genetic software requires cellular hardware (or should it be 'gelware'?). For these reasons, we should not be overly gene-centric when thinking about cell evolution, but should also give due weight to biochemical, energetic, and structural aspects of living cells. For example, I agree with Harold that a simplistic gene-first (RNA-world) model for the origin of cells is flawed. RNA molecules cannot replicate themselves–they need to be embedded in a chemical system that allows their information to be copied. Explaining the origin and perpetuation of this system cannot be laid simply at the feet of the RNA molecules themselves". (reviewed by David Baum)

5 Aug 2022

Life is at root a chemical phenomenon

Transformer: The Deep Chemistry of Life and Death

"For decades, biology has been dominated by information – the power of genes. The importance of genes is unquestionable, yet there is no difference in the information content of a living protozoon and one that died a moment ago. The difference between being alive or dead lies in energy flow, in the ability of cells to continually regenerate themselves from simpler building blocks. ... We do not merely inherit inert information in the form of genes – our inheritance includes this living metabolic network in the egg cell, a flame passed from generation to generation, without pause, right back to the emergence of life. ... Without the flame life is – dead"

Nick Lane (2022) Transformer: The Deep Chemistry of Life and Death (Introduction).

4 Aug 2023

Without a cell, a genome doesn't mean much

The Master Builder: How the New Science of the Cell Is Rewriting the Story of Life

"Nevertheless, the gene-centric view has established a form of tyranny where genes reign supreme over not only our past and present but also our future."

"But the genome is not actually a blueprint for an organism or its architect. Insofar as it contains any design, it is the design for another genome, not for an organism."

"But without a cell, a genome doesn't mean much. For creatures ranging from a virus to a human being, it is cells that give meaning to those sequences of nucleic acids by translating stretches of them into proteins. It is cells that use those proteins to take care of and repair themselves. Most importantly, it is cells that work with other cells to construct an organism. The cell decides which genes are used for what purposes and when, rather than being at the mercy of the genes."

"DNA cannot send orders to cells to move right or left within your body or to place the heart and the liver on opposite sides of your thorax. DNA cannot measure the length of your arms or instruct the placement of your eyes symmetrically across the midline of your face." (NOEMA magazine by Alfonso Martinez Arias)

Alfonso Martinez Arias (2023) The Master Builder: How the New Science of the Cell Is Rewriting the Story of Life. See for example the end of chapter 1 NOT IN THE GENES:

"If you were to put DNA in a test tube and wait for it to make an organism, it would never happen. Even if you were to add all the ingredients that allow the reading and expression of the information in DNA – the transcription factors, plus some amino acids, lipids, sugars, and salts to help catalyze chemical reactions – it would still never happen. DNA needs a cell to transform its content into a tangible form. An organ or a tissue, and most certainly an organism, is no more the result of the activity of a collection of genes than a house is an aggregate of bricks and mortar."
(531).

If the blueprint of the embryo is not in DNA, then where is it? Alfonso Martinez Arias. A very convincing argument for the cell-centric view of life 14 March 2026

16 jun 2025

A Barcode needs a barcode reader

A barcode system consists of a barcode, barcode generator, hardware and software. The hardware is a reader, the software does the processing of the visual information in the barcode and gives a meaning to a meaningless pattern. All parts together form a system, without any of the components, it does not function. A barcode needs a barcode reader. Without a reader a barcode is useless; a meaningless pattern. Substitute 'barcode' with 'DNA' and one has a suitable metaphor for a DNA-centric view.

Image source: Wikipedia.

The logic of the genome-centered view

Why would the primordial pond not produce independent nerve cells, brain cells, muscle cells, kidney cells, heart cells which combine into organisms? Why would the primordial pond produce multicellular organisms by means of single cells developing into multicellular organisms? Why would a genome-centered theory predispose to our well-known embryological development program? Why would a random DNA sequence not produce cancerous cells? aneuploid cells? Would the DNA of a hypothetical genome by accident be in an epigenetic state necessary for the start of embryological development? If the DNA would be in an adult (differentiated) epigenetic state (see: Induced pluripotent stem cell). The problem is urgent because Senapathy claims independent origin of multicellular organisms (eukaryotes), not single-celled organisms (prokaryotes).

Information centered approach

This is what I wrote in a review years ago: "Information is the ultimate explanation of life. Information is the secret of life. This view of life is an oversimplification" (202). Senapathy is victim of this oversimplification because he thinks a computer simulation of statistical properties of DNA is enough to explain the origin of life (eukaryotes). However, Senapathy is not the only one. Famous ultra-Darwinist Richard Dawkins described organisms as information carriers based on the idea that bodies are containers for DNA, which itself is just a code and storage format from an informational point of view.
Another aspect of the information and DNA centered view is the fact that DNA contains the information that is faithfully transcribed and translated into proteins. This view is wrong. One of the main reasons is RNA-editing, which means the non-random replacement of one base by another (for example A-to-G or C-to-T) and ends up in the protein sequence (214). This means that the DNA sequence is not sufficient for producing proteins. Furthermore, RNA-editing does not make sense if the information of DNA originated from random DNA sequences. If the endproduct of RNA-editing is useful for the organism, why would DNA sequences representing the protein sequence not be selected in the first place? That would be the more likely event.
Furthermore, Senapathy ignores DNA methylation which adds extra information to the primary DNA sequence (epigenetics) and it controls transcription factor binding sites and regulatory regions.

23 Nov 2018 Are all genomes and organisms equally likely?
C-value paradox: genome sizes are not randomly distributed over species

There is a mismatch between genome size and organism complexity (see figure C-value paradox). For example flowering plants and amphibians have significant larger genomes than mammals. Senapathy's theory does not predict this fact.
Plants, in comparison to animals, have a very limited set of basic needs, namely water, carbon dioxide, nitrogen, magnesium, phosphorous, potassium plus some trace elements (368). That's a big difference with animals. This causes different survival probabilities. There is mismatch between simple food requirements of plants with huge genomes and mammals with relatively small genomes but complex food requirements. His DNA-based theory cannot distinguish between food requirements of animals and plants. Plants and animals are equally likely to originate and survive from random DNA in his theory.

Conclusion:
Shortcomings of the DNA-centered approach are: it ignores cell and cell membrane, cell arises from fertilization, and genes need to be regulated, and true understanding of gene regulation requires the study of epigenetics and a lot more. An eukaryotic cell contains prokayotic DNA (mitochondria), the type of DNA which cannot originate from random DNA. Furthermore, RNA-editing changes the information transcribed from DNA. Therefore, any DNA-centered theory of the origin of life is incomplete and wrong. Certainly, if eukaryotic life is claimed to be the first form of life.
Even within the genome-centered view two different views are possible: 'the regulatory code' versus 'the protein code'. Senapathy exclusively pays attention to protein-coding sequences (exons). However, many of the observed differences between species likely stem from when and where the products of the genes are made, in other words: 'the regulatory code'.

I collected additional technical reasons from developmental biology, genetics and ecology on the page:
Independent Origin and the facts of life.

Phylogenomics

Ironically, the study of genomes, called genomics, is precisely the research field that enables biologists to construct the tree of life! The use of genomics to construct Darwinian trees of life is called phylogenomics. An example is: 'A Phylogenomic Study of Birds Reveals Their Evolutionary History' published in Science, 27 Jun 2008. According to the theory of Senapathy there can be no true tree of all birds, because there is no common descent of all birds. See also: Elizabeth Pennisi (2008) 'Building the Tree of Life, Genome by Genome', Science 27 June 2008: 1716-1717.

See also: Phylogeography from Wikipedia, the free encyclopedia.

8 Sexual reproduction is far more complicated than asexual reproduction

Senapathy writes:

"The genomes were directly assembled into single seed cells, analogous to the fertilized eggs of sexually reproducing organisms" (page 5)
"male and female 'seed cells' lead to male and female individuals";
"Both male and female seed cells can be assembled";
"One can infer that it is not difficult to segregate the genes for a male or a female into a specific chromosome and in two different sex cells". (page 358)

"Not difficult"! This is an astonishing and outrageously statement. The first quote proves that Senapathy does not distinguish between haploid and diploid cells (18). This is fatal for his theory. Although he knows about X and Y chromosomes in a noot on page 588 (18), in this context he forgets about it. Just look how different the X- and Y-chromosome are! He just flatly states that it is "not difficult" without explanation!

Fig. 10. XY chromosomes

The facts:
Sexual reproduction is the most common form of reproduction in plants and animals. But it is not the most simple form of reproduction. Asexual reproduction is the most simple form of reproduction. It is found in prokaryotes: bacteria and some fungi such as Penicillium. Sexual reproduction is a most important fact against independent origin, because it is far more complicated than asexual reproduction. For a start, two different types of individuals must exist in a species: males and females. Additionally, internal gestation (pregnancy) is more complicated than just laying eggs. Any theory of independent origin should predict asexual haploids because both asexual and haploidy are more 'simple' and therefore more likely to originate. Unfortunately, a normal cell division is complicated enough (many highly specific proteins are involved!), let alone meiosis (the process that produces 4 haploid sex cells).
Furthermore, most sexual species are also eukaryotes, which is another obstacle for the theory (see: Did prokaryotes arise from eukaryotes?). The independent origin of a female and a male of the same species is extremely improbable. The emergence of a hermaphrodite is slightly less improbable than two sexes because it does not involve two different individuals (see: hermaphrodite?). The improbability applies to all sexual reproducing organisms. Asexual reproduction is rare among animals (65). As a starting point, I list ten uncontroversial facts (421). One does not need to be an evolutionist or a Darwinist to accept them. It is known to every biology student.

Ten most important facts about sex

Seed cells [updated 18 Jul 2025]

'Seed cells'? First: seed is a botanical concept, it does not exist in zoology. A serious error. 'Seed cells' are 'analogous' to the fertilized eggs? Analogous? A fertilized egg is diploid and is the result of the fusion of a female and male haploid cell (460). And those cells have a precise non-random composition (mitochondria are exclusively transmitted through the egg cell). How does a 'seed cell' form without males and females? How does a seed cell manage to get the exact right number of 1 pair of homologous chromosomes of each type in its cell? No more, no less. (see karyotype above). It can't rely on meiosis which routinely ensures the right number of homologues chromosomes. Where do homologuous chromosomes in the Primordial Pond come from anyway?
In the case of humans each haploid cell contains 23 matching unique homologous human chromosomes with the additional restriction that only XX or XY combinations are allowed. Any deviations from 46XY/46XX result in serious genetic disorders: Klinefelter syndrome (47XXY), Turner syndrome (45X), 48XXYY syndrome, 47XYY syndrome, and many other forms of sex-chromosome and autosome aneuploidy. (The Primordial Pond must have been should be flooded with an overwhelming number of those genetic disorders). The correct combinations can only be produced with a female 23X and a male 23Y haploid cell. Furthermore, sperm and egg are highly specialized cells. Virtually all random collections of 46 chromosomes produce nothing like a human. Please, calculate the probability of 23 pairs of homologous chromosomes of random origin with exactly matching genes in precise homologous positions will end up in one cell! (355). Thank you. This argument holds for any eukaryote: animal and plant.

cell
A 'seed cell' is apparently a cell. A cell is cytoplasm enclosed within a membrane. A membrane is selectively permeable and is made from a double layer of phospholipids. Where does the cell membrane come from? Where do phospholipids come from? Lipids are not encoded by DNA like proteins. Senapathy assumes what he has to explain. Begging the question of the origin of life.

See the following quote for an entertaining and touching scenario:

"Perhaps among the organisms produced in the primordial pond, some had only secondary sex organs, but no genital organ to copulate; whereas other organisms would have the latter but not the former. Both the above situations may or may not have had the reproductive cycles of sperm/egg production. There could have been many seed cells producing individuals, with wrong combinations of male and female sex organs and secondary sex characteristics. Rarely, some seed cells will process all the three sets of genes for all these three functions - attractiveness by secondary sex features, copulation by genital organs, and reproduction by sperm/egg cycles. This is analogous to many seed cells giving rise to individuals with improper or incomplete organs, which will not survive. Only those individuals with the absolutely right organs will survive. Therefore, only one out of myriads of seed cells may form a viable organism. This may explain why it would have taken geological time for seed cells to be formed with genomes capable of producing viable organisms." (page 358-359.) (my emphasis) (97).

This text could have been written by Greek philosophers Empedocles or Epicurus. There is no science in it.

Senapathy and the Greek Philosophers

In Antiquity basically two solutions for the origin of organisms were developed: design and accident. The creationists Socrates and Plato argued for design. The Atomists Empedocles and Epicurus argued for accident. The atomists needed an infinite universe to explain why accident could produce highly improbable adaptations such as the eye. Darwin improved the 'accident theory' by eliminating the huge improbabilities and replacing them by natural selection. Senapathy's solution is non-creationist and non-Darwinist. But he does not invoke an infinite universe (96), so he bears the full burden of the Boeing-747 argument, and at the same time he dismisses the greatest improvement since Antiquity of the atomist naturalistic theory: full common descent, full gradual evolution, and full natural selection. Therefore, he has all the disadvantages of the 'accident theory' and must do without all the advantages of Darwinian evolution.
Despite DNA, Senapathy ('immutable DNA') is closer to Greek philosophers than he knows: Lucretius believed in the fixity of species, and that all mutations occurred in one burst at the beginning of the world; all species were fully formed by spontaneous generation and did never change (85,p.149). Anaximander believed that all life arose in water (85,p.153). As an afterthougth Senapathy adds 'natural selection' (Fig.7). The Greek philosophers proposed at least two rounds of selection: one for viability, and one for fine adaptations by competition between individual animals.

my review of David Sedley (2007) 'Creationism and its critics in Antiquity'.

9 Can a male or female arise from haploid cells?

X Y
Fig 2. Human sex
chromosomes (2)

In 1692 Richard Bentley asks what the probability would be that a male and a female of the same species should each arise by chance (78). This is exactly Senapathy's question. That is: the question he ought to ask. If the goal is to produce a male one needs an egg that is fertilized by a sperm carrying a Y-chromosome. For a female one needs and an egg that is fertilized by sperm carrying a X-chromosome. So one needs 4 haploid cells to produce one male and one female (3).
What is the probability that those 4 cells arise from the primordial pond? Let us start with the production of one egg cell from random assemblage of genes in the primordial pond. A human egg cell contains approximately 32,000 genes (minus Y-specific genes) distributed over 23 chromosomes. For example chromosome 1 contains an estimated 2700 genes; X chromosome 1600 and Y chromosome 250. I will ignore a lot of complicating factors: a chromosome is more than naked DNA (19), has a centromere, telomeres (74), and an egg cell is more than a bag of genes. All those genes have exact locations on the chromosomes characteristic for the species. For a given species, the same genes are on the same chromosomes and in the same order. Chromosomes themselves have no order (free floating). If one wants to produce a fertile individual that is able to reproduce then a requirement is that all genes on the corresponding chromosomes of the egg and the sperm are in exactly the same locations. Therefore, the probability of the genomes of one egg cell and one sperm cell equals the probability of 32,000 genes ending up in exactly the same positions on 23 chromosomes in two independent trials. Please note that I am not estimating the probability of random assembly of the human genome in one trial. I am estimating the repeatability of the event. In this scenario we need the repeatability because in the end we need 4 (haploid) genomes of the same species. The problem can now be formulated as: how many permutations are there of 32,000 genes? To get an idea of the magnitude of the problem: the number of permutations of only 29 genes exceeds 10³⁰ (which is the number of DNA sequences in Senapathy's primordial pond). It is clear from this that it's impossible that a second cell that matches the first, will be produced with this method, let alone that 4 genomes could be produced in this way.

"each independently-assembled genome is a distinct entity giving rise to a creature that is also distinct and unrelated to others." (page 455).

The problem with the theory of Independent Origin is the fact that the Primordial pond only produces unique individuals, millions of unique individuals, but not species. So, if females and males were produced at all, then they would not belong to the same species. Remarkably, there are no species in the Primordial pond. Remarkably, his theory does not explain The Oigin of Species. He isn't even aware of this fact.

      Degenarate polyploid genomes

The situation is even worse. Many plant species are polyploid, having duplicated genomes. For example: yeast Saccharomyces cerevisiaea is a tetraploid and its genome consists of four roughly identical genomes (the diploid number is 4N). Because of the accumulation of mutations during the history of the species the original identical genomes start to differ. These are called degenerate tetraploids. Is is highly unlikely that degenerate tetraploids originated spontaneously from a pool of random DNA segments (especially all mutations being neutral).

      Virgin birth?

Male bees, wasps, and ants develop from unfertilized haploid eggs, so are haploid. Would that help Senapathy's theory? No, because a diploid female has to produce the egg. Although it is known for an egg to start developing without being fertilised (parthenogenesis) in some insects, snakes (boa constrictor), lizards, and Komodo dragons, and parthenogenesis has been reported in about 70 vertebrate species (roughly 0.1%) and even in sharks (75), but not in mammals. Early mammalian development has a strict requirement for genetic contributions from a male and a female parent (174) and fully parthenogenetic human embryos cannot develop to term (4). The embryos die after a few days. Maternal and paternal genomes are both necessary for normal development in mice, and this is believed to account for the absence of parthenogenesis [=development without fertilization by a father] in mammals (31). An embryo that did not have a sperm involved in its formation cannot make a placenta (the organ that keeps the fetus alive) and so cannot be born (54).
In plants the formation of asexual seeds is called apomixis and it leads to populations that are genetically uniform maternal clones (apomixis is found in more than 400 species of flowering plants). Even if parthenogenesis would work in humans, this only produces females, so would not explain the origin of males. This does apply to all animal species with an XX-female/XY-male sex-chromosome system. There is no Y-chromosome in a female cell, therefore a male cannot be produced parthenogenetically.
    Apart from the Y-chromosome, all sexually reproducing animals, simply have two parents. Asexually reproducing species (making clones of themselves) like bacteria, only have one parent.

      Conclusion

Senapathy's computer experiment shows a single sequence (Fig 1). The analogy with a haploid genome is obvious. (I am ignoring the fact that DNA is a double helix for the moment). I guess he based his theory of independent birth on the idea of a haploid genome and assumed it was no problem to produce diploid organisms. Regrettably, the haploid method fails to produce the first human male and female. The haploid method can only produce asexually reproducing bacteria and other prokaryotes. The funny thing is that Senapathy knows about X and Y chromosomes when he wants to refute neo-Darwinism, but does not realize that they are an insurmountable obstacle for his own theory. Nobody would deny that simple genomes have a higher probability than complex genomes. Therefore, a haploid organism should be the expected outcome of the primordial pond. Multicellular haploid organisms do exist: males of the honeybee are haploid. Senapathy is confronted with the amazing but inevitable question:

why on earth are not all species haploid?

The above problem points to a wider problem: Senapathy does not distinguish between the origin of an individual and a species. How to proceed from the first individual to a population of interbreeding individuals? (Note: Only a few genes can prevent interbreeding of individuals of the same species (incompatibility genes, hybrid sterility, reproductive isolation). Self-incompatibility is of special importance because of obligate outcrossing. Self-pollination in plants requires only one individual, but still is sexual reproduction and few plants actually self pollinate.

Possible objections

I suppose Senapathy could come up with the following objections:
(1) maybe millions of different genomes could produce humans. – This is not relevant. What is relevant that any genome must be produced in a male and a female form. That makes it impossible.
(2) genes do not need to be in same position as the current positions to produce a human, so the probability to produce a human genome from random DNA sequences is very much higher. – Theoretically it seems quite possible that genes which are ordered in a different way on the human chromosomes would still produce a human being. However, paternal and maternal genes (alleles) have identical positions on chromosomes. That situation must be explained.
(3) there are many variations of a gene that produce the same protein and many protein sequence variants produce the same enzymatic function. – This is right but not relevant because I only considered the positions of genes on chromosomes. Those chromosomal positions must match.
(4) sex organs are generated by genes just as all other organs, so it should not be difficult (p.353). – Of course sex organs are generated by genes. That is not the point. The point is: what is the probability that male-specific genes come together on chromosome Y and that the sex-genes are expressed at the right time and the right place in the right sex, and that both genomes are identical apart from sex-specific genes, and both genomes are able to fuse and create a new individual?

10 Can a male or female arise from diploid cells?

17 Oct 2011

A diploid cell (= a pair of each chromosome) must be somehow produced because human body cells are diploid. Instead of the spontaneous origin of 4 suitable haploid gamete-like cells (2 sperm and 2 egg cells), theoretically, replicating each chromosome of a haploid cell and skipping cell division could produce a diploid cell. This doubling method escapes the huge improbability of generating a human genome twice. However, immediately two problems arise. The first is that the 'doubling method' fails to produce a diploid male cell because it does not produce a Y chromosome. An XY-pair can never be produced by doubling. Without a male cell, this scenario fails to produce a male and a female. Therefore, it fails to produce what could be the start of a new species.

The second problem is that even when the 'doubling method' produced a diploid female cell, it would be a 100% homozygote: all pairs of chromosomes would be identical. However, individual human genomes are diploid in nature, with half of the homologous chromosomes being derived from each parent. Therefore, they are different: heterozygote. Normally, in a diploid cell each gene has two versions called alleles. A complete homozygote chromosome pair in which the two chromosomes were identical would be a recipe for trouble, as the effects of any bad gene would be felt to their fullest. This is the same problem that genome researchers encounter if they would try to create the extinct woolly mammoth from its DNA (110). Normal diploid cells are heterozygote in a significant degree, because they originate from two parents (70).

two kinds of cells - two kinds of cell division

mitosis produces:	diploid cells (2n)
meiosis produces:	haploid cells (n)
fertilization produces:	n + n = 2n (diploid)

From a genetic point of view all animals and plants have two different kinds of cells: diploid body cells and haploid sex cells. These cells are created by two fundamentally different processes: mitosis which creates diploid cells and meiosis (91) which creates haploid cells. Both processes are complex because they must guarantee that daughter cells receive the correct set of chromosomes. At the beginning of mitosis, the process of cell division, chromosomes are organized randomly - like jigsaw puzzle pieces spread out on the floor. During the process of mitosis the two halves must be oriented such that they will be pulled in opposite directions into two newly forming daughter cells. Mechanisms must exist to eliminate wrong configurations while selecting the right ones (52).
Further, there is an important difference between mitosis and meiosis: in contrast to mitosis, gametogenesis eliminates age-induced cellular damage and resets life span (212). Meiosis is even more complex and is controlled by meiosis-associated genes. In normal female meiosis in plants and animals, only one of the four products forms an egg nucleus while the other three are discarded into polar bodies. Why? Even the loss of a single chromosome can be lethal and can contribute to birth defects and cancer. Explaining these two highly complex and highly conserved processes with randomness, explains precisely nothing.
Then there is a third process called fertilization which is the fusion of a haploid sperm and a haploid egg. Fertilization creates the first diploid cell of an individual: the zygote. After that, multicellular organisms develop clonally by mitosis from a single cell. Senapathy knows that

"In sexually-reproducing animals, development always begins with a single cell called the zygote." (p.307)

He knows that the zygote is diploid and originates from the fusion of haploid sex cells (p. 307). He next renames 'zygote' to 'seed cell' (a verbal trick) and claims it could arise spontaneously from the primordial pool without sufficient evidence (see elsewhere on this page).
Fertilization is 'simpler' than meiosis, because it is just adding two genomes together. However, things can go wrong too. The fusion must add precisely two and not three or more genomes. Furthermore, it appears that many highly specific proteins are involved in fertilization (71), and these proteins must be encoded by the genome. Furthermore, sperm must deliver a pair of centrioles: "When the nematode C. elegans egg is fertilized the sperm delivers a pair of centrioles. These centrioles will form the centrosomes which will direct the first cell division of the zygote and this will determine its polarity." (226). Furthermore, there are a variety of substances transferred along the sperm into the females on mating (Accessory gland proteins). So the fertilization process is not much simpler than meiosis.

Plants

Plants have a double fertilization: one sperm fertilizes the haploid egg cell, which becomes the diploid embryo, and the other sperm fertilizes the diploid central cell, generating the triploid endosperm. It is extremely unlikely that these complicated processes occur by chance. The endosperm nourishes the embryo during seed development. That is another clue that plant seeds need a mother plant. The endosperm genome is not transmitted to the next generation.

    Evolutionary theory starts with relatively simple haploid cells which reproduce without sex and without meiosis. They are in fact clones. On the other hand diploid sexually reproducing organisms are much more complicated, because they use both mitosis and meiosis. The transition from asexual clones to sexual reproduction is one of the 8 major transitions of life (John Maynard Smith, 32) and Senapathy lets them originate just as easy as single celled organisms by random forces.
    Senapathy claims that his theory predicts eukaryotic genomes. I don't see how his theory could predict diploid organisms in the first place. Given the fact that there are less complicated and therefore more likely ways to reproduce, his theory certainly would not predict a complicated process like meiosis (37). However, meiosis is even more complicated than that. Meiosis produces in the end four haploid germ cells. In males, all four give rise to sperm. In the female however, only one of those four develops into an egg cell, while the other three eventually die. So, additionally, there is also a sex difference in the meiosis process.

      A hermaphrodite?

Could the first organism be a hermaphrodite? In a hermaphrodite species all individuals have both male and female reproductive organs. The possible advantage would be that there is no need to produce two individuals (males and females) which differ in the DNA that determines sexual characteristics, but have otherwise equal genomes. Therefore, the origin of the hermaphrodite species could start with just one self-reproducing individual (snails, plants). Apart from all other objections, if successful, it would only explain hermaphrodite species, not the majority of species with males and females. Furthermore, it would not even explain all hermaphroditic species: many flowering plant species are obligate outcrossers that cannot self-fertilize because of self-incompatibility: they recognize and reject their own pollen.

      A male without a Y chromosome?

Males of grasshoppers and aphids ('plant bugs') do not have a Y chromosome, they are described as XO. Females of those species have two X-chromosomes; they are XX (36). Are females of grasshoppers and aphids perhaps candidates for independent origin? Theoretically they could produce a species. However, apart from the absence of the Y-chromosome problem, the problem of producing a female and male version of the same genome still exists. Furthermore, the production of males requires a rather unusual form of meiosis. Apart from this, if successful, it would only explain XO species, not the majority of species with a Y-chromosome!

    Conclusion: both the haploid and the diploid methods fail to produce the first male and female. It is impossible to produce humans from a pool of genes even if all the necessary human genes were swimming around in duplicate. The funny thing is that Senapathy knows that 'eukaryotic organisms usually contain two of each gene, and a haploid genome contains only one copy of each gene' , but he does not realize that this is fatal to his theory.
Diploidy and sexual reproduction are tightly interconnected. But even when one allows for the independent origin of diploid organisms, than it is still not necessary that they should have sexual reproduction with such a complicated process like meiosis. Indeed, why don't all diploid species use some form of asexual reproduction (common among plants; rare among animals)? Why not produce diploid children directly from a diploid cell, thereby circumventing meiosis? In general: when there are two solutions for a problem in nature, the theory of independent origin should predict the most probable solution, that is the most simple, of the two alternatives.
Postscript: in the Bible, God instructs Noah to take pairs of each kind of animal onto the ark. In those days people knew that one needs a male and a female.

11 Did prokaryotes arise from eukaryotes?

The origin of prokaryotes [15 Aug 2023 / 15 Mar 2026]

"the contiguous genes of prokaryotes could only be derived from eukaryotic split genes by losing introns" (page 232 chapter 7)
"Thus it is possible for the prokaryotic genome to have been derived directly from contiguous genes in the open primordial pond." (page 238 chapter 7)

'Losing introns'! How? As if it were an accident! If 'losing introns' from eukaryotic split genes were so easy, then prokaryote genes could originate directly from the Primordial Pond indeed! But, if that were so easy, why didn't eukaryotes lost their introns too? Why are not all introns lost? Senapathy knows that removing introns requires specific splicing signals and complex cellular machineries (spliceosomes).

'Losing introns' causes a chain reaction of other thoughts (532).
For the sake of argument let's use his language of 'introns':

Apparently, having 'introns' determines that a DNA sequence is eukaryotic and lacking 'introns' determines that a DNA sequence is prokaryotic. Why? *)
If prokayotic genes arose from eukaryotic genes by 'losing introns', then prokaryotic genes are in fact processed eukaryotic genes,
and that means that prokaryotes have processed split genes from insects, crustaceans, mollusks, worms, mammals, birds, fish, reptiles, amphibians and plants!
Furthermore, the Primordial Pond must have a mix of DNA sequences with and without 'introns'
consequently the sharp distinction between Prokaryotes and Eukaryotes completely disappears
But then the whole idea that Eukaryotes arose from random DNA sequences in a Primordial Pond collapses.

*) Why would a DNA sequence with 'introns' belong to an Eukaryote at the time of the origin of life? At the Origin of Life 'introns' are out of place because they presuppose very complex splicing machinery, a nucleus and a cell. Furthermore, as if split genes and introns are necessary for complex life. The function of a DNA sequence determines to what kind of organism it belongs, not the property of having introns. Senapathy wrongly projects the current situation onto the past.

Continue: First things first; Elephant in the room

losing introns

Senapathy knows that in eukaryotes introns are not removed from DNA, they are removed from mRNA. So, if prokayotes have intron-less genes, the introns must have been removed from DNA itself. His solution:

"From the split genes, the introns could be lost by copying back the genes' spliced RNAs (mRNAs) into DNAs by an enzyme called reverse transcriptase – thereby creating intron-less genes." (text from Figure 7.7 page 243.)

Magically, the enzyme reverse transcriptase is available in the Primordial Pond! This enzyme is 671 to 1000 amino acids long! Continue: The primordial pond is an unlimited resource.

Anyway, the whole idea that relatively simple prokaryotes could not arise in the primordial pond, while complex eukaryotes could easily, is a counter-intuitive idea. But his whole theory is counter-intuitive (475).

Did prokaryotes arise from eukaryotes?

"However, molecular biologists now essentially agree that single-celled eukaryotes were the first to originate, and that the prokaryote must have been derived from them." Chapter 12 Conclusion. page 529.

This is an unsubstantiated claim. Senapathy does not give a soruce for that claim. In reality, "The endosymbiosis theory of organogenesis became widely accepted in the early 1980s, after the genetic material of mitochondria and chloroplasts had been found to be significantly different from that of the symbiont's nuclear DNA." (450). See below.

"The importance of all these considerations is not only that they could form complex single-celled organisms, ..." (page 219) (456)

Remarkably, the first life forms originating from the Primordial Pond are single cells:

"I could see that the first cells on earth must have been unicellular eukaryotes (cells with a nucleus) — not simpler bacteria as has been traditionally thought." Introduction page 5.

Senapathy hesitated to start life directly with a multicellular eukaryote. This remarkable, because "The complexity of the genomes is not too different among various multicellular organisms, from worm to human, all of which in turn are not far removed from those of unicellular eukaryotes." (533). Surprisingly, further in the book we read:

"Therefore, if unicellular eukaryotes could probabilistically arise in the primordial pond, then multicellular creatures could arise." page 306 Chapter 8.

Fossil evidence refutes Senapathy: the timing of the 'primordial pond' is crucial

Senapathy's theory states that eukaryotes originated first because genes with introns can be found in computer generated random sequences of bases, whereas prokaryote intronless genes cannot. Did prokaryotes (114) arise from eukaryotes? Please note that this very idea is evolution and not independent origin! Additionally, eukaryotic genomes are immutable! Apart from that, Senapathy still needs to show how this happened. I know of one other publication claiming that "a plausible, albeit controversial, case has even been made that prokaryotic cell architecture is a simplified derivative of that of eukaryotes" (115).
The fossil record shows that the first organisms are 3,500 million years old and are prokaryotes (bacteria). So, this is evidence independent of the theory of evolution. Bare facts. For the next 800 million years life on earth consisted of prokaryotes. Another source states that the first 1.5 billion years, life consisted of aquatic microbes (51). The first indirect evidence of eukaryotes appeared 2,700 million years ago and the first fossil eukaryotes appeared 1,7000 million years ago. Another source states that eukaryotes emerged perhaps as many as 2 billion years later than prokaryotes. Senapathy claims eukaryotes emerged first. It's clear from these data: the fossils say NO!
From the point of view of energy requirements, multicellular eukaryotic animals could not arise before photosynthetic bacteria (prokaryotes!), because animals require enough oxygen in the atmosphere (Great Oxygenation Event). The atmosphere of the early Earth did not contain oxygen. So, the timing of the 'Primordial Pond' is crucial. No fishes, amphibians, reptiles, birds, mammals can arise BEFORE there is enough oxygen in the atmosphere! This is a physical requirement, and does not depend on the theory of evolution. The first oxygen was produced by cyanobacteria (= prokaryotes!) (443). But according to Senapathy's theory, the Origin of Life only depends on the assemblage of random DNA. Why could this not happen a billion years earlier? In stead of in the Cambrian explosion some 540 million years ago? Furthermore, oxygen producing photosynthetic cyanobacteria (=prokaryotes) were preceded by anoxygenic photosynthetic bacteria (=prokaryotes!).

Endosymbiosis refutes Independent Origin (1)

"Thus the very first cells could have been very highly complex. They were comprised of a full complement of split genes, a fully complex splicing machinery, and a nucleus to house all the chromosomes. It is possible that they also contained other organelles such as mitochondria and chloroplasts, in which they compartmentalized some special functions." page 248.

Mitochondria are organelles in all eukaryotic cells; they are crucial for the energy supply of the eukaryotic cell; they multiply independently within eukaryotic cells in a simple asexual fashion (just like bacteria!); their DNA is circular (just like bacteria!); the genes are intronless (just like bacteria!); have little non coding DNA and few intergenic regions between genes (just like bacteria!); are haploid (just like bacteria!); are exclusively inherited from the mother (maternal transmission) and have their own DNA (37 genes in vertebrate animals) which is autonomously copied.
Human mtDNA encodes only 13 proteins, as well as the 22 tRNA and 2 ribosomal RNA genes required for their translation. The mosaic composition of human mitochondria is evident in the organelle's replication and translation machinery, with the ribosome closely resembling its bacterial counterpart and the DNA polymerase resembling that of a viral (bacteriophage) ancestor (299). None of these facts is controversial.
These facts support the hypothesis that mitochondria were once free living single-celled prokaryotes. This hypothesis is called endosymbiosis theory and was proposed by Lynn Margulis in 1970 (Senapathy knows this on pages 231, 598). Initially rejected by biologists as too speculative, the theory was accepted by evolutionary biologist John Maynard Smith as early as 1975 in his The Theory of Evolution. That was 19 years before Senapathy published his book (1994). The entire DNA sequence of the human mitochondrial genome - 16,569 nucleotides - was determined in 1981 (249), 13 years before the publication of Senapathy's book. It was relatively easy to identify the bacterial ancestors of the mitochondria in 1985 (246,p.177). The theory is now the standard view in biology and evolution textbooks (5). Nobel Prize winner Christian de Duve said about the proofs for the bacterial origin of mitochondria: "In the opinion of the vast majority of investigators, these proofs are conclusive." (6). Grauer and Li (2000) in Fundamentals of Molecular Evolution state "the molecular evidence is now overwhelmingly in favor of the endosymbiotic theory". What Senapathy has to say is this:

"Some scientists have suggested that eukaryotes were formed by "endosymbiosis"... Although there exists some resemblance between mitochondrion and bacterial cells, the origin of the nucleus in the eukaryotic cell is still considered to be a total mystery." (p. 231).

Senapathy only devotes two short paragraphs to an issue of crucial importance to his theory. In the quote he sidesteps the problem of mitochondria and in stead discusses the nucleus. He conveniently omits that mitochondria have their own DNA (which is present in John Maynard Smith, 1975). Clearly, he wants to get rid of the theory because it undermines his own theory. The main problem for his theory is that it is extremely unlikely that a dual genetic system originated independently in a million eukaryotic species. Further, he cannot explain the simultaneous origin of a prokaryote (mitochondrion) and a eukaryote, because prokaryotes (mitochondria) have intronless DNA. Additionally, it is even more unlikely that the mitochondrial genomes of all species always contain the genes for the critical electron-transport proteins for respiration, along with the necessary machinery to produce those proteins (13 mRNAs, 22 tRNAs and 2 rRNAs) (64).
Finally, if nuclear genes originated from random DNA, then how it is possible that more than 1,000 nuclear genes encode mitochondrial proteins? Genes from the nuclear and mitochondrial genomes must work in concert to generate a functional oxidative phosphorylation (OXPHOS) system. This system cannot have originated independently. The most plausible explanation is that the dual genetic system arose only once and was inherited from the first eukaryote (63). One of the most irritating facts in Senapathy's book is that he dismisses a theory without a careful examination of the facts. The presence of mitochondria in eukaryotes is not an insignificant fact. It is now recognized that eukaryotic life on earth became the dominant form of life on earth because mitochondria caused gender (5). See further: § 28 (endosymbiosis). Another organelle: peroxisome.

Endosymbiosis refutes Independent Origin (2): gut bacteria

Apart from the role of endosymbiosis in the origin of eukaryotes, there are many symbiont-dependent species such as aphids, humans, corals and cows, in which microbes are abundant and important for host fitness" (445). Microbes are prokaryotes. Endosymbionts live inside other organisms whether that be in their bodies or in cells. The human gut contains approximately thirty-eight trillion microbes (wikipedia). Most bacteria in the human body are actually good for us and help with carrying out necessary life processes. Gut bacteria in humans often aid in the breakdown of foods and synthesize important vitamins that could not be processed by humans alone.
See also: Hologenomics: A hologenome is the whole set of genomes of a holobiont, an organism together with all co-habitating microbes, other life forms, and viruses. And: hologenome on this page.

Energetics refutes Senapathy
    Mitochondria bestowed upon their eukaryotic host 10⁵-10⁶ times more power per gene than a prokaryote. Mitochondria allowed their host to evolve, explore and express 200,000-fold more genes with no energetic penalty. Eukaryotes harbor approximately 12 genes per Mb, bacteria 500-1,000 genes per Mb. The high gene density and small protein size of bacteria can be explained in bioenergetic terms. "Prokaryotes cannot have evolved from eukaryotes, because the energy per gene required to bring forth the complex eukaryotic starting point for prokaryotic evolution under such views requires a prokaryotic endosymbiont to begin with" (131).

Termites refute Senapathy
    Termites mostly feed on dead plant material, generally in the form of wood, leaf litter, soil, or animal dung. This is a problem if termites originated independently in a primary pond. To survive they immediately need other organisms (pants, animals, in other words: an ecosystem). Furthermore, to digest wood (cellulose) termites (eukaryotes) need bacteria (prokaryotes) in their hindguts. A termite never could originate and survive in the primary pond, because prokaryotes could not originate in the primary pond according to Senapathy.

Immune system of eukaryotes 28 jul 11. 18 Apr 25
    The immune system of eukaryotes is a problem for Senapathy. The function of the immune systems is to combat pathogenic bacteria. Bacteria are prokaryotes! However, according to Senapathy eukaryotes arose from the primordial pond and prokaryotes later developed ('evovled') from eukaryotes by an unspecified process. The problem: what's the point of having an elaborate immune system if there are no bacteria? what's the point of having genes that enable detection of bacterial molecules? An elaborate immune system arose out of a random assemblage of DNA in the primordial pond? Individuals without immune system would survive just as well. And since in his theory genomes are fixed, eukaryotes could not 'evolve' the necessary genes (HLA genes) later when encountering pathogenic bacteria. The existence of the immune systems proves that prokaryotes and eukaryotes lived during the same time which contradicts his own theory (449). Remember, prokaryotes could not originate in the primary pond because they have no random DNA.

Single-cell versus multicellular organisms
    The problem of the origin of prokaryotes and eukaryotes is part of a more general problem of simplicity and complexity. In the theory of evolution simple organisms are the most easy to explain and complex organisms are the most difficult to explain. That's why evolution starts with single cells.
Compare single-cell organisms and multicellular organisms: "Unlike a bacterium each cell in an animal requires a position-detection system that causes it to proliferate only when more cells of its type are needed at its particular position in a tissue" (367). Remarkably, in the Appendix Senapathy knows of "a simple bacterial cell [prokaryote], and a more complex cell [multicellular organisms]" (p.559). In Senapathy's theory simple and complex creatures have the same probability to originate from the primordial pond. A single cell is as likely as a whale, elephant or human. Maybe this is caused by the genome-centered view of life. Genomes of simple and complex species differ only in the length of their genomes? or do they? In fact, in contrast to eukaryotes, prokaryotic genomes are usually nearly completely devoid of mobile elements and introns and have genes with very simple regulatory structures, often transcribed into operons with negligible leader and trailer sequences. There are 30 differences between eukaryotes and prokaryotes (17).

Mitochondria contain DNA

"Take the enzyme cytochrome oxidase, for example, which handles the final step of cell respiration. In mammals, the complex is composed of 13 subunits, 3 of which are encoded by mitochondrial DNA, and 10 by nuclear genes. If the subunits of cytochrome oxidase don't work together properly, electrons are not passed to oxygen and respiration fails, triggering the death of the cell."
"The mitochondrial and nuclear genes adapt to each other within a population, and the process must happen quickly because the mutation rate is so high in mitochondrial DNA."
Nick Lane, Nature 19 Nov 2009.

Chloroplasts contain DNA

15 Apr 2012

Several researchers proposed that chloroplasts evolved from bacteria (endosymbiosis) in the late 19th century on the basis of microscopic study of plant cells (246, p.44) and again in 1905 and 1907 (244). In 1962 Hans Ris and Walter Plaut demonstrated DNA in chloroplasts in plants (245). This is not an obscure publication, it has been cited at least 15 times between 1964 and 2011 (it is not known in wikipedia). In 1967 Lynn Margulis under the name L. Sagan concluded that not only chloroplasts, but also mitochondria evolved from endosymbiotic bacteria (246, p.44). In 1970 she published her paradigm-changing book Origin of Eukaryotic Cells. In 1978 Robert Schwartz and Margaret Dayhoff proved the endosymbiosis theory (248). Senapathy could and should have known all this in 1994.
Land plant chloroplast genomes typically contain around 110-120 unique genes. Some algae have retained a large chloroplast genome with more than 200 genes (source).
Second argument against Senapathy: chloroplasts are derived from free living cyanobacteria. That means those cyanobacteria must have existed before eukaryotic plants. This contradicts Senapathy's scenario in which bacteria evolved from eukaryota.

Bacteria and Archaea

To complicate matters further, the Woesian revolution established that that prokaryotes, far from being a homogeneous group, actually consists of two genetically very different groups: Bacteria and Archaea (73, 114). Although Archaea superficially resembled bacteria (being single-celled and lacking a nucleus), Archaea have a distinctively different metabolism, cell wall, and transcription machinery. That means in Senapathy's theory both Bacteria and Archaea are supposed to have originated from eukaryotes. This is very unlikely.

Size matters 19 Jan 2017
Bacteria are small. Because of that there are millions and billions of them. Bacteria multiply fast. Because they are fast and numerous, they evolve fast. That is an advantage for the origin and evolution of life. Evolution can try out millions of solutions for the problem of staying alive. This could be a matter of life or death. The origin of life on earth could have failed. Multicellular life is big and not so numerous. Therefore it evolves more slowly. It is logic that bacteria originated first, not animals and plants. They are too complicated, too big and too slow.

12 All mammals require a mother

"When we consider the case of the independent birth of mammals, it is reasonable to think that a conglomeration of a large number of cells and biochemicals in the primordial pond could have formed an environment akin to that of the placenta and uterus of mammals. There, a seed cell can differentiate into an embryo and a full-grown offspring". (Senapathy, p.309 chapter 8)

© Lennart Nilsson (1990)
Human embryo with umbilical cord. The
umbilical cord is the link between mother
and embryo and symbolizes dependency.

A human baby without a mother? Surely, you're joking, Mr. Senapathy! (313). This is an example of extreme speculation without any evidence. All mammals have internal gestation in contrast to egg-laying animals, like sea urchins, frogs, fish and worms (66), (369). Even an invertebrate like the Red Swamp Crayfish has internal fertilization and the newly hatched stay with their mothers for about two months (370). Could a human baby develop in a primordial pond? Could it survive without placenta, without mother? That would be a miracle (29). There is no constant supply of food and oxygen during nine months. There is no protection against pathogens or predators. The placenta comprises two components: a fetal portion and a maternal portion. So, how could a placenta and umbilical cord form without a mother? How could the first human individual have a navel? (29). Please note that he explains the origin of mammals without excluding humans. Later, this will appear to be crucial (387, 388). If he excludes only humans, does that imply that chimps do arise in the primordial pond?

maternal proteins and RNAs in the zygote 14 oct 2025
An embryo exclusively relies on maternal gene products, RNAs, and proteins for its early development until activation of its own genome. Across animals, RNA transcribed from 40–75% of all genes is deposited into the egg during oogenesis as a maternal contribution required for embryo development. "In all metazoans (animals), the fertilized embryo is provided with proteins and RNAs deposited by the mother during oogenesis. These maternal stores support early embryonic cell divisions before the onset of transcription at zygotic genome activation (ZGA), which occurs after a stereotypical number of cell cycles. (...) maternally provided transcription factors governing the onset of at least some zygotically activated genes." (522).

body temperature
Warm-blooded animals with a constant body temperature require a tenfold increase in energy expenditure above cold-blooded animals (101). Does the primordial pond have a temperature of precisely 37 °C? Birds have a body temparature of 38 - 43 °C. How does the primordial pond handle that?

implantation
Ignoring all the problems discussed above, the very idea that a isolated human genome could produce a human fetus contradicts established knowledge of many reciprocal adaptations of fetus and the pregnant mother. For example: Would the first human genome include genes for implantation, placenta, umbilical cord? What's the point? The process of implantation is occurring during week 2 of development in humans. A synchronized dialog between maternal and embryonic tissues are crucial. Without implantation the embryo cannot survive.
The early (human) embryo creates three distinct types of stem cells: the first creates the body of the fetus. The second group of stem cells create the placenta. The third group of stem cells create a sac in which the baby will grow (433). This is how genomes are programmed. But what is the point of the other two if the embryo arises in a primordial pond? Even a 100% accurate and complete human genome sitting in a cell does not have a higher probability of survival than any random genome. It could only survive when it is self-supporting: when it could get its own food (adult). Until that it really is a kind of parasite. The human genome is simply not designed to be self-supporting in a primordial pond at the one cell stage and many years after that.

viviparity
Viviparity means the embryo develops inside the body of the mother. The best example is placental mammals. Another group is the pouched marsupials like the kangaroos and koalas of Australia. At birth, the baby kangaroo is no larger than a peanut, a blind, pink, hairless fetus-without-a-whomb that must crawl on its own through the mother's fur into the pouch. It drinks milk from a teat (122). However, scorpions, some sharks, some snakes, and velvet worms also are viviparous. Roughly 20% of non-avian reptile species (lizards, snakes) give birth to live offspring (viviparity).

birth
Other problems: if the human embryo is in the water of the primordial pond during nine months, how does the sudden transition to air breathing (usually called: 'birth') happen? Who helps the baby out of the water to prevent death by drowning? After normal birth fetal haemoglobin (a crucial oxygen-carrying protein) drops off and the adult version kicks in. Organs, such as the kidneys and lungs, which do not function in the womb, must all switch on at the same time (called 'birth'). How is all this regulated and coordinated?
Babies have average birth weights around 7.5 pounds. In the primordial pond, the baby could grow to any size before 'birth', because it does not have to pass the birth canal of its mother. Females have wide hips and a large enough pelvic opening that enable babies with big brain sizes to be born. A real genome 'knows' to start the delivery at the right size of the baby. How does a human genome in the primordial pond know about the nine months? Timing of birth in humans is under genetic control (207). Does the mouse genome know that it is sitting in smaller animal and needs to deliver in a shorter time?
The human baby is born premature when compared with chimpanzees. The head of the fetus is still small enough to pass through its mother's birth canal. One of the consequences is that humans at birth are utterly helpless (42). The human brain doubles in size in the first year of life and achieves 95% of its adult size by the age of 5 (although white matter grows at least to age 18).

energy
Reproduction is one of the biggest energy investments that an animal will make. Caring for offspring is as much as 10 times more energy expensive than producing them, and this higher expense is the case not just in mammals (in which costs are the highest), but in other taxa as well. (438). It takes roughly 13 million calories to rear a human baby from birth to nutritional independence at around age 18 or older (168). Big brains are so metabolically expensive that primates must postpone the age of reproduction in order to build them. High fecundity requires at least an extended family with fathers and grandmothers around to help provision and care for the young (109).
"In mammals, which often grow placentas to provide oxygen and nutrients and to remove waste, and which maintain a stable internal body temperature, the researchers found that the indirect costs made up roughly 90% of the total energy costs of reproduction. Just 10% of the total energy is contained in the offspring. In humans, 96% of the 208,000 kilojoules (or nearly 50,000 kilocalories) required for reproduction is taken up by indirect costs." (439).

breastfeeding

breastmilk
Breastmilk can be considered a 'live tissue'. It contains oligosaccharides, lactose and lipids. In addition to macronutrients and micronutrients essential for child survival, breastmilk contains other myriad bioactive components, including cells and microbes (381). Not only mammals, but many invertebrates produce milk-like substances to nourish offspring (375).
The first woman 'was born' in the primordial pond complete with breasts to nourish her future babies. However, that woman herself did not have a mother to breastfeed her. How did she develop and survive without any maternal care?

food
The first teeth of the human baby typically appear between six and nine months. It can take several years for all 20 teeth to complete the tooth eruption. How does the independently born baby get food without teeth and without a mother?
An essential amino acid or indispensable amino acid is an amino acid that cannot be synthesized de novo by the organism (usually referring to humans), and therefore must be supplied in the diet. Non-essential amino acids can be synthesized by the organism –provided the necessary cellular machinery is present. Also, omega-3 phospholipids (essentially for the brain) is primarily obtained from diet. Nearly all animals must obtain vitamins, carotenoids (metabolically expensive chemicals!), etc. through their diet because their genomes have no genes for producing them.
Fish and frogs have large amounts of yolk, whereas mammals have extremely small amounts.

hormones
Amazingly, in order to be a healthy individual, in Senapathy's scenario the first female genome would not need hormones for ovulation, menstruation, womb, pregnancy, and lactation. If no hormones are necessary, then genes for hormones are unnecessary. The genomes would be different.

immune system
Genetically, a fetus is half mother, half father. Why isn't the fetus rejected during pregnancy? Why is the immune system tolerating the fetus, which contains antigens that the maternal immune system recognizes as foreign because they are the products of genes inherited from the father? During pregnancy the foreign antigens of the developing fetus and the placenta come into direct contact with cells of the maternal immune system, but fail to evoke the typical tissue rejection response seen with organ transplants. The cause is the silencing of chemokine genes in the decidua, the specialized structure that encases the fetus and placenta (276). A hypothetical 'independently born fetus' has no father and mother, so does not have to solve these problems, but as soon it tries to reproduce it will have to. But it does not have the genes for it, because it was independently born. DNA has no foresight.

sex organs
The fetus has internal and external sex organs which are useless in the womb. Furthermore, sex organs would be unnecessary for survival of the first individual. There is no reason to expect that the primordial pond would produce complete male and female genomes. Sex organs simply do not contribute to the health and survival of the first individual.

parental care
Why should the first female have a pair of breasts which grow considerably during puberty? A pair of breasts does not contribute in any way to the health and survival of the individual possessing them. They are a burden and a risk (breast cancer). Additionally, how does Senapathy explain that only 50% of the individuals have breasts? The first individual needs food, needs to escape disease and predation to survive, not sex. So why is the primordial pond not producing sexless individuals forever? (25).
Why are parents (especially mothers) motivated to care for their young? It certainly does not help the survival of the parents themselves.

complete genomes
Returning to the issue of complete genomes: what is a complete genome? If spontaneous generation of genomes were nature's method for producing animals and plants, then a healthy sexless individual is viable and complete. As an illustration: only one missing gene can make healthy male or female mice sterile (27). On the other hand one needs a few hundred genes, I guess, to add maleness and femaleness. To evaluate 'independent birth' we need to eliminate our deeply rooted prejudices about the necessity of sexual reproduction. Senapathy should take his primordial pond serious and reason from the point of view of the primordial pond, and resist relying on the benefit of hindsight.

altricial species
In general, in bird and mammal biology altricial species ("requiring nourishment") are those whose newly hatched or born young are relatively immobile, have closed eyes, lack hair or down (naked), and must be cared for by the adults. Altricial young are born helpless and require care for a comparatively long time. Among birds, these include, for example, herons, hawks, woodpeckers, parrots, owls and most passerines. Rodents and marsupials are altricial, as are cat, dog, fox, lion (they are carnivores) and humans. Altricial species usually have relatively short gestation periods. Altricial individuals, if ever 'born independently', don't survive.

mouthbrooder
Mouth brooding is taking fertilized eggs in the mouth until they have developed to live independently. It is usually done by fishes (cichlids). The hypothetical first individual (embryo) of a mouthbrooding species does not have a mother or father, so cannot survive.

imprinting
The best known form of imprinting is filial imprinting, in which a young animal learns the characteristics of its parent.

imprinting
©Antal Festetics, 1983

It is most obvious in nidifugous birds, who imprint on their parents and then follow them around. Konrad Lorenz demonstrated how incubator-hatched geese would imprint on the first suitable moving stimulus they saw within what he called a "critical period" of about 36 hours shortly after hatching. Most famously, the goslings would imprint on Lorenz himself (wiki). In Senapathy's theory there are no mothers. In the absence of a mother the hatchling would follow the first creature in sight: a crocodile, a bat, or a Boa constrictor. In other words: the hatchling is doomed to die.

12b All mammals require a father

In mammals we have never seen in the wild a baby born that didn't have a father. We see fatherless babies in birds, fish, amphibians, reptiles, etc., just not in mammals. They aren't common in birds, but in some fish and reptiles species, such asexual reproduction is the normal mode of reproduction. We think we know why mammals are odd. We have a strange way we transcribe about 100-200 genes in early embryos called genomic imprinting, not seen in these other species. While we have two copies of all our genes on the 22 non-sex chromosomes, for some we use only the one we inherited from our mother or only the one we inherited from our father. For example, the gene for insulin-like growth factor 2, is transcribed exclusively off the DNA from your father. If any of the genes transcribed just from paternal DNA code for vital proteins, then embryos without father's copies will be unable to develop. Hence, we never see babies without fathers (446).

13 Common descent versus independent origin

Just as creationsits, Senapathy points out that there are no intermediates between groups of animals. For example, there are no intermediates between reptiles and birds and the lack of fossil ancestors for fishes, etc. So far, I did not use common descent as an argument against Independent Origin. I restricted myself to the facts that make Independent Origin implausible, improbable, and impossible. This is more than enough to refute Independent Origin. However, contrasting both theories is helpful for understanding the origin of species.
Rejecting common descent comes at a huge cost: it equals reinventing the wheel a million times! All the combined adaptations that produce successful flight must be reinvented for each potential bird (there are 11,000 bird species). All the combined adaptations that enable survival in the sea must be reinvented for each fish. It just seems crazy to reinvent a dog-like type repeatedly to explain wolf, fox and coyote. Small modifications of a basic dog-like type would suffice. Creationists and other critics of common descent must have suspected this problem and proposed a limited form of common descent for similar organisms. For example, Senapathy wrote: "The reality is that numerous creatures were independently born in a common primordial pond" (page 490) and: "Pieces of already successful genomes can become part of newly assembled genomes in the primordial pond" (page 321).
Compare the two diagrams below. The first is from Senapathy and the second from intelligent design creationist Paul Nelson. Remarkably, both accept a limited form of common descent ('microevolution'). A dog-like 'basic type' produces the dog, hyena, fox, wolf, and coyote species.

similar species of a distinct creature
Species according to Senapathy

millions of distinct independently-born creatures

Fig 3. Senapathy, p.462

pheasants ducks dogs cats horses
Species according to Nelson

creation of basic types

Fig 4. Paul Nelson (7)

Senapathy:

"Our new theory does not dispute the occasional connections among different families within an order, or the more common connections among genera within a family or species within a genus." (page 454 chapter 10)

"many species within a genus usually connectable by evolution and many families within an order are sometimes connectable by evolution" (page 461 chapter 10)

'Connectable by evolution' means Darwinian evolution and natural selection! Firstly, this implies all the evolutionary mechanisms such as mutation, natural selection and the origin of species. Later I found this statement in Chapter 12 Conclusion:

...and from each distinct creature came many varieties and similar species by a number of mechanisms – natural selection, genetic drift, mutations in trivial genes ..." page 533 (483).

Unexpectedly, Senapathy's theory is not really a theory of independent origin contrary to the title of his book "Showing That Evolutionary Theories Are Fundamentally Incorrect". In the Introduction he writes "This enabled me to propose an entirely new theory on the origin of diverse creatures on earth, without involving organismal evolution at all" (page 5). Not true. He claims that Evolutionary Theories Are Fundamentally Incorrect, but in chapter 10 he unceremoniously introduces the origin of species by mutation and natural selection without apologizing. That is cheating. That is dishonest. At least inconsistent and confusing. Contradicting himself. First overstating his claim, and in the end backing down and revoking the claim.
Secondly, further evidence comes from the primordial pond. Numerous 'creatures' originated from "a common pool of genes in the same primordial pond" (p.455). He introduces the concepts "the universal sequence pool" (USP) and "the universal gene pool" (UGP) on page 202. A common pool of genes contradicts the independent origin of genes (490). We now have a violation of independent origin at three levels: (1) common pool of genes, (2) microevolution (Fig 3), and (3) prokaryotes evolved from eukaryotes. Therefore, it is misleading to label his theory as 'independent origin'. That's cheating! Even worse: to claim simultaneously "That Evolutionary Theories Are Fundamentally Incorrect" (book title) is simply dishonest. Apart from the label, the amount of common genes is left unspecified. Probably because he has no theoretical reasons for their existence. I am afraid that random origin of DNA sequences predicts unique sequences, not multiple occurrences of the same sequence.
There is a practical implication of the hypothetical "common pool of genes": a common pool requires that there is only one pool on the earth. Where was it located? How big was it? How long ago? How long did exist? (25). Was it fresh or salt water? Was it hot or cold? Could it be at the bottom of the ocean? or must it be at the surface? All unanswered questions!
The most revealing remark is:

"Many ponds may have produced life during that fertile period of the earth eons ago, but the life from only one pond survived until today." (464)

But the life from only one pond survived until today? Why? Why only one pond, when there are many? Is there anything in his theory of independent origin that predicts this? On the contrary! It is against the idea that independent origin creates unique creatures (465). Why would he claim such a thing? That statement is conceptually similar if not identical to Darwin's Descent with Modification, or Common Descent! (Senapathy does not use the phrase 'Common Descent' but 'descent with modification'). The universal genetic code is a beautiful confirmation of common descent of all life (so for the theory of evolution). Senapathy seems be aware that he has to incorporate common descent of all life in his theory.

"...multitudes of complex organisms could have been born directly from the primordial pond, deriving their basic genetic codes, genes, and cellular machineries from a common universal gene pool." (page 219).

So, his motives are clear. The problem is: how does one derive the genetic code from the primordial pool? He doesn't elaborate. He gives no details. The genetic code is the fixed pairing of 64 base-triplets with 20 amino acids (see here). If amino acids were randomly assigned to triplet codons, there would be 1.5 x 10⁸⁴ possible genetic codes. How does Senapathy ensure that only the unique standard genetic code will be produced? and not a mixture of millions of different genetic codes? Senapathy doesn't tell us. Maybe he didn't notice the problem? He just assumes all this. But exactly the origin of the genetic code is one of the hard problems for any theory of the origin of life!

Furthermore, both mechanisms (independent and dependent) can be arbitrarily invoked to explain any pattern of similarities and dissimilarities in nature. Similarly, it could 'predict' any pattern. Far from being an advantage of the theory, it is actually a disadvantage. It is an ad hoc 'explanation'.
The scientific value of his theory becomes still worse (but still more comfortable for Senapathy), when he allows for arbitrary genome mixing:

"slightly changed creatures could also be produced in the primordial pond by mechanisms of genome mixing and genome alteration and or restructuring" (p.455)

Also:

"We stated in the new theory that an already successful genomes could be used, in part or full, in the construction of the genomes of later born organisms." (p.419).

In other words: stealth common descent! Stealth evolutionary reasoning! Please note: "in part or full"! My objections are:

This is again contradicting independent origin;
If 'genome mixing' completely mimics common descent, then there is no observational difference of his theory and common descent
'genome mixing' and 'in part or full' is arbitrary, too vague, too 'cheap', too 'easy' giving a maximum of freedom
it does not make sense that anything goes at the moment that genomes originate and millions of years thereafter all genomes are frozen (immutable)
It is impossible to refute such a theory. When showing evidence that refutes independence, Senapathy always can claim 'my theory can explain this by a common pool of genes and genome mixing'. We do not learn anything new about nature.

Finally, let us not forget that in Senapathy's theory prokaryotes derived from eukaryotes, thereby contradicting independent origin again. It is an odd aspect of Senapathy's theory that bacteria, the most simple living organisms, did not arise directly from the primordial pond, but from more complex organisms!

See also: The final refutation of independent origin

On the other hand, in a sense evolutionary theory involves 'independent origin' of, for example, eyes and sex chromosomes (332). However, this is not independent origin of organisms from scratch, but of parts of organisms within the context of common descent.

14 The role of randomness and improbability

"When the number of random events are large enough, the unbelievable will certainly happen"

(Senapathy, p. 332). (124)

"When the number of random events are large enough, the unbelievable will certainly happen" (p.332).

"the universal sequence pool of random DNA sequences totalling approximately to 10^30 nucleotides could have existed in the primordial ponds." (page 251)

Randomness is the single most important explanatory principle in Senapathy's theory. This is because his theory is based on random genomes. Furthermore, his Primordial Pond is endowed with everything he needs: unlimited amount DNA sequences of infinite length, and every enzym that is necessary to boot up the system. And there is 'random perfection' of organisms.
Ironically, randomness is very important for some evolutionists too: "the probability of even an extremely unlikely event happening is actually quite high" (100). The difference is that for evolutionists natural selection is an addition to unlikely events, while for Senapathy unlikely events are the only explanation for life on earth. Because he relies solely on random events, the available time is crucial. The available time is not infinite, but finite. The universe exists for 13.8 billion year and the earth exists for 4.54 billion years. So, not every possible event happens. Above that, major mass extinctions destroyed a lot what has been achieved and reset life to an earlier phase.
For the moment I will ignore that Senapathy introduces natural selection and micro-evolution in disguise through the back door. In the next sections I will discuss what the effect is of rejecting natural selection, mutation, adaptation, and time.

15 The role of natural selection

See also: Adaptation, Ecology.

"Each independently-originating creature (...) also gave rise to many related similar species by many mechanisms such as natural selection and mutation..." Chapter 10, page 488.

dandelion seed
©Dries Buytaert

Intro:
Consider the seed of a dandelion (Taraxacum officinale) in the picture above. Its physical properties are marvelously adapted to the physical properties of air to enable dispersal of the seed by air (67). Consider the number of hairs: too many or too few, too short or too long, too thin or too thick would fail to make it air born. The properties of the hairs ultimately depends on the density of air. The density of dry air at sea level is approximately 1/800th the density of water, but as altitude increases, the density drops dramatically. The density further depends on temperature and humidity. The density of air ultimately depends on its composition: roughly 78% nitrogen, 21% oxygen and 1% argon. So where does the perfect match of number of hairs, total weight of a seed and the density of air come from? Could this be a lucky accident following from the random assembly of DNA nucleotides in the primordial pond? (see also: The genome is blind).

Senapathy apparently knows that he needs natural selection:

"Only one out of a very large number of "genomes" assembled into seed cells can become viable. (...) A creature born from a genome should be first physically fit in the environment. If not it dies at birth. If an ecological fit occurs for a physically fit organism, then it survives." (page 313 chapter 8)

"Only those organisms that were suited to the fundamental physical and chemical environments on earth – gravity, temperature, land, water, air, light – could be fit to live. Therefore, for every organism that emerged as a viable life form, there were many, many multicellular masses that perished." (page 334).

Amazingly, this is a description of Darwinian natural selection without using the words 'natural selection'! (422). This is written in a book with the title "...Showing That Evolutionary Theories Are Fundamentally Incorrect". Natural Selection was and is the most important part of Darwins theory of evolution: On the Origin of Species by Means of Natural Selection. Senapathy sometimes substitutes 'natural selection' with "The sieving effect" (page 335).

The probability of a human genome arising out of the 'primordial pond' is vanishing small. There is nothing in Senapathy's theory that tells us how many genome trials are needed to produce a human genome. One? a hundred? thousand? million? billion? trillion? If selection is a negligible factor, then the origin of (human) life could be a matter of hours! The fundamental question here is how easy is it to produce a genome? (24). The point of Fred Hoyle's Boeing-747 story (14) is that building blocks are not enough to produce complex systems. Essentially, Senapathy believes that a tornado in a junkyard produces a Boeing-747. Creationists and Darwinists reject the possibility that a complex system can arise by chance in one trial. According to evolutionary biologists, numerous selection steps are needed. According to creationists, 'intelligence' is needed.

Fig 5. Probability: 1 in 1.000.000.000

Fig 6. Probability: 1 in 10

Strickberger (15) compared the very low chance of getting the word 'EVOLUTION' in one trial (figure 5) with the high probability of getting it in successive small steps (figure 6). Although some details of Strickbergers illustrations are confusing, it is clear that the method of figure 5 is essentially Senapathy's method. Therefore, Senapathy chooses the most difficult method (20). Senapathy's mechanism is a whole-genome-test. The Darwinian mechanism is a test of a small modification of a genome. Another important difference between the Senapathy type of selection and Darwinian selection is that Senapathy's selection applies to unique genomes, while Darwinian selection applies to individuals of a species. The death of a Senapathy genome equals extinction, while Darwinian selection means that very similar genomes of the same species survive and can be improved. The power of selection comes from endlessly repeated cycles of magnification of the successful genomes in populations of very similar individuals. Lucky accidents are magnified. This is crucial feature is completely absent in the theory of Independent Birth.
Suppose a healthy human 'male' with a perfect human genome originated from the primordial pond, but missing just one gene: the SRY-gen, which makes the unlucky individual completely infertile. It would not qualify as a male anyway. That means no descendants and 100% selection against that individual. It means that this extremely rare and nearly perfect genome is extinct forever. The odds that the same genome with an intact SRY-gene will arise for the second time are astronomically low. Compare this with an endlessly repeated cycle of small improvements based upon successful individuals of previous generations. It will become clear that the intensity of selection in Senapathy's scenario is huge when compared to selection in the Darwinian scenario. The power of common descent is the accumulation of inventions and the power of natural selection is selection of small variations of proven successful individuals. I only realized the powerful advantages of common descent and natural selection when I compared them with independent origin.
Can we test whether genomes have a random origin? Of course. Senapathy should have given statistical tests of randomness of real genomes (they were available already in 1952, see box). For example, the frequencies of A,T,C and G should be equal if genomes have a random origin. Any deviation from randomness can only be explained by mutation and selection.
As far as Senapathy is concerned, a genome could have originated yesterday. His genomes are timeless fixed creations. Senapathy genomes do not contain any history.
Finally, any amount of selection after the creation of a genome destroys the whole idea of organisms arising directly and simultaneously from the primordial pond.
Common descent and Natural Selection are both central theories of Darwinism. Senapathy smuggles in downgraded versions of both and at the same time triumphantly claims that Darwin's theory is 'fundamentally incorrect'.

Fig. 7. Later Senapathy produced this figure on the internet, demonstrating the extensive involvement of natural selection (although he disingenuously used different names such as 'failed trials', 'filter', 'window', 'pinhole'). He also uses the word 'natural selection', contradicting his book title "...Showing That Evolutionary Theories Are Fundamentally Incorrect."

Earth, Ecology and Climate

Natural Selection and ecology are connected issues. Example: The disappearance of vast tracts of tropical forest some 305 million years ago led to an explosion in the global diversity of reptiles and amphibians, thanks to the emergence of many new, fragmented habitats. During that period, climate change dried up equatorial rain forests in the land mass that later became Europe and North America. Many of the species that lived across these forests became extinct, and were replaced by a wealth of different types of reptile and amphibian that were particular to isolated habitats. Amphibians, which depend on aquatic environments, fared less well than reptiles, which were able to adapt to a drier world (Nature, 9 dec 2010). This cannot be predicted from the properties of spontaneously arising genomes because those properties derive from the laws of chemistry and did not change 305 million years ago. The only thing that changed was selection pressure.

Another counter example: the exploitation of the newly arisen angiosperms approximately 66 – 86 million years ago by early mammals triggered the diversification of mammals and a shift towards increased herbivory (273). Herbivores, frugivores, granivores, root- and bark-eaters, egg-eaters, insectivores, carnivores and omnivores all point to the dependence on plants, insects, reptiles. The primary pond is completely blind to these ecological factors.

Circadian cycle: gene activity shows a circadian cycle: a 24 hours clock (364).

See also: Adaptation,

Adaptation and Ecology: how to start an ecosystem which discusses the importance of ecology.

16 The role of mutation: exploring DNA sequence space

Ironically, in evolutionary theory mutations are random and in Senapathy's theory genes and genomes arise from random combinations of the bases A, T, C, G. Despite this he writes:

"Mutations in a genome can only lead to normal individual variations, or to genetic defects, which are absolutely (28) useless for organismal evolutionary change"(p.46)

and having said something about the effects of mutations, he goes on to declare the immutability of organisms:

"the genome of every independently born creature is unique and unchangeable into that of another unique creature, and therefore is essentially immutable." (p.6)
"a snail can give rise to many different snail varieties, but never to a crab or a sea star." (p.46).

Senapathy's use of the concept 'im-mutability' is very confusing. He does not clearly distinguish between (im)mutability of an individual genome and that of its descendants; and between an individual and the population as a whole. Evolutionary changes happen during million of years, not during the lifetime of an individual. Neutral mutations are the stepping stones towards useful mutations. Of course an individual has a unique genome. The cause of this uniqueness is mutation. Recently, the DNA sequence of James D. Watson revealed 3.3 million single nucleotide mutations, of which 10,654 cause amino-acid substitution. In addition, 2-40,000 base pair (bp) insertions and deletions as well as copy number variation resulting in the large-scale gain and loss of chromosomal segments ranging from 26,000 to 1.5 million base pairs were detected (77). This means that there is a huge reservoir of genetic variation in a population of individuals. That is the material for natural selection to act upon.

Exploring DNA sequence space

Exploring DNA sequence space. Nature 16 dec 2010

Exploring DNA sequence space. Nature 16 dec 2010

    His 'immutability' concept introduces two more serious difficulties for his own theory. Mutations are steps in genome space. If mutations only cause defects and diseases, then steps in genome space are harmful or lethal. This makes individuals isolated islands in genome space. If genomes are essentially fixed, and cannot use mutations to explore genome space, then how does the primordial pond find those rare viable genomes? How does it avoid those unsuccessful genomes? This is impossible. Furthermore, if there are no viable intermediates in genome space and viable genomes are rare, then this is a problem for both independent origin and gradual Darwinian evolution. There is only one world, therefore both theories have to deal with the same genome space. Potentially, 'Independent Origin' has the advantage that it does not need to explore genome space by a limited number of trajectories through genome space, but can hit isolated sequences that are inaccessible for an evolutionary step-by-step process.
    However, Senapathy needs either a huge amount of luck or a huge amount of selection. A huge amount of luck is unsatisfactory and a huge amount of selection contradicts his own claim that selection is unimportant. Senapathy postulates a very resourceful primordial pond ("The number of genes in it must have been several times more than that contained in all creatures that ever have lived on earth", p. 312). Where is the evidence?
    His claim that "mutations are absolutely useless for organismal evolutionary change" is in conflict with his statement: "many species within a genus usually connectable by evolution and many families within an order are sometimes connectable by evolution" (see: Common descent versus independent origin). If one accepts common descent up to the level of families and orders, how could this be achieved without advantageous mutations??? One cannot have common descent on such a large scale based exclusively on harmful mutations! One cannot create even one new species based on harmful mutations. Senapathy did not think very thoroughly about this problem.

17 Adaptation. Not random perfection

"At the time of the birth of organisms, "random perfection" of organisms filtered the meaningful organisms from among the myriad mostly meaningless independently-born organisms. Those creatures that fit well with the physical environment survived while others perished. Among the physically fit immutable organisms, ecological fitness occurred by chance." (p. 204) (81).

Fig. 8. Is the match between the extreme long spur of the orchid and the extreme long tongue of the moth an accident?
© image Sinauer 2005 (60)

One of the main functions of flowers is to attract animals. Why? How did this happen? Even Goethe felt compelled to explain the origins of floral structures. For Senapathy the answer is: Random Perfection! Why would anybody opt for such a desperate 'explanation'? As can be concluded from the above quote, Random Perfection is based upon 'filtering'. That's another word for 'selection'! That's Darwin! It is dishonest to smuggle in natural selection using another name. Above that, if all adaptations are the direct result of randomly assembled genomes, then we can not ask any further questions about those adaptations. We can not make any progress in our understanding of adaptation. 'Random perfection' caused by random genomes is the final answer. Don't ask any further questions. In fact, every property of an organism must be explained by random genomes according to Senapathy's theory, since mutation and natural selection are excluded (81). This implies that we never will understand the big questions in biology: the origin of adaptations like the brain, eye, ear, nose, hart, lung, digestion, photosynthesis, meiosis, respiration, blood circulation, warm-bloodedness, sexual dimorphism, parental care and bird migration, let alone the interrelations between them (61). This is an unacceptable drawback for a professional biologist. Senapathy is forced to accept 'random perfection'. He has no alternative. He has no choice.
Senapathy misses a number of crucial points here: a few trials are not enough to determine if an individual is 'ecological fit'. Genomes cannot be tested in isolation from other species, because species are each other's environments! (See this page). Senapathy's theory reduces organisms to isolated individuals. We need a theory that let species originate, evolve and adapt to their local environments including other species. Additionally, the origin of species is completely unconnected to the geological context (geographical differences, continental drift, ice ages, meteorite impacts, climate changes, etc). In Darwinism the environment is an important external causal factor (externalism). Furthermore, genomes cannot be tested at one point in time only, because that leaves unexplained how species are able to adapt to ever changing environments.

Fig. 9. Which genome originated first:
the humming bird genome or the flower genome?
How could they survive and reproduce without each other?
Is this mutual adaptation merely a genome accident?
© image Sinauer 2005 (60)

©Scientific American
Timema poppensis partially camouflaged on its host,
coast redwood Sequoia sempervirens, California.
Could both the genomes of these organisms originate
randomly and yet the organisms look very similar?

Senapathy describes in several pages the complexities and the diversity of the eye in the animal world and claims Darwinism can not explain this. He ignores that his theory implies that the eye has to be reinvented a thousand times in mammals which all have the same type of eye. According to his theory, the eye has been independently produced by the genomes of the rabbit, squirrel, mouse, bat, tiger, lion, leopard, deer, bear, giraffe, buffalo, dolphin, rhinoceros, elephant, monkey, ape, human, etc, etc.
Creationists frequently claim that evolution relies exclusively on randomness, but in fact randomness is an adequate characterization of Senapathy-genomes. For Senapathy, life is a 'genome accident'.

The genome is blind

"The genome is blind and cannot visualize the existing niches and environments. Therefore, millions of bizarre phenotypes must be produced in a species for the selection of one useful structure. (p. 89, see also p.75. my emphasis).

This looks like a devastating argument against Independent Origin. Surprisingly, these words are written by Senapathy himself. He is perfectly aware of the problem that genomes are blind. Remarkable for someone rejecting natural selection, he uses the word 'selection' in the above quote! He writes:

"the genome of the reptile or the wingless invertebrate did not "know" that there was a medium called air in which the animal could fly if it developed a wing for its host" (page 89).

"To the genome of an animal that lacks a wing, the new genes that code for the feathers have no meaning." (page75).

Surprisingly, he turns this into a problem for Darwinism by stating that an almost infinite number of random mutations should occur in order to arrive at a wing. 'Infinite' is exaggerated, but in principle 'Independent Origin' has the hypothetical advantage compared to Darwinism, and that is because it sidesteps the need to modify existing structures because all organisms originate de novo. Viewed from the genomic perspective: the potential advantage of 'Independent Origin' is that genomes are not restricted by paths or trajectories in genome space, while evolution is strongly constrained by accessible evolutionary paths.

Consider for example the origin of land animals. The vertebrate transition from water to land requires changes in a variety of functional systems including feeding, respiration, support and locomotion. The key question is: is it easier to produce an organism from chemicals or to modify an existing organism? How many random genomes are needed to produce a land animal or a flying animal from scratch compared with modifying aquatic animals or non-flying animals? That must be billions of times more. In the theory of evolution the origin of flight starts with a fully functional reptile or insect (evolution is cumulative). Senapathy has to produce a fully functional animal plus a pair of fully functional wings from a random genome. Which of the two is the most difficult task? He is also fully aware of the fact that birds have additional unique properties. Again: the probability of producing all those features from scratch must be very much lower than adding them one by one to an existing animal. Even if Darwinists would not have any idea about the genes and mutations involved, still the probability of adding features to an existing design must be orders of magnitude easier than developing a complete animal from scratch. Birds inherit all their features (metabolism, anatomy, cell structure, behavior, reproduction) from reptiles, tetrapods, multicellular organisms, eukaryotes, and single celled ancestors. The primordial pond has to reinvent everything as many times as there are species. Rejecting common descent comes at a huge cost: it equals reinventing the wheel a billion times!

See also:

Common descent versus independent origin
The role of natural selection).

Another ADAPTATION example:
Magnetic compass: directional information, which enables an animal to maintain a consistent heading, for example towards the North or South and 'magnetic map' (a few animals can also derive positional information from Earth's field). Magnetic sensitivity is phylogenetically widespread; it exists in all major groups of vertebrate animals, as well as in some molluscs, crustaceans and insects. The list includes groups such as flies, chickens and mole rats, none of which migrate (Nature). The molecular and genetic basis are cryptochromes that, in migratory birds, are thought to enable sensing of Earth's magnetic field. What does a blind genome know about the earth's magnetic field?

17a Ecology and the food chain

Marine food chain
source

Ecology and food chains make independent origin of animals very difficult if not impossible. The basis of the foodchain are the primary producers, the autotrophs which are organisms that can make their own food using inorganic materials and photosynthesis. In the above example phytoplankton (algae). Remarkable, Senapathy is aware of this 'problem' (463), but simply states in a footnote that plants "must have originated in the primordial pond at least at around the same time as, if not before, the animals." That's it! Problem solved!
"For most of Earth's history, the food chain rested on the tiny backs of cyanobacteria. Some researchers believe complex animals couldn't arise until big, nutritious green algae finally dethroned bacteria as the world's foremost photosynthesizers." (501).

See all about ecology and food chains:

Independent origin and the facts of life (page on my website).

18 The role of time: the chronological order of life

"Primordial chemical reactions on earth, approximately several hundred million years ago, produced a primordial pond with enormously large amounts of long DNA sequences," (page 202)

Time does not play a big role in Senapathy's theory. Only 3 periods in earth's history are distinguished:

Figure 11.1. The chronology and time table of the independent birth of organisms. Chapter 11: 'A New Look at The Fossil Record', page 497.

Current view of the Cambrian Explosion (541 million to 515 million years ago, that is a 26 million years period) indicated by red box. © Science (319)
The figure shows that the Cambrian Explosion was preceded by the Ediacaran Biota (EB) and that a huge number of Classes, Orders, Families, Genera and Species originated after the Cambrian Explosion.

Senapathy is careful to exclude absolute dates in the figure. However, from the legend of the figure it appears that the primordial pond existed from 600 - 595 million years ago. The start of the chemical evolution is dated at 4 billion years ago, which agrees with orthodox science. Furthermore, in the text of the chapter he claims that the beginning of the primordial pond coincides with the Cambrian explosion starting at 533 million years ago and lasting 5 - 10 million years (p.496) (94). In the figure he indicates 'the end of the birthing activity' but again no date is given. In the text he writes that the primordial pond existed for a few tens of millions of years (page 504). On page 505 he writes that the fertile period in the history of the earth lasted 50 - 100 million years. On page 204 he writes that the primordial pond 'became barren millions of years ago' which does not help very much in pinning down the precise date. Ignoring contradictions in his data, the period of the existence of his primordial pond is from about 600 to 500 million years ago. Thereafter, no new organisms are born: "only extinctions". The funny thing is that the human species, and all mammals, must have been born during the Cambrian Explosion! Needless to say this contradicts mainstream science. According to mainstream science the following species appeared in the fossil record after the primordial pond became barren (98):

Senapathy's drawing of the origin of life (cover)
with modern dates of origin added (66).

"All living creatures suddenly erupted from that pond, and simply walked, swam, flew or flowered away to fill the earth with the awesome power and beauty of organic Nature." (464)

Dates according to mainstream science:

first humans did appear 6-7 million years ago
Old World Monkeys: 25 -33 million years ago
first bats, dogs, weasels, elephants: 30 - 40 million years ago
first placental mammals (rabbits, whales, rodents): 60 - 65 million years ago
Brasilodon quadrangularis is the oldest mammal sofar: 225 million years ago
first mosquitoes, honeybees: 65 - 70 million years ago
first turtles: 65 - 190 million years ago
first butterflies (Lepidoptera): 150 million years ago
first birds: 155 - 165 million years ago
first frogs, crabs: 135 - 190 million years ago
first flowering plants appeared about 135 million years ago; grasses appeared around 94 million years ago
first sex chromosomes (XY) appeared some 200 million - 300 million years ago
first amniotes (land-living vertebrates): 310 million years ago
first amphibians: 360 million years ago
first tetrapods: 375 - 363 million years ago
first insect fossil appeared 412 million years ago
first land plants appeared 465 - 470 million years ago; plant with leaves appear with a delay of 40 million years; trees: 475 million years ago.
fish with jaws appeared: 416 - 359 million years ago
jawless fish appeared: 500 - 435 million years ago
first multicellular animals: 635 million years ago
eukaryotes: 1,000-1,300 million years ago
cyanobacteria: 2.15 billion years ago
stromatolites: 3.4 billion years ago
putative fossilized microorganisms that are at least 3,770 million and possibly 4,280 million years old
Potentially biogenic carbon preserved in a 4.1 billion-year-old zircon

The reader is advised to have a look at: TimeTree: The Timescale of Life or: Deep Time. Interactive Infographic or: GSA Geologic Time Scale or: Geological time scale (wikipedia). There is detailed chronology of the first appearance of life forms in: Stearns and Hoekstra (2005) p.416-418.

The reader will search in vain for human fossils in chapter 11 A New Look at The Fossil Record. Senapathy forgot to discuss human fossils. That's a pity, because I would like to learn why there are no human fossils known from the Cambrian period and the whole primordial pond period. If a human fossil was found in the Cambrian period the theory of evolution would be falsified and it could be compatible with the theory of independent origin.

source

Furthermore, many forms of life appeared in the fossil record before the start of Senapathy's primordial pond (600 Mya). There is solid evidence that life was present on the earth more than 3 billion years ago. Conclusion: life appeared 2400 million years before the start, and 500 million years after the end of Senapathy's primordial pond. Secondly: the first fossil animals found in the oldest layers were creatures that lived in the sea (trilobites, brachiopods), only later animals and plants living on land are found. Why? The first animals on land were amphibians and reptiles. Reptiles dominated the earth before mammals appeared. Mammals appeared much later. This was known before Darwin (1859). There was little overlap between "the age of reptiles" and "the age of mammals". Traces of humanity did not appear until the very end of the record. This is chronology. This paleontology.

Anyone proposing a non-evolutionary explanation for the origin of life and species must start with explaining the fossil record as known in 1859 and everything that has been learned since then. Senapathy did not do this. This fossil record cannot be explained by a primordial pond that produces every species at the same time.

Chronology of life is connected with chronology of earth

Around 2.4 billion years ago, a pivotal moment in Earth's history took place: The Great Oxidation Event. During this period, a significant amount of oxygen accumulated in the atmosphere. This surge in oxygen production led to a dramatic shift in the composition of the atmosphere, altering the chemistry of the planet. The event marked a turning point as oxygen levels rose, enabling the development of more complex multicellular life forms and fundamentally reshaping Earth's ecosystems (428).
In contrast, the origin of genomes in the 'Independent Birth scenario' is an autonomous process disconnected from the earth. According to Senapathy complex eukaryotic organisms and single-celled eukaryotes were produced at the same time. But, how can it be that only after 3 billion years, about 600 million years ago, organisms emerged from the microscopic world, became larger, and shortly thereafter developed skeletons and shells? (159).

Oxygen and body size. Modified after Kump

Life got big after nearly 3 billion years of microbial evolution. Soft-bodied organisms of centimeter- to meter-scale first appeared 579 million years ago (180). Why the delay if every genome (mono-cellular and multi-cellular) was produced?

We humans are only here because of a remarkable series of revolutions in Earth history, each revolution built on the previous ones. The Earth was not ready for 'higher' forms of life. By a process of niche construction the Earth prepared itself for land animals and plants. The first forms of life derived their energy from simple chemicals coming from the Earth ('chemo-litho-autotrophs'), and not from sunlight or by consuming organic material (325).

Chronology fact: plants

The first plants on land were small and leafless. Plants with leaves appeared 40 million years later (72). Why don't leafless and leaved plants appear randomly in the fossil record? Why were plants (green algae) present in oceans 500 million years before plants colonized the land? Why do only land plants (especially trees) possess lignin? (277). What would be the use of enzymatic lignin decomposition gene in a genome of an organism born in the primordial pond? Why the chronological order of appearance? In Senapathy's genome-centered view the chronological and environmental context is absent. There is no possibility to answer questions why certain fossils are where they are.

All organisms are born in the Primordial Pond and move out (see cover illustration of his book). But what about plants? Rooted and unable to flee from the Pond, plants drown and die. How could seedlings survive in the primordial pond? Could seeds germinate in a pond at all? And how do get from naked DNA to a seed anyway?
Example: the desert-dwelling tobacco species Nicotiana attenuata has an array of mechanisms to survive severe environmental stresses including fire, herbivores and drought. However, could it survive in a pond? How does it move from the Primordial Pond to the desert?

Chronology fact: photosynthesis

There ares two photosynthesis methods: C₃ and C₄ (used by 7500 plant species, mostly subtropical grasses, maize, sugarcane). The C₃ method is optimal for high atmospheric carbon dioxide levels (100x today) and the C₄ is optimal for low carbon dioxide levels. However, there is only one atmosphere and one primordial pond in Senapathy's scenario. Data support the view that the C₃ system arose more than 2800 million years ago under high carbon dioxide levels, the C₄ system arose as an adaptation to low carbon dioxide levels about 30 million years ago (72). If both systems were produced by the primordial pond, then the primordial pond must have existed from 3 billion years ago up to 30 million years ago. Unlikely as it is, it still does not explain why it produced the systems in that chronological order and why with such a huge time interval.

See also: § 22 Incompatible primordial ponds.

Chronology fact: Great Oxygenation Event

Great Oxidation Event (appearance of free oxygen O₂ in Earth's atmosphere around 2.4 billion years ago) is believed to have followed the development of oxygenic photosynthesis by ancestors of modern cyanobacteria. Cyanobacteria have played a central role in the evolution of life on Earth, both by producing oxygen as a photosynthetic byproduct and by generating organic carbon, the major ecological energy commodity.
DNA sequences from extant organisms bear an imprint of the GOE. Enzymes that bind molecular oxygen are more likely to appear in organisms that emerged after the Great Oxidation Event (155).
However, it takes time to fill the earth's atmosphere with oxygen. To be precise: nearly 2 billion years! It was not until oxygen levels rose even higher, around half a billion years ago, that the oceans could support large multicellular eukaryotes that got their energy by burning food. In Senapathy's genome-centered view the chronological and environmental context is absent.

See: Great Oxygenetion Event (wikipedia)

Chronology fact: Nitrogen cycle: bacteria first

Proteins and DNA contain Nitrogen (N). The atmosphere of the earth contains enough nitrogen (78%), but remarkably, animals and plants can not use it. Only nitrogen-fixing bacteria (prokaryotes!) can use nitrogen from the air. (Some fixation occurs in lightning strikes). This imposes a chronological order: first bacteria, then plants, then animals. So, independent origin of animal and plant genomes is impossible. In Senapathy's genome-centered view the chronological and ecological context is absent. 2 Jun 11

See: Nitrogen cycle (wikipedia)

Chronology fact: chronology of sex

Why on earth was there no sexual reproduction in the first half of the history of the earth? These are facts from the geological record. One does not need to be an evolutionist to accept them. On the theory of independent origin such groups of organisms should appear randomly in the history of the earth or why not all at the same time?

Extinctions and recoveries

The independent birth theory does not predict a specific order of appearance or extinction in time because genomes are randomly generated in time. However, the fossil records shows for example that "the extinction of dinosaurs at the Cretaceous/Paleogene boundary was the seminal event that opened the door for the subsequent diversification of terrestrial mammals" (133). For the first 140 million years of their evolutionary history, mammals were small (up to 15 kg). Such a pattern should not exist according to the theory of independent birth.

How does the theory of independent origin deal with the big extinctions and recoveries? The figure shows 5 major mass extinctions. Each is followed by recovery of the number of species. Overall, there is a clear increase in the number of species from 600 million years ago to today. This would require that primordial pond(s) must have been active continuously from 600 million years ago up to today.

Figure: Nicholas Barton et al (2007) Evolution, p. 283.

Relation between macro evolution and regulatory innovation:

Macroevolutionary trends in faunal diversity and gene regulation.

Macroevolutionary trends in animal diversity and gene regulation show that there is a chronological order in the appearance of animal groups and regulatory sequences contradicting expectations of random origin in a primordial pond. Source: (221).

Conclusion: It is OK to reject neo-Darwinism. However, the first appearances of animal and plant groups in the fossil record are the raw data which any theory of the origin of species has to explain. Both Senapathy's theory and neo-Darwinism have to explain exactly the same data of the fossil record. It does not help to deny the existence of 'missing links': the chronology of fossils are facts. Introns and exons do not change the facts of the fossil record.

19 The role of place: the biogeography of the origin of species

source

According to Senapathy, life originated in the primordial pond. Where on earth is the primordial pond located? Senapathy does not tell us. Biogeography reminds us that the word "origin" denotes both a process and a place—that the great variety of life did not just arise in some indistinct and misty nowhere. Instead location matters. When we study distributions we begin to associate the evolution of plants and animals with a particular setting, thus providing a tangible background to the birth and development of species. The Earth is not merely the cradle of life; it is its whomb.
Imagine life evolving on a planet covered by a single, uniform ocean—of constant depth, stable temperature, and few currents, and you have imagined a planet where life would very likely remain simple and relatively homogeneous (123).

Endemic species
Endemism: a species is unique to a defined geographic location, such as islands (Hawaii, Galápagos Islands, Socotra, Tasmania), isolated areas such as the highlands of Ethiopia, or large bodies of water like Lake Baikal. So, every island its own primordial pool?

Transition from sea to land
There was a time that there was not much land (see illustration above). There was a time when there were no animals or plants on land. Animals and plants originated in water and colonized land when land became available. Hemichordates are exclusively marine animals. The green alga Chlamydomonas reinhardtii represents early photosynthesizers, confined to water and never expanding beyond a simple single cell. Bryophytes (mosses, liverworts, and hornworts), which evolved 450 million years ago, were among the first plants to colonize land. For the moss Physcomitrella patens, the climb to shore required new genes for surviving dry spells and temperature swings, and that resulted in a more complex genome, with expanded families of genes (210). Later came woody plants. For animals a similar story can be told (see: Neil Shubin). Keywords: place and time.

See also: Biogeography.

20 The clumpiness of morphospace

"What can be more curious than that the hand of a man, formed for grasping, that of a mole for digging, the leg of the horse, the paddle of the porpoise, and the wing of the bat, should all be constructed on the same pattern, and should contain the same bones in the same relative proportions?" (300).

Most organisms are well adapted to their immediate environments, but also built on anatomical ground plans that transcend any particular circumstance. Why should structures adapted for particular ends, root their basic structure in homologies that do not have a common function? Why should this be so, if all organisms arose independently?

C-value paradox: genome sizes are not randomly distributed over species.

Why are genomesizes not randomly distributed over all living species? Why are they clumped? Why do genomesizes of birds and mammals not overlap?
Why do all backboned animals have four fins or limbs, one pair in front and one pair behind? Why are all land vertebrates 'tetrapods' (4 legs), while none have six or eight legs?
With a few exceptions, all mammals and birds are warm-blooded, and all reptiles, insects, arachnids, amphibians and fish are cold-blooded (218). Why is this, if they are independently born?
Why do birds (of prey) have no teeth? It would be advantageous for larger pieces of food which cannot be swallowed in one piece. Why don't birds have a third pair of arms to handle their food? (since they can't use their wings for that task). Humans, apes, squirrels can use their hands for handling food. Why aren't there birds with internal gestation like mammals? Why are the chicks of all passerines (songbirds) altricial (blind, featherless, and helpless when hatched from their eggs)? Why are most passerines smaller than typical members of other avian orders? Why has the foot of all passerines 3 toes directed forward and one toe directed backwards? Why do birds embryologically start with paired oviducts, but one or the other side fails to develop (together with the corresponding ovary), and only one functional oviduct develops?
What is the matter with genomes: birds have remarkably small genomes, averaging 1/2 to 1/3 of the size of typical mammalian genomes. Why should that be when all eukaryotic genomes arose independently? Small genome size is intriguingly correlated with flight. Bats, compared to other mammals, have small genomes, and flightless birds, compared to other birds, have larger genomes. (220). Why are introns and intergenic regions of birds half the size of mammals? (348).
Why do only birds and insects fly, not frogs and mammals (except bats)? Why does not one of the almost 40,000 species of spiders fly? Why do all spiders have eight legs? Why do all spiders produce silk? Why are all spiders predatory and not herbivorous?
What is the matter with genomes that there are twenty thousand species of birds and only twenty species of crocodile? (Mike Benton)
What is the matter with genomes that insects are more numerous than any other type of animal, accounting for 80% of species?
What is the matter with genomes that, unlike most animals (females: XX, males: XY), female birds are the heterogametic sex, having the equivalent of a human Y chromosome, called the W chromosome (females: ZW, males: ZZ) except ostrich and emu. Why is this property not randomly distributed over animals and birds?
Why does the domain of mammalian carnivores contain a large cluster of cats, another of dogs, a third of bears, leaving so much unoccupied morphological space between? This is not expected on the random genome origin scenario.
Why do each of the more than 1,000 species among one group of centipedes have an odd number of leg-bearing segments?

This feature of life on earth is called 'the clumpiness of morphospace': the inhomogeneous occupation of all possible forms of extant or extinct animals. This clumpiness must be explained. In the theory of evolution, the cluster of cats exists primarily as a consequence of homology and historical constraint. All cat-like animals (lion, tiger, puma, leopard) share a basic morphology because they arose from the common ancestor of all cat-like animals.
In a world of independent origin, a world without history, where all features of organisms express their initially created state, why does homology exist at all? If organisms arose independently, they would show more structural variation, and not be morphologically clustered as varied manifestations of 'archetypes' (47). Senapathy cannot use historical developmental constraints, because Independent Origin is an unhistorical or even anti-historical theory. Senapathy can not use limitations of genome production either, because if genomes are random, then any genome is possible.

21 The primordial pond is an unlimited resource

updated 15 June 2025 / 28 Mar 2026

infinite power

"I then came to realize that, given a sufficiently large pool of genetic sequences in the primordial pond, almost any gene could have occurred in it. If this had in fact been the case, then complete genomes – for unicellular and multicellular organisms alike – could have formed by the random assembly of these genes." (page 200 Chapter 5).

When providing the Primordial Pond with unlimited resources, the hard problem of the Origin of Life is 'solved' instantly. In his book Senapathy uses the word 'myriad' no less than 649 times.

"In the new theory, I will show that myriad complete genes with the right exon-intron organization could exist in the primordial pond." (page 150). (my bold)
"... that a myriad of creatures arose separately and directly from the primordial pond." (page 295). (my bold)

"In English, myriad is an adjective used to mean that a group of things has indefinitely large quantity." Here are a few quotes containing 'vastness', 'immense', 'enormous', 'abundant', 'unlimited', 'extremely':

" ... produced a primordial pond with enormously large amounts of long DNA sequences" (page 202)
"Even before DNA-coded machineries arose, immense amounts of DNA molecules could have been synthesized in the primordial ponds" (page 211).
"millions of small and large ponds must have existed on the primitive earth" (page 214).
"It is extremely important that we should not underestimate the potential of the primordial broth, which must have contained millions of highly reactive organic chemicals of different types, sizes, and structures, brewing with all kinds of molecular catalysts." (page 215).
"In the vast, total universal sequence pool, however, an extremely large number of genes and regulatory sequences would be present." (page 217)
"The size of the Universal Sequence Pool (USP) we estimated to be available in a typical primordial pond is 10³⁰ – 10³⁵ nucleotides". (page 288, the number is also on page 273).
"The gene pool was immense enough to contain all the billions of genes necessary to give rise to a multitude of creatures." (page 320)
"UGP, which, let us not forget, contained trillions of genes" (pae 346)
"It is important to remind ourselves that the primordial pond had become extremely complex by the time the first cells were constructed." (page 299).
"The vastness of the universal gene pool was such that the number of genes in it must have been several times more than that contained in all the creatures that have ever lived on earth." (page 312).
"In other words, when the number of random events are large enough, the unbelievable will certainly happen." (page 332)
"The power of the primordial pond's molecular mechanisms was such that they could produce, by the independent assembly of genomes, a wide repertoire of organisms..." (page 334).
"then an almost unlimited number of unique genes are probable in the same amount of random DNA sequence. (page 371)
"almost an unlimited supply of distinct genes for multitudes of unique biochemical functions would occur in the USP" (469).
"THE ABUNDANT OCCURRENCE OF GENES IS INEVITABLE IN THE PRIMORDIAL POND" (title chapter 7).
"it must have contained almost all the catalytic activities that we can think of."(!) (page 213) (452)
"The primordial pond could have been productive for a very long geological time" (page 345).

So, everything is possible. The resources are unlimited. If one pond is not enough, just introduce many independent pools at distinct locations and at different geological times! (491). That will do the trick. There are one or two passages where Senapathy mentions limitations, but immediately thereafter he suggests that there is no real limitation (461). Another intriguing example is that the Primordial Pond at some time will be depleted of DNA, but fortunately that occurs only after all organisms are produced, so that is not a problem at all (492). The problem with an all-powerful mechanism in a theory, is that you have in fact destroyed the explanatory power of the theory. You didn't explain anything. In fact you have introduced a miracle (431). A miracle is supernatural. Natural resources are never unlimited. Population genetic theory predicts that natural selection doesn't have unlimited power (518), (530), and even better, population genetic theory can calculate the limits of natural selection (539), (540).

In the Origin of Life research the Holy Grail is finding out how the first DNA-encoded enzymes originated. DNA requires enzymes, enzymes require DNA. It is the famous chicken-and-egg problem: What came first, DNA or protein? (246 page 360). How has Senapathy solved this vicious circle? Very simple and straightforward:

"genes for all these enzymes must have been available in the primordial pond's genetic sequences" (p.427)

Here you have the fundamental conceptual error: what's the point of having genes if they can't produce proteins? There is a crucial difference between "DNA sequences for enzymes" and the enzymes themselves, but Senapathy mindlessly uses them interchangeably. As if the presence of "DNA sequences for enzymes" automatically implies the presence of those enzymes. This is one of his biggest and fatal conceptual errors of his book. It couldn't be more devastating than this.

"it is quite possible that transcriptional activity, a far simpler function (of RNA polymerase) could have been present in the primordial protein mixture." page 216, Chapter 6.

Apart from the presence of all the building blocks for DNA, the Primordial Pond magically has enzymes such as DNA polymerase, RNA-polymerase, reverse-transcriptase. Where did they come from? According to Senapathy from "prebiotically synthesized primitive RNA polymerase" (p.216). In Senapathy's own words: "biotic pertains to biochemical processes occurring in living cells after living cells were first formed in the primordial pond. Thus, prebiotic means the chemical syntheses that took place before any living cell was formed." (p.594). So, prebiotic means spontaneous synthesis from building blocks without the help of DNA, without mRNA, without tRNA, without ribosomes, without splicing machinery, without a nucleus, without a cell, without ATP. So, a living cell isn't necesssary for all biochemical reactions? The Primordial Pond has unlimited powers.

The primordial pond is a free lunch

There is only one primordial pond (see: flap text and here) (462). The primordial pond is the birthplace of all species. It must have been a very busy place with millions of 'species'. Water is the natural home of fishes. Predatory fishes (for example tuna) prey on other fishes. As soon as a predatory fish originates in the primordial pond, it starts eating. It swallows everything it can get. Therefore, predatory fish easily cause the extinction of every 'species' it can swallow, because the method of 'independent birth' does not produce species but single unique individuals. Thus before a single unique individual can multiply and become a population, it has been swallowed by a predatory fish. Likewise, plankton feeders will exterminate all plankton. The primordial pond is a free lunch until the predators die of starvation when all prey has been eaten. Likewise, pathogenic bacteria, viruses (213) and fungi (286) responsible for infectious diseases, will kill their hosts. (fungi are eukaryotes, bacteria and viruses are not eukaryotes and are supposed not to originate in the primordial pond). That is why Darwin, Oparin and Haldane already argued that life could have emerged only on a sterile, lifeless planet (50).

On a molecular level, chemical inhibitors will prevent DNA synthesis, replication, transcription and translation or any other enzymatic reaction. For example, the peptide alpha-Amanitin is an inhibitor of RNA polymerase II. That is one of the reasons why the Origin of Life is a hard problem.

22 Incompatible requirements for a primordial pond

The primordial pond is the 'birthplace' of all species. But organisms have incompatible environmental demands: some require oxygen, others require anoxic (anaerobic) conditions. Some bacteria require hydrogen sulfide as a source of energy. Some require high temperatures (thermophilic), others require low or very cold temperatures. Some are acid-loving, others are acido-phobe. Some plants and animals require salt water (sea), freshwater flora and fauna requires freshwater (lakes, rivers). (The most famous halophilic algae, Dunaliella salina, survives up to 23% salt and the extremely halophilic archaeon Haloquadratum walsbyi). Reproduction of fishes in the tropics (no seasons) is during the whole year, while in temperate regions they reproduce at the end of spring. All those conditions cannot be combined in one and the same primordial pond.
Furthermore, how could the primordial pond produce anything else but organisms that can live in water and extract oxygen from the water? (fishes, mollusks) ignoring for the moment creatures that live in the dark deep sea below 200 meters. A pond is deadly for air-breathing animals except those that live near the surface (whales, crocodiles, otters, etc) or live only partly in water (sea lions, seals, etc).

Furthermore, if born, why and how do organisms migrate from the primordial pond to all those different locations? Some organisms only survive at deep sea, hydrothermal vent communities are found at depths ranging from 1,500 to 3,200 m. Giant tube worms (chemoautotrophs), in the absence of sunlight, subsist on hydrogen sulfide found in the warm waters surrounding vent communities. There is evidence for living prokaryotic cells in 1626 meters below the sea floor sediments that are 111 My old and at 60° to 100°C (79). The bacterium D. audaxviator lives at 2.8 kilometer depth in a South African gold mine and is lacking a complete system for oxygen resistance, suggesting the long-term isolation from O₂. That means it is damaged by oxygen (102).
Emperor penguins live in probably the most extreme conditions endured by any warm-blooded animal on earth. They even breed in the depths of the Antarctic winter at temperatures of -30°C (-22°F). They have so many cold adaptations that in warmer weather, overheating can be a problem. So, if the primary pond is located in moderate or tropical regions they will die from overheating.

Eggs incompatible with water

In the primordial pond organisms develop from 'egg cells'. Please, have a look at the cover illustration again: not accidentally, there is no bird or mammal creeping out of the primordial pond. Bird eggs do not survive in water (apart from the requirement of incubation). Saltwater crocodiles, marine iguanas and sea turtles, although marine, lay eggs on land. Please have a look at the cover illustration: a turtle is coming out of the primordial pond! However, all turtles lay eggs on land. Also, reptiles cannot successfully lay eggs under water because gas exchange across the eggshell is much slower in water than in air.

On the other extreme are bar-headed geese (Anser indicus)- the world's highest-altitude migrants - fly from their winter feeding grounds in the lowlands of India, sometimes even directly above Mount Everest (29,000 feet or 8,800 meters), on their way to their nesting grounds on the Tibetan plateau (only a third of the oxygen available at sea level). They have a special type of hemoglobin that absorbs oxygen very quickly when the birds are at high altitudes; as a result, they can extract more oxygen from each breath of rarefied air than other birds can. The most plausible explanation for this migratory behaviour is the geological history of the region (80). How do these geese, if born in the primordial pond anywhere in the world, -ignoring all other problems-, know to fly to the Tibetan plateau?

If one wishes to escape from incompatible requirements, why not propose thousands or millions of primordial ponds in stead off one? It could make independent origin much and much more easier! For example, it would make it easier to explain the unusual breathing system in some dinosaurs, a group called Saurischian dinosaurs who lived at a time when the oxygen level at the surface of the earth was only 10 percent (103). Alternatively, why not propose a primordial pond that lasts millions of years? (see: §18 The role of time: the chronological order of life). The secret could be that Senapathy needs "a common pool of genes in the same primordial pond"! (see: §13 Common descent versus independent origin).

How big is the primordial pond?
Would a 90-tonne blue whale (Balaenoptera musculus) fit in the primordial pond? or only a juvenile? Would there be enough food? Just asking...

What about trees?
Do trees originate in the primordial pond? Water is no problem for aquatic plants (marine algae, water lily), but what about landplants?

23 The final refutation of independent origin

Her response was: "Do you really think that an insect or a rat simply came about as it is?"I simply answered "Yes, I do!" (1)

Conserved chromosome segments between human and mouse are the final refutation of independent origin. If all genomes arose independently from the primordial pond and if the distribution of genes over chromosomes were random, then genes of related species should not have the same linear order on their chromosomes. However, if a great number of genes appear in the same order in different species, this cannot be explained by pure chance. This is exactly what has been found when geneticists recently compared the genome of mouse and man (8,9,10). A segment of roughly 90,5 million bases on human chromosome 4 is similar to mouse chromosome 5. (11). Almost all human genes on chromosome 17 are found on mouse chromosome 11 (12) and human chromosome 20 appears to be entirely orthologous to the bottom half of mouse chromosome 2, apparently in a single segment (13). That means that thousands of genes are in the same order in mouse and man. A few genes might be expected to be in the same order by pure chance, but not thousands. This can only be explained by common descent of mouse and man. If all species were independently born, then the probability of finding similarities in a human-mouse comparison should equal the probability of finding it in, say, a human-turtle, a human-fish or a human-mushroom comparison. Of course, Senapathy could not have known all these facts in 1994, but conserved chromosome segments are now the most impressive refutation of independent origin. This evidence alone is sufficient to refute independent origin. No theory of independent origin can survive this evidence. The above argument is only about genomics. Anatomy also has a story to tell: Neil Shubin (2008) Your Inner Fish: A Journey into the 3.5-Billion-Year History of the Human Body.
Today, I would point out that the most crucial and unequivocal fact against Senapathy's theory of independent origin is the fact that DNA cannot spontaneously originate from building blocks, or the universality of the genetic code, or the fact that DNA needs proteins to be transcribed or translated.

The formal refutation of independent origin was the publication 'A formal test of the theory of universal common ancestry' by evolutionary biologist Douglas L. Theobald (130).

For my final refutation see: Wikipedia.

24 The origin of life

new:
3 Sep 13

Spontaneous generation

Senapathy's theory is a modern version of the theory of spontaneous generation. The Greek philosopher Aristotle believed in spontaneous generation of life. As late as the seventeenth century philosophers believed that mice, frogs, and eels could emerge from garbage, mud, and river water. According to Alec Panchen "Lamarck rejected common descent. Lamarck's theory was of continuous events of spontaneous generation with descent from generated organisms of innumerable parallel evolutionary lines. I know of no 20th-century evolutionist who accepts this view." (69). Indeed, Senapathy is no evolutionist. Louis Pasteur gave the final deathblow to Spontaneous Generation of bacteria. A very useful history of Spontaneous Generation is given by Iris Fry in The emergence of life on Earth (chapters 2,3,4).

Senapathy:

"Such primitive proteins in the primordial ponds might have catalyzed a great number of reactions, albeit with poor efficiency. (...) This strongly supports the conclusion that the primordial pond must have had a great deal of catalytic activities."
"There can be no doubt that proteinoid-like catalytic activities could aid in the process of building long DNA molecules in the primordial pond.". (page 209 chapter 6).

The words 'primitive' and 'poor efficiency' suggest an evolutionary process from primitive to complex. He also uses 'primitive genetic machineries', 'primitive ribosomes', 'primitive RNA polymerase'. But senapathy rejects evolution! (424). Above that it contradicts 'Random Perfection'.
Furthermore, we learn from this quote that he starts with an idea, then quickly turns it into an evidence based hypothesis, and immediately after that transforms it in to a well confirmed theory which cannot be doubted. Senapathy derives his evidence and optimism from Cyril Ponnamperuma (502), but Senapathy goes way beyond the conclusions of Prof. Ponnamperuma.

He knows:

"Therefore, the fact that long oligonucleotides have not been so far demonstrated in chemical evolution experiments does not mean that they are not possible." (page 215)

This is fatal for any theory based on the spontaneous synthesis of (random) DNA with millions and billions of nucleotides. But Senapathy is not discouraged.

In modern science the Origin Of Life field has grown into a separate research field with strong connections to astrobiology, organic chemistry and geochemistry. According to the most recent textbook of Evolution (89) there are seven critical steps in the origin of life:

the generation of simple organic molecules from inorganic molecules
chemical "evolution" to produce more complex organic molecules and primitive metabolic networks
the origin of self-replication and the creation of "genotypes"
compartmentalization and the creation of cells
the linking of genotype and phenotype
the origin of the genetic code
the takeover of early replication systems by one involving DNA

Please note that DNA is the final step! All Origin of life scenarios do without DNA. Senapathy starts with DNA and ignores or denies the necessity of all previous steps! But even starting with DNA there are 3 separate problems: 1) origin of the double helix (4 bases internal, phosphate-sugar backbone external), 2) origin of the sequence, 3) the origin of the genetic code (translation problem, mRNA, tRNA, ribosome). Senapathy wants to solve these 3 problems at the same time. No organic chemist (263) or evolutionary biologist (477) believes that is possible that DNA bases or sugars (ribose) spontaneously form and assemble into DNA in a prebiotic world . But Senapathy proposes a theory of the origin of life! He thinks all (eukaryotic) genes are assembled from scratch and so nothing is inherited from a previous phase (RNA-world). So, he also ignores or rejects the RNA-world hypothesis:

Genes–First versus Metabolism–First
Another way to look at the origin of life is two competing models: "genes first" ("replication first") versus "metabolism first". In the "genes first" model replicating DNA or RNA sequences arise first, in the "metabolism first" inorganic catalyzers convert simple and abundant inorganic compounds, such as carbon dioxide, into more complex organic molecules. Obviously, Senapathy represents the "genes first", or more precise: "DNA first" model.

The RNA World
The discovery that RNA molecules can act as catalysts provides a possible solution to a long-standing 'chicken and egg' dilemma:

DNA encodes the genetic information of proteins
DNA replication and transcription requires proteins
proteins cannot self-replicate (except prions)
proteins cannot encode the information in DNA (Cricks 'Central Dogma')

In other words: the interdependent world of nucleic acids and proteins which forms the basis of all modern life (345). If RNA can serve both as:

a repository of information (in its sequence of nucleotides)
a catalyst

then the dilemma is solved. This provides the basis for the hypothesis that life began as RNA – the so-called RNA World (183): an RNA-based genetic and catalytic system. The unexpected observation that deoxyribonucleotides are synthesized from ribonucleotides in cellular pathways supports the notion that DNA arose later in cellular evolution than RNA (254). Further evidence: Michael Yarus points out that the evidence for the RNA-world is actually scattered throughout modern-day biochemistry and cell biology ('the ancestor within': mRNA, tRNA, rRNA, microRNA, ). Senapathy completely overlooked the DNA-protein dilemma, and why the RNA-world is the solution to that dilemma. And there must have been a Pre-RNA-world, proto-RNA-world (346). Consequently, his theory about random DNA is pointless. Intriguingly, in Figure 11.1. (see above § 18) at the basis is a pointer to 'Start of chemical evolution'! What would that mean?

Multiple origins of life hypothesis
Raup and Valentine (140) propose multiple origins of life: "The probability of survival of life is low unless there are multiple origins, and given survival of life and given as many as 10 independent origins of life, the odds are that all but one would have gone extinct, yielding the monophyletic biota we have now." This mainstream hypothesis does not contradict common descent of all life. It only proposes that there must have been many origins, but they did not survive except one which formed the universal tree of life. Senapathy proposes independent origin of all species.

See also: §25: What is life? in which I argue that one must first know what life is, before one can start to explain its origin.
See also website: Exploring Life's Origins. (animations!)

25 Origin of species

2 Jul 2025

Only recently I realized that I did not have a separate paragraph in this review about the origin of species. The subject deserves its own paragraph! Senapathy wrote:

"Among the most significant of our observations is the fact that while there are many similar species that are essentially the variants of a single organism, there are numerous organisms that are unique and distinct." (page 2).

"each independently-assembled genome is a distinct entity giving rise to a creature that is also distinct and unrelated to others." (page 455).

First, the concepts 'organism', 'creature', 'variant' are used here in a confusing and unscientific way. In biology every individual belongs to a species. There are no individuals that don't belong to a species. In general, a species is a group of individuals that can interbreed and are reproductively isolated from other such groups. The problem with the theory of Independent Origin is the fact that the Primordial Pond only produces genetically unique individuals. Those individuals are by definition genetically and reproductively isolated from every other individual in the Primordial Pond. So, they can't reproduce. When they die, they leave no descendants.
In other words: a species cannot be established from one unique individual because those individuals are unique and genetically incompatible. How could it, if genomes are produced by random assemblage of random genes? Senapathy has no choice. The Primordial Pond does not and can not produce a species. To interbreed successfully, the genomes of individuals must be genetically compatible and that means they must be similar to a high degree and in a specific way. Their chromosomes must match. In practice, this means that their combined genomes must be able to produce a viable embryo. In a Primordial Pond (as the author suggests) only genetically unique individuals are produced by random processes and they cannot interbreed because their genomes do not match. That's a dead end. The majority of species have sexual reproduction. A further restriction is that most sexually reproducing species have an even chromosome copy number. So, if females and males were produced at all in the Primordial Pond, they would not belong to the same species. Remarkably, there are no species in the Primordial pond. Remarkably, his theory does not explain The Oigin of Species. He doesn't even try to do so given the title of his book (429):

"Independent Birth of Organisms. A New Theory That Distinct Organisms Arose Independently From The Primordial Pond...".

An 'organism' is any living thing that functions as an individual. Again, his theory is about the origin of individuals. Nothing meaningful is added with 'distinct individuals'. Individuals are always distinct, otherwise they would not be individuals. An individual is one that exists as a distinct entity. Any theory of Independent Origin/Creation has the same problem.

26 What is life?

L I F E

1. Chemical motor
system

2. Chemical boundary system

3. Chemical
information system

Senapathy claims to explain the origin of life. But, what is life? If one has a wrong idea of what life is, then the theory to explain 'life' is useless. So, what is life? According to chemical engineer Tibor Ganti (43) life consists of 3 subsystems (see figure):

a chemical motor (metabolism) that supplies energy to synthesize compounds necessary for the other 2 subsystems and is stable
a membrane which keeps the other 2 subsystems together, protects against dilution and is itself stable
an information-carrying subsystem (for example DNA) which enables reproduction of the 3 subsystems

Together these 3 subsystems are a living system. Senapathy's theory is concerned with a subsystem of a subsystem: the information-carrying subsystem (DNA) only. So he has a mistaken view of what life is. Therefore, his theory is useless. Furthermore, he got the order of origin of the 3 subsystems wrong. Several scientists believe that metabolism originated first, and that the information carrying subsystem arose later (as a by-product). The reason is that whereas the abiotic synthesis of amino acids is easy, the abiotic synthesis of nucleotides is difficult (44). Whatever the order of appearance, the point is that according to Gánti the genetic code and the reading machine can only function together, they originate and function together. If there is an order, it is the machine first, because a machine can exist without program control, but the program cannot exist without machine (43, p. 16).

According to John Maynard Smith (92) "entities are alive if they have the properties of multiplication, variation, and heredity". Senapathy's theory does not say a word why independently born organisms should have the property of multiplication (reproduction). Since organisms could be produced by the primordial pond indefinitely, why did the primordial pond not produce organisms lacking the power to reproduce themselves? Why such an improbably complex feature as reproduction? DNA is necessary for building a body and keeping that body alive, reproduction is an extra addition. Why does the primordial pond stop producing organisms anyway?

Energy. Without energy no life. Energy is the chemical motor and is the first subsystem of life (Ganti). We consume carbohydrates and fats, combining them with oxygen that we inhale, to keep ourselves alive. Microorganisms are more versatile and can use minerals in place of the food or the oxygen. In either case, the transformations that are involved are called redox reactions. They entail the transfer of electrons from an electron-rich (or reduced) substance to an electron-poor (or oxidized) one (68). By defining the origin and the evolution of life as the same problem, Senapathy has to show that the origin of organisms that consume carbohydrates and fats (which are of biological origin) is as plausible as the origin of organisms requiring only minerals.
Furthermore, multicellular animals need an order of magnitude more energy and so must use aerobic respiration (76). But Oxygen levels on Earth rose gradually to the current level. How do primary ponds know when to produce small mono-cellular or large multi-cellular animals?

27 PLOS ONE article

new:
3 Nov 08

update:
5 Nov 08

On 21 October 2008 I received a very kind email from Senapathy to notify me that he published an article in PLOS ONE (105). It is a huge article with many data (graphs, tables) to support his theory of independent origin (now called ROSG model). It is best viewed in pdf version. "This project is purely an academic project, fulfilling the academic interest of the corresponding author". Indeed, one could justly say that it is a lifelong interest.

I tried to decipher the logic of his argument in the PLOS article. Despite I studied his argument for years, it is not easy. I guess, he does not want to defend the idea that the present-day human genome is random with respect to the frequency of stop codons. No scientist would want to do that. An arbitrary random genome could not produce a human being. On the other hand, nobody can argue against the claim that "The presence of three stop codons for every 64 codons limits the average ORF [Open Reading Frame] length to about 60 bases in random DNA" ('random DNA' is a computer generated random string of four different symbols). What he seems to argue is that, despite the predominant non-random nature of present-day genomes, exon length still has a random signature.

ROSG model A
Figure 7A. The ROSG model. Origin of an eukaryotic gene from primordial random DNA.
ORF = Open Reading Frame, is DNA sequence between two stop codons.
Red: coding sequence is an exon. stop codons occurred too
frequently to allow functional proteins to be encoded in random DNA.
That's why processing is necessary. Please note there are no startcodons.

The best evidence of what he is really after is:

"It is remarkable that all the characteristics of random DNA are still essentially present in the split genes of present day intron-dense large genomes such as those in the human."

This is his goal and conclusion. Please note: 'essentially' and 'present day'. I sifted through the article several times to find the most clear example of the logic of his reasoning. This is the most succinct example:

"The average exon length from the intron-rich genomes is about 170 bases whereas that expected from random ORF lengths is 60 bases. This may indicate that there has been a selection for longer exons within the allowed maximum ORF length of 600 bases for optimizing the frequency of suitable exon lengths."

Obviously, when your model predicts 60 bases and you find 170 bases (211), your model is wrong. If that is not enough, his statement "small minority (~2%) of exons were >750 bases" should refute his model. He sees the contradiction, because he suggests "there has been a selection for longer exons". Selection! If there has been natural selection, then the original signal is destroyed. The difficulty is, that Senapathy has no independent evidence of the first random DNA sequences and independent evidence of subsequent processing. If you allow for any amount of processing and selection, then any exon length can be 'explained' simply because his model does not specify restrictions on the amount of processing. (However, there is one escape: long noncoding RNA (ncRNA) could be closer to randomness because they are not translated into proteins and therefore stopcodon statistics, indeed any codon could by random. Even the necessity of triplets is absent).

When?
Senapathy does not distinguish clearly between the timing of the different events: 1) the origin of the very first DNA sequences, 2) the processing of those sequences, 3) subsequent genome evolution during 2 billion years. But prebiotic environments are completely different from those of a living organism. Irrespective of when (106) this processing occurred, after 'splicing together short coding pieces', exons lengths, gene lengths and genomes are not random anymore. Any selective processing of random DNA makes it non-random. Any non-random removal of stop codons change the statistical properties of the sequence. If present-day exons are a combination of shorter pieces joined together, than by definition current exon lengths are not random.

The mechanism
His figure shows the combination of 4 small exons into one large exon, but any exon length can be explained in this way. An arbitrary number of exons with arbitrary lengths can be joined to an arbitrary number of new exons with arbitrary lengths. Also, an arbitrary number of exons may stay unmodified. Exon length distribution is the main thing Senapathy wants to explain and he introduces a mechanism that changes them in an unspecified way. Why do introns still exist? Why are there still so many introns in our genes? Why did his mechanism not eliminate all introns? Senapathy observes present-day exon lengths and postulates a hypothetical mechanism that produces exactly the exon sizes of today. That does not add anything to our knowledge.
My own suggestion would be: it is true that stop codons would limit gene length, but a far more simpler solution would be elimination of stop codons by a one-base mutation of the stopcodon in the context of living organisms. That is a far more simple because it does not require complicated splicing machinery. Certainly under prebiotic conditions where no functional enzymes are present. He did not give evidence that this complicated processing is possible in prebiotic chemistry. He stated that functional proteins cannot be encoded in genes with average ORF length of about 60 bases.

Facts
In addition to the contradiction of his own model with his own data (called 'non-confirming' data by Senapathy), two types of evidence are contradicting his hypothesis: too many short exons and too many long exons. Humans have 170 exons of length up to 25 bp (107) which is significantly lower than the expected size of 60 and exon sizes up to 2087 bp exist (108) which is much outside the predicted maximum. The length of an average human exon is 126 bp which is more than twice the expected 60 bp.
Possibly, his ideas and data about the origin of splice signals (Figure 7B, not shown) are interesting. I would suggest submitting it to scientific journals such as Genomics.
Finally, the idea that the very first DNA sequence must have been random, is plausible only if that idea is part of a plausible 'DNA-first' theory of the origin of life.

28 The Nature Precedings articles

5 jan 11

23 Jan 11

On 13 December 2010 Senapathy posted 3 articles on the Nature Precedings website (141). This is a permanent, citable archive for non-peer-reviewed pre-publication research and preliminary findings and is run by the publishers of the famous Nature journal. The article 'Origin of biological information' appears to contain more modest claims than his previous writings. For example, this article does not contain the words 'Darwin' or 'Darwinism', which means that Senapathy does not openly attack Darwinism anymore (subtitle of his 1994 book!). Some of my criticisms are addressed (so it seems), but errors I pointed out are repeated. New is the calculation that genes + regulatory + splicing sequences "can occur within one milligram (~10¹⁹ bases) or so of pre-biotic random DNA"! (p. 8). For comparison: it takes the largest Gene Synthesis Supplier in the USA a year to synthesize 54 million (=10⁶) base pairs and it will cost you $18 million (156).

History
The Conclusion of the article starts with: "This work does not claim to provide historical details of early evolution." (p. 9). You can focus your Origin of Life theory on any aspect you are interested in. Maybe you are not interested in the history of life. That's OK. However, the moment you want to test your theory, you cannot ignore history. Simply because your theory could conflict with the fossil record. And it does. Eukaryotes include vertebrates like birds, whales, elephants, horses and humans. They easily fossilize. If they are produced in the primordial pond, why are those fossils not found during the 'Cambrian explosion of multicellular organisms'? Where are the fossils? See: §18: the chronological order of life.

Self-assembly?
Some of his claims seem to be more modest than earlier claims: "Our findings demonstrate that complete split genes encoding complex proteins could have arisen within a minute amount of pre-biotic random DNA, explaining the origin of biological information and serving as the basis for the evolution of the very first genome." (p. 2. my emphasis). And in the conclusion he writes: "that these genes could have been used in the self-assembly process to create countless eukaryotic genomes" (p.9 my emphasis). So, Senapathy does not explain genomes (strong claim), but genes (weak claim). The word 'evolution' in a non-evolutionary theory? Self-assembly of genomes? This is pure magic! He provides no details, no mechanism, no probabilities, no evidence for self-assembly of genomes. That means that the most important part of his theory is left unspecified. Again: explaining genes does not explain genomes. Genomes are not random collections of arbitrary genes (See: § 6). What determines whether a specific collection of genes is a genome? Sooner or later one must invoke selection of viable collection of genes, and death of inviable collections of genes. So, the crucial step to genome formation in the independent origin scenario would invoke the Darwinian principle of natural selection! Question: could it be that if genome formation depends on self-assembly of isolated genes, that the larger the genome, the lower the probability of successful self-assembly of that genome? It must be.

Genetic code
The stopcodon statistics story, which was an important part of the PLOS article, is not defended explicitly in this article, although he refers to it. The stopcodon issue is part of the genetic code problem (see 'The elephant in the room'). The genetic code problem, the main ingredient of the origin of life problem, is stated and dismissed in a shallow way in a two-sentence paragraph. This is very disappointing from a scientific point of view. (See: § 6)

Figure 1. The common ancestor of eukaryotes, bacteria, and archaea may
have been a community of organisms.
+M = endosymbiosis of mitochondrial ancestor.
Kurland et al, ©Science

Endosymbiosis
Senapathy's genome-centered view of life explains life by reducing life to genomes, and reducing genomes to the statistics of 4 symbols (1/4 x 1/4 x 1/4 etc). The existence of organelles in the eukaryotic cell (see: § 7) is troublesome and annoying for the genome-centered view of life, because organelles (such as mitochondria) cannot be explained by calculating probabilities in the same way (157). It is disappointing that Senapathy tries to dismiss the endosymbiosis theory in a single paragraph (p. 8): "Even after decades of research, no consensus framework for the evolution of a eukaryotic cell from bacterium-like cells has emerged" (p. 8). He refers among others to Kurland et al (147). Although Kurland et al suggest that "eukaryotes are a unique primordial lineage", they certainly do not claim that prokaryotes descended from eukaryotes, nor that eukaryotes were the first forms of life. Furthermore, in Fig. 1 Kurland et al clearly show endosymbiosis of a mitochondrial ancestor (+M) from Bacteria to Eukarya. Whatever the origin of the mitochondrion, it is by definition present in eukaryotes! That is a fact of life which Senapathy ignores. Controversy about the evolution of the eukaryotic cell does not deny that all eukaryotic cells harbor mitochondria. If eukaryotes arose directly from random primordial DNA, then Senapathy has to explain where mitochondria came from; why mitochondria have a circular chromosome with 37 genes; and why mitochondrial DNA lacks introns, as is the case in the human mitochondrial genome (wikipedia). Remember, according to his own theory, the absence of introns makes independent origin impossible because intronless genes cannot be found in reasonable amounts of random DNA. That's why Senapathy is forced to conclude that Prokaryotes descended from Eukaryotes (172). Furthermore, figure 1 of Kurland et al shows that cellular life predated the common ancestor of Bacteria, Eukarya and Archaea, contradicting again the idea that Eukarya were the first forms of life. An overview of possible relationships:

1	— P —> E	Eukaryotes descended from Prokaryotes	mainstream
2	— E —> P	Prokaryotes descended from Eukaryotes	148
3	E <— —> P	Eukaryotes and Prokaryotes had a common ancestor	Kurland, Darnell
4	E —> P	Eukaryotes = first life, Prokaryotes from Eukaryotes	Senapathy

Senapathy also refers to a mainstream publication (148) proposing the hypothesis that 'prokaryotes might be derived from eukaryotes'. This is a remarkable proposal, and seems to support his own theory, but it has not been established, is proposed for quite different reasons, and won't explain the origin of eukaryotes themselves.

There is an even more remarkable mainstream publication of J.E. Darnell (1978), not cited by Senapathy as far as I know, which claims "that eukaryotes evolved independently of prokaryotes" for exactly the same reason as Senapathy: "noncontiguous sequences in eukaryotic DNA" and that eukaryotic DNA "may reflect an ancient, rather than a new, distribution of information in DNA" (169). Martin and Koonin (170) say about this: "James Darnell submitted similar ideas at a time when the issue in early evolution was how to generate long coding sequences from scratch" (my emphasis). From scratch? (This is exactly Senapathy!). At a time? Was it a mainstream issue at the time? But has become irrelevant? Because of what? Mysterious! Whatever, Darnell did not claim that complex multicellular eukaryotes could have originated from prebiotic random DNA.

Anyway, it is a logical mistake to think that gaps in knowledge of eukaryotic origins is evidence for any alternative theory. Senapathy needs positive evidence for his theory. (See also: § 11).
Senapathy uses references to the literature to support his views, which do not support his views at all. For example: "Even after the knowledge that eukaryotic split genes may have been the very first genes became widespread, ..." (p.7, my emphasis). This is a serious misrepresentation of the literature. What the publication (153) says: "the proposal that introns arose before the origin of genetically encoded proteins and DNA," (my emphasis), which refers to a RNA world (self-splicing RNA's). Nobody in the literature claims that complex eukaryotic genomes (like the human genome!) were produced at the time of the origin of life. There is simply no evidence in the fossil record of vertebrates in the Cambrian fossil record. Nobody claims that the "last eukaryotic common ancestor had an intron-dense genome" means that these were the first forms of life.
Senapathy's interpretation of the relevant scientific literature appears to be idiosyncratic, unorthodox, and often erroneous. His references are so numerous, that it is a huge undertaking to check all his references.

Introns-Early — Introns-Late
Senapathy's theory is beyond the Introns-Early (IE) versus Introns-Late (IL) controversy. It would not be correct to label his theory as 'Intron-Early'. If anything, it could be called 'Intron-First' (IF), (but the context of IF is the RNA world). Introns-Late is certainly incompatible with his theory. In Senapathy's theory introns and exons simply do occur in random DNA. It is a static phenomenon. Introns do not have biological causes. Introns did not invade genomes. The mainstream view, whether Introns-Early or Introns-Late, is that introns are a dynamic phenomenon: they invade existing genomes like parasites. In Senapathy's theory it makes no sense to talk about 'mechanisms of intron loss and gain'. They were just there from the beginning.
Mainstream science agrees with Senapathy in one point and that is that introns have evolved extremely early: "introns that currently reside in eukaryotic genes, after all, do derive, through an uninterrupted lineage of selfish elements, from primordial genetic elements." and "introns have evolved extremely early, very likely, earlier than cells themselves." (Koonin 165). Although Koonin uses the suggestive phrase "primordial pool of genetic elements" (233), big differences are that "the primordial genetic pool is believed to have evolved from a pure RNA world to a RNA-protein system to the modern world of the Central Dogma (DNA-RNA-protein)" (165), and that the method was descent with variation (tree of life). Ford Doolittle: "Really we know nothing about how genes arose, and to suppose that they sprang full blown and full length from noncoding polynucleotides seems to me more of a stretch than to imagine that they were cobbled together from smaller oligopeptide-encoding modules" (165).

Intron — genetic code
A fundamental problem with Senapathy's view of introns is that the very word 'intron' defined as "intervening random sequences" (p. 10) has no meaning without 'exon', defined as "split coding sequences corresponding to the split protein sequences" (p. 10). But 'coding for proteins' implies the genetic code. Senapathy is confronted with the genetic code problem again. The issue is not statistics. From the point of view of coding capacity there is no fundamental difference between introns and exons, because intron splicing sites can be deleted or created de novo quite easily by mutation (179). (see also: § 3, 4).

29 Conclusion

The elephant in the room

20 Apr 2026 small improvement of text

For a long time I did not see clearly that even the statistics of stop codons are not pure statistics, but tacitly assume something very important. I got distracted by the details of exons and introns and the subtle way in which Senapathy –unintentionally– confuses the reader and himself. He puts the reader and himself on a wrong track.

What I did not see clearly is that even these 'pure' statistical predictions about the frequency of stop codons and ORF lengths simply assume the presence of a full-blown canonical genetic code with all the necessary biochemical and cellular machinery. In other words: a living cell. The conceptual error here is assuming that a DNA sequence has an intrinsic, absolute, universal, timeless meaning. However, a DNA sequence is just a meaningless string of bases. It is not even a 'code'. DNA is not a code before there is somebody/something to read it. Who or what is going to read the code at the origin of life? DNA only gains meaning in its context: a specific Genetic Code (table). The meaning of a DNA sequence is relative to a specific Genetic Code. Change the Genetic Code, and the meaning of the sequence changes (541). Without a specific Genetic Code, a DNA sequence is just a meaningless polymer. At the time of the origin of life, there is no difference between the sequences of 'exons' and 'introns'. 'Exons' and 'introns' must be recognized by somebody/something. In fact, 'introns' and 'exons' did not exist at the origin of life. Both are just random DNA sequences.
For example, the concept 'stop codon' is meaningless without the full-blown canonical genetic code (314). The words 'DNA', 'amino acid', 'protein', 'lipid' denote molecules (no problem with that), but a 'stop codon' only means something if a DNA sequence is 'translated' into a protein. The word 'codon' itself implies that something is somehow encoded. So, the concepts 'genome', 'genetic code', 'stop codon', 'start codon', 'exon', 'intron', 'triplet', 'codon degeneracy', 'messenger-RNA', 'translation', 'transfer-RNA', 'Reading Frame', 'Open Reading Frame' make only sense in the context of a living cell. Before the origin of the first cell, only chemistry existed. At prebiotic times no 'genomes' existed. So, all these words are forbidden in prebiotic chemistry.
The concept 'Reading Frame' implies that DNA is organized in triplet codons. But triplets are biochemically invisible. A triplet is an arbitrary grouping of units of a continuous DNA molecule. You can't see where a triplet starts just by looking at DNA. In the prebiotic world DNA is a meaningless macromolecule just like proteins are. Proteins are not organized in triplets either and don't encode something. These concepts are not justified in the context of the origin of life. 'Reading Frame' depends critically on the presence of the complete transcription, splicing and translation machinery which cannot simply be assumed at the origin of life. It is a conceptual error. It is the difference between life and non-life. That's the elephant in the room!
Viewed from the point of information theory: there is no difference between 'message' and 'noise' without an 'interpreter' and a specific language (327). So, even the concept 'noise' is inapplicable to a random DNA sequence (326).

See the following paragraphs for an explanation:

He knows it!
DNA is nothing without proteins
the chicken and egg problem
§6 A DNA sequence is not a genome
§7 The genome-centered and information-centered approach

Summary

updated 21 Aug 2025

    In 1994 Senapathy wrote an ambitious 600-page book "Independent Birth of Organisms. A New Theory That Distinct Organisms Arose Independently From The Primordial Pond Showing That Evolutionary Theories Are Fundamentally Incorrect" to refute evolution and to argue that all life forms except bacteria and viruses, but including humans (chapters 5–11) (366), arose independently from random DNA in a single primordial pond. He put a lot of energy into the project.
    Senapathy started with the correct observation that –unlike prokaryotes– most of the DNA in an eukaryotic genome is junk; and that genes exist only 'as small islands in large oceans of meaningless DNA' (471). He claimed that split genes –genes with introns and exons– originated from random DNA. Split genes only occur in eukaryotes. All eukaryotes have split genes. Consequently, he is forced to explain the origin of all eukaryotes. But this idea cannot be applied to the origin of prokaryotes, because prokaryotes don't have split genes. So, he is forced to find another explanation for the origin of prokaryotes. His solution: prokaryotes originated (evolved?) from eukaryotes by losing introns. His theory does not allow for the origin of eukaryotes from prokaryotes, because his theory already explained the origin of eukaryotes from random DNA. Since eukaryotes arose from random DNA, they have no ancestors. As a consequence random DNA must have originated from chemical building blocks. But that is nothing less than the orgin of life! So, he ended up –whether he wanted to or not– explaining the origin of life, eukaryotes and prokaryotes. That is no small matter. That is nothing less than explaining the origin of all living creatures.

    Initially I was excited and curious because his theory is a non-religious naturalistic alternative for evolution (112). A naturalistic alternative for evolution is rare. The only other scientist I know who defends naturalistic independent origin is biochemist Christian Schwabe (16). I am unaware of any cooperation between the two. An important difference is that Schwabe is a biochemist and that he approached the origin of life as a biochemical problem. The difference with creationists is that Senapathy almost certainly has no religious motives. Nonetheless there is strong similarity with Christian authors such as Walter Remine (545).
In science, as in life, it is difficult to have a truly original idea (509). In essence his theory states: "complexity first, simplicity later" (475), which is precisely the opposite of evolution (476). This idea is so alien, so counter-intuitive, so different from all existing theories, that it takes time to understand it. Moreover, there are several contradictory claims in his book which makes it even more difficult to understand. He summarises his theory in 13 statements (pages 202–204).
   In my opinion Neo-Darwinism and the mainstream theories of the origin of life are not necessarily true (88). Independent origin is not false simply because 'everybody knows that evolution is true'. Unsolved problems still exist in evolutionary theory (479). Unfortunately, Senapathy is unaware of and also disregards crucial biological facts, despite the fact that he had access to the mainstream literature of that time (mainly genetics and genomics). It is a very naive view of the biological world, a layman's view of the world, especially of the origin of life. That explains why he is so confident (508). He does not know what he does not know. Despite of this, or maybe because of this, he wants to create 'a theory of everything' in biology: the origin of life, and the origin of DNA and genomes, and the origin of the genetic code, and the origin of species, and the origin of eukaryotes, and the origin of prokaryotes and the origin of development of multicellular organisms. No knowledgeable biologist ever dared to do that. Darwin refrained from explaining the origin of life. Senapathy wants to explain too much (510). Real science has not solved and cannot solve everything. Real science knows its limitations (457). Moreover, he lacks empirical evidence to support his theory (470), (471).
    Remember that Senapathy is trying to solve the Origin Of Life. In other words: Prebiotic chemistry (or Abiogenesis). Prebiotic chemistry studies the natural origin of the building blocks of life (503). But Senapathy started with DNA. So, he assumed all the building blocks are present in the Primordial Pond and that DNA is prebiotically synthesized. But DNA is both chemically and biochemically very difficult to make (480). So, he skipped the hardest problems. He should have discussed DNA synthesis before discussing STOP codon statistics. The first chapter of the book should be Prebiotic Chemistry. Without a solid basis in prebiotic chemistry his theory is a non-starter.

Senapathy's theory of the origin of life suffers from Reductionism at 10 levels (reduction means simplifying):

Reduction of life to eukaryotes: ignoring the origin of prokaryotes ( Did prokaryotes arise from eukaryotes? )
Reduction of males and females to a sexless genome ( Sexual reproduction is far more complicated than asexual reproduction )
Reduction of diploid eukaryotic species to haploid DNA sequences
Reduction of genomes to nuclear genomes, ignoring mitochondrial genomes (animals, plants), chloroplast genomes (plants) and hologenomes (endosymbiotic bacteria). ( Endosymbiosis refutes Independent Origin )
Reduction of genomes to a random collection of genes, ignoring the probability of a complete viable genome
Reduction of genomes to protein-coding genes ('protein-centric view'): ignoring everything else (non-coding DNA, an abundance of short and long noncoding RNAs, regulatory sequences, promoter regions, enhancer regions, transcription factor-binding sites, intergenic regions, histone modification, DNA methylation, chromosome-interacting regions, transposons, repetitive DNA)
Reduction of genes to ORFs ignoring translation (mRNA must be transported out of the nucleus in to the cytoplasm and attach to ribosomes).
Reduction of genes to exons: discarding introns and necessary splice signals
Reduction of genes to a Sequence of 4 'symbols' A,T,C,G: reduction of biology to statistics, ignoring chemistry.
Reduction of an organism to 'The Sequence' ( The genome-centered and information-centered approach ).

Three major errors of independent origin

The first major error: the argument assumes that if a gene (genome) can be generated by a computer, it can also be generated naturally (prebiotically) and it assumes highly specific enzymes. The origin of life is a chemical problem. It is true that (random) polymers must have been synthesized prebiotically. But assuming that the polymer must have been DNA and that it could form spontaneously a length of billions of bases, and then building your whole theory on it is very risky. Very risky indeed, because he published his book one year before the first complete eukaryotic genome was sequenced in 1995 (301), so he could not calculate the probability of a whole genome sequence. Yes, it is true that DNA is crucial for life, but it does not necessarily imply DNA was involved in the origin of life. DNA does not form spontaneously.

virus: DNA + capsid

The second major error (I became aware of this only after nearly 10 years) is that even a 100% accurate human DNA sequence without a cellular context has no more meaning than a random DNA sequence of the same length.
Even if a correct human diploid sequence of 6 billion base pairs, including all introns, exons, splice sites, regulatory sequences, promoters, enhancers, would be found in a random sequence, the sequence would still be as dead as a doornail!
Even if that sequence would be present in a diploid heterozygote state and correctly distributed over 46 units (simulating human chromosomes; 2n=46, XY or XX pair), the sequence would still be as dead as a doornail!
Even if all sequence characteristics of the chromosomes were present (telomeres, centromeres), including 'junk-DNA', the sequence would once more still be as dead as a doornail!
That is the elephant in the room! The 'human genome sequence' could not produce a human. The sequence would not even be 'living'. The reason is the same as why a DNA virus is unable to reproduce outside a living cell (331): a naked DNA sequence is unable to do anything outside a living cell. It needs a cell to reproduce. Even a DNA-virus itself does not consist solely of DNA but also of a protein coat (capsid).

There is a profound reason for this error: the DNA- and genome-centered view of life ('genetic determinism' 219):

THE SEQUENCE IS NECESSARY AND SUFFICIENT TO CREATE AN ORGANISM.

This is wrong if applied to the origin of a multi-cellular individual from a zygote. A zygote is a eukaryotic cell formed by a fertilization event between a male and female gamete. The zygote's genome is a combination of the DNA in each gamete (216). Importantly, the zygote contains a lot of cellular information that is not present in the DNA sequence of the genome (528). But Senapathy got carried away by the idea, took it too literally, and transformed it into:

THE SEQUENCE IS NECESSARY AND SUFFICIENT TO EXPLAIN THE ORIGIN OF LIFE.

spermium

The idea that The Sequence is necessary and sufficient to create an organism is an absolute requirement for independent origin. For, if other factors than The Sequence would be required, The Sequence itself could never explain the origin of life. The next step is: let the Sequence be created abiotically by the Primordial Pond and the origin of life would be explained. Both scenarios (1) and (2) fail for the same reason as why a virus could not be the explanation for the origin of life: the Sequence depends on other factors to do anything. Both naked DNA and a virus need a living cell to do anything. A virus, despite containing genetic information, cannot be called 'living' because it does not and cannot do anything itself (see:

Ganti). A virus is a parasite. Even the notion of 'genetic information' in a virus completely depends on the cell's interpretation machinery. Without a cell we would not be justified to claim that a virus carries genetic information. Just like naked DNA. Just like a sperm, despite the fact that a sperm has a complete human genome. A sperm itself cannot produce a human. It needs an egg cell. An egg cell is more than DNA.

The third major error:

A DNA SEQUENCE HAS AN ABSOLUTE MEANING 4 Oct 2025 / 22 Mar 2026

A DNA sequence on its own has no meaning. The meaning of a DNA sequence is relative to a specific Genetic Code and the biochemical implementation of that Code. For example, by convention the Genetic code table is read from left to right. But it could just as easily be read from right to left. That would also give meaning to a DNA sequence. Assuming the current genetic code for the origin of life is an unjustified assumption. There is no evidence that the current genetic code is biochemically necessary, and that it is the only possible genetic code. The fact the the current genetic code on earth is nearly universal is no proof that is is biochemically necessary. It could be highly arbitrary or completely random. There can literally be thousands of codes that result in the same proteins. Because there is no absolute meaning of a DNA sequence, it makes no sense to ask the question: Can we find a human gene in a random DNA sequence in the Primordial Pond? Certainly not at the Origin of Life. A more meaningful question would be: how did the current Genetic Code originate? Which features can be predicted?

These objections are so fundamental that no future developments can change that situation. I now see clearly that to start the origin of life with DNA is hopelessly wrong. This has been known for decades (395). DNA is the most unlucky choice anybody can make.

Possibly, there is a role for randomness in the origin of life. Possibly, the origin of catalytic RNA molecules from random RNA sequences (217) plays a role in the origin of life. But nobody demonstrated the spontaneous origin of DNA sequences of billions of bases long. Even a bacterium like E. coli is too complex to originate in a primordial pond. Origin of Life researchers aim at something 1000 times less complex than E. coli, while Senapathy aims at something a thousand times more complex than a bacterium. Possibly, there is a role for random DNA sequences in the origin of new genes in existing organisms (279, 280). Possibly, there is a role for computer simulations of random networks (see:

Stuart Kauffman), but all computer simulations require empirical support.
The origin of life requires a bottom-up approach starting from chemical building blocks and using laws of chemistry. Senapathy assumes the complete cellular transcription and translation machinery of a cell (454). Importantly, since eukaryotic introns are not self-splicing, one needs the complex eukaryotic splicing machinery. The origin of life is not solved by simply assuming all these things.

Surprisingly, and contrary to his claims, his theory is not truly independent birth of organisms, because it involves a lot of micro-evolution, mutation and natural selection. Even more clearly contradicting independent origin is the idea that prokaryotes are not independently born, but somehow evolved from eukaryotes. This contradicts his revolutionary claim "That Evolutionary Theories Are Fundamentally Incorrect" (subtitle of book!). Significantly, the only non-supernatural alternative to Darwinism invokes (micro-)evolution and a lot of selection.

Methodology: theory and data

His theory tries to solve problems in the theory of evolution, such as 'missing links' between supposedly related organisms; the 'Cambrian explosion'; split genes; junk DNA and the C-value paradox (526). Senapathy made several methodological errors. He overestimates the success of his own theory and underestimates the problems. At the same time he is exaggerating the problems of the theory of evolution, and ignores the successes of the theory of evolution. His theory creates many more problems than it solves, if it solves any problem at all. Anyway, the origin of life is the most difficult problem in biology (for an introduction, see wiki article). The best scientists in the world have not yet solved it, but Senapathy states that "the new theory is able to explain the origin and diversity of complex creatures" (p.8).
Apart from computer simulations and statistical analyses, he did a lot of primordial pond story telling (466),(524), filling gaps in knowledge by making extensive use of imaginary scenarios (425), some passages look very much like science-fiction (here), wishful thinking to an extreme degree (420), personal beliefs (473) and 'solid' conclusions based on shaky foundations (474, 534). A recurrent and serious methodological problem is that he doesn't clearly distinguish between possible state of affairs ("it is possible", "it is conceivable") and factual state of affairs ('is') (493, 493a, 420). He claims that his theory makes predictions that are corroborated by the data, but that is questionable (494). He knows things without doing the experiments or calculations (419). He simply didn't have the necessary data to support his theory (470). His computer simulations of ORFs seem to be tweaked to give the desired result (537).
There are unresolved contradictions in his theory (478). For example, mutations are allowed in 'immutable genomes' (417). This fuzziness is unacceptable in a scientific theory. When necessary, he makes exceptions to his own theory (387). He conveniently provided his Primordial pond with unlimited powers (431), (452). Unique features of organisms are explained by the production of unique genomes in the Primordial Pond, and common features of organisms are explained by reusing genes from successful organisms (486). There is no way this theory could fail. In this way, any arbitrary combination of unique and common features in species can be explained. He also included an arbitrary combination of 'independent origin' and evolution by natural selection in his theory (487). Adding 'natural selection' to his theory contradics the title of his book (529).
He assumes so much that in fact he solved no problem at all. He did not address the hard problems of the origin of life (482). Remember: Senapathy not only tried to replace Darwin's theory with his own theory, but also wanted to invent a completely new Origin of Life theory. That's an unimaginably heavy burden for one person. Remember, Darwin did not try to solve the origin of life. It seems Senapathy isn't fully aware of the hard problems at all. Furthermore, to design and evaluate a new theory that encompasses the whole of biology, one must be able to synthesize a huge amount of data from all biological subdisciplines. This is almost impossible for one person to manage. The book doesn't have co-authors.
Furthermore, his theory is a convoluted theory with several inconsistencies. He does not think deeply enough about his own theory. In order to be able to do that, one has to have professional knowledge of all areas of biology (to name a few: ecology, foodwebs, parasitism, symbiosis), not just one particular field (STOP codon statistics). But his general biological knowledge is often at the level of a layman. This is evident from vague layman concepts such as 'creatures', 'organism', 'seed cell', 'born', 'primordial pond', 'eons', 'myriads' (484). Furthermore, he invents his own concepts such as 'seed cell', 'DG pathways', 'Universal Gene Pool' (UGP), 'random perfection'.
Scientific theories are tentative, but his theory is not tentative at all. He does not acknowledge uncertainties or limitations. He uses words like 'it is highly probable' and 'it is certain'. Scientists are usually very cautious in scientific publications, but Senapathy is absolutely certain that his theory is correct (488). In fact, he claims he has proven that his theory is true (489). Unfortunately, in the field Origin Of Life, there are no certainties. Potentially contradictory/falsifying data (endosymbiosis) are not taken seriously and are unceremoniously dismissed. He underestimates the profoundness and complexity of the origin of life problem, and he overestimates the power of his own theory to an extreme degree. He never doubts. He is overconfident (Dunning–Kruger effect). At the same, time he is much more critical of the theory of evolution, than of his own theory. In contrast to Darwin, Senapathy did not include a chapter 'Difficulties of the theory'. On the contrary. There is not a single critical remark on his Wikipedia page (506).

In this review I used a lot of information not available to Senapathy in 1994. However, Senapathy is unaware of too many biological facts that were known in 1994 and he continues the same approach in recent publications (141).

Periannan Senapathy:
thinking too far
outside the box
and too independent

Senapathy is a baffling personality, an outside-the-box thinker with an unhealthy dose of infallibility. I conclude that he is a stop codon specialist, but not a biologist. On the one hand he had a job at the National Institutes of Health in Bethesda, he is the director of a software company, is very ambitious, published already 4 articles in scientific journals before he published his book (Science, Genomics, Bioinformatics, PNAS (194), PLOS ONE (406)) and was cited in books (175), in the New Scientist (236) and other journals (195), developed the Shapiro Senapathy algorithm (407), is present in Wikipedia (423), quotes from mainstream evolution literature (C. Darwin, S.J. Gould, D. J. Futuyma, E. C. Minkoff, G. G. Simpson, R. Dawkins, T. Dobzhansky, M. Kimura), but on the other hand he doesn't know basic biological facts, and often behaves like a crank (372). He is not a biologist (426). His theory is still mentioned in his Linkedin profile (379). However, my review of his book has not been written to attack the person. I was curious how a non-evolutionary, non-religious theory explains the existence of life on Earth.

What I learned

Studying this and religious alternatives for evolution, convinced me that it is extremely hard to develop a non-evolutionary but naturalistic alternative for the history of life on earth which (a) does not contradict the facts of life and (b) is internally consistent and (c) is an improvement. At the same time the theory of independent origin stimulates thinking far more than creationism does. It proved to be a useful framework to organize your thoughts about what life is and the origin of life. Further, I learned that evolution theory is certainly not a shallow idea that is dogmatically and mindlessly defended by evolutionary biologists. I am impressed and a little bit surprised that evolution theory escapes so many of the traps Periannan Senapathy fell into. I am equally surprised how thousands of seemingly neutral facts turn out to be evidence against 'Independent Origin'. This endless stream of facts continues to inspire and give me deep insights into the fundamental properties of life on Earth and the theory of evolution. All this has a direct relation to the central questions of biology, evolutionary biology, the Origin of Life and the origin of introns.
The availability of two opposing theories of life turns biological facts into arbiters. And that makes this project so rewarding.
The best scientists have made important discoveries and, at the same time, admit that they do not understand everything, that they cannot solve every problem; and that their favourite theory is not perfect and does not explain everything. The best scientists listen carefully to scientists who disagree with them, and try to understand their best arguments and seek out information they have that you don't. Therefore, it is always very unwise and unscientific to say: my theory easily explains everything! And that is also true for the theory of evolution. Francis Crick wrote:

"Theorists in biology should realize that it is extremely unlikely that they will produce a useful theory (...) just by having a bright idea distantly related to what they imagine to be the facts. Even more unlikely is that they will produce a good theory at their first attempt. It is amateurs who have one big, bright, beautiful idea that they can never abandon. Professionals know that they have to produce theory after theory before they are likely to hit the jackpot. The very process of abandoning one theory for another gives them a degree of critical detachment that is almost essential if they are to succeed." (Francis Crick, 398).

As an independent researcher, you may be encouraged by discovering 'errors' in a mainstream scientific theory, but don't make the mistake of thinking that your own theory is error-free. That makes you overconfident and blind to your own errors (508). Discuss your theory with colleagues before publication. You could have misidentified errors (spurious errors) and have overlooked the real errors in the theory you are attacking. Your task is to find out what is wrong and what is right. It is never easy to improve (let alone replace) an extremely wide-ranging and all-encompassing theory such as the theory of evolution, especially if you do it all by yourself. If you want to publish a revolutionary theory that overthrows everything biologists know, think twice. Foremost, make sure you have a thorough knowledge of all aspects of biology.

"Biology is always more complex than you'd like it to be." (393c).

30 Conclusion after 20 years

added: 14-18 Jul 2023

The amazing thing I discovered recently is that Senapathy was aware of important facts that should cause fatal problems for his theory. He must have some pretty good idea that these facts cause serious problems for his scenario. This is my summary of the problem:

In each and every living thing on earth DNA is always synthesized on the basis of another DNA template (semi-conservative replication) and never de novo from building blocks.
Gene expression is controlled by transcription factors, which are proteins that control the rate of transcription.
DNA is always first transcribed into mRNA, STOP codons do not matter at transcription. RNA-polymerase is required for transcription. That is an enzyme.
splicing machinery (spliceosome) is required which consists of ribonucleoproteins (enzymes)
translation requires ribosomes (translational apparatus) consisting of ribosomal proteins
translation requires transferRNA (tRNA) and aminoacyl-tRNA-synthetases, the latter are enzymes.

This is what he writes:

"It should be remembered that these proteins and RNAs of the primitive ribosomes (and other complex machineries) were not DNA coded. They were the random polymers of amino acids and nucleotides, chemically synthesized in the primordial pond. Rare associations of proteins and nucleic acids into "nucleoproteins" could carry out flashes of activities such as duplicating DNA (DNA replication), copying DNA into RNA (transcription), editing and splicing RNA (RNA splicing) and decoding RNA into protein (translation) in the primordial soup." page 210 (bold added) (418).

Senapathy must have been aware that this is a chicken and egg problem: which was first: enzymes? DNA? Since enzymes are specified by DNA sequences and produced with the help of enzymes, one cannot start with DNA alone. DNA alone can do nothing. But one cannot start with enzymes either, because they are coded by DNA. Those enzymes have high specificity and cannot originate randomly. That specificity is stored in the sequence of DNA. That's the whole point of DNA.
His 'solution' was astounding: he assumed that those highly specific enzymes were present in the primordial pond (albeit in a primitive form!). The necessary enzymes are not coded by DNA, he writes. They are prebiotically produced enzymes. They miraculously originated spontaneously in the Primordial pond! Clearly, he underestimated the specificity and the complexity of these large enzymes. The introduction of 'primitive' ribosomes and spliceosomes' in the primordial pond (p.210) suggest an evolutionary process and that contradicts the theory of independent origin of eukaryotic organisms. The concept 'primitive' shows that he feels uncomfortable with the idea that highly complex enzymes originated in the primordial pond. By introducing enzymes, he implicitly admits that naked DNA (however complex or how many protein coding sequences it may contain) could never form life.
Please note, I do not use today's knowledge. The above quote shows Senapathy knew that enzymes are necessary. I listed more examples above.
Many years later I discovered that I could have improved the quality of the review by double-checking what the author wrote on specific topics throughout the text. The pdf of his book with full-text search proved to be an invaluable asset. No printed book can compete with that. As a result I included specific illuminating quotes from the book.

First things first! 26 Jul 2025 – 10 May 2026

After finding a thousand objections to the theory of independent origin, I suddenly realized that there is a hierarchy in the biological facts that make independent origin impossible. For example, geology, climate, ecology, sex, maternal care, the paleontological record are all valid objections, but they are secundary or tertiary. The origin of life starts with the most important precondition (1) a bio-friendly prebiotic environment, then (2) the origin of DNA, Genetic Code, cells and cel division, and finally (3) the rest: all animals and plants. The prebiotic conditions must carry the most weight in the critique, simply because no plant and animal can exist without fully functional chromosomes, cells and cell division. That is where it all begins (according to Senapathy eukaryotes originate in the primordial pond). If DNA, chromosomes and cells cannot originate in a primordial pond out of simple molecules, then you might as well forget about the rest.
Among the problems of the origin of chromosomes, cells, and cell division, the abiotic chemical synthesis of DNA itself and the factors that give DNA its meaning have the highest priority. The factors that give DNA its meaning are a Genetic Code which interprets the DNA sequence and the biochemical machinery that physically performs the task of interpreting DNA. Statistics of STOP codons is of the least concern. See:

a DNA sequence is not a genome.
this causes a chain reaction.

SuperSummary 1 Mar 2026

Senapathy's theory 'Independent Birth of Organisms' is unable to explain the origin of a single organism.
All the supposed problems in the theory of evolution that Senapathy raises in his book, fail to provide evidence for his theory.

31 Appendix: Genetics Primer

Eukaryotic cell according to Senapathy
(Fig 7 Appendix)

Senapathy's book contains a large 30-page 'Appendix: Genetics Primer'. With this section we can compare 1) What did Senapathy know in 1994? 2) What was known by scientists in 1994?

In the section 'The Genome' (p.543) he describes the structure of 'the genome' in general, but this appears to be only the eukaryotic or human genome. As if prokayrotes do not exist. Only on page 551 are prokaryotes introduced. The information is correct, but he omits a crucial fact: eukaryotic mitochondria contain DNA with prokaryote-like characteristics. It is also not indicated in figure 7 of the Appendix (p. 552). He could have known it, because the human mitochondrial genome was published in 1981 (548).
There is no mention of the endosymbiosis theory (see here). Chloroplasts are not mentioned. Only on the last page ('Some anecdotes', p. 566) we learn that human DNA is about 600 times larger than the bacterium E. coli. The concept 'zygote' is mentioned on page 559 but it is not explained that a zygote originates by fusion of a haploid egg cell and a halpoid sperm cell. 'Transfer RNA' and 'Ribosome' is mentioned (page 556) but not ribosomal genes. The Genetic Code table is present on page 548 with a very short description but nothing about its orgin. The start codon (AUG) is absent from the table.
The important concepts 'heterochromatin' and 'euchromatin' are completely absent in the book. He does describe 'chromosomes': "Histones are a set of proteins that "package" the DNA and lead to an ordered chromosomal structure called chromatin..." (p.545), but does not explain where histones (which are proteins!) come from (see the elephant in the room). He also mentions 'spliceosome'.
He also knows that the expression of genes is regulated by the binding of multiple proteins to regulatory regions in DNA before the start of a protein-coding gene (p.554-555). However, he doesn't see that this adds enormously to the complexity of the genome and that there is more to a genome than stop-codon statistics.

Assessment: only somebody who read the appendix would notice that his theory implies the very counter-intuitive idea that a simple bacterium cannot arise spontaneously in the primordial pond, whereas the human genome, which is according to his own data 600 times bigger, easily does! Furthermore, in the context of gene regulation he mentions "a simple bacterial cell [prokaryote], and a more complex cell [multicellular organisms]" (p.559). Additionally, eukaryotes mainly have sexual reproduction which is more complex than asexual reproduction (prokaryotes such as bacteria).
It certainly is intriguing that vertebrate genomes have low gene- and information densities and high intron densities compared to bacteria (246, p.233), but at the end of the day it does not help in the slightest degree. (In contrast, the theory of Kimura and Ohta explain why bacterial genomes are information dense and those of multicellular species have low information density). By –conscious or unconscious– omitting several important facts which cause trouble or are fatal for his theory, his Genetics Primer is a biased presentation of Genetics. Even considering what was known in 1994.

32 Humans

added: 23 Jun 2025

On 6 Sep 2021 from Periannan Senapathy requested to remove 'humans' from the statement "Independent researcher Periannan Senapathy came up with an extraordinary solution: the independent origin of all organisms, including humans." (which is the opening sentence of my review). I blogged about this (387) and (388). For completeness, I collect here relevant quotes about humans or implying humans.

"When such a high level of extreme order as found even in the genome of a worm can arise from the chaos of random genetic sequences, it is not comparatively more difficult to create the order in the genomes of organisms such as the complex human." Note 62 on page 619 of his book.
"In my opinion, without this fundamental finding about the genes which are central to life, it would be impossible to show that multicellular animals and plants could have directly originated in the primordial pond." page 250.
"In this sense, humans, elephants, rats, earthworms and even the very small microscopic insects and invertebrates all have nearly the same genomic complexity in terms of the structure and function of the genome." page 302.
"When we consider the case of the independent birth of mammals, it is reasonable to think that a conglomeration of a large number of cells and biochemicals in the primordial pond could have formed an environment akin to that of the placenta and uterus of mammals. There, a seed cell can differentiate into an embryo and a full-grown offspring". (page 309 chapter 8).
"it is clear that all organisms were born independently, and were – and are – also immutable. This is the secret of life, and of its origin and history." page 373
"It is remarkable that all the characteristics of random DNA are still essentially present in the split genes of present day intron-dense large genomes such as those in the human." (105)
"When we construct a curve for an actual gene, the human globin gene, we see that it is quite similar to that of the random sequence."
"This work also explains why the genomes are very large, for example, the human genome with three billion bases, and why only a very small fraction of the human genome (~2%) codes for the proteins and other regulatory elements." from: https://en.wikipedia.org/wiki/Periannan_Senapathy.

Please note that the structure of the human genome with split genes, exons and introns is fundamentally the same as in all animals and plants. This follows from his theory. To make an exception for humans is clearly against the theory of Independent Origin (of eukaryotes).

33 Wikipedia article 'Periannan Senapathy'

added: 29 Jul 2025. Updated: 31 Jul 2025

In his Wikipedia article Periannan Senapathy (last edited 16 July 2025), paragraph 'Origin of split genes from random DNA sequences', he explains why eukaryotic genes have a split structure or a (exon – intron structure. His explanation is: split genes originated from random DNA sequences.
The problem with this explanation is: if a hypothetical gene has a random sequence than everything from begin to end is random including the supposed 'exon-intron' boundaries. But introns have to be recognized somehow in order to get removed. The recognition is achieved by special splice site recognition sequences which are located at the beginning and end of each intron. In fact those sites define introns. Without those sites, exons and introns cannot be distinguished (both exons and introns are just random sequences anyway). Because these boundaries are specific they are non-random (504), (505). Consequently, there is no way that these splice sites occur in random DNA. Think about this: human genes contain on average 8 introns, that is 16 splice sites per gene. Since humans have about 25,000 genes, there are about 400,000 splice sites. A few could occur by chance, but not 400,000.
The whole argument can be summarized in one sentence:

Either a sequence is random, and then there can be no well defined splice recognition sites, and thus no exons and introns
or there are splice recognition sites, and exons and introns do exist, but then the sequence is not random anymore.

Consequently, the exon-intron structure of eukaryotic genes cannot arise from random DNA. The inescapable and final conclusion is that the origin of eukaryotes cannot be explained by random DNA. There is no escape from this conclusion.
Senapathy failed to explain the origin of life. His Primordial pond will neither be the birthplace of eukaryotes nor of prokaryotes. Under the most favourable conditions his Primordial pond will contain dead meaningless random DNA sequences.

In his wikipedia article Shapiro–Senapathy algorithm there is additional evidence that splice sites are not random:

"the likelihood that a given sequence functions as a donor or acceptor splice site."
"The majority of disease-causing mutations in the human genome are located in splice sites."
" the consequences of splice site mutations including exon skipping and intron retention."

Since mutations in splice sites disrupt normal splicing, splice sites are not random sequences. They are specific. They have a certain signature. This disproves the theory that eukaryotic genes and genomes arose from random DNA.

34 Notes & References

Senapathy: "When I initially tried to explain my theory to my wife, I said, All the organisms could have come about just as they are, independently from the primordial pond. Her response..." (p.295). Please note, she forgot to ask about humans.
Source of chromosomes. These chromosomes are not from Senapathy's book. Although the word 'chromosomes' does occur frequently in his book, the 'origin of chromosomes' is not discussed in his book. That is the point.
Only male honeybees hatch from unfertilized eggs [are haploid], but female honeybees hatch from fertilized eggs [are diploid], therefore that won't help independent origin theory very much. See: Olivia Judson(2002) Dr. Tatiana's sex advice to all creation, p.18 . Hermaphrodites (organism with both female and male sex organs) usually need another individual to reproduce.
Helen Pearson: Human genetics: "Dual identities", news feature, Nature 417, 10-11 (2002), 2 May 2002.
See for a full exposition for the non-specialist Mark Ridley (2000) Mendel's Demon. Gene Justice and the Complexity of Life (review), Chapter 6 "Darwinian merger and acquisition" is about the far reaching implications of having mitochondria in the cell.
Christian de Duve (2002) Life Evolving, Oxford Univeristy Press, p. 141.
Robert Pennock (2001) Intelligent Design Creationism and its Critics, p. 685.
S.G. Gregory et al (2002) A Physical map of the mouse genome. Nature AOP, published online 4 August 2002.
Carina Dennis and Richard Gallagher (2001), The Human Genome, p.120: "The largest apparently contiguous conserved segment in the human genome is on chromosome 4, including roughly 90,5 Mb of human DNA that is orthologous to mouse chromosome 5."
Similarities found in mouse genes and human's. Nicholas Wade, NewYork Times Science, 5 Dec 2002.
Comparision of Human chromosome 4 and Mouse chromosome 5.
Comparision of Human chromosome 17 and Mouse chromosome 11.
Comparision of Human chromosome 20 and Mouse chromosome 2.
A memorable misunderstanding on this site.
Evolution (Third Edition) on this site.
A Chemist's View of Life: Ultimate Reductionism & Dissent on this site.
Ernst Mayr (2001) What Evolution is, p.46. See also: Lynn Margulis(1998) Five Kingdoms. An illustrated guide to the phyla of life on earth, third edition. p.12.
Senapathy has a short paragraph What is a "seed cell"? (p.307), in which he uses 'haploid' and 'diploid', but there he neither explains what a seed cell is, nor how a diploid cell arises out of the primordial pond.
In noot 107–109 of Chapter 4 he knows "Thus when the sperm of a male and the egg of a female unites to produce an offspring" and: "Almost in all multicellular organisms, each chromosome is present in two copies, one from the father and one from the mother. The same chromosome from the father and the mother are called homologous. For instance, there are 22 pairs of chromosomes and a female (X) and a male (Y) chromosome in the human." (page 588). This is not quite correct; females have XX, males XY and the complete genome is: female 46XX, males 46XY.
Portrait of a molecule by Philip Ball. This is a good article for those who think that a genome is just naked DNA. Nature 421, 421 - 422 (2003) (free). Have a look at the beautiful diagram of the 3-D structure of the chromosome showing that a genome is more than just the sequence of the bases! Looking at this image it is clear that Senapathy's discovery about split genes in random DNA is almost irrelevant. He did not explain the massive amounts of highly specialised proteins (histones), which form the complex 3-D structure of the eukaryotic chromosome.
Richard Dawkins used the now famous weasel computer experiment to demonstrate the difference between one-step and cumulative selection in The Blind Watchmaker, chapter 3. See also my Spetner review.
David Foster attributed this argument to Thomas Huxley (see review of Foster's book).
Senapathy states (p.222) that the occurrence of the uninterrupted text of Shakespeare is improbable.
Senapathy easily contradicts his own theory: "Thus it is possible for the prokaryotic genome to have been derived directly from contiguous genes in the open primordial pond". (p.238)
In fact, this question is wrong. There is no such a thing is "the human genome". Only female and male human genomes exist.
"the primordial pond could have been productive for a very long geological time" (p.345).
"indentifiying exon-intron borders is a notoriously difficult task", Antoine Danchin (2002) The Delphic Boat p.238.
Charles Spruck (2003) Requirement of Cks2 for the first metaphase/anaphase transition of mammalian meiosis. Science 300 (5619):647. 25 Apr 2003.
Whenever Senapathy is uncertain, he says "absolutely". The word occurs 138 times in his book!
Even Jesus, the son of God, had a mother. Significantly, this is claimed by people who otherwise accept miracles. However, Adam and Eve did not have a mother and father, but were created as adults. Senapathy's creatures also did not have a father and mother. The biblical creation story and Senapathy's scenario both are examples of independent origin. Even an orphan had a father and a mother.
A Conversation with James D. Watson, Scientific American, March/April 2003.
David Haig (2002) Genomic imprinting and kinship, Rutgers University Press, p.11. A further reason for the absence of parthenogenesis in animals is that the sperm also contributes the centrosome to the egg which is essential for initial divisions of the fertilized egg (Christiane Nüsslein-Volhard, 2006, p.15).
John Maynard Smith & Eörs Szathmáry (1999) The Origins of Life. From the Birth of Life to the Origin of Language. Furthermore, the members of higher levels are composed out of members of lower levels.
Graur and Li (2000) Fundamentals of Molecular Evolution. Second edition. p. 136.
Donald Forsdyke (2001) The Origin of Species Revisited, p. 103. (seereview on this site).
Syozo Osawa (1995) Evolution of the Genetic Code, p. 45.
Michael Majerus (2003) Sex wars. - Genes, bacteria, and biased sex ratios, p.63,66.
Actually, meiosis is more complex. In males, the products of meiosis are four sperm, each sex chromosome in the original diploid cell being present in two of the products. In females of most species, however, only one egg is produced for each parent cell that undergoes meiosis, the other three haploid products together giving rise to the yolk of the ensuing egg. See also review of Mendel's Demon (Unexpected predictions and explanations).
Jan Sapp (2003) Genesis. The Evolution of Biology, Oxford University Press, paperback, p.x (Prefeace). This is elaborated in the chapter "Beyond the Genome".
M. Lynch (2002) 'Intron evolution as a population-genetic process' Proc. Natl. Acad. Sci. U.S.A. 99, 6118 (2002)
Paul Davies (1999) The Fifth Miracle. The Search for the Origin and Meaning of Life. p.119. Very important book!
What is known about the function of introns? , Scientific American, Ask the experts/Biology, 1999. Of course Senapathy could not have known this in 1994.
Louis Berman (2003) The Puzzle. Exploring the evolutionary puzzle of male homosexuality, p.478
Tibor Ganti (2003) The Principles of Life, Oxford University Press. (my review). Furthermore, Ganti writes: "A living organism can never be developed from genetic material alone", "An egg, a seed, or a spore must always contain the substances of the cytoplasm". p.126.
Freeman Dyson (1999) "Origins of Life", second edition. p.18.
J.J. Emerson et al (2004) "Extensive Gene Traffic on the Mammalian X Chromosome", Science 303, nr 5657, 23 Jan 2004, pp. 537-540.
However, retrogenes are known for a long time. Examples of intronless retrogenes are: PGK (1987), calmodulin gene (1987), globin gene (1987), actin gene (1985). See Wen-Hsiung Li (1997), p.347.
S. J. Gould (2002) The Structure of Evolutionary Theory, pp 252-253, 325, 527-528, 1174 (slightly adapted).
Solving the origin of life without the origin of species is difficult enough. However, even the origin of life itself is difficult enough because it commonly includes the origin of the genetic code. Hungarian chemist Gánti simplified the question by distinguishing between the origin of life an the origin of the genetic code.
Motomichi Matsuzaki et al (2004) 'Genome sequence of the ultrasmall unicellular red alga Cyanidioschyzon merolae 10D', Nature 428, 653-657. 08 Apr 2004. This species has 5331 genes, only 26 genes have introns.
Iris Fry (2000) The emergence of life on earth, p.56, 170.
Paul G. Falkowski and Colomban de Vargas (2004) Shotgun Sequencing in the Sea: A Blast from the Past? Science, 304, 58-60, 2 Apr 2004.
Iain Cheeseman and Arshad Desai (2004) Cell division: Feeling tense enough?, Nature, 428, 32-33, 4 March 2004.
Radu Popa (2004) "Between Necessity and Probability: Searching for the Definition and Origin of Life", p. 95-96. [ 18 June 2004 ]
David Bainbridge (2000) Making babies. The science of pregnancy, page 35-36.
Philip Ball (2004) "Synthetic Biology: starting from scratch", Nature, 431, 624-626 (7 Oct 2004). "Bacterial genomes are within the range of current DNA-synthesis technology" says John Mulligan, president of the DBA-synthesizing company Blue Heron Technology. But bacterial genomes must be embedded within a cell and its attendant biochemical machinery, making them much harder to synthesize than viruses.". [ 9 Oct 2004 ]
Why are stem-cells so important? Stem-cell biology is the second pillar of twenty-first-century biology. If a genome were enough, why are stemm cells so important for medicine? See: Ann Parson (2004) The Proteus Effect: Stem Cells and their Promise for Medicine. [ 24 Oct 2004 ]
Mark T. Ross et al (2005) "The DNA sequence of the human X chromosome.", Nature, 434, 17 Mar 2005, 325-337.
Christian de Duve (2002) Life Evolving, p.38.
Gil Ast (2005) The Alternative Genome, Scientific American March/April 2005 pp 40-47.
Douglas Futuyma (2005) Evolution, Sinauer Associates, page 53 and 440. (figures adapted for the web).
This is similar to explaining everything by saying 'God created the perfect fit between organism and environment'.
Large genomic differences explain our little quirks, Nature, 19 May 2005, 252.
Patrick Forterre (France) argues "that bacteria probably evolved more recently, and that LUCA was in fact a eukaryote", NewScientist 3 Sept 2005, p.28. (LUCA=Last Universal Common Ancestor). So Forterre seems to suggest that bacteria evolved from eukaryotes, but the difference with Senapathy is that Forterre does not claim that all eukaryotes arose independently. So although Forterre's view is highly unorthodox and implausible, Senapathy's view is a million times more unlikely.
Nick Lane (2005) Power, Sex, Suicide. Mitochondria and the Meaning of Life, p.143.
Asexual reproduction in animals is rare: the freshwater polyp Hydra reproduces by budding and some insects like aphids show life phases of quick multiplication through diploid eggs that form large, genetically identical clones. But in difficult times, even these animals reproduce sexually. (Christiane Nüsslein-Volhard, 2006, p.21).
Please note that the primordial pond illustration on the cover shows a turtle, a frog, a crab, a butterfly, a worm, a fern, but no mammal and no human being. Please note that the illustration suggest frogs develop directly from DNA, but in reality they develop from tadpoles. Similarly, butterflies develop from larva; pupa (not in water!).
Blowing In The Wind. Seeds & Fruits Dispersed By Wind. (a beautifully illustrated page about all kinds of seed dispersal).
Robert Shapiro (2007) 'A simpler Origin for Life', Scientific American, june 2007, page 28.
Alec Panchen (1993) Evolution, p.175.
Reduced fitness in individuals due to homozygous deleterious alleles is known as "inbreeding depression". See: Scott Freeman and Jon Herron (2007) 'Evolutionary Analysis', page 270. See also: "the analysis found more than 4 million variants between Venter's maternal and paternal chromosomes. This suggests that humans differ by 0.5%, not 0.1%, as suggested by earlier estimates." Jon Cohen (2007) Venter's Genome Sheds New Light on Human Variation, Science, 7 Sep 2007. On the other hand inbreeding (of dogs) can result in extremely long stretches of identical DNA common in different individuals of the same breed -millions of bases long compared to the typical tens of thousands of bases in humans. This is artificial selection with probably high costs (Science 21 September 2007). It is no accident that DNA of dog breeds are now investigated for genes for 18 diseases including four cancers, four inflammatory disorders, and three heart diseases.
Catherine Jessus & Olivier Haccard (2007) 'Fertilization: Calcium's double punch', Nature 449, 297-298 (20 September 2007).
David Beerling (2007) The Emerald Planet: How Plants Changed Earth's History, pp.180-183. There are regions on earth (Athi Plains in Kenya) were C₃ and C₄ plants coexist, so that would be the place for Senapathy's primordial pond!
See my review: Carl Woese.
Catherine Brady (2007) Elizabeth Blackburn and the Story of Telomeres: Deciphering the Ends of DNA, The MIT Press.
Erika Check Hayden (2008) 'Evolution: Scandal! Sex-starved and still surviving', Nature, 10 april 2008. " "Bdelloid rotifers reproduce entirely without males: females package a complete copy of their DNA into eggs that develop, sans fertilization, into the next generation. Asexual reproduction certainly isn't unheard of in the animal world: parasitic bacteria force some insects to reproduce without males and female sharks kept alone in captivity have surprised their keepers by giving birth to baby sharks."
Vaclav Smil Energy in Nature and Society, MIT Press: 2008. 512 pp. reviewed in Nature, 10 april 2008.
David A. Wheeler et al (2008) 'The complete genome of an individual by massively parallel DNA sequencing', Nature, 452, 872-876 (17 April 2008)
Elliott Sober (2008) Evidence and Evolution. The logic behind the science, p.116.
Erwan G. Roussel et al (2008) 'Extending the Sub-Sea-Floor Biosphere', Science, 23 May 2008.
For this amazing story see: Audubon Magazine. For a map see here. If the primordial pond is at sea level, do these geese survive at that level? How does the bizar migratory behavior originate? Random? That's a helpful explanation! Senapathy never tells us when, and where the primordial pond existed!
In fact when one looks closely to the quote from page 204, Senapathy is describing random mutation followed by natural selection!!!
See for explanation 'Vicious circle' box in my review of Hubert Yockey. This is a devastating obstacle for Independent Origin for the same reason that any mutation in the tRNA genes is lethal. See also: 'Does life look unlike evolution?' in my review of Walter Remine.
"RNA nucleotides have never been synthesized from scratch, in spite of decades of focused effort" (Robert M. Hazen (2005) Genesis. The scientific quest for life's origin, p.219. Also: "Nucleotides, the building blocks of DNA have never been produced in any prebiotic synthesis experiment" (Barton, p.95). Senapathy can never solve this problem by supposing that nucleotides where synthesised under genome control, because without nucleotides no genomes!
But not impossible. It is quite funny that Senapathy did not claim that with enough time prokaryote genomes could originate.
Gordon Campbell in: M.R. Wright (2000) 'Reason and necessity'.
Pier Luigi Luisi (2006) The Emergence of Life. From chemcial origins to synthetic biology, page 208.
Sheref S. Mansy et al (2008) 'Template-directed synthesis of a genetic polymer in a model protocell', Nature, 3 Jul 2008.
"Neo-Darwinism is in fact falsifiable, for there are many empirically testable claims made, for example within modern genetics which currently explains the core principle of inheritance. However, if it were to be falsified a new theory would have to replace it, in order to explain design in a non-theological fashion, and this would have very many features in common with neo-Darwinism simply because of the explanatory burden such a theory would have to carry." Thomas E. Dickins in: Evolutionary Psychology, 2005. 3: 79-84. This is very interesting: not only any alternative to evolution needs to explain the same set of facts, it also is expected to have much in common with neo-Darwinism. This is exactly what Senapathy is doing: he copies natural selection and common descent into his theory!!! What he does not do is use the same set of data as neo-Darwinism. Furthermore, Intelligent Design theorist Michael Behe incorporates natural selection and common descent into his ID theory!
Nicholas H. Barton, Derek E.G. Briggs, Jonathan A. Eisen, David B. Goldstein, Nipam H. Patel (2007) Evolution, Cold Spring Harbor Laboratory Press, hardback 833 pp. (my review).
See for the probability of the spontaneous origin of a well-designed body: Richard Dawkins (1991) The Blind Watchmaker, Penguin books 1991 paperback edition, page 146.
Additionally, meiotic recombination shuffles the genome, so each generation inherits a new combination of parental traits. How does Senapathy's theory explain the origin and continued existence of such a widespread, complex and costly proces as meiotic recombination? Meiotic recombination contradicts the idea that genomes are essentially fixed. What is the purpose of a recombination of parental genomes?
John Maynard Smith (1999) The Origin of Life, page 3.
John Maynard Smith (1995,1997) The Theory of Evolution, Cambridge University Press paperback. p.110.
I overlooked this figure and his claim that the primordial pond coincides with the Cambrian explosion. However, it only makes my argument stronger and clearer.
Senapathy appears to know that chromosomes occur in pairs. On pag 588 in note 109 he mentions homologous chromosomes. He even knows that one chromosome of a homologous pair of chromosomes is from the father, the other from the mother.
However, he seems to adopt an infinite universe of DNA sequences which does the trick for him.
Of the myriad problems of this scenario I mention a simple error: "Only those individuals with the absolutely right organs will survive" is not correct for reproductive organs! The situation is not analogous. One does not need sex organs to survive, while one does need hart, lungs, kidney, liver, mouth, teeth, stomach, intestines, and anus to survive. Infertile people don't die. See: 'The female and male genome' on this page.
A good overview is: chapter 17 in: Stephen Stearns and Rolf Hoekstra (2005), second edition.
page 8. This is a revealing and charming description of his naive way of thinking.
Daniel Fairbanks (2007) Relics of Eden. The powerful evidence of evolution in human DNA, p.154.
John Allman (2000) Evolving Brains, Scientific American Library, page 86
Dylan Chivian et al (2008) 'Environmental Genomics Reveals a Single-Species Ecosystem Deep Within Earth', Science 10 October 2008.
Science Daily (2003) Ultra-low Oxygen Could Have Triggered Die-offs, Spurred Bird Breathing System, Oct. 31, 2003.
Helen Pearson (2008) 'Outcry at scale of inheritance project', Nature, 10 October 2008
Periannan Senapathy et al (2008) 'Origination of the Split Structure of Spliceosomal Genes from Random Genetic Sequences, Plos One, October 20, 2008.
When did this processing occur? Some evidence suggests that Senapathy means prebiotic processing, because he writes: "Stop codons occurred too frequently to allow functional proteins to be encoded in random DNA" (my emphasis). Life could not have started with too short genes and proteins, that would be incompatible with life. Life requires functional proteins. On the other hand, Senapathy also provides evidence that a substantial amount of processing occurred after the origin of random DNA: "The average exon length from the intron-rich genomes is about 170 bases whereas that expected from random ORF lengths is 60 bases. This may indicate that there has been a selection for longer exons within the allowed maximum ORF length of 600 bases for optimizing the frequency of suitable exon lengths." (my emphasis). Further evidence supports this interpretation: "mRNA splicing evolved to overcome the problem of the frequent occurrence of stop codons in primordial random DNA"; "RNA splicing evolved to circumvent the problem of short ORFs"; "According to the ROSG model, mRNA splicing evolved to overcome the problem of the frequent occurrence of stop codons in primordial random DNA that severely restricted ORF lengths." This is vague. Senapathy is not clear about it. A good scientific theory should be clear.
Computational Discovery of Internal Micro-Exons
Stewart Scherer (2008) A Short Guide to the Human Genome, p.32.
Ann Gibbons (2008) 'The Birth of Childhood', Science 14 November 2008.
Henry Nicholls (2008) 'Darwin 200: Let's make a mammoth', Nature 456, 310-314 (2008). Let's make a mammoth from its DNA is a perfect analogy with Senapathy's project to create an animal from its genome! All the problems and obstacles to create a mammoth from its DNA appear in Senapathy's scenario!
updated 13 Mar 2013. It is chemically possible to have 6 bases (3 base pairs), 8 bases (4 base pairs), 12 bases (6 base pairs) (Nicholas Barton et al (2007) Evolution, page 561) or maybe even 20 bases (10 pairs). In that case each base codes for one amino acid and the length of DNA would be reduced to a third of the original length. Shorter DNA molecules would be more efficient: it costs less energy and less building blocks to synthetize and replicate. However, the difficulty would be to find bases that are able to form pairs with a constant diameter to enable a stable, regular double helix structure. The disadvantage would be that 20 different bases must be synthesized, coded for and maintained.
On the other hand, organisms could do with only CG pairs or only AT pairs. Information in DNA can still be coded with one base pair. Each strand would have a sequence such as GCCGGGGC..., and each amino acid could be coded by four or five bases instead of three. Replication with only one base pair would be most accurate because only G or C (say) would need to be distinguished. The disadvantage would be that the length of DNA to encode proteins would much longer, almost double. Example of using the four-base codon CGGG in a cell-free translation system: FRET analysis of protein conformational change through position-specific incorporation of fluorescent amino acids.
In an interview Jerry Coyne says: "I don't know of any challenge to evolution that's ever come from a non-religious person. Personally I've never experienced one.". Senapathy is such a non-religious critic of evolution.
Noncoding DNA
Actually Carl Woese showed that 'Prokaryotes' must be replaced with 'Archaea' and 'Bacteria' or alternatively with 'microorganisms'. See also: Norman R. Pace (2009) 'It's time retire the prokaryote', Microbiology Today, May 2009, 85-87.
Kurland CG, Collins LJ, Penny D (2006) Science 312:1011-1014. Quoted by Lynch
John S. Mattick and Igor V. Makunin (2006) 'Non-coding RNA', Human Molecular Genetics 2006 15.
Eugene V. Koonin (2009) Darwinian evolution in the light of genomics, Nucleic Acids Research, 2009, Vol. 37, No. 4 1011-1034. free full text.
Henrik Kaessmann (2009) More Than Just a Copy, Science.
Carol Greider, Elizabeth Blackburn and Jack Szostak received the Nobel prize 2009 for their work on telemeres. Nature
Fuyuki Ishikawa and Taku Naito (1999) 'Why do we have linear chromosomes? A matter of Adam and Eve', Mutation Research/DNA Repair Volume 434, Issue 2, 23 June 1999, Pages 99-107: "Bacterial circular chromosomes have sporadically become linearised during prokaryote evolution".
Dirk Schübeler (2009) 'Epigenomics: Methylation matters' Nature 462, 296-297 (19 November 2009)
Dennis McCarthy (2009) Here be dragons. How the study of animal and plants distributions revolutionzied our views of life and Earth, p.100.
Dennis McCarthy (2009) 'Here be dragons', p.186 and 191.
See also: "As Simpson himself pointed out, -any event that is not absolutely impossible ... becomes probable if enough time elapses" (Nature, 4 Feb 2010) about Madagascar species. What we need is quantification.
Stewart Scherer (2008) A Short Guide to the Human Genome, p.41.
Explanation of basic principles: Reading the Genetic Code (Nature Education).
Michael Yarus (2010) Life from an RNA world, Harvard University Press.
This very profound objection to independent origin occured to me only 7 years after I started this review.
Elizabeth Pennisi (2010) 'Synthetic Genome Brings New Life to Bacterium', Science, 21 May 2010.
Douglas L. Theobald (2010) 'A formal test of the theory of universal common ancestry', Nature 465 219-222 (13 May 2010). But see: The common ancestry of life.
Nick Lane & William Martin (2010) 'The energetics of genome complexity', Nature, 467 929-934 21 October 2010
France Denoeud (2010) 'Plasticity of Animal Genome Architecture Unmasked by Rapid Evolution of a Pelagic Tunicate', Science, Published Online 18 November 2010.
Felisa A. Smith et al (2010) 'The Evolution of Maximum Body Size of Terrestrial Mammals', Science, 26 November 2010.
Elizabeth Pennisi (2010) Shining a Light on the Genome's 'Dark Matter', Science 17 dec 2010
'meaningless/meaningful': is relative to the language. This also holds for the genetic code language! Senapathy would have found genes from an arbitrary genetic language in a random piece of computer DNA, because meaningful genes can be produced by any genetic language. The point is however, that in a computer experiment it's easy to use the same language during the experiment, while in natural experiments there is nothing that ensures the same genetic language during the production of even one genome, let alone all genomes.
O.B. Ptitsyn (1984) 'Random sequences and protein folding', Journal of Molecular Structure: THEOCHEM Volume 24, Issues 1-2, July 1985
V. S. Pande et al (1994) 'Nonrandomness in protein sequences: Evidence for a physically driven stage of evolution?' PNAS Vol. 91, pp. 12972-12975, December 1994.
Anthony D. Keefe & Jack W. Szostak (2001) 'Functional proteins from a random-sequence library', Nature 410, 715-718.
Codon usage in E. coli and: codon usage.
D M Raup and J W Valentine Multiple origins of life PNAS May 1, 1983 vol. 80 no. 10 2981-2984
Senapathy uploaded 3 papers to Nature Precedings (Documents on Nature Precedings are not peer-reviewed, but any visitor can comment):
- Origin of biological information: Inherent occurrence of intron-rich split genes, coding for complex extant proteins, within pre-biotic random genetic sequences 13 December 2010 07:04
- The inherent occurrence of complex intron-rich spliceosomal split genes, including regulatory and splicing elements, within pre-biotic random genetic sequences 13 December 2010 07:27
- Parallel genome assembly from pre-biotic split-genes: A solution for the mosaic genome conundrum 13 December 2010 07:53
I submitted 3 comments to the first document, but only one was posted. Remarkably, that comments are blocked while simultaneous submitting 3 large articles is no problem at all! Only later the second one appeared. And on 28 February 2011 Senapathy replied.
T. Mourier and D. C. Jeffares (2003) 'Eukaryotic Intron Loss', Science p.1393, 30 May 2003.
Sakharkar KR (2006) 'Functional and evolutionary analyses on expressed intronless genes in the mouse genome', FEBS Lett. 2006 Feb 20;580(5):1472-8. Epub 2006 Jan 31.
Jain M et al (2008) 'Genome-wide analysis of intronless genes in rice and Arabidopsis', Funct Integr Genomics. 2008 Feb;8(1):69-78. Epub 2007 Jun 20.
Syozo Osawa (1995) Evolution of the Genetic Code, Oxford University Press. page 150.
Dmitry V. Fyodorov & James T. Kadonaga (2002) 'Dynamics of ATP-dependent chromatin assembly by ACF', Nature 418, 896-900 (22 August 2002)
C. G. Kurland, L. J. Collins, D. Penny (2006) Genomics and the Irreducible Nature of Eukaryote Cells, Science 19 May 2006
T. M. Embley, W. Martin (2006) 'Eukaryotic evolution, changes and challenges', Nature 440, 623 (Mar 30, 2006):
"In recent years, even that has been called into question, as some phylogenies have suggested that prokaryotes might be derived from eukaryotes".
Eugene V. Koonin (2009) 'Intron-Dominated Genomes of Early Ancestors of Eukaryotes', J Hered (2009) 100 (5): 618-623.
N. H. Barton et al (2007) Evolution, Cold Spring Harbor Laboratory Press, p.220.
Group I catalytic intron in wikipedia. Homing endonuclease recognition sequences are long enough to occur randomly only with a very low probability: approximately once every 7×10¹⁰ bp.
Scott William Roy, Walter Gilbert (2006) The evolution of spliceosomal introns: patterns, puzzles and progress, Nature Reviews Genetics Volume 7 March 2006 211
Francisco Rodríguez-Trelles, Rosa Tarrío, Francisco J. Ayala (2006) 'Origins and Evolution of Spliceosomal Introns', Annu. Rev. Genet. 2006. 40:47-76. Very imporant article (introns-first hypothesis).
Sequence and organization of the human mitochondrial genome, Nature (1981) 290: 457-65. Also: The mouse mitochondrial genome displays exceptional economy of organization, protein-coding genes with zero or few noncoding nucleotides between coding sequences .
Lawrence A. David, Eric J. Alm (2011) 'Rapid evolutionary innovation during an Archaean genetic expansion', Nature, 469, 93-96. 06 January 2011
GenScript (accessed 6 Jan 2011)
"Our findings thus show that any split gene that can encode complex proteins which form the structure of the spliceosome or any other eukaryotic cellular organelle," (p.8). This equals to the claim that organelles were designed.
Brian Hall, Benedikt Hallgrimsson (2008) Strickberger's Evolution, Fourth edition, page 139.
Lee R. Kump (2010) 'Earth's Second Wind', Science 10 December 2010
Radu Popa (2004) Between Necessity and Probability: Searching for the Definition and Origin of Life, Springer, p. 67-68.
In previous versions I stated only that every OOL theory needs to explain the restricted choice of the genetic code in the face of the huge freedom of choice, but later I realised that precisely this freedom of choice forces any theory of independent origin to predict a huge diversity of genetic codes (11 Jan 11).
"The non-universal genetic codes are not produced randomly, but are derived from the universal genetic code as the result of a series of non-disruptive changes" (Syozo Osawa (1995) 'Evolution of the Genetic Code', p.171). Senapathy's theory would predict that non-universal codes are (1) random, (2) ubiquitous, (3) originated in the beginning. All predictions are wrong.
Michael Lynch (2002) 'Intron evolution as a population-genetic process', Proc Natl Acad Sci U S A. 2002 April 30.
Michael Lynch, John S. Conery (2003) 'The Origins of Genome Complexity', Science 302 21 Nov 2003.
Eugene V Koonin (2006) 'The origin of introns and their role in eukaryogenesis: a compromise solution to the introns-early versus introns-late debate?', Biology Direct. Koonin seems to be a little bit genome-centered too! (with response from Ford Doolittle).
D. Fridmanis et al (2007) 'Formation of new genes explains lower intron density in mammalian Rhodopsin G protein-coupled receptors', Mol Phylogenet Evol. 2007 Jun; 43(3):864-80.
Tom Strachan, Andrew Read (2000) Human Molecular Genetics Second Edition, p.150-152.
Sarah Blaffer Hrdy (2009) Mothers and Others, p. 101.
JE Darnell, Jr (1978) Implications of RNA-RNA splicing in evolution of eukaryotic cells, Science 22 December 1978:
Abstract: "The differences in the biochemistry of messenger RNA formation in eukaryotes compared to prokaryotes are so profound as to suggest that sequential prokaryotic to eukaryotic cell evolution seems unlikely. The recently discovered noncontiguous sequences in eukaryotic DNA that encode messenger RNA may reflect an ancient, rather than a new, distribution of information in DNA and that eukaryotes evolved independently of prokaryotes."
Martin and Koonin (2006) Introns and the origin of nucleus-cytosol compartmentalization, Nature 440, 41-45 (2 March 2006)
Daniel C. Jeffares, Tobias Mourier, David Penny (2006) 'The biology of intron gain and loss', TRENDS in Genetics Vol.22 No.1 January 2006
In theory Senapathy could argue that prokaryotes originated independently and had as many introns as eukaryotes but lost them completely, since several publications argue that clear instances of massive intron loss make the case for complete intron loss in prokaryotes plausible. This implies strong Darwinian selection.
Anthony Poole, Daniel Jeffares, David Penny (1999) Early evolution: prokaryotes, the new kids on the block, BioEssays 21:880-889, 1999.
Morgan Ryan (2011) Unauthorized reproduction not prohibited, American Scientist, Vol 99, p. 30.
M.B. Shapiro, P. Senapathy (1987) "RNA splice junctions of different classes of eukaryotes" is cited in: László Patthy (1999) Protein Evolution, p. 148.
Monya Baker (2011) 'Genomics: Genomes in three dimensions', Nature, 470, 289-294 (10 February 2011)
Tom Misteli (2007) Beyond the Sequence: Cellular Organization of Genome Function, Cell 128, 787-800, February 23, 2007
Tom Misteli (2011) The Inner Life of the Genome Scientific American Feb 2011.
Dan Graur, W H Li (2000) Fundamentals of Molecular Evolution, p.289 and 296.
Guy M. Narbonne (2011) Evolutionary biology: When life got big, Nature 470, 17 February 2011
Eddo Kim (2007) Different levels of alternative splicing among eukaryotes, Nucleic Acids Res. 2007 January; 35(1): 125-131.
Aubrey E. Hill, Eric J. Sorscher (2006) The non-random distribution of intronless human genes across molecular function categories, FEBS Letters.
Ribozymes.
Francis Crick (1981) Life Itself. It's Origin and Nature, Simon and Schuster.
Monroe Strickberger (2000) Evolution, Third edition, p.143.
Michael IP (2005) Intron retention: a common splicing event within the human kallikrein gene family. Clin Chem. 2005 Mar;51(3):506-15.
Gene Splicing Overview & Techniques.
Andreas Wagner (2005) Energy Constraints on the Evolution of Gene Expression, Molecular Biology and Evolution, 22, 6 Pp. 1365-1374
Ner-Gaon H (2004) Intron retention is a major phenomenon in alternative splicing in Arabidopsis, Plant J. 2004 Sep;39(6):877-85.
Knowles DG, 2. McLysaght A (2009) Recent de novo origin of human protein-coding genes. Genome Research
Maren Krull, Jürgen Brosius and Jürgen Schmitz (2005) Alu-SINE Exonization: En Route to Protein-Coding Function, Molecular Biology and Evolution 22, 8 1702-1711
Manfred Eigen (1992) Steps towards Life. A perspective on Evolution, Oxford University Press. (paperback edition: 1996) See also wikipedia article Error threshold.
Richard Dawkins (1986) The Blind Watchmaker, chapter 3.
P Senapathy (1986) Origin of eukaryotic introns: a hypothesis, based on codon distribution statistics in genes, and its implications, PNAS April 1, 1986 vol. 83 no. 7 2133-2137. The abstract is a beautiful succinct summary (written 8 years before his book) of his whole theory!
Francesco Catania, Xiang Gao and Douglas G. Scofield Endogenous Mechanisms for the Origins of Spliceosomal Introns, J Hered (2009) 100 (5): 591-596. Quote: "Spliceosomal introns have also been suggested to directly arise from random primordial and canonical ancestral gene sequences (the 'split-gene model' and the 'proto-splice site model', respectively). In particular, Senapathy (1986)". Further Senapathy (1986) is quoted in: Masaru Tomita et al (1996) 'Introns and Reading Frames: Correlation Between Splicing Sites and Their Codon Positions', Mol. Biol. Evol. 13(9):1219-1223.
Lewin's GENES X, 2011, p. 86.
Donald R. Forsdyke & James R. Mortimer (2000) Chargaff's legacy, Gene (2000) 261, 127-137
Cory Y. McLean et al (2011) Human-specific loss of regulatory DNA and the evolution of human-specific traits, Nature 10 Mar 2011
Manyuan Long and Carl Rosenberg (2000) Testing the 'Proto-splice Sites' Model of Intron Origin: Evidence from Analysis of Intron Phase Correlations, Mol Biol Evol (2000) 17 (12): 1789-1796. Another way of measuring intron phases is expressing exon length as multiples of 3, 3N+1, 3N+2.
Compare with W F Doolittle: 'Genes in pieces: Were they ever together?' Nature 1978, 272:581-582.
Jeffrey M. Perkel (2011) Synthetic Genomes: Building a better Bacterium, Science, 25 Mar 2011
See my review of Information theory and molecular biology by Hubert Yockey.
Eileen E. M. Furlong (2011) Molecular biology: A fly in the face of genomics, Nature 471, 458-459 24 March 2011
S Ragsdale (2011) Biochemistry: How two amino acids become one, Nature, 31 Mar 2011 shows that the stopcodon UAG from methanogenic Archaea encodes the new amino acid Pyrrolysine. So it only has two stop codons. Tetrahymena species recognize only UGA as a stop codon, while Euplotes species recognize only UAA and UAG as stop codons (Joe Salas-Marco et al, 2006).
Ryan E. Mills et al (2011) 'Natural genetic variation caused by small insertions and deletions in the human genome', Genome Research April 1, 2011
Gretchen Vogel (2011) 'Do Jumping Genes Spawn Diversity?, Science, 15 April 2011.
Jevon Plunkett et al (2011) An Evolutionary Genomic Approach to Identify Genes Involved in Human Birth Timing, PLoS Genetics April 2011
Kevin Plaxco and Michael Gross (2006) Astrobiology. A Brief Introduction, page 71: "These amino acids were almost certainly introduced by biochemistry after the origins of life".
Hadas Keren, Galit Lev-Maor & Gil Ast (2010) Alternative splicing and evolution: diversification, exon definition and function, Nature Review Genetics, 11, 345-355 (May 2010)
Elizabeth Pennisi (2011) 'Green Genomes', Science 17 Jun 2011
"It is worth noting that the average length of metazoan exons (125 - 165 bp) is similar to the length of DNA that wraps around a nucleosome (147 bp), which suggests that nucleosome occupancy might confer purifying selection on exon length. However, the length of an average human exon is only 126 bp." From ref 209.
Elçin Ünal et al (2011) Gametogenesis Eliminates Age-Induced Cellular Damage and Resets Life Span in Yeast, Science, 24 Jun 2011.
"The oceans are teeming with viruses — typically, there are 100 billion viral particles per litre of water in the top 50 metres of most marine ecosystems. With an average of ten viruses for each bacterial cell, these parasites impose a tight control over the composition of marine microbial communities. The 'arms race' hypothesis holds that the selective pressure exerted by viruses continuously triggers adaptive mutations in the bacterial genomes, with counteracting genetic adaptations occurring at a similar pace in the parasites." Science 30 jun 2011.
Mingyao Li et al (2011) Widespread RNA and DNA Sequence Differences in the Human Transcriptome, Science, 1 Jul 2011 :"We have uncovered thousands of exonic sites where the RNA sequences do not match those of the DNA sequences, including transitions [changes a purine nucleotide to another purine] and transversions [substitution of a purine for a pyrimidine or vice versa]".
David Deamer (2011) First Life. Discovering the Connections between Stars, cells, and How Life Began. University of California Press, p. 183.
Denis Noble (2006) The Music of Life. Biology Beyond the Genome, Oxford University Press. See my short description on the Introduction page. The first four chapters are the most important, they explain what is wrong with genetic determinism.
See: David Deamer (note 215) page 214: "Can genetic information really appear out of nowhere, by chance?" where he reports the experiments of Bartel and Szostak (1993) "Isolation of new ribozymes from a large pool of random sequences', who began by synthetizing many trillions of different random RNA molecules 300 nucleotides long and found RNAs with catalytic activity. Furthermore, selection and amplification are involved, which are absent from Senapathy's scenario. Deamer concludes "The inescapable conclusion is that genetic information can appear out of random mixtures, as long as there are populations containing large numbers of polymeric molecules with variable sequences of monomers and a way to select and amplify specific property" (p.216). The conditions in this claim are extremely important! The problem here is: how do you get long polymeres in the first place? Long enough to be of catalytic value?
Warm and Cold-Blooded
The Central Dogma of molecular biology certainly reinforces genetic determinism because the direction of the flow of information is from DNA to RNA to proteins. This strongly suggests DNA is in control and there is no feedback.
Which came first, the bird or the smaller genome? 30 August 2007
Craig B. Lowe et al (2011) Three Periods of Regulatory Innovation During Vertebrate Evolution, Science, 19 August 2011.
A story of chromosome number, Nature 477, 9 1 September 2011
Y. Jiao et al (2011) Ancestral polyploidy in seed plants and angiosperms. Nature 473, 97 (2011).
T. E. Wood et al (2009) The frequency of polyploid speciation in vascular plants. Proc. Natl. Acad. Sci. U.S.A. 106, 13875 (2009).
Steven A. Frank (2009) Somatic evolutionary genomics: Mutations during development cause highly variable genetic mosaicism with risk of cancer and neurodegeneration, PNAS January 26, 2010 vol. 107
Centrosome (24 Sep 2011)
Kyle Vogan (2011) Maternal imprinting defect, Nature Genetics 43, 928
James Darnell (2011) RNA. Life's Indispensable molecule, p. 243.
Senapathy knows about the existence of the spliceosome in the Genetics Primer (p. 547, 555), although it does not occur in the index.
Evolution of nested gene arrangements, the ruvinsky lab.
Chun-Long Chen et al (2008) Genomewide Analysis of Box C/D and Box H/ACA snoRNAs in Chlamydomonas reinhardtii Reveals an Extensive Organization Into Intronic Gene Clusters, Genetics May 2008 vol. 179 no. 1 21-30.
Michael B. Clark et al (2011) The Reality of Pervasive Transcription PLoS Biology July 2011.
Eugene V Koonin, Tatiana G Senkevich, Valerian V Dolja (2006) 'The ancient Virus World and evolution of cells', Biology Direct 2006, 1:29. The authors use the words:
- "the primordial pool of primitive genetic elements"
- "the primordial genetic pool"
- "The primordial gene pool"
- "existence of a complex, precellular, compartmentalized but extensively mixing and recombining pool of genes"
- "the primordial, pre-cellular gene pool"
- "viral origin from the primordial genetic pool"
However, "In this pool, RNA viruses would evolve first, followed by retroid elements, and DNA viruses.". So that is different from Senapathy: he starts with complete eukaryotic genomes. It seems that Koonin proposes that there are selfish genetic elements before there are cells! But how can 'selfish' genetic elements exist before cells? How can a virus be selfish if there is nothing to parasitize on? If every virus depends on another virus or cell to replicate, the virus world could not come in to existence. (I need to investigate this important publication! It seems that the authors make the mistake of 'protein-coding genes' (for example reverse transcriptase, capsid proteins) without transcription and translation machinery.)
Stephen J. Freeland, et al (1999) Early Fixation of an Optimal Genetic Code, Mol Biol Evol (2000) 17 (4): 511-518.
A. D. Ellington (2009) Evolutionary origins and directed evolution of RNA, Int J Biochem Cell Biol. 2009 Feb;41(2):254-65. (See also Stuart A. Kauffman about random DNA libraries)
"A long explanation for introns", New Scientist, June 26, 1986 (see google books); "Exon, introns, and evolution", New Scientist, March 31, 1988 (see google books).
Martin Ackermann, Lin Chao (2006) DNA Sequences Shaped by Selection for Stability, PLoS Genetics, February 2006.
Woese, C. R. On the evolution of the genetic code. Proc. Natl Acad. Sci. USA 54, 1546–1552 (1965).
Tobias Warnecke, Laurence D. Hurst (2011) Error prevention and mitigation as forces in the evolution of genes and genomes, Nature Reviews Genetics 12, 875-881 (December 2011)
Brian P. Cusack, Peter F. Arndt, Laurent Duret, Hugues Roest Crollius (2011) Preventing Dangerous Nonsense: Selection for Robustness to Transcriptional Error in Human Genes, PLoS Genetics October 2011
Miodrag Grbic et al (2011) The genome of Tetranychus urticae reveals herbivorous pest adaptations, Nature 479, 487–492 (24 November 2011) Supplementary information Figure S2.4.2.
Richard Cordaux, Mark A. Batze (2009) The impact of retrotransposons on human genome evolution, Nature Reviews Genetics 10, 691-703 (October 2009) is a very usefull overview.
Erez Lieberman Aiden (2011) Zoom! Science 2 December 2011
William A. Wells (2005) There's DNA in those organelles, The Journal of Cell Biology, March 14 2005
Hans Ris and Walter Plaut (1962) Ultrastructure Of DNA-Containing Areas In The Chloroplast Of Chlamydomonas, The Journal of Cell Biology, 1 Jun 1962.
Eugene V. Koonin (2011) The Logic of Chance. Pearson Education, hardback.
Kevin N. Laland, Kim Sterelny, John Odling-Smee, William Hoppitt, Tobias Uller (2011) 'Cause and Effect in Biology Revisited: Is Mayr's Proximate-Ultimate Dichotomy Still Useful?', Science 16 Dec 2011.
RM Schwartz and MO Dayhoff (1978) 'Origins of prokaryotes, eukaryotes, mitochondria, and chloroplasts', Science 27 January 1978: 395-403.
Structure of the Mitochondrial Genome, Genetic Origins website.
"Translation in Escherichia requires the coordinated and complex interactions of at least 100 gene products." from: Ravi Jain, Maria C. Rivera, and James A. Lake (1999) Horizontal gene transfer among genomes: The complexity hypothesis, Proc. Natl. Acad. Sci. USA Vol. 96, pp. 3801–3806, March 1999
Stuart A. Kauffman (2011) Approaches to the Origin of Life on Earth, Life 2011, 1, 34-48 (Open Access)
Joan A. Steitz (2012) RNA Rejoice! Review of RNA Life's Indispensable Molecule by James Darnell, Science 6 January 2012
Ming Zou, Baocheng Guo, Shunping He (2011) 'The Roles and Evolutionary Patterns of Intronless Genes in Deuterostomes', Comparative and Functional Genomics, Volume 2011.
Brian K. Hall, Benedict Hallgrimsson (2008) Strickberger's Evolution Fourth Edition, p. 134.
Dennis W. Grogan (2002) Hyperthermophiles and the problem of DNA instability, Molecular Microbiology.
Scott Freeman and Jon Herron (2007) 'Evolutionary Analysis', page 657.
Martin Egli (2006) Uncovering DNA's 'sweet' secret. One particular curiosity: how did DNA and RNA come to incorporate five-carbon sugars into their "backbone" when six-carbon sugars, like glucose, may have been more common?
Miklos Csuros, Igor B. Rogozin, Eugene V. Koonin (2011) A Detailed History of Intron-rich Eukaryotic Ancestors Inferred from a Global Survey of 100 Complete Genomes, PLoS Computational Biology September 2011.
Igor B. Rogozin, Yuri I. Wolf, Alexander V. Sorokin, Boris G. Mirkin, and Eugene V. Koonin, (2003) Remarkable Interkingdom Conservation of Intron Positions and Massive, Lineage-Specific Intron Loss and Gain in Eukaryotic Evolution, Current Biology, Vol. 13, 1512–1517, September 2, 2003.
NCBI The RNA World and the Origins of Life.
Lindberg J, Lundeberg J. (2009) The plasticity of the mammalian transcriptome, Genomics 2010 Jan;95(1):1-6.
Furthermore, E. Koonin notes that the emergence of spurious (weak) transcription initiation sites in random DNA sequences is relatively easy (given the existence of Transcription factors of course) (p. 240, Note 246).
Aaron E. Engelhart and Nicholas V. Hud (2010) Primitive Genetic Polymers, Cold Spring Harbor Perspectives in Biology, 12 May 2010.
Aaron Klug (2004) 'The Discovery of the DNA Double Helix', Journal of Molecular Biology, Volume 335, Issue 1, 2 January 2004, Pages 3-26 (available as: klug-DNA.pdf)
The standard laboratory technique to isolate DNA includes digestion with proteinase which removes all proteins (histones).
How Many People Have Ever Lived On Earth? Population reference Bureau, assessed 9 Feb 2012.
Monya Baker (2012) Functional genomics: The changes that count, Nature 482, 257–262 09 February 2012
Kerstin Lindblad-Toh et al (2011) A high-resolution map of human evolutionary constraint using 29 mammals, Nature 478, 476–482 (27 October 2011)
Jurka J, et al (2007) Repetitive sequences in complex genomes: structure and evolution, Annu Rev Genomics Hum Genet. 2007;8:241-59.
Christian de Duve (2005,2006) Singularities. Landmarks on the Pathways of Life, p.81.
Eric S. Lander (2011) Initial impact of the sequencing of the human genome, Nature, 470, 187–197 (10 February 2011)
James Collins (2012) Synthetic Biology: Bits and pieces come to life, Nature 483, S8–S10 (01 March 2012)
Gregory P. Wilson, et al (2012) Adaptive radiation of multituberculate mammals before the extinction of dinosaurs, Nature 483, 457–460 (22 March 2012)
Tom Strachan, Andrew Read (2011) Human Molecular Genetics. Fourth Edition, on p. 274 is a table with 7 examples. (Info).
In theory, a protein or enzyme could originate spontaneously from amino acids. However, non-essential amino acids, such as Glycine, must be synthesized from precursors (in this case from Serine) by enzymes (in this case by serine hydroxymethyltransferase, SHMT). Even if all 20 amino acids were present, the precise order of the amino acids would be too unlikely to originate by chance. One needs genes for that.
Pregnancy: Why Mother's Immune System Does Not Reject Developing Fetus as Foreign Tissue, Sciencedaily, 7 Jun 2012.
Chris Todd Hittinger (2012): "As plants invaded land, lignin provided the rigidity necessary for vascular plants to grow above their rivals and move water and nutrients over long distances. Lignin is a dizzying web of polymerized phenylalanine derivatives with dozens of combinations of modifications and cross-links that make wood structurally sound ", Science 29 June 2012
Kenneth A. Johnson (2012) 'Biochemistry: DNA replication caught in the act', Nature 487, 177–178 (12 July 2012)
Anne-Ruxandra Carvunis et al (2012) Proto-genes and de novo gene birth, Natue 19 Jul 2012.
Dong-Dong Wu et al (2011) De Novo Origin of Human Protein-Coding Genes, PLOS Genetics, November 10, 2011. "Here we identify 60 new protein-coding genes that originated de novo on the human lineage since divergence from the chimpanzee, supported by both transcriptional and proteomic evidence."
William A. Shear (2012) Palaeontology: An insect to fill the gap, Nature, 488, 34–35 (02 August 2012)
Mark Isalan (2012) Systems biology: A cell in a computer, Nature, 2 Aug 2012
Christof Koch (2012) Modular Biological Complexity, Science 3 August 2012. Koch discusses the complexity of living systems which are characterized by large numbers of highly heterogeneous components, be they genes, proteins, or cells. He applies his analysis to the human brain, but it can equally applied to the origin of an eukaryotic organism.
Alla Katsnelson (2010) Epigenome effort makes its mark, Nature 467, 646 (2010) 6 October 2010
Amy Maxmen (2012) Cancer research: Open ambition, Nature News feature 8 August 2012. (a story about cancer drug discoverer Jay Bradner). "Such control systems generally involve three types of protein: 'writers', 'readers' and 'erasers'. Writers attach chemical marks, such as methyl groups (to DNA) or acetyl groups (to the histone proteins that DNA wraps around); readers bind to these marks and influence gene expression; erasers remove the marks".
Kai Kupferschmidt (2012) Attack of the Clones, Science 10 August 2012: "Fungi have long been seen as the least interesting pathogens, but two catastrophes in the animal world have changed that view." "In just the past 5 years, scientists have discovered fungi affecting rattlesnakes, land crabs, avocado trees, cultured abalone, and the eggs of sea turtles".
The ENCODE Project Consortium (2012) An integrated encyclopedia of DNA elements in the human genome, Nature 489, 57–74 (06 September 2012): "These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions.". One of the more remarkable findings is that 80% of the genome contains elements linked to biochemical functions, dispatching the widely held view that the human genome is mostly 'junk DNA' (Genomics: ENCODE explained). "Comparative genomic studies suggest that 3–8% of bases are under purifying (negative) selection and therefore may be functional". 13 Jun 2023: Warning: 80% soon appeared to be totally wrong! See Note 401.
Elizabeth Pennisi (2012) 'ENCODE Project Writes Eulogy for Junk DNA', Science 7 September 2012: "the Encyclopedia of DNA Elements (ENCODE), has found that 80% of the human genome serves some purpose". "The latest protein-coding gene count is 20,687, with hints of about 50 more, the consortium reports in Nature. Those genes account for about 3% of the human genome [including introns], less if one counts only their coding regions. So, if introns are excluded the percentage of DNA consisting of genes probably is about 0.3% of the total DNA in the human genome. "As a result of ENCODE, Gingeras and others argue that the fundamental unit of the genome and the basic unit of heredity should be the transcript–the piece of RNA decoded from DNA–and not the gene". 13 Jun 2023: Warning: 80% soon appeared to be totally wrong! Note 401.
Laurence Moran 'The Random Genome Project' and the original source is: Sean Eddy (8 Sep 2012): "The experiment that I'd like to see is the Random Genome Project. Synthesize a hundred million base chromosome of entirely random DNA, and do an ENCODE project on that DNA. Place your bets: will it be transcribed? bound by DNA-binding proteins? chromatin marked? Of course it will." Sunday, September 09, 2012. Addition: a random sequence may bind a transcription-factor, but that may not result in transcription.
The ENCODE Project Consortium An integrated encyclopedia of DNA elements in the human genome, Nature 489, 06 September 2012. 13 Jun 2023: Warning: 80% soon appeared to be totally wrong! See Note 401.
Richard Dawkins (2012) The descent of Edward Wilson, Prospect, May 24, 2012.
Maurizio Zanetti, Navin R. Mahadevan (2012) Immune Surveillance from Chromosomal Chaos? Science 28 September 2012
Tim R. Mercer et al (2009) Long non-coding RNAs: insights into functions, Nature Reviews Genetics 10, 155-159 (March 2009)
Vivien Marx (2012) 'Epigenetics: Reading the second genomic code', Nature 1 Nov 2012.
"But DNA works with many partners, including 'epigenetic' factors, which influence gene expression in ways that don't involve changes to the underlying sequence" and: "... to control the activity of particular genes". Genes do not control gene expression? Who does the controlling? “This development has heightened awareness about the good technologies needed to study how the genetic code is put into action says Adam Petterson ! ... BET proteins belong to a class of epigenetic reader that targets histones, recruits multi-protein complexes to the spot where they attach and instructs cellular processes involved in reading genetic information ... So far, 96 histone methyltransferases have been identified in humans...”
The 1000 Genomes Project Consortium (2012) An integrated map of genetic variation from 1,092 human genomes, Nature 1 Nov 2012.
We now know that the haploid human genome has 3,080 (male) or 3,022 (female) million base pairs.
Nina V. Fedoroff (2012) Transposable Elements, Epigenetics, and Genome Evolution, Science 9 Nov 2012 (free access)
Dirk Schübeler (2012) Epigenetic Islands in a Genetic Ocean, Science 9 November 2012
Scott B. Vafai, Vamsi K. Mootha (2012) Mitochondrial disorders as windows into an ancient organelle, Nature 491, 374–383 (15 November 2012)
Charles Robert Darwin (1809–1882). Origin of Species. XIV. Mutual Affinities of Organic Beings: Morphology–Embryology–Rudimentary Organs.
See section genome sequencing and mapping in wikipedia. In 1995 the first eukaryotic genome, the budding yeast Saccharomyces cerevisiae, was completed.
Peter Langridge (2012) Genomics: Decoding our daily bread, Nature, 491, 678–680 (29 November 2012)
Warnecke T, Weber CC, Hurst LD. Why there is more to protein evolution than protein function: splicing, nucleosomes and dual-coding sequence, Biochem Soc Trans. 2009 Aug;37(Pt 4):756–61. (very useful, educational powerpoint presentation!).
Wenqing Fu et al (2012) Analysis of 6,515 exomes reveals the recent origin of most human protein-coding variants, Nature, Published online 28 November 2012
Panagiotis Papasaikas, Juan Valcárcel (2012) Splicing in 4D, science, 21 Dec 2012
Joshua B. Plotkin, Grzegorz Kudla (2011) 'Synonymous but not the same: the causes and consequences of codon bias', Nature Reviews Genetics 12, 32-42 (January 2011)
Dong-Dong Wu, David M. Irwin, Ya-Ping Zhang (2011) De Novo Origin of Human Protein-Coding Genes, PLoS Genetics 7(11)
David G. Knowles and Aoife McLysaght (2009) Recent de novo origin of human protein-coding genes, Genome Res. 2009. 19: "This is the first evidence for entirely novel human-specific protein-coding genes originating from ancestrally noncoding sequences. We estimate that 0.075% of human genes may have originated through this mechanism leading to a total expectation of 18 such cases in a genome of 24,000 protein-coding genes."
Dan Graur et al (2013) On the immortality of television sets: "function" in the human genome according to the evolution-free gospel of ENCODE, Genome Biology and Evolution , 20 Feb 2013.
The difference between prokaryotes and eukaryotes correlates well with effective population sizes, genome size and the ability of natural selection to remove "junk DNA". Large genomes belonging to species with small effective population sizes should contain considerable amounts of junk DNA. See: Note 309. So, Senapathy found "junk DNA" in eukaryotes and not in prokaryotes and concluded that eukaryotes looked more like random DNA than prokaryotic DNA. In fact the cause of the difference is population size.
Amy Maxmen (2013) RNA: The genome's rising stars, Nature 04 April 2013. 13 Jun 2023: Warning: "three-quarters of the human genome is transcribed into non-coding RNA" appeared to be totally wrong! See Note 401.
Gert Korthof (2012) New origin of life model fatally requires a nonrandom protein, 24 Dec 2012. (blogpost)
Is a variation on: "Surely You're Joking, Mr. Feynman!": Adventures of a Curious Character. Amazingly, Senapathy knows full well that "When the embryo is developed in the uterus, the embryo is attached to and supported by the placenta which transfers nourishment from the mother to the developing embryo.". (page 307). He knows!
The idea of a 'stop codon' implies a release factor which is a protein that allows for the termination of translation by recognizing the stop codon in an mRNA. So, to produce one molecule of the release factor (from DNA), the release factor must already be present. That's a vicious circle. The human release factor eRF1 gene is 50.768 basepairs long and the protein is 437 Amino Acids long and has a specific, non-random, sequence. So, it cannot arise spontaneously. Mathematically speaking, every sequence can arise in a serie of trials long enough. But that is not the point. The abiotic synthesis of those specific DNA- or protein-sequences is the problem to solve.
Somatic Genome Mosaicism: But even within an individual, genomic variation exists. When individual cells of an organism have mutations this is called 'Genome Mosaicism'. For example: Somatic mosaicism can be caused by L1 transposition during embryogenesis. James R. Lupski (2013) 'Genome Mosaicism–One Human, Multiple Genomes', Science 26 July 2013.
Robert M. Brucker, Seth R. Bordenstein (2013) 'The Hologenomic Basis of Speciation: Gut Bacteria Cause Hybrid Lethality in the Genus Nasonia', Science 9 August 2013.
Kathleen L. McCann (2013) Mysterious Ribosomopathies Science 23 August 2013
Mitochondria Versus Nucleus, The Scientist, February 15, 2013
M. Paul Smith, David A. T. Harper (2013) Causes of the Cambrian Explosion, Science, 20 Sep 2013.
DNA Packaging: Nucleosomes and Chromatin, Nature Scitable.
"RNA polymerases are intricate molecular machines that transcribe DNA into RNA, combining RNA synthesis with the precise movement of a DNA template across their active site. Eukaryotic cells (those of animals, plants and fungi) have several RNA polymerases, each dedicated to the production of specific RNAs. RNA polymerase I (Pol I) synthesizes the ribosomal RNA component of the cell's protein-producing factories and so is crucial for cell survival, growth and proliferation; malfunction of Pol I can cause cell death or support the unrestrained proliferation characteristic of cancer cells". Nature 31 Oct 2013.
Kostas Kampourakis (Editor) (2013) The Philosophy of Biology: A Companion for Educators (History, Philosophy and Theory of the Life Sciences), Springer, Introduction, p. 3.
Tibor Gánti (2003) The Principles of Life, Oxford University Press, hardback. page 17. These are very beautiful insights from Gánti decades ago!
Robert J. Weatheritt, M. Madan Babu (2013) The Hidden Codes That Shape Protein Evolution, Science 13 Dec 13: "The authors determined that ~14% of the codons within 86.9% of human genes are occupied by transcription factors. Such regions, called "duons", therefore encode two types of information: one that is interpreted by the genetic code to make proteins and the other, by the transcription factor-binding regulatory code to influence gene expression. This requirement for transcription factors to bind within protein-coding regions of the genome has led to a considerable bias in codon usage and choice of amino acids, in a manner that is constrained by the binding motif of each transcription factor."
Tim Lenton, Andrew Watson (2011) Revolutions That Made the Earth, Oxford University Press. 448 pp. (Reissue edition: paperback 2013). (Chapter 4 and 4.6 Summary: the whole system view.)
Another fundamental problem is that not every ORF produces a functional product (protein) due to biochemical noise. They are spurious ORFs and are not real genes . A random ORF could really be a random meaningless DNA sequence. See: Michele Clamp et al (2007) Distinguishing protein-coding and noncoding genes in the human genome PNAS December 4, 2007. A random GC-rich sequence (50% GC) of 2 kb has a ≈50% chance of harboring an ORF ≈400 bases long. See Supporting Information Figure 4 for the graph of ORF length statistics.
Kepa Ruiz-Mirazo and Alvaro Moreno (2006) 'On the Origins of Information and Its Relevance for Biological Complexity', Biological Theory 1(3) 2006, 227–229. However, single-stranded RNA could contain 'information' because it can fold. But, Senapathy is only concerned with DNA, not RNA.
Li Zhao (2014) Origin and Spread of de Novo Genes in Drosophila melanogaster Populations, Science 14 February 2014
Narayana Annaluru et al (2014) Total Synthesis of a Functional Designer Eukaryotic Chromosome, Science 4 April 2014: "Here, we report the synthesis of a functional 272,871 base pair designer eukaryotic chromosome, synIII, which is based on the 316,617 base pair native Saccharomyces cerevisiae chromosome III.". Other researchers have synthesized a bacterium's full genome, but the yeast job is orders of magnitude more complex. Furthermore, compare a yeast chromosome with human chromosome 21: 48 million nucleotides!
Elizabeth Pennisi (2014) Building the Ultimate Yeast Genome, Science 4 April 2014: To increase the genome's stability, they took out mobile DNA elements, such as retrotransposons, introns and other noncoding DNA. It took Codon Devices more than eleven months to deliver a 90,000-base circular chromosome.
Viruses are intracellular parasites and contain no ribosome, produce no energy and do not divide. See also: Pandoraviruses: Amoeba Viruses with Genomes Up to 2.5 Mb Reaching That of Parasitic Eukaryotes. Pandoraviruses contain more than 1000 protein-genes including 54 DNA-processing proteins and seven virus-encoded amino acid–transfer RNA (tRNA) ligases. The largest known viral genome Pandoravirus salinus contains 2541 protein-genes. The big number of genes does not make them alive; they still depend on living cells for reproduction.
Andrew G. Clark (2014) Genetics: The vital Y chromosome, Nature 508, 463-465 (24 April 2014): "Most noteworthy is their observation that the sex chromosomes of placental mammals, birds and monotremes had essentially independent origins, which means that patterns of gene loss and of specific retention of classes of genes on their Y (or W) chromosomes can be compared."
Compare with Daniel G. Gibson and J. Craig Venter (2014): "A biological cell is much like a computer – the genome can be thought of as the software that encodes the cell's instructions, and the cellular machinery as the hardware that interprets and runs the software." Nature 8 May 2014 (Synthetic biology: Construction of a yeast chromosome)
Jocelyn Kaiser (2014) The Hunt for Missing Genes, Science 16 May 2014. "the average person carries about 100 incapacitated genes–and in 20 of those cases, both the maternal and paternal copies of a gene are missing, creating a complete knockout."
Already at this point in the story the canonical genetic code is assumed. Or: rather some genetic code, because at that point in the origin of life any genetic code is possible. See: The elephant in the room. As an example: "reassignment of all three stop codons was found" in: Natalia N. Ivanova (2014) Stop codon reassignments in the wild, Science 23 May 2014. The effect of a stopcodon reassignment to an amino acid is a longer protein. A stop codon reassignment could be in the nuclear or mitochondrial DNA. [23 May 2014]
It is easy to forget that introns never get spliced out in the DNA, but only in the mRNA! Remark inserted 25 Jun 2014. Furthermore, catalytic RNAs play a dominant role in this processing, which represents a major involvement of ribozymes.
Wolf Reik & Gavin Kelsey (2014) Epigenetics: Cellular memory erased in human embryos, Nature 31 Jul 2014.
Mycoplasma: the codon UGA encodes the amino acid tryptophan instead of the usual stop codon. 31 Jul 2014
Matt Kaplan (2012) DNA has a 521-year half-life, Nature, 10 Oct 2012 1 Aug 2014
Bloom K, Joglekar A. (2010) 'Towards building a chromosome segregation machine', Nature, 2010 Jan 28
Suzanne Clancy DNA Transcription, Scitable, Nature Education.
However, if 'no other causal factor' than DNA is required to produce a human being, then 'the organism does –in a sense– compute itself from its genes'! So, what's the problem? He contradicts himself. For Senapathy the problem is quite different and serious: he has no cellular machinery for reading DNA. 9 Aug 2014
Genomes. 2nd edition. Chapter 7 Understanding a Genome Sequence. Stewart Scherer (2008) A short guide to the human genome states 16.995 to 21.461 nucleotides (bases). (p. 25)
Joanna L. Kelley, et al (2014) Compact genome of the Antarctic midge is likely an adaptation to an extreme environment, Nature Communications, 12 aug 2014. See Supplementary tables.
Addy Pross (2012) What is Life? : "All modern life forms depend critically on this interdependence. DNA, the nucleic acid in which all heritable information is coded, cannot replicate without the elaborate involvement of protein enzymes, and those proteins cannot be generated without the prior existence of the DNA molecule, which codes for those enzymes. ... The RNA-world hypothesis appears to resolve this dilemma... " Chapter 5. 26 Aug 2014.
Emily Singer (2014) Chemists Seek Possible Precursor to RNA, Quanta magazine, Feb 5 2014.
Thousands of never-before-seen human genome variations uncovered, Science Daily, 10 Nov 2014.
Guojie Zhang et al (2014) 'Comparative genomics reveals insights into avian genome evolution and adaptation', Science 12 December 2014
Hui Y. Xiong et al (2015) 'The human splicing code reveals new insights into the genetic determinants of disease ' and editorial: Roderic Guigo, Juan Valcarcel (2015) Prescribing splicing, Science 9 Jan 2015. "These [SNVs] include synonymous changes within protein-coding sequences, generally assumed to be functionally neutral, as well as missense or nonsense changes whose effects on protein expression may be more dramatic than anticipated because of their impact on the splicing process."
Ed Yong (2015) Microbiology: Here's looking at you, squid, Nature, 14 January 2015
Michael Weinreich (2015) Molecular biology: DNA replication reconstructed, Nature, News & Views, 26 March 2015. The core replication machinery has been conserved throughout evolution, from yeast to mammals.
Edward J. Grow et al (2015) Intrinsic retroviral reactivation in human preimplantation embryos and pluripotent cells, Nature, 11 Jun 2015. "transcripts of HERVH, and of its regulatory element LTR7, were detected before EGA ". (Embryonic Genome Activation)
The cells' toolbox for DNA repair. Nobel Prize Org Press Release 7 October 2015. See also: Popular Information: DNA repair – providing chemical stability for life (pdf), and "Before his (Lindahl) work, "I don't think anybody really considered the idea that DNA requires active engagement by a set of housekeeping processes to keep it in a stable state," says Keith Caldecott" in Nature, 7 Oct 2015.
Robert J. Weatheritt, M. Madan Babu (2013) The Hidden Codes That Shape Protein Evolution, Science 13 December 2013
By the way, successful human fertilization requires an interaction between two unique proteins: Karsten Melcher (2016) When sperm meets egg, Nature 15 Jun 2016. Two proteins are: Izumo1, which is produced by sperm, and Juno, its receptor on eggs.
David S. Booth & Nicole King (2016) Evolution: Gene regulation in transition, Nature 23 Jun 2016
Fateful imprints, Science 13 Jan 2017
Suzan Mazur (2017) Eugene Koonin: The New Evolutionary Biology, Huffington Post, 02/03/2017. [my criticism: Koonin's genomes evolve in a vacuum, they do not live on a planet, and are not influenced by down-to-earth factors as physical, geochemical and climatological factors.]
Yasutaka Kakui, Frank Uhlmann (2017) Building chromosomes without bricks, Science 23 Jun 2017
Jeremy L. England (2013) Statistical physics of self-replication, The Journal of Chemical Physics, Volume 139, Issue 12 2013.
Tejas Dharmaraj, Katherine L. Wilson (2017) How chromosomes unite, Nature NEWS AND VIEWS 29 November 2017
Senapathy knows the existence of a Startcodon, he shows a Start codon in figure 10 on page 555 of the Genetics primer. A Start codon determines together with a Stop codon the Open Reading Frame (ORF). Calculation: on the basis of the frequency of stop codons, there are on average 3 ORFs of 21 codons in every sequence of random DNA of length 64 codons. Adding Start codons with frequency of 1 in 64, 1 of the 3 ORFs is truncated at position 10 on average. Taking together: 21 + 21 + 10 = 52 divided by 3 = 17 codons. So, the predicted average length of ORFs is 17 codons. This is smaller than predicted on the basis of Stop codons alone. 25 Feb 2018
According to Senapathy the left splice site is 9 bases long and the right one is 4 bases long (page 243). 1 Mar 2018
Circadian organization of the genome, Science 16 Mar 2018 ("The clock protein Rev-erbα regulates genome folding to establish circadian gene repression.")
However, according to Karo Michaelian (2017), photochemical pathways to nucleobases exist. (Thermodynamic Dissipation Theory of the Origin and Evolution of Life, p.124.).
"except bacteria and viruses, but including humans": please note that the genome of the MS2 bacteriophage virus is about as simple as a genome can get: 3600 bp. But according to Senapathy's theory the genome of this virus cannot, and a human genome of 3.2 billion bp can arise spontaneously. 28 Jun 2018
Bruce Alberts (2010) Model Organisms and Human Health, Science 24 Dec 2010.
K.J. Willis, J.C. McElwain (2002) The Evolution of Plants. , p.249. (8.3 Why no mass extinction in the plant fossil record?) 23 Nov 2018
It has been found that an ant-mimicking jumping spider that secretes a nutritious milk-like substance on which its young offspring are entirely dependent. The spider also continues to care for the spiderlings as they mature and become independent. Thus, this type of maternal care may be more widespread than has been assumed. Zhanqi Chen et al (2018) Prolonged milk provisioning in a jumping spider, Science 30 Nov 2018 30 Nov 2018
Procambarus clarkii (belongs to the Crustacea), the newly hatched stay attached to their mother. 26 Dec 2018
Samantha R. Edwards, Tracy L. Johnson (2019) 'Intron RNA sequences help yeast cells to survive starvation', Nature News and views, 16 Jan 2019: "Approximately 5% of protein-coding genes in yeast contain introns, and only nine contain more than one. By contrast, 90% of genes in mammals contain introns, with an average of eight introns per gene."
See also: 'Indian scientists protest against unscientific claims made at conference', Nature News 9 Jan 2019.
B. Kazemi (2011) Genomic Organization of Leishmania Species.
Matthew Cobb (2013) 1953: When Genes Became "Information", Cell, Volume 153, Issue 3, 25 April 2013, Pages 503-506.
Joshua B. Benoit, Mathias Kölliker, Geoffrey M. Attardo (2019) Putting invertebrate lactation in context, Science 8 Feb 2019.
Erich Bornberg-Bauer, Brennen Heames (2019) Becoming a de novo gene, Nature Ecology & Evolution 3, 524–525 (2019). 4 Jun 2019
Ken Garber (2019) Epigenetics comes to RNA. Science 5 Jul 2019. 5 Jul 2019
Human Genetics Revolution Tells Us That Men and Women Are Not the Same, Oct 24 2013. 22 Jul 2019
Linkedin profle Periannan Senapathy 1-1-2020
Michael Eisenstein (2020) How to build a genome, Nature, 24 February 2020. "At present, the only solution is to let living cells do the hard work of assembling fragments: Saccharomyces cerevisiae! ... that is not possible in the Senapathy scenario!
Lars Bode (2020) Understanding the mother-breastmilk-infant 'triad', Science, 6 Mar 2020.
Alan J. Warren (2020) DNA-repair enzyme turns to translation, Nature 26 Feb 2020
Jelke J. Fros (2021) An adaptive compromise - Conflicting evolutionary pressures on arthropod-borne Zika virus dinucleotide composition in mammalian hosts and mosquito vectors.
Paul Davies (1999) Life Force, New Scientist, pp 27-30. 18 September 1999. Please note Senapathy's book was published 1994. See also his "The FIfth Miracle" (1999) chapter 4.
Avihu H. Yona, Eric J. Alm, Jeff Gore (2018) Random sequences rapidly evolve into de novo promoters, Nature. Of course, as we know by now, this works only in a fully functioning genome in a fully functioning living organism. 5 Aug 2021
This is the email I received 6 Sep 2021 from Periannan Senapathy:
- "Dear Gert,
  
  For business reasons, I had to search for my name on Google today. When I did, I observed that your website was the third to be displayed on the results page, and the caption under the website title stated the following, As you know, life in the animal kingdom is classified into the successive ranks of Phylum, Class, Order, Family, Genus, and Species. The core prediction of my theory is that fundamentally distinct organisms with unique body plans originated independently. By fundamentally distinct, I refer to the organisms that biologists had to separate into the highest-level taxa such as phylum and class. Humans are not classified in a high-level taxon, but the lowest rank of a species.
  
  For the reason outlined above, I would appreciate it if you could remove the statement "including humans" at the end of your description. It is a very prominent statement, which diminishes the core principle of my theory. What I had intended to stress in my book was that any fundamentally unique body plan, from the 'simplest' invertebrate to the most 'complex' vertebrate, must have originated independently. Nevertheless, organismal evolution would have worked on these body plans over millions of years, giving rise to numerous low-level taxa such as families, genera, and species, including humans. I hope this clarifies my view.
  
  Best regards,
  
  Sena".
However, in his book there is no explicit statement that humans did not originate from the primordial pond. And why. This review is about his 1994 book, and not about new interpretations decades later. Removing those two words would imply that around 8.7 million species of plants and animals have been created by evolution and are not independently born. This is in contradiction with the statement "Evolutionary Theories Are Fundamentally Incorrect" on the cover of his book and the first 4 chapters of his book. His 1994 theory would be changed beyond recognition. I will blog about this.
Senapathy's request to remove humans is a blogpost about Senapathy's request. 19 Sep 2021.
Are humans produced in Senapathy's Primordial Pond? Yes and No!, blog 27 sep 2021.
Chromosome centromeres are inherited epigenetically, Sciencedaily, Nov 9, 2011.
Tet methylcytosine dioxygenase 3 is an example of a protein that plays a role in DNA demethylation. Especially right after an egg and sperm have come together to form a zygote. 26 May 2022
"...in this case, find out what the smallest unit of the genome is necessary for a cell to know where it is in the body... how Hox genes help cells to learn and remember where they are. ... Different species have different structures and shapes, a lot of which depends on how Hox clusters get expressed.". Source: Scientists engineer synthetic DNA to study 'architect' genes, ScienceDaily, June 30, 2022
Rebecca Muir, Alan Diot, Joanna Poulton (2016) Mitochondrial content is central to nuclear gene expression: Profound implications for human health. Bioessays. 31 Aug 2022
(a) Nick Lane (2022) 'Transformer: The Deep Chemistry of Life and Death', Chapter 6. "Mitochondria are now wholly integrated into their host cells, and their function is central to everything that cells do; not only energy, but also biosynthesis" (Chapter 6). The mitochondrion is much, much more important for the falsification of the theory of independent origin than I ever imagined. It was not apparent because references to mitochondrion and the mitochondrial genome are scattered all over this page.
(b) "Even if the complete mammoth genome could be reconstituted successfully, letter by letter, it wouldn't work properly in elephant cells unless the mitochondrial genes were also replaced." (Chapter 6).
(c) Quote from note 4 of chapter 6. page 208. (pdf) 3 Sep 2022
Stuart Kauffman (1995) At Home In The Universe, page 38 bottom. 6 Sep 2022
Gert Korthof (2022) Did Nick Lane solve the origin of life? 5 Sep 2022. In this blog I point out that "Solving the origin of life is showing that non-enzymatic chemical reactions can create the basic chemicals of life. There can be no enzymes involved at that phase because enzymes are proteins. ... and they are coded by DNA." Ganti wrote already: "a living orgaism can never be developed from genetic material alone, i.e. only from chromosomes or the nucleus." (THE PRINCIPLES OF LIFE, 2003, hardback, OXFORD UNIVERSITY PRESS, p.126)
Charlie Wood (2020) Cosmic Rays May Explain Life's Bias for Right-Handed DNA, Quanta Magazine, June 29, 2020.
List of genetic disorders (wikipedia). 20 Nov 2022
Francis Crick (1988) What Mad Pursuit, Basic Books, p. 142. Jan 1, 2023
A Kondrashov (2017) Crumbling Genome, chapter 5.1 Fidelity of DNA replication. Jan 8, 2023
Futuyma, Kirkpatrick, Evolution, 5th ed, 2023 p.279. Figure 10.19. Feb 9, 2023
Laurence A. Moran (2023) 'What's in Your Genome? 90% of Your Genome Is Junk', University of Toronto Press, Release Date: May 16, 2023 chapter 5 (309/943) Jun 6, 2023
The reason for the update is that I discovered (1) part of it was based on wrong information of John S. Mattick and others, see: Note 401, and (2) Senapathy discussed 'Regulation of gene expression' in his 'Genetics Primer' on page559–563. (3) the paragraph likely contained information not avaible in 1994. This is the original text of the paragraph Regulatory sequences:

"Genes are useless if they are not expressed and if the expression is not regulated. Even if one has found all 20,000 human genes in random DNA, that still would be useless.
It has been found that introns account for at least 30% of the human genome and may be a significant, perhaps major, source of regulatory noncoding RNAs (116). An experiment concerning the relationship between introns and coded proteins provided evidence that some non-coding DNA is just as important as coding DNA. This experiment consisted of damaging a portion of noncoding DNA in a plant which resulted in a significant change in the leaf structure because structural proteins depended on information contained in introns (113).
Genomes don't just encode proteins. They also give rise to various groups of noncoding RNAs that can regulate gene expression. Short RNAs that form from enhancer sequences might be one such class of regulatory RNA. The concomitant increase in non-coding content of the genome with organismal complexity supports the proposition that evolutionary innovations and expansion of regulatory RNAs were fundamental to the genetic programming of complex eukaryotes (293). As far as I know Senapathy ignored promotors (near to a gene's transcription start site), enhancers (far away) and transcription factor binding sites. Without regulatory sequences a DNA sequence is not a genome."
Denis A. Malyshev, et al (2014) A semi-synthetic organism with an expanded genetic alphabet, Nature, 7 May 2014. New base pair: pair formed between d5SICS and dNaM (d5SICS–dNaM).
Diana Kwon (2023) How scientists are hacking the genetic code to give proteins new powers, Nature.
How urea may have been the gateway to life, Sciencedaily, June 28, 2023.
PLOS ONE is an Open Access Peer-reviewed online journal. See paragraph 27 on this page: PLOS ONE article. Added: 30 Jun 2023
Wikipedia pages: Shapiro Senapathy algorithm (this page has several issues), Split gene theory (his book is mentioned), Life (his book is mentioned). Added: 30 Jun 2023
Senapathy writes: "When a very highly complex cell with a nucleus could be formed by a large genome containing an extremely large number of genes from the USP, it is quite probable to form an organelle [mitochondrion] with a far smaller set of genes directly from the primordial pond's genetic sequences. I am convinced that this is what had happened." Chapter 7 p.248. That's all! It's so simple! Added: 3 Jul 2023
Tomas Lindahl (1993) Instability and decay of the primary structure of DNA, Nature, 22 April 1993. Note: this is one year before Senapathy published his book. Added: 5 Jul 2023
A Kondrashov (2017) Crumbling Genome. Chapter 5 'Struggle for Fidelity'. (blog)
De novo genes are mentioned in Futuyma, Kirkpatrick (2023) Evolution fifth ed. page 267 with references. Added: 18 Jul 2023
Larry Moran (2023) 'What's in Your Genome? 90% of Your Genome Is Junk', Chapter 5 The big picture. This is a conservative estimate, other sources have much higher estimates. Added: 23 Jul 2023
Kara L. McKinley, Iain M. Cheeseman (2015) The molecular basis for centromere identity and function. Added: 3 Aug 2023
Geoffray Monteuuis et al (2019) The changing paradigm of intron retention: regulation, ramifications and recipes, Nucleic Acids Research. Added: 4 Aug 2023
In the Appendix Senapathy writes: "Genes constitute only a small proportion, typically less than five percent of the whole genome. These are only rough estimates. The rest is considered 'junk DNA', whose origin has been so far unclear." (Genetics Primer, page 543). Added: 11 Aug 2023
The concepts do not occur in Minkoff (1983). Minkoff uses 'epigenotype' in a quite different meaning. "Although many researchers proposed that DNA methylation might regulate gene expression, it was not until the 1980s that several studies demonstrated that DNA methylation was involved in gene regulation and cell differentiation" (Nature). Added: 11 Aug 2023.
As an afterthought Senapathy writes in note 117: "Remember, whenever we refer to immutable genome, we mean that the genome of one organism is not changeable to the genome of another organism with a new gene or a new body structure. The genes and the sequences in the genome can mutate, but the genome itself, speaking functionally, does not mutate." Added: 13 Aug 2023
Senapathy really thinks that biochemical machinery is present in the primordial pond to read DNA seqeunces: "These prebiotic chemical processes could also have led to the primitive but complex molecular machineries made up of prebiotically synthesized RNAs and proteins, not coded by DNA, capable of reading the messages contained in DNA sequences..." (page 206 chapter 6). Added: 16 Aug 2023
Amazingly, Senapathy is aware that he needs splicing sequences: "In our experiments in which we illustrated that we can find the genes of today's living animals and plants in random sequences, we have not included the splice junction sequences at the junctions of the exons and introns. We can certainly consider an experiment that includes the splice junction sequences. Although we have not done such an experiment, it is certain to yield similar results." (page 282 chapter 7). So, he knows things without doing the experiment! Added: 16 Aug 2023. Senapathy algorithm undermines his own theory of independent birth of organisms in the primordial pond blogpost Added: 11 Aug 2025
"It is quite conceivable that the genomes of sexually reproducing multicellular organisms were also assembled as eukaryotic cells." (page 300 chapter 8) Senapthy uses "conceivable" 9 times in his book. See also Note 466. In the pdf of his book hyphenated words at the end of a line will not be found! So, a search should be repeated for different hyphenations of that word. Added: 17 Aug 2023
Apparently, surprisingly, Senapathy knows: "In sexually-reproducing animals, development always begins with a single cell called the zygote. In most cases, the male (sperm) and female (egg) sex cells, called gametes, each containing a single (haploid) set of chromosomes, unite to produce the diploid zygote, which develops into the embryo that forms the offspring." (page 307, chapter 8) Added: 17 Aug 2023
But natural selection in this form means only to survive or not to survive. But this not enough. Survivors must reproduce otherwise they do not contribute to the next generation. That means in the majority of cases sexual reproduction. Added: 18 Aug 2023
On the Wikipedia page Split Gene Theory the name "Senapathy" occurs 47 times! This is extreme! Added: 22 Aug 2023
Senapathy rejects evolution: "Biologists have always been taught to believe that evolution takes a path from simple to complex systems. Because of the frame of mind and the world view about evolution and the fossil record, scientists were forced to take the simple → complex evolutionary route to explain how complex life evolved starting from the 'primitive' prokaryotes.' (page 517). Added: 23 Aug 2023
"but because of Darwin's subtly convoluted way of laying out his arguments and facts, filling gaps in knowledge by making extensive use of imaginary scenarios" is a quote from Richard G. Delisle, James Tierney (2022) Rereading Darwin's Origin of Species: The Hesitations of an Evolutionist, first chapter. In the same way as Darwin was not fully an evolutionist and incorporated many anti-evolutionary ideas into his work, Senapathy incorporated many evolutionary ideas into his anti-evolutionary theory. Added: 27 Aug 2023
He is not a biologist. Maybe his lack of a typical education in biology was, in part, what enabled him to arrive at his distinctive views on the subject. I think if he had had a more conventional education, if he'd come up through the ranks and had taken the standard biology courses and so on, maybe he would have done less interesting work. See: David L. Chandler (2023) Could the Universe be a giant quantum computer?, Nature 25 August 2023 (is about computer scientist and physicist Edward Fredkin). Added: 2 Sep 2023
Sleeping embryonic genomes are awoken by OBOX proteins, Nature 17 July 2023. Added: 2 Sep 2023
Tiny mineral inclusions picture the chemical exchange between Earth's mantle and atmosphere, Science Daily Aug 31, 2023. Added: 3 Sep 2023
If one searches for 'the origin of species', the results are found only in his discussion of Darwin and in the Notes. Not in his own theory! Revealingly, the title of his book is: 'Independent Birth of Organisms', not: 'Independent Birth of Species'! It is easy to overlook this! Added: 4 Sep 2023
Albert Eschenmoser (1925–2023), Science 14 Sep 2023. He investigated alternatives for DNA, and concluded that "hat the evolutionary choice of RNA and DNA was made from a diversity of constitutionally related alternatives on the basis of functional criteria.". Added: 15 Sep 2023
He conveniently provided his Primordial pond with unlimited powers: Senapathy's primordial pond has no limits, it has unlimited power, it is all-powerful. There are no limiting environmental factors. This is not a good property for a theory. A good scientific theory describes limits. Natural selection is not all-powerful: it can be overpowered by random genetic drift, especially with small effective population sizes. Natural selection cannot produce perfect adaptations. Natural selection cannot eliminate (all) junk DNA. Added: 20 Sep 2023 See also 457.
The SRY gene: suppose the SRY gene would end up not in the Y-chromosome but in one of the 22 autosomes! Possibly, all individuals would be males. Because of the effect of SRY, there could be no females. End of a possible species. Added: 6 Nov 2023
Magdalena Zernicka-Goetz, Roger Highfield (2021) 'The Dance of Life: Symmetry, Cells and How We Become Human', WH Allen. Added: 12 Nov 2023
Silvia Monticelli, Petr Cejka (2024) DNA sensing and repair systems unexpectedly team up against cancer", Nature 10 Jan 2024. 18 Jan 2024
review of 'Gene Machine The Race to Decipher the Secrets of the Ribosome' on NHBS website.
Brendan R. Camellato et al (2024) Synthetic reversed sequences reveal default genomic states, Nature 6 March 2024. 11 Apr 2024
Kseniia Dudnyk et al (2024) Sequence basis of transcription initiation in the human genome, Science, 26 Apr 2024. Perspective: How DNA encodes the start of transcription. Transcription is much more complex than Senapathy imagined. "Furthermore, by comparing human and mouse data and sequence conservation across 241 mammalian species, we show that the transcription initiation rules are conserved across mammalian species." This points to common descent (of mammals) contrary to independent origin. 26 Apr 2024
Samuel C. Ginther et al (2024) Metabolic loads and the costs of metazoan reproduction, Science, 16 May 2024. 16 May 2024
Carissa Wong (2024) How much energy does it take to make a baby? Researchers are rethinking what they know. Nature, Open access. "...When you switch to live-bearing, there's a significant increase in the indirect costs — that's the cost of holding the baby inside you longer and moving around and carrying it,...". "A lack of women in the field might have led researchers to pay less attention to the indirect costs of reproduction, Marshall suggests."!!! 25 Oct 2024
Elizabeth Pennisi (2024) 'Dark proteome' survey reveals thousands of new human genes, Science 29 Nov 2024: Database confirms that overlooked segments of the genome code for a multitude of tiny proteins.
Mitch Leslie (2019) New universe of miniproteins is upending cell biology and genetics. Science 17 Oct 2019: Tiny proteins help power muscles and provide the toxic punch to many venoms. "to avoid a data deluge, past researchers typically excluded any ORF that would yield a protein smaller than 100 amino acids in eukaryotes or 50 amino acids in bacteria.
- Fruit flies rely on a microprotein with 11 amino acids to grow normal legs.
- yeast could make more than 260,000 molecules with between 2 and 99 amino acids
- finding more than 600,000 short ORFs in the fruit fly genome
- microproteins suggest protogenes can form when mutations create new START and STOP signals in a noncoding portion of the genome.
GK: a 'data deluge' because there apparently is a huge number of random small ORFs! 29 Nov 2024
Tim Lenton (2016) Earth System Science: A Very Short Introduction. (The nitrogen cycle).
Donald E. Canfield (2014) Oxygen: A Four Billion Year History, Princeton University Press. 23 Dec 2024
Michael W. Grome et al (2025) Engineering a genomically recoded organism with one stop codon, Nature, 5 February 2025.
Tobin J Hammer, Jon G Sanders, Noah Fierer (2019) Not all animals need a microbiome. 1 Apr 2025.
Laurence D. Hurst (2025) The Evolution of Imperfection: The Science of Why We Aren't and Can't Be Perfect, Princeton University Press.
Mammals, however, are odd. We overuse and conserve the least good one (TGA). Why? from: Laurence Hurst (note 446).
Sean R. Eddy (2013) The ENCODE project: Missteps overshadowing a success, Current Biology, 8 April 2013.
- "Suppose we put a few million bases of entirely random synthetic DNA into a human cell, and do an ENCODE project on it. Will it be reproducibly transcribed into mRNA-like transcripts, reproducibly bound by DNA-binding proteins, and reproducibly wrapped around histones marked by specific chromatin modifications? I think yes."
Frank C. Schroeder (2025) How bacteria subvert plant immunity, Science. To prevent their detection, bacteria inhibit plant enzymes with a small molecule. 18 Apr 2025
- Senpathy discusses the immune system in Chapter 9. On page 388 he writes: "These are the cells that attack and destroy any invading organisms, viruses or bacteria" and "Some of these creatures have antibacterial proteins and genes specific to them." without noting that –in his theory– bacteria appeared only after eukaryotes."
Wikipedia article: Lynn Margulis. [19 Apr 2025]
This quote from chapter 7 of Senapathy's book clearly shows how he solves problems:
"By simple logic, when a complex system such as that of splicing could come into being, then it is at least equally probable for the origin of the simpler systems of transcription and translation in the primordial soup. All these basic systems must have come into being before the organization of the first living cells. Without these machineries a living cell cannot be assembled." (page 248).
So, having accepted one unproven hypothesis, he simply states that another wildly improbable hypothesis is equally likely and accepts it also as true! [25 Apr 2025]
For example, in living cells, naked DNA needs to be stabilized by proteins (histones). He simply assumes that those highly specific proteins are present in the Primordial Pond.
"One might doubt whether all these enzymatic activities, discussed above, could have occurred in the primordial pond before the DNA-coded proteins were synthesized. However, we must remember that when the primordial soup attained highly complex chemical activities, it must have contained almost all the catalytic activities that we can think of." (page 213)
About the stability of DNA he writes:
"Further, the DNA molecule is highly stable even by itself in the test tube, indicating that such a stability is the inherent nature of the DNA molecule. Also, protective mechanisms similar to those existing in the chromosomes of living cells could have existed in the primordial pond. One would logically expect that DNA must have been abundant and very stable before complex cells were developed." (page 214).
This is wrong. DNA appears to be stable because of DNA-repair mechanisms. Just one year before Senapathy published his book, Nobel prize winner Tomas Lindahl published his groundbreaking article: Instability and decay of the primary structure of DNA in Nature. See my blog. See also: note 353. [6 May 2025]
"It is plausible that one or a few of the myriad DNA sequences in the primordial pond, by chance, might have had the message for coding a protein with DNA polymerase activity. (...) Similarly, the messages for the transcription enzymes and the more complex translation machinery could have been present in random DNA." (page 211)
But that doesn't help! It is the vicious circle! To transcribe a protein-coding DNA sequence, one must first have transcription enzymes! And it doesn't help in the slightest degree that transcription enzymes are encoded in DNA! It is the vicious circle! [7 May 2025]
"All these might have been expressed from DNA sequences by the primitive non-DNA-coded enzymes and nucleoprotein complexes of the primordial pond." (page 211).
(my bold). Please note the word 'primitive'!
"After the DNA-coded enzymes had evolved, the replication of the DNA must have become more efficient and produced an abundance of the authentic DNA-polymerizing enzymes." (page 212)
(my bold). So, he solves the origin of DNA-based life by assuming that all the necessary enzymes are simply present in the Primordial Pond! He blatantly uses an evolutionary process ('primitive', 'evolved')! [7 May 2025]
"In fact, such a switch from non-DNA-coded machineries to DNA-coded machineries must have been inevitable, over a period of time, even though the non-DNA-coded machineries were primitive and slow." (page 211).
Please note: "over a period of time", "primitive and slow": that is evolutionary thinking! Please note "inevitable": how so? [7 May 2025]
"The importance of all these considerations is not only that they could form complex single-celled organisms, but that if appropriate genomes were organized in such cells, these cells could directly give rise to multicellular organisms." (page 219)
(my bold) [7 May 2025]
Creation science and Intelligent Design also want to explain everything with one simple principle ('GOD'). Evolutionary biology knows its limits. For example, there is a chapter 'The Limitations of Evolution Theory' in John Maynard Smith (1992) DID DARWIN GET IT RIGHT? Essays on Games, Sex and Evolution, and a chapter 'Michael Ruse: Is there a limit to our knowledge of evolution?' in But is it Science?. In chapter 20 of Stephen Stearns, Rolf Hoekstra (2005) Evolution, an introduction is a discussion of some unsolved problems in evolution. Kostas Kampourakis (2020) Understanding Evolution has a chapter about the limits of science and questions not answered by evolutionary theory (chapter 7). Douglas Futuyma, Mark Kirkpatrick (2023) Evolution. Fifth Edition: most chapters have a 'What we don't know' section. [24 May 2025]
Haiqing Xu, et al (2023) Chance promoter activities illuminate the origins of eukaryotic intergenic transcriptions, Nature 01 April 2023. [28 May 2025]
However, he did know that "The genome of a human being contains 3 x 10⁹ nucleotides." (page 566) and "The human genome is believed to express approximately 50,000 genes" (page 544). Remarkably, the expression 'human genome' occurs only twice in his book! (one in the Appendix and a second one in Notes and References). Also the expression 'human DNA sequence' occurs twice in the book (Genetics Primer, which is an Appendix). I could not find the fact that humans have 46 chromosomes, nor 23 pair of chromosomes, nor '46XY', '46XX'. On page 167 he mentions Turner's syndrome (45XO) and 'Kleinfelter syndrome' (47XXY) (this must be: 'Klinefelter'). So, from his information it can be derived that a healthy human female must have a 46XX karyotype, but he does not mention it explicitly. [30 May 2025]
He knows: "Take for example a chicken. When an egg is laid ..." (my italic) (Appendix, page 536). But, then you need a mother first! Where does she come from? An egg? And where does that egg come from? [31 May 2025]
"The set of all possible reactions among the chemicals must occur randomly, the only limitation being the availability of the chemicals and the inherent reaction rate of each reaction. Because the primordial pond must have been abundant in these elements, there was no dearth of reactants." page 208 Chapter 6. Again, on page 222 he mentions "Enormous quantities of DNA material must therefore have come into existence in the primordial ponds, although the total amount possible in a typical pond is within a finite limit (approximately 10³⁵ nucleotides). It seems that this limit is in fact indistinguishable from infinite. Please note he uses the plural: primordial ponds. [3 June 2025]
On the other hand, he uses the phrase "primordial ponds" 27 times in his book. [3 June 2025]
"However, the autotrophs – cells and organisms (mainly plants) that are capable of directly converting the earth's chemicals to food (by photosynthesis) without eating other organisms – must have originated in the primordial pond at least at around the same time as, if not before, the animals. Once autotrophs originated in the primordial pond by the same principles of the independent birth of animals we have alluded to, then the heterotrophs, animals that eat other living forms for food, can become viable." footnote 7 of Chapter 8 page 604. [4 June 2025]
"Many ponds may have produced life during that fertile period of the earth eons ago, but the life from only one pond survived until today. All living creatures suddenly erupted from that pond, and simply walked, swam, flew or flowered away to fill the earth with the awesome power and beauty of organic Nature." (page 534 chapter 12) [7 June 2025]
The subtitle of his book reads "A New Theory That Distinct Organisms Arose Independently From the Primordial Pond". And in the book: "According to the new theory, different sets of largely the same genes could be assembled into entirely different DG pathways leading to unique organisms that are immutable." (page 348). [9 June 2025]
"It is easily conceivable that the rich broth of the primordial pond could be the common source of many such basic functions of life." page 312. Further, he uses "it is quite possible" 6 times. "Independent assembly of genomes in the open-ended primordial pond leads easily to highly complex organs in organisms". (page 348). [10 June 2025]
Ewen Callaway (2025) Rare 'ambidextrous' protein breaks rules of handedness, Nature 29 May 2025.
- Most proteins are left-handed, but scientists have found an ancient molecule that works in both mirror-image forms.
  "Many chemicals have a handedness, or chirality, and can exist in two mirror-image forms. But the building blocks of life tend to stick to one or the other. Sugars in nucleic acids such as DNA are right-handed – causing the DNA double helix to twist to the right, if you were looking down its axis – whereas the amino acids that build proteins are left-handed."
- "There's also a crazy explanation," Longo says. The motif could have evolved when 'mirror life' – containing left-handed nucleic acids and right-handed proteins – existed on Earth. "That would completely change what we think about the ecology of the pre-LUCA times. That's kind of a wild thing," he adds. If ambidexterity is a common feature of other ancient DNA- and RNA-binding proteins, it could add support to the idea." [12 June 2025]
Transfer RNAs (tRNAs) are key components of the translation machinery. They read codons on messenger RNAs (mRNAs) and deliver the appropriate amino acid to the ribosome for protein synthesis. The human genome encodes more than 500 tRNA genes. They are transcribed by RNA Polymerase III (Pol III) and go through a series of maturation steps and posttranscriptional modifications to become fully active". From: Adrian Gabriel Torres (2019) Enjoy the Silence: Nearly Half of Human tRNA Genes Are Silent. [14 June 2025]
"One of the most important principles that we should note here is that if one typical gene could probabilistically occur in the USP, then almost any gene for any particular biochemical function – almost an unlimited supply of distinct genes for multitudes of unique biochemical functions – would occur in the USP." (page 289). Please note: "biochemical functions"! suggesting once you have a gene, you have a biochemical function! DNA is all you need! [15 June 2025]
By the end of the 1970s scientists concluded on the basis of experiments that most of the human genome must be junk. Senapathy used this fact as the basis of his theory. His theory could not have been invented if most scientists at the time thought that most of our DNA was functional. In the 1990s the genomics area began. In 1992 The complete DNA sequence of yeast Saccharomyces cerevisiae chromosome III was published. Sequencing of the entire yeast nuclear genome was then completed by early 1996 through a massive, collaborative international effort. The first multicellular eukaryote, and animal, to have its whole genome sequenced was the nematode worm Caenorhabditis elegans in 1998. The genome of the lab mouse Mus musculus was published in 2002. In 2001 the draft human genome sequence was published. When Senapathy published his book in 1994, no complete genome sequences of any eukaryote including humans was known (459). At the time only gene sequences (GenBank) and protein sequences were known. [Note 470 added 17 June 2025]
The section Agreements was originally included in the Summary, but it distracts too much from my main argument against independent origin. I moved it to Note 471.
Uncontroversial facts accepted by mainstream science:
- Science should seek a natural explanation of the origin of life
- prokaryotes very rarely have introns, have high gene densities, low-entropy, compact, relatively small genomes (246, p.53, p.229).
- eukaryotes in general have many large introns, have low gene densities, have high-entropy, noisy, large genomes (246, p.53, p.229), 310, although the yeast genome is highly streamlined in comparison with those of most other eukaryotes (371), (287), (288).
- According to Laurence Moran (2023) (412) 3% of the sequence of an eukaryotic gene consists of exons and 97% of introns. That is roughly in agreement with what Senapathy estimated.
- it is mathematically 100% certain to find the sequence of any eukaryotic gene in computer generated random strings of A,T,C,G if the random sequence is long enough (262), (326)
- theoretically stopcodon statistics in random computer DNA logically follow from the premiss that there are 3 stop codons in 64 codons
- As pointed out by Paul Davies in an important article (384), in order to contain information, a DNA genome cannot have repeating patterns, that is it cannot contain much compressible information, it must be highly random in a information-theoretical sense.
- Experiments with random DNA:
  - Any random DNA sequence of sufficient length will contain transcription-factor binding sites (309) and Random sequences rapidly evolve into de novo promoters (385).
  - a large proportion of random sequences have promoter activities in the eukaryote Saccharomyces cerevisiae (458).
  - Deciphering eukaryotic gene-regulatory logic with 100 million random promoters (511)
- DNA can code for anything (that's the beauty of DNA!), any protein large or small. Small proteins are the most easy to find in random DNA. It is estimated that there are some 3000 miniproteins (microproeins) in the human genome (440),(441).
- "Over the last decade, it has become apparent that new genes may emerge de novo from parts of eukaryotic genomes that were previously non-coding, such as intergenic regions" (376). It has been found empirically in Drosophila melanogaster that 59.9% of random 800-bp intergenic sequences were associated with a ≥ 150-bp single-exon Open Reading Frame (ORF) (328) which could theoretically produce a 50 amino acid long peptide.
- Recently it has been claimed by a genome researcher that a hundred million base chromosome of entirely random DNA will be transcribed, bound by DNA-binding proteins and the chromatin marked (289).
- Recently, it has been demonstrated for the first time that three human genes have arisen de novo from noncoding DNA (308), (411). but the human genome is not in a primordial pond! And 3 genes do not equal a genome!.
- some mainstream scientists use the concept "primordial gene pool" (233) (so what?)
- Last but not least: random mutations and random genetic drift play an important role in the modern theory of evolution.
  [Note 471 added 17 June 2025]
What's wrong with a DNA-centric view? Philip Ball (2024) How Life works blog 12 February 2024 [18 June 2025]
Senapathy: "I am convinced that this is what had happened." page 248. [19 June 2025]
Senapathy: "Looking at the fact that all living entities – even the simplest bacteria or the bacterial viruses – have a majority of proteins longer than 200 amino acids, in fact as long as 3000 amino acids, it is obvious that splicing must have originated in the primordial soup, before the very first cell could ever start to live." (page 248). But this conclusion is only warranted if the truth of his theory (abiotic synthesis of DNA, etc. etc. etc.) was already established. Which is not the case. So, what he does is: assuming the truth of his theory, and then using that as an argument that splicing machinery must have been present in the Primordial Pond! The abiotic origin of splicing must be produced experimentally in the lab! Such a fact cannot be derived by 'logic'. [20 June 2025]
Senapathy: "Let us shed our traditional beliefs of always going from simplicity to complexity. Let us open our minds to the rationale of complexity first and simplicity next." (page 249). It seems he is using this as a natural law, subsequently deriving facts from it. It seems that what he is saying amounts to: the complexity of the writings of Shakespeare is the same as all Twitter posts because they all use the same words but in a different order. [20 June 2025]
Senapathy: "Biologists have always been taught to believe that evolution takes a path from simple to complex systems. ..." page 517. [20 June 2025]
Eugene V Koonin (2007) The cosmological model of eternal inflation and the transition from chance to biological evolution in the history of life, Biology Direct May 31 2007.
- Conclusion: "Despite considerable experimental and theoretical effort, no compelling scenarios currently exist for the origin of replication and translation, the key processes that together comprise the core of biological systems and the apparent pre-requisite of biological evolution. The RNA World concept might offer the best chance for the resolution of this conundrum but so far cannot adequately account for the emergence of an efficient RNA replicase or the translation system." [21 June 2025]
Senapathy: "We have said consistently that an independently-born organism is immutable. However, it is possible that an organism may loose some characteristics and still be viable. For instance, the legs of an animal can be lost due to some mutations" (page 364). If mutations do occur in DNA, genomes are not immutable. Since mutations are the result of the biochemistry of the nucleic acids, mutation is a fundamental property of life. One cannot construct mutations as a special case, as an exception to the rule. Secondly, mutations contradict the stability of DNA (see here). Thirdly, humans did not originate in the Primordial Pond (387, 388), so they must have evolved from an ancestor, and that involves a lot of mutations. Contradicting immutability again. Similarly, prokaryotes did not originate in the Primordial Pond, implying that they must have evolved from eukaryotes and that requires a lot of mutations. [26 June 2025]
A serious contradiction is that the title of his book "...That Evolutionary Theories Are Fundamentally Incorrect" contradicts with statements in his book such as: "we now know that evolution explains only a small portion of what it purports to explain. While evolution can account for only some aspects of life..." Chapter 12 Conclusion page 521. [24 Nov 2025]
Problems common to independent origin and evolution: The human genome encodes at least 16 different DNA polymerases, and at least 31 proteins that detach the two DNA strands form each other (helicases) and many other DNA-handling proteins (399). Evolution must explain how complex organisms with large genomes could evolve from simpler ones without or with less repair/proofreading enzymes and higher mutation rates. Independent origin must be able to calculate what the probability is for the origin of a random genome that accidently codes for a complex organism and the right repair enzymes.
A. G. Cairns-Smith (1985) Seven clues to the origin of life (paperback) is a popular popular-science book about the origin of life. It was published 9 years before Senapathy published his book, but it is absent from his book. A very important conclusion: "Biochemically as well as chemically these [DNA,RNA] are evidently difficult moleucles to make: it takes many steps to manufacture even just their nucleotide units from the simpler central moleucles of biochemistry." (page 114). "Nucleotide units do not join together on their own, even with the help of heat agitation. To get them join up they have to be primed ..." (page 23), etc. [29 June 2025]
Unique heat adaptations: The DNA of some hyperthermophiles is made more heat stable by (1) adjusting the base composition (2) supercoiling of DNA (3) heat stable molecules and membranes, (4) production of 2,3-diphosphoglycerate, (5) chaperones. David A. Wharton (2002) Life at the limits. Organisms in extreme environments, page 148–149. [1 Jul 2025]
The hard problem: see the Koonin threshold and the Eigen paradox is one of the most intractable puzzles in the study of the origins of life. Eigen published in 1971, so Senapathy could have known the paper. He quotes Eigen (1976). [1 Jul 2025]
Senapathy: "Out of a billion species that have so far appeared on earth (including those that have become extinct), several million creatures may have originated independently in the primordial pond, and from each distinct creature came many varieties and similar species by a number of mechanisms – natural selection, genetic drift, mutations in trivial genes such as those that affect the coat thickness, color, or body size. It is possible that some "new" creatures can originate from the independently-born creatures by losing body parts or functions that are not crucial to the life of the organism. Slight modifications of the body parts or functions should also have been possible. Thus, evolution has played only a minor role in the origin of organisms on earth, and it is the independent birth of organisms that has played the major and most important role." page 533 Chapter 12 Conclusion. (my bold). [2 Jul 2025]
'Eons': 'an indefinite and very long period of time', or: 'a period of time that is so long that it cannot be measured'. [2 Jul 2025]
Chromosome-specific centromeric patterns define the centeny map of the human genome, Science, 3 Jul 2025.
Senapathy: "Organisms born first in the primordial pond will have unique features; those born later will inevitably be constructed with some basic features of the organisms born earlier"; "Pieces of already successful genomes can become part of newly assembled genomes in the primordial pond"; "However, once functional genomes become available, then it becomes unavoidable that these shall be used in pieces in the construction of additional new genomes" (page 321). [7 Jul 2025]
Senapathy: "many species within a genus usually connectable by evolution and many families within an order are sometimes connectable by evolution" (page 461 chapter 10). This also contradicts the title of his book. [7 Jul 2025]
Senapathy:
- "There were absolutely no contiguous genes (as found today in prokaryotes) in the primordial pond." (page 247).
- "The analyses of the distribution of all the codons including stop codons prove that the DNA sequences in the genomes of eukaryotes are random." (page 250).
- "...which unequivocally proves that numerous genomes were indeed constructed independently from a common pool of genes." (page 376)
- "I began investigating DNA and proteins, using the computer to simulate random sequences, to prove that genes could in fact occur in the primordial pond." (page 4)
Nobody can know with certainty how first life originated, because biochemistry is not fossilized. At most, scientists can construct reasonable hypotheses. A big mistake: no one can prove with computers what happened in the 'primordial pond' billions of years ago! [7 Jul 2025]
Senapathy: "Our new theory of the independent birth of organisms rests on absolutely sound scientific principles and corroborating facts, and is a perfect fit with all of what we know about life on earth, both past and present." (page 373) and: "But now we have the minimum amount of such information with which we can prove the theory, and this is all we will ever need to show what happened in the primordial pond eons ago." (page 532-533). [7 Jul 2025]
Senapathy knows HOX genes on page 411 and on page 412 he states: "that these systems independently originated from the common pool of homeobox-containing genes in the open primordial pond.". A contradiction in one sentence: "independent" and "common"! Several key publications about HOX genes appeared in 1984 (ten years before the publication of his book) and the discovery of the features of HOX gene organization in clusters were published in 1989 (5 years before his book). (See references in Sean B. Carroll ( 2005) Endless Forms most beautiful, page 313). Please note that the HOX clusters are not in random order. Senapathy overlooked this important fact. [8 Jul 2025]
Senapathy: "Thus life seems to have originated in many ponds on the primitive earth, when the conditions were right – possibly at distinct locations and at different geological times. In each pond, life originated independently of life in other ponds. And in every pond, numerous unique creatures were born, each independently of others in the pond." (page 502). [10 Jul 2025]
Senapathy: "Thus, eventually the primordial pond will be depleted and the birth of creatures will cease. However, by then a large repertoire of creatures would be stably living in all the nooks and corners of the earth." (page 331). [10 Jul 2025]
[12 Jul 2025] For example statements with 'could'/'should'/'would'/'must have been'/'probably did contain' and contrary to facts. Examples of 'facts' frequently appear in headings:
- "The biochemical complexity of the primordial pond increased tremendously over geological time. The DNA, RNA, and protein molecules, among others, were prebiotically synthesized by random physicochemical reactions among elements and molecules of the primordial soup". (page 219)
- "The very first cells were highly complex eukaryotic cells! (page 239)
- "Countless different genes existed in the very large universal sequence pool: The universal gene pool" (page 298)
- "Unicellular eukaryotes arose directly from the primordial pond" (page 299)
- "Each new creature is established from one or more male/female pairs directly born from the seed cells assembled in the primordial pond" (page 321)
- "Primordial chemical reactions on earth, approximately several hundred million years ago, produced a primordial pond with enormously large amounts of long DNA sequences," (page 202)
- 493a "The first genes were split genes and the first cells were eukaryotic cells. Computer analysis of DNA sequences reveals that the very first genes in the primordial pond were split into coding (exon) and intervening (intron) sequences". Paragraph heading chapter 7 page 230.
  These are factual statements! It is impossible to say anything factual about the primordial pond. A scientist can only put forward hypotheses. Computer calculations can never result in factual statements about the empirical world, let alone about events 4 billion years ago. Senapathy knows very well the difference between 'fact' and 'theory' when he describes symbiosis: "This was theorized by some scientists two decades ago, notably by Lynn Margulis." (1970) The origin of eukaryotic cells. Note 13 on page 598. Here, he describes competing ideas as 'theories' and his own theory as a 'fact'. Compare this with creationists: 'It's only a theory". [24 aug 2025].
When a theory is constructed to explain certain facts, is it correct to say that the theory predicts those same facts? I am afraid not. Of course the theory 'predicts' those facts, because the theory was specifically designed to explain those facts. A really interesting theory predicts new facts which were not known at the time the theory was constructed. This matter must be studied carefully. For example, does the theory predict 3 universal stopcodons? etc. [13 Jul 2025]
Mária Trexler et al (2023) Evolution of termination codons of proteins and the TAG-TGA paradox, Nature, Published: 31 August 2023. For the stop codon frequencies the authors refer to publications between 2012 and 2022. [14 Jul 2025]
Johanna M Enright et al (2023) Low Complexity Regions in Proteins and DNA are Poorly Correlated, Molecular Biology and Evolution, Volume 40, Issue 4, April 2023 Open Access. [14 Jul 2025]
- "LCRs can present as periodic repeats, ambiguous cryptic repeats, or can contain no apparent pattern at all, but simply deviate from a randomized composition. LCRs contain low information and have a low entropy" (...) "mononucleic codons such as AAA have an entropy of zero, dinucleic codons such as AGA have an entropy of 0.918, and trinucleic codons such as AGC have an entropy 1.58." This is highly relevant for the assessment of the randomness of genomes.
Davide De Lucrezia et al (2012), Do natural proteins differ from random sequences polypeptides? Natural vs. random proteins classification using an evolutionary neural network, PLoS One 2012. Open Access Peer-reviewed. [15 Jul 2025]
Olaf Weis et al (2000) Information Content of Protein Sequences, Journal of Theoretical Biology. [15 Jul 2025]
Wikipedia: Prebiotic synthesis of purine ribonucleosides: "Nam et al. (2018) demonstrated the direct condensation of purine and pyrimidine nucleobases with ribose to give ribonucleosides in aqueous microdroplets, a key step leading to RNA formation. Also, a plausible prebiotic process for synthesizing purine ribonucleosides was presented by Becker et al. in 2016." [16 Jul 2025]
Sidney Becker et al (2016) A high-yielding, strictly regioselective prebiotic purine nucleoside formation pathway Science 13 May 2016.
Elise Cutts (2025) Molecular fossils offer first glimpse of how life survived Snowball Earth, Science, 15 Jul 2025.
Robert Shapiro (1986) Origins. A Skeptic's Guide to the Creation of Life on Earth, on Cyril Ponnamperuma: "As we have noted, his approach to the area is pervaded by an optimism and a sense of cosmic purpose that seems to come from some inner faith." (hardback page 187). Please read pages 186–189 and 108. Senapathy met Ponnamperuma in person. [18 Jul 2025]
I described what it means to explain the origin of life in a review of Nick Lane's Transformer: The Deep Chemistry of Life and Death. [21 Jul 2025]
"A splice site defines the boundary between a coding exon and a non-coding intron in eukaryotic genes." quote from wikipedia article: Shapiro–Senapathy algorithm (This page was last edited on 28 July 2025, at 14:35 (UTC).) So, the mere fact that the Shapiro–Senapathy algorithm exists, proves that there are non-random patterns in the genes and consequently are not random DNA, and disproves the theory of the origin of eukaryotes from random DNA. [29 Jul 2025]
In the original publication M. B. Shapiro, P. Senapathy (1987) RNA splice junctions of different classes of eukaryotes: sequence statistics and functional implications in gene expression, Nucleic Acids Res 1987, they write: "A sequence of eight nucleotides is highly conserved at the boundary between an exon and an intron ... The boundary between an intron and an exon also exhibits a highly conserved sequence of 4 nucleotides, preceded by a pyrimidine-rich region." [1 Aug 2025]
Wikipedia page Periannan Senapathy (accessed 18 Jul 2025). There is not a single hint on this page that the theory is at odds with mainstream science, that it may be a controversial theory. On the contrary: (mainstream) scientists are quoted who would accept (part of?) his theory. Furthermore, a layman would not discover that his theory is anti evolution. The page is a defense of the theory from start to finish. The Wikipedia page is purely for self-promotion purposes. There is not a single critical note on the page. It violates Wikipedia's neutral point of view (NPOV) rule. Even worse: on the Web archive can be verified that there was a link to a critical review (my review), but as of 2021 the only critical note had disappeared. This is against the five pillars of wikipedia: Wikipedia is written from a neutral point of view, We avoid advocacy, etc. [4 Aug 2025]
Personal communication Rolie Barth 09-08-2025 13:02. [11 Aug 2025]
Richard Feynman: "The first principle is that you must not fool yourself and you are the easiest person to fool." (WikiQuote). Feynman emphasized the importance of intellectual honesty and avoiding self-deception, especially in scientific endeavors. He argued that it is crucial to be honest with oneself when evaluating evidence and theories, as it's easy to fall into the trap of believing what one wants to be true. [18 Aug 2025]
Quote from John Archibald (2014) One Plus One Equals One: Symbiosis and the Evolution of Complex Life. page 47. [20 Aug 2025]
With hindsight, it is understandable why he was forced to explain everything. He started with explaining the origin of split genes –genes with introns and exons– from random DNA. Split genes only occur in eukaryotes. All eukaryotes have split genes. Consequently, he is forced to explain the origin of all eukaryotes. But this idea cannot be applied to the origin of prokaryotes, because prokaryotes don't have split genes. So, he is forced to come up with an alternative explanation for prokaryotes: they originated from eukaryotes by losing introns. His theory does not allow for the origin of eukaryotes from prokaryotes because the theory already explained the origin of eukaryotes from random DNA. Since eukaryotes arose from random DNA, they have no ancestors. As a consequence random DNA must have originated from chemical building blocks. But that is nothing less than the orgin of life! So, he ended up –whether he wanted to or not– explaining the origin of life. [20 Aug 2025]
Carl G. de Boer (2019) Deciphering eukaryotic gene-regulatory logic with 100 million random promoters Nature Biotechnology. The authors tested some 100 million random sequences, each of which were 80 nucleotides in length for their ability to drive expression of a fluorescent protein in yeast (Saccharomyces cerevisiae). [22 Aug 2025]
Origins of life: the molecules that could have unlocked peptide synthesis, Nature 27 Aug 2025.
"This is for this same reason that genes themselves should not be considered as alive, but that cells and organisms should be, as the former are not able to self-replicate in isolation." from: Daniel J.M. Crouch, Walter F. Bodmer (2024) Evolution by natural selection is a scientific law and not just a theory, Academia Biology. February 08, 2024.
Ting Zhu (2025) Mirror of the unknown: should research on mirror-image molecular biology be stopped?, Nature 15 Sep 25. "Hundreds to thousands of cellular components – including proteins, nucleic acids, membranes, metabolites and complex carbohydrates called glycans – would need to be synthesized chemically or enzymatically in their chirally inverted forms. Some of these are encoded directly by DNA. But many are synthesized or modified by other complex biological machinery, meaning their compositions and structures cannot simply be derived from DNA sequences."
Yi Qiu et al (2024) The GC-content at the 5' ends of human protein-coding genes is undergoing mutational decay Genome Biology 13 August 2024. 24 Sep 2025
Christian Schlötterer (2015) Genes from scratch – the evolutionary fate of de novo genes, Trends in genetics April 2015. 27 Sep 2025
Frameshift mutation. A frameshift mutation is the deletion or insertion of 1 or 2 bases. All codons translated downstream of a frameshift mutation will be misread, (that is produce different amino acids) and frequently an out-of-frame stop codon will prematurely terminate translation. 27 Sep 2025
Sean R. Eddy The ENCODE project: Missteps overshadowing a success, Current Biology 2013. 28 Sep 2025
Michael Marshall (2020) How the first life on Earth survived its biggest threat – water, Nature 9 Dec 2020. "Living things depend on water, but it breaks down DNA and other key molecules. So how did the earliest cells deal with the water paradox?" Important article: keep reading. 2 Oct 2025
updated 3 Oct 25
Patricia Heyn (2014) The Earliest Transcribed Zygotic Genes Are Short, Newly Evolved, and Different across Species: "In all metazoans, the fertilized embryo is provided with proteins and RNAs deposited by the mother during oogenesis. These maternal stores support early embryonic cell divisions before the onset of transcription at zygotic genome activation (ZGA)." 5 Oct 25
Patricia Heyn et al (2014) The Earliest Transcribed Zygotic Genes Are Short, Newly Evolved, and Different across Species, Cell Reports. 14 Oct 25
Alexandra Kühnlein, Simon A Lanzmich, Dieter Braun (2021) tRNA sequences can assemble into a replicator, Computational and Systems Biology, Structural Biology and Molecular Biophysics. 15 Oct 25
The ilustration on page 323 is a prefect example of story telling and wishfull thinking: "Using pieces of genomes from first-born organisms to construct new genomes of later-born organisms."
- "Each of the broken DNA pieces from this genome, because it most likely contained some genes required for the construction of a living organism, had a far greater value than an equally-sized random DNA sequence in forming a genome for a multicellular organism." (page 323). How could it be, when all organims arose directly from random DNA? How could DNA from broken seed cells have a greater surival value than random DNA sequences? It implies natural selection, common descent and contradicts independent origin! 21 Oct 25
"Thus there are reports of ... genomic sequences corresponding to 12 of the 14 steps involved in de novo purine nucleotide synthesis". (source). The 14 enzymes must be available before nucleotides and DNA can be synthesized. So it does not help that the 14 genes are present in DNA. Without those 14 enzymes there is no DNA. That is the famous vicious circle or chicken and egg problem. 23 Oct 25
"I never doubted the validity of Darwin's theory. However, the number of problems unsolved by the theory, such as the 'missing links' between supposedly related organisms, were puzzling to me. Furthermore, there are many questions concerning the origin of life itself still unanswered by scientific research." Introduction, page 3. 24 Nov 25
John S. Wilkins (2013) Essentialism in Biology, chapter, The Philosophy of Biology, online 1 Jan 2013. 30 Nov 25
"Perhaps there are proteins or metabolites that are inherited by the embryo from the sperm and egg, and those contain necessary information. All of the various cloning and genome transplant experiments to date require an intact cell, oocyte, or embryo to work, and I don't think there is a single example where naked DNA and a chemically defined in vitro solution has led to a living organism. As such, it would be very interesting to have a precise understanding of what and how much information is encoded in the genome sequence alone and what is stored by other chemical means in the cell, whether by epigenetic coding of chromatin or via other biological molecules." From: Stephen R. Quake (2024) The cellular dogma, Cell, 14 Nov 2024. Free access. 23 Dec 25
"Each independently-originating creature (whether invertebrate or vertebrate) also gave rise to many related similar species by many mechanisms such as natural selection and mutation, that is, by means of change through organismal descent with modification." Chapter 10, page 488. 20 Jan 26
"A central question in evolutionary biology is understanding what limits natural selection's ability to perfect organisms." Source: Michael Lynch, Pioneer of Evolutionary Cell Biology, to Deliver 2026 Darwin Day Lecture. 12 Feb 26
Unfortunately, this destroys the Theory of Independent Origin of Organisms, because the theory is based on this naive assumption. Added 10 Mar 26
Added 14 Mar 26
"The complexity of the genomes is not too different among various multicellular organisms, from worm to human, all of which in turn are not far removed from those of unicellular eukaryotes. Therefore the probabilities for the assembly of the genomes for these organisms are not widely varied." page 203 A PRELUDE TO THE NEW THEORY Chapter 5. 15 Mar 26
Here is another example of assuming his theory is basically right, and then deriving a factual conclusion from it: "When a unicellular eukaryote could originate from the primordial pond's UGP, it inherently means that the UGP was certainly vast enough to contain genes for almost any protein required to construct multicellular organisms." p 306. 15 Mar 26
Amandine Bonnet, Benoit Palancade (2015) Intron or no intron: a matter for nuclear pore complexes, Nucleus:
- "In particular, mRNAs expressed from intron-containing genes are surveyed by a specific NPC-dependent quality control pathway ensuring that unspliced mRNAs are retained within the nucleus." 17 Mar 26
This paragraph has been improved by highlighting the text in red and adding extra quotes. 19 Mar 26
For example, he omits START codons and splicing recognition sites in his computer experiments of introns and exons. This looks like tweaking his experimetnal design. "Tweaking is the deliberate choice of reserach design and model specification to produce specific empirical results in the interest of the researcher." According to this definition, tweakers do not directly manipulate or fabricate data. Rather, they modify their research designs or models to produce misleading, even technically true, data.". Self-serving "tweaks' hurt science, Science 19 Mar 26
"In eukaryotes, three RNA polymerases (RNAPI, RNAPII, and RNAPIII) and their associated factors drive transcription. Using live-cell single-molecule tracking in yeast, Ling et al. measured the kinetics of 58 proteins involved in this process." from Live-cell single-molecule dynamics of eukaryotic RNA polymerase machineries, Science 5 Feb 2026
A. Robertson (1960) A theory of limits in artificial selection. 28 Mar 2026
N Barton, L Partridge (2000) Limits to natural selection. 28 Mar 2026
Senapathy is "Searching for the occurrence of the genes of today's animals and plants in the universal sequence pool" (page 273). Today's genes are read with today's Genetic Code table. That means that the Genetic Code of today's animals and plants must arise in the primordial pond. Consequently, if a different Genetic Code were to emerge in the Primordial Pond, his scenario would fail because genes of today's organisms obey the same Genetic Code table as those in the Primordial Pond (according to his theory,). This is a risky assumption. How likely is it, that at the origin of life, exactly the same Genetic Code would arise in one go? without the possibility for any optimization process? I did not fully realize the magnitude of this problem until now. Senapathy is not concerning himself with this problem. He is not worried. See for the details:
A DNA sequence is not a genome. 21 Apr 2026
Charles S. Cockell (2018) 'The Equations of Life: How Physics Shapes Evolution', Chapter 7 THE CODE OF LIFE. Source: Gayle K Philip, Stephen J Freeland (2011) Did evolution select a nonrandom "alphabet" of amino acids?. 27 Apr 2026
"more than 80 amino acids have been found in meteorites" from: On the Evolutionary History of the Twenty Encoded Amino Acids. 5 May 2026
Human DHX29 detects nonoptimal codon usage to regulate mRNA stability, Science, 19 Mar 2026. 8 May 2026
Walter Remine: "Directly created organisms have no ancestor, they are created by the direct action of a designer." (p510). "directly created organisms are not related by common descent". "Numerous life forms were separately created". My review. 8 May 2026.
Enhancers with tissue-specific activity are enriched in intronic regions, Genome Res 2021. Intronic enhancers, which are regulatory DNA sequences located within the introns of genes, were first discovered in 1983 in the mouse immunoglobulin heavy chain gene, so Senapathy could have known this. 9 May 2026
Adrian Woolfson (2026) 'On the Future of Species: Authoring Life by Means of Artificial Biological Intelligence', chapter 3: Constructing Synthetic Genomes. 11 May 2026
Sequence and organization of the human mitochondrial genome, Nature 9 April 1981. 11 May 2026
The term "exon" is often mistakenly used as a direct synonym for "protein-coding". In reality, the definition refers to what is retained, not what is translated. A subtle but important difference! 16 May 2026

35 Further Reading

Periannan Senapathy's company Genome Technologies. Note: this seems to be no longer Senapathy's homepage, but of a commerical firm (noted on 16 Jun 2013). Here is a snapshot of his website genome.com taken in 2001 by the WayBackMachine. Please note the self-assured language Senapathy uses about himself and his achievements.
Senapathy's new home site ("The page you requested was not found" on 16 Jun 2013).
Senapathy's immodest autobiography. ("The page you requested was not found" on 16 Jun 2013).
book: Independent Birth of Organisms - A New Theory That Distinct Organisms Arose Independenlty From the Primordial Pond Showing That Evolutionary Theories Are Fundamentally Incorrect (1994). Genome Press, Madison, 635 pages. The book is free available as an Adobe pdf file (3 MB!). Fast internet connection recommended. The pdf file is full-text searchable, which is extremely handy for research purposes (recommended for any book! Each book should also be published on CDROM!) [Today: ebooks]. Error: ("The page you requested was not found" on 16 Jun 2013). Luckily, I have saved the pdf on my PC.
Wikipedia page Periannan Senapathy (accessed 18 Jul 2025). There is not a single hint on this page that the theory is at odds with mainstream science, that it may be a controversial theory. On the contrary: (mainstream) scientists are quoted who would accept (part of?) his theory. Furthermore, a layman would not discover that his theory is anti evolution. The page is a defense of the theory from start to finish. This page is purely for self-promotion purposes. There is not a single critical note on the page. It violates Wikipedia's neutral point of view (NPOV) rule. Even worse: on the Web archive can be verified that there was a link to a critical review (my review), but as of 2021 the only critical note had disappeared. This is against the five pillars of wikipedia: Wikipedia is written from a neutral point of view, We avoid advocacy, etc.
Here is a list of publications of Senapathy P (Periannan) from the BioInfoBank Library. More on p.598 of his book. Please note that peer-reviewed journals are included in the list, but as far as I can see from the abstracts those publications do not mention his theory of independent origin of organisms!
PRWEB Genome Data Proves False the Theory of Evolution, New Theory Shows Complex Animals and Plants Originated from Prebiotic Chemistry, December 15, 2010
A layman's summary of the new theory by Jeffrey Mattox. With many links (some dead links). Mattox is an electrical engineer who 'discovered' Senapathy. The page is no longer maintained.
Double helix: 50 years of DNA (from Nature), containing a collection of overviews celebrating the historical, scientific and cultural impacts of the discovery of the double helix. All content is free.
Gert Korthof: A Chemist's View of Life: Ultimate Reductionism & Dissent. A review of Schwabe's book.
Gert Korthof: Independent Origin and the facts of life. Reasons from developmental biology, genetics and ecology (please note that I skipped reasons from evolution biology!).
Gert Korthof: a review of The principles of Life by Tibor Gánti. (the origin of life)
Gert Korthof: a review of Lynn Margulis and Dorion Sagan (2002) Acquiring Genomes. A theory of the origin of species. Recommended reading. If anything contradicts independent origin, then it is Margulis' now well established symbiosis theory.
Gert Korthof: The Feathered Onion is a review of Clive Trotman's book. Summary: Is the time span for the origin of life on earth too short?
How Many New Genes Are There? Science Vol 311 24 March 2006 1709. This article uses computer generated random (intron-free) cDNA sequences of 2000 bases in length and concluded that by chance 1247 of 20,000 (6,2%) contain Open Reading Frames which could produce proteins of 119 or more amino acids.
Robert M. Hazen, Patrick L. Griffin, James M. Carothers, and Jack W. Szostak (2007) 'Functional information and the emergence of biocomplexity', PNAS published online May 9, 2007.
- "Here we explore the functional information of randomly generated populations of Avida organisms." That is random genomes are generated! This would be a modest but careful way to explore genomespace and the probability of random generation of a funtional genome.
Periannan Senapathy et al (2008) 'Origination of the Split Structure of Spliceosomal Genes from Random Genetic Sequences', Plos One, October 20, 2008. Open Access. ("and that a machinery was required for removing the genetic waste.": the concept 'genetic waste' has only meaning if a functional genome exist. Similarly, 'stop codon' only has meaning if the genetic code is already established.) Please note that the book reviewed on this page is present in the PLOS article as note 36, but the subtitle 'A New Theory That Distinct Organisms Arose Independently From The Primordial Pond Showing That Evolutionary Theories Are Fundamentally Incorrect' is omitted.
T. Ryan Gregory and Niles Eldredge (2008) Spore biology. Surprisingly, this page is usefull for Senapathy supporters, because Senapathy's theory is like Spore Biology (need to elaborate this).
Periannan Senapathy (2013) 'Theory of the origin of complex eukaryotic genomes from pre-biotic random genetic sequences', 14 Mar 2013 - 12:00pm. Lecture in the Evolution Seminar Series (ESS) of the J.F. Crow Institute for the Study of Evolution, University of Wisconsin-Madison. (reported to me by Tyler Valkoun from University of Wisconsin-Madison).
Gert Korthof (2021) Senapathy's request to remove humans is a blogpost about Senapathy's request. 19 Sep 2021.
Gert Korthof (2021) Are humans produced in Senapathy's Primordial Pond? Yes and No! internal inconsistency in his theory, blog 27 Sep 2021.
Gert Korthof (2022) Did Nick Lane solve the origin of life? blog 5 Sep 2022
Gert Korthof (2023) Periannan Senapathy (1994) claimed that the human genome consists of more than 90% junk DNA, blog 04 July 2023.
Gert Korthof (2025) Senapathy algorithm undermines his own theory of independent birth of organisms from the primordial pond, 2 Aug 2025.

Nederlands

Gert Korthof (2011) Senapathy publiceert in Nature Precedings on korthof.blogspot.com 7 Jan 2011.
Gerdien de Jong (2019) Open Leesramen in Drosophila melanogaster DNA maandag 17 juni 2019 (Open Leesramen = ORFs)
Gerdien de Jong (2019) Ruis, rommel (= noise, junk) zondag 26 mei 2019

Top

Korthof blogspot	home: Towards the Third Evolutionary Synthesis	https://wasdarwinwrong.com/korthof58.htm
Copyright ©G.Korthof 2002	First published: 29 Dec 2002	Updated: 16 May 2026 Notes: 16 May 2026