Famous old images of racehorse and jockey in stop action frames

Credit: NIH/National Institute of Mental Health

Scientists funded by the National Institutes of Health have succeeded in a study that could have remarkable implications of our understanding and perhaps re-scripting of how the human body behaves.  The study, which will appear July 12, 2017, in the journal Nature, show for the first time that DNA in living cells can be used to encode not just genetic knowledge, but any random sequential data into a genome.

Neuroscientist Seth Shipman, Ph.D. , a post-doctoral fellow at Harvard Medical School, Harvard’s Drs. George Church, Jeffrey Macklis and Jeff Nivalapreviously, first demonstrated that they could use CRISPR to store sequences of DNA in bacteria. Clustered regularly interspaced short palindromic repeats (CRISPR) is a genome editing technology that allows permanent modification of genes within organisms.  The team then used CRISPR to encode and rebuild frames from a classic 1870s race horse in motion  sequence of photos – an early forerunner of moving pictures.

The research is heralded as a important jump ahead in the goal to developing a “molecular recorder” that in the future could make it possible to obtain read-outs, for example, of the transitioning internal states of neurons as they develop.  The Harvard team sees many possible futures uses for this technology like recording molecular events such as changes in gene expression over time.  The researchers eventually wish to use the technology to further explore the brain.

“We want to use neurons to record a molecular history of the brain through development,” said Seth Shipman. “Such a molecular recorder will allow us to eventually collect data from every cell in the brain at once, without the need to gain access, to observe the cells directly, or disrupt the system to extract genetic material or proteins.”

Transcript of Video Below:

Nearly a century and a half ago, the big news
0:07
about encoding and playing back sequential information – the forerunner of movies – was
0:12
all about those horses.
0:14
Over the years, the technology advance to motion picture film, then to tinier and tinier
0:22
videotapes and flash media.
0:23
And we thought that was gee whiz stuff.
0:26
Well, think again.
0:28
Today’s latest news about encoding and playing back sequential information is again about
0:34
those horses – except these movies were encoded in and played back from DNA in living
0:42
bacterial cells.
0:43
Seth Shipman: To be totally honest, the point is not to
0:46
store videos in bacteria.
0:48
Laugh.
0:49
We use the video because it’s a good example of a complex piece of information that has
0:57
both many parts to it – that is many pixel values and a time component.
1:02
So its organized over time.
1:03
And so it was a good way for us to test whether the CRISPR adaptation system that we’re
1:10
using could actually acquire enough information that we could go in and sequence the bacteria
1:17
after we had encoded it and reconstruct the movie.
1:22
So its kind of a way for us to push the technology forward and understand how we will build and
1:32
analyze data as we are scaling up, hopefully to eventually do biological recording.
1:38
My background is actually in neuroscience.
1:39
And so, I was trying to understand how different cells in the developing brain would become
1:46
very distinct types of neurons.
1:48
So they are start as a cell with the same genetic material.
1:54
But over time, they acquire these very distinct cell states.
1:58
So there are different types of neurons.
2:01
And it’s very difficult, from a technological standpoint, to understand how that happens
2:08
in the developing brain.
2:10
Because its happening for many many many cells at once.
2:14
And all kind of locked away within the brain.
2:17
And so to get access to the biological events is quite difficult.
2:24
So what I imagined was if we had a way or recording biological events and storing them
2:30
in the genome, then I wouldn’t have to – as an experimenter – go in and either be watching
2:35
the cells or be kind of disrupting the brain in order to get information out, which then
2:40
disrupts the entire developmental process.
2:42
The overall idea is to use these kind of molecular recorders to gain access to biological phenomena
2:51
that we just can’t see in real time.
2:54
I was making reference to the development of the brain.
2:59
That’s one place where its something that happens both over time and in a biological
3:06
system that’s very difficult to get access to.
3:08
To actually watch the cell biology occur.
3:13
So if we were able to implement a molecular recorder to catalog biological Events over
3:19
time, that’s the type of a system that really would fundamentally change the type of data
3:27
we would pull out of the system.
3:30
Right now, we’re really piloting the idea of encoding information into the genome as
3:38
a way of storing information in life cells.
3:41
An so, it’s in bacteria.
3:46
We are currently supplying the information.
3:49
And over time we really would want to unlock the technology.
3:53
Be able to put it into any cell type.
3:56
And to hook it up to biology.
3:58
It is now functioning in bacteria.
4:00
So there are other things you could think about, like having living bacteria that are
4:07
actually sensing things in their environment and recording them into their own genome.
4:11
So they could be these biological recording devices, sensing recording devices.
4:17
And so they could be out in the environment or just recording things that are happening
4:23
within the cell – and capturing them, storing them in their genomes.
4:26
At any point, we can go back and recover that information.
4:30
This technology definitely has applications for experimental biology.
4:35
So it can be used as a tool for experimental biologists to gather data from cells.
4:40
In the future, it also potentially has applications in the medical field, where a cell has privileged
4:49
access to certain components of your body.
4:52
And if a cell could be gathering information and recording it into its own genome, then
4:57
that cell could be perhaps living within your body and getting that information over time.
5:03
And a clinician could go in and sequence out the genomic locus that has that information
5:11
and see what’s been going on previously.
5:14
So we’re looking at an image of a human hand, which shows a human hand, because it
5:21
was one of the first images that people put onto the natural world.
5:25
And so since we’re encoding images here in the natural world in a different way, we
5:29
decided to recapitulate one of those images.
5:33
And what you’re seeing is the source image – what we started with.
5:39
We then encoded all of the pixel values into the nucleotides of DNA, distributed them over
5:45
a number of strands of DNA, and then put them into bacteria, where the bacteria acquired
5:50
those sequences and captured them into their own genomes.
5:54
After the cells grew for awhile, we then sequenced their genomes and were able to reconstruct
6:01
the image based on the sequencing of the nucleotides that were inserted.
6:06
So then what you’re looking at is also the reconstructed image from the bacteria after
6:12
they were sequenced.
6:14
Here we are looking at a movie of a horse running.
6:19
This was originally taken by Edweard Muybridge.
6:22
And, again, it was one of the first moving images that was made.
6:27
And so we thought that it was an appropriate thing to encode.
6:31
And what we did here was, again, for each frame, break down the pixel values and encode
6:39
them in the nucleotides of DNA across many different strands – about 100 strands for
6:45
each frame of the image.
6:47
And then in this case, we delivered those strands to living bacteria over time.
6:52
So each frame was delivered on a different day over the course of five days.
6:57
And then we let those cells grow for awhile.
7:00
And then we sequenced their genomes.
7:01
Dnd we were able to reconstruct not only each of the frames, but the order of each of the
7:06
frames.
7:07
Because the molecular recording system that we’re using captures the timing information
7:13
of different molecular events.
7:16
The next step would be to hook it up to actual biology.
7:20
So, right now, we were supplying information in the form of synthesized DNA.
7:27
In the future, we want that information to come from biological events that are happening
7:31
within the cell.
7:33
So that’s the next step.