Human Genome Data
SB has access to the largest collection of genomic data -- DNA sequences, protein sequences, etc. This data
has never been available anywhere before, and SCC is devising ways to get as much useful and important
information from it as possible. Thus we have invented novel measures of the quality of the data, new
quantitative attributes, and are implementing ways to extrapolate future behaviors of the experiments generating
the data. The data are still incomplete, but more is obtained weekly as DNA gets further sequenced in the labs.
Some of these ideas were conceived during our probabilistic simulations of genomic sampling, where we
assumed various distributions of outcomes for the genome experiments then experimented with and analyzed the
inputs and outputs.
Return to Profile and Forum for John Matulis