I’m not lazy
December 16, 2009 at 2:57 pm | Posted in plan | Leave a commentTags: thesis
Why haven’t I posted anything in the last few weeks?
As I told my friend, blog posting frequency is inversely related to thesis progression.
I’ve been writing several sections to the traditional literature review that will form part of the preamble to my thesis and to my transfer report next year (transfer to full PhD).
Thus far I have made notes on the history of the Gene Ontology, what an ontology is, classification in biology, formal ontology in biology and science and technology studies (STS) theories relevant to ontology usage.
I’ll post the documents as they stand thus far up before Christmas.
Thesis questions for October 2009
October 22, 2009 at 8:39 am | Posted in plan, thesis | Leave a commentI will most likely have to prepare my transfer report (to full PhD) in the new year, so I am working on a final plan for my schedule of practical work for 2010.
The work is shaping up to a mixture of epistemology, philosophy of science, information history and classification theory. The methods will be philosophical arguments, a case study of the Gene Ontology and (hopefully) some user behaviour studies to find out how biologists use and understand GO terms.
- How can an objective knowledge be constructed by social co-operation?Philosophical argument using Popper’s 3 Worlds
Science is the revealing of ‘objective knowledge’ in the sense used by Karl Popper
The 3 Worlds epistemology – the material, the mental, the objective – is applicable to biology
Modern biology is the construction of objective knowledge by social means facilitated by computers - Is the knowledge represented by the Gene Ontology ‘objective knowledge’?
Case study of the Gene Ontology
The Gene Ontology (GO) is a controlled vocabulary and a knowledge representation for molecular biology
Formal relations between concepts in GO permit automated annotation and inferencing using computers
The Ontology with its terms, relations and notes represents a World 3 - How does the Gene Ontology change over time?
Information history approach to the development of the Gene Ontology
The Gene Ontology changes over time
The community participates in suggesting new terms to include
There are other ways GO could be developed
GO is being used in new ways eg, for information retrieval - Should knowledge representations in biology be constructed by logic, consensus or with impunity?
Interviews with experts, classification test and literature analysis
Ontologies are logical
Biologists want to contribute what they think are appropriate new terms
Very new knowledge does not have a place in an ontology, so how can it be incorporated? - How can the Gene Ontology be used to retrieve information?
User relevance study with GO term-driven search results
GO terms can be used to annotate documents in biology
They may represent a better way to find new knowledge and relations than simple NLP
Thesis questions for August 2009
August 13, 2009 at 12:36 pm | Posted in plan, thesis | Leave a commentTags: thesis
- How true is a biomedical ontology?
- How can we index a new idea?
- What is a document in biology?
- Are the elements of life a hierarchical classification?
- Can a machine be a biologist?
Academic authors to watch
June 16, 2009 at 9:42 am | Posted in plan | Leave a commentTags: planning
See below for a list of some academics (and links to their publications) whom I think are particularly relevant to my PhD:
This month's thesis
May 21, 2009 at 2:47 pm | Posted in plan, thesis | Leave a commentTags: thesis
- Biology grows new knowledge
- Informatics is now key to how biology grows new knowledge
- Theoretical invention is fundamental to biology
- Can biology grow new knowledge and invent new theories using only informatic techniques?
- Can a machine be a biologist?
The telephone in Lusaka
March 26, 2009 at 8:44 am | Posted in plan | Leave a commentTags: information theory, shannon
Has information been communicated between two objects or events, the information transfer causing the one to effect a change in the other?
Has a message been communicated, or has there been a statistically unlikely co-occurrence which we interpret as being linked by an information event?
Imagine if every time I threw a tennis ball against the wall in my office, a telephone in an office in Lusaka rang twice. I throw the ball ten times, a hundred times, and the phone rings consistently rings twice with each throw.
(Every time a child says “I don’t believe in fairies,” there’s a little fairy somewhere that drops down dead.)
Is the truth-content of the information in a signal related to the scientist’s implicit trust in the inductive method?
Akin to the mind-body problem, how can immaterial information effect changes in the physical?
How do two identical signals transferring the same information to different receivers effect non-identical changes? Is the information the signals carry different, or the structure of the receiver? Can causally-linked events co-occur and not involve an information transfer?
Popper and e-Science infrastructure problems
February 13, 2009 at 1:16 pm | Posted in plan | Leave a commentTags: e-science, popper
I’m reading about Popper’s 3 Worlds and want to link his model to some e-science infrastructure problems in biology.
For example, I think experimental data storage issues are primarily a World 1 problem (where is data physically located, what format is used, how is it networked).
Systems evaluation is a World 2 problem (does a scientist like a system, does he/she want to use it).
World 3 therefore becomes an important ontological category – World 3 is where biologists aspire to contribute new theories and test biological models.
- World 1 = practical / physical design problem area
- World 2 = sociological / psychological problem area
- World 3 = theoretical / philosophical problem area
Do ontologies for biology make the grade?
November 7, 2008 at 1:24 pm | Posted in article, plan | Leave a commentTags: ontology, planning
Soldatova, L. N. and King, R. D. Are the current ontologies in biology good ontologies? Nature Biotechnology, 23(9), pp. 1095-1098
How can I resolve my different ideas and readings into one topic? When I consider what I have read over the last six weeks, several areas are prominent:
- the biomedical science as an information domain
- biomedical researchers as actors with information needs
- information retrieval in the biomedical sciences
- information management in the biomedical sciences
- knowledge discovery in the biomedical sciences
- innovation in the biomedical sciences
- the philosophy of science
As I was reading the paper from Soldatova and King, which I will come to in a moment, I felt that really what I have been thinking about is knowledge representation in the biomedical sciences. How are entities named? How are the related to one another? In research, how are observations and theories connected, stored and retrieved? Where do new theories originate from?
Perhaps the term ‘knowledge representation’ best describes what I am thinking about. This captures the high-order conceptual and theoretical relationships that are my primary interest, rather than the data description and retrieval tasks which are bread-and-butter for bioinformaticians. ‘Representation’ appeals to me because I am interested in how names symbolise entities in biology. Names have meaning for biologists, but that meaning might not be as tight and logical as ontologies might want them to be, or as biologists might really use them.
This paper from the Computational Biology research group at the University of Wales, Aberystwyth, highlights conceptual inconsistencies in the design of the MGED Ontology for describing microarray experiments and results. My Masters dissertation investigated the sharing and citation of microarray data through the Arrayexpress data repository, therefore it is interesting to read these criticisms of the community-developed ontology standard which describes this data-type.
Standards have been developed for creating ontologies, and these place careful constraints on ontology construction, based on formal logic. Intrinsic properties, class definitions and ‘Is-a’ relationships are strictly controlled. Failure to observe the standards will create logical inconsistencies both within the ontology, and in interoperability between ontologies.
The MGED Ontology breaks numerous rules and creates confusion for the paper authors with mixed classes, odd properties and errors in the ontology structure. Have these criticisms been addressed? I’m not sure, but the MGED Ontology is but one example of myriad ontologies in development to support biomedical data annotation and interpretation. Take a look at the Open Biomedical Ontologies and National Center for Biomedical Ontology’s BioPortal websites for many, many more.
Do all these ontologies meet the standards for logic for consistent application in machine processing or learning? I wonder if biologists themselves recognise the logical classes and properties built into these knowledge representations. More importantly, can they or would they apply these ontologies during the normal research process? Would these ontologies be built into instruments? Would annotation take place post-data acquisition, and who would do this? It strikes me that there would be an enormous investment of time and effort to describe even a simple cell biology experiment.
With all this effort going in to develop ontologies, how will they be used?
I did not meet my spirit animal
November 6, 2008 at 10:07 am | Posted in plan, work | Leave a commentTags: planning
Craft and Creativity workshop, Part 1.
A City University research student course
Yesterday I attended the first of a two-part course run by the University entitled ‘Craft and creativity’.
The first part, ‘Planning, Structuring, Researching, and Managing Your PhD’ did exactly what it said on the tin. I found the technique of creating questions and sub-questions that will form thesis chapters to be particularly useful.
My notes are below:
The afternoon was dedicated to the creativity part. Rather than dropping acid and talking a walk through our subconscious, the workshop facilitator, John Hands, guided us through a simple meditation technique designed to release the creative tiger within.
It wasn’t quite my bag. The product of my meditative adventure is depicted below. It makes next to no sense.
My original PhD research proposal
November 5, 2008 at 8:38 am | Posted in plan, work | Leave a commentTags: planning
I wrote the original proposal for this project back in February 2008, under the lengthy and accommodating title, ‘Information in the biomedical sciences : organising data, transferring knowledge, and innovating from bench to bedside in the postgenomic era’.
For the full text, see the link below:
Original research proposal (PDF, 92, kB)
Blog at WordPress.com. | Theme: Pool by Borja Fernandez.
Entries and comments feeds.



