This is part of the methodology tutorial (see its table of contents).
In short, qualitative data analysis usually implies to related and iterative steps. See Methodology tutorial - theory-finding research designs for the general principle.
(1) Data needs to be coded and indexed so that you can find it for data analysis. More particularly:
(2) You then can do visualizations, matrices, grammars, etc.
(3) Interpret these
(1) Write memos (conservation of your thoughts). It is useful to write short memos (vignettes) when an interesting idea pops up, when you looked at something and want to remember your thoughts
(2) Create contact sheets that allow you remember your field work. After each contact (telephone, interviews, observations, etc.), make a short data sheet that should include:
(3) Index your interview notes:
(4) Do not trust your hard-disk !
First step in qualitative data analysis is coding.
A code is a “label” to tag a variable (concept) and/or a value found in a "text"
CE-CLIM(+)
instead of:
external_context -climate (positive)
Assigning a code to a "text" segment is not always obvious and coding similar passages exactly the same way even less. In other words, we have a reliability problem.
There are two ways of improving reliability:
There exist several formula to compute intercoder (inter-rater) reliability. The most simple one is:
reliability = number of agreements (same coding) / total codes (agreements plus disagreements)
Read this for a very good introduction.
The list of variables (and their codes), is defined through theoretical reasoning, e.g.
Example from an innovation study (about 100 codes):
Categories | Codes | Theoretical references | |
---|---|---|---|
properties of the innovation | PI |
....(fill for your own code book)..... | |
external context | EC | ||
demography | CE-D | ||
support for the reform | CE-S | ||
internal context | IC | ||
adoption processes | AP | ||
official chronology | PA-OC | ||
dynamics of the studied site | DS | ||
external and internal assistance | EIA | ||
causal links | CL |
Before you start think about your own code book, you really should go through the relevant literature and try to find existing code books (that you then may adapt). E.g. below is an an example of codes use to analyze types of job-related problems of Turkish computer teachers (Deryakulu & Olkun, 2007).
Grounded theory (Glaser, Strauss) refers to a set of approaches that that focus on interpretation and theory building, i.e. it is a fully inductive approach. The researcher starts by coding a small data set and then increases the sample in function of emerging theoretical questions. Categories (codes) can be revised at any time.
“Grounded theory begins with a research situation. Within that situation, your task as researcher is to understand what is happening there, and how the players manage their roles. You will mostly do this through observation, conversation and interview. After each bout of data collection you note down the key issues: this I have labelled "note-taking".
Constant comparison is the heart of the process. At first you compare interview (or other data) to interview (or other data). Theory emerges quickly. When it has begun to emerge you compare data to theory.
The results of this comparison are written in the margin of the note-taking as coding. Your task is to identify categories (roughly equivalent to themes or variables) and their properties (in effect their sub-categories). Grounded theory: a thumbnail sketch, retrieved 13:40, 15 October 2008 (UTC))”
Typically, you'd both code phenomena in isolation and relations (so-called axial coding). On starting point for axial coding could be these big abstract observation categories:
Read more in [Introduction to Grounded Theory].
To use this approach you really should document yourself, as beginner you'd likely fall into various traps, in particular selection and confirmation biases, i.e. you only look at things that will interest you for one or another reason.
Instead of initially creating a code book from variables found in your research questions or "inductive" coding à la grounded theory, you also start by creating or using a vocabulary for a given domain. This strategy is a compromise between “grounded theory” and “theory driven” approaches.
Below is a table that lists things you could observe in an organization (Bogdan and Biklen, cited by Miles & Huberman:1994 61)
Types | Explanation |
---|---|
Context/Situation | information on the context |
Definition of the situation | interpretation of the analyzed situation by people |
Perspectives | global views of the situation |
Ways to look at people and objects | detailed perceptions of certain elements |
Processes | sequences of events, flow, transitions, turning points, etc. |
Activities | structures of regular behaviors |
Events | specific activities (non regular ones) |
Strategies | ways of tackling a problem (strategies, methods, techniques) |
Relations and social structure | informal links |
Methods | comments (annotations) of the researcher |
In the literature, you may find several other such "accounting schemes". In educational technology, for example, there is a range of relatively simple code books for conversation and asynchronous discussion groups (forum) analysis. De Wever et al. (2006) provide a good overview. Some coding schemes are fairly simple. E.g. Cobos and Pifarré (2008) analyzed "collaborative knowledge construction in the web" with the following coding scheme:
Explanation | Asks for clarifying some parts of the document | “The following link, which appears in your document, doesn't work now, but it worked a week ago” |
---|---|---|
Support | Express explicit agreement with the document's ideas or information organisation | “In my opinion this document is very useful and is easy to read” |
Addition | Suggests additions to the document: ideas, opinions or information organisation | “I think that an index of the sections of the article should be added” |
Delete | Suggests deletions from the document: ideas, opinions or information organisation | “In the summary there are some examples, which perhaps were not necessary” |
Correction | Suggests changes to the document. They refer to ideas, opinions or information organisation | “I think that there is an error in the first paragraph in the conclusion section, where it says 'motor' it should be 'motivation'” |
Pena and Nichols (2004) used the following categories:
There exist more complex code books: As an example we produce a summary of students' messages code book by Eilon and Kliachko (2004).
"Level A" Group | These categories indicate knowledge construction and a significant contribution to peer learning |
---|---|
Comprehension | Shows written evidence about the comprehension of the subjects studied by the following categories: |
Reproducing- 1 | Reproduces the main points, ideas, arguments or messages found in the incoming information with reference to its source and with critical evaluation. |
Directing | Directs others to relevant sources for the subjects studied (both printed and online). |
Clarifying by questions | Locates ambiguous, difficult or problematic areas in the new material. Describes the context of the question or the reason for asking it. |
Clarifying by answers | Gives correct, relevant and comprehensive explanations as answers. Bases the answers on retrieved information while citing its origin. |
Reflection | Shows written evidence of metacognitive processes that the learner applies when studying new subjects by the following categories: |
Linking/extending | Links the new information with his or her own previous knowledge. Extends the new knowledge to other domains, especially to STS issues. |
Critical evaluation | Evaluates the new information critically. |
Transformation | Applies the new information in an original and creative way, draws inferences, and gives original examples. |
"Level B" Group | These categories indicate a probable contribution to peer learning, by the following categories: |
Documenting | Documents l earning exper iences or indivi dual contributions to the group. |
Reproducing- 2 | Reproduces the main points, ideas, arguments or messages found in the incoming information without any evaluation or original input to it. |
Learning outcomes | Presents group and individual learning outcomes. |
Technical questions/answers | Questions or remarks about any subject that does not relate directly to understanding the subjects studied. |
Personal knowledge | Presents personal knowledge or daily life experiences. |
"Level C" Group | These categories indicate no contr ibution (or adverse contribution) to peer learning, by the following categories: |
Irrelevant/unexplained questions | Poses questions without giving the context or reason for asking them. |
Casual quotations | Includes quotations without their context and without further explanation. |
Irrelevant/incorrect answers | Offers Irrelevant or incorrect answers to questions sent by other students. |
Emotional/personal comments | Includes personal comments, which should have been sent by e-mail as instructed by the teacher . |
Some researchers also code patterns (relationships). Simple encoding (above) breaks data down to atoms, categories), whereas pattern coding identifies relationships between atoms. Pattern coding is also one the steps in the inductive grounded theory approach.
The ultimate goal is to detect (and code) regularities, but also variations and singularities.
Some suggested operations:
Attention: a co-presence does not prove causality
Qualitative analysis attempts to put structure to data (as in exploratory quantitative techniques)
In short: Analysis = visualization
There exist 2 popular types of analysis:
In this tutorial we can not cover all possible types of analysis, but just provide a few examples of what can be done. Before you start doing any sort of analysis, think about what you need to answer your research questions !
This technique allows to visualize relations and information flows between rôles and groups
There exist codified "languages" for this type of analysis, e.g. UML or OSSAD
Once you clearly identified and clarified formal relations, you can use the graph to make annotations (like below)
Check lists allow to make detailed summary for an analysis of an important variable.
Example: "external support is important for succeeding a reform project"
Examples for external support | At counselor level | At teacher level |
---|---|---|
Analysis of deficiencies | Fill in each cell as below | |
Teaching training | ||
Change monitoring | ||
Incentives | ||
Group dynamics | adequate: “we have met an organizer 3 times and it has helped us” (ENT-12:10) | not adequate: “we just have informed” (ENT-13:20) |
etc. .. |
Such a table displays various dimensions of and important variable (external support). E.g. in the example above the values of the variable "external support" are listed in the left column
In the other columns we insert summarized facts as reported by different roles.
Review Question: Imagine how you would build such a grid to summarize teacher’s, student’s and assistant’s opinion about technical support for an e-learning platform
Example: Task assignments for a blended project-oriented class
Activity |
Date |
imposed tools (products) | |
1 |
Get familiar with the subject |
21-NOV-2002 |
links, wiki, blog |
2 |
project ideas, Q&R |
29-NOV-2002 |
classroom |
3 |
Students formulate project ideas |
02-DEC-2002 |
news engine, blog |
4 |
Start project definition |
05-DEC-2002 |
ePBL, blog |
5 |
Finish provisional research plan |
06-DEC-2002 |
ePBL, blog |
6 |
Finish research plan |
11-DEC-2002 |
ePBL, blog |
7 |
Sharing |
17-DEC-2002 |
links, blog, annotation |
8 |
audit |
20-DEC-2002 |
ePBL, blog |
9 |
audit |
10-JAN-2003 |
ePBL, blog |
10 |
Finish paper and product |
16-JAN-2003 |
ePBL, blog |
11 |
Presentation of work |
16-JAN-2003 |
classroom |
Miles & Huberman (1994:124)
The abstract principle can be summarized as follows (see below for an example):
roles | persons | variable 1 | variable 2 | variable 3 |
---|---|---|---|---|
role 1 | person 1 | cells are filled in with values (pointing to the source) | ||
person 2 | ||||
..... | ||||
rôle 2 | person 9 | |||
person 10 | ||||
..... | ..... | |||
role n | person n | |||
..... |
role 1 | ... | role 3 | |
---|---|---|---|
role 1 | fill in all sorts of informations about interactions | ||
... | |||
role 3 |
Example: Evaluation of the implementation of a help desk software
Actor | Evaluation | assistance provided | Assistance received | Immediate effects | Long term effects | Explanation of the researcher |
---|---|---|---|---|---|---|
Manager | - | - | - | demotivating | threatened the program | Felt threatened by new procedures |
Consultant | + | help choosing the right soft. involved himself | - | contributed to the start of the experiment | - | .... |
“Help-desk worker” | +/- | debugging of machines, little help with software | better job satisfaction because of the tool | slight improvement of throughput | is still overloaded with work | |
Users | +/- | A few users provided help to peers with the tool | debugging of machines, little help with software | Were made aware of the high amount of unanswered questions | slight improvement of work performance | .... |
Crossing between roles to visualize relations:
role 1 | trainers | role 3 | |
---|---|---|---|
rôle 1 | |||
trainers | “don’t coordinate very much” (1) | dosn’t receive all the information (2) | |
rôle 3 |
Often qualitative analysis stops with simple descriptive analysis (Daniel K. Schneider believes that this is the case with popular forum analysis). However, you also may use qualitative data to do some kind of "correlational analysis" as you typically do in quantitative data analysis.
expressed needs for training for a new collaborative platform (data from teachers’s interviews)?
case | var 1 | need for support | need for training | need for directives |
---|---|---|---|---|
case 1 | important | important | important | |
case 2 | not important | not important | not important | |
case 3 | important | important | important | |
case 4 | yyy | not important | not important | not important |
case 5 | ..... | important | important | important |
case 6.... | important | not important | not important |
e.g. cases 1,3,5 have association of "important", cases 2 and 4 have association of "not important".
training needs * support needs | need for support | ||
---|---|---|---|
yes | no | ||
need for training |
yes | 3 | 1 |
no | 1 | 2 |
We can observer a correlation here: "blue cells" (symmetry) is stronger than "magenta"!
You should check the data above to see if we did this right ...
Type 1: "anxious" |
Type 2: "dependent" |
Type 3: "bureaucrats" |
Type 4: "autonomists" | |
---|---|---|---|---|
case 1 | X | |||
case 2 | X | |||
case 3 | X | |||
case 4 | X | |||
case 5 | X | |||
case 6 | X | |||
Total individuals per type |
3 | 1 | 0 | 2 |
We can observe emergence of 3 types to which we assign "labels"
Note: for more than 3 variables use a cluster analysis program
The table shows co-occurrence between values of 2 variables. The idea is to find out what effect different types of pressure have on ICT strategies adopted by a school.
Strategies of a school | |||||
---|---|---|---|---|---|
Type of pressure | strategy 1:no reaction | strategy 2:a task force is created | strategy 3:internal training programs are created | strategy 4:resources are reallocated | strategy 5: ..... |
Letters written by parents | (N=4)(p=0.8) | (N=1)(p=0.2) | |||
Letters written by supervisory boards | (N=2)(p=0.4) | (N=3)(p=0.6) | |||
newspaper articles | (N=1)(p=100%) | ||||
type ... | ..... | .... |
See also: Methodology tutorial - quantitative data analysis (Cross-tabulation)
We would like to estimate the probability that a given value of the independent (explaining) variable entails a given value of the dependent (explained) variable.
Variable y to explain = Strategies of action | ||||
---|---|---|---|---|
Explaining variable x |
do nothing | send a mail | write a short tutorial | Total |
Students making indirect suggestion | 4 (80%) | 1 (20%) | 5 (100 %) | |
Students explicitly complaining |
2 (40%) | 3 (60%) | 5 (100%) |
Interpretation: “... if students explicitly complain, the tutor will react more strongly and engage in more helpful activities.”
There are no limits of what you can draw. Basically such analysis just use a more or less precise language to draw concept maps.
Below we just show two examples.
Example: Perception of a new program by different implementation agencies (e.g. schools) and its actors (e.g. teachers)
Message structure can be rendered with "message maps" (e.g. Pena-Shaff and Nicholls, 2004).
Of course, you may use another language, such as UML activity diagrams to draw such maps.
A simple causality graphs relates variables (concepts) with directed arrows.
There exist many variants. One older method is “operational coding” (Axelrod, 1976) and is somewhat popular in political science. It allows to compute outcomes of reasoning chains
Example: Teacher talking about active pedagogies, ICT connections, Forums
Depending on your discipline of reference, you may be familiar with different software families that help drawing graphs:
We also can recommend a good general purpose free diagram software:
Finally, for people who hate to draw, there exist useful free visualization software, in particular: