Chapter
Three: Developing the Coding Scheme
and Selecting the Pool
The Importance of
Theory:
No coding scheme can
provide a reliable content analysis unless it is grounded in theory. James
Potter and Devine Leviine-Donnerstein argue that “the challenge of
designing a content analysis can only be adequately met if researchers begin by
making decisions about the nature of the content they want to analyze and the
role of theory in their study” (259).
Thus, the first step
in developing a coding scheme for this project was to identify the content that
I was seeking to code and discover if there was adequate theory to guide the
search for that content. This step was accomplished in earlier chapters that
outlined the theoretical direction this study was taking and which revealed
that there is an adequate understanding of the discourse factors that
constitute a successful writing conference. Because Laurel Johnson Black, Don Murray, Carolyn Walker and
David Elias clearly identify the successful writing conference as one that is
student and paper centered rather than teacher centered, my code needed to
identify content that illuminated the focus of the conference as it progressed.
Potter and
Levine-Donnerstein also emphasize that reliability can not be counted on unless
the researchers and coders concentrate where possible on “latent
pattern,” which describes the broad characteristics and commonly shared
meaning of the “patterns embedded in the content to be analyzed”
(260). Coding rules that seek latent pattern avoid the “subjective
interpretations” which occur when coders are faced with the difficult and
complex task of differentiating among the minute nuances of a micro level
“projective” coding scheme. According to Potter and Levine-Donnerstein,
a coding scheme that looks at the pattern of content characteristics is more
reliable than a projective coding scheme that calls for a technical set of
definitions that must be interpreted item by item. Coders looking for latent
pattern are not as likely to be forced to “bring their own schema into
play” as they are asked to make judgments. “There is an increase in
the confidence with which we can construct rules that will guide coders in
capturing the essence of that which is being coded,” they report (262).
Theory is the basis for capturing this essence, they stress.
“Utterances”
as a Unit of Content
Once the theory for
this study was specified and understood, it became relatively easy to identify
the latent patterns of content that Mikhail Bakhtin terms
“utterances.” The process of creating the code and defining the
utterances was illuminated by theory, which for the purpose of this study is
best articulated by Johnson Black and by Don Murray, who links student-centered
conversation with the role that the affective dimension—those subtle and
often visual cues which keep a conversation going in a preferred
direction—plays in determining the success of a conference. Murray
defines the ideal conferences as one that is more reflective and less
“deductive” in nature, with the deductive conference being one in
which the student expects the teacher the teacher to fill his head with
knowledge (Teacher 163), as
opposed to the “engaged pedagogy” envisioned by bell hooks which “call[s] on students to be
active participants, to link awareness with practice” (14).
Looking at Johnson
Black and Murray, I determined there were two very broad categories of
utterances in a typical conference: those that fell within Johnson
Black’s affective dimension and those that were more deductive in nature.
For Johnson black, the affective dimension is so critical because it opens the
conversation to the personal, the student’s insights about her own
‘feelings” about the process. Talking about writing, without
talking about feelings, “abstract[s] a set of skills and a string of
words from what has been a personal process, a human connection,” she
warns (123). Alongside the affective topics, then are Murray’s deductive
topics, which may lend themselves to a more passive position on the part of the
student because they are product oriented, primarily about writing issues.
Johnson black calls these “discourse topics” and warns that
teachers tend to love them because they are “safe” topics. She
writes:
It is common for teachers to take a student’s
affective topic and transform it into a discourse topic. We submerge the
feelings in something safe, something more clearly about writing or reading or
skills and move away from feelings. (132)
Students are lured
by discourse topics too, notes bell hooks, because they provide safe ground
without the risk of the conflict or tension that topics expressing feelings or
which call for critical thinking engender. “The presence of tension—and at times even
conflict—often meant that students did not enjoy my classes or love me,
their professor, as I secretly wanted them to do,” hooks observes (42).
Affective topics and
discourse topics are not binary categories, it should be noted, but sorting the
utterances in this fashion seemed useful to the goal of determining difference
between traditional conferences and conferences in cyberspace, and because
Potter Levine-Donnerstein so closely link theory and reliability, it became
essential to clearly articulate the goal of this project. The purpose of
recording and creating transcripts was to determine if the affective dimension
that Johnson Black and Murray identify as fundamental to the success of the
one-to-one writers conference is transportable to a distance learning
environment. Because cyberspace is
lacking the visual cues—“body language” (Teacher 163)—that for Murray help create the affective
dimension of a conference, then the coding scheme clearly needed to identify
utterances that served specifically to keep conference power within the
student’s control as well as utterances that reflect Murray’s
conversational style (148) of conferencing. It also needed to differentiate
between utterances that relate to feeling and critical thinking about the
process and between those that are more product oriented.
Although earlier I
speculated that there might be some unexpected and new forms of discourse
between the two modes of conferencing that might create (or destroy) a new
manifestation of the affective dimension, a cursory examination of conference
transcripts revealed there were no unexpected forms of utterances. Therefore
the coding scheme needed to reflect the more subtle differences among
utterances, those that could be counted and sorted to measure the
“success” of a conference and which could be compared between face-to-face
conferences and chatroom conferences. To accomplish this, each category of
utterance was broken down into four subcategories to reveal the issues
identified by theory as key to a successful conference.
Why
“Utterances”?
A basic element,
then, of the coding scheme, was to categorize “utterances.” Mikhail
Bakhtin defines this term as meaning a unit of speech that is determined by a
change of speaking subjects (71). It is a particularly useful unit in the
context of conferencing, because of what Bakhtin describes as its responsive
nature:
Any utterance—from a short (single-word)
rejoinder in everyday dialogue to the large novel or treatise—has, so to
speak, an absolute beginning and an absolute end: its beginning is preceded by
the utterances of others, and its end is followed by the responsive utterances
of others (or, although it may be silent, others’ active responsive
understanding, or finally, a responsive action based on this understanding).
(71)
Thus the utterance
can be used not only to mark a change of speakers, but also to identify a type
of response. Theory shows that this responsive nature of dialog, identified by
Bakhtin, can be used to judge the success of the writing conference by
measuring how responsive the teacher is to the student and her paper.
Johnson Black notes
that Bakhtin uses this term in a context larger than just the immediate dialog.
“Any concrete utterance is a link in the chain of speech communication of
a particular sphere,” writes Bakhtin (91), and hence they are filled with
“echoes and reverberations” of preceding utterances from within
that sphere. Johnson Black observes that teachers participate in a chain of
utterances that becomes a fundamental aspect of a student’s own history
of exposure to teachers. Anything a teacher says is heard by the student in the
context of all the teacher utterances that have gone before. “That chain
of utterances seems to have injured more people than I can count,”
reports Johnson Black (54), alluding to the power differential that takes place
within a conference. She explains, “It is this power, where even if I
have no history with a student, she brings one with her and attaches it to me,
that invades conferences.”
Power and the
Successful Conference:
Quantifying this
invasion of power takes on new urgency as one moves into the cyberconference,
where the affective dimension is diminished and the outcome of a conference
less clear. Because pedagogy is ultimately about power, and because conferences
that put the student more in control of the dialog and of their own paper are
identified by Johnson Black and Murray as the most effective, it seemed
important to quantify the power differential as a means of objectively defining
the success of these conferences. Thus it was clear the coding scheme needed to
be able to indicate what Johnson Black calls “control of conference
talk” (52).
Developing the
Code:
In developing the coding scheme, the following were
taken into account:
1.
ability of the code to specify and
quantify a variety of utterances in a conference;
2.
ability of the code to describe the
affective dimension;
3.
ability of the code to determine control
of conference talk;
4.
ease of use for the coder;
5.
reliability.
Quantifying a conference, of course, is no easy
trick, but after conducting some 600 conferences over an 18 month period, I
feel they break down into a combination of teacher initiated comments, student
responses and student questions and comments that replicate themselves from
conference to conference in Bakhtin’s “chain of utterances.” My coding scheme reflects, but does not
rank or weight, what I’ve experienced to be the dominant component
utterances of a conference. The numbering scheme does not indicate any type of
order or occurrence or ranking of importance but rather serves as a means of
identifying for sorting and counting purposes the types of utterances I
routinely experience in a conference.
Number Codes:
However, there are some distinctions among the
utterances that allowed for broad groupings. Johnson Black notes that affective topics—which are
the subject of this study—are primarily about feelings, and discourse
topics are deductive topics primarily about writing (124). Thus I have numbered
and ordered the code to allow it to discover and distinguish between the two.
Hence, code items 1 through 6 are designed to quantify the affective dimension,
code items 7 through 10 reflect Don Murray’s deductive utterances, and
code 11 speaks directly to power
issues. Altogether I identified 11 types of utterances which I felt were
typical to my conferences. Combined with the four letters codes discussed my
coding scheme provides up to 44 separate codes, a number far fewer than the 146
codes used in a 1984 study by Sarah Warshauer Freedman who tracked the topics
of conversation in conferences to determine differentials between low achieving
and high achieving students and to examine gender and racial issues. Because my
study is looking strictly at the differences between what Stuart Blythe
describes as “scenes” (25), the number of codes did not seem as
important as the coding scheme’s ability to distinguish among different
styles of utterances. Blythe’s metaphor of scenes, from Burke’s
Pentad, stresses “interpretation” of “actions,” not
precise definitions, thus my code is more interpretative in nature—along
the lines of Potter and Levine-Donnerstein’s search for latent
pattern—rather than minutely descriptive.
The 11 utterances that I describe in chart 3.1 below
break a conference down into the components necessary to identify and interpret
the differences between a face-to-face conference and an online conference,
with the four letter codes per each utterance serving to define the origins and
directions of the utterances and to name their source of power.
Letter Codes:
Each code number identifying a type of utterance is
followed by a code letter to indicate who is making the utterance and, equally
as important, to indicate the “flow” of the control of the
conference. Developed along
parallel lines to facilitate ease of use, the code reveals this flow of control
with code letters that show who made the utterance and describe whether it was
an active or passive utterance.
Qualifying utterances as active or passive provides a
success indicator of the conference, because passive students, those who do not
fully encounter their own papers, are not actively participating in their own
learning. As Muriel Harris puts it, “Students who sit passively in a
conference are not likely to do a turnaround and actively engage in any
substantive revision” (40). hooks, Fife and O’Neill also clearly
articulate this need for active engagement by the student if learning is to
occur.
Looking at the code horizontally, a and b codes are those of teacher utterances and c and d
codes are those of student utterances. All a codes indicate utterances in which the teacher is
controlling the dialog by demanding information from the student, by initiating
and dominating a topic, or, in the case of code 6, by pushing the conversation along with verbal
listening cues. Codes with the letter suffix b indicate a more passive stance in which the teacher
is responding to the student. All c
codes indicate the student has shown initiative and is actively engaged in the
dialog, and d codes reflect
passive responses by the student.
Additional Coding:
To further clarify the power issue, which in turn
helps clarify the affective dimension and measures “success” of the conference by quantifying how
much control students have over their own conference and hence over their own
papers, code 11 specifically
defines points where the momentum of the conference has been interrupted and
forcibly changed to a new direction. Hence, code 11a indicates the teacher interrupted and regained
control of the conversation, and 11c indicates the student changed the flow. The b and d
codes in this case indicate the power shift went unchallenged.
Summary of the Code:
The coding scheme can be summarized as follows:
·
Numbers indicate type
of utterance.
·
Letters indicate who
made the utterance.
·
Codes 1 through 6 refer specifically to the affective dimension.
·
Codes 7 through 10 are deductive topics.
·
Code 11 relates specifically to transfer of power.
·
All a and b
codes are teacher utterances.
·
Add c and d
codes are student utterances.
·
All a and c
codes are active in nature.
·
All b and d
codes are passive in nature.
·
All a and c
codes are soliciting information or initiating a topic.
·
All b and d
codes are responses.
·
All a codes indicate the power balance is in favor of the
teacher.
·
All c codes indicate the power balance is in favor of the
student.
Word Count:
Another important
indicator of the power issue, of course, is simple word count, which is not
part of the coding scheme but is discussed here because the coding scheme is
designed to illuminate the distribution of the word count. Johnson Black
concedes that a conference in which the teacher does most of the talking may
provide the student with a lot of knowledge and give the teacher a certain
sense of satisfaction, but she doubts the effectiveness of such a conference:
In real time conferencing, we may sense that this
isn’t the close conversation we wanted, but at least we have an
interested audience and a bit of give and a lot of take with a student. And so
we often settle for that, hoping that at least someone learned something, and
we move on. The written product may be better, ultimately, but whether the
student is a better writer is debatable. (53)
Johnson Black notes
that the issue here is not how much knowledge the student was exposed to or how
the student feels about the conference. A student is likely, she notes, to be
“impressed that the teacher took the time to talk with her and considered
her paper so thoroughly” (53). Rather, the issue points to the
student’s ability to utilize that knowledge effectively when it was delivered
in such a one-way fashion. “How much will she remember when she has
played so passive a role,” Johnson Black asks (53). Johnson Black devotes
a section of the appendix of her book to words counts, revealing that in the 14
conferences she studied, teacher talk comprised between 59.6 and 97.6 percent
of the conference totals. That means that in some cases students in her study
did as little as 2.3 percent of the talking and never more than 40.2 percent.
The average student participation given as a share of the total word count
among her 14 conferences was 20.2 percent. We will return to this figure in a
later chapter.
Word count, of
course, was the simplest form of quantification of the conferences. Microsoft
Word was used to record all transcripts. After determining a total word count,
all teacher utterances were eliminated from the document and another count was
made, providing a student percentage of total words.
Other considerations:
In addition to describing and qualifying the
utterances, it was also important to ensure that the coding scheme leant itself
easily to quantifiable analysis. Thus, the numbering and lettering scheme was
designed to be sortable, findable and countable in Excel spreadsheets. An
appropriate recording template was also developed in Excel.
In order to assess
the reliability of the codes, a draft coding scheme was subjected to an initial
run-through of a 279-line transcript from a face to face conference to ensure:
• that the
majority of the lines could be coded;
• that the
goal of the code—to illuminate power issues and measure the affective
dimension—was reflected in the process;
• that the
coding could proceed without undo subjective judgment.
With the above three
issues in mind, the code was revised and then applied to a similarly lengthy
online conference with a similar student. More revisions were made.
After the coding
scheme was finalized, four new transcripts were selected for the final coding
(see Design of the Study below).
A larger, more
scientific project would have included more transcripts, more coders and
neutral testing of the codes by two or more coders.
Graphic 3.1 below shows the coding scheme after
initial testing.
|
code |
|
a |
b |
c |
d |
|
|
utterance
topic |
instructor… |
instructor… |
student… |
student… |
|
1 |
personal/ice-breaking |
active |
passive |
active |
passive |
|
2 |
discussion
about process |
active |
passive |
active |
passive |
|
3 |
evaluation
of conference |
active |
passive |
active |
passive |
|
4 |
critical
thinking about topic |
active |
passive |
active |
passive |
|
5 |
encouraging
comments |
active |
passive |
active |
passive |
|
6 |
listening
cue |
providing |
|
providing |
|
|
7 |
discussion
of writing issues |
active |
passive |
active |
passive |
|
8 |
evaluative
comments |
active |
passive |
active |
passive |
|
9 |
corrective
suggestions |
active |
passive |
active |
passive |
|
10 |
procedural
issues |
active |
passive |
active |
passive |
|
11 |
control
shift |
active |
passive |
active |
passive |
Graphic 3.1 – coding
scheme
Graphic 3.2 presents
the results template, which is an Excel spreadsheet that can be counted and
sorted. Each line of the transcript was numbered and coded. In the rare event
that an utterance ended and a new one began on the same line, the new utterance
was used to determine the line code.
|
line# |
number
code |
letter
code |
|
1 |
|
|
|
2 |
|
|
|
3 |
|
|
|
4 |
|
|
|
5 |
|
|
|
6 |
|
|
|
7 |
|
|
|
8 |
|
|
|
9 |
|
|
|
10 |
|
|
|
11 |
|
|
|
12 |
|
|
|
13 |
|
|
|
14 |
|
|
|
15 |
|
|
|
16 |
|
|
|
17 |
|
|
|
18 |
|
|
|
19 |
|
|
Graphic 3.2 – Results Template
Discussion of the Initial Testing:
Roughly two to two and one-half hours were required
for coding the initial transcript with the draft code. It quickly became
apparent that there were several types of utterances not included in the draft
code, including generic comments about procedure and classroom issues, and at
first I was tempted to overlook them. However, utterances in which the teacher
specifically called for questions and the student either responded or failed to
respond seemed impossible to overlook because they clearly reflected the power
issue. They were added to the end of the code and not included in the codes for
the affective dimension because they measure deductive topics—mechanics
of the paper.
It was also apparent that the horizontal
codes—the letter codes—were more easily applied that the vertical
codes. The active or passive positions of the student and teacher were always
easily identifiable, whereas the nuances between some of the number codes were
not as easily delineated.
At first this
concerned me, but focusing on the goal of the project—to measure the
affective dimension and to illuminate power issues—I realized I was not
trying to actually dissect a conference into its component parts. Hence, as
long as the codes clearly fell within the first half of the code (the affective
dimension) or within the second half (deductive and power topics) the results
were measurable with validity for this project.
With the above in mind, I compressed the draft code
after some utterances did not appear in the initial round of testing and after
I realized it was not necessary to make fine distinctions within the groupings
of utterances. I was finally forced to add a code in the discourse grouping for
procedural issues when I discovered that the chatroom conferences were often
focused on procedure. Because a personal goal of this study has been to improve
my own conferencing techniques, I also added a code in the affective section
after discovering the relationship expressed by bell hooks between critical
thinking and classroom pedagogy, hoping to discover just how much critical
pedagogy occurred in my own conferences.
I also had to
rethink the horizontal terminology of the code. Looking back on the
conversational theory articulated by Fife, O’Neill and Straub, I realized
after my initial testing that “inviting” a student into the dialog,
and then allowing that student to run with the invitation, was not at all a
passive response on the part of the student as my draft coding scheme, with its
emphasis on descriptive terms such as “soliciting” (in the active
column) and “responding” (in the passive column) seemed to imply.
To reflect this new understanding of the theory, I had to allow the code to
show where the student was in charge of the conversation, even if her response
came as acceptance of my own initiative. This issue was particularly evident in
the trial coding of an online student, who often simply responded to my invitation
to dialogue with a simple “okay,” or “yes.” These
one-word responses were clearly passive, whereas if a student took off with my
invitation and re-entered Straub’s “chaos of revision,” as
the test seemed to reveal was more often the case in a face-to-face conference,
it was indeed an active position on the part of the student. With this new
realization in mind, I ended up clarifying the horizontal terminology to simply
reflect who made the utterance and whether the nature of the utterance was active
or passive.
Design of the
Study:
For the purposes of this study I tape-recorded 31
face-to-face conferences late in the semester. Other conferences were recorded
earlier in the semester, but they were not studied because I felt my own
conferencing mode matured as the semester progressed, as the students got more
comfortable with conferencing, and most importantly, as both the student and I
got more comfortable with the tape recorder—earlier I had discovered that
initially the tape recorder seemed quite intrusive, but by the end of the
semester neither I nor the students seemed aware of the recorder. Of those 31
conferences, 19 students gave explicit permission so that I could study the
tapes and report on my findings.
From my online class, I conducted 14 conferences late
in the semester. Eight of those students provided their consent for me to study
the text transcripts that were automatically (and unobtrusively) recorded in
our one-to-one chatroom conference sessions.
In order to determine how many transcripts to study
and what type of selection criteria should be used, I turned to other research
in the field to see if there were models I could follow. Johnson Black admits,
without apology, that her own research is somewhat informal. “Certainly
it was a small study of a homogenous group,” she reports (9). “But
like most teacher research, it grew from an immediate context,” she adds.
Johnson Black initially studied conferences with six
of her own students, all of whom were traditional first-year composition
students, and half of whom were male and the other half female. She made no
other attempt to differentiate among the students beyond gender. She later
expanded her research to include other teachers and other students, studying an
additional 14 conferences, but she offered no methodology or justification for
arriving at this number.
Jean Ketter and Jonelle Pool, in their recent study
of the impact of high-stakes testing in the high school composition classroom,
took a more scientific and rigorous approach. Of the 23 students available to
them, they chose five students, “identified by the classroom teachers as
representing a range of abilities in writing” (356). That figure
represents 22 percent of their available pool. A similar percentage of my
largest pool, that of my face-to-face students, would be four students.
Including four students from my online classes would increase the total beyond
Johnson Black’s initial study and would amount to 29.6 percent of my available
pool.
Sondra Perl’s famous 1978 study examining the
composing process of low-level college writers dealt with just five students
(as did the Ketter and Pool study), so in comparison with existing studies, my
sample of eight students—nearly 30 percent of my available
pool—seems both adequate and generous (although certainly not as generous
as Sarah Warshauer Freedman’s 1984 study which coded and analyzed 48
writing conferences).
Because of the nature of the course sequence I teach
and the nature of the University, which is a commuter-campus drawing a wide
range of non-traditional college students, I sought less to rely on perceived
writing ability to determine my sample and more on the nature of the students
themselves and their stance towards writing. From my face-to-face students I
picked an older student who was a serious writer but who displayed a lot of
concern with correctness, a traditional teenaged student who wanted to get a
good grade but showed no real interest in writing and who felt out of place in
college, a twenty-something returning student who considered herself a very
poor writer, and a traditional teenaged student who considered herself an
excellent writer and who liked to write.
No considerations of gender or race were made, for several
reasons:
·
these issues have been
thoroughly covered in other studies (particularly by Freedman);
·
I was not aware of the
race of my online students;
·
gender stereotypes did
not match with my experiences in the classroom, particularly in the online setting;
·
in trying to determine
control factors, categorizing students in terms of their feelings about their
writing and in terms of their level of self confidence as a student seemed more
important to me than seeking some kind of equity based on social markers;
·
this study does not
attempt to draw any conclusions based on gender or race.
My selection criteria certainly doesn’t exhaust
all the possibilities, but it is one that transferred easily to the online
classroom: I was able to pick four students from my online class who roughly
matched the criteria established for the face-to-face conference analysis. This
ability to match students across the divide of the classroom setting seemed
important so that the analysis could compare face-to-face conferencing with
online conferencing in an equitable fashion.
In the analysis that follows, coded labels were
applied to the student subjects to provide:
·
Anonymity to the
student.
·
An indication of the
classroom setting of the conference.
o
F2F means the
conference was in a face-to-face setting
o
OL means the conference
was in an online chat room
·
An indication of the
type of student.
o
student1 is an older,
active student anxious about writing “correctness”;
o
student2 is a young,
passive student concerned only with grades;
o
student3 is an older,
passive student with low expectations;
o
student4 is a an young,
active student with high expectations.
For example, if a chart or references indicates F2Fstudent1, this label is referring to a face-to-face, older
student who is actively concerned about writing issues. Hence, OLstudent1 would refer to his/her counterpart in my online
class.
Because of the deadlines and constrictions of a
Master’s Thesis project, this study was limited in the end to the coding
and analysis of four transcripts, two from the face to face pool and two from
similar students in the online pool. As described above, students 2 and 3 were
picked from each pool.
The following two chapters present the results of the
coding, with Chapter Four providing discussion about the analysis of the coding
and presenting tables quantifying the coded data. Chapter Five interprets the
results.