00197
1
2
3
4 VOLUME II
5 (Afternoon Session - November 15, 1999)
6
7
8
9
10 HUMAN TUMOR ASSAY SYSTEMS
11
12 HEALTH CARE FINANCING ADMINISTRATION
13 Medicare Coverage Advisory Committee
14 Laboratory & Diagnostic Services Panel
15
16
17
18
19
20 November 15 and 16, 1999
21
22 Sheraton Inner Harbor Hotel
23 Baltimore, Maryland
24
25
00198
1 Panelists
2 Chairperson
John H. Ferguson, M.D.
3
Vice-Chairperson
4 Robert L. Murray, M.D.
5 Voting Members
David N. Sundwall, M.D.
6 George G. Klee, M.D., Ph.D.
Paul D. Mintz, M.D.
7 Richard J. Hausner, M.D.
Mary E. Kass, M.D.
8 Cheryl J. Kraft, M.S.
Neysa R. Simmers, M.B.A.
9 John J.S. Brooks, M.D.
Paul M. Fischer, M.D.
10
Temporary Voting Member
11 Kathy Helzlsouer, M.D.
12 Consumer Representative
Kathryn A. Snow, M.H.A.
13
Industry Representative
14 James (Rod) Barnes, M.B.A.
15 Carrier Medical Director
Bryan Loy, M.D., M.B.A.
16
Director of Coverage, HCFA
17 Grant Bagley, M.D.
18 Executive Secretary
Katherine Tillman, R.N., M.S.
19
20
21
22
23
24
25
00199
1 TABLE OF CONTENTS
Page
2 Welcome and Conflict of Interest Statement
Katherine Tillman, R.N., M.A. 5
3
Opening Remarks & Overview
4 Grant Bagley, M.D. 10
5 Chairman's Remarks
John H. Ferguson, M.D. 28
6
Brian E. Harvey, M.D., Ph.D. 30
7
Open Public Comments & Scheduled Commentaries
8 Frank J. Kiesner, J.D. 48
Larry Weisenthal, M.D. 57
9 Randy Stein 92
Richard H. Nalick, M.D. 99
10 William R. Grace, M.D. 108
John P. Fruehauf, M.D., Ph.D. 110
11 James Orr, M.D. 127
Robert M. Hoffman, Ph.D. 131
12 Andrew G. Bosanquet, Ph.D. 136
David Alberts, M.D. 142
13 Robert Nagourney, M.D. 147
David Kern, M.D. 159
14 Daniel F. Hayes, M.D. 168
Bryan Loy, M.D. 178
15
LUNCH 196
16
VOLUME II
17
Open Public Comments & Scheduled Commentaries
18 Edward Sausville, M.D. 201
Harry Handelsman, D.O. 227
19 Harry Burke, M.D., Ph.D. 234
Mitchell I. Burken, M.D. 262
20
Open Committee Discussion 304
21
Day One Adjournment 330
22
23
24
25
00200
1 TABLE OF CONTENTS (Continued)
2 VOLUME III
3 Opening Remarks - Introduction 336
4 Open Committee Discussion 337
5 Motions, Discussions and
Recommendations 425
6
Adjournment 487
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
00201
1 PANEL PROCEEDINGS
2 (The meeting was called to order at
3 1:16 p.m., Monday, November 15, 1999.
4 DR. FERGUSON: Dr. Sausville?
5 DR. SAUSVILLE: Good afternoon, all.
6 And if I could have the first overhead, this says
7 who I am, and the general topic that I hope
8 you're interested in hearing about this
9 afternoon. Anyway, my task this afternoon is to
10 provide an overview, at least from the
11 perspective of the preclinical therapeutics
12 development program of NCI of antitumor drug
13 sensitivity testing. And I will approach this,
14 therefore, from the standpoint of one who uses
15 tests like this, and indeed, in some cases
16 actually tests that have been used for this
17 purpose for the preclinical selection of drugs
18 for more detailed evaluation, as well as from the
19 perspective of an oncologist who has occasionally
20 thought about using these tests in the treatment
21 of patients. Next.
22 So the basis for this issue in cancer
23 derives directly from the infectious diseases
24 experience, wherein a number of different disease
25 categories, such as tuberculosis, where it's well
00202
1 established that one has to establish that a
2 particular patient's infected bacillus is
3 sensitive to the agents, and a number of
4 non-tuberculosis indications, which would
5 include, for example, pyelonephritis or
6 endocarditis, where it is well established from
7 the standpoint of standard medical practice that
8 such sensitivity tests are that valuable. Next.
9 The assays as applied to cancer ideally
10 would have 95 percent sensitivity and
11 specificity, and short of that goal, would
12 hopefully be better in predicting outcome than
13 the empirical choice of the physician. And the
14 essence of the question from an oncological
15 standpoint, therefore, is whether a particular
16 test conveys information over and above what is
17 implicit in the histologic diagnosis of a
18 patient's tumor. Ideally the test would be
19 biased in favor of detecting sensitivity rather
20 than resisting, for this reason, and ultimately,
21 these tests should be able to demonstrate an
22 impact on ultimate outcome, as opposed to simply
23 response, since in oncology, good outcome begins
24 with the response, it does not end with a
25 response. One ultimately has to have evidence of
00203
1 tangible clinical benefit that changes outcome.
2 Next.
3 So among the specific assays that
4 through the years have been utilized include the
5 by now classical Hamburger Salmon clonogenic
6 assay, wherein tumors that were biopsied for
7 example, were disaggregated, plated in agarose or
8 other solid media after relatively brief
9 exposures to drug, and ultimately colonies
10 counted in 14 days. There have been
11 modifications to this, most notably the capillary
12 tube modification used by Von Hoff and
13 colleagues, and it seems to increase the number
14 of patients for which valuable data are
15 obtained. Modifications of this also include
16 radionuclide based assays, in which radioactive
17 thymidine is added after three days and thus,
18 although it is a soft agar base, one can obtain
19 information after shorter periods of time. And
20 there are also non-agar based assays assessing
21 radionuclide uptake in mass culture. Next.
22 Technical problems with clonogenic
23 assays include a number of artifacts intrinsic to
24 the practice of the assay, including clumping of
25 tumor cells, the potential of growth perturbation
00204
1 from manipulation of potential clonogenic cells,
2 reduced nutrient uptake from nonclonogenic cells,
3 with increase in the size of colonies that grow
4 out in the treated cells. Counting evaluations
5 with a potential large coefficient of variation,
6 and poor cloning efficiencies. And a major
7 limitation in the widespread use of this
8 technique relates to the fact that in many
9 instances, the majority of the specimens are not
10 actually valuable, and there is the inability of
11 this type of assay to score small numbers of
12 resistant cells, which in a clinical scenario are
13 thought to translate new ultimately resistance to
14 therapy, of the sort that is manifest by the
15 subsequent relapse of a patient with drug
16 resistant tumor. Next.
17 In various reviews, actually extending
18 from the initial use of this technique into the
19 early '80s, the cumulative experience is that a
20 relatively small fraction of patients actually
21 have colony growth. And the data that is
22 tabulated here is contained in the references
23 that were indicated. But also, there is the
24 finding that the tests are clearly better at
25 predicting negative or resistant assays, than
00205
1 sensitive assays, such that for example, if one
2 looks at those specimens that were sensitive in
3 vitro as opposed to sensitive in vivo, we have a
4 60 percent true positive, with a range of 47 to
5 71 percent. In contrast to those specimens that
6 were resistant in vitro and resistant in vivo,
7 where there was, as you can see, a 97 percent
8 true negative information. Next.
9 This led to a so-called perspective
10 evaluation of chemotherapy selection utilizing a
11 clonogenic assay, as opposed to the choice of a
12 clinician. And again, this was published by von
13 Hoff and colleagues in 1990 in the Journal of the
14 National Cancer Institute. And in the 133
15 patients randomized in a single agent therapy, of
16 those where the therapy was assigned by a
17 clinician, one had one partial response, and in
18 19 of 68 that were possible to have an assay
19 directed assignment, there were four partial
20 responses. Certainly there was no evidence that
21 this was statistically different and one
22 concluded, or this article concluded, that what
23 one might conceive of potentially a somewhat
24 improved response rate, did not translate into
25 any noticeable effect on survival. And again,
00206
1 approximately a third of the tests could not be
2 evaluated, and there were clearly no evidence of
3 survival in patients either treated according to
4 that which was recommended by the physician, or
5 all patients that were compared versus the test
6 population. Next.
7 Other specific assays which have come
8 to the fore in an effort to meet some of the
9 clear difficulties in the widespread use of the
10 clonogenic assay include the so-called
11 differential standing cytoxicity assay, or DiSC
12 assay, pioneered by Weisenthal and colleagues.
13 And here, one is essentially assessing the effect
14 on whether or not cells remain alive after short
15 periods of culture after exposure to a drug.
16 Thus, either marrow, buffy coat or a tumor
17 suspension after disaggregation, can be treated
18 with drug for anywhere from one hour to four
19 days. Interestingly, the quantification was
20 aided by the addition of so-called duck red blood
21 cells, which are easily distinguishable
22 microscopically, a dye added, and then after a
23 cytospin, one can either assess the dead cells
24 per duck red blood cells, or live cells per duck
25 red blood cells, based on the differential
00207
1 staining of live and dead cells with either
2 fast-green, which stains dead cells, or HD, which
3 stains live cells.
4 Over variants of this approach include
5 the so-called MTT assay, which is a dye that
6 depends for its coloration properties as to
7 whether or not it is reduced by living
8 mitochondria, or a fluorescein assay, where live
9 cells take up a dye, hydrolyze to it in a point
10 that is detected by a change in fluorescence.
11 But all of these techniques, again, don't then
12 depend on the growth out of clonogenic cells, but
13 rather allow a relatively short term exposure to
14 the drug to define whether there is an effect on
15 the viability of the cells. Next.
16 When this assay was, and again, this is
17 in reference to the DiSC assay, was applied
18 initially to hematologic neoplasms, there was
19 clear evidence that there was increased cell
20 survival, that is to say resistance in patients
21 who ultimately were not responsive to
22 chemotherapy that was assigned on the basis of a
23 knowledge of the tests. So in that respect, the
24 assay was certainly suggestive that it might
25 eventually correlate with clinical outcome. And
00208
1 in addition, there was a fairly good
2 correspondence, again, with delineation of true
3 positives and true negatives by this assay.
4 Next.
5 When this assay was applied to the
6 somewhat more difficult clinical category of
7 patients with lung cancer, here in an initial
8 study with non-small cell assay, the DiSC assay
9 was performed assessing sensitivity to ten drugs,
10 treating with a regimen that ultimately
11 incorporated the three most sensitive agents. In
12 this series of 25 patients, there was a 36
13 percent partial response rate with a median
14 duration of 6.5 months and with the, if you want
15 to read, it looks to be responders and a median,
16 or I should say a median survival of about seven
17 months, with an overall of about 12 months.
18 There was clearly a threefold lower assay
19 survival. That is to say, people with greater
20 cell kill in responders versus non-responders.
21 However, these authors concluded that outcome as
22 measured by response rate and survival is within
23 the range reported by the literature, that is to
24 say, even though you can detect this difference,
25 the issue of whether or not it ultimately caused
00209
1 a different outcome that might be afforded by
2 treating with drugs that would be available from
3 the literature and without knowing the patient's
4 histologic diagnosis was not apparent. In
5 addition, some drugs clearly had a much greater
6 discordance in the predictive value of the test.
7 Thus for example, 5 fluorouracil did
8 not seem to have any ultimate value in its
9 performance, and on the other hand, etoposide,
10 behavior to etoposide, was essentially predictive
11 of the behavior of all of the of the drugs. And
12 actually from a scientific perspective, we now
13 recognize that since many of these agents act by
14 inducing apoptosis, this actual result
15 retrospectively, is not that surprising.
16 Interestingly, this paper also
17 introduced the concept of so-called extreme drug
18 resistance. That is to say, you can define
19 patients who had greater than one or more
20 standard deviations resistance than the median in
21 the population, and these patients essentially
22 had zero percent response to any of the agents.
23 Next.
24 This assay was also applied in a study
25 that was recently published from the NIH, and
00210
1 attempted to individualize chemotherapy for
2 patients with non-small cell lung cancer. And
3 from a population of 165 study patients, 21
4 received DiSC based regimens, and these had a 9
5 percent partial response rate. Whereas, 69
6 patients received empiric treatment with
7 etoposide and cisplatin; these had a 14 percent
8 partial response rate. And ultimately, the
9 survival of in vitro best regimen was comparable
10 to what one would have expected from the
11 empirically chosen chemotherapy.
12 Interestingly, this study also revealed
13 an issue that also has to come up in any test in
14 which there is a second or subsequent procedure
15 to obtain tissue, in that the survival of
16 patients who had any in vitro test was actually
17 worse than those without, and this implies
18 potentially that those people that had a
19 sufficient volume of tumor to have the tests had
20 an intrinsically less survival than those that
21 did not. Next.
22 And the last clinical study that I'll
23 touch on also emanated from the NCI and was
24 published in 1997. This attempted to use the
25 DiSC assay in limited stage small cell, and here
00211
1 we turn the somewhat, and consider the use of the
2 test in what may be considered in its most
3 favorable scenario, because this disease which is
4 traditionally, and now actually standardly
5 treated with the combination of radiation therapy
6 and chemotherapy, would potentially treat
7 empirically with a regimen known to produce a
8 high level of response, and then come back after
9 finishing consolidation with radiation
10 sensitivity with either a chosen regimen based on
11 the in vitro sensitivity or a standard approach
12 using an additional three drugs that the patient
13 had not seen previously that would be regarded as
14 standard or part of the standard care of patients
15 with small cell lung cancer.
16 And in this study, there was actually a
17 trend towards somewhat improved survival in
18 patients who could actually receive the in vitro
19 best regimen, but it certainly was just a trend.
20 And most interestingly, of the 54 patients that
21 were entered, the minority of the patients could
22 actually be successfully biopsied in this very,
23 shall we say well coordinated, well resourced
24 clinical trials scenario. Next.
25 So in terms of summarizing what I list
00212
1 here as my own disinterested perspective on
2 whether or not chemosensitivity testing is what
3 one would might consider to be ready to prime
4 time in widespread use, I would offer that from
5 my perspective, no method has emerged as a
6 quote-unquote gold standard, owing to
7 methodologic variation and the definition of what
8 constitutes resistance or sensitive tests. The
9 unfortunate fact that one cannot get reliable
10 data from most if not many patients. And in the
11 few completed prospective or randomized trials,
12 there is little assurance that ultimately there
13 is a difference effected by the test.
14 What we ultimately need if tests of
15 this nature are to be potentially useful, is
16 probably better drugs, because in point of fact,
17 since most of the drugs are unfortunately
18 inactive in many of the diseases in which these
19 tests would be used, knowing that they won't work
20 is not actually terribly valuable.
21 We need a method that is applicable to
22 all specimens obtained in real time with the
23 diagnostic specimen; that is to say, to require a
24 second test, or second procedure, in order to
25 obtain the specimen, inevitably indicating or
00213
1 introduces potential biases in studies related to
2 those patients who could withstand or undergo
3 these procedures, as well as of course, making
4 the test, the performance of the test more costly
5 than one might potentially desire.
6 But on the other hand, I think the
7 future holds potentially with newer approaches,
8 including gene expression arrays, serial analysis
9 of gene expression, there may be better, and
10 hopefully more useful techniques to assess this
11 in the future. But whatever the test, be it some
12 permutation of a currently available test, or one
13 of the newer methodologies here, its ultimate
14 value should be established in prospective
15 randomized trials where one uses the
16 diagnostically guided as opposed to the empirical
17 treatment before assessing whether or not it is
18 openly valuable.
19 And I thank you for your attention.
20 DR. FERGUSON: Thank you, Dr.
21 Sausville. I think you've gone in shorter time
22 than even I asked for, and so I'll open for a
23 question or comment. Yes, Dr. Hoffman?
24 DR. HOFFMAN: Yes. I would like to ask
25 Dr. Sausville his opinion about the assays that
00214
1 were discussed this morning, based on three
2 dimensional culture and other new third
3 generation techniques that address these problems
4 and have shown to be able to assess greater than
5 95 percent of the patients' specimens, have shown
6 survival benefit, have shown very high
7 correlation to response. I would like Dr.
8 Sausville's comments on this morning's talks.
9 DR. SAUSVILLE: Again, I wasn't here
10 this morning, and indeed, my brief was not to
11 comment on specific assays from this morning's
12 activities, but to offer an overview of problems
13 in the field in general. And I would certainly
14 say that if the tests that were proposed this
15 morning seem of interest, the real question is
16 have they been evaluated in prospective
17 randomized studies. Because unless they have
18 not, or I should say until they have, one, and
19 since as far as I'm aware, they have not, it
20 would be, I think premature to conclude that they
21 are, therefore, of widespread general use.
22 DR. FERGUSON: Dr. Weisenthal?
23 DR. WEISENTHAL: Now would be as good a
24 time as any to address the issue of the
25 requirement of prospective randomized trials for
00215
1 acceptance of this technology. I think it's a
2 very important issue, several speakers have
3 raised it, and the issue is this: Should these
4 tests be used in clinical medicine until it has
5 been established in prospective randomized trials
6 that patients treated on the basis of assay
7 result have a superior therapeutic outcome to
8 patients treated without the assay result? The
9 cop-out way to answer this, which I'm not, this
10 is not my answer to it, but what I could say if I
11 wanted to cop out, and it's perfectly valid, is
12 that never has the bar been raised so high for
13 any diagnostic test in history.
14 Dr. Sausville began his talk by
15 pointing out bacterial cultures done in
16 sensitivity testing, including one of his
17 examples was serum bactericidal testing. Serum
18 bactericidal testing, for those of you who may
19 know it, is something that Medicare does
20 reimburse for. It's very controversial, it's
21 much more controversial actually than cell
22 culture drug resistance testing. The performance
23 characteristics are certainly inferior based on
24 sensitivity and specificity. And furthermore,
25 there has certainly never been a prospective
00216
1 randomized trial showing survival advantage or
2 therapeutic outcome, you know, higher cure rate
3 or anything, whether you use serum bactericidal
4 testing or not, or any other form of antibiotic
5 sensitivity test.
6 We're talking about laboratory tests,
7 not a therapeutic agent, and I think that one
8 would be advised, at least first of all, to judge
9 them on the basis of the way that other
10 laboratory tests have been judged, and that is,
11 do they have acceptable accuracy, sensitivity and
12 specificity?
13 However, moving on to the question of
14 the prospective randomized trial, all of us, no
15 one more than those of us who have been working
16 in this field for 20 years, would love to see
17 prospective randomized trials, physician's choice
18 therapy versus assay directed therapy. This has
19 been the Holy Grail. I hope before I die, I will
20 be able to participate in such a trial. I
21 mentioned earlier, the fact is that there have
22 been a lot of energetic, very talented people,
23 that have devoted their careers to this, and the
24 best example is Dr. Dan von Hoff, who is the most
25 energetic. He and I were clinical oncology
00217
1 trainees together at the National Cancer
2 Institute. We both started working in this field
3 in the same lab at the same time. And Dan had --
4 you know, my CV lists about 50 publications;
5 Dan's CV is probably closing in on 2,000. And
6 he's organized more prospective randomized trials
7 and things like that. He was unable to
8 successfully get a study initiated and patients
9 accrued, and completed. I have devoted enormous
10 amounts of effort to getting those trial done,
11 and for one reason or another, they didn't accrue
12 patients, and things like this.
13 I want to point out that in Medicare,
14 Medicare has a problem, and the problem is not in
15 the year 2000, the years between 2010 and 2015.
16 The budgetary crunch in Medicare is going to come
17 in 2010 and 2015 when those of use who are now 52
18 years old are going to be 60, 65, 75 years old,
19 and we're going to be getting cancer. What's
20 going to happen over the next ten years is that
21 there's going to be an ever increasing array of
22 partially effective and very expensive cancer
23 treatments. We're seeing that now. Drugs are
24 being approved at a very rapid pace. We don't
25 have a clue how to use them.
00218
1 We brought up the idea about using the
2 test as a litmus test, like should you pay for
3 the therapy. Well, the only way that you're
4 going to be able to ever use the test as a litmus
5 test is if you do the prospective randomized.
6 And I would submit to you that the way to get the
7 prospective randomized trials done is as
8 follows: Look at the data that you heard about
9 this morning. Surely, you must be convinced that
10 there is a germ of truth in this. You know,
11 there is a consistent, overwhelming, and I think
12 that study after study is showing that these
13 tests do predict, they can identify the
14 difference between good treatments and bad
15 treatments. So it is not much of a leap of faith
16 to say that if only someone could do the trials,
17 then there's a good chance that they would turn
18 out to be positive, and if they do turn out to be
19 positive, by the year 2010, we will have a
20 wonderful tool to triage therapy, to triage
21 patients, right at the time when Medicare most
22 needs it, when the budgetary crunch comes, when
23 we've got all these expensive cancer therapies.
24 You know, I gave you the example of the five
25 patients treated with the bone marrow transplants
00219
1 at $200,000 a patient, who did not benefit from
2 that, who then got an assay and had a great
3 result. What if they had gotten the assay? It
4 has the potential to be enormously cost
5 effective. But the only way that it will be used
6 in that way is if you do the trials, but the only
7 way -- it's a catch 22 -- the only trials will
8 ever get done -- I personally believe that if
9 Medicare approves this, it will be the shot heard
10 round the world, Swann, ECOG, CLGB, they will be
11 lining up to do trials. You guys, you know, come
12 back and maybe approve it conditionally, come
13 back in five years and see what's happening.
14 DR. BAGLEY: Well, you know, today --
15 it brings up an interesting point, and I think
16 you bring up the comment that, you know, never
17 has the bar been this high. Well, I would take
18 exception to that. I think the bar is not any
19 higher for this than for anything else that we
20 are currently looking at. And it is not
21 something that we are not used to hearing for
22 other things too, and that is, gee, we're paying
23 for things that were never subjected to any
24 scrutiny, so why should we subject it to
25 scrutiny. I mean, that's -- we hear it all the
00220
1 time, and that's just not going to work in this
2 day of evidence based medicine. And I'll tell
3 you because, you know, how much it costs isn't an
4 issue that we're really here to talk about today.
5 Because, you know, two years HCFA reorganized
6 and changed the whole focus of coverage, moved
7 the coverage office away from that part of HCFA
8 that pays for things and looks at program
9 integrity, and moved it into the place at HCFA
10 that looks at quality and clinical standards.
11 And that's exactly the focus we ought
12 to be doing because, you know, what it boils down
13 to is, it's not just why not pay for it, it
14 doesn't cost that much, or it might save a little
15 money. But it's let's pay for it because it's
16 the right thing to do, and represents quality
17 medicine. And when that happens, you know, we
18 shouldn't just pay for it, we should pay for it,
19 we should promote it, and perhaps, if the
20 evidence is there, we ought to insist on it. I
21 don't think the clinical community or the
22 beneficiary community would tolerate us insisting
23 upon a pattern of behavior, or even promoting a
24 pattern of behavior, without evidence, and so why
25 should we pay for it without evidence? And
00221
1 that's the change we're trying to make, that's
2 been the whole point of changing the coverage
3 process, putting together advisory committees
4 like this, is to say, let's look at evidence and
5 let's make decisions about what we pay for based
6 on quality, and once we know what quality is,
7 let's not just pay for it, let's not stop there,
8 but let's pay for things that we are willing to
9 promote and perhaps even insist upon. And so,
10 that is the reason for the focus on evidence, and
11 it's going to be there. And the fact that we may
12 not have subjected past technologies to the same
13 evidence, doesn't mean we can't go back and look
14 at them, time willing, but it doesn't mean we
15 should lower the bar for new technologies.
16 DR. FERGUSON: Do you want to respond a
17 minute, Dr. Sausville?
18 DR. SAUSVILLE: Yes, I do wish to
19 respond to that. And I want to thank you for
20 that perspective, because clearly, there is
21 nothing that ever, that doesn't lack for good
22 intentions. Clearly, the desire to convey useful
23 patient benefit goes without question. And the
24 efforts that were cited over the past two decades
25 have really been enormous efforts in that regard.
00222
1 But one distinction that I must point out is
2 that when one considers the bacteriologic
3 analogy, the diagnostic specimen, that is to say
4 the bacteria growing in a bottle, equals the test
5 specimen. So that is one intrinsic difference.
6 In many cases, cancer related
7 sensitivity testing requires additional efforts
8 to get and process tissue different than the
9 routine. So it is a point where the analogy is
10 not exactly apt, I think. And you quoted the
11 endocarditis issue, and you're right. It is
12 controversial as to whether or not ultimately
13 sensitivity testing is beneficial, because among
14 the lethal consequences of endocarditis are a
15 series of almost anatomical problems, valve
16 problems, thrombi, et cetera, that are not in any
17 way predicted or dealt with by the sensitivity
18 testing. So again, it's -- I think that the two
19 are, recall each other, but have important
20 differences in thinking about the ultimate value
21 of the tests.
22 DR. FERGUSON: Dr. Sundwall?
23 DR. SUNDWALL: Just a quick question.
24 Dr. Sausville, I am a family physician, not an
25 oncologist, but I was very perplexed by your
00223
1 statement. If I heard it correctly, you said,
2 knowing what drugs won't work is not all that
3 helpful. I don't understand that, given the
4 morbidity and the difficulties with
5 chemotherapeutic agents. I've had many patients
6 suffer terribly from chemotherapy, and how can
7 you say not knowing what won't work isn't that
8 helpful?
9 DR. SAUSVILLE: Because the context in
10 which -- and I respect your point, and I don't
11 certainly mean to in any way imply a lot of sole
12 searching on both the part of physicians and
13 patients that goes into the decision to entertain
14 therapy. But in oncology, frequently the
15 treatment is driven by the histologic diagnosis,
16 so if for example the initial diagnosis of small
17 cell lung cancer, if one could have a pattern of
18 drugs that have more or less a susceptibility, I
19 am not aware that such tests would be considered
20 definitive in saying, well, because you happen to
21 have a resistant small cell lung cancer, you
22 should not receive any therapy. So in that case
23 the therapy, or choice of therapy, is ultimately
24 driven by the histologic diagnosis that's
25 apparent. Consider the opposite point. Somebody
00224
1 with a chemotherapy refractory neoplasm,
2 manifested, such as pancreatic or renal, which
3 are problems which as far as I'm aware, are not
4 considered responsive to any set of agents
5 routinely.
6 Again, the information of whether the
7 patient has that dire situation is implicit in
8 the histology. It's not clear that any tests
9 that can be done ultimately defines a drug that
10 can change the outcome that is at the present
11 time ordained by the histology. So I take your
12 point, that being able to reliably choose drugs
13 that convey a useful clinical benefit is very
14 worthwhile and a goal that should be pursued. I
15 am not sure that the current tests actually allow
16 the clear delineation of such agents.
17 And in that regard, you can tell a
18 patient who has the unfortunate diagnosis of
19 pancreatic cancer, that they are likely not going
20 to respond to a medicine chosen on that basis, or
21 chosen after having gone through an additional
22 test to obtain tissue and then tested for assay
23 resistance.
24 MR. KIESNER: I think Dr. Sundwall
25 asked a very interesting question, and I think
00225
1 there are at least two clinical strategies for
2 using this type of information. I think on one
3 hand you can say, we're going to select a drug,
4 and another, there may be a different clinical
5 setting, and I will give you two examples. I
6 think this is very important.
7 Dr. Alberts spoke this morning about a
8 clinical situation where he would be referred a
9 patient from another hospital, and that patient
10 may not, may be unaffected by the primary care,
11 he has relapsed, the tumor is growing, and they
12 send him, they send the patient to him. Doctor,
13 what can you do to help me? In that situation,
14 there may be three or four or five different
15 drugs, single agents, none of which have been
16 determined to have a significant clinical benefit
17 over the other drug in that situation. If I am a
18 patient and if any physician can tell me of the
19 five drugs, Frank, two of those drugs you're
20 resistant to, what has he told me? He's said,
21 I'm not going to use those two drugs, I've saved
22 you from the possibility that you're going to get
23 those two drugs and not benefit from it. It's
24 very, very well documented that these tests are
25 able to identify resistance, and if I'm a patient
00226
1 and if my physician in that setting can identify
2 the resistance, I believe he has done me a real
3 service.
4 The second situation is one which I
5 experienced personally. And I'm not mentioning
6 it because it's personal, I'm mentioning it
7 because it's exemplary of the position that a lot
8 of families can be in relation to elderly
9 Medicare patients. In -- I'm from Minneapolis.
10 My father was in St. Mary's Hospital. He was ten
11 years past Medicare age and was being treated for
12 cancer. We saw what the drug was doing to him.
13 If his physician could have come to me and said,
14 Frank, I have two or three other drugs, two or
15 three other choices I could try, and I have done
16 a test and I could see that they are all
17 resistant, I don't think we should go any
18 further. From the family situation, it was a
19 very difficult situation to make, do you go
20 further. We made the situation not to. But to
21 this date, if I would have had an assay that
22 would have told me the drugs that the physician
23 was considering will not work, I would feel I
24 would have been served, our family would have
25 been served, and my father would have been
00227
1 served. Elimination of drugs, identification of
2 drugs in those types of clinical settings that
3 don't help, or help you stop therapy, I think is
4 something worth considering.
5 DR. FERGUSON: Thank you. Very
6 briefly.
7 DR. SAUSVILLE: So my response to that
8 is the essence of the issue, and it also pertains
9 to the question before, is whether or not one
10 could have reached the conclusion that drugs
11 would not have benefitted your relative by the
12 diagnosis itself, and not have ultimately had to
13 rely on a test. And here the performance
14 properties of the -- the unfortunate performance
15 properties that when a drug is predicted to be
16 sensitive by these tests, the outcome is
17 unfortunately not any different in many cases,
18 than when things -- and in fact, in all cases
19 that I'm aware -- of when drugs are seen as
20 resistant, is the essence of why we are in a
21 quandary about how to appropriately use this.
22 DR. FERGUSON: Thank you. Harry.
23 Dr. Handelsman?
24 DR. HANDELSMAN: I'm Harry Handelsman.
25 I'm at the Center for Practice and Technology
00228
1 Assessment, Agency for Health Care Policy and
2 Research, and our office was asked by HCFA to
3 review the 1990 article by Kern and Weisenthal on
4 the use of suprapharmacologic drug exposures.
5 And I'm going to briefly synthesize what I think
6 was the essence of that article and then give my
7 personal critique. Unfortunately, some of this
8 is going to be repeating some of the data that
9 you heard earlier today, and that's unavoidable.
10 Bayes' theorem suggests that drug
11 sensitivity testing in vitro will be accurate in
12 predicting clinical drug resistance in tumors
13 with high overall response rates only if the
14 assays have a specificity of greater than 98
15 percent for drug resistance. A 1989 review of
16 the literature by the authors indicated that a 30
17 to 50 percent false positive rate, and a false
18 negative rate as high as 15 percent.
19 This reported assay, which was
20 developed by Kern, uses a soft agar culture with
21 products of concentration times time higher than
22 those which can be achieved clinically and used
23 drug exposures 100-fold higher than other
24 contemporaneous studies. Response assessments
25 were made by retrospective and blinded chart
00229
1 reviews. The authors reviewed 450 correlations
2 between assay results and clinical response over
3 an eight-year period. The assay was calibrated
4 to produce extremely high specificity for drug
5 resistance. Two assay end points were used,
6 colony formation and thymidine incorporation.
7 Overall response rates were 28 percent
8 using the colony formation end point, and 34
9 percent using the thymidine incorporation end
10 point. At the assay lower cutoff value, the
11 assay was 99 percent specific in identifying
12 non-responders, fulfilling the Bayes prediction.
13 Patients with drug resistant tumors could be
14 accurately identified in otherwise highly
15 responsive patient cohorts. The demonstration
16 that the post-test response probabilities of
17 patients varied according to assay results in
18 pretest response probabilities allowed the
19 construction of a nomogram for predicting
20 probability of response.
21 In 1976, it appeared that no method of
22 predictive testing had gained general acceptance,
23 and during the subsequent decade, high false
24 positive and false negative rates continued to
25 plague the field of in vitro testing.
00230
1 The clinical advantages of developing a
2 highly specific drug resistant assay include:
3 The avoidance of the use of inactive agents in
4 treating responsive tumors; the avoidance of drug
5 related morbidity of inactive agents; the
6 identification of drug resistant tumors for
7 timely consideration of alternative therapies;
8 and obviously, the cost savings of avoiding the
9 use of ineffective agents. Alternative assay
10 methods are available. However, the use of cell
11 culture assay had the advantage of measuring the
12 net effect of both known and unknown mechanisms
13 involved in drug resistance.
14 It is indeed possible to estimate the
15 post-test response probability for specific drugs
16 in specific tumors and patients. This can be
17 achieved through the determination of assay
18 results and the application of a constructed
19 nomogram for assay predicted probability of
20 response.
21 In general, efficacy studies in both in
22 vitro and in vivo tumor models provide an
23 opportunity to obtain data on both efficacy and
24 toxicity, and to refine dose and schedule
25 information for clinical trials. In vitro
00231
1 testing has been extensively applied to determine
2 the potential efficacy of individual drugs and
3 remains an attractive alternative to testing
4 empiric regimens in phase I and phase II clinical
5 trials. In vitro testing can differentiate
6 active and inactive agents, but cannot serve as a
7 substitute for in vivo studies, despite providing
8 elements of both positive and negative predictive
9 reliability.
10 Combinations of agents, which are the
11 most widely applied treatment strategy, are best
12 evaluated using in vivo models, where both
13 toxicity and pharmacokinetics can be adequately
14 studied. Although in vitro assays can provide
15 primary drug resistance date, the most relevant
16 outcome from such assays is improved patient
17 survival, and there have been no clinical trials
18 demonstrating such a result. In addition, it
19 remains to be determined if in vitro testing will
20 be found to have direct clinical applications for
21 disease or patient specific therapies. There
22 have been encouraging reports of survival
23 advantage of patients treated with in vitro
24 directed therapies, but these require
25 confirmation from larger numbers of patients and
00232
1 variety of tumors.
2 Both randomized and non-randomized
3 studies comparing tumor responses to chemotherapy
4 selected by in vitro testing with empirical
5 chemotherapy have produced conflicting results.
6 Response rates appear to be better with in vivo
7 selective agents. However, the impact on
8 survival has not been adequately addressed.
9 Ideally, in vitro assays should be correlated
10 with both response and survival data. The most
11 significant issue in the realm of cancer
12 chemotherapy is that of the resistant
13 mechanisms. The ability to identify ineffective
14 agents in these assays, albeit potentially
15 important, does little to elucidate the
16 mechanisms problem. The assay described in this
17 article can perform its intended task of
18 identifying resistant tumors, and determining a
19 probability of response, but its clinical utility
20 has not been established.
21 DR. FERGUSON: Thank you. I think we
22 have time for a few questions. Panel, or
23 others? Comments? Yes?
24 DR. FRUEHAUF: I think that was a nice
25 summary of the paper and I think the issue of
00233
1 survival was addressed this morning.
2 DR. HANDELSMAN: Excuse me, if I can
3 interrupt. The issue of survival was predicted,
4 but it wasn't on a comparison with alternative
5 therapies.
6 DR. FRUEHAUF: That's true. It was
7 survival in a blinded prospective way, looking at
8 people just getting empirical therapy and asking
9 the question, if you're not going to respond,
10 will you have inferior survival? And we've
11 addressed the issue of, do you want to get a
12 drug, as a person who has cancer, that won't work
13 and won't benefit your survival? And I think
14 that's the important point that this paper is
15 establishing, the utility of knowing that a drug
16 will not be of benefit to a cancer patient.
17 And I have dealt with neuropathies that
18 are incapacitating to professional tennis
19 players. I have had to deal with all sorts of
20 toxicities, and many of these people progress
21 through therapy and die of their disease, and the
22 quality of their life during that progression was
23 significantly adversely affected by getting
24 ineffective therapy. So the clinical utility
25 question to me as a practicing physician, is to
00234
1 not harm people with ineffective therapy that
2 will not, which has been demonstrated not to
3 benefit their survival. And I think most people
4 understand, if you don't respond to the therapy,
5 you're not going to live longer. And we're not
6 trying to say that the test will predict a drug
7 that will help people live longer, for drug
8 resistance assays. We are trying to say, and you
9 stated, and Dr. Sausville stated, that these
10 assays accurately predict drugs that will not be
11 effective. And then the question is, so what? I
12 think the answer to the question so what is, so I
13 don't want to give those drugs to my patient.
14 DR. FERGUSON: Thank you. Okay. I
15 guess we can go on. Harry, thank you very much.
16 Dr. Burke?
17 DR. BURKE: I think we're going to have
18 a lot of fun this afternoon. My name is Harry
19 Burke. I'm a consultant to HCFA. I'm an
20 internist. I'm a methodologist. I'm only here
21 for today.
22 The first couple slides that I am going
23 to present are not HCFA's position, they're my
24 personal review on the subject, and shouldn't be
25 considered HCFA's policy. I am going to address
00235
1 three issues today. First, the levels of
2 evidence, which has been raised several times by
3 various speakers. I'm going to talk about test
4 accuracy. And then I am going to talk about the
5 Kern and Weisenthal article that Dr. Handelsman
6 just gave us an introduction to.
7 But before that, I would like to make a
8 couple comments. First, the extreme drug
9 resistance is really a therapy specific
10 prognostic factor. It really has to be looked at
11 in the context of other therapy specific
12 prognostic factors. Dan Hayes was right. It's
13 like ER and PR, and these other factors. And
14 there's a scientific rationale underlying therapy
15 specific prognostic factors that must be dealt
16 with. The utility of the test depends on the
17 characteristics of the test; that's clear. But
18 it also depends on the efficacy of the treatments
19 if it's a therapy specific prognostic factor,
20 because they're inextricably linked together, and
21 you can't separate the two. And it depends on
22 the prevalence of the disease or the resistance
23 in the population under study. So it's really
24 those three factors together that must be taken
25 into consideration when looking at something like
00236
1 this.
2 Let me make another point, and that is,
3 when we talk about the utility of this test, we
4 can't be talking about the utility for individual
5 patients. We have to be talking about the
6 utility of the test for a population of
7 patients. So we can't switch back and forth
8 between the two, because we're really mixing
9 apples and oranges when we do that.
10 I'd like to make a couple comments
11 about what has been said earlier. Fruehauf,
12 Weisenthal and others have suggested consistent
13 findings across studies, and they've made a claim
14 that that proves something. And I would like to
15 suggest that, yes, consistent findings across
16 studies can be due to robustness of the
17 underlying phenomenon. But it can also be due to
18 consistent biases across the studies. And so if
19 you are going to make a claim for consistency of
20 35 studies, that all suggest the same thing, and
21 that's a robustness claim, you'd better be
22 prepared to tell me why it isn't due to biases in
23 the 35 studies themselves. So you really have to
24 look at each of the 35 studies and you have to
25 ask the question, are these really consistent?
00237
1 You can't just wave a hand and point to 35
2 studies. You have to rule out the alternative
3 hypothesis.
4 Secondly, I'm a little confused.
5 Kiesner pointed out that we could use this task
6 at the bedside at the discretion of the patient,
7 or I mean of the physician, while Kern suggested
8 that it could be used to deny a particular drug.
9 And I need to know, which is it? Is it that the
10 evidence is so convincing that it can be used to
11 deny a particular therapy, or that it's not that
12 convincing, and it's just one of an armamentarium
13 of tests that are available to the physician.
14 First, let me just do a little
15 background. Comparative clinical benefit is what
16 I'm looking at. This is my gloss on reasonable
17 and necessary. It could be defined as the test
18 or treatment providing a measurable improvement
19 over all the current relevant tests and
20 treatments at a cost commensurate with the
21 measured improvement. I also suggest that FDA
22 approval is prima facie evidence of safety and
23 efficacy, but if that isn't there, I think safety
24 and efficacy must also be demonstrated. And a
25 comparative clinical benefit study of a
00238
1 prospective, or of the test or treatment, must
2 compare itself to the other tests or treatments.
3 It doesn't stand alone. And so when you say
4 well, this test or treatment is really good, you
5 have to say what the other tests and treatments
6 are, what you're comparing it to.
7 Now I would -- I am not totally a
8 believer in randomized clinical trials for
9 everything. My suggestion is that there may be
10 three levels of evidence that can be adduced: A
11 strong evidence, which is either a large
12 prospective randomized clinical trial, or two
13 large retrospective studies where one study
14 independently replicates the other study. I
15 think that's good science as well. Or two medium
16 size randomized prospective trials. I think all
17 of those would be strong evidence. If I saw a
18 really large retrospective study that was
19 independently replicated by independent
20 investigators, independent institutions, I would
21 take that as fairly strong evidence.
22 Moderate strength are medium sized
23 prospective trials, a large well designed
24 retrospective study that hasn't been replicated,
25 or two medium randomized prospective clinical
00239
1 trials, medium size.
2 Weak evidence. Small properly designed
3 and implemented prospective randomized trials, I
4 think are weak evidence, and I think are well
5 recognized as that, and I think Pito and others
6 have suggested meta-analysis to overcome the
7 weaknesses of small randomized clinical trials.
8 Alternatively, two medium sized retrospective
9 studies that were done by independent
10 investigators might be good evidence.
11 But insufficient evidence, small
12 systematic studies, I consider them really
13 exploratory rather than evidence. Case series, I
14 think are well considered as anecdotal. And any
15 study that's not properly designed, implemented
16 or analyzed must be considered fatally flawed.
17 Large is 500 patients, medium sized,
18 250, small is less than 250. You know.
19 Okay. Test accuracy. What is a
20 properly designed, implemented and analyzed
21 study? Well, test accuracy of course is an
22 association between each patient's predictions
23 based on the test, and each patient's true
24 outcome. That's test accuracy. The factors that
25 affect test accuracy include the study
00240
1 population, were the patients who were selected
2 easy to predict. Because you can select patient
3 populations, and we'll get into that later, that
4 are very easy to predict by just about any test.
5 The test characteristics: Was the test assessed
6 in the clinical setting which it's intended to be
7 used for? The reproducibility: Does the
8 prediction variability increase across
9 laboratories and reagents? And finally, the
10 method of measuring the accuracy: Was the
11 correct method used?
12 And I'm going to focus on two of these,
13 the first and fourth, which are the most
14 problematic.
15 Okay. Sample size, or study sample
16 characteristics. The composition of the study,
17 the study population, makes a difference in the
18 observed accuracy. A sample with only extreme
19 cases, i.e., the predictors are extreme values of
20 their range, will be easier to predict than a
21 sample with many intermediate cases, the
22 predictions are mostly in the middle of their
23 range. For example, for women with breast cancer
24 who have many positive lymph nodes, their
25 outcomes are fairly easy to predict. Women with
00241
1 metastatic disease, their outcomes are pretty
2 easy to predict. It isn't a hard task to do.
3 What is hard to do is to predict the women with
4 small tumors and with no lymph node involvement
5 or metastatic involvement, that's really tough to
6 do. So, if you just pick an extreme population,
7 it turns out those are pretty easy predictions to
8 make, but it turns out that most patients aren't
9 in the extreme, so it's relatively unuseful.
10 Okay? Thus, the sample must be representative of
11 the real world in which the test is to be used.
12 Measurement of test accuracy. There
13 are several ways to assess test accuracy. The
14 correctness of the accuracy assessment method
15 depends on, so when you select a method of test
16 accuracy, whether there is a preexisting
17 threshold, in other words, is there something out
18 there that says everybody above this should be
19 positive, everybody below should be negative,
20 does that already exist, or do you have to
21 construct it? The number of tests to be
22 assessed. And whether the assessment is
23 performed on one population or more than one
24 population.
25 And just very briefly, this is really
00242
1 hard to read. I can't get the lines on tables to
2 work out for me, so this is lineless. But it
3 turns out that the sensitivity and specificity
4 pairs are really, have one threshold, they do one
5 test, and its one population. Okay? Positive
6 and negative predicted value, there is one
7 threshold, one test, and two or more populations,
8 because really, the positive and negative
9 predictive value we're talking about are
10 different prevalences, therefore, different
11 populations. And the area under the receiver
12 operating characteristic includes all thresholds,
13 two or more tests are assessed, and one
14 population. So in other words, we use the ROC as
15 a best unbiased measure of test accuracy.
16 In terms of the measures of accuracy
17 discussed above, without changing the test
18 itself, there are only two ways to change the
19 accuracy of the test. One way, of course, is to
20 change the threshold of the predictions, and then
21 your sensitivity and specificity would change.
22 And the other way is to change the prevalence of
23 the disease in the population, because then your
24 negative and predicted -- positive and negative
25 predicted values will change. Okay?
00243
1 Prevalence's effect on accuracy. The
2 optimal prevalence for assessing the accuracy of
3 the test is to use a population composed of 50
4 percent disease, 50 percent unaffected. In this
5 situation, the prevalence itself provides no
6 advantage to the test. As the prevalence departs
7 from 50-50, the impact of predicting the
8 prevalence becomes more prominent. In other
9 words, if the test acted as a naive Bayesian
10 classifier, then for each patient it would always
11 predict the most frequent outcome, in other
12 words, it would predict the prevalence. So for
13 example, if there was a 90 percent prevalence in
14 a diseased population, then the naive Bayesian
15 classifier would say disease every single time
16 for every single patient, and you would be right
17 90 percent of the time. That's pretty good.
18 Okay? That's a pretty accurate approach. As the
19 proportion of patients with or without the event
20 moves, either toward a hundred percent or zero,
21 the naive Bayesian approach becomes more
22 effective, more efficient in its predictions. So
23 it's only at 50-50 for binary outcome, that you
24 neutralize the naive Bayesian classifier
25 approach. In other words, if the true prevalence
00244
1 of the disease in a population is close to a
2 hundred percent, it's almost possible for a test
3 to add predictive information. Okay? That's
4 really an important idea. So, as you get towards
5 high prevalences, almost no test will be helpful
6 anymore. Okay?
7 Changes in the prevalence of the
8 disease in a population, as reflected by
9 corresponding changes in the test's positive and
10 negative predictive values. If one were allowed
11 to report the positive predictive value, or the
12 negative predictive value of tests just by
13 itself, then one might be tempted to create or
14 select a high prevalence population for
15 assessment of the test, because the test would
16 appear to possess a high predictive accuracy,
17 okay? Until of course, it was compared to the
18 naive Bayesian classifier, at which point it
19 would cease. Thus, both the positive and
20 negative predictive values of the test must be
21 assessed. Then, if the prevalence is not 50
22 percent, the test must be compared to the naive
23 Bayesian classifier. Further, both the
24 sensitivity and specificity of the test must be
25 assessed in terms of the cutoff that was
00245
1 selected, and the prevalence, because it turns
2 out that although it's commonly thought that
3 prevalence doesn't affect sensitivity and
4 specificity, it certainly does, and there are a
5 number of papers that demonstrate that.
6 So, a better way to assess the accuracy
7 of the test is to use the ROC. This measure of
8 accuracy is impervious to changes in prevalence
9 and reflects the characteristics of the test
10 across all sensitivity and specificity pairs.
11 Well, okay. So now, I was asked to
12 take a peek at Kern and Weisenthal's paper as
13 well, and it turns out that they really are very
14 sophisticated in their use of data and results.
15 It's probably one of the most sophisticated
16 papers I've ever read, and I have read quite a
17 few. I'm going to talk about those areas of the
18 paper.
19 Overview of the study. Kern and
20 Weisenthal used two in vitro tests, which have
21 been mentioned, as surrogate outcomes for
22 response to chemotherapy in patients with
23 different types of cancer. If a patient's tumor
24 demonstrated drug resistance in a test, i.e.,
25 after the patient's tumor cells were exposed to
00246
1 the drugs for a certain period of time, and the
2 cells did not achieve a threshold inhibition, the
3 test was interpreted as predicting that the
4 patient would not clinically benefit from
5 receiving the drug.
6 So, we go back to our levels of
7 evidence, and we can ask overall about this
8 study, where it would lie in our levels of
9 evidence? Well, the colony formation test is
10 really Level III, it's weak evidence. The
11 thymidine incorporation is really Level IV,
12 insufficient evidence.
13 But, not letting that bother us too
14 much, let's talk about the study itself. It's a
15 retrospective chart review, subject to several
16 biases, including therapy selection bias, who
17 received the therapy, and study selection bias,
18 which patients were included in the study. And
19 the study was not validated on an independent
20 population, but it was done on the same
21 population. It was done from 1980 to 1987 in the
22 United States.
23 The study characteristics. Initial
24 population was 5,059 patients. From that, they
25 winnowed it down to 450 patients that they
00247
1 actually studied, about 9 percent of the initial
2 population. They looked at eight different types
3 of cancer. They had 332 colony formation
4 patients, 116 thymidine incorporation patients.
5 And the non-respondent prevalence was 71 percent
6 of the population.
7 One thing that the study wasn't very
8 clear about, it said that virtually all patients
9 were treated with standard chemotherapy, but then
10 later on it said, most of the patients whose
11 specimens were analyzed did not receive
12 chemotherapy because they underwent curative
13 surgical procedures. And I didn't understand
14 that distinction.
15 We'll assume that all 450 patients in
16 the study received chemotherapy. The percentage
17 of patients who receive chemotherapy today may
18 actually be much higher. The criteria to decide
19 which patients received chemotherapy is not
20 reliable. This is really not a very acceptable
21 approach to a study. If in fact you're going to
22 predict who's going to respond to chemotherapy, I
23 think you really have to say how chemotherapy was
24 selected, what the selection criteria were.
25 Also not provided were the patient
00248
1 characteristics of the study population, and this
2 is really critical information. For example, if
3 the population was composed of patients who had
4 already received primary chemotherapy, had
5 incurable disease, and were undergoing salvage
6 treatment, then this study would not be
7 applicable today, and in addition, the results
8 would be biased. So we really need to know what
9 the chemotherapy selection criteria were, and
10 what the patient population characteristics were,
11 neither of which are provided to us. There is no
12 basis from which to understand the results that
13 we are seeing.
14 Now, the function of the test is to
15 predict clinical non-response to chemotherapy
16 using suprapharmacologic drug doses. Now, we're
17 interested in the non-response rate per drug per
18 cancer type per test type. That's what we're
19 interested in. So there were eight drugs. Now
20 I'm not going to talk about combination therapy
21 because that's a whole other subject. There are
22 eight drugs, eight cancer types, that means there
23 were 64 bins, okay? So that means per cancer,
24 per treatment, so there were 64 of those
25 combinations for each of the two tests, for a
00249
1 total of 128 accuracy assessments. And excuse,
2 the lines aren't there, but you see 64 bins. And
3 for each bin, you would want to know
4 prospectively, hopefully randomized, you would
5 want to know, for disease one, treatment one,
6 what does the test say, okay, about this
7 population? In that one cell. And then you
8 would want to follow that population over time
9 and see what actually happened to those people.
10 So for breast cancer and a particular
11 chemotherapeutic agent, you would like to see,
12 did the test predict for that chemotherapeutic
13 agent for breast cancer, successfully. And you'd
14 want to do that for each of the 64 cells. And in
15 fact, you must do it for each of the 64 cells.
16 If there were the same number of
17 patients per cancer type, then the 118 patients
18 tested for thymidine incorporation would be at
19 1.8 patients per bin, for this study. And for
20 the 332 patients tested for colony formation,
21 there would be 5.2 patients per bin. These
22 frequencies would be too low to be meaningful.
23 Now, out of the eight drugs tested and
24 reported, the only drugs to use today, and are
25 they not used in combination, the efficacy, the
00250
1 efficiency of these tests must be demonstrated
2 with each chemotherapeutic agent in use today,
3 and for each combination of agents, each type of
4 cancer.
5 Now, just a couple final points. It's
6 unclear why this study provided two sets of
7 thresholds instead of one. Further, although two
8 thresholds were tested for significance, three
9 were presented in the text, shown in the tables
10 and figures. The first threshold is 45 to 75,
11 and the second one was 15 to 40. In this study,
12 the thresholds that were selected to assess on
13 the same population that were used to determine
14 the optimal thresholds. This elementary mistake,
15 reporting the results from the population used to
16 create the threshold, rather than the results of
17 an independent population, always results in the
18 overestimation of test accuracy.
19 The outcome was standard response
20 criteria. We are never given a definition of
21 what standard response criteria are. We don't
22 know who got the chemo and why. We don't know
23 the study population. We don't even know the
24 outcome. We are never given definitions of any
25 of those three. It's absolutely critical that
00251
1 the specific response criteria employed by the
2 investigators be revealed if that is their
3 outcome.
4 Now of course, Rich Simon, who many of
5 you know at the NCI, and others, have pointed out
6 that response is an unreliable outcome and should
7 be avoided if at all possible. So, okay. So
8 rather than the 64 sets of results that we were
9 looking for, two sets of results were presented,
10 one for each of the two thresholds. Each of the
11 results is across all eight cancers, all eight
12 therapies, and the tests, and the results are
13 there for the first threshold, 60 something
14 percent sensitivity, 87 specificity, 43 and 99.
15 Clearly, the sensitivity goes down as the
16 specificity goes up. Neither sensitivity or
17 specificity pair is very high. Combining all
18 results into one conglomeration provides no
19 information regarding the utility of the test for
20 each drug in terms of each cancer type. The
21 study should have reported the area of the ROC
22 curve, both tests, for the 64 sets of results.
23 Thank you.
24 DR. FERGUSON: Thank you very much.
25 We're actually at our time for a break.
00252
1 Perhaps -- it is almost 2:30. If we take a
2 15-minute break, I think, yes, would you please
3 come back up, because there may be a couple of
4 questions for you.
5 (Recess from 2:25 p.m. to 2:45 p.m.)
6 DR. FERGUSON: I wonder if there are
7 any in the audience, or panel for that matter,
8 who would like to ask Dr. Burke some questions
9 related to his presentation? And also, Dr. Burke
10 has promised to give us the last few slides. Are
11 there questions for Dr. Burke from members of the
12 audience or from the panel?
13 Dr. Weisenthal, did you have a question
14 that you wanted to ask, or a comment?
15 DR. WEISENTHAL: I want to thank
16 Dr. Burke. He started off by paying me
17 compliment, and he said of Dr. Kern and I's
18 paper, that this is one of the most sophisticated
19 papers that he's ever read. I've also been
20 talking to critics of these technologies for 20
21 years, and that's the most sophisticated
22 criticism that I've ever had, so I want to
23 congratulate you on that.
24 There are several points that were
25 raised in your talk which should be addressed.
00253
1 Just to begin with, the study by Kern and
2 Weisenthal that you spent the bulk of your time
3 reviewing, just to begin with that, you brought
4 up several methodologic criticisms and raised
5 questions about patient selection and so forth.
6 I want to remind everyone here that that was
7 published in the Journal of the National Cancer
8 Institute. I assure you it underwent rigorous
9 peer review. When we submitted our first draft
10 of the manuscript, the reviewers there had
11 certain problems with it and they had certain
12 things they wanted clarification of.
13 DR. BURKE: But that's an appeal to
14 authority.
15 DR. WEISENTHAL: No, no, no. Dr.
16 Burke, had you been one of the reviewers, no
17 doubt you would have raised those issues at the
18 time and we would have responded to those. And
19 I'd like to ask Dr. Kern now if he can respond,
20 so we're in consideration of that you were one of
21 the -- you know, you can't blame us because you
22 were not the reviewer of our paper. Had you been
23 there and helping us to get the essential
24 information out there, I'm sure it would have
25 been a better paper. But we'd like to address
00254
1 those issues that you raised at this time, if
2 that's okay.
3 DR. BURKE: Absolutely.
4 DR. KERN: One of the points was the
5 selection bias. How could you end up with 450
6 correlations out of 5,000 patients in the study?
7 Well, the 5,000 patients was an overview of all
8 the tests that we had done in the laboratory. It
9 wasn't meant to imply that the clinical study was
10 based on 5,000 patients. And in fact, at the
11 Department of Surgery, UCLA, where I was, most of
12 the patients were treated with surgery or
13 radiation, not with chemotherapy.
14 Secondly, many of the patients that
15 received chemotherapy received adjuvant
16 chemotherapy. So the inclusion criteria of the
17 study to get to 400 patients included, first,
18 patients had to have advanced disease; second,
19 they all had to have objectively measurable
20 disease, either by CT scan, x-rays or so on.
21 Okay?
22 Now, as far as another comment that you
23 made about one study, but it's not been
24 independently validated, I think I may ask Dr.
25 Bosanquet to address that issue, because he
00255
1 published an article in Lancet a couple of years
2 after our paper.
3 DR. BURKE: Did you want to address any
4 of the other issues that I brought up?
5 (Inaudible response from audience.)
6 DR. BURKE: I mean, this is not an
7 opportunity for us to get into whether the study
8 has been validated or not at this time. That was
9 just an issue that I raised, and perhaps at
10 another forum that can be addressed further. I
11 think we have time limitations.
12 DR. KERN: Well, I will try to answer.
13 DR. BURKE: So keep going. There were
14 a lot of issues.
15 DR. KERN: Bring up a couple of the
16 issues, remind me of them. Let me see what you
17 consider a serious objection.
18 DR. BURKE: Well, the selection, the
19 patient characteristics, the criteria for who got
20 what treatment.
21 DR. KERN: Let's go one at a time. Who
22 got what treatment was determined independently,
23 not by the assay, but by the disease type. The
24 patients went on standard protocols. Most of the
25 patients who ended up at being UCLA, an academic
00256
1 center, were all on some sort of clinical trial,
2 randomized trial protocol.
3 DR. BURKE: What were the standard
4 protocols? What was the response criteria that
5 you used?
6 DR. KERN: The response criteria were
7 the ECROG criteria of partial response and
8 complete response.
9 DR. BURKE: And what percentage was
10 each in terms of your study?
11 DR. KERN: I'm sorry, I don't
12 understand.
13 DR. BURKE: In other words, in terms of
14 response, global response measured, and what
15 percentage of these patients were partial
16 responses, what were complete, and then at that
17 time, how were those defined in your study
18 population?
19 DR. KERN: Okay. The responses were,
20 again, just by objective measurements. It was
21 retrospective, but scans, x-rays. And the
22 complete response, obviously, complete
23 disappearance of the disease. Partial response
24 was by the criteria of two dimensions and the
25 shrinkage of at least half in two dimensions.
00257
1 Standard criteria.
2 DR. BURKE: But this was a
3 retrospective study where you went back to the
4 charts. We all know about the paucity of
5 information and the error of information, and in
6 follow-up information not being in the charts.
7 How did you manage those issues in your
8 retrospective study?
9 DR. KERN: Well, the follow-up --
10 obviously, there are problems, and I'm not trying
11 to say there's not biases in it. We all know the
12 disadvantages of retrospective chart reviews.
13 The only thing I can tell you is what actually
14 was done, two oncologists reviewed the charts and
15 made their best decisions of what the responses
16 were, based on measurable criteria.
17 DR. BURKE: What did they do when they
18 disagreed? What did they do when information
19 wasn't there? What did they do make sure it was
20 accurate information? Do you want to continue
21 with this?
22 DR. KERN: No, I cannot say that I can
23 answer every question. I mean, I'm not an expert
24 in your field.
25 DR. FERGUSON: One brief, and then
00258
1 we'll let Dr. Bosanquet speak.
2 DR. WEISENTHAL: This is really
3 important, okay? You know, you talked about an
4 eight-by-eight table, and we only have one
5 point. You know, a study like this is never
6 going to be done again in the history of the
7 world. Never again are you going to have 330
8 patients treated with single agents. The
9 important thing about it was that this was an
10 honest blinded study in the following fashion,
11 and that is that the clinical results were
12 determined independent of knowledge of assay
13 results. The clinical results were reported to
14 the Department of Biomathematics at UCLA; they
15 were like the stakeholder in this, they had the
16 clinical assessments. Likewise, they received
17 independently from the laboratory the laboratory
18 assessments, and then the correlations were made
19 as stated.
20 DR. BURKE: Let's just deal with that
21 issue for a moment, because Dr. Kern sat down and
22 you stood up. So we've got the 64 bin table,
23 right.
24 DR. WEISENTHAL: Yes.
25 DR. BURKE: And the issue is, how do we
00259
1 know the utility of this test for a chemotherapy
2 in a disease?
3 DR. WEISENTHAL: Okay. You're making
4 the same criticism as Maury Markman has made.
5 What Maury Markman says is as follows, and he
6 says that he notes that there have been no
7 prospective randomized trials.
8 DR. BURKE: That's not my question.
9 DR. WEISENTHAL: Wait a second. But
10 it's the same thing. He says that even if some
11 day there were to be a prospective randomized
12 study, that that would only apply to that one
13 particular situation, and it would not tell you
14 anything about all the other situations.
15 You know, the sort of information that
16 you're asking for in the real world will not be
17 available for 20 to 50 years, if ever.
18 DR. BURKE: No, no. I understand the
19 mitigating circumstances. But the question is,
20 if you want this test to predict a particular
21 chemotherapeutic regimen in a particular disease,
22 then I want that information, and I don't have it
23 in your study.
24 DR. FERGUSON: Okay. I am going to ask
25 for Dr. Bosanquet to give his response, and then
00260
1 we are going to go ahead. We will try to have a
2 little more time at the end.
3 DR. WEISENTHAL: There's an extremely
4 important point. Basically he started out his
5 talk denigrating -- in other words, I made the
6 point that we have 35 studies consistently
7 showing the same thing, and he denigrated that,
8 and he said, oh, that's just due to consistent
9 bias. And I would like to prove to you that that
10 is not true.
11 DR. BURKE: Excuse me. I didn't. I
12 posed an alternative hypothesis. I said there
13 are two hypotheses for the 35 consistent studies.
14 Assuming that they are consistent, which we
15 have no evidence of, but assuming that they are
16 consistent, it could be due to two things. It
17 could be due to the fact that there is a
18 phenomenon there, or it could be due to
19 consistent study bias. And until you eliminate
20 the alternative hypothesis, you haven't done
21 science.
22 DR. WEISENTHAL: I would like to then
23 eliminate the alternative hypothesis and prove to
24 Dr. Burke that we have indeed done science in
25 this setting.
00261
1 DR. FERGUSON: Can you do that in the
2 final hour?
3 DR. WEISENTHAL: Okay.
4 DR. BOSANQUET: It was stated that
5 there was no independent validation of this. We
6 actually took the data that we published in the
7 Lancet the following year. This paper that we're
8 discussing is 1990; we published this work in
9 1991 in the Lancet, using CLL patients. And we
10 also looked at extreme drug resistance in these
11 patients.
12 We got this. We found 22 of 119
13 patients had extreme drug resistance in vitro,
14 and none of these patients responded. So here is
15 one of the things that we would speak to, which
16 was independent validation in a completely
17 different set of circumstances in a different
18 laboratory, and we find exactly the same thing,
19 extreme drug resistance, no response.
20 DR. FERGUSON: Using the same cutoff
21 points that were determined by Kern Weisenthal, I
22 guess.
23 DR. BURKE: Just to respond briefly to
24 that, two points. One, that is not a replication
25 of the 1990 paper.
00262
1 And number two, I do suspect that
2 that's exactly correct, that it is disease and
3 treatment specific. And that's exactly my
4 point. That's exactly my point. You have to
5 talk a specific disease, a specific treatment,
6 how does the test do? Not a conglomeration of
7 diseases and treatments together. That's exactly
8 my point. Thank you.
9 DR. FERGUSON: Thank you. Dr. Burken.
10 DR. BURKEN: Hi, everybody. Can
11 everybody hear me okay? I am Dr. Mitch Burken, a
12 medical officer with the coverage and analysis
13 group at HCFA. What I'd like to do is try to tie
14 together some of the presentations from earlier
15 in the day. There will be a lot of material in
16 here that you've seen before, but what I want to
17 do is try to wrap it up, and wrap it up in a way
18 that's consistent with Dr. Bagley's opening
19 remarks around 8:00 this morning, looking at the
20 broad sweep of the evidence, not spending as much
21 time on specific papers as much as trying to see
22 the bigger picture, cutting across many assay
23 formats.
24 Well, as I said, for the first several
25 minutes I want to be as conceptual as possible,
00263
1 and then we'll get more into the bulk of the
2 evidence itself.
3 But let's think about why we would
4 order any type of lab test, okay? A lab test has
5 its maximum clinical utility when the disease
6 probability is most uncertain. In other words,
7 we heard a little bit about the 50-50 point, and
8 the naive Bayes condition, and so forth, but let
9 me just try to restate that in a slightly
10 different way. If we have any type of lab test
11 we're looking at, and exploring questions of
12 clinical utility, okay? What's the probability
13 that the patient has a disease? If a patient is
14 very unlikely to have the disease, then what kind
15 of information do you have when you get a lab
16 test result? It's certainly not very very high.
17 And the reciprocal situation, where we
18 have a very very high probability or prevalence
19 of disease, and then the lab test doesn't really
20 add a whole lot, because we're almost positive
21 the patient has the disease. It's when we're
22 unsure of ourselves, and when we are at that
23 50-50 point, that's when a lab result can really
24 begin to add value.
25 Well, where are we in the Medicare
00264
1 program? The panel is charged with trying to
2 demarcate what's reasonable and necessary with
3 respect to human tumor assay systems, okay? And
4 we need to find a spot in this, or we need to
5 kind of bracket an area of this graph where lab
6 testing -- and again, we'll talk about the HTAS
7 in a second. But where is lab testing most
8 reasonable and necessary? Where does it add
9 information?
10 Let's talk about now about applying
11 this more generic situation to human tumor assay
12 systems. Well, let's talk about the
13 chemosensitivity scenario. We talked all day
14 about how this testing can assist clinicians in
15 selecting effective single agents. Okay?
16 Conversely, the chemoresistance scenario is where
17 this assay, or where these assay systems can
18 avoid ineffective agents. And what's our
19 reference here? The reference is data from
20 published clinical trials; maybe they're in peer
21 reviewed journals, maybe unfortunately they're
22 just in abstracts that are available at ASCO
23 meetings. But again, there is information from
24 clinical trials that does provide a backdrop
25 against which one can look at this lab testing
00265
1 and the added value thereof.
2 So let's go back to our graph again.
3 In vitro testing has the greatest clinical
4 utility when the presumed sensitivity or
5 resistance, because remember, they're really
6 reciprocal functions of each other, is most
7 uncertain. And going to our X axis here, what's
8 the real question? The question is, is tumor X
9 sensitive or resistant to drug Y in patient Z?
10 Therefore, we need to be specific as to what
11 questions we're posing.
12 Well, let's talk some more about
13 clinical utility, because as I said, what we want
14 to do is look at the broad sweep of the
15 evidence. We've talked about different outcome
16 measures today; we've talked about clinical
17 response; survival; we've talked even a little
18 bit about quality of life, although in the packet
19 of materials there is really not a lot of quality
20 of life literature to discuss, so we won't really
21 get into that.
22 And in looking at clinical responses
23 and outcome, we need to identify robust
24 two-by-two data using a valid gold standard, and
25 from there we can look at different performance
00266
1 measures. In this case, I think it's valuable to
2 look at positive predictive value as a marker for
3 chemosensitivity, and negative predictive value
4 as a marker for chemoresistance. One could also
5 talk about the sensitivity or specificity, but
6 let's try to keep it just a little bit simpler,
7 let's focus in on some concepts, and not worry so
8 much about the math. There are others in the
9 room who may be more expert, but let's try to
10 keep it simple, and not get too wrapped up in the
11 numbers, but let's try to get wrapped up until
12 the themes and the concepts.
13 As we discussed earlier this morning,
14 as Dr. Bagley emphasized, we have to insure that
15 the biases, such as insufficient sample sizes,
16 don't substantially influence our results.
17 Going back to our graph now, in 2-D
18 rather than 3-D, let me emphasize a point that
19 I've said already, but let me reemphasize it
20 again. That if you are at the extremes of this
21 utility function, okay, where the lab test,
22 whether it's human tumor assay system, or a serum
23 sodium, or whatever it is, or a chest x-ray, any
24 type of diagnostic test, if you're at the extreme
25 regions of this utility function, it doesn't
00267
1 really matter what your predictive values are.
2 If the predictive value is high, it can be offset
3 by the fact that you are in an extreme region of
4 the utility function where those numbers don't
5 really mean as much. Okay? And we'll talk more
6 about that. Okay?
7 Well, what kinds of measures do we need
8 to evaluate test accuracy? The ones that I
9 talked about below, predictive values, but there
10 are also sensitivity, specificity, area under the
11 ROC curve, but let's talk about something else.
12 What about some of the physician concerns. In
13 the lab, what kinds of things can a physician
14 tell his or her patient when a particular tumor
15 can or cannot be assayed by the lab?
16 On the right side of the slide are what
17 I would call the quality control measures, and
18 we're not really going to spend time on those in
19 this particular, at least my particular
20 presentation, but you've heard from FDA earlier.
21 So let's just kind of stay on the left-hand side
22 of the slide for now.
23 Well, just to get back to a couple of
24 those issues that really cut to the heart what
25 physician concerns might be, you know, is there
00268
1 sufficient assessability or evaluability of the
2 tumor cells from the submitted specimens? And we
3 found out that some of the earlier clonogenic
4 assays had very very -- had relatively low
5 assessability or evaluability rates. But let me
6 pose another question.
7 Even if a particular assay format is
8 evaluable 90 percent of the time, it still might
9 mean that 10 percent of the time the physician
10 speaks with his or her patient, and they really
11 just can't get an adequate result. So I think
12 that's an issue. You know, even if it's 90 or 95
13 percent, there is still some percentage of the
14 time when you don't have a result and you come
15 back. Okay?
16 There are other issues that come into
17 play. What's the effect of tumor heterogeneity?
18 We talked a little bit this morning about tumor
19 heterogeneity, but there's another type of tumor
20 heterogeneity as well, and that's the type that
21 can occur within the same patient, so that a
22 primary tumor and its metastatic lesions have
23 different in vitro patterns. And again, that is
24 a consideration to keep in mind when we're
25 thinking about this type of testing.
00269
1 So as I mentioned, in vitro results for
2 solid tumors from one site may not always provide
3 the same result as other sites. However, there
4 is a paper back in 1986, we'll get to it a little
5 bit later, I'll touch on it again, but it shows
6 that this problem may not be quite as pronounced
7 in clonic lesions such as leukemias.
8 Well, there is a whole host of in vitro
9 assay formats and we can, as I say, just kind of
10 go through those. But I think it's important to
11 mention at the end here that we will not in this
12 presentation be going through the clonogenic
13 literature. When we reviewed this material at
14 HCFA, we didn't feel there would be a lot of
15 value in discussing the older technologies that
16 had the lower evaluability rates, and just felt
17 it would be better to present it to the panel
18 this way.
19 Well, what kinds of criteria do we use
20 to evaluate the literature? Again, our goal here
21 is to be broad based. We looked at peer review
22 journals in English. There were some manuscripts
23 pending publication that were necessary for panel
24 discussion. There were a couple of the assay
25 formats that were relatively recently developed
00270
1 that we felt we would not be fair to the
2 requesters if we excluded some of the
3 manuscripts. There was, for example, Bartels
4 chemoresponse assay was FDA approved back in 1996
5 and package inserted in the summary of the safety
6 evaluation data was included as a way of
7 evaluating that. And we did not look at abstract
8 data.
9 Based on that, what types of additional
10 search methods? Well, we -- again, we looked at
11 articles submitted to HCFA prior to November 1st,
12 1999.
13 DR. FERGUSON: Mitch, can you speak
14 into the microphone?
15 DR. BURKEN: Right. When we started
16 reviewing this sometime, sometime before the
17 panel itself, we found that the Fruehauf and
18 Bosanquet review article from the 1993 PPO
19 updates, crystallized many of the issues. And
20 what we did, based on it, there were some summary
21 tables that were actually presented this morning,
22 where they looked at groups of studies. And I
23 again refer you to summary table seven and eight
24 from the 1993 PPO. And as a result, we really
25 focused our efforts, our literature efforts on
00271
1 the EDR, as well as also some of the other
2 thymidine assays, because there were some other
3 thymidine uridine incorporation assays pertinent
4 to bring to the panel. And then we also did a
5 lot of sampling of DiSC and MTT, using a MEDLINE
6 search, and we did not have any time limit on our
7 studies.
8 And when we went through and did our
9 literature search, then we had to figure out what
10 we would want to present to the panel. And since
11 clinical response was one of the outcomes we
12 looked at, as well as survival, we needed to have
13 confidence in the viability of our two-by-two
14 tables. So as a result, any study that lacked
15 the clinical criteria -- either the -- the
16 clinical criteria either had to be documented or
17 referenced, you know, for clinical response. We
18 only looked at adult patients.
19 And just for the record here, in the
20 rather extensive handout which has been provided
21 for this session, we do list the pediatric
22 studies that have not been summarized in this
23 presentation, but there is a notebook of all the
24 studies that are being presented in this
25 presentation, are available. I know it's kind of
00272
1 hard to read the whole notebook tonight, but as a
2 supplement to the materials you already received,
3 there are papers in here such that anybody that
4 has any questions about any of the bullet items
5 from this afternoon's presentation can go back to
6 this, as well as your other materials.
7 We included both prospective and
8 retrospective two-by-two data designs. The only
9 thing we did exclude for this, again, for this
10 panel presentation, were descriptive type studies
11 that didn't use any quantitative summaries.
12 There were some studies that went beyond
13 two-by-two tables, used regression analysis and
14 some other techniques, and those were included.
15 Now talking about all these studies,
16 you know, how can we present these studies to the
17 impact panel? Can we group them or pool them, or
18 do we need to go through them individually? It
19 was something that we really had to spend some
20 time thinking about. And we came to the very,
21 very strong conclusion that data from the
22 individual studies should not be pooled. The
23 reason being is that they're, the studies are so
24 heterogeneous, they use different cutoff points,
25 different tumor drug combinations, different
00273
1 clinical response criteria, that we just felt
2 very uncomfortable about doing a meta-analysis
3 for the purposes of presenting data to the panel,
4 okay? So therefore, each study must be presented
5 on its own merit, and I think that's a
6 fundamental approach to presenting data this
7 afternoon, and probably lengthens the
8 presentation a little bit, but we feel it's
9 important.
10 So now, let's just go through the
11 evidence. Let me just walk through the handout
12 with you. I don't -- there is a lot of bulk on
13 my slides, but again, it's in the handout and it
14 is really set up to be a reference guide to
15 trying to put it all together.
16 The assay formats I start with are not
17 based on cell death versus cell proliferation.
18 It's not done that way. I went from the assay
19 formats that we concentrated on, as in the EDR,
20 the DiSC and the MTT, where we really had the
21 most literature, and then towards the end I have
22 some of the other formats where there was a
23 little less literature that came up, based on the
24 criteria that were described in previous slides.
25 The Kern and Weisenthal article from
00274
1 1990 is a complex article that was referred over
2 to Dr. Burke and Dr. Handelsman for separate
3 review. Again, a central piece of evidence, but
4 highly complex.
5 But let me go through some of the other
6 articles that pertain to EDR as well as some of
7 the other thymidine uridine incorporation
8 formats.
9 Eltabakkh, '98, shows, you know, some
10 PPVs and NPVs. There were some confidence
11 intervals that are reported. As you can see, the
12 NPVs in this study is actually fairly low. Let
13 me start, as I said, rather than to go through
14 all the bullets, let me try to highlight what are
15 some of the themes. As I said, there were a
16 hundred new patients with ovarian cancers. We
17 find out in this study, all the patients were
18 recruited prior to chemotherapy, which is
19 important when we think about selection bias.
20 And we found 75 evaluable patients, so we went
21 from about a hundred down to 75, which is really
22 pretty good.
23 Fernandez-Trigo is a study, again, that
24 also has some case loss of about roughly 25
25 percent. But in this case, there was a very rare
00275
1 site cancer that was selected, so I would just
2 keep that in mind.
3 Moving on to some of the other
4 thymidine uridine formats, you know, I talked
5 about the CRA, the Bartels CRA, and it turns out
6 that there was a study by Elledge back in '95
7 that enumerates the findings of clinical trial.
8 And one little, kind of I suppose warning to the
9 panel when evaluating this paper, I mentioned
10 that NPV is a marker for chemoresistance and PPV
11 being a marker for chemosensitivity. Well, in
12 this particular article, not the article but the
13 package insert, and the summary safety and
14 evaluation data, it's flip-flopped, so you have
15 to be a little bit careful there. It turns out
16 you have to reverse that, so you have to really
17 be wide awake when you read these two-by-two
18 tables. And I mention that down here, that the
19 two-by-two table design differs from the other
20 studies presented, even differs from the Elledge
21 paper, which is -- the Elledge paper comes out
22 before the FDA submission data. And this was a
23 prospective blind enrollment of 60 relapsed
24 breast cancer patients. The interesting thing
25 about this particular assay format is that it was
00276
1 very specific for breast cancer and 5-FU.
2 Well, what about some of these earlier
3 five-day thymidine uptake assays? Sondak in '84
4 had a series of 142 patients with successful
5 assays out of a pool of 219, with 33 clinical
6 correlations. Quite a bit of case loss here,
7 even though, again, you know -- well, the numbers
8 are small, but the NPV, again, is high. You
9 know. But one, again, has to be concerned about
10 possible selection bias.
11 Sondak in '85, 819 mixed solid tumors,
12 again, if you use different cutoffs, you're going
13 to have different PPVs and NPVs.
14 Sanfilippo, in '81, there were several
15 studies on three-hour incubation, rather than the
16 five days, you know, and there were -- as I said,
17 you can see the numbers here. I think the
18 interesting thing about this particular study in
19 '81 was the use of subsets for high
20 proliferative and low proliferative non-Hodgkins
21 lymphoma cases.
22 And in '86, Sanfilippo, the same group,
23 went ahead and studied 169 patients with various
24 types of germ cell testicular tumors, but only 29
25 cases were available for clinical correlation.
00277
1 Again, we didn't really know how many people were
2 previously treated or untreated, and that could
3 inject some bias.
4 More three-hour uptake assays, two
5 studies by Silvestrini in 1985 and Daidone in
6 1985 from the same institution as Silvestrini.
7 Different tumor types.
8 Well, let's kind of move along, and we
9 finished up with the thymidine/uridine
10 incorporation assays, and let's move on to the
11 DiSC assay. And as I say, there were several
12 papers that were reviewed in the Cortazar and
13 Johnson article, which is the review article in
14 1999, which did a MEDLINE search, and targeted 12
15 studies, and four of those 12 studies were DiSC
16 assay approaches in solid tumors, three of the
17 four studies being small cell lung carcinoma, the
18 other study being a non-small cell lung
19 carcinoma. And I think, you know, in each of the
20 studies, the test groups did at least as well as
21 the control groups. The survival data was not
22 particularly convincing. In the Gazdar study,
23 the survival rates were similar; in the Wilbur
24 study, the survival rate comparisons were not
25 am. Again, that is survival rate of assay
00278
1 directed versus, you know, empiric therapy
2 groups. In both the Shaw and the Cortazar
3 articles, their survival rates were really not --
4 there was really not enough of a difference to
5 really hold much discussion.
6 But I think where we find a lot more
7 evidence, again, based on our structured review,
8 is looking at this, and hematologic tumors. And
9 we start with Dr. Weisenthal's study in '86 where
10 there is 70 cases. What we did was we subtracted
11 out the 29 cases of ALL. Again, it's just a
12 judgment case one makes as to how you want to
13 treat the pediatric tumors. I can tell you that
14 the pediatric performance table was just about
15 the same as for adults, and there weren't
16 significant differences, so I think what one
17 could --
18 DR. FERGUSON: Mitch, closer to the
19 microphone please, or maybe you should hold it.
20 DR. BURKEN: Yeah. As I said, one can
21 scan through some of the pediatric studies
22 quickly, and I think get a flavor for that. But
23 moving on, looking at the adult data, we had PPVs
24 and NPVs that were over 80 percent.
25 Dr. Bosanquet in 1999 had a fairly
00279
1 elegant study, and it was reviewed earlier
2 today. I will leave the details to the group.
3 Another study that was also discussed
4 earlier today was a study by Mason that did some
5 modeling. In this particular case, not only was
6 there some clinical response data and survival
7 data that was looked at, but there was some
8 modeling done using regressions, where if you --
9 I know that the print is a little small at the
10 bottom, but I just wanted to mention that if you
11 look at the life years gained per assay, the
12 modeling here said that if you had a simulated
13 50-year old with stage C chronic lymphocytic
14 leukemia, there would be a life years gain would
15 be about six months, and about three weeks if it
16 was a simulated age 70 stage AB female. Again,
17 these are all simulations that are based on the
18 assumptions in the regression modeling. But it's
19 a little bit more of a sophisticated approach, as
20 I said.
21 And continuing on, there's been, as I
22 said, a fair amount of work in hematology.
23 Tidefelt in 1989, with more than 90 percent of
24 the patients not being previously treated. There
25 is a complex predictive value studies in this
00280
1 paper with varying anthracycline concentrations
2 and different treatment regimens. But if you
3 flush out all 40, actually 40 out of 53 patients
4 were available for clinical correlations, and you
5 can see the PPVs and the NPVs.
6 To go on now, to continue more on the
7 hematologic DiSC studies, Bird in '86 was a small
8 study. That's the one I quoted earlier because
9 there was peripheral blood and bone marrow that
10 were used as two separate sites. And there
11 seemed to be reasonably good concordance between
12 in vitro testing, between peripheral blood and
13 bone marrow. But again, I caution, the sample
14 sizes are fairly small here. Bird in '88, again,
15 another small sample study.
16 I'm going back just some more. More
17 small sample studies. Dr. Bosanquet in 1983.
18 Dr. Beksac in '88. I think the interesting thing
19 about this study was it's kind of a mixed
20 retrospective prospective approach, so it was a
21 somewhat complicated study even though there were
22 only 16 patients.
23 Dr. Bosanquet's 1991 study that was
24 actually just up on the screen a few minutes ago,
25 showed 67 patients with CLL where there was a
00281
1 survival benefit. But just before we leave DiSC
2 and the hematologic applications, you know, there
3 were some articles that didn't have documented
4 clinical criteria, and you see down, Dr.
5 Bosanquet's article. Again, we certainly looked
6 at the survival data, but we didn't feel that the
7 clinical criteria were adequately specified in
8 this particular paper even though, as I said, it
9 showed some survival data.
10 And Kirkpatrick from 1990 was a paper,
11 again, all in the backup book here that has some
12 pediatric data.
13 Moving on to MTT, several articles did
14 not have documented clinical criteria so that for
15 the purposes of this panel presentation, we
16 didn't feel it would be useful to present
17 clinical response data. And it's interesting
18 that three -- we talked about the 12 studies from
19 Cortazar and Johnson. Three of them were Yamaue,
20 1991, 1992 and 1996, but unfortunately, none of
21 those articles had, you know, adequately
22 described criteria, and we just didn't feel we
23 could construct good enough two-by-two tables.
24 And the survival rates in these studies do not