00197

1

2

3

4 VOLUME II

5 (Afternoon Session - November 15, 1999)

6

7

8

9

10 HUMAN TUMOR ASSAY SYSTEMS

11

12 HEALTH CARE FINANCING ADMINISTRATION

13 Medicare Coverage Advisory Committee

14 Laboratory & Diagnostic Services Panel

15

16

17

18

19

20 November 15 and 16, 1999

21

22 Sheraton Inner Harbor Hotel

23 Baltimore, Maryland

24

25

00198

1 Panelists

2 Chairperson

John H. Ferguson, M.D.

3

Vice-Chairperson

4 Robert L. Murray, M.D.

5 Voting Members

David N. Sundwall, M.D.

6 George G. Klee, M.D., Ph.D.

Paul D. Mintz, M.D.

7 Richard J. Hausner, M.D.

Mary E. Kass, M.D.

8 Cheryl J. Kraft, M.S.

Neysa R. Simmers, M.B.A.

9 John J.S. Brooks, M.D.

Paul M. Fischer, M.D.

10

Temporary Voting Member

11 Kathy Helzlsouer, M.D.

12 Consumer Representative

Kathryn A. Snow, M.H.A.

13

Industry Representative

14 James (Rod) Barnes, M.B.A.

15 Carrier Medical Director

Bryan Loy, M.D., M.B.A.

16

Director of Coverage, HCFA

17 Grant Bagley, M.D.

18 Executive Secretary

Katherine Tillman, R.N., M.S.

19

20

21

22

23

24

25

00199

1 TABLE OF CONTENTS

Page

2 Welcome and Conflict of Interest Statement

Katherine Tillman, R.N., M.A. 5

3

Opening Remarks & Overview

4 Grant Bagley, M.D. 10

5 Chairman's Remarks

John H. Ferguson, M.D. 28

6

Brian E. Harvey, M.D., Ph.D. 30

7

Open Public Comments & Scheduled Commentaries

8 Frank J. Kiesner, J.D. 48

Larry Weisenthal, M.D. 57

9 Randy Stein 92

Richard H. Nalick, M.D. 99

10 William R. Grace, M.D. 108

John P. Fruehauf, M.D., Ph.D. 110

11 James Orr, M.D. 127

Robert M. Hoffman, Ph.D. 131

12 Andrew G. Bosanquet, Ph.D. 136

David Alberts, M.D. 142

13 Robert Nagourney, M.D. 147

David Kern, M.D. 159

14 Daniel F. Hayes, M.D. 168

Bryan Loy, M.D. 178

15

LUNCH 196

16

VOLUME II

17

Open Public Comments & Scheduled Commentaries

18 Edward Sausville, M.D. 201

Harry Handelsman, D.O. 227

19 Harry Burke, M.D., Ph.D. 234

Mitchell I. Burken, M.D. 262

20

Open Committee Discussion 304

21

Day One Adjournment 330

22

23

24

25

00200

1 TABLE OF CONTENTS (Continued)

2 VOLUME III

3 Opening Remarks - Introduction 336

4 Open Committee Discussion 337

5 Motions, Discussions and

Recommendations 425

6

Adjournment 487

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

00201

1 PANEL PROCEEDINGS

2 (The meeting was called to order at

3 1:16 p.m., Monday, November 15, 1999.

4 DR. FERGUSON: Dr. Sausville?

5 DR. SAUSVILLE: Good afternoon, all.

6 And if I could have the first overhead, this says

7 who I am, and the general topic that I hope

8 you're interested in hearing about this

9 afternoon. Anyway, my task this afternoon is to

10 provide an overview, at least from the

11 perspective of the preclinical therapeutics

12 development program of NCI of antitumor drug

13 sensitivity testing. And I will approach this,

14 therefore, from the standpoint of one who uses

15 tests like this, and indeed, in some cases

16 actually tests that have been used for this

17 purpose for the preclinical selection of drugs

18 for more detailed evaluation, as well as from the

19 perspective of an oncologist who has occasionally

20 thought about using these tests in the treatment

21 of patients. Next.

22 So the basis for this issue in cancer

23 derives directly from the infectious diseases

24 experience, wherein a number of different disease

25 categories, such as tuberculosis, where it's well

00202

1 established that one has to establish that a

2 particular patient's infected bacillus is

3 sensitive to the agents, and a number of

4 non-tuberculosis indications, which would

5 include, for example, pyelonephritis or

6 endocarditis, where it is well established from

7 the standpoint of standard medical practice that

8 such sensitivity tests are that valuable. Next.

9 The assays as applied to cancer ideally

10 would have 95 percent sensitivity and

11 specificity, and short of that goal, would

12 hopefully be better in predicting outcome than

13 the empirical choice of the physician. And the

14 essence of the question from an oncological

15 standpoint, therefore, is whether a particular

16 test conveys information over and above what is

17 implicit in the histologic diagnosis of a

18 patient's tumor. Ideally the test would be

19 biased in favor of detecting sensitivity rather

20 than resisting, for this reason, and ultimately,

21 these tests should be able to demonstrate an

22 impact on ultimate outcome, as opposed to simply

23 response, since in oncology, good outcome begins

24 with the response, it does not end with a

25 response. One ultimately has to have evidence of

00203

1 tangible clinical benefit that changes outcome.

2 Next.

3 So among the specific assays that

4 through the years have been utilized include the

5 by now classical Hamburger Salmon clonogenic

6 assay, wherein tumors that were biopsied for

7 example, were disaggregated, plated in agarose or

8 other solid media after relatively brief

9 exposures to drug, and ultimately colonies

10 counted in 14 days. There have been

11 modifications to this, most notably the capillary

12 tube modification used by Von Hoff and

13 colleagues, and it seems to increase the number

14 of patients for which valuable data are

15 obtained. Modifications of this also include

16 radionuclide based assays, in which radioactive

17 thymidine is added after three days and thus,

18 although it is a soft agar base, one can obtain

19 information after shorter periods of time. And

20 there are also non-agar based assays assessing

21 radionuclide uptake in mass culture. Next.

22 Technical problems with clonogenic

23 assays include a number of artifacts intrinsic to

24 the practice of the assay, including clumping of

25 tumor cells, the potential of growth perturbation

00204

1 from manipulation of potential clonogenic cells,

2 reduced nutrient uptake from nonclonogenic cells,

3 with increase in the size of colonies that grow

4 out in the treated cells. Counting evaluations

5 with a potential large coefficient of variation,

6 and poor cloning efficiencies. And a major

7 limitation in the widespread use of this

8 technique relates to the fact that in many

9 instances, the majority of the specimens are not

10 actually valuable, and there is the inability of

11 this type of assay to score small numbers of

12 resistant cells, which in a clinical scenario are

13 thought to translate new ultimately resistance to

14 therapy, of the sort that is manifest by the

15 subsequent relapse of a patient with drug

16 resistant tumor. Next.

17 In various reviews, actually extending

18 from the initial use of this technique into the

19 early '80s, the cumulative experience is that a

20 relatively small fraction of patients actually

21 have colony growth. And the data that is

22 tabulated here is contained in the references

23 that were indicated. But also, there is the

24 finding that the tests are clearly better at

25 predicting negative or resistant assays, than

00205

1 sensitive assays, such that for example, if one

2 looks at those specimens that were sensitive in

3 vitro as opposed to sensitive in vivo, we have a

4 60 percent true positive, with a range of 47 to

5 71 percent. In contrast to those specimens that

6 were resistant in vitro and resistant in vivo,

7 where there was, as you can see, a 97 percent

8 true negative information. Next.

9 This led to a so-called perspective

10 evaluation of chemotherapy selection utilizing a

11 clonogenic assay, as opposed to the choice of a

12 clinician. And again, this was published by von

13 Hoff and colleagues in 1990 in the Journal of the

14 National Cancer Institute. And in the 133

15 patients randomized in a single agent therapy, of

16 those where the therapy was assigned by a

17 clinician, one had one partial response, and in

18 19 of 68 that were possible to have an assay

19 directed assignment, there were four partial

20 responses. Certainly there was no evidence that

21 this was statistically different and one

22 concluded, or this article concluded, that what

23 one might conceive of potentially a somewhat

24 improved response rate, did not translate into

25 any noticeable effect on survival. And again,

00206

1 approximately a third of the tests could not be

2 evaluated, and there were clearly no evidence of

3 survival in patients either treated according to

4 that which was recommended by the physician, or

5 all patients that were compared versus the test

6 population. Next.

7 Other specific assays which have come

8 to the fore in an effort to meet some of the

9 clear difficulties in the widespread use of the

10 clonogenic assay include the so-called

11 differential standing cytoxicity assay, or DiSC

12 assay, pioneered by Weisenthal and colleagues.

13 And here, one is essentially assessing the effect

14 on whether or not cells remain alive after short

15 periods of culture after exposure to a drug.

16 Thus, either marrow, buffy coat or a tumor

17 suspension after disaggregation, can be treated

18 with drug for anywhere from one hour to four

19 days. Interestingly, the quantification was

20 aided by the addition of so-called duck red blood

21 cells, which are easily distinguishable

22 microscopically, a dye added, and then after a

23 cytospin, one can either assess the dead cells

24 per duck red blood cells, or live cells per duck

25 red blood cells, based on the differential

00207

1 staining of live and dead cells with either

2 fast-green, which stains dead cells, or HD, which

3 stains live cells.

4 Over variants of this approach include

5 the so-called MTT assay, which is a dye that

6 depends for its coloration properties as to

7 whether or not it is reduced by living

8 mitochondria, or a fluorescein assay, where live

9 cells take up a dye, hydrolyze to it in a point

10 that is detected by a change in fluorescence.

11 But all of these techniques, again, don't then

12 depend on the growth out of clonogenic cells, but

13 rather allow a relatively short term exposure to

14 the drug to define whether there is an effect on

15 the viability of the cells. Next.

16 When this assay was, and again, this is

17 in reference to the DiSC assay, was applied

18 initially to hematologic neoplasms, there was

19 clear evidence that there was increased cell

20 survival, that is to say resistance in patients

21 who ultimately were not responsive to

22 chemotherapy that was assigned on the basis of a

23 knowledge of the tests. So in that respect, the

24 assay was certainly suggestive that it might

25 eventually correlate with clinical outcome. And

00208

1 in addition, there was a fairly good

2 correspondence, again, with delineation of true

3 positives and true negatives by this assay.

4 Next.

5 When this assay was applied to the

6 somewhat more difficult clinical category of

7 patients with lung cancer, here in an initial

8 study with non-small cell assay, the DiSC assay

9 was performed assessing sensitivity to ten drugs,

10 treating with a regimen that ultimately

11 incorporated the three most sensitive agents. In

12 this series of 25 patients, there was a 36

13 percent partial response rate with a median

14 duration of 6.5 months and with the, if you want

15 to read, it looks to be responders and a median,

16 or I should say a median survival of about seven

17 months, with an overall of about 12 months.

18 There was clearly a threefold lower assay

19 survival. That is to say, people with greater

20 cell kill in responders versus non-responders.

21 However, these authors concluded that outcome as

22 measured by response rate and survival is within

23 the range reported by the literature, that is to

24 say, even though you can detect this difference,

25 the issue of whether or not it ultimately caused

00209

1 a different outcome that might be afforded by

2 treating with drugs that would be available from

3 the literature and without knowing the patient's

4 histologic diagnosis was not apparent. In

5 addition, some drugs clearly had a much greater

6 discordance in the predictive value of the test.

7 Thus for example, 5 fluorouracil did

8 not seem to have any ultimate value in its

9 performance, and on the other hand, etoposide,

10 behavior to etoposide, was essentially predictive

11 of the behavior of all of the of the drugs. And

12 actually from a scientific perspective, we now

13 recognize that since many of these agents act by

14 inducing apoptosis, this actual result

15 retrospectively, is not that surprising.

16 Interestingly, this paper also

17 introduced the concept of so-called extreme drug

18 resistance. That is to say, you can define

19 patients who had greater than one or more

20 standard deviations resistance than the median in

21 the population, and these patients essentially

22 had zero percent response to any of the agents.

23 Next.

24 This assay was also applied in a study

25 that was recently published from the NIH, and

00210

1 attempted to individualize chemotherapy for

2 patients with non-small cell lung cancer. And

3 from a population of 165 study patients, 21

4 received DiSC based regimens, and these had a 9

5 percent partial response rate. Whereas, 69

6 patients received empiric treatment with

7 etoposide and cisplatin; these had a 14 percent

8 partial response rate. And ultimately, the

9 survival of in vitro best regimen was comparable

10 to what one would have expected from the

11 empirically chosen chemotherapy.

12 Interestingly, this study also revealed

13 an issue that also has to come up in any test in

14 which there is a second or subsequent procedure

15 to obtain tissue, in that the survival of

16 patients who had any in vitro test was actually

17 worse than those without, and this implies

18 potentially that those people that had a

19 sufficient volume of tumor to have the tests had

20 an intrinsically less survival than those that

21 did not. Next.

22 And the last clinical study that I'll

23 touch on also emanated from the NCI and was

24 published in 1997. This attempted to use the

25 DiSC assay in limited stage small cell, and here

00211

1 we turn the somewhat, and consider the use of the

2 test in what may be considered in its most

3 favorable scenario, because this disease which is

4 traditionally, and now actually standardly

5 treated with the combination of radiation therapy

6 and chemotherapy, would potentially treat

7 empirically with a regimen known to produce a

8 high level of response, and then come back after

9 finishing consolidation with radiation

10 sensitivity with either a chosen regimen based on

11 the in vitro sensitivity or a standard approach

12 using an additional three drugs that the patient

13 had not seen previously that would be regarded as

14 standard or part of the standard care of patients

15 with small cell lung cancer.

16 And in this study, there was actually a

17 trend towards somewhat improved survival in

18 patients who could actually receive the in vitro

19 best regimen, but it certainly was just a trend.

20 And most interestingly, of the 54 patients that

21 were entered, the minority of the patients could

22 actually be successfully biopsied in this very,

23 shall we say well coordinated, well resourced

24 clinical trials scenario. Next.

25 So in terms of summarizing what I list

00212

1 here as my own disinterested perspective on

2 whether or not chemosensitivity testing is what

3 one would might consider to be ready to prime

4 time in widespread use, I would offer that from

5 my perspective, no method has emerged as a

6 quote-unquote gold standard, owing to

7 methodologic variation and the definition of what

8 constitutes resistance or sensitive tests. The

9 unfortunate fact that one cannot get reliable

10 data from most if not many patients. And in the

11 few completed prospective or randomized trials,

12 there is little assurance that ultimately there

13 is a difference effected by the test.

14 What we ultimately need if tests of

15 this nature are to be potentially useful, is

16 probably better drugs, because in point of fact,

17 since most of the drugs are unfortunately

18 inactive in many of the diseases in which these

19 tests would be used, knowing that they won't work

20 is not actually terribly valuable.

21 We need a method that is applicable to

22 all specimens obtained in real time with the

23 diagnostic specimen; that is to say, to require a

24 second test, or second procedure, in order to

25 obtain the specimen, inevitably indicating or

00213

1 introduces potential biases in studies related to

2 those patients who could withstand or undergo

3 these procedures, as well as of course, making

4 the test, the performance of the test more costly

5 than one might potentially desire.

6 But on the other hand, I think the

7 future holds potentially with newer approaches,

8 including gene expression arrays, serial analysis

9 of gene expression, there may be better, and

10 hopefully more useful techniques to assess this

11 in the future. But whatever the test, be it some

12 permutation of a currently available test, or one

13 of the newer methodologies here, its ultimate

14 value should be established in prospective

15 randomized trials where one uses the

16 diagnostically guided as opposed to the empirical

17 treatment before assessing whether or not it is

18 openly valuable.

19 And I thank you for your attention.

20 DR. FERGUSON: Thank you, Dr.

21 Sausville. I think you've gone in shorter time

22 than even I asked for, and so I'll open for a

23 question or comment. Yes, Dr. Hoffman?

24 DR. HOFFMAN: Yes. I would like to ask

25 Dr. Sausville his opinion about the assays that

00214

1 were discussed this morning, based on three

2 dimensional culture and other new third

3 generation techniques that address these problems

4 and have shown to be able to assess greater than

5 95 percent of the patients' specimens, have shown

6 survival benefit, have shown very high

7 correlation to response. I would like Dr.

8 Sausville's comments on this morning's talks.

9 DR. SAUSVILLE: Again, I wasn't here

10 this morning, and indeed, my brief was not to

11 comment on specific assays from this morning's

12 activities, but to offer an overview of problems

13 in the field in general. And I would certainly

14 say that if the tests that were proposed this

15 morning seem of interest, the real question is

16 have they been evaluated in prospective

17 randomized studies. Because unless they have

18 not, or I should say until they have, one, and

19 since as far as I'm aware, they have not, it

20 would be, I think premature to conclude that they

21 are, therefore, of widespread general use.

22 DR. FERGUSON: Dr. Weisenthal?

23 DR. WEISENTHAL: Now would be as good a

24 time as any to address the issue of the

25 requirement of prospective randomized trials for

00215

1 acceptance of this technology. I think it's a

2 very important issue, several speakers have

3 raised it, and the issue is this: Should these

4 tests be used in clinical medicine until it has

5 been established in prospective randomized trials

6 that patients treated on the basis of assay

7 result have a superior therapeutic outcome to

8 patients treated without the assay result? The

9 cop-out way to answer this, which I'm not, this

10 is not my answer to it, but what I could say if I

11 wanted to cop out, and it's perfectly valid, is

12 that never has the bar been raised so high for

13 any diagnostic test in history.

14 Dr. Sausville began his talk by

15 pointing out bacterial cultures done in

16 sensitivity testing, including one of his

17 examples was serum bactericidal testing. Serum

18 bactericidal testing, for those of you who may

19 know it, is something that Medicare does

20 reimburse for. It's very controversial, it's

21 much more controversial actually than cell

22 culture drug resistance testing. The performance

23 characteristics are certainly inferior based on

24 sensitivity and specificity. And furthermore,

25 there has certainly never been a prospective

00216

1 randomized trial showing survival advantage or

2 therapeutic outcome, you know, higher cure rate

3 or anything, whether you use serum bactericidal

4 testing or not, or any other form of antibiotic

5 sensitivity test.

6 We're talking about laboratory tests,

7 not a therapeutic agent, and I think that one

8 would be advised, at least first of all, to judge

9 them on the basis of the way that other

10 laboratory tests have been judged, and that is,

11 do they have acceptable accuracy, sensitivity and

12 specificity?

13 However, moving on to the question of

14 the prospective randomized trial, all of us, no

15 one more than those of us who have been working

16 in this field for 20 years, would love to see

17 prospective randomized trials, physician's choice

18 therapy versus assay directed therapy. This has

19 been the Holy Grail. I hope before I die, I will

20 be able to participate in such a trial. I

21 mentioned earlier, the fact is that there have

22 been a lot of energetic, very talented people,

23 that have devoted their careers to this, and the

24 best example is Dr. Dan von Hoff, who is the most

25 energetic. He and I were clinical oncology

00217

1 trainees together at the National Cancer

2 Institute. We both started working in this field

3 in the same lab at the same time. And Dan had --

4 you know, my CV lists about 50 publications;

5 Dan's CV is probably closing in on 2,000. And

6 he's organized more prospective randomized trials

7 and things like that. He was unable to

8 successfully get a study initiated and patients

9 accrued, and completed. I have devoted enormous

10 amounts of effort to getting those trial done,

11 and for one reason or another, they didn't accrue

12 patients, and things like this.

13 I want to point out that in Medicare,

14 Medicare has a problem, and the problem is not in

15 the year 2000, the years between 2010 and 2015.

16 The budgetary crunch in Medicare is going to come

17 in 2010 and 2015 when those of use who are now 52

18 years old are going to be 60, 65, 75 years old,

19 and we're going to be getting cancer. What's

20 going to happen over the next ten years is that

21 there's going to be an ever increasing array of

22 partially effective and very expensive cancer

23 treatments. We're seeing that now. Drugs are

24 being approved at a very rapid pace. We don't

25 have a clue how to use them.

00218

1 We brought up the idea about using the

2 test as a litmus test, like should you pay for

3 the therapy. Well, the only way that you're

4 going to be able to ever use the test as a litmus

5 test is if you do the prospective randomized.

6 And I would submit to you that the way to get the

7 prospective randomized trials done is as

8 follows: Look at the data that you heard about

9 this morning. Surely, you must be convinced that

10 there is a germ of truth in this. You know,

11 there is a consistent, overwhelming, and I think

12 that study after study is showing that these

13 tests do predict, they can identify the

14 difference between good treatments and bad

15 treatments. So it is not much of a leap of faith

16 to say that if only someone could do the trials,

17 then there's a good chance that they would turn

18 out to be positive, and if they do turn out to be

19 positive, by the year 2010, we will have a

20 wonderful tool to triage therapy, to triage

21 patients, right at the time when Medicare most

22 needs it, when the budgetary crunch comes, when

23 we've got all these expensive cancer therapies.

24 You know, I gave you the example of the five

25 patients treated with the bone marrow transplants

00219

1 at $200,000 a patient, who did not benefit from

2 that, who then got an assay and had a great

3 result. What if they had gotten the assay? It

4 has the potential to be enormously cost

5 effective. But the only way that it will be used

6 in that way is if you do the trials, but the only

7 way -- it's a catch 22 -- the only trials will

8 ever get done -- I personally believe that if

9 Medicare approves this, it will be the shot heard

10 round the world, Swann, ECOG, CLGB, they will be

11 lining up to do trials. You guys, you know, come

12 back and maybe approve it conditionally, come

13 back in five years and see what's happening.

14 DR. BAGLEY: Well, you know, today --

15 it brings up an interesting point, and I think

16 you bring up the comment that, you know, never

17 has the bar been this high. Well, I would take

18 exception to that. I think the bar is not any

19 higher for this than for anything else that we

20 are currently looking at. And it is not

21 something that we are not used to hearing for

22 other things too, and that is, gee, we're paying

23 for things that were never subjected to any

24 scrutiny, so why should we subject it to

25 scrutiny. I mean, that's -- we hear it all the

00220

1 time, and that's just not going to work in this

2 day of evidence based medicine. And I'll tell

3 you because, you know, how much it costs isn't an

4 issue that we're really here to talk about today.

5 Because, you know, two years HCFA reorganized

6 and changed the whole focus of coverage, moved

7 the coverage office away from that part of HCFA

8 that pays for things and looks at program

9 integrity, and moved it into the place at HCFA

10 that looks at quality and clinical standards.

11 And that's exactly the focus we ought

12 to be doing because, you know, what it boils down

13 to is, it's not just why not pay for it, it

14 doesn't cost that much, or it might save a little

15 money. But it's let's pay for it because it's

16 the right thing to do, and represents quality

17 medicine. And when that happens, you know, we

18 shouldn't just pay for it, we should pay for it,

19 we should promote it, and perhaps, if the

20 evidence is there, we ought to insist on it. I

21 don't think the clinical community or the

22 beneficiary community would tolerate us insisting

23 upon a pattern of behavior, or even promoting a

24 pattern of behavior, without evidence, and so why

25 should we pay for it without evidence? And

00221

1 that's the change we're trying to make, that's

2 been the whole point of changing the coverage

3 process, putting together advisory committees

4 like this, is to say, let's look at evidence and

5 let's make decisions about what we pay for based

6 on quality, and once we know what quality is,

7 let's not just pay for it, let's not stop there,

8 but let's pay for things that we are willing to

9 promote and perhaps even insist upon. And so,

10 that is the reason for the focus on evidence, and

11 it's going to be there. And the fact that we may

12 not have subjected past technologies to the same

13 evidence, doesn't mean we can't go back and look

14 at them, time willing, but it doesn't mean we

15 should lower the bar for new technologies.

16 DR. FERGUSON: Do you want to respond a

17 minute, Dr. Sausville?

18 DR. SAUSVILLE: Yes, I do wish to

19 respond to that. And I want to thank you for

20 that perspective, because clearly, there is

21 nothing that ever, that doesn't lack for good

22 intentions. Clearly, the desire to convey useful

23 patient benefit goes without question. And the

24 efforts that were cited over the past two decades

25 have really been enormous efforts in that regard.

00222

1 But one distinction that I must point out is

2 that when one considers the bacteriologic

3 analogy, the diagnostic specimen, that is to say

4 the bacteria growing in a bottle, equals the test

5 specimen. So that is one intrinsic difference.

6 In many cases, cancer related

7 sensitivity testing requires additional efforts

8 to get and process tissue different than the

9 routine. So it is a point where the analogy is

10 not exactly apt, I think. And you quoted the

11 endocarditis issue, and you're right. It is

12 controversial as to whether or not ultimately

13 sensitivity testing is beneficial, because among

14 the lethal consequences of endocarditis are a

15 series of almost anatomical problems, valve

16 problems, thrombi, et cetera, that are not in any

17 way predicted or dealt with by the sensitivity

18 testing. So again, it's -- I think that the two

19 are, recall each other, but have important

20 differences in thinking about the ultimate value

21 of the tests.

22 DR. FERGUSON: Dr. Sundwall?

23 DR. SUNDWALL: Just a quick question.

24 Dr. Sausville, I am a family physician, not an

25 oncologist, but I was very perplexed by your

00223

1 statement. If I heard it correctly, you said,

2 knowing what drugs won't work is not all that

3 helpful. I don't understand that, given the

4 morbidity and the difficulties with

5 chemotherapeutic agents. I've had many patients

6 suffer terribly from chemotherapy, and how can

7 you say not knowing what won't work isn't that

8 helpful?

9 DR. SAUSVILLE: Because the context in

10 which -- and I respect your point, and I don't

11 certainly mean to in any way imply a lot of sole

12 searching on both the part of physicians and

13 patients that goes into the decision to entertain

14 therapy. But in oncology, frequently the

15 treatment is driven by the histologic diagnosis,

16 so if for example the initial diagnosis of small

17 cell lung cancer, if one could have a pattern of

18 drugs that have more or less a susceptibility, I

19 am not aware that such tests would be considered

20 definitive in saying, well, because you happen to

21 have a resistant small cell lung cancer, you

22 should not receive any therapy. So in that case

23 the therapy, or choice of therapy, is ultimately

24 driven by the histologic diagnosis that's

25 apparent. Consider the opposite point. Somebody

00224

1 with a chemotherapy refractory neoplasm,

2 manifested, such as pancreatic or renal, which

3 are problems which as far as I'm aware, are not

4 considered responsive to any set of agents

5 routinely.

6 Again, the information of whether the

7 patient has that dire situation is implicit in

8 the histology. It's not clear that any tests

9 that can be done ultimately defines a drug that

10 can change the outcome that is at the present

11 time ordained by the histology. So I take your

12 point, that being able to reliably choose drugs

13 that convey a useful clinical benefit is very

14 worthwhile and a goal that should be pursued. I

15 am not sure that the current tests actually allow

16 the clear delineation of such agents.

17 And in that regard, you can tell a

18 patient who has the unfortunate diagnosis of

19 pancreatic cancer, that they are likely not going

20 to respond to a medicine chosen on that basis, or

21 chosen after having gone through an additional

22 test to obtain tissue and then tested for assay

23 resistance.

24 MR. KIESNER: I think Dr. Sundwall

25 asked a very interesting question, and I think

00225

1 there are at least two clinical strategies for

2 using this type of information. I think on one

3 hand you can say, we're going to select a drug,

4 and another, there may be a different clinical

5 setting, and I will give you two examples. I

6 think this is very important.

7 Dr. Alberts spoke this morning about a

8 clinical situation where he would be referred a

9 patient from another hospital, and that patient

10 may not, may be unaffected by the primary care,

11 he has relapsed, the tumor is growing, and they

12 send him, they send the patient to him. Doctor,

13 what can you do to help me? In that situation,

14 there may be three or four or five different

15 drugs, single agents, none of which have been

16 determined to have a significant clinical benefit

17 over the other drug in that situation. If I am a

18 patient and if any physician can tell me of the

19 five drugs, Frank, two of those drugs you're

20 resistant to, what has he told me? He's said,

21 I'm not going to use those two drugs, I've saved

22 you from the possibility that you're going to get

23 those two drugs and not benefit from it. It's

24 very, very well documented that these tests are

25 able to identify resistance, and if I'm a patient

00226

1 and if my physician in that setting can identify

2 the resistance, I believe he has done me a real

3 service.

4 The second situation is one which I

5 experienced personally. And I'm not mentioning

6 it because it's personal, I'm mentioning it

7 because it's exemplary of the position that a lot

8 of families can be in relation to elderly

9 Medicare patients. In -- I'm from Minneapolis.

10 My father was in St. Mary's Hospital. He was ten

11 years past Medicare age and was being treated for

12 cancer. We saw what the drug was doing to him.

13 If his physician could have come to me and said,

14 Frank, I have two or three other drugs, two or

15 three other choices I could try, and I have done

16 a test and I could see that they are all

17 resistant, I don't think we should go any

18 further. From the family situation, it was a

19 very difficult situation to make, do you go

20 further. We made the situation not to. But to

21 this date, if I would have had an assay that

22 would have told me the drugs that the physician

23 was considering will not work, I would feel I

24 would have been served, our family would have

25 been served, and my father would have been

00227

1 served. Elimination of drugs, identification of

2 drugs in those types of clinical settings that

3 don't help, or help you stop therapy, I think is

4 something worth considering.

5 DR. FERGUSON: Thank you. Very

6 briefly.

7 DR. SAUSVILLE: So my response to that

8 is the essence of the issue, and it also pertains

9 to the question before, is whether or not one

10 could have reached the conclusion that drugs

11 would not have benefitted your relative by the

12 diagnosis itself, and not have ultimately had to

13 rely on a test. And here the performance

14 properties of the -- the unfortunate performance

15 properties that when a drug is predicted to be

16 sensitive by these tests, the outcome is

17 unfortunately not any different in many cases,

18 than when things -- and in fact, in all cases

19 that I'm aware -- of when drugs are seen as

20 resistant, is the essence of why we are in a

21 quandary about how to appropriately use this.

22 DR. FERGUSON: Thank you. Harry.

23 Dr. Handelsman?

24 DR. HANDELSMAN: I'm Harry Handelsman.

25 I'm at the Center for Practice and Technology

00228

1 Assessment, Agency for Health Care Policy and

2 Research, and our office was asked by HCFA to

3 review the 1990 article by Kern and Weisenthal on

4 the use of suprapharmacologic drug exposures.

5 And I'm going to briefly synthesize what I think

6 was the essence of that article and then give my

7 personal critique. Unfortunately, some of this

8 is going to be repeating some of the data that

9 you heard earlier today, and that's unavoidable.

10 Bayes' theorem suggests that drug

11 sensitivity testing in vitro will be accurate in

12 predicting clinical drug resistance in tumors

13 with high overall response rates only if the

14 assays have a specificity of greater than 98

15 percent for drug resistance. A 1989 review of

16 the literature by the authors indicated that a 30

17 to 50 percent false positive rate, and a false

18 negative rate as high as 15 percent.

19 This reported assay, which was

20 developed by Kern, uses a soft agar culture with

21 products of concentration times time higher than

22 those which can be achieved clinically and used

23 drug exposures 100-fold higher than other

24 contemporaneous studies. Response assessments

25 were made by retrospective and blinded chart

00229

1 reviews. The authors reviewed 450 correlations

2 between assay results and clinical response over

3 an eight-year period. The assay was calibrated

4 to produce extremely high specificity for drug

5 resistance. Two assay end points were used,

6 colony formation and thymidine incorporation.

7 Overall response rates were 28 percent

8 using the colony formation end point, and 34

9 percent using the thymidine incorporation end

10 point. At the assay lower cutoff value, the

11 assay was 99 percent specific in identifying

12 non-responders, fulfilling the Bayes prediction.

13 Patients with drug resistant tumors could be

14 accurately identified in otherwise highly

15 responsive patient cohorts. The demonstration

16 that the post-test response probabilities of

17 patients varied according to assay results in

18 pretest response probabilities allowed the

19 construction of a nomogram for predicting

20 probability of response.

21 In 1976, it appeared that no method of

22 predictive testing had gained general acceptance,

23 and during the subsequent decade, high false

24 positive and false negative rates continued to

25 plague the field of in vitro testing.

00230

1 The clinical advantages of developing a

2 highly specific drug resistant assay include:

3 The avoidance of the use of inactive agents in

4 treating responsive tumors; the avoidance of drug

5 related morbidity of inactive agents; the

6 identification of drug resistant tumors for

7 timely consideration of alternative therapies;

8 and obviously, the cost savings of avoiding the

9 use of ineffective agents. Alternative assay

10 methods are available. However, the use of cell

11 culture assay had the advantage of measuring the

12 net effect of both known and unknown mechanisms

13 involved in drug resistance.

14 It is indeed possible to estimate the

15 post-test response probability for specific drugs

16 in specific tumors and patients. This can be

17 achieved through the determination of assay

18 results and the application of a constructed

19 nomogram for assay predicted probability of

20 response.

21 In general, efficacy studies in both in

22 vitro and in vivo tumor models provide an

23 opportunity to obtain data on both efficacy and

24 toxicity, and to refine dose and schedule

25 information for clinical trials. In vitro

00231

1 testing has been extensively applied to determine

2 the potential efficacy of individual drugs and

3 remains an attractive alternative to testing

4 empiric regimens in phase I and phase II clinical

5 trials. In vitro testing can differentiate

6 active and inactive agents, but cannot serve as a

7 substitute for in vivo studies, despite providing

8 elements of both positive and negative predictive

9 reliability.

10 Combinations of agents, which are the

11 most widely applied treatment strategy, are best

12 evaluated using in vivo models, where both

13 toxicity and pharmacokinetics can be adequately

14 studied. Although in vitro assays can provide

15 primary drug resistance date, the most relevant

16 outcome from such assays is improved patient

17 survival, and there have been no clinical trials

18 demonstrating such a result. In addition, it

19 remains to be determined if in vitro testing will

20 be found to have direct clinical applications for

21 disease or patient specific therapies. There

22 have been encouraging reports of survival

23 advantage of patients treated with in vitro

24 directed therapies, but these require

25 confirmation from larger numbers of patients and

00232

1 variety of tumors.

2 Both randomized and non-randomized

3 studies comparing tumor responses to chemotherapy

4 selected by in vitro testing with empirical

5 chemotherapy have produced conflicting results.

6 Response rates appear to be better with in vivo

7 selective agents. However, the impact on

8 survival has not been adequately addressed.

9 Ideally, in vitro assays should be correlated

10 with both response and survival data. The most

11 significant issue in the realm of cancer

12 chemotherapy is that of the resistant

13 mechanisms. The ability to identify ineffective

14 agents in these assays, albeit potentially

15 important, does little to elucidate the

16 mechanisms problem. The assay described in this

17 article can perform its intended task of

18 identifying resistant tumors, and determining a

19 probability of response, but its clinical utility

20 has not been established.

21 DR. FERGUSON: Thank you. I think we

22 have time for a few questions. Panel, or

23 others? Comments? Yes?

24 DR. FRUEHAUF: I think that was a nice

25 summary of the paper and I think the issue of

00233

1 survival was addressed this morning.

2 DR. HANDELSMAN: Excuse me, if I can

3 interrupt. The issue of survival was predicted,

4 but it wasn't on a comparison with alternative

5 therapies.

6 DR. FRUEHAUF: That's true. It was

7 survival in a blinded prospective way, looking at

8 people just getting empirical therapy and asking

9 the question, if you're not going to respond,

10 will you have inferior survival? And we've

11 addressed the issue of, do you want to get a

12 drug, as a person who has cancer, that won't work

13 and won't benefit your survival? And I think

14 that's the important point that this paper is

15 establishing, the utility of knowing that a drug

16 will not be of benefit to a cancer patient.

17 And I have dealt with neuropathies that

18 are incapacitating to professional tennis

19 players. I have had to deal with all sorts of

20 toxicities, and many of these people progress

21 through therapy and die of their disease, and the

22 quality of their life during that progression was

23 significantly adversely affected by getting

24 ineffective therapy. So the clinical utility

25 question to me as a practicing physician, is to

00234

1 not harm people with ineffective therapy that

2 will not, which has been demonstrated not to

3 benefit their survival. And I think most people

4 understand, if you don't respond to the therapy,

5 you're not going to live longer. And we're not

6 trying to say that the test will predict a drug

7 that will help people live longer, for drug

8 resistance assays. We are trying to say, and you

9 stated, and Dr. Sausville stated, that these

10 assays accurately predict drugs that will not be

11 effective. And then the question is, so what? I

12 think the answer to the question so what is, so I

13 don't want to give those drugs to my patient.

14 DR. FERGUSON: Thank you. Okay. I

15 guess we can go on. Harry, thank you very much.

16 Dr. Burke?

17 DR. BURKE: I think we're going to have

18 a lot of fun this afternoon. My name is Harry

19 Burke. I'm a consultant to HCFA. I'm an

20 internist. I'm a methodologist. I'm only here

21 for today.

22 The first couple slides that I am going

23 to present are not HCFA's position, they're my

24 personal review on the subject, and shouldn't be

25 considered HCFA's policy. I am going to address

00235

1 three issues today. First, the levels of

2 evidence, which has been raised several times by

3 various speakers. I'm going to talk about test

4 accuracy. And then I am going to talk about the

5 Kern and Weisenthal article that Dr. Handelsman

6 just gave us an introduction to.

7 But before that, I would like to make a

8 couple comments. First, the extreme drug

9 resistance is really a therapy specific

10 prognostic factor. It really has to be looked at

11 in the context of other therapy specific

12 prognostic factors. Dan Hayes was right. It's

13 like ER and PR, and these other factors. And

14 there's a scientific rationale underlying therapy

15 specific prognostic factors that must be dealt

16 with. The utility of the test depends on the

17 characteristics of the test; that's clear. But

18 it also depends on the efficacy of the treatments

19 if it's a therapy specific prognostic factor,

20 because they're inextricably linked together, and

21 you can't separate the two. And it depends on

22 the prevalence of the disease or the resistance

23 in the population under study. So it's really

24 those three factors together that must be taken

25 into consideration when looking at something like

00236

1 this.

2 Let me make another point, and that is,

3 when we talk about the utility of this test, we

4 can't be talking about the utility for individual

5 patients. We have to be talking about the

6 utility of the test for a population of

7 patients. So we can't switch back and forth

8 between the two, because we're really mixing

9 apples and oranges when we do that.

10 I'd like to make a couple comments

11 about what has been said earlier. Fruehauf,

12 Weisenthal and others have suggested consistent

13 findings across studies, and they've made a claim

14 that that proves something. And I would like to

15 suggest that, yes, consistent findings across

16 studies can be due to robustness of the

17 underlying phenomenon. But it can also be due to

18 consistent biases across the studies. And so if

19 you are going to make a claim for consistency of

20 35 studies, that all suggest the same thing, and

21 that's a robustness claim, you'd better be

22 prepared to tell me why it isn't due to biases in

23 the 35 studies themselves. So you really have to

24 look at each of the 35 studies and you have to

25 ask the question, are these really consistent?

00237

1 You can't just wave a hand and point to 35

2 studies. You have to rule out the alternative

3 hypothesis.

4 Secondly, I'm a little confused.

5 Kiesner pointed out that we could use this task

6 at the bedside at the discretion of the patient,

7 or I mean of the physician, while Kern suggested

8 that it could be used to deny a particular drug.

9 And I need to know, which is it? Is it that the

10 evidence is so convincing that it can be used to

11 deny a particular therapy, or that it's not that

12 convincing, and it's just one of an armamentarium

13 of tests that are available to the physician.

14 First, let me just do a little

15 background. Comparative clinical benefit is what

16 I'm looking at. This is my gloss on reasonable

17 and necessary. It could be defined as the test

18 or treatment providing a measurable improvement

19 over all the current relevant tests and

20 treatments at a cost commensurate with the

21 measured improvement. I also suggest that FDA

22 approval is prima facie evidence of safety and

23 efficacy, but if that isn't there, I think safety

24 and efficacy must also be demonstrated. And a

25 comparative clinical benefit study of a

00238

1 prospective, or of the test or treatment, must

2 compare itself to the other tests or treatments.

3 It doesn't stand alone. And so when you say

4 well, this test or treatment is really good, you

5 have to say what the other tests and treatments

6 are, what you're comparing it to.

7 Now I would -- I am not totally a

8 believer in randomized clinical trials for

9 everything. My suggestion is that there may be

10 three levels of evidence that can be adduced: A

11 strong evidence, which is either a large

12 prospective randomized clinical trial, or two

13 large retrospective studies where one study

14 independently replicates the other study. I

15 think that's good science as well. Or two medium

16 size randomized prospective trials. I think all

17 of those would be strong evidence. If I saw a

18 really large retrospective study that was

19 independently replicated by independent

20 investigators, independent institutions, I would

21 take that as fairly strong evidence.

22 Moderate strength are medium sized

23 prospective trials, a large well designed

24 retrospective study that hasn't been replicated,

25 or two medium randomized prospective clinical

00239

1 trials, medium size.

2 Weak evidence. Small properly designed

3 and implemented prospective randomized trials, I

4 think are weak evidence, and I think are well

5 recognized as that, and I think Pito and others

6 have suggested meta-analysis to overcome the

7 weaknesses of small randomized clinical trials.

8 Alternatively, two medium sized retrospective

9 studies that were done by independent

10 investigators might be good evidence.

11 But insufficient evidence, small

12 systematic studies, I consider them really

13 exploratory rather than evidence. Case series, I

14 think are well considered as anecdotal. And any

15 study that's not properly designed, implemented

16 or analyzed must be considered fatally flawed.

17 Large is 500 patients, medium sized,

18 250, small is less than 250. You know.

19 Okay. Test accuracy. What is a

20 properly designed, implemented and analyzed

21 study? Well, test accuracy of course is an

22 association between each patient's predictions

23 based on the test, and each patient's true

24 outcome. That's test accuracy. The factors that

25 affect test accuracy include the study

00240

1 population, were the patients who were selected

2 easy to predict. Because you can select patient

3 populations, and we'll get into that later, that

4 are very easy to predict by just about any test.

5 The test characteristics: Was the test assessed

6 in the clinical setting which it's intended to be

7 used for? The reproducibility: Does the

8 prediction variability increase across

9 laboratories and reagents? And finally, the

10 method of measuring the accuracy: Was the

11 correct method used?

12 And I'm going to focus on two of these,

13 the first and fourth, which are the most

14 problematic.

15 Okay. Sample size, or study sample

16 characteristics. The composition of the study,

17 the study population, makes a difference in the

18 observed accuracy. A sample with only extreme

19 cases, i.e., the predictors are extreme values of

20 their range, will be easier to predict than a

21 sample with many intermediate cases, the

22 predictions are mostly in the middle of their

23 range. For example, for women with breast cancer

24 who have many positive lymph nodes, their

25 outcomes are fairly easy to predict. Women with

00241

1 metastatic disease, their outcomes are pretty

2 easy to predict. It isn't a hard task to do.

3 What is hard to do is to predict the women with

4 small tumors and with no lymph node involvement

5 or metastatic involvement, that's really tough to

6 do. So, if you just pick an extreme population,

7 it turns out those are pretty easy predictions to

8 make, but it turns out that most patients aren't

9 in the extreme, so it's relatively unuseful.

10 Okay? Thus, the sample must be representative of

11 the real world in which the test is to be used.

12 Measurement of test accuracy. There

13 are several ways to assess test accuracy. The

14 correctness of the accuracy assessment method

15 depends on, so when you select a method of test

16 accuracy, whether there is a preexisting

17 threshold, in other words, is there something out

18 there that says everybody above this should be

19 positive, everybody below should be negative,

20 does that already exist, or do you have to

21 construct it? The number of tests to be

22 assessed. And whether the assessment is

23 performed on one population or more than one

24 population.

25 And just very briefly, this is really

00242

1 hard to read. I can't get the lines on tables to

2 work out for me, so this is lineless. But it

3 turns out that the sensitivity and specificity

4 pairs are really, have one threshold, they do one

5 test, and its one population. Okay? Positive

6 and negative predicted value, there is one

7 threshold, one test, and two or more populations,

8 because really, the positive and negative

9 predictive value we're talking about are

10 different prevalences, therefore, different

11 populations. And the area under the receiver

12 operating characteristic includes all thresholds,

13 two or more tests are assessed, and one

14 population. So in other words, we use the ROC as

15 a best unbiased measure of test accuracy.

16 In terms of the measures of accuracy

17 discussed above, without changing the test

18 itself, there are only two ways to change the

19 accuracy of the test. One way, of course, is to

20 change the threshold of the predictions, and then

21 your sensitivity and specificity would change.

22 And the other way is to change the prevalence of

23 the disease in the population, because then your

24 negative and predicted -- positive and negative

25 predicted values will change. Okay?

00243

1 Prevalence's effect on accuracy. The

2 optimal prevalence for assessing the accuracy of

3 the test is to use a population composed of 50

4 percent disease, 50 percent unaffected. In this

5 situation, the prevalence itself provides no

6 advantage to the test. As the prevalence departs

7 from 50-50, the impact of predicting the

8 prevalence becomes more prominent. In other

9 words, if the test acted as a naive Bayesian

10 classifier, then for each patient it would always

11 predict the most frequent outcome, in other

12 words, it would predict the prevalence. So for

13 example, if there was a 90 percent prevalence in

14 a diseased population, then the naive Bayesian

15 classifier would say disease every single time

16 for every single patient, and you would be right

17 90 percent of the time. That's pretty good.

18 Okay? That's a pretty accurate approach. As the

19 proportion of patients with or without the event

20 moves, either toward a hundred percent or zero,

21 the naive Bayesian approach becomes more

22 effective, more efficient in its predictions. So

23 it's only at 50-50 for binary outcome, that you

24 neutralize the naive Bayesian classifier

25 approach. In other words, if the true prevalence

00244

1 of the disease in a population is close to a

2 hundred percent, it's almost possible for a test

3 to add predictive information. Okay? That's

4 really an important idea. So, as you get towards

5 high prevalences, almost no test will be helpful

6 anymore. Okay?

7 Changes in the prevalence of the

8 disease in a population, as reflected by

9 corresponding changes in the test's positive and

10 negative predictive values. If one were allowed

11 to report the positive predictive value, or the

12 negative predictive value of tests just by

13 itself, then one might be tempted to create or

14 select a high prevalence population for

15 assessment of the test, because the test would

16 appear to possess a high predictive accuracy,

17 okay? Until of course, it was compared to the

18 naive Bayesian classifier, at which point it

19 would cease. Thus, both the positive and

20 negative predictive values of the test must be

21 assessed. Then, if the prevalence is not 50

22 percent, the test must be compared to the naive

23 Bayesian classifier. Further, both the

24 sensitivity and specificity of the test must be

25 assessed in terms of the cutoff that was

00245

1 selected, and the prevalence, because it turns

2 out that although it's commonly thought that

3 prevalence doesn't affect sensitivity and

4 specificity, it certainly does, and there are a

5 number of papers that demonstrate that.

6 So, a better way to assess the accuracy

7 of the test is to use the ROC. This measure of

8 accuracy is impervious to changes in prevalence

9 and reflects the characteristics of the test

10 across all sensitivity and specificity pairs.

11 Well, okay. So now, I was asked to

12 take a peek at Kern and Weisenthal's paper as

13 well, and it turns out that they really are very

14 sophisticated in their use of data and results.

15 It's probably one of the most sophisticated

16 papers I've ever read, and I have read quite a

17 few. I'm going to talk about those areas of the

18 paper.

19 Overview of the study. Kern and

20 Weisenthal used two in vitro tests, which have

21 been mentioned, as surrogate outcomes for

22 response to chemotherapy in patients with

23 different types of cancer. If a patient's tumor

24 demonstrated drug resistance in a test, i.e.,

25 after the patient's tumor cells were exposed to

00246

1 the drugs for a certain period of time, and the

2 cells did not achieve a threshold inhibition, the

3 test was interpreted as predicting that the

4 patient would not clinically benefit from

5 receiving the drug.

6 So, we go back to our levels of

7 evidence, and we can ask overall about this

8 study, where it would lie in our levels of

9 evidence? Well, the colony formation test is

10 really Level III, it's weak evidence. The

11 thymidine incorporation is really Level IV,

12 insufficient evidence.

13 But, not letting that bother us too

14 much, let's talk about the study itself. It's a

15 retrospective chart review, subject to several

16 biases, including therapy selection bias, who

17 received the therapy, and study selection bias,

18 which patients were included in the study. And

19 the study was not validated on an independent

20 population, but it was done on the same

21 population. It was done from 1980 to 1987 in the

22 United States.

23 The study characteristics. Initial

24 population was 5,059 patients. From that, they

25 winnowed it down to 450 patients that they

00247

1 actually studied, about 9 percent of the initial

2 population. They looked at eight different types

3 of cancer. They had 332 colony formation

4 patients, 116 thymidine incorporation patients.

5 And the non-respondent prevalence was 71 percent

6 of the population.

7 One thing that the study wasn't very

8 clear about, it said that virtually all patients

9 were treated with standard chemotherapy, but then

10 later on it said, most of the patients whose

11 specimens were analyzed did not receive

12 chemotherapy because they underwent curative

13 surgical procedures. And I didn't understand

14 that distinction.

15 We'll assume that all 450 patients in

16 the study received chemotherapy. The percentage

17 of patients who receive chemotherapy today may

18 actually be much higher. The criteria to decide

19 which patients received chemotherapy is not

20 reliable. This is really not a very acceptable

21 approach to a study. If in fact you're going to

22 predict who's going to respond to chemotherapy, I

23 think you really have to say how chemotherapy was

24 selected, what the selection criteria were.

25 Also not provided were the patient

00248

1 characteristics of the study population, and this

2 is really critical information. For example, if

3 the population was composed of patients who had

4 already received primary chemotherapy, had

5 incurable disease, and were undergoing salvage

6 treatment, then this study would not be

7 applicable today, and in addition, the results

8 would be biased. So we really need to know what

9 the chemotherapy selection criteria were, and

10 what the patient population characteristics were,

11 neither of which are provided to us. There is no

12 basis from which to understand the results that

13 we are seeing.

14 Now, the function of the test is to

15 predict clinical non-response to chemotherapy

16 using suprapharmacologic drug doses. Now, we're

17 interested in the non-response rate per drug per

18 cancer type per test type. That's what we're

19 interested in. So there were eight drugs. Now

20 I'm not going to talk about combination therapy

21 because that's a whole other subject. There are

22 eight drugs, eight cancer types, that means there

23 were 64 bins, okay? So that means per cancer,

24 per treatment, so there were 64 of those

25 combinations for each of the two tests, for a

00249

1 total of 128 accuracy assessments. And excuse,

2 the lines aren't there, but you see 64 bins. And

3 for each bin, you would want to know

4 prospectively, hopefully randomized, you would

5 want to know, for disease one, treatment one,

6 what does the test say, okay, about this

7 population? In that one cell. And then you

8 would want to follow that population over time

9 and see what actually happened to those people.

10 So for breast cancer and a particular

11 chemotherapeutic agent, you would like to see,

12 did the test predict for that chemotherapeutic

13 agent for breast cancer, successfully. And you'd

14 want to do that for each of the 64 cells. And in

15 fact, you must do it for each of the 64 cells.

16 If there were the same number of

17 patients per cancer type, then the 118 patients

18 tested for thymidine incorporation would be at

19 1.8 patients per bin, for this study. And for

20 the 332 patients tested for colony formation,

21 there would be 5.2 patients per bin. These

22 frequencies would be too low to be meaningful.

23 Now, out of the eight drugs tested and

24 reported, the only drugs to use today, and are

25 they not used in combination, the efficacy, the

00250

1 efficiency of these tests must be demonstrated

2 with each chemotherapeutic agent in use today,

3 and for each combination of agents, each type of

4 cancer.

5 Now, just a couple final points. It's

6 unclear why this study provided two sets of

7 thresholds instead of one. Further, although two

8 thresholds were tested for significance, three

9 were presented in the text, shown in the tables

10 and figures. The first threshold is 45 to 75,

11 and the second one was 15 to 40. In this study,

12 the thresholds that were selected to assess on

13 the same population that were used to determine

14 the optimal thresholds. This elementary mistake,

15 reporting the results from the population used to

16 create the threshold, rather than the results of

17 an independent population, always results in the

18 overestimation of test accuracy.

19 The outcome was standard response

20 criteria. We are never given a definition of

21 what standard response criteria are. We don't

22 know who got the chemo and why. We don't know

23 the study population. We don't even know the

24 outcome. We are never given definitions of any

25 of those three. It's absolutely critical that

00251

1 the specific response criteria employed by the

2 investigators be revealed if that is their

3 outcome.

4 Now of course, Rich Simon, who many of

5 you know at the NCI, and others, have pointed out

6 that response is an unreliable outcome and should

7 be avoided if at all possible. So, okay. So

8 rather than the 64 sets of results that we were

9 looking for, two sets of results were presented,

10 one for each of the two thresholds. Each of the

11 results is across all eight cancers, all eight

12 therapies, and the tests, and the results are

13 there for the first threshold, 60 something

14 percent sensitivity, 87 specificity, 43 and 99.

15 Clearly, the sensitivity goes down as the

16 specificity goes up. Neither sensitivity or

17 specificity pair is very high. Combining all

18 results into one conglomeration provides no

19 information regarding the utility of the test for

20 each drug in terms of each cancer type. The

21 study should have reported the area of the ROC

22 curve, both tests, for the 64 sets of results.

23 Thank you.

24 DR. FERGUSON: Thank you very much.

25 We're actually at our time for a break.

00252

1 Perhaps -- it is almost 2:30. If we take a

2 15-minute break, I think, yes, would you please

3 come back up, because there may be a couple of

4 questions for you.

5 (Recess from 2:25 p.m. to 2:45 p.m.)

6 DR. FERGUSON: I wonder if there are

7 any in the audience, or panel for that matter,

8 who would like to ask Dr. Burke some questions

9 related to his presentation? And also, Dr. Burke

10 has promised to give us the last few slides. Are

11 there questions for Dr. Burke from members of the

12 audience or from the panel?

13 Dr. Weisenthal, did you have a question

14 that you wanted to ask, or a comment?

15 DR. WEISENTHAL: I want to thank

16 Dr. Burke. He started off by paying me

17 compliment, and he said of Dr. Kern and I's

18 paper, that this is one of the most sophisticated

19 papers that he's ever read. I've also been

20 talking to critics of these technologies for 20

21 years, and that's the most sophisticated

22 criticism that I've ever had, so I want to

23 congratulate you on that.

24 There are several points that were

25 raised in your talk which should be addressed.

00253

1 Just to begin with, the study by Kern and

2 Weisenthal that you spent the bulk of your time

3 reviewing, just to begin with that, you brought

4 up several methodologic criticisms and raised

5 questions about patient selection and so forth.

6 I want to remind everyone here that that was

7 published in the Journal of the National Cancer

8 Institute. I assure you it underwent rigorous

9 peer review. When we submitted our first draft

10 of the manuscript, the reviewers there had

11 certain problems with it and they had certain

12 things they wanted clarification of.

13 DR. BURKE: But that's an appeal to

14 authority.

15 DR. WEISENTHAL: No, no, no. Dr.

16 Burke, had you been one of the reviewers, no

17 doubt you would have raised those issues at the

18 time and we would have responded to those. And

19 I'd like to ask Dr. Kern now if he can respond,

20 so we're in consideration of that you were one of

21 the -- you know, you can't blame us because you

22 were not the reviewer of our paper. Had you been

23 there and helping us to get the essential

24 information out there, I'm sure it would have

25 been a better paper. But we'd like to address

00254

1 those issues that you raised at this time, if

2 that's okay.

3 DR. BURKE: Absolutely.

4 DR. KERN: One of the points was the

5 selection bias. How could you end up with 450

6 correlations out of 5,000 patients in the study?

7 Well, the 5,000 patients was an overview of all

8 the tests that we had done in the laboratory. It

9 wasn't meant to imply that the clinical study was

10 based on 5,000 patients. And in fact, at the

11 Department of Surgery, UCLA, where I was, most of

12 the patients were treated with surgery or

13 radiation, not with chemotherapy.

14 Secondly, many of the patients that

15 received chemotherapy received adjuvant

16 chemotherapy. So the inclusion criteria of the

17 study to get to 400 patients included, first,

18 patients had to have advanced disease; second,

19 they all had to have objectively measurable

20 disease, either by CT scan, x-rays or so on.

21 Okay?

22 Now, as far as another comment that you

23 made about one study, but it's not been

24 independently validated, I think I may ask Dr.

25 Bosanquet to address that issue, because he

00255

1 published an article in Lancet a couple of years

2 after our paper.

3 DR. BURKE: Did you want to address any

4 of the other issues that I brought up?

5 (Inaudible response from audience.)

6 DR. BURKE: I mean, this is not an

7 opportunity for us to get into whether the study

8 has been validated or not at this time. That was

9 just an issue that I raised, and perhaps at

10 another forum that can be addressed further. I

11 think we have time limitations.

12 DR. KERN: Well, I will try to answer.

13 DR. BURKE: So keep going. There were

14 a lot of issues.

15 DR. KERN: Bring up a couple of the

16 issues, remind me of them. Let me see what you

17 consider a serious objection.

18 DR. BURKE: Well, the selection, the

19 patient characteristics, the criteria for who got

20 what treatment.

21 DR. KERN: Let's go one at a time. Who

22 got what treatment was determined independently,

23 not by the assay, but by the disease type. The

24 patients went on standard protocols. Most of the

25 patients who ended up at being UCLA, an academic

00256

1 center, were all on some sort of clinical trial,

2 randomized trial protocol.

3 DR. BURKE: What were the standard

4 protocols? What was the response criteria that

5 you used?

6 DR. KERN: The response criteria were

7 the ECROG criteria of partial response and

8 complete response.

9 DR. BURKE: And what percentage was

10 each in terms of your study?

11 DR. KERN: I'm sorry, I don't

12 understand.

13 DR. BURKE: In other words, in terms of

14 response, global response measured, and what

15 percentage of these patients were partial

16 responses, what were complete, and then at that

17 time, how were those defined in your study

18 population?

19 DR. KERN: Okay. The responses were,

20 again, just by objective measurements. It was

21 retrospective, but scans, x-rays. And the

22 complete response, obviously, complete

23 disappearance of the disease. Partial response

24 was by the criteria of two dimensions and the

25 shrinkage of at least half in two dimensions.

00257

1 Standard criteria.

2 DR. BURKE: But this was a

3 retrospective study where you went back to the

4 charts. We all know about the paucity of

5 information and the error of information, and in

6 follow-up information not being in the charts.

7 How did you manage those issues in your

8 retrospective study?

9 DR. KERN: Well, the follow-up --

10 obviously, there are problems, and I'm not trying

11 to say there's not biases in it. We all know the

12 disadvantages of retrospective chart reviews.

13 The only thing I can tell you is what actually

14 was done, two oncologists reviewed the charts and

15 made their best decisions of what the responses

16 were, based on measurable criteria.

17 DR. BURKE: What did they do when they

18 disagreed? What did they do when information

19 wasn't there? What did they do make sure it was

20 accurate information? Do you want to continue

21 with this?

22 DR. KERN: No, I cannot say that I can

23 answer every question. I mean, I'm not an expert

24 in your field.

25 DR. FERGUSON: One brief, and then

00258

1 we'll let Dr. Bosanquet speak.

2 DR. WEISENTHAL: This is really

3 important, okay? You know, you talked about an

4 eight-by-eight table, and we only have one

5 point. You know, a study like this is never

6 going to be done again in the history of the

7 world. Never again are you going to have 330

8 patients treated with single agents. The

9 important thing about it was that this was an

10 honest blinded study in the following fashion,

11 and that is that the clinical results were

12 determined independent of knowledge of assay

13 results. The clinical results were reported to

14 the Department of Biomathematics at UCLA; they

15 were like the stakeholder in this, they had the

16 clinical assessments. Likewise, they received

17 independently from the laboratory the laboratory

18 assessments, and then the correlations were made

19 as stated.

20 DR. BURKE: Let's just deal with that

21 issue for a moment, because Dr. Kern sat down and

22 you stood up. So we've got the 64 bin table,

23 right.

24 DR. WEISENTHAL: Yes.

25 DR. BURKE: And the issue is, how do we

00259

1 know the utility of this test for a chemotherapy

2 in a disease?

3 DR. WEISENTHAL: Okay. You're making

4 the same criticism as Maury Markman has made.

5 What Maury Markman says is as follows, and he

6 says that he notes that there have been no

7 prospective randomized trials.

8 DR. BURKE: That's not my question.

9 DR. WEISENTHAL: Wait a second. But

10 it's the same thing. He says that even if some

11 day there were to be a prospective randomized

12 study, that that would only apply to that one

13 particular situation, and it would not tell you

14 anything about all the other situations.

15 You know, the sort of information that

16 you're asking for in the real world will not be

17 available for 20 to 50 years, if ever.

18 DR. BURKE: No, no. I understand the

19 mitigating circumstances. But the question is,

20 if you want this test to predict a particular

21 chemotherapeutic regimen in a particular disease,

22 then I want that information, and I don't have it

23 in your study.

24 DR. FERGUSON: Okay. I am going to ask

25 for Dr. Bosanquet to give his response, and then

00260

1 we are going to go ahead. We will try to have a

2 little more time at the end.

3 DR. WEISENTHAL: There's an extremely

4 important point. Basically he started out his

5 talk denigrating -- in other words, I made the

6 point that we have 35 studies consistently

7 showing the same thing, and he denigrated that,

8 and he said, oh, that's just due to consistent

9 bias. And I would like to prove to you that that

10 is not true.

11 DR. BURKE: Excuse me. I didn't. I

12 posed an alternative hypothesis. I said there

13 are two hypotheses for the 35 consistent studies.

14 Assuming that they are consistent, which we

15 have no evidence of, but assuming that they are

16 consistent, it could be due to two things. It

17 could be due to the fact that there is a

18 phenomenon there, or it could be due to

19 consistent study bias. And until you eliminate

20 the alternative hypothesis, you haven't done

21 science.

22 DR. WEISENTHAL: I would like to then

23 eliminate the alternative hypothesis and prove to

24 Dr. Burke that we have indeed done science in

25 this setting.

00261

1 DR. FERGUSON: Can you do that in the

2 final hour?

3 DR. WEISENTHAL: Okay.

4 DR. BOSANQUET: It was stated that

5 there was no independent validation of this. We

6 actually took the data that we published in the

7 Lancet the following year. This paper that we're

8 discussing is 1990; we published this work in

9 1991 in the Lancet, using CLL patients. And we

10 also looked at extreme drug resistance in these

11 patients.

12 We got this. We found 22 of 119

13 patients had extreme drug resistance in vitro,

14 and none of these patients responded. So here is

15 one of the things that we would speak to, which

16 was independent validation in a completely

17 different set of circumstances in a different

18 laboratory, and we find exactly the same thing,

19 extreme drug resistance, no response.

20 DR. FERGUSON: Using the same cutoff

21 points that were determined by Kern Weisenthal, I

22 guess.

23 DR. BURKE: Just to respond briefly to

24 that, two points. One, that is not a replication

25 of the 1990 paper.

00262

1 And number two, I do suspect that

2 that's exactly correct, that it is disease and

3 treatment specific. And that's exactly my

4 point. That's exactly my point. You have to

5 talk a specific disease, a specific treatment,

6 how does the test do? Not a conglomeration of

7 diseases and treatments together. That's exactly

8 my point. Thank you.

9 DR. FERGUSON: Thank you. Dr. Burken.

10 DR. BURKEN: Hi, everybody. Can

11 everybody hear me okay? I am Dr. Mitch Burken, a

12 medical officer with the coverage and analysis

13 group at HCFA. What I'd like to do is try to tie

14 together some of the presentations from earlier

15 in the day. There will be a lot of material in

16 here that you've seen before, but what I want to

17 do is try to wrap it up, and wrap it up in a way

18 that's consistent with Dr. Bagley's opening

19 remarks around 8:00 this morning, looking at the

20 broad sweep of the evidence, not spending as much

21 time on specific papers as much as trying to see

22 the bigger picture, cutting across many assay

23 formats.

24 Well, as I said, for the first several

25 minutes I want to be as conceptual as possible,

00263

1 and then we'll get more into the bulk of the

2 evidence itself.

3 But let's think about why we would

4 order any type of lab test, okay? A lab test has

5 its maximum clinical utility when the disease

6 probability is most uncertain. In other words,

7 we heard a little bit about the 50-50 point, and

8 the naive Bayes condition, and so forth, but let

9 me just try to restate that in a slightly

10 different way. If we have any type of lab test

11 we're looking at, and exploring questions of

12 clinical utility, okay? What's the probability

13 that the patient has a disease? If a patient is

14 very unlikely to have the disease, then what kind

15 of information do you have when you get a lab

16 test result? It's certainly not very very high.

17 And the reciprocal situation, where we

18 have a very very high probability or prevalence

19 of disease, and then the lab test doesn't really

20 add a whole lot, because we're almost positive

21 the patient has the disease. It's when we're

22 unsure of ourselves, and when we are at that

23 50-50 point, that's when a lab result can really

24 begin to add value.

25 Well, where are we in the Medicare

00264

1 program? The panel is charged with trying to

2 demarcate what's reasonable and necessary with

3 respect to human tumor assay systems, okay? And

4 we need to find a spot in this, or we need to

5 kind of bracket an area of this graph where lab

6 testing -- and again, we'll talk about the HTAS

7 in a second. But where is lab testing most

8 reasonable and necessary? Where does it add

9 information?

10 Let's talk about now about applying

11 this more generic situation to human tumor assay

12 systems. Well, let's talk about the

13 chemosensitivity scenario. We talked all day

14 about how this testing can assist clinicians in

15 selecting effective single agents. Okay?

16 Conversely, the chemoresistance scenario is where

17 this assay, or where these assay systems can

18 avoid ineffective agents. And what's our

19 reference here? The reference is data from

20 published clinical trials; maybe they're in peer

21 reviewed journals, maybe unfortunately they're

22 just in abstracts that are available at ASCO

23 meetings. But again, there is information from

24 clinical trials that does provide a backdrop

25 against which one can look at this lab testing

00265

1 and the added value thereof.

2 So let's go back to our graph again.

3 In vitro testing has the greatest clinical

4 utility when the presumed sensitivity or

5 resistance, because remember, they're really

6 reciprocal functions of each other, is most

7 uncertain. And going to our X axis here, what's

8 the real question? The question is, is tumor X

9 sensitive or resistant to drug Y in patient Z?

10 Therefore, we need to be specific as to what

11 questions we're posing.

12 Well, let's talk some more about

13 clinical utility, because as I said, what we want

14 to do is look at the broad sweep of the

15 evidence. We've talked about different outcome

16 measures today; we've talked about clinical

17 response; survival; we've talked even a little

18 bit about quality of life, although in the packet

19 of materials there is really not a lot of quality

20 of life literature to discuss, so we won't really

21 get into that.

22 And in looking at clinical responses

23 and outcome, we need to identify robust

24 two-by-two data using a valid gold standard, and

25 from there we can look at different performance

00266

1 measures. In this case, I think it's valuable to

2 look at positive predictive value as a marker for

3 chemosensitivity, and negative predictive value

4 as a marker for chemoresistance. One could also

5 talk about the sensitivity or specificity, but

6 let's try to keep it just a little bit simpler,

7 let's focus in on some concepts, and not worry so

8 much about the math. There are others in the

9 room who may be more expert, but let's try to

10 keep it simple, and not get too wrapped up in the

11 numbers, but let's try to get wrapped up until

12 the themes and the concepts.

13 As we discussed earlier this morning,

14 as Dr. Bagley emphasized, we have to insure that

15 the biases, such as insufficient sample sizes,

16 don't substantially influence our results.

17 Going back to our graph now, in 2-D

18 rather than 3-D, let me emphasize a point that

19 I've said already, but let me reemphasize it

20 again. That if you are at the extremes of this

21 utility function, okay, where the lab test,

22 whether it's human tumor assay system, or a serum

23 sodium, or whatever it is, or a chest x-ray, any

24 type of diagnostic test, if you're at the extreme

25 regions of this utility function, it doesn't

00267

1 really matter what your predictive values are.

2 If the predictive value is high, it can be offset

3 by the fact that you are in an extreme region of

4 the utility function where those numbers don't

5 really mean as much. Okay? And we'll talk more

6 about that. Okay?

7 Well, what kinds of measures do we need

8 to evaluate test accuracy? The ones that I

9 talked about below, predictive values, but there

10 are also sensitivity, specificity, area under the

11 ROC curve, but let's talk about something else.

12 What about some of the physician concerns. In

13 the lab, what kinds of things can a physician

14 tell his or her patient when a particular tumor

15 can or cannot be assayed by the lab?

16 On the right side of the slide are what

17 I would call the quality control measures, and

18 we're not really going to spend time on those in

19 this particular, at least my particular

20 presentation, but you've heard from FDA earlier.

21 So let's just kind of stay on the left-hand side

22 of the slide for now.

23 Well, just to get back to a couple of

24 those issues that really cut to the heart what

25 physician concerns might be, you know, is there

00268

1 sufficient assessability or evaluability of the

2 tumor cells from the submitted specimens? And we

3 found out that some of the earlier clonogenic

4 assays had very very -- had relatively low

5 assessability or evaluability rates. But let me

6 pose another question.

7 Even if a particular assay format is

8 evaluable 90 percent of the time, it still might

9 mean that 10 percent of the time the physician

10 speaks with his or her patient, and they really

11 just can't get an adequate result. So I think

12 that's an issue. You know, even if it's 90 or 95

13 percent, there is still some percentage of the

14 time when you don't have a result and you come

15 back. Okay?

16 There are other issues that come into

17 play. What's the effect of tumor heterogeneity?

18 We talked a little bit this morning about tumor

19 heterogeneity, but there's another type of tumor

20 heterogeneity as well, and that's the type that

21 can occur within the same patient, so that a

22 primary tumor and its metastatic lesions have

23 different in vitro patterns. And again, that is

24 a consideration to keep in mind when we're

25 thinking about this type of testing.

00269

1 So as I mentioned, in vitro results for

2 solid tumors from one site may not always provide

3 the same result as other sites. However, there

4 is a paper back in 1986, we'll get to it a little

5 bit later, I'll touch on it again, but it shows

6 that this problem may not be quite as pronounced

7 in clonic lesions such as leukemias.

8 Well, there is a whole host of in vitro

9 assay formats and we can, as I say, just kind of

10 go through those. But I think it's important to

11 mention at the end here that we will not in this

12 presentation be going through the clonogenic

13 literature. When we reviewed this material at

14 HCFA, we didn't feel there would be a lot of

15 value in discussing the older technologies that

16 had the lower evaluability rates, and just felt

17 it would be better to present it to the panel

18 this way.

19 Well, what kinds of criteria do we use

20 to evaluate the literature? Again, our goal here

21 is to be broad based. We looked at peer review

22 journals in English. There were some manuscripts

23 pending publication that were necessary for panel

24 discussion. There were a couple of the assay

25 formats that were relatively recently developed

00270

1 that we felt we would not be fair to the

2 requesters if we excluded some of the

3 manuscripts. There was, for example, Bartels

4 chemoresponse assay was FDA approved back in 1996

5 and package inserted in the summary of the safety

6 evaluation data was included as a way of

7 evaluating that. And we did not look at abstract

8 data.

9 Based on that, what types of additional

10 search methods? Well, we -- again, we looked at

11 articles submitted to HCFA prior to November 1st,

12 1999.

13 DR. FERGUSON: Mitch, can you speak

14 into the microphone?

15 DR. BURKEN: Right. When we started

16 reviewing this sometime, sometime before the

17 panel itself, we found that the Fruehauf and

18 Bosanquet review article from the 1993 PPO

19 updates, crystallized many of the issues. And

20 what we did, based on it, there were some summary

21 tables that were actually presented this morning,

22 where they looked at groups of studies. And I

23 again refer you to summary table seven and eight

24 from the 1993 PPO. And as a result, we really

25 focused our efforts, our literature efforts on

00271

1 the EDR, as well as also some of the other

2 thymidine assays, because there were some other

3 thymidine uridine incorporation assays pertinent

4 to bring to the panel. And then we also did a

5 lot of sampling of DiSC and MTT, using a MEDLINE

6 search, and we did not have any time limit on our

7 studies.

8 And when we went through and did our

9 literature search, then we had to figure out what

10 we would want to present to the panel. And since

11 clinical response was one of the outcomes we

12 looked at, as well as survival, we needed to have

13 confidence in the viability of our two-by-two

14 tables. So as a result, any study that lacked

15 the clinical criteria -- either the -- the

16 clinical criteria either had to be documented or

17 referenced, you know, for clinical response. We

18 only looked at adult patients.

19 And just for the record here, in the

20 rather extensive handout which has been provided

21 for this session, we do list the pediatric

22 studies that have not been summarized in this

23 presentation, but there is a notebook of all the

24 studies that are being presented in this

25 presentation, are available. I know it's kind of

00272

1 hard to read the whole notebook tonight, but as a

2 supplement to the materials you already received,

3 there are papers in here such that anybody that

4 has any questions about any of the bullet items

5 from this afternoon's presentation can go back to

6 this, as well as your other materials.

7 We included both prospective and

8 retrospective two-by-two data designs. The only

9 thing we did exclude for this, again, for this

10 panel presentation, were descriptive type studies

11 that didn't use any quantitative summaries.

12 There were some studies that went beyond

13 two-by-two tables, used regression analysis and

14 some other techniques, and those were included.

15 Now talking about all these studies,

16 you know, how can we present these studies to the

17 impact panel? Can we group them or pool them, or

18 do we need to go through them individually? It

19 was something that we really had to spend some

20 time thinking about. And we came to the very,

21 very strong conclusion that data from the

22 individual studies should not be pooled. The

23 reason being is that they're, the studies are so

24 heterogeneous, they use different cutoff points,

25 different tumor drug combinations, different

00273

1 clinical response criteria, that we just felt

2 very uncomfortable about doing a meta-analysis

3 for the purposes of presenting data to the panel,

4 okay? So therefore, each study must be presented

5 on its own merit, and I think that's a

6 fundamental approach to presenting data this

7 afternoon, and probably lengthens the

8 presentation a little bit, but we feel it's

9 important.

10 So now, let's just go through the

11 evidence. Let me just walk through the handout

12 with you. I don't -- there is a lot of bulk on

13 my slides, but again, it's in the handout and it

14 is really set up to be a reference guide to

15 trying to put it all together.

16 The assay formats I start with are not

17 based on cell death versus cell proliferation.

18 It's not done that way. I went from the assay

19 formats that we concentrated on, as in the EDR,

20 the DiSC and the MTT, where we really had the

21 most literature, and then towards the end I have

22 some of the other formats where there was a

23 little less literature that came up, based on the

24 criteria that were described in previous slides.

25 The Kern and Weisenthal article from

00274

1 1990 is a complex article that was referred over

2 to Dr. Burke and Dr. Handelsman for separate

3 review. Again, a central piece of evidence, but

4 highly complex.

5 But let me go through some of the other

6 articles that pertain to EDR as well as some of

7 the other thymidine uridine incorporation

8 formats.

9 Eltabakkh, '98, shows, you know, some

10 PPVs and NPVs. There were some confidence

11 intervals that are reported. As you can see, the

12 NPVs in this study is actually fairly low. Let

13 me start, as I said, rather than to go through

14 all the bullets, let me try to highlight what are

15 some of the themes. As I said, there were a

16 hundred new patients with ovarian cancers. We

17 find out in this study, all the patients were

18 recruited prior to chemotherapy, which is

19 important when we think about selection bias.

20 And we found 75 evaluable patients, so we went

21 from about a hundred down to 75, which is really

22 pretty good.

23 Fernandez-Trigo is a study, again, that

24 also has some case loss of about roughly 25

25 percent. But in this case, there was a very rare

00275

1 site cancer that was selected, so I would just

2 keep that in mind.

3 Moving on to some of the other

4 thymidine uridine formats, you know, I talked

5 about the CRA, the Bartels CRA, and it turns out

6 that there was a study by Elledge back in '95

7 that enumerates the findings of clinical trial.

8 And one little, kind of I suppose warning to the

9 panel when evaluating this paper, I mentioned

10 that NPV is a marker for chemoresistance and PPV

11 being a marker for chemosensitivity. Well, in

12 this particular article, not the article but the

13 package insert, and the summary safety and

14 evaluation data, it's flip-flopped, so you have

15 to be a little bit careful there. It turns out

16 you have to reverse that, so you have to really

17 be wide awake when you read these two-by-two

18 tables. And I mention that down here, that the

19 two-by-two table design differs from the other

20 studies presented, even differs from the Elledge

21 paper, which is -- the Elledge paper comes out

22 before the FDA submission data. And this was a

23 prospective blind enrollment of 60 relapsed

24 breast cancer patients. The interesting thing

25 about this particular assay format is that it was

00276

1 very specific for breast cancer and 5-FU.

2 Well, what about some of these earlier

3 five-day thymidine uptake assays? Sondak in '84

4 had a series of 142 patients with successful

5 assays out of a pool of 219, with 33 clinical

6 correlations. Quite a bit of case loss here,

7 even though, again, you know -- well, the numbers

8 are small, but the NPV, again, is high. You

9 know. But one, again, has to be concerned about

10 possible selection bias.

11 Sondak in '85, 819 mixed solid tumors,

12 again, if you use different cutoffs, you're going

13 to have different PPVs and NPVs.

14 Sanfilippo, in '81, there were several

15 studies on three-hour incubation, rather than the

16 five days, you know, and there were -- as I said,

17 you can see the numbers here. I think the

18 interesting thing about this particular study in

19 '81 was the use of subsets for high

20 proliferative and low proliferative non-Hodgkins

21 lymphoma cases.

22 And in '86, Sanfilippo, the same group,

23 went ahead and studied 169 patients with various

24 types of germ cell testicular tumors, but only 29

25 cases were available for clinical correlation.

00277

1 Again, we didn't really know how many people were

2 previously treated or untreated, and that could

3 inject some bias.

4 More three-hour uptake assays, two

5 studies by Silvestrini in 1985 and Daidone in

6 1985 from the same institution as Silvestrini.

7 Different tumor types.

8 Well, let's kind of move along, and we

9 finished up with the thymidine/uridine

10 incorporation assays, and let's move on to the

11 DiSC assay. And as I say, there were several

12 papers that were reviewed in the Cortazar and

13 Johnson article, which is the review article in

14 1999, which did a MEDLINE search, and targeted 12

15 studies, and four of those 12 studies were DiSC

16 assay approaches in solid tumors, three of the

17 four studies being small cell lung carcinoma, the

18 other study being a non-small cell lung

19 carcinoma. And I think, you know, in each of the

20 studies, the test groups did at least as well as

21 the control groups. The survival data was not

22 particularly convincing. In the Gazdar study,

23 the survival rates were similar; in the Wilbur

24 study, the survival rate comparisons were not

25 am. Again, that is survival rate of assay

00278

1 directed versus, you know, empiric therapy

2 groups. In both the Shaw and the Cortazar

3 articles, their survival rates were really not --

4 there was really not enough of a difference to

5 really hold much discussion.

6 But I think where we find a lot more

7 evidence, again, based on our structured review,

8 is looking at this, and hematologic tumors. And

9 we start with Dr. Weisenthal's study in '86 where

10 there is 70 cases. What we did was we subtracted

11 out the 29 cases of ALL. Again, it's just a

12 judgment case one makes as to how you want to

13 treat the pediatric tumors. I can tell you that

14 the pediatric performance table was just about

15 the same as for adults, and there weren't

16 significant differences, so I think what one

17 could --

18 DR. FERGUSON: Mitch, closer to the

19 microphone please, or maybe you should hold it.

20 DR. BURKEN: Yeah. As I said, one can

21 scan through some of the pediatric studies

22 quickly, and I think get a flavor for that. But

23 moving on, looking at the adult data, we had PPVs

24 and NPVs that were over 80 percent.

25 Dr. Bosanquet in 1999 had a fairly

00279

1 elegant study, and it was reviewed earlier

2 today. I will leave the details to the group.

3 Another study that was also discussed

4 earlier today was a study by Mason that did some

5 modeling. In this particular case, not only was

6 there some clinical response data and survival

7 data that was looked at, but there was some

8 modeling done using regressions, where if you --

9 I know that the print is a little small at the

10 bottom, but I just wanted to mention that if you

11 look at the life years gained per assay, the

12 modeling here said that if you had a simulated

13 50-year old with stage C chronic lymphocytic

14 leukemia, there would be a life years gain would

15 be about six months, and about three weeks if it

16 was a simulated age 70 stage AB female. Again,

17 these are all simulations that are based on the

18 assumptions in the regression modeling. But it's

19 a little bit more of a sophisticated approach, as

20 I said.

21 And continuing on, there's been, as I

22 said, a fair amount of work in hematology.

23 Tidefelt in 1989, with more than 90 percent of

24 the patients not being previously treated. There

25 is a complex predictive value studies in this

00280

1 paper with varying anthracycline concentrations

2 and different treatment regimens. But if you

3 flush out all 40, actually 40 out of 53 patients

4 were available for clinical correlations, and you

5 can see the PPVs and the NPVs.

6 To go on now, to continue more on the

7 hematologic DiSC studies, Bird in '86 was a small

8 study. That's the one I quoted earlier because

9 there was peripheral blood and bone marrow that

10 were used as two separate sites. And there

11 seemed to be reasonably good concordance between

12 in vitro testing, between peripheral blood and

13 bone marrow. But again, I caution, the sample

14 sizes are fairly small here. Bird in '88, again,

15 another small sample study.

16 I'm going back just some more. More

17 small sample studies. Dr. Bosanquet in 1983.

18 Dr. Beksac in '88. I think the interesting thing

19 about this study was it's kind of a mixed

20 retrospective prospective approach, so it was a

21 somewhat complicated study even though there were

22 only 16 patients.

23 Dr. Bosanquet's 1991 study that was

24 actually just up on the screen a few minutes ago,

25 showed 67 patients with CLL where there was a

00281

1 survival benefit. But just before we leave DiSC

2 and the hematologic applications, you know, there

3 were some articles that didn't have documented

4 clinical criteria, and you see down, Dr.

5 Bosanquet's article. Again, we certainly looked

6 at the survival data, but we didn't feel that the

7 clinical criteria were adequately specified in

8 this particular paper even though, as I said, it

9 showed some survival data.

10 And Kirkpatrick from 1990 was a paper,

11 again, all in the backup book here that has some

12 pediatric data.

13 Moving on to MTT, several articles did

14 not have documented clinical criteria so that for

15 the purposes of this panel presentation, we

16 didn't feel it would be useful to present

17 clinical response data. And it's interesting

18 that three -- we talked about the 12 studies from

19 Cortazar and Johnson. Three of them were Yamaue,

20 1991, 1992 and 1996, but unfortunately, none of

21 those articles had, you know, adequately

22 described criteria, and we just didn't feel we

23 could construct good enough two-by-two tables.

24 And the survival rates in these studies do not