STATISTICAL PRINCIPLES FOR CLINICAL TRIALS 臨床試験 - nifty

STATISTICAL PRINCIPLES FOR CLINICAL TRIALS
臨床試験のための統計的原則
I. INTRODUCTION
I. はじめに
1.1 Background and Purpose
1.1 背景と目的
The efficacy and safety of medicinal products should be
医薬品の有効性及び安全性は、1996年5月1日にICHが
demonstrated by clinical trials which follow the guidance
採択した「医薬品の臨床試験の実施に関する基準のガ
in 'Good Clinical Practice: Consolidated Guideline' (ICH
イドライン（ICH E6）」に基づき、日本における臨床
E6) adopted by the ICH, 1 May 1996. The role of
試験の実施基準として定められた「医薬品の臨床試験
statistics in clinical trial design and analysis is
の実施の基準に関する省令」(平成9年3月27日厚生省令
acknowledged as essential in that ICH guideline. The
第28号)に従って実施される臨床試験によって示され
proliferation of statistical research in the area of clinical
るべきである。上記のICHガイドライン及び日本にお
trials coupled with the critical role of clinical research in
ける関連通知において、臨床試験の計画と解析におけ
the drug approval process and health care in general
る統計学の役割は欠くことのできないものと認められ
necessitate a succinct document on statistical issues
ている。医薬品承認の過程及びヘルスケア一般におい
related to clinical trials. This guidance is written
て臨床研究が重要な役割を果たしていることにより、
primarily to attempt to harmonise the principles of
臨床試験領域での統計的研究が増大していることか
statistical methodology applied to clinical trials for
ら、臨床試験に関連した統計的な問題に関する簡潔な
marketing applications submitted in Europe, Japan and
ガイドラインが必要とされている。本ガイドラインは、
the United States.
日米EUの3極間で、主として、承認申請のための臨床
試験（治験）に適用する統計的方法論の原則の調和を
進めるために書かれている。
As a starting point, this guideline utilised the CPMP
本ガイドラインは出発点として、CPMP(欧州医薬品委
(Committee for Proprietary Medicinal Products) Note for
員会)ガイダンス文書'Biostatistical
Guidance entitled 'Biostatistical Methodology in Clinical
Methodology in Clinical Trials in Applications for
Trials in Applications for Marketing Authorisations for
Marketing Authorisations for Medicinal Products’
Medicinal Products' (December, 1994). It was also
「医薬品市販認可のための申請に用いる臨床試験にお
influenced by 'Guidelines on the Statistical Analysis of
ける生物統計方法論（1994 年 12 月）」を利用した。
Clinical Studies' (March, 1992) from the Japanese
また、厚生省(日本)の「臨床試験の統計解析に関する
Ministry of Health and Welfare and the U.S. Food and
ガイドライン（1992 年 3 月）」、U.S. Food and Drug
Drug Administration document entitled 'Guideline for
Administration (米国食品医薬品庁)の'Guideline for
the Format and Content of the Clinical and Statistical
the Format and Content of the Clinical and
Sections of a New Drug Application' (July, 1988). Some
Statistical Sections of a New Drug Application'「新
topics related to statistical principles and methodology
医薬品申請書臨床の部及び統計の部の書式と内容に関
are
ICH guidelines,
するガイドライン（1988 年 7 月）」も参考にした。統
particularly those listed below. The specific guidance that
計的原則と方法論に関連したトピックは、他の ICH ガ
contains related text will be identified in various sections
イドライン及びそれに基づいて作成された日本でのガ
of this document.
イドライン等(作成中のものを含む)、特に以下のもの
also
embedded
within
other
にも記載されている。関連する内容を含むガイドライ
ンを、本文の各節で引用する。
E1A: The Extent of Population Exposure to Assess
E1A:「致命的でない疾患に対し長期間の投与が想定さ
Clinical Safety
れる新医薬品の治験段階において安全性を評価するた
めに必要な症例数と投与期間について(平成7年5月24
日薬審第592号)」(以下引用に際しては、ICHのトピッ
クに合わせ「ICH E1A」という。)
E2A: Clinical Safety Data Management: Definitions and
E2A: 「治験中に得られる安全性情報の取り扱いにつ
Standards for Expedited Reporting
いて(平成7年3月20日薬審第227号)」(以下「ICH E2A」
という。)
E2B: Clinical Safety Data Management: Data Elements
E2B: 「個別症例安全性報告の伝達のためのデータ項
for Transmission of Individual Case Safety Reports
目(以下「ICH E2B」という。)」
E2C: Clinical Safety Data Management: Periodic Safety
E2C: 「市販医薬品に関する定期的安全性最新報告
Update Reports for Marketed
(PSUR)(平成9年3月27日薬安第32号)」 (以下「ICH
Drugs
E2C」という。)
E3: Structure and Content of Clinical Study Reports
E3: 「治験の総括報告書の構成と内容に関するガイド
ライン(平成8年5月1日薬審第335号)」(以下「ICH E3」
という。)
E4: Dose-Response Information to Support Drug
E4: 「新医薬品の承認に必要な用量-反応関係の検討の
Registration
ための指針(平成6年7月25日薬審第494号)」(以下「ICH
E4」という。)
E5: Ethnic Factors in the Acceptability of Foreign
E5: 「外国臨床データを受け入れる際に考慮すべき民
Clinical Data
族的要因についての指針(平成10年8月11日医薬審第
672号)」(以下「ICH E5」という。)
E6: Good Clinical Practice: Consolidated Guideline
E6: 「医薬品の臨床試験の実施の基準に関する省令(平
成9年3月27日厚生省令第28号)」、及び「医薬品の臨
床試験の実施の基準に関する省令の施行について(平
成9年3月27日薬発第430号薬務局長通知) 」(以下「ICH
E6」という。)
E7: Studies in Support of Special Populations: Geriatrics
E7: 「高齢者に使用される医薬品の臨床評価法に関す
るガイドライン(平成5年12月2日薬新薬発第104号) 」
(以下「ICH E7」という。)
E8: General Considerations for Clinical Trials
E8: 「臨床試験の一般指針(平成10年4月21日医薬審第
380号)」(以下「ICH E8」という。)
E10:Choice of Control Group in Clinical Trials
E10: 「臨床試験における対照群の選択」(以下「ICH
E10」という。)
M1: Standardisation of Medical Terminology for
M1:「規制目的のための医学用語の標準化」(以下「ICH
Regulatory Purposes
M1」という。)
M3: Non-Clinical Safety Studies for the Conduct of
M3: 「医薬品の臨床試験のための非臨床安全性試験の
Human Clinical Trials for Pharmaceuticals.
実施時期についてのガイドライン (平成10年11月13
日医薬審第1019号)」(以下「ICH M3」という。)
This guidance is intended to give direction to sponsors in
本ガイドラインは、臨床開発のあらゆる場面で、治験
the design, conduct, analysis, and evaluation of clinical
依頼者が被験薬に関する臨床試験の計画、実施、解析
trials of an investigational product in the context of its
及び評価を行う場合の方向づけを目的としている。ま
overall clinical development. The document will also
た、本ガイドラインは、主として開発の後期の相の臨
assist scientific experts charged with preparing
床試験について、承認申請書添付資料概要(以下「資料
application summaries or assessing evidence of efficacy
概要」という。)の作成を任せられた専門家、又は有効
and safety, principally from clinical trials in later phases
性及び安全性の証拠を評価することを任せられた専門
of development.
家にも役立つであろう。
1.2 Scope and Direction
1.2 適用範囲と方向性
The focus of this guidance is on statistical principles. It
本ガイドラインは、統計的原則に焦点を合わせており、
does not address the use of specific statistical procedures
個々の統計的な手続き又は手法の使い方を扱うもので
or methods. Specific procedural steps to ensure that
はない。原則が正しく適用されていることを保証する
principles are implemented properly are the
ために個々の手続きを積み重ねることは、治験依頼者
responsibility of the sponsor. Integration of data across
の責任である。複数の臨床試験にわたるデータの統合
clinical trials is discussed, but is not a primary focus of
も論じるが、本ガイドラインの主要な点ではない。デ
this guidance. Selected principles and procedures related
ータマネジメント又は臨床試験のモニタリング活動に
to data management or clinical trial monitoring activities
関連した原則と手続きのうち、他のICHガイドライン
are covered in other ICH guidelines and are not
で扱っているものはここでは触れない。
addressed here.
This guidance should be of interest to individuals from a
本ガイドラインは、科学の広い分野の人々から関心を
broad range of scientific disciplines. However, it is
持たれるべきものである。しかし、臨床試験に関連し
assumed that the actual responsibility for all statistical
たすべての統計的業務に対する実際の責任は、適切な
work associated with clinical trials will lie with an
資格と経験のある統計家が果たすことが前提となって
appropriately qualified and experienced statistician, as
おり、そのことは ICH E6にも示されている。試験統
indicated in ICH E6. The role and
計家（用語集参照）の役割と責任は、医薬品開発を支
responsibility of the trial statistician (see Glossary), in
える臨床試験に統計的原則が適切に適用されているこ
collaboration with other clinical trial professionals, is to
とを、他の臨床試験専門家と共同して保証することで
ensure that statistical principles are applied
ある。したがって、試験統計家は本ガイドラインに明
appropriately in clinical trials supporting drug
確に述べられた原則を実行するために十分な理論又は
development. Thus, the trial statistician should have a
実地の教育及び経験を併せ持つべきである。
combination of education/training and experience
sufficient to implement the principles articulated in this
guidance.
For each clinical trial contributing to a marketing
承認申請に含まれる個々の臨床試験の計画と実施に関
application, all important details of its design and
するすべての重要事項についての詳細及び臨床試験に
conduct and the principal features of its proposed
おいて使用する統計解析の主要な特徴は、試験開始前
statistical analysis should be clearly specified in a
に作成された治験実施計画書（プロトコル）に明記す
protocol written before the trial begins. The extent to
べきである。治験実施計画書中の手続きの遵守状況及
which the procedures in the protocol are followed and the
び主要解析の事前での計画状況がどの程度であったか
primary analysis is planned a priori will contribute to the
が試験の最終結果と結論の信頼性に寄与することにな
degree of confidence in the final results and conclusions
る。治験実施計画書及びその作成後の改訂は、試験統
of the trial. The protocol and subsequent amendments
計家を含む責任者全員から承認を受けるべきである。
should be approved by the responsible personnel,
試験統計家は、治験実施計画書及びそのいかなる修正
including the trial statistician. The trial statistician
もが、すべての重要な統計的問題を、必要ならば専門
should ensure that the protocol and any amendments
用語を用いて、明確かつ正確に扱っていることを保証
cover all relevant statistical issues clearly and
すべきである。
accurately, using technical terminology as appropriate.
The principles outlined in this guidance are primarily
本ガイドラインにまとめられている原則は、主として、
relevant to clinical trials conducted in the later phases of
有効性の検証的試験として実施されることの多い開発
development, many of which are confirmatory trials of
の後期の相の臨床試験に当てはまるものである。検証
efficacy. In addition to efficacy, confirmatory trials may
的試験では、主要変数として有効性以外にも安全性に
have as their primary variable a safety variable (e.g. an
関する変数（例えば、有害事象、臨床検査変数又は心
adverse event, a clinical laboratory variable or an
電図の読み取り結果）、薬力学変数又は薬物動態変数
electrocardiographic measure), a pharmacodynamic or a
（検証的な生物学的同等性試験の場合）を採用しても
pharmacokinetic variable (as in a confirmatory
よい。更に、検証的な知見の一部は複数の試験を統合
bioequivalence trial). Furthermore, some confirmatory
したデータから導かれることもあり、このような場合
findings may be derived from data integrated across
でも本ガイドライン中の原則の一部は適用できる。最
trials, and selected principles in this guidance are
後に、医薬品開発の初期の相は主として探索的な性質
applicable in this situation. Finally, although the early
の臨床試験からなるが、統計的原則はこれらの臨床試
phases of drug development consist mainly of clinical
験にも当てはまる。したがって、本ガイドラインの趣
trials that are exploratory in nature, statistical principles
旨は、可能な限り臨床開発のすべての相において適用
are also relevant to these clinical trials. Hence, the
されるべきである。
substance of this document should be applied as far as
possible to all phases of clinical development.
Many of the principles delineated in this guidance deal
本ガイドラインに述べられている原則の多くは、偏り
with minimising bias (see Glossary) and maximising
（用語集参照）を最小にし、精度を最大にすることを
precision. As used in this guidance, the term 'bias'
目的としている。本ガイドラインでは、「偏り（バイ
describes the systematic tendency of any factors
アス）」という用語を、「臨床試験の計画、実施、解
associated with the design, conduct, analysis and
析及び結果の解釈と関連した因子の影響により、試
interpretation of the results of clinical trials to make the
験治療の効果（用語集参照）の推定値と真の値に系統
estimate of a treatment effect (see Glossary) deviate from
的な差が生じること」という意味で用いる。偏りを低
its true value. It is important to identify potential sources
く抑えるためには、偏りの潜在的な原因を可能な限り
of bias as completely as possible so that attempts to limit
明らかにすることが重要である。偏りの存在により、
such bias may be made. The presence of bias may
臨床試験から妥当性のある結論を導くことが困難にな
seriously compromise the ability to draw valid
るおそれがある。
conclusions from clinical trials.
Some sources of bias arise from the design of the trial, for
偏りの原因が治験実施計画にある場合がある。例えば
example an assignment of treatments such that subjects
試験治療の割付が不適切で、リスクの低い患者が一方
at lower risk are systematically assigned to one
の試験治療に系統的に割付けられる場合がそうであ
treatment. Other sources of bias arise during the conduct
る。偏りの原因は、臨床試験の実施や解析の際に生じ
and analysis of a clinical trial. For example, protocol
ることもある。例えば、治験実施計画書違反及び個々
violations and exclusion of subjects from analysis based
の被験者の結果を知った後で被験者を解析から除外す
upon knowledge of subject outcomes are possible sources
ることは偏りの原因となり得るものであり、試験治療
of bias that may affect the accurate assessment of the
効果の正確な評価に影響を及ぼすおそれがある。偏り
treatment effect. Because bias can occur in subtle or
が起こる理由は明確にはとらえられない場合があり、
unknown ways and its effect is not measurable directly,
偏りの影響は直接測定できないため、試験の結果と主
it is important to evaluate the robustness of the results
要な結論の安定性を評価することは重要である。安定
and primary conclusions of the trial. Robustness is a
性（ロバストネス）とは、データ、仮定及び解析方針
concept that refers to the sensitivity of the overall
についての様々な制限に対して全体の結論がどの程度
conclusions to various limitations of the data,
変わり易いかに関連した概念である。安定性は、異な
assumptions, and analytic approaches to data analysis.
る仮定又は異なる解析方針に基づいて解析を行った場
Robustness implies that the treatment effect and
合でも、試験治療の効果と試験の主要な結論は大きく
primary conclusions of the trial are not substantially
影響されないことを意味している。試験治療の効果と
affected when analyses are carried out based on
試験治療の比較における不確実さに関する統計的指標
alternative assumptions or analytic approaches. The
の解釈には、p値、信頼区間又は推測に偏りが与えうる
interpretation of statistical measures of uncertainty of
影響を考慮に入れるべきである。
the treatment effect and treatment comparisons
should involve consideration of the potential contribution
of bias to the p-value, confidence interval, or inference.
Because the predominant approaches to the design and
臨床試験の計画と解析においては、頻度論的立場から
analysis of clinical trials have been based on frequentist
の統計手法に基づく方法が主流になっていることか
statistical methods, the guidance largely refers to the use
ら、本ガイドラインは仮説検定や信頼区間を議論する
of frequentist methods (see Glossary) when discussing
場合、主として頻度論的手法（用語集参照）を念頭に
hypothesis testing and/or confidence intervals. This
置いている。これは、他の方法が適切でないと主張す
should not be taken to imply that other approaches are
るものではない。ベイズ流の手法や他の手法の使用も、
not appropriate: the use of Bayesian (see Glossary) and
それらの使用の理由が明らかであり、異なる仮定の下
other approaches may be considered when the reasons for
でも結果として得られる結論が十分に安定している場
their use are clear and when the resulting conclusions
合には検討することができる。
are sufficiently robust.
II. CONSIDERATIONS FOR OVERALL CLINICAL
II. 臨床開発全体を通して考慮すべきこと
DEVELOPMENT
2.1 Trial Context
2.1 試験の性格
2.1.1 Development Plan
2.1.1 開発計画
The broad aim of the process of clinical development of a
新しい医薬品を臨床開発する過程全体を通しての目的
new drug is to find out whether there is a dose range and
は、臨床上の利益との兼ね合いでリスクが許容できる
schedule at which the drug can be shown to be
限度において、医薬品が安全性と有効性を兼ね備えて
simultaneously safe and effective, to the extent that the
いる用量範囲と使用スケジュールが存在するかどうか
risk-benefit relationship is acceptable. The particular
を知ることにある。医薬品から利益を受ける対象とな
subjects who may benefit from the drug, and the specific
る集団を明確にし、医薬品の適応症も定める必要があ
indications for its use, also need to be defined.
る。
Satisfying these broad aims usually requires an ordered
これら臨床開発全体としての目的を達成するために
programme of clinical trials, each with its own specific
は、通常それぞれが特定の目的を持った臨床試験の順
objectives (see ICH E8). This should be specified in a
序だったプログラムが必要である（ICH E8 参照）。
clinical plan, or a series of plans, with appropriate
このプログラムは、判断をくだす適切な時点と、知識
decision points and flexibility to allow modification as
の蓄積に伴う修正を認める柔軟性を持つ、一つ又は一
knowledge accumulates. A marketing application should
連の臨床開発計画として明示されるべきである。承認
clearly describe the main content of such plans, and the
申請では、臨床開発計画の趣旨及び個々の試験がどの
contribution made by each trial. Interpretation and
ように寄与するかについて明確に述べるべきである。
assessment of the evidence from the total programme of
試験プログラム全体から得られる証拠の解釈と評価
trials involves synthesis of the evidence from the
は、個々の試験からの証拠を総合する過程を伴うもの
individual trials (see Section 7.2). This is facilitated by
である（7.2節参照）。これは、医学用語の辞書、主要
ensuring that common standards are adopted for a
な測定の定義と時期、治験実施計画書からの逸脱の取
number of features of the trials such as dictionaries of
り扱い、といった試験のいくつかの特徴について、共
medical terms, definition and timing of the main
通の基準を採用することで容易になる。医学上の問題
measurements, handling of protocol deviations and so on.
に複数の試験によって対応する場合、統計的要約、総
A statistical summary, overview or meta-analysis (see
括(overview)又はメタアナリシス（用語集参照）が有
Glossary) may be informative when medical questions
益であろう。可能であれば、このように対応すること
are addressed in more than one trial. Where possible this
は臨床開発計画の中で明確にすべきであり、そうする
should be envisaged in the plan so that the relevant trials
ことによって関連する試験が把握され、試験間で共通
are clearly identified and any necessary common features
にすべき計画上の特徴を前もって特定することができ
of their designs are specified in advance. Other major
る。共通の臨床開発計画によって実施される複数の試
statistical issues (if any) that are expected to affect a
験に影響することが予測されるその他の主要な統計的
number of trials in a common plan should be addressed
問題があるならば、臨床開発計画中に述べるべきであ
in that plan.
る。
2.1.2 Confirmatory Trial
2.1.2 検証的試験
A confirmatory trial is an adequately controlled trial in
検証的試験とは、事前に定められた仮説を評価するた
which the hypotheses are stated in advance and
めの、適切に計画・実施された比較試験である。一般
evaluated. As a rule, confirmatory trials are necessary to
に、検証的試験は有効性又は安全性の確固たる証拠を
provide firm evidence of efficacy or safety. In such trials
提示するために必要である。検証的試験では、関心の
the key hypothesis of interest follows directly from the
ある重要な仮説は試験の主要な目的から直接導かれ、
trial’s primary objective, is always pre-defined, and is the
常に事前に設定され、試験完了後に検証される。検証
hypothesis that is subsequently tested when the trial is
的試験では、関心のある試験治療に由来する効果の大
complete. In a confirmatory trial it is equally important
きさを十分な精度で推定すること、及びそれらの効果
to estimate with due precision the size of the effects
を臨床的な意義と関連付けることも同様に重要であ
attributable to the treatment of interest and to relate
る。
these effects to their clinical significance.
Confirmatory trials are intended to provide firm evidence
検証的試験は、承認に関わる主張の裏付けとなる確固
in support of claims and hence adherence to protocols and
たる証拠を提示することを目的としているため、治験
standard operating procedures is particularly important;
実施計画書と標準業務手順書に従うことは特に重要で
unavoidable changes should be explained and
ある。このため、やむをえない変更については説明を
documented, and their effect examined. A justification of
文書として残すべきであり、変更の影響も検討してお
the design of each such trial, and of other important
くべきである。個々の検証的試験の計画及び予定して
statistical aspects such as the principal features of the
いる解析の主要な特徴等の重要な統計的側面につい
planned analysis, should be set out in the protocol. Each
て、それを正当とする理由は、治験実施計画書に記述
trial should address only a limited number of questions.
すべきである。個々の試験は限られた少数の問題のみ
を扱うべきである。
Firm evidence in support of claims requires that the
承認に関わる主張の裏付けとなる確固たる証拠として
results of the confirmatory trials demonstrate that the
は、被験薬が臨床上の利益を持つことを、検証的試験
investigational product under test has clinical benefits.
の結果で示す必要がある。したがって検証的試験は、
The confirmatory trials should therefore be sufficient to
有効性又は安全性の主張に関連した個々の主要な臨床
answer each key clinical question relevant to the efficacy
的問題に、明確かつ確定的な回答を与えるのに十分な
or safety claim clearly and definitively. In addition, it is
ものであるべきである。更に、意図している患者集団
important that the basis for generalisation (see Glossary)
への一般化（用語集参照）の根拠が理解でき、説明で
to the intended patient population is understood and
きることは重要である。このことによっても、必要な
explained; this may also influence the number and type
施設（センター）の数と型式（例えば、専門医か一般
(e.g. specialist or general practitioner) of centres and/or
臨床医か）及び試験の数と型式が影響を受けるであろ
trials needed. The results of the confirmatory trial(s)
う。検証的試験の結果は、条件によらず安定している
should be robust. In some circumstances the weight of
べきである。場合によっては、一つの検証的試験から
evidence from a single confirmatory trial may be
の証拠だけで十分であることもありうる。
sufficient.
2.1.3 Exploratory Trial
2.1.3 探索的試験
The rationale and design of confirmatory trials nearly
検証的試験の正当性の根拠と計画は、事前に実施され
always rests on earlier clinical work carried out in a
た一連の探索的研究の臨床成績にほとんど常に依存し
series of exploratory studies. Like all clinical trials, these
ている。すべての臨床試験と同様に、これらの探索的
exploratory studies should have clear and precise
研究も明確で精密な目的を持つべきである。しかし、
objectives. However, in contrast to confirmatory trials,
検証的試験とは対照的に探索的試験の目的は、必ず
their objectives may not always lead to simple tests of
しも事前に設定した仮説の単純な検定に帰着するわけ
pre-defined hypotheses. In addition, exploratory trials
ではない。更に、探索的試験では、結果の蓄積に応じ
may sometimes require a more flexible approach to
て変更が可能となるような、より柔軟な方法を必要と
design so that changes can be made in response to
するときもある。その解析は探索的なデータ解析を伴
accumulating results. Their analysis may entail data
ってもよい。すなわち仮説検定を行う場合、その仮説
exploration; tests of hypothesis may be carried out, but
の選択は得られたデータに基づいて行うことが有り得
the choice of hypothesis may be data dependent. Such
る。このような試験は、有効性を証明する証拠全体に
trials cannot be the basis of the formal proof of efficacy,
は貢献するといってよいが、その正式な証明の根拠に
although they may contribute to the total body of
はならない。
relevant evidence.
Any individual trial may have both confirmatory and
個々のいかなる試験も、検証的側面と探索的側面の両
exploratory aspects. For example, in most confirmatory
方を持つものであるといってよい。例えば、検証的試
trials the data are also subjected to exploratory analyses
験であってもほとんどの場合、結果を説明する又は裏
which serve as a basis for explaining or supporting their
付けする根拠を与えるため、また後の研究の仮説を引
findings and for suggesting further hypotheses for later
き出すためにデータの探索的解析も行われる。治験実
research. The protocol should make a clear distinction
施計画書には、各試験について検証的な証明として用
between the aspects of a trial which will be used for
いられる側面と、探索的解析のためにデータを提供す
confirmatory proof and the aspects which will provide
る側面とを、明確に区別しておくべきである。
data for exploratory analysis.
2.2 Scope of Trial
2.2 試験で扱う範囲
2.2.1 Population
2.2.1 対象集団
In the earlier phases of drug development the choice of
医薬品開発の初期の相では、臨床試験の被験者の選択
subjects for a clinical trial may be heavily influenced by
は、関心のある特定の臨床効果が観察できる見込みを
the wish to maximise the chance of observing specific
最大にしたいという要求に大きく影響されるであろ
clinical effects of interest, and hence they may come from
う。したがって、被験者は、最終的にその薬が使用さ
a very narrow subgroup of the total patient population
れる全患者集団のうちごく限られた部分集団から選ば
for which the drug may eventually be indicated. However
れる可能性がある。しかし、検証的試験が実施される
by the time the confirmatory trials are undertaken, the
時までには、被験者を目標集団に十分近いものにして
subjects in the trials should more closely mirror the
おくべきである。したがって、検証的試験では一般に、
target population. Hence, in these trials it is generally
試験治療の効果を精度よく推定するために十分な均質
helpful to relax the inclusion and exclusion criteria as
性を維持する一方、目標集団の範囲内でできるだけ広
much as possible within the target population, while
い患者を対象とするような選択基準及び除外基準を定
maintaining sufficient homogeneity to permit precise
めることが有益である。単一の臨床試験の被験者をも
estimation of treatment effects. No individual clinical
って、将来の使用者を完全に代表させることは期待で
trial can be expected to be totally representative of future
きない。なぜならば、地理的な位置、試験が実施され
users, because of the possible influences of geographical
た時期、個々の治験責任医師や診療所で行われる実地
location, the time when it is conducted, the medical
の医療内容などが影響する可能性があるからである。
practices of the particular investigator(s) and clinics, and
しかし、これらの要因の影響は可能な限り小さくすべ
so on. However the influence of such factors should be
きであり、その上で試験の結果を解釈する際にその影
reduced wherever possible, and subsequently discussed
響を議論しておくべきである。
during the interpretation of the trial results.
2.2.2 Primary and Secondary Variables
2.2.2 主要変数と副次変数
The primary variable (‘target’ variable, primary
主要変数（「目標」変数、主要評価項目ともいう）は、
endpoint) should be the variable capable of providing the
試験の主要な目的に直結した臨床的に最も適切で説得
most clinically relevant and convincing evidence directly
力のある証拠を与えうる変数であるべきである。主要
related to the primary objective of the trial. There should
変数は通常ただ一つにすべきである。ほとんどの場合、
generally be only one primary variable. This will usually
検証的試験の主要な目的は有効性に関して科学的に説
be an efficacy variable, because the primary objective of
得力のある証拠を提示することにあるため、主要変数
most confirmatory trials is to provide strong scientific
は、通常有効性に関する変数となる。安全性及び忍容
evidence regarding efficacy. Safety/tolerability may
性は常に重要な問題であり、ときには主要変数となり
sometimes be the primary variable, and will always be an
うるものである。生活の質（QOL）及び保健経済に関
important consideration. Measurements relating to
する測定値も、主要変数となる可能性がある。主要変
quality of life and health economics are further potential
数の選択には、開発に関連した研究領域で一般に認め
primary variables. The selection of the primary variable
られている規範と基準を反映させるべきである。先行
should reflect the accepted norms and standards in the
研究又は公表論文で使用された実績のある、信頼性及
relevant field of research. The use of a reliable and
び妥当性の確立した変数を使用することが薦められ
validated variable with which experience has been
る。主要変数は、選択基準と除外基準によって規定さ
gained either in earlier studies or in published literature
れる患者集団において、臨床的に適切で重要な治療上
is recommended. There should be sufficient evidence that
の利益に関する妥当で信頼のおける指標であることが
the primary variable can provide a valid and reliable
十分に証拠づけられているべきである。被験者数の見
measure of some clinically relevant and important
積もりに用いる変数は、通常は主要変数であるべきで
treatment benefit in the patient population described by
ある（3.5 節参照）。
the inclusion and exclusion criteria. The primary
variable should generally be the one used when
estimating the sample size (see section 3.5).
In many cases, the approach to assessing subject outcome
多くの場合、被験者の結果を評価する方法はあまり単
may not be straightforward and should be carefully
純ではないであろうから、主要変
defined. For example, it is inadequate to specify
数は慎重に定義する必要がある。例えば、主要変数を
mortality as a primary variable without further
明確に定めないでただ死亡とするのでは不十分であ
clarification; mortality may be assessed by comparing
る。死亡の評価にも、定められた時点における生存割
proportions alive at fixed points in time, or by comparing
合の比較、又は特定期間の生存時間分布全体の比較が
overall distributions of survival times over a specified
ある。別の例として、事象が繰り返し起こる場合が挙
interval. Another common example is a recurring event;
げられる。この場合試験治療の効果の指標としては、
the measure of treatment effect may again be a simple
単純な二分類の変数（一定期間に一度でも起きたかど
dichotomous variable (any occurrence during a specified
うか）、初発までの時間、生起率（単位観察時間あた
interval), time to first occurrence, rate of occurrence
りの事象数）などがある。慢性疾患のための治療の研
(events per time units of observation), etc. The
究で、経時的に機能の状態を評価する場合も、主要変
assessment of functional status over time in studying
数の選択に関して別の問題が生じる。可能な対処法と
treatment for chronic disease presents other challenges
しては、観察期間の最初と最後になされた評価の比較、
in selection of the primary variable. There are many
全期間を通じたすべての評価から求めた傾きの比較、
possible approaches, such as comparisons of the
定めた閾値を超える若しくは下回る被験者の割合の比
assessments done at the beginning and end of the
較、又は繰り返し測定データのための方法に基づいた
interval of observation, comparisons of slopes calculated
比較といった多くのものがある。事後的に定義するこ
from all assessments throughout the interval,
とから生じる多重性の問題を回避するために、統計解
comparisons of the proportions of subjects exceeding or
析で用いる主要変数の正確な定義を治験実施計画書に
declining beyond a specified threshold, or comparisons
明記することは重要である。更に、選択した特定の主
based on methods for repeated measures data. To avoid
要変数の臨床的な適切さ及びその測定手順の妥当性
multiplicity concerns arising from post hoc definitions, it
は、通常治験実施計画書に記載し正当性を示すことが
is critical to specify in the protocol the precise definition
必要である。
of the primary variable as it will be used in the statistical
analysis. In addition, the clinical relevance of the specific
primary variable selected and the validity of the
associated measurement procedures will generally need
to be addressed and justified in the protocol.
The primary variable should be specified in the protocol,
主要変数は、その変数を選択した理由とともに治験実
along with the rationale for its selection. Redefinition of
施計画書に明記すべきである。割付が明らかになった
the primary variable after unblinding will almost always
後に主要変数を定義し直すことは、ほとんどの場合許
be unacceptable, since the biases this introduces are
容できない。なぜならば、それによって生じる偏りの
difficult to assess. When the clinical effect defined by the
評価が難しいからである。主要な目的として定めた臨
primary objective is to be measured in more than one
床効果を二通り以上の方法で測定しようとしている場
way, the protocol should identify one of the
合、臨床的な適切さ、重要性、客観性、その他関連す
measurements as the primary variable on the basis of
る特徴に基づいて、治験実施計画書にはできる限り一
clinical relevance, importance, objectivity, and/or other
つの測定値を主要変数として指定すべきである。
relevant characteristics, whenever such selection is
feasible.
Secondary variables are either supportive measurements
副次変数は、主要な目的に関連した補足的な測定値又
related to the primary objective or measurements of
は副次目的に関連した効果の測定値のどちらかであ
effects related to the secondary objectives. Their
る。治験実施計画書において副次変数を事前に定義し、
pre-definition in the protocol is also important, as well as
試験結果の解釈の際に副次変数が果たす相対的な重要
an explanation of their relative importance and roles in
性と役割を説明することも重要である。副次変数の数
interpretation of trial results. The number of secondary
は、試験で答えるべき限られた少数の問題と関連して
variables should be limited and should be related to the
制限すべきである。
limited number of questions to be answered in the trial.
2.2.3 Composite Variables
2.2.3 合成変数
If a single primary variable cannot be selected from
主要な目的に関する複数の測定値の中から、主要変数
multiple measurements associated with the primary
として一つを選ぶことができない場合、それに代わる
objective, another useful strategy is to integrate or
有用な戦略は、事前に定められたアルゴリズムを用い
combine the multiple measurements into a single or
て、複数の測定値を単一の変数、つまり「合成」変数
'composite' variable, using a pre-defined algorithm.
に統合又は結合することである。実際、主要変数を
Indeed, the primary variable sometimes arises as a
複数の臨床測定値の組み合わせとして定める例がみら
combination of multiple clinical measurements (e.g. the
れる（例えば、関節炎、精神障害その他で用いられる
rating scales used in arthritis, psychiatric disorders and
評価尺度）。この方法は、多重性の問題に対処するに
elsewhere). This approach addresses the multiplicity
当たり、第一種の過誤の調整を必要としない。複数の
problem without requiring adjustment to the type I error.
測定値を結合する方法は、治験実施計画書に明記すべ
The method of combining the multiple measurements
きであり、得られた尺度について、臨床的な利益の大
should be specified in the protocol, and an interpretation
きさを適切にあらわしているかどうかという観点から
of the resulting scale should be provided in terms of the
説明がなされるべきである。合成変数が主要変数とし
size of a clinically relevant benefit. When a composite
て用いられる場合、合成変数の成分に臨床的意義があ
variable is used as a primary variable, the components of
りかつ妥当性が示されているならば、その成分を個別
this variable may sometimes be analysed separately,
に解析することがある。評価尺度を主要変数として用
where clinically meaningful and validated. When a
いる場合、内容的妥当性（用語集参照）、評価者内信
rating scale is used as a primary variable, it is especially
頼性（用語集参照）及び評価者間信頼性（用語集参照）
important to address such factors as content validity (see
並びに疾病重症度の変化を検出するための反応性とい
Glossary), inter- and intra-rater reliability (see Glossary)
った特性を説明しておくことは、特に重要である。
and responsiveness for detecting changes in the severity
of disease.
2.2.4 Global Assessment Variables
2.2.4 総合評価変数
In some cases, 'global assessment' variables (see
場合によっては、試験治療の全体的な安全性、全体的
Glossary) are developed to measure the overall safety,
な有効性、全体的な有用性を測定するために、「総合
overall efficacy, and/or overall usefulness of a treatment.
評価」変数（用語集参照）が作られることもある。こ
This type of variable integrates objective variables and
の種の変数は、被験者の状態又はその変化についての
the investigator’s overall impression about the state or
客観的変数と治験責任(分担)医師の全体的な印象を統
change in the state of the subject, and is usually a scale of
合した、通常は順序カテゴリの評価尺度となる。全体
ordered categorical ratings. Global assessments of overall
的な有効性の総合評価は、神経科、精神科など一部の
efficacy are well established in some therapeutic areas,
治療領域で確立されている。
such as neurology and psychiatry.
Global assessment variables generally have a subjective
総合評価変数は、一般的に主観的な要素を併せ持って
component. When a global assessment variable is used as
いる。総合評価変数を主要変数又
a primary or secondary variable, fuller details of the scale
は副次変数として使用する際は、尺度に関する以下の
should be included in the protocol with respect to:
事項についての詳細を治験実施計画書に記述する必要
がある:
1) the relevance of the scale to the primary objective of
1) 試験の主要な目的に対する尺度の適切さ
the trial;
2) the basis for the validity and reliability of the scale;
2) 尺度の妥当性及び信頼性の根拠
3) how to utilise the data collected on an individual
3) 個々の被験者を尺度のカテゴリの内の一つに判定
subject to assign him/her to a unique category of the
するための、その被験者から集められたデータの利用
scale;
法
4) how to assign subjects with missing data to a unique
4) 欠測データがある被験者を尺度のカテゴリの内の
category of the scale, or otherwise evaluate them.
一つに判定するための方法又は被験者を評価する方法
If objective variables are considered by the investigator
総合評価を実施する際、治験責任医師によって考慮さ
when making a global assessment, then those objective
れている客観的な変数があれば、それらの客観的変数
variables should be considered as additional primary, or
は主要変数又は少なくとも重要な副次変数として追加
at least important secondary, variables.
することを検討すべきである。
Global assessment of usefulness integrates components
有用性の総合評価は、医薬品の使用による利益とリス
of both benefit and risk and reflects the decision making
クの両方の要素を統合するもので、医薬品の使用を決
process of the treating physician, who must weigh benefit
定するために、その使用による利益とリスクの比較検
and risk in making product use decisions. A problem with
討をしなければならない臨床医の意思決定過程を反映
global usefulness variables is that their use could in some
したものである。有用性の総合評価尺度の問題点の一
cases lead to the result of two products being declared
つは、二つの試験治療が有益な効果と有害な作用に関
equivalent despite having very different profiles of
する全く異なるプロファイルを持っているにもかかわ
beneficial and adverse effects. For example, judging the
らず、有用性の総合評価を用いることで同等と示す結
global usefulness of a treatment as equivalent or superior
果を導く場合があることである。例えば、試験治療の
to an alternative may mask the fact that it has little or no
有用性総合評価が他方の試験治療に対して同等であ
efficacy but fewer adverse effects. Therefore it is not
る、又は優っているとの結果であっても、それは有害
advisable to use a global usefulness variable as a primary
な作用が少ないだけで、試験治療には有効性がほとん
variable. If global usefulness is specified as primary, it is
ど又は全くないという事実を意味しているおそれがあ
important to consider specific efficacy and safety
る。したがって、有用性の総合評価を主要変数とする
outcomes separately as additional primary variables.
ことは薦められない。有用性の総合評価を主要変数と
する場合には、有用性の総合評価に用いた特定の有効
性及び安全性の結果を主要変数として別個に追加して
考慮することが重要である。
2.2.5 Multiple Primary Variables
2.2.5 複数の主要変数
It may sometimes be desirable to use more than one
治療の及ぼす効果の範囲を一つ一つの変数（又は変数
primary variable, each of which (or a subset of which)
の組)が包含するような、複数の主要変数を用いること
could be sufficient to cover the range of effects of the
が望ましい場合がある。この型式の証拠を解釈する方
therapies. The planned manner of interpretation of this
法は、事前に慎重に説明しておくべきである。試験の
type of evidence should be carefully spelled out. It should
目的を達成するために必要と考えられるのは、複数の
be clear whether an impact on any of the variables, some
変数のいずれかに対する効果なのか、ある一定数の変
minimum number of them, or all of them, would be
数に対する効果なのか、又は変数すべてに対する効果
considered necessary to achieve the trial objectives. The
なのかを明らかにすべきである。一つ又は複数の主要
primary hypothesis or hypotheses and parameters of
な仮説と関心のあるパラメータ（例えば、平均、割合、
interest (e.g. mean, percentage, distribution) should be
分布の状況）は、指定した複数の主要変数に関して明
clearly stated with respect to the primary variables
確に決められているべきであり、また統計的推測の方
identified, and the approach to statistical inference
法も明確に述べるべきである。このとき、多重性の問
described. The effect on the type I error should be
題が起こり得るため、治験実施計画書では、第一種の
explained because of the potential for multiplicity
過誤に与える影響を説明し（5.6 節参照）、それを制
problems (see Section 5.6); the method of controlling type
御する方法を定めるべきである。第一種の過誤への影
I error should be given in the protocol. The extent of
響を評価するためには、提案した複数の主要変数間の
intercorrelation among the proposed primary variables
内部相関の程度を考慮する必要があろう。指定した主
may be considered in evaluating the impact on type I
要変数のすべてにおいて有効性を示すことが試験の目
error. If the purpose of the trial is to demonstrate effects
的である場合、第一種の過誤を調整する必要はないが、
on all of the designated primary variables, then there is
第二種の過誤及び必要な被験者数への影響は慎重に考
no need for adjustment of the type I error, but the impact
慮すべきである。
on type II error and sample size should be carefully
considered.
2.2.6 Surrogate Variables
2.2.6 代替変数
When direct assessment of the clinical benefit to the
実際の臨床的有効性を観察することにより被験者の臨
subject through observing actual clinical efficacy is not
床的な利益を直接評価することが実際的でない場合に
practical, indirect criteria (surrogate variables - see
は、間接的な基準（代替変数、用語集参照）を考慮す
Glossary) may be considered. Commonly accepted
ることができる。代替変数は、それが臨床的利益の信
surrogate variables are used in a number of indications
頼できる予測因子であると信じられている多くの領域
where they are believed to be reliable predictors of
において、一般的に容認されたものとして用いられて
clinical benefit. There are two principal concerns with the
いる。代替変数を提案し導入する際には、大きな問題
introduction of any proposed surrogate variable. First, it
が二つある。一つめは、代替変数が関心のある臨床結
may not be a true predictor of the clinical outcome of
果の真の予測因子ではないおそれがあることである。
interest. For example it may measure treatment activity
例えば、代替変数はある特定の薬理作用と関連した試
associated with one specific pharmacological mechanism,
験治療の作用を測定しているだけで、肯定的であろう
but may not provide full information on the range of
と否定的であろうと、試験治療の作用範囲と最終的な
actions and ultimate effects of the treatment, whether
効果の範囲に関する完全な情報はもたらさないおそれ
positive or negative. There have been many instances
がある。提案された代替変数においては非常に有効で
where treatments showing a highly positive effect on a
あることを示している試験治療が、結局は被験者にと
proposed surrogate have ultimately been shown to be
って臨床上有害であると示された例は数多い。それと
detrimental to the subjects' clinical outcome; conversely,
は逆に、提案された代替変数には何の影響もないが、
there are cases of treatments conferring clinical benefit
臨床的には利益をもたらした試験治療の例もある。二
without measurable impact on
つめは、提案された代替変数が、有害作用に対して直
proposed surrogates. Secondly, proposed surrogate
接比較考量することのできる臨床的利益の定量的な指
variables may not yield a quantitative measure of clinical
標とは必ずしもならないことである。代替変数の妥当
benefit that can be weighed directly against adverse
性を確認する統計的基準は提案されているが、その基
effects. Statistical criteria for validating
準を使用した経験は限られている。実際には、代替性
surrogate variables have been proposed but the
の証拠の強さは、(i) 代替変数と臨床的結果の関連の生
experience with their use is relatively limited. In
物学的合理性、(ii) 代替変数が臨床的結果の予後を予
practice, the strength of the evidence for surrogacy
測する上で有益であると疫学研究によって示されてい
depends upon (i) the biological plausibility of the
ること及び (iii) 試験治療の代替変数に対する効果が
relationship, (ii) the demonstration in epidemiological
臨床的効果と対応しているという臨床試験の結果、に
studies of the prognostic value of the surrogate
依存している。ある医薬品における臨床的変数と代替
for the clinical outcome and (iii) evidence from clinical
変数との関係は、同じ疾患の治療に用いる医薬品であ
trials that treatment effects on the surrogate correspond
っても、作用機序の異なる医薬品について当てはまる
to effects on the clinical outcome. Relationships between
とは限らない。
clinical and surrogate variables for one product do not
necessarily apply to a product with a different mode of
action for treating the same disease.
2.2.7 Categorised Variables
2.2.7 カテゴリ化した変数
Dichotomisation or other categorisation of continuous or
連続変数若しくは順序変数の二分化又はその他のカテ
ordinal variables may sometimes be desirable. Criteria of
ゴリ化を行うことが、望ましい場合もある。「成功」
'success' and 'response' are common examples of
又は「反応あり」という基準は二分化のよくある例で
dichotomies which require precise specification in terms
あり、例えば、連続変数では（基準となる時点での値
of, for example, a minimum percentage improvement
から）何パーセント以上の改善か、又は順序評価尺
(relative to baseline) in a continuous variable, or a
度であれば、ある閾値レベル（例えば「良好」）以上
ranking categorised as at or above some threshold level
に分類されるか、という観点から二分化の方法を正確
(e.g., 'good') on an ordinal rating scale. The reduction of
に記述することが要求される。拡張期血圧が 90mmHg
diastolic blood pressure below 90mmHg is a common
を下回るまで低下するかどうかを基準にして二分化す
dichotomisation. Categorisations are most useful when
ることは、その例である。カテゴリ化が最も有用なの
they have clear clinical relevance. The criteria for
は、それが明確な臨床的意味を持つ場合である。試験
categorisation should be pre-defined and specified in the
の結果を知った後でカテゴリ化の基準を決めることは
protocol, as knowledge of trial results could easily bias
偏りを生じやすいため、カテゴリ化の判定基準は前も
the choice of such criteria. Because categorisation
って定め、治験実施計画書中に明記すべきである。カ
normally implies a loss of information, a consequence will
テゴリ化は一般に情報の損失となるため、結果として
be a loss of power in the analysis; this should be
解析での検出力の低下を招く。被験者数の計算では、
accounted for in the sample size calculation.
この点を考慮すべきである。
2.3 Design Techniques to Avoid Bias
2.3 偏りを回避するための計画上の技法
The most important design techniques for avoiding bias
臨床試験で偏りを回避するための最も重要な計画上の
in clinical trials are blinding and randomisation, and
技法は、盲検化及びランダム化（無作為化）であり、
these should be normal features of most controlled
これらは承認申請に利用することを目的とするほとん
clinical trials intended to be included in a marketing
どの比較臨床試験で標準的に採用すべきである。その
application. Most such trials follow a double-blind
ような試験は、ほとんどの場合二重盲検法によるもの
approach in which treatments are pre-packed in
であり、治験薬を適切なランダム割付表に従って事前
accordance with a suitable randomisation schedule, and
に箱詰めし、試験の実施に関係する者全てが個々の被
supplied to the trial centre(s) labelled only with the
験者に割付けられた試験治療を、その試験治療のコー
subject number and the treatment period so that no one
ドのみであっても、知ることのないよう被験者番号と
involved in the conduct of the trial is aware of the specific
治療期間のみを表示して治験実施施設に供給するとい
treatment allocated to any particular subject, not even as
うものである。2.3.1 節全体と 2.3.2 節のほとんどでは、
a code letter. This approach will be assumed in Section
上記の方法を前提とし、2.3.2 節最後にその他の方法を
2.3.1 and most of Section 2.3.2, exceptions being
述べる。
considered at the end.
Bias can also be reduced at the design stage by specifying
試験の実施に伴って発生する十分な解析が損なわれる
procedures in the protocol aimed at minimising any
可能性のあるあらゆる変則的な事例について、予想さ
anticipated irregularities in trial conduct that might
れるもの全て(様々な種類の治験実施計画書違反、試験
impair a satisfactory analysis, including various types of
治療の中止及び欠測値など)の発生頻度を最小にする
protocol violations, withdrawals and missing values. The
ための手段を治験実施計画書に明記することにより、
protocol should consider ways both to reduce the
計画段階で偏りを減じることも可能である。治験実施
frequency of such problems, and also to handle the
計画書では、そういった問題の発生頻度を減じる方法
problems that do occur in the analysis of data.
と、データ解析において起こる問題の対処法の両方を
考慮すべきである。
2.3.1 Blinding
2.3.1 盲検化
Blinding or masking is intended to limit the occurrence of
盲検化又はマスク化は、臨床試験の実施及び解釈にお
conscious and unconscious bias in the conduct and
ける意識的、無意識的な偏りの発生を制限するために
interpretation of a clinical trial arising from the influence
行われる。割付けられた試験治療を知ることが、被験
which the knowledge of treatment may have on the
者の募集と割付、それに引き続き行われるケア、被験
recruitment and allocation of subjects, their subsequent
者の試験治療に対する態度、評価項目の評価、試験治
care, the attitudes of subjects to the treatments, the
療を中止した被験者の取り扱い、解析からのデータの
assessment of end-points, the handling of withdrawals,
除外等に影響を及ぼし、偏りを発生させるためである。
the exclusion of data from analysis, and so on. The
盲検化の本質的な目的は、割付けられた試験治療を知
essential aim is to prevent identification of the
ることにより偏りが生じる可能性のある間は、試験治
treatments until all such opportunities for bias have
療が同定されるのを防ぐことである。
passed.
A double-blind trial is one in which neither the subject
二重盲検試験は、被験者並びに被験者の試験治療又は
nor any of the investigator or sponsor staff who are
臨床評価を行う治験責任(分担)医
involved in the treatment or clinical evaluation of the
師及び治験依頼者のスタッフのすべての者が被験者に
subjects are aware of the treatment received. This
割付けられた試験治療を知ることができないものであ
includes anyone determining subject eligibility,
る。これには、被験者の適格性の判断、評価項目の評
evaluating endpoints, or assessing compliance with the
価及び治験実施計画書遵守状況の評価に関わる者すべ
protocol. This level of blinding is maintained throughout
てが含まれる。この盲検化のレベルは治験実施中維持
the conduct of the trial, and only when the data are
され、許容できる質にまでデータクリーニングが行わ
cleaned to an acceptable level of quality will appropriate
れてはじめて、適切な関係者に割付が明らかにされる。
personnel be unblinded. If any of the sponsor staff who
被験者に対し試験治療又は臨床評価を行っていない治
are not involved in the treatment or clinical evaluation of
験依頼者のスタッフ（例えば、生体試料分析者、監査
the subjects are required to be unblinded to the
担当者、重大な有害事象報告にたずさわる者）に試験
treatment code (e.g. bioanalytical scientists, auditors,
治療コードの割付を明らかにする必要が生じた場合の
those involved in serious adverse event reporting), the
ために、治験依頼者は、試験治療コードの不適切な開
sponsor should have adequate standard operating
示を防ぐための標準業務手順書を持つべきである。単
procedures to guard against inappropriate dissemination
盲検試験とは、治験責任医師若しくはそのスタッフの
of treatment codes. In a single-blind trial the investigator
どちらか又は両方が割付けられた試験治療を知ってい
and/or his staff are aware of the treatment but the
るが、被験者が知ることはない試験であり、逆の場合
subject is not, or vice versa. In an open-label trial the
もありうる。非盲検試験とは、どの試験治療が割付け
identity of treatment is known to all. The double-blind
られたかが全員に知られている試験である。この中で
trial is the optimal approach. This requires that the
二重盲検試験が最適な方法である。二重盲検試験にお
treatments to be applied during the trial cannot be
いては、試験中実施される複数の試験治療が、実施前
distinguished (appearance, taste, etc.) either before or
にも実施中にも区別できないこと（外見、味、その他）
during administration, and that the blind is maintained
及び全試験期間中盲検を適切に維持することが必要で
appropriately during the whole trial.
ある。
Difficulties in achieving the double-blind ideal can arise:
二重盲検という理想の実現が困難な場合も生じる可能
the treatments may be of a completely different nature,
性がある。比較する試験治療が、例えば外科的療法と
for example, surgery and drug therapy; two drugs may
薬物療法のように、完全に異なる性質を持つ場合であ
have different formulations and, although they could be
る。治験薬剤型が異なる場合もある。カプセルの使用
made indistinguishable by the use of capsules, changing
によって識別不能にできたとしても、剤型の変更が薬
the formulation might also change the pharmacokinetic
物動態と薬力学の性質のどちらか又は両方を変える可
and/or pharmacodynamic properties and hence require
能性もあるため、この場合は剤型間の生物学的同等性
that bioequivalence of the formulations be established;
を立証する必要があろう。二つの治験薬の毎日の服薬
the daily pattern of administration of two treatments
パターンが異なる場合もある。こういった状況で二重
may differ. One way of achieving double-blind conditions
盲検の状態を達成する一つの方法は、「ダブルダミー」
under these circumstances is to use a 'double-dummy'
（用語集参照）技法を用いることである。しかし、こ
(see Glossary) technique. This technique may sometimes
の技法は、被験者の服薬に対する動機づけを低下させ
force an administration scheme that is sufficiently
服薬遵守に悪影響を与えるような通常はありえない服
unusual to influence adversely the motivation and
薬計画を強いる場合がある。また、例えば二重盲検を
compliance of the subjects. Ethical difficulties may also
保つために偽の手術を必要とする場合のように、倫理
interfere with its use when, for example, it entails
的な問題により、ダブルダミーの使用が妨げられるこ
dummy operative procedures. Nevertheless, extensive
とがある。それでも、これらの問題を克服するための
efforts should be made to overcome these difficulties.
十分な努力をすべきである。
The double-blind nature of some clinical trials may be
明らかに試験治療に由来する効果により、一部の臨床
partially compromised by apparent treatment induced
試験では部分的に二重盲検が破れるおそれがある。そ
effects. In such cases, blinding may be improved by
のような場合、ある種の検査結果（例えば、臨床検査
blinding investigators and relevant sponsor staff to
項目の一部）を治験責任(分担)医師及び関連する治験
certain test results (e.g. selected clinical laboratory
依頼者スタッフが知ることのないようにすることによ
measures). Similar approaches (see below) to minimising
り、盲検性が改善されるであろう。試験治療特有の又
bias in open-label trials should be considered in trials
は特定できるような効果により、個々の被験者の割付
where unique or specific treatment effects may lead to
が明らかになるおそれのある試験では、後述する非盲
unblinding individual patients.
検試験で偏りを最小にする手法と同様の手法を検討す
べきである。
If a double-blind trial is not feasible, then the
二重盲検試験が実施できない場合は、次に単盲検を選
single-blind option should be considered. In some cases
択することを検討すべきである。非盲検試験のみが実
only an open-label trial is practically or ethically
際的に又は倫理的に可能な場合もある。単盲検試験と
possible. Single-blind and open-label trials provide
非盲検試験は二重盲検試験に比べて柔軟性があるが、
additional flexibility, but it is particularly important that
次の試験治療が何であるか治験責任(分担)医師が知
the investigator's knowledge of the next treatment
ることにより、被験者の登録の決定に影響を与えない
should not influence the decision to enter the subject; this
ようにすることが重要である。登録の決定は、常に割
decision should precede knowledge of the randomised
付けられる試験治療を知る以前になされるべきであ
treatment. For these trials, consideration should be given
る。これら単盲検又は非盲検試験では、試験治療のラ
to the use of a centralized randomisation method, such as
ンダム割付を管理するために、電話による割付のよう
telephone randomisation, to administer the assignment
な一ヵ所でランダム化を行う方法を検討すべきであ
of randomized treatment. In addition, clinical
る。更に、臨床評価は被験者の処置を行っておらず、
assessments should be made by medical staff who are not
試験治療が盲検化された状態の医療スタッフによって
involved in treating the subjects and who remain blind to
なされるべきである。単盲検試験又は非盲検試験では、
treatment. In single-blind or open-label trials every effort
様々な既知の偏りの原因を最小にするために、あらゆ
should be made to minimise the various known sources of
る努力をなすべきであり、主要変数は可能な限り客観
bias and primary variables should be as objective as
的にすべきである。採用した盲検化の程度について、
possible. The reasons for the degree of blinding adopted
それを選択した理由は、偏りを減じるために用いた他
should be explained in the protocol, together with steps
の手段による処置とともに、治験実施計画書に述べる
taken to minimise bias by other means. For example, the
べきである。例えば治験依頼者は、解析のためにデー
sponsor should have adequate standard operating
タベースを公開する前のデータベースクリーニングの
procedures to ensure that access to the treatment code is
段階では、試験治療コードへのアクセスが適切に制限
appropriately restricted during the process of cleaning
されることを保証するために十分な標準業務手順書を
the database prior to its release for analysis.
用意すべきである。
Breaking the blind (for a single subject) should be
（被験者個人について）割付を明らかにすることは、
considered only when knowledge of the treatment
割付けられた試験治療を知ることが被験者のケアのた
assignment is deemed essential by the subject’s physician
め本質的であると主治医が考える場合に限り検討すべ
for the subject’s care. Any intentional or unintentional
きである。故意であろうとなかろうと、割付を明らか
breaking of the blind should be reported and explained at
にした場合は、開示の理由にかかわらず、治験終了時
the end of the trial, irrespective of the reason for its
に報告し、説明すべきである。割付けられた試験治療
occurrence. The procedure and timing for revealing the
を明らかにするための手続きと時期は記録しておくべ
treatment assignments should be documented.
きである。
In this document, the blind review (see Glossary) of data
本ガイドラインでは、試験完了（最後の被験者の最終
refers to the checking of data during the period of time
観察）から割付を明らかにするまでの間にデータをチ
between trial completion (the last observation on the last
ェックすることを、データの盲検下レヴュー（用語集
subject) and the breaking of the blind.
参照）と呼ぶ。
2.3.2 Randomisation
2.3.2 ランダム化(無作為化)
Randomisation introduces a deliberate element of chance
ランダム化は、臨床試験において、被験者への試験治
into the assignment of treatments to subjects in a clinical
療の割付に意図的に偶然の要素を取り入れており、後
trial. During subsequent analysis of the trial data, it
に試験データを解析する際に、試験治療の効果に関す
provides a sound statistical basis for the quantitative
る証拠の定量的な評価のための正しい統計的根拠を与
evaluation of the evidence relating to treatment effects.
える。また、ランダム化は予後因子が既知であるか未
It also tends to produce treatment groups in which the
知であるかにかかわらず、予後因子の分布が類似した
distributions of prognostic factors, known and unknown,
試験治療グループを作るために役立つものである。ラ
are similar. In combination with blinding, randomisation
ンダム化は、盲検化と組み合わせることで、試験治療
helps to avoid possible bias in the selection and allocation
の割付が予見可能な場合に、被験者の選択的割付によ
of subjects arising from the predictability of treatment
って生じる可能性のある偏りを回避することに役立
assignments.
つものである。
The randomisation schedule of a clinical trial documents
臨床試験のランダム割付表は、被験者への試験治療の
the random allocation of treatments to subjects. In the
ランダム割付を記録するものである。最も単純な状況
simplest situation it is a sequential list of treatments (or
では、割付表は一連の試験治療のリスト又は（クロス
treatment sequences in a crossover trial) or
オーバー試験では、試験治療の順序のリスト）被験者
corresponding codes by subject number. The logistics of
番号に対応するコードである。スクリーニングの段
some trials, such as those with a screening phase, may
階がある試験のような、一部の試験の実施手順は問題
make matters more complicated, but the unique
をより複雑なものにすることがあるが、被験者に対す
pre-planned assignment of treatment, or treatment
る試験治療又は試験治療の順についての事前に予定さ
sequence, to subject should be clear. Different trial
れた唯一通りの割付は明確にすべきである。試験計画
designs will require different procedures for generating
が異なれば、ランダム割付表の作成のためにも異なる
randomisation schedules. The randomisation schedule
手順が必要である。（必要な場合に備えて）ランダム
should be reproducible (if the need arises).
割付表は、再現可能なものとすべきである。
Although unrestricted randomisation is an acceptable
制約をおかないランダム化は受け入れ可能な方法では
approach, some advantages can generally be gained by
あるが、一般にはブロック別にランダム化する方が有
randomising subjects in blocks. This helps to increase the
利な点がある。このブロック別ランダム化の方法は、
comparability of the treatment groups, particularly when
試験治療グループ間の比較可能性を高めることに役立
subject characteristics may change over time, as a result,
つものである。これは例えば募集方針の変更により、
for example, of changes in recruitment policy. It also
結果として被験者の特徴が時間的に変化する可能性が
provides a better guarantee that the treatment groups
あるような場合である。また、ブロック別ランダム化
will be of nearly equal size. In crossover trials it provides
は試験治療グループの被験者数をほぼ等しくすること
the means of obtaining balanced designs with their
を保証する。クロスオーバー試験では、ブロック別ラ
greater efficiency and easier interpretation. Care should
ンダム化を採用することが、効率が高く解釈しやすい
be taken to choose block lengths that are sufficiently
バランスのとれた計画を得る手段となる。ブロックの
short to limit possible imbalance, but that are long
長さは、被験者数のバランスが崩れる可能性を制限で
enough to avoid predictability towards the end of the
きる程度に短く、しかしブロックの終りの方での予見
sequence in a block. Investigators and other relevant
可能性を回避できる程度に十分長くするように注意す
staff should generally be blind to the block length; the
べきである。治験責任医師及びその他関係するスタッ
use of two or more block lengths, randomly selected for
フは、一般にブロックの長さを知ることがないように
each block, can achieve the same purpose. (Theoretically,
すべきである。二つ以上の異なるブロックの長さを用
in a double-blind trial predictability does not matter, but
いて、それぞれのブロックで異なる長さをランダムに
the pharmacological effects of drugs may provide the
選ぶことにより、同じ目的を達成できる。（理論的に
opportunity for intelligent guesswork.)
は、二重盲検試験において予見可能性は問題とならな
い。しかし、治験薬の薬理作用により、知的な当て推
量をする機会を提供してしまうおそれがある。）
In multicentre trials (see Glossary) the randomisation
多施設共同治験（用語集参照）では、ランダム化の手
procedures should be organised centrally. It is advisable
続きは中央で準備すべきである。施設で層別する、又
to have a separate random scheme for each centre, i.e. to
は各施設にブロック全体を数個割付けるといった、各
stratify by centre or to allocate several whole blocks to
施設毎にランダム化する計画を立てることが推奨され
each centre. More generally, stratification by important
る。より一般的には、基準となる時点で測定された重
prognostic factors measured at baseline (e.g. severity of
要な予後因子（例えば、疾患の重症度、年齢、性、な
disease, age, sex, etc.) may sometimes be valuable in
ど）で層別することは、層内でバランスのとれた割付
order to promote balanced allocation within strata; this
を促進するために有益といってよい。これは小規模な
has greater potential benefit in small trials. The use of
試験では大きな潜在的利益となる。三つ又は四つ以上
more than two or three stratification factors is rarely
の層別因子を用いる必要はほとんどない。そのうえ、
necessary, is less successful at achieving balance and is
例数のバランスがとりにくく、実施が煩雑になる。後
logistically troublesome. The use of a dynamic allocation
述する動的割付法を用いることは、多数の層別因子の
procedure (see below) may help to achieve balance across
バランスを同時にとることに役立つであろう。そのた
a number of stratification factors simultaneously
めにはその後の試験手続きをこの型式の方法に応じて
provided the rest of the trial procedures can be adjusted
調整する必要がある。層別ランダム化に用いた因子は、
to accommodate an approach of this type. Factors on
後の解析の際にも考慮すべきである。
which randomization has been stratified should be
accounted for later in the analysis.
The next subject to be randomised into a trial should
ランダム割付けされ試験に組み入れられる被験者は、
always receive the treatment corresponding to the next
常に適切なランダム割付表（層別ランダム化の場合に
free number in the appropriate randomisation schedule
は、各層ごと）の中の、まだ割付けられていない最初
(in the respective stratum, if randomization is stratified).
の番号に対応した試験治療を受けるべきである。次の
The appropriate number and associated treatment for
被験者の該当番号とそれに対応する試験治療は、被
the next subject should only be allocated when entry of
験者が試験のランダム割付段階へ登録されたことを確
that subject to the randomised part of the trial has been
認してから、はじめて割付けるべきである。予見可能
confirmed. Details of the randomisation that facilitate
性を高めてしまうようなランダム化に関する内容の詳
predictability (e.g. block length) should not be contained
細（例えば、ブロックの長さ）については治験実施計
in the trial protocol. The randomisation schedule itself
画書に含めるべきではない。ランダム割付表自体は、
should be filed securely by the sponsor or an independent
全試験期間において、盲検性が適切に維持されること
party in a manner that ensures that blindness is properly
を保証する方法で、治験依頼者又は第三者によって安
maintained throughout the trial. Access to the
全に保管されなければならない。どの被験者について
randomisation schedule during the trial should take into
も緊急の場合には割付を明らかにしなければならない
account the possibility that, in an emergency, the blind
ことがあるため、試験中ランダム割付表にアクセスす
may have to be broken for any subject. The procedure to
る可能性を考慮すべきである。その際に従うべき手順、
be followed, the necessary documentation, and the
必要な証拠資料及び割付を明らかにした後の被験者の
subsequent treatment and assessment of the subject
治療と評価の仕方はすべて治験実施計画書に記述すべ
should all be described in the protocol.
きである。
Dynamic allocation is an alternative procedure in which
動的割付は割付方法の選択肢の一つであり、被験者へ
the allocation of treatment to a subject is influenced by
の試験治療の割付に、現在までに割付けられている試
the current balance of allocated treatments and, in a
験治療の例数バランスを反映させ、層別された試験で
stratified trial, by the stratum to which the subject
は被験者が属する層での試験治療の例数バランスを反
belongs and the balance within that stratum.
映させるものである。決定論的な動的割付法は避ける
Deterministic dynamic allocation procedures should be
べきであり、試験治療の一つ一つの割付にランダム化
avoided and an appropriate element of randomisation
の要素が適切に取り入れられるべきである。動的割付
should be incorporated for each treatment allocation.
法を取り入れた試験の二重盲検性を保つためには、あ
Every effort should be made to retain the double-blind
らゆる努力がなされるべきである。例えば、通常は電
status of the trial. For example, knowledge of the
話登録を用いることにより動的割付を管理している中
treatment code may be restricted to a central trial office
央治験事務局以外は試験治療コードを知ることができ
from where the dynamic allocation is controlled,
ないようにする方法が採られるであろう。この方法は
generally through telephone contact. This in turn
結果として、(電話登録時に)適格基準の追加確認を可
permits additional checks of eligibility criteria and
能とし、試験への登録を確定する。これらの特徴はあ
establishes entry into the trial, features that can be
る種の多施設共同治験では有益である。二重盲検試験
valuable in certain types of multicentre trial. The usual
において通常用いられる治験薬を事前に箱詰めし、ラ
system of pre-packing and labelling drug supplies for
ベルを貼り供給するシステムを、動的割付でも用いる
double-blind trials can then be followed, but the order of
ことができる。ただし、使用する順番は番号順ではな
their use is no longer sequential. It is desirable to use
くなる。中央治験事務局の職員に、試験治療コードを
appropriate computer algorithms to keep personnel at
知られることがないようにするため、適切なコンピュ
the central trial office blind to the treatment code. The
ータアルゴリズムを用いることが望ましい。動的割付
complexity of the logistics and potential impact on the
を検討する場合には、実施手順の複雑さと解析に与え
analysis should be carefully evaluated when considering
る潜在的な影響を慎重に評価すべきである。
dynamic allocation.
III. TRIAL DESIGN CONSIDERATIONS
III. 試験計画上で考慮すべきこと
3.1 Design Configuration
3.1 試験計画の構成
3.1.1 Parallel Group Design
3.1.1 並行群間比較計画
The most common clinical trial design for confirmatory
検証的試験で最もよく用いられる試験計画は並行群間
trials is the parallel group design in which subjects are
比較計画である。被験者はそれぞれ異なる試験治療が
randomised to one of two or more arms, each arm being
割り当てられている二つ以上の群の一つにランダムに
allocated a different treatment. These treatments will
割付けられる。これらの試験治療は、一つ又は複数の
include the investigational product at one or more doses,
用量の被験薬を含み、プラセボ若しくは実薬など一つ
and one or more control treatments, such as placebo
又は複数の対照治療も含むものであろう。並行群間比
and/or an active comparator. The assumptions
較計画の基礎となる仮定は、他のほとんどの計画にお
underlying this design are less complex than for most
ける仮定に比べ複雑ではない。しかし、他の計画と同
other designs. However, as with other designs, there may
様に試験の解析と解釈を複雑にする別な側面（例えば、
be additional features of the trial that complicate the
共変量、経時的繰り返し測定、要因間の交互作用、治
analysis and interpretation (e.g. covariates, repeated
験実施計画書違反、脱落（用語集参照）及び試験治療
measurements over time, interactions between design
の中止）が存在することがある。
factors, protocol violations, dropouts (see Glossary) and
withdrawals).
3.1.2 Crossover Design
3.1.2 クロスオーバー計画
In the crossover design, each subject is randomised to a
クロスオーバー計画では、各被験者は二つ又はそれ以
sequence of two or more treatments, and hence acts as
上の試験治療を行う順序をランダムに割付けられる。
his own control for treatment comparisons. This simple
したがって被験者自身を対照として試験治療比較が行
manoeuvre is attractive primarily because it reduces the
われることになる。この単純な計画上の工夫は、主と
number of subjects and usually the number of
して、それを行うことが、定められた検出力の達成
assessments needed to achieve a specific power,
に必要な被験者数と通常は評価件数を劇的に減少させ
sometimes to a marked extent. In the simplest 2×2
ることがあるという理由から魅力的である。最も単純
crossover design each subject receives each of two
な 2×2 クロスオーバー計画では、各被験者は、多くの
treatments in randomised order in two successive
場合ウオッシュアウト期間をはさんで連続した二つの
treatment periods, often separated by a washout period.
試験治療期間に、二種の試験治療のそれぞれをランダ
The most common extension of this entails comparing
ム化された順番で受ける。この計画の最も一般的な拡
n(>2) treatments in n periods, each subject receiving all n
張では、n（>2）個の試験治療を n 期間で各被験者が
treatments. Numerous variations exist, such as designs
すべて受け、試験治療を比較することになる。計画の
in which each subject receives a subset of n(>2)
拡張には、各被験者が n（>2）個の試験治療の一部だ
treatments, or ones in which treatments are repeated
けを受ける、同じ試験治療を繰り返し行う、といった
within a subject.
様々な変法が存在する。
Crossover designs have a number of problems that can
クロスオーバー計画は、結果の妥当性を損なうおそれ
invalidate their results. The chief difficulty concerns
のある多くの問題を抱えている。最大の問題は持ち越
carryover, that is, the residual influence of treatments in
し効果に関するものである。持ち越し効果とは、先行
subsequent treatment periods. In an additive model the
する試験治療が次に続く試験治療期間において及ぼす
effect of unequal carryover will be to bias direct
残存効果である。加法モデルでは、不均等な持ち越し
treatment comparisons. In the 2×2 design the carryover
効果が試験治療の直接の比較を偏らせる。2×2 クロス
effect cannot be statistically distinguished from the
オーバー計画では、持ち越し効果が試験治療と時期間
interaction between treatment and period and the test
の交互作用から統計的に分離できず、どちらの効果の
for either of these effects lacks power because the
検定も対応する対比が「被験者間」であるため検出力
corresponding contrast is 'between subject'. This problem
に欠ける。この問題は、試験治療の数が多い、より高
is less acute in higher order designs, but cannot be
次の計画ではそれほど深刻なものではないが、完全に
entirely dismissed.
排除することはできない。
When the crossover design is used it is therefore
したがって、クロスオーバー計画を用いる場合は、持
important to avoid carryover. This is best done by
ち越し効果を回避することが重要である。このために
selective and careful use of the design on the basis of
は、疾患の領域及び新しい薬剤の双方に関する十分な
adequate knowledge of both the disease area and the new
知識に基づき、クロスオーバー計画を選択的かつ慎重
medication. The disease under study should be chronic
に使用することが最善の結果を生むことになる。対
and stable. The relevant effects of the medication should
象とする疾患は慢性的で症状が安定しているべきであ
develop fully within the treatment period. The washout
る。薬剤に関連のある効果はその試験治療期間中に完
periods should be sufficiently long for complete
全に現れなければならない。ウオッシュアウト期間は
reversibility of drug effect. The fact that these conditions
薬剤効果が完全に消失するよう十分長くすべきであ
are likely to be met should be established in advance of
る。これらの条件が満たされていると考えてよいかに
the trial by means of prior information and data.
ついて、試験に先だって事前情報及びデータから確認
しておくべきである。
There are additional problems that need careful
クロスオーバー試験には、他にも注意を払わなければ
attention in crossover trials. The most notable of these
ならない問題がある。その中で最も注意すべきことは、
are the complications of analysis and interpretation
被験者の減失に起因する解析と解釈の複雑さである。
arising from the loss of subjects. Also, the potential for
また、持ち越し効果が存在する可能性があることから、
carryover leads to difficulties in assigning adverse events
後続の試験治療期間に発生した有害事象に対応した試
which occur in later treatment periods to the appropriate
験治療がどちらであるか特定することは困難である。
treatment. These, and other issues, are described in ICH
ICH E4 には、これらの問題と共に、他の問題も含め
E4. The crossover design should generally be restricted to
記述されている。クロスオーバー計画は、一般に試験
situations where losses of subjects from the trial are
からの被験者の減失が少ないと期待できる場合に限定
expected to be small.
すべきである。
A common, and generally satisfactory, use of the 2×2
2×2 クロスオーバー計画が一般的に使用されており、
crossover design is to demonstrate the bioequivalence of
通常その妥当性が満たされているのは、同一薬剤の異
two formulations of the same medication. In this
なる二つの剤型間で生物学的同等性を示す場合であ
particular application in healthy volunteers, carryover
る。特にこの健康志願者への適用の場合、二つの投与
effects on the relevant pharmacokinetic variable are most
期間の間のウオッシュアウト時間が十分長ければ、持
unlikely to occur if the wash-out time between the two
ち越し効果が、それに関連のある薬物動態変数に関し
periods is sufficiently long. However it is still important
て、発生することはほとんど考えられない。しかし、
to check this assumption during analysis on the basis of
得られたデータに基づいて、例えば各治験薬の投与開
the data obtained, for example by demonstrating that no
始時に薬物が検出できなかったことを示すことによっ
drug is detectable at the start of each period.
て、解析時にこの仮定をチェックすることはやはり重
要である。
3.1.3 Factorial Designs
3.1.3 要因計画
In a factorial design two or more treatments are
要因計画は、複数の試験治療の異なる組み合わせを複
evaluated simultaneously through the use of varying
数用いて、二つ以上の試験治療を同時に評価するもの
combinations of the treatments. The simplest example is
である。最も単純な例は、2×2 要因計画である。被験
the 2×2 factorial design in which subjects are randomly
者は二つの試験治療で可能な四つの組み合わせの一つ
allocated to one of the four possible combinations of two
にランダムに割付けられる。その四つとは、試験治療
treatments, A and B say. These are: A alone; B alone;
を A, B とすると、「A 単独」、「B 単独」、「AB 同
both A and B; neither A nor B. In many cases this design
時」、「どちらもなし」である。多くの場合、要因計
is used for the specific purpose of examining the
画は A と B の交互作用を調べるという特定の目的のた
interaction of A and B. The statistical test of interaction
めに用いられる。主効果の検定に基づいて必要な被験
may lack power to detect an interaction if the sample size
者数が計算されている場合には、交互作用の統計的検
was calculated based on the test for main effects. This
定はその検出力に欠けるであろう。要因計画が A と B
consideration is important when this design is used for
の併用効果を調べるために用いられる場合、特に二つ
examining the joint effects of A and B, in particular, if
の試験治療が併用される見込みのある場合、この配慮
the treatments are likely to be used together.
は重要である。
Another important use of the factorial design is to
要因計画の別の重要な利用法は、試験治療 C と D を同
establish the dose-response characteristics of the
時に使用する場合の用量-反応特性を立証することで
simultaneous use of treatments C and D, especially when
ある。それは特に、先行する試験から、それぞれ単剤
the efficacy of each monotherapy has been established at
での有効性がある用量において立証されている場合で
some dose in prior trials. A number, m, of doses of C is
ある。C について、通常ゼロ用量（プラセボ）を含ん
selected, usually including a zero dose (placebo), and a
だ m の用量が選択され、D についてもそれに近い数で
similar number, n, of doses of D. The full design then
ある n の用量が選択される。全ての組み合わせを含ん
consists of m×n treatment groups, each receiving a
だ計画は m×n 個の試験治療グループからなり、各グル
different combination of doses of C and D. The resulting
ープは C と D の異なる用量の組み合わせのうちの一つ
estimate of the response surface may then be used to help
を受ける。このようにして得られた反応曲面を用いる
to identify an appropriate combination of doses of C and
ことが、臨床適用のために適切な、C と D の用量の組
D for clinical use (see ICH E4).
み合わせを同定するのに役立つ場合がある（ICHE4 参
照）。
In some cases, the 2×2 design may be used to make
2×2 要因計画は、一方の試験治療の有効性を評価する
efficient use of clinical trial subjects by evaluatingthe
ための被験者数で両方の試験治療の有効性を評価する
efficacy of the two treatments with the same number of
ことにより、臨床試験の被験者を効率よく利用するた
subjects as would be required to evaluate theefficacy of
めに用いられる場合がある。この戦略は、死亡に関す
either one alone. This strategy has proved to be
る大規模試験で特に有益であることが示されている。
particularly valuable for very large mortality trials. The
この方法の効率と妥当性は、試験治療 A と B との間に
efficiency and validity of this approach depends upon the
交互作用がないことに依存している。交互作用がなけ
absence of interaction between treatments A and B so
れば、主たる有効性変数に関する A と B の効果は加法
that the effects of A and B on the primary efficacy
モデルに従い、このため A 単独の効果と B の効果に上
variables follow an additive model, and hence the effect of
乗せされた A の効果はほとんど等しくなる。クロスオ
A is virtually identical whether or not it is additional to
ーバー試験と同様に、この条件が満たされていると考
the effect of B. As for the crossover trial, evidence that
えてよいことを示す根拠は、試験に先だって事前情報
this condition is likely to be met should be established in
及びデータから確認しておくべきである。
advance of the trial by means of prior information and
data.
3.2 Multicentre Trials
3.2 多施設共同治験
Multicentre trials are carried out for two main reasons.
多施設共同治験が実施されることには、主として二つ
Firstly, a multicentre trial is an accepted way of
の理由がある。第一の理由は、多施設共同治験が、新
evaluating a new medication more efficiently; under
医薬品の有効性をより効率よく評価するための方法と
some circumstances, it may present the only practical
して認められていることである。場合によっては多施
means of accruing sufficient subjects to satisfy the trial
設共同治験が、妥当と考え得る範囲の期間内に試験の
objective within a reasonable timeframe. Multicentre
目的を満たすための十分な被験者を登録する唯一の実
trials of this nature may, in principle, be carried out at
用的な手段となることがある。このような特徴を持つ
any stage of clinical development. They may have several
多施設共同治験は、原則として、臨床開発のあらゆる
centres with a large number of subjects per centre or, in
段階で実施してよい。多施設共同治験は、施設当たり
the case of a rare disease, they may have a large number
の被験者数が多い数カ所の施設において行う場合があ
of centres with very few subjects per centre.
り、まれな疾患の試験の場合は施設当たりの被験者数
が少ない多数の施設において行うこともある。
Secondly, a trial may be designed as a multicentre (and
多施設（かつ多治験責任医師）共同治験が計画される
multi-investigator) trial primarily to provide a better
第二の理由は、得られた結果を一般化するためのより
basis for the subsequent generalisation of its findings.
適切な根拠を与えるためであろう。第二の理由は、多
This arises from the possibility of recruiting the subjects
施設共同治験がより広い患者集団から被験者を募集で
from a wider population and of administering the
きる可能性があること、及び臨床の広い範囲の現場で
medication in a broader range of clinical settings, thus
この医薬品が使用される可能性があることに基づいて
presenting an experimental situation that is more typical
いる。したがって、多施設共同治験は、将来使用され
of future use. In this case the involvement of a number of
る状況により近い実験状況を提供するものである。こ
investigators also gives the potential for a wider range of
の場合、多数の治験責任医師が参加することは、医薬
clinical judgement concerning the value of the
品の有益性に関して広範囲にわたる臨床的判断がなさ
medication. Such a trial would be a confirmatory trial in
れる可能性をも生み出すことになる。一般化を目的と
the later phases of drug development and would be likely
した多施設共同治験は、医薬品開発の後期の相での検
to involve a large number of investigators and centres. It
証的試験となり、多数の治験責任医師と施設が参加す
might sometimes be conducted in a number of different
ることになる。一般化可能性（用語集参照）を更に高
countries in order to facilitate generalisability (see
めるために、多施設共同治験が複数の異なる国にまた
Glossary) even further.
がって実施されることもあり得る。
If a multicentre trial is to be meaningfully interpreted
多施設共同治験の結果の意味が十分に解釈され、外挿
and extrapolated, then the manner in which the protocol
されるためには、治験実施計画書を実行する方法は明
is implemented should be clear and similar at all centres.
確で、すべての施設で同様のものであるべきである。
Furthermore the usual sample size and power
更に、通常行われる必要な被験者数と検出力の計算は、
calculations depend upon the assumption that the
施設が異なる場合でも比較する試験治療間の差の大き
differences between the compared treatments in the
さは同じであるという仮定に依存している。このよう
centres are unbiased estimates of the same quantity. It is
な背景を考慮して、共通の治験実施計画書を作成し、
important to design the common protocol and to conduct
以下のように多施設共同治験を実施することが重要で
the trial with this background in mind. Procedures
ある。実施手順はできるかぎり徹底して標準化すべき
should be standardised as completely as possible.
である。評価基準及び評価体系のばらつきは、治験責
Variation of evaluation criteria and schemes can be
任医師会議、試験前に行う関係者の訓練、試験実施中
reduced by investigator meetings, by the training of
の慎重なモニタリングを通して小さくすることができ
personnel in advance of the trial and by careful
る。適切な計画を立てるためには、一般に施設内で試
monitoring during the trial. Good design should
験治療毎に被験者の分布が同じになるよう心掛けるべ
generally aim to achieve the same distribution of subjects
きであり、適切な運営管理により、この計画の目的を
to treatments within each centre and good management
維持すべきである。後に施設間での試験治療効果の不
should maintain this design objective. Trials that avoid
均一性を考慮する必要性が判明した場合には、施設当
excessive variation in the numbers of subjects per centre
たりの被験者数が過度に異なることがないようにして
and trials that avoid a few very small centres have
いる試験及び極端に小規模な施設を含まない試験が有
advantages if it is later found necessary to take into
利である。それは、施設毎の重みを変えた場合でも、
account the heterogeneity of the treatment effect from
試験治療効果の重み付き推定値がそれほど異ならない
centre to centre, because they reduce the differences
からである。（この点は、すべての施設が小規模で、
between different weighted estimates of the treatment
施設の特徴が解析には現れない多施設共同治験には当
effect. (This point does not apply to trials in which all
てはまらない。）これらの予防策を採用しないことは、
centres are very small and in which centre does not
結果の均一性が疑わしいことと併せると、深刻な場合
feature in the analysis.) Failure to take these
には承認に関わる治験依頼者の主張に対して説得力の
precautions, combined with doubts about the
ある根拠を示すものとは見なせない程度まで多施設共
homogeneity of the results may, in severe cases, reduce
同治験の価値を減じるおそれがある。
the value of a multicentre trial to such a degree that it
cannot be regarded as giving convincing evidence for the
sponsor’s claims.
In the simplest multicentre trial, each investigator will
最も単純な状況の多施設共同治験では、個々の治験責
be responsible for the subjects recruited at one hospital,
任医師が一つの病院で募集された被験者に対し責任を
so that ‘centre’ is identified uniquely by either
もつため、「施設」は治験責任医師又は病院に対して
investigator or hospital. In many trials, however, the
一つに特定される。しかし、多くの場合、状況はもっ
situation is more complex. One investigator may recruit
と複雑である。恐らく一人の治験責任医師が数カ所の
subjects from several hospitals; one investigator may
病院で被験者を募集するということもあり、一人の治
represent a team of clinicians (subinvestigators) who all
験責任医師が、一つ又は複数の関連病院における自身
recruit subjects from their own clinics at one hospital or
の診察室で被験者を募集する臨床家（治験分担医師）
at several associated hospitals. Whenever there is room
のチームの代表であることもある。統計モデルにおけ
for doubt about the definition of centre in a statistical
る施設の定義に疑問の余地がある場合、治験実施計画
model, the statistical section of the protocol (see Section
書の統計の部（5.1 節参照）には、その多施設共同治
5.1) should clearly define the term (e.g. by investigator,
験の枠組みにおける施設という用語の定義を明確にす
location or region) in the context of the particular trial. In
べきである（例えば、治験責任医師毎なのか、場所な
most instances centres can be satisfactorily defined
のか、又は地域なのか）。ほとんどの場合、施設は治
through the investigators and ICH E6 provides relevant
験責任医師により定義することが十分可能で、ICH E6
guidance in this respect. In cases of doubt the aim should
はこれに関連した指針を示している。施設の定義が疑
be to define centres so as to achieve homogeneity in the
わしい場合には、主要変数の測定及び試験治療に影響
important factors affecting the measurements of the
を与える重要な要因が施設内で均一になるように施設
primary variables and the influence of the treatments.
を定義すべきである。解析の際に施設を併合するため
Any rules for combining centres in the analysis should be
のルールはすべて、可能な限り前もって治験実施計画
justified and specified prospectively in the protocol where
書中にその正当性も含めて記述すべきである。しかし
possible, but in any case decisions concerning this
どのような場合でも、とるべき手段に関する決定は、
approach should always be taken blind to treatment, for
常に試験治療について盲検下で、例えば盲検下レヴュ
example at the time of the blind review.
ーの際に行うべきである。
The statistical model to be adopted for the estimation and
試験治療の効果の推定と検定に用いる統計モデルは治
testing of treatment effects should be described in the
験実施計画書に記載すべきである。試験治療の主効果
protocol. The main treatment effect may be investigated
は、最初に施設と試験治療の交互作用を含まず施設間
first using a model which allows for centre differences,
差を考慮に入れるモデルを用いて調べることができ
but does not include a term for treatment-by-centre
る。モデルに常に交互作用を含めると、試験治療の効
interaction. If the treatment effect is homogeneous across
果が施設間で均一な場合、主効果の検定の効率が低下
centres, the routine inclusion of interaction terms in the
する。試験治療効果の不均一性が真に存在する場合に
model reduces the efficiency of the test for the main
は、主効果の解釈には様々な議論がある。
effects. In the presence of true heterogeneity of treatment
effects, the interpretation of the main treatment effect is
controversial.
In some trials, for example some large mortality trials
例えば死亡を評価する大規模試験で施設当たりの被験
with very few subjects per centre, there may be no reason
者数が少ない試験にみられる例として、施設が臨床的
to expect the centres to have any influence on the
に重要な影響を反映するとは考えにくいために施設が
primary or secondary variables because they are unlikely
主要変数又は副次変数に影響を与えると考える理由は
to represent influences of clinical importance. In other
ないといって差し支えないような場合がある。別の試
trials it may be recognised from the start that the limited
験では、施設当たりの被験者数が少ないために、統計
numbers of subjects per centre will make it impracticable
モデルに施設の効果を含めることが実行不可能である
to include the centre effects in the statistical model. In
ことが、あらかじめ認識できる場合もある。それらの
these cases it is not appropriate to include a term for
場合、モデルに施設の項を含めることは適切ではなく、
centre in the model, and it is not necessary to stratify the
施設で層別したランダム化を行うことも必ずしも必要
randomisation by centre in this situation.
ではない。
If positive treatment effects are found in a trial with
施設当たりの被験者数が不均一性を評価しうる規模の
appreciable numbers of subjects per centre, there should
試験で、試験治療の肯定的な効果が判明した場合、結
generally be an exploration of the heterogeneity of
論の一般化可能性に影響する可能性があるため、通常
treatment effects across centres, as this may affect the
は施設間における試験治療効果の不均一性を探索すべ
generalisability of the conclusions. Marked heterogeneity
きである。著しい不均一性は、個々の施設の結果を図
may be identified by graphical display of the results of
示すること又は試験治療と施設間の交互作用の有意性
individual centres or by analytical methods, such as a
検定などの解析手法によることでも確認される場合が
significance test of the treatment-by-centre interaction.
ある。交互作用の統計的有意性検定を用いる場合、試
When using such a statistical significance test, it is
験治療の主効果を検出することを目的に計画した試験
important to recognise that this generally has low power
では、一般に交互作用の検定の検出力は低いことを認
in a trial designed to detect the main effect of treatment.
識しておくことが重要である。
If heterogeneity of treatment effects is found, this should
試験治療効果の不均一性がみられた場合、その解釈に
be interpreted with care and vigorous attempts should be
は注意すべきであり、試験の運営管理面又は被験者の
made to find an explanation in terms of other features of
特徴といったそれとは別の面から説明できるかどうか
trial management or subject characteristics. Such an
を、積極的に調べるべきである。通常はその説明によ
explanation will usually suggest appropriate further
って、適切な追加解析と解釈が示唆される。説明がで
analysis and interpretation. In the absence of an
きない場合、例えば著しい量的交互作用（用語集参照）
explanation, heterogeneity of treatment effect as
から試験治療効果の不均一性の存在が明らかとなるこ
evidenced, for example, by marked quantitative
とは、施設に異なる重みを与えて試験治療効果の推定
interactions (see Glossary) implies that alternative
値を複数求め、試験治療効果の推定値の安定性を実証
estimates of the treatment effect may be required, giving
する必要があることを意味する。不均一性が著しい質
different weights to the centres, in order to substantiate
的交互作用（用語集参照）により特徴付けられるもの
the robustness of the estimates of treatment effect. It is
であれば、その理由を解明することは更に重要であり、
even more important to understand the basis of any
説明ができない場合は、試験治療効果を確実に予測す
heterogeneity characterised by marked qualitative
るために、追加の臨床試験を必要とするであろう。
interactions (see Glossary), and failure to find an
explanation may necessitate further clinical trials before
the treatment effect can be reliably predicted.
Up to this point the discussion of multicentre trials has
これまで、多施設共同治験に関する議論は、固定効果
been based on the use of fixed effect models. Mixed
モデルを用いることを前提としてきた。混合モデルも
models may also be used to explore the heterogeneity of
試験治療効果の不均一性を探索するために利用でき
the treatment effect. These models consider centre and
る。混合モデルでは、施設及び試験治療と施設の交互
treatment-by-centre effects to be random, and are
作用を変量効果として扱っており、特に施設数が多い
especially relevant when the number of sites is large.
場合に用いることが適切である。
3.3 Type of Comparison
3.3 比較の型式
3.3.1 Trials to Show Superiority
3.3.1 優越性を示すための試験
Scientifically, efficacy is most convincingly established by
科学的には、有効性を立証するには、プラセボ対照試
demonstrating superiority to placebo in a
験でプラセボに優ることを示すこと、実対照薬に優る
placebo-controlled trial, by showing superiority to an
ことを示すこと又は用量-反応関係を示すことが最も
active control treatment or by demonstrating a
説得力がある。この型式の試験を「優越性」試験（用
dose-response relationship. This type of trial is referred
語集参照）と呼ぶこととする。本ガイドラインでは、
to as a ‘superiority’ trial (see Glossary). Generally in this
特に断らない限り優越性試験を前提としている。
guidance superiority trials are assumed, unless it is
explicitly stated otherwise.
For serious illnesses, when a therapeutic treatment
重篤な疾患に対して優越性試験により有効であること
which has been shown to be efficacious by superiority
が示されている治療法が存在する場合、プラセボ対照
trial(s) exists, a placebo-controlled trial may be
試験は非倫理的と考えられることがある。その場合、
considered unethical. In that case the scientifically sound
実治療を対照として科学的に正しく用いることを考慮
use of an active treatment as a control should be
すべきである。プラセボ対照と実薬対照のどちらが適
considered. The appropriateness of placebo control vs.
切であるかは、個々の試験ごとに判断すべきである。
active control should be considered on a trial by trial
basis.
3.3.2 Trials to Show Equivalence or Non-inferiority
3.3.2 同等性又は非劣性を示すための試験
In some cases, an investigational product is compared to
優越性を示す目的以外にも、被験薬と標準治療とが比
a reference treatment without the objective of showing
較される場合がある。この型式の試験は目的に応じて
superiority. This type of trial is divided into two major
二つの主要なカテゴリに分けられる。一つは「同等性」
categories according to its objective; one is an
試験（用語集参照）で、もう一つは「非劣性」試験（用
'equivalence' trial (see Glossary) and the other is a
語集参照）である。
'non-inferiority' trial (see Glossary).
Bioequivalence trials fall into the former category. In
生物学的同等性試験は前者のカテゴリに属している。
some situations, clinical equivalence trials are also
また、例えば化合物が吸収されずそのために血中に現
undertaken for other regulatory reasons such as
れない場合に、後発医薬品と先発医薬品との臨床的同
demonstrating the clinical equivalence of a generic
等性を示すといった規制側の理由から、臨床的同等性
product to the marketed product when the compound is
試験が要求されることがある。
not absorbed and therefore not present in the blood
stream.
Many active control trials are designed to show that the
多くの実薬対照試験は、被験薬の有効性が実対照薬の
efficacy of an investigational product is no worse than
有効性よりも劣らないことを示すために計画され、し
that of the active comparator, and hence fall into the
たがって後者のカテゴリに属している。実薬対照試験
latter category. Another possibility is a trial in which
のもう一つの例は、被験薬の複数の用量と標準薬の推
multiple doses of the investigational drug are compared
奨用量又は複数の用量が比較される試験である。被
with the recommended dose or multiple doses of the
験薬の用量-反応関係を示すこと及び被験薬と実対照
standard drug. The purpose of this design is
薬とを比較することを同時に行うことがこの試験計画
simultaneously to show a dose response relationship for
の目的である。
the investigational product and to compare the
investigational product with the active control.
Active control equivalence or non-inferiority trials may
実薬対照同等性試験又は非劣性試験には、プラセボを
also incorporate a placebo, thus pursuing multiple goals
組み込んでもよく、そうすることで一つの試験で複数
in one trial; for example, they may establish superiority
の目標を達成できる。例えば、プラセボに対する優越
to placebo and hence validate the trial design and
性の立証とその結果として試験計画の妥当性を確認で
simultaneously evaluate the degree of similarity of
きると同時に、実対照薬に対する有効性及び安全性が
efficacy and safety to the active comparator. There are
どの程度類似しているかについても評価できる。プラ
well known difficulties associated with the use of the
セボを含まない又は被験薬の複数用量を用いない実薬
active control equivalence (or non-inferiority) trials that
対照同等性試験（又は非劣性試験）には、よく知られ
do not incorporate a placebo or do not use multiple doses
た問題点がある。その問題点とは、（優越性試験とは
of the new drug. These relate to the implicit lack of any
対照的に）内部妥当性を示すいかなる指標も必然的に
measure of internal validity (in contrast to superiority
存在していないことであり、このため外部情報による
trials), thus making external validation necessary. The
妥当性の確認を必要とする。同等性試験（又は非劣性
equivalence (or non-inferiority) trial is not conservative
試験）は本質的に保守的でないため、試験の計画上又
in nature, so that many flaws in the design or conduct of
は実施上の多くの不備が、同等であると結論づける方
the trial will tend to bias the results towards a conclusion
向へ結果を偏らせる傾向がある。これらの理由から、
of equivalence. For these reasons, the design features of
このような試験ではその計画上の特徴に特に注意すべ
such trials should receive special attention and their
きであり、慎重に実施する必要がある。例えば、登録
conduct needs special care. For example, it is especially
基準違反、服薬不遵守、試験治療の中止、追跡不能、
important to minimise the incidence of violations of the
欠測データ及び治験実施計画書からのその他の逸脱を
entry criteria, non-compliance, withdrawals, losses to
最小限に抑えることは特に重要であり、またこれらが
follow-up, missing data and other deviations from the
解析に与える影響も抑えることが重要である。
protocol, and also to minimise their impact on the
subsequent analyses.
Active comparators should be chosen with care. An
実対照薬は慎重に選択すべきである。適切な実対照薬
example of a suitable active comparator would be a
の例としては、広く使用されている治療法で、十分に
widely used therapy whose efficacy in the relevant
計画され記録されている一つ以上の優越性試験によっ
indication has been clearly established and quantified in
て適切な適応に対する有効性が明確に立証され定量的
well designed and well documented superiority trial(s)
に示されており、現在計画している実薬対照試験にお
and which can be reliably expected to exhibit similar
いても同様の有効性を示すことが十分に期待できるも
efficacy in the contemplated active control trial. To this
のがあげられる。このためには、新たに行う試験に関
end, the new trial should have the same important
連した、医学又は統計学の進歩を考慮した上で、新た
design features (primary variables, the dose of the active
に行う治験計画上の重要な特徴（主要変数、実対照薬
comparator, eligibility criteria, etc.) as the previously
の用量、適格基準など）を、実対照薬が臨床的に適切
conducted superiority trials in which the active
な有効性を明確に示した過去の優越性試験と同じにす
comparator clearly demonstrated clinically relevant
べきである。
efficacy, taking into account advances in medical or
statistical practice relevant to the new trial.
It is vital that the protocol of a trial designed to
同等性又は非劣性を証明するために計画された試験で
demonstrate equivalence or non-inferiority contain a
は治験実施計画書に同等性又は非劣性を示すために計
clear statement that this is its explicit intention. An
画されたということを明確に述べることが不可欠であ
equivalence margin should be specified in the protocol;
る。治験実施計画書には同等限界を明示しておくべき
this margin is the largest difference that can be judged as
である。同等限界とは、臨床的に許容できると判断し
being clinically acceptable and should be smaller than
うる最大の差であり、実対照薬の有効性を立証した優
differences observed in superiority trials of the active
越性試験において観測された差よりも小さいものであ
comparator. For the active control equivalence trial, both
るべきである。実薬対照同等性試験では、上側及び下
the upper and the lower equivalence margins are needed,
側両方の同等限界が必要であり、実薬対照非劣性試験
while only the lower margin is needed for the active
では下側同等限界のみが必要である。同等限界の大き
control non-inferiority trial. The choice of equivalence
さの選択には、十分な臨床的根拠を示すべきである。
margins should be justified clinically.
Statistical analysis is generally based on the use of
統計解析は、通常信頼区間に基づいて行われる（5.5
confidence intervals (see Section 5.5). For equivalence
節参照）。同等性試験では、両側信頼区間を用いるべ
trials, two-sided confidence intervals should be used.
きである。信頼区間全体が同等限界内に含まれる場合、
Equivalence is inferred when the entire confidence
同等であると推論する。両側信頼区間の使用は、試験
interval falls within the equivalence margins.
治療の差は同等限界の外側にあるという（複合）帰無
Operationally, this is equivalent to the method of using
仮説に対し、試験治療の差は同等限界の内側にあると
two simultaneous one-sided tests to test the (composite)
いう（複合）対立仮説を検定する、二つの片側検定を
null hypothesis that the treatment difference is outside
同時に行う方法と実際上同じものである。二つの帰無
the equivalence margins versus the (composite)
仮説には重なりがないため、第一種の過誤は適切に制
alternative hypothesis that the treatment difference is
御される。非劣性試験では、片側信頼区間を用いるべ
within the margins. Because the two null hypotheses are
きである。信頼区間を用いた方法は、（被験薬から対
disjoint, the type I error is appropriately controlled. For
照を引いた）試験治療間の差は下側同等限界に等しい
non-inferiority trials a one-sided interval should be used.
という帰無仮説に対して、試験治療間の差は下側同等
The confidence interval approach has a one-sided
限界よりも大きいという対立仮説を検定する片側仮説
hypothesis test counterpart for testing the null
検定に対応する。第一種の過誤の大きさの選択は、片
hypothesis that the treatment difference (investigational
側検定又は両側検定のどちらを選択するかとは別に検
product minus control) is equal to the lower equivalence
討すべきである。被験者数の計算は、これらの方法に
margin versus the alternative that the treatment
基づくべきである（3.5 節参照）。
difference is greater than the lower equivalence margin.
The choice of type I error should be a consideration
separate from the use of a one-sided or two-sided
procedure. Sample size calculations should be based on
these methods (see Section 3.5).
Concluding equivalence or non-inferiority based on
被験薬と実対照薬に差がないという帰無仮説の検定結
observing a non-significant test result of the null
果が有意でないことから、同等性又は非劣性が示され
hypothesis that there is no difference between the
たと結論することは不適切である。
investigational product and the active comparator is
inappropriate.
There are also special issues in the choice of analysis
解析対象集団の選択にも特別な問題が生じる。試験治
sets. Subjects who withdraw or dropout of the treatment
療グループ又は対照治療グループにおいて、試験治療
group or the comparator group will tend to have a lack of
を中止した被験者又はそれらのグループから脱落した
response, and hence the results of using the full analysis
被験者は、効果が現れにくいことから、最大の解析対
set (see Glossary) may be biased toward demonstrating
象集団（用語集参照）を用いた結果は同等性を示す
equivalence (see Section 5.2.3).
方向に偏るおそれがある（5.2.3 節参照）。
3.3.3 Trials to Show Dose-response Relationship
3.3.3 用量-反応関係を示すための試験
How response is related to the dose of a new
被験薬がどのような用量-反応関係を示すかは、開発の
investigational product is a question to which answers
すべての相から、様々な方法によ
may be obtained in all phases of development, and by a
って解答が得られる可能性がある問題である（ICH E4
variety of approaches (see ICH E4). Dose-response trials
参照）。用量-反応試験は多くの目的に役立つであろう。
may serve a number of objectives, amongst which the
中でも次に示すものは特に重要である。有効性の確認、
following are of particular importance: the confirmation
用量-反応曲線の形状と位置の研究、適切な開始用量の
of efficacy; the investigation of the shape and location of
推定、個人毎の用量の調整に最適な戦略の同定、それ
the dose-response curve; the estimation of an appropriate
以上臨床上の利益を見込むことができない最大用量の
starting dose; the identification of optimal strategies for
決定。プラセボ（ゼロ用量）を含めることが適切な場
individual dose adjustments; the determination of a
合にはプラセボを含め、多くの用量について集められ
maximal dose beyond which additional benefit would be
たデータを用いて、これらの目的に対応する必要があ
unlikely to occur. These objectives should be addressed
る。そのためには、用量-反応関係の推定に信頼区間の
using the data collected at a number of doses under
構成及びグラフ表示を用いた手法を適用することが、
investigation, including a placebo (zero dose) wherever
統計的検定を使用することと同程度に重要である。仮
appropriate. For this purpose the application of
説検定を用いる場合は、用量の順序関係又は用量-反応
procedures to estimate the relationship between dose and
曲線の形状に関する個々の問題（例えば単調性）に対
response, including the construction of confidence
応した方法を用いる必要があろう。予定している統計
intervals and the use of graphical methods, is as
的な手続きに関する内容の詳細は、治験実施計画書に
important as the use of statistical tests. The hypothesis
述べるべきである。
tests that are used may need to be tailored to the natural
ordering of doses or to particular questions regarding the
shape of the dose-response curve (e.g. monotonicity). The
details of the planned statistical procedures should be
given in the protocol.
3.4 Group Sequential Designs
3.4 逐次群計画
Group sequential designs are used to facilitate the
逐次群(群逐次)計画は、中間解析（4.5 節及び用語集参
conduct of interim analysis (see section 4.5 and
照）を実施するために用いられる。中間解析を可能に
Glossary). While group sequential designs are not the
する計画がいくつかある中で、逐次群計画が受け入れ
only acceptable types of designs permitting interim
可能な唯一の型式というわけではないが最もよく適用
analysis, they are the most commonly applied because it
されている。なぜならば、被験者の結果を試験期間中
is more practicable to assess grouped subject outcomes at
定期的にまとめて評価することは、個々の被験者の結
periodic intervals during the trial than on a continuous
果が利用可能になる都度評価するよりも実際的だから
basis as data from each subject become available. The
である。逐次群計画での統計手法は、試験治療の結果
statistical methods should be fully specified in advance of
及び試験治療の割付に関する情報が利用可能となる
the availability of information on treatment outcomes
（盲検解除、4.5 節参照）前に、完全に明記しておく
and subject treatment assignments (i.e. blind breaking,
べきである。独立データモニタリング委員会(効果安全
see Section 4.5). An Independent Data Monitoring
性評価委員会)（用語集参照）は、逐次群計画から得ら
Committee (see Glossary) may be used to review or to
れるデータの中間解析の実施又は検討のために利用さ
conduct the interim analysis of data arising from a group
れる（4.6 節参照）。逐次群計画は、死亡又は重大な
sequential design (see Section 4.6). While the design has
非致死性の評価項目を調べる大規模で長期にわたる試
been most widely and successfully used in large,
験で広く用いられ成功してきたが、その他の状況でも
long-term trials of mortality or major non-fatal
利用されることが増えてきている。特にすべての試験
endpoints, its use is growing in other circumstances. In
で安全性をモニターしなければならないことは共通の
particular, it is recognized that safety must be monitored
認識となっていることから、安全性の理由から早期中
in all trials and therefore the need for formal procedures
止を行うことも含めて正式な手続きの必要性を常に検
to cover early stopping for safety reasons should always
討しておくべきである。
be considered.
3.5 Sample Size
3.5 必要な被験者数
The number of subjects in a clinical trial should always
臨床試験の被験者数は、提示された問題に信頼のおけ
be large enough to provide a reliable answer to the
る解答を与えられるよう常に十分多くすべきである。
questions addressed. This number is usually determined
試験に必要な被験者数は、通常試験の主要な目的によ
by the primary objective of the trial. If the sample size is
り決められる。被験者数がその他の理由から決定され
determined on some other basis, then this should be
る場合には、その理由を明確にし正当化しておくべき
made clear and justified. For example, a trial sized on the
である。例えば、安全性に関する問題若しくは要求に
basis of safety questions or requirements or important
基づいた試験又は重要な副次目的に基づいて被験者数
secondary objectives may need larger numbers of
が決定される試験では、主要な有効性の問題に基づい
subjects than a trial sized on the basis of the primary
て被験者数が決定される試験よりも多くの被験者数を
efficacy question (see, for example, ICH E1a).
必要とするであろう（例えば、ICH E1A 参照）。
Using the usual method for determining the appropriate
適切な被験者数を決定するために用いられる通常の方
sample size, the following items should be specified: a
法を利用するためには、以下の項目を定めておくこと
primary variable, the test statistic, the null hypothesis,
が必要である。それらは、主要変数、検定統計量、帰
the alternative ('working') hypothesis at the chosen
無仮説、選択された用量での対立（「作業」）仮説（そ
dose(s) (embodying consideration of the treatment
の用量と選ばれた対象集団で検出すべき又は棄却すべ
difference to be detected or rejected at the dose and in the
き試験治療の差を考慮することも含めて）、誤って帰
subject population selected), the probability of
無仮説を棄却する確率（第一種の過誤）及び誤って帰
erroneously rejecting the null hypothesis (the type I
無仮説を棄却できない確率（第二種の過誤）であり、
error), and the probability of erroneously failing to reject
更に試験治療を中止した被験者及び治験実施計画書違
the null hypothesis (the type II error), as well as the
反を取り扱う方法も定める必要がある。検出力の評価
approach to dealing with treatment withdrawals and
のために、イベント発生率が主要な関心事項となる場
protocol violations. In some instances, the event rate is of
合には、試験に必要なイベント数から最終的な被験者
primary interest for evaluating power, and assumptions
数を外挿するための仮定も置くべきである。
should be made to extrapolate from the required number
of events to the eventual sample size for the trial.
The method by which the sample size is calculated should
被験者数を計算する方法は、計算に用いる見積値（分
be given in the protocol, together with the estimates of
散、平均値、反応割合、イベント発生率、検出すべき
any quantities used in the calculations (such as
差）とともに、治験実施計画書に定めておくべきであ
variances, mean values, response rates, event rates,
る。また、これらの見積値の根拠も示すべきである。
difference to be detected). The basis of these estimates
これらの仮定からの様々なずれに対して、必要な被験
should also be given. It is important to investigate the
者数がどの程度変わり易いか調べることは重要であ
sensitivity of the sample size estimate to a variety of
り、このためには実際に起こりうるずれの範囲に対応
deviations from these assumptions and this may be
する被験者数の範囲を示すことで実施することが容易
facilitated by providing a range of sample sizes
になるであろう。検証的試験では、通常これらの仮定
appropriate for a reasonable range of deviations from
は公表されたデータ又は先行する試験の結果に基づく
assumptions. In confirmatory trials, assumptions should
べきである。検出すべき試験治療の差は、患者の治療
normally be based on published data or on the results of
管理を行う上で臨床的に意味をもつ最小限度の効果に
earlier trials. The treatment difference to be detected
関する判断又は新しい試験治療の予想される効果の方
may be based on a judgement concerning the minimal
が大きい場合にはその効果に関する判断に基づいて決
effect which has clinical relevance in the management of
まるものであろう。慣例的に、第一種の過誤は 5%以
patients or on a judgement concerning the anticipated
下に設定され、多重性を考慮するために必要な調整が
effect of the new treatment, where this is larger.
なされる場合はそれに従って設定される。検証すべき
Conventionally the probability of type I error is set at 5%
仮説のもっともらしさ及び検定結果に望む影響力の強
or less or as dictated by any adjustments made necessary
さにより、第一種の過誤の的確な選択に影響が及ぼさ
for multiplicity considerations; the precise choice may be
れるであろう。
第二種の過誤は、慣例として 10%∼20%
influenced by the prior plausibility of the hypothesis
に設定される。第二種の過誤を実施可能な範囲ででき
under test and the desired impact of the results. The
る限り小さくすることは、特に繰り返すことが困難又
probability of type II error is conventionally set at 10% to
は不可能な試験の場合、治験依頼者の利益となる。慣
20%; it is in the sponsor’s interest to keep this figure as
例として用いている第一種の過誤の値及び第二種の過
low as feasible especially in the case of trials that are
誤の値とは異なる値を用いることも許容される場合が
difficult or impossible to repeat. Alternative values to the
あり、むしろそれが好ましいこともあり得る。
conventional levels of type I and type II error may be
acceptable or even preferable in some cases.
Sample size calculations should refer to the number of
被験者数の計算は、主要な解析で用いる解析対象集団
subjects required for the primary analysis. If this is the
に基づくべきである。解析対象集団が「最大の解析対
'full analysis set', estimates of the effect size may need to
象集団」である場合、効果の大きさに関する見積値は、
be reduced compared to the per protocol set (see
治験実施計画書に適合した対象集団（用語集参照）の
Glossary). This is to allow for the dilution of the
場合に比べて小さくする必要があろう。これは、試験
treatment effect arising from the inclusion of data from
治療を中止した被験者又は服薬遵守状況の悪い被験者
patients who have withdrawn from treatment or whose
を解析に含めることにより、試験治療の効果が薄めら
compliance is poor. The assumptions about variability
れることを考慮するためである。このときばらつきに
may also need to be revised.
関する仮定も再検討する必要があろう。
The sample size of an equivalence trial or a
同等性試験又は非劣性試験の被験者数（3.3.2 節参照）
non-inferiority trial (see Section 3.3.2) should normally
は、通常試験治療の差の信頼区間を用いて、試験治療
be based on the objective of obtaining a confidence
間の差が最大でも臨床的に許容できる範囲であること
interval for the treatment difference that shows that the
を示すという目的に基づいて計算すべきである。同等
treatments differ at most by a clinically acceptable
性試験での検出力が真の差をゼロとして設定されてい
difference. When the power of an equivalence trial is
る場合、真の差がゼロでなければ、この検出力を達成
assessed at a true difference of zero, then the sample size
するために必要な被験者数よりも少なく見積もられる
necessary to achieve this power is underestimated if the
ことになる。非劣性試験での検出力が差をゼロとして
true difference is not zero. When the power of a
設定されている場合、被験薬の効果が実対照薬の効果
non-inferiority trial is assessed at a zero difference, then
よりも小さければ、この検出力を達成するための必要
the sample size needed to achieve that power will be
な被験者数よりも少なく見積もられることになる。
「臨
underestimated if the effect of the investigational
床的に許容できる」差は、その選択に当たり市販後使
product is less than that of the active control. The choice
用される患者に対してどのような意味を持つかに関す
of a 'clinically acceptable’ difference needs justification
る正当な理由が必要であり、差が存在することを立証
with respect to its meaning for future patients, and may
するために計画する優越性試験において参照した前述
be smaller than the 'clinically relevant' difference
の「臨床的に適切な」差よりも小さくなるであろう。
referred to above in the context of superiority trials
designed to establish that a difference exists.
The exact sample size in a group sequential trial cannot
逐次群試験での正確な被験者数は、選択した中止指針
be fixed in advance because it depends upon the play of
と真の試験治療の差に依存する上に、偶然の動きにも
chance in combination with the chosen stopping
左右されるため、事前には固定できない。中止指針の
guideline and the true treatment difference. The design
設計には、試験を続けた際の被験者数の分布を考慮す
of the stopping guideline should take into account the
べきであり、通常これは期待被験者数及び最大被験者
consequent distribution of the sample size, usually
数により具体的に示される
embodied in the expected and maximum sample sizes.
When event rates are lower than anticipated or
イベント発生率が予想よりも低い場合、又はばらつき
variability is larger than expected, methods for sample
が予想よりも大きい場合は、割付を明らかにすること
size re-estimation are available without unblinding data
又は試験治療間の比較を行うことなく被験者数を見直
or making treatment comparisons (see Section 4.4).
すことができる（4.4 節参照）。
3.6 Data Capture and Processing
3.6 データの獲得と処理
The collection of data and transfer of data from the
治験責任医師から治験依頼者へのデータの収集と転送
investigator to the sponsor can take place through a
は、症例記録用紙、遠隔地モニタリングシステム、医
variety of media, including paper case record forms,
療コンピューターシステム、電子的転送等の様々な媒
remote site monitoring systems, medical computer
体で行うことができる。どのようなデータ獲得の手段
systems and electronic transfer. Whatever data capture
を用いても、収集する情報の様式及びその内容は治験
instrument is used, the form and content of the
実施計画書と完全に一致させるべきであり、臨床試験
information collected should be in full accordance with
の実施前に確定しておくべきである。収集する情報の
the protocol and should be established in advance of the
様式及びその内容は予定した解析の実施に必要なデー
conduct of the clinical trial. It should focus on the data
タに合わせて考えるべきである。必要なデータには、
necessary to implement the planned analysis, including
治験実施計画書遵守状況の確認又は重要な治験実施計
the context information (such as timing assessments
画書からの逸脱を明らかにするために必要な背景情報
relative to dosing) necessary to confirm protocol
（服薬に対応した評価の時期等）が含まれる。「欠測
compliance or identify important protocol deviations.
値」は「ゼロ」又は「該当せず」と区別できるように
‘Missing values’ should be distinguishable from the ‘value
すべきである。
zero’ or ‘characteristic absent’.
The process of data capture through to database
データベースの確定までのデータ獲得の手順は、GCP
finalisation should be carried out in accordance with GCP
に従って実行すべきである（ICH E6、5 節参照）。特
(see ICH E6, Section 5). Specifically, timely and reliable
に、質の高いデータベースの引き渡しを確かにし、予
processes for recording data and rectifying errors and
定した解析の履行を通した試験目的の達成を確かにす
omissions are necessary to ensure delivery of a quality
るためには、データの記録並びに誤り及び無記入の訂
database and the achievement of the trial objectives
正のために、適切なタイミングで信頼できる処理を実
through the implementation of the planned analysis.
施することが、必要である。
IV. TRIAL CONDUCT CONSIDERATIONS
IV. 試験実施上で考慮すべきこと
4.1 Trial Monitoring and Interim Analysis
4.1 治験モニタリングと中間解析
Careful conduct of a clinical trial according to the
治験実施計画書に従って臨床試験が慎重に実施されて
protocol has a major impact on the credibility of the
いるかどうかは、結果の信憑性に重要な影響を与える
results (see ICH E6). Careful monitoring can ensure that
（ICH E6 参照）。慎重なモニタリングによって、実
difficulties are noticed early and their occurrence or
施上の問題の所在を早期に発見するとともに、問題の
recurrence minimised.
発生又は再発を最小限に抑えることが保証できる。
There are two distinct types of monitoring that generally
モニタリングには、製薬企業が依頼する検証的試験の
characterise confirmatory clinical trials sponsored by the
性格を一般に左右するような二つの異なる型式が存在
pharmaceutical industry. One type of monitoring
する。一方の型式は試験の質の監視と関係したもので
concerns the oversight of the quality of the trial, while
あり、もう一方の型式は試験治療の比較のために割付
the other type involves breaking the blind to make
を明らかにすることを伴うものである（中間解析）。
treatment comparisons (i.e. interim analysis). Both types
治験モニタリングにおける二つの型式はどちらも、異
of trial monitoring, in addition to entailing different staff
なるスタッフの責任を伴ううえ、異なる型式の試験デ
responsibilities, involve access to different types of trial
ータ及び情報へのアクセスを必要とすることから、こ
data and information, and thus different principles apply
のための異なる原則が、潜在的な統計的及び運営上の
for the control of potential statistical and operational
偏りの制御に適用される。
bias.
For the purpose of overseeing the quality of the trial the
試験の質を監視するためには、治験モニタリングで、
checks involved in trial monitoring may include whether
治験実施計画書が守られているか、集積されたデータ
the protocol is being followed, the acceptability of data
が受け入れ可能か、予定している集積目標が達成され
being accrued, the success of planned accrual targets, the
ているか、計画時に用いた仮定は適切か、患者の試験
appropriateness of the design assumptions, success in
への継続的参加に成功しているか、などをチェックす
keeping patients in the trials, etc. (see Sections 4.2 to
る必要があろう（4.2 節から 4.4 節を参照）。この型式
4.4). This type of monitoring does not require access to
のモニタリングは、試験治療効果の比較に関する情報
information on comparative treatment effects, nor
へのアクセスを必要としないだけでなく、割付を明ら
unblinding of data and therefore has no impact on type I
かにしたデータを必要としないため、第一種の過誤へ
error. The monitoring of a trial for this purpose is the
の影響を与えるものではない。この目的での治験モニ
responsibility of the sponsor (see ICH E6) and can be
タリングは治験依頼者の責任であり（ICH E6 参照）、
carried out by the sponsor or an independent group
治験依頼者又は治験依頼者によって任命された独立し
selected by the sponsor. The period for this type of
たグループが行うことができる。この型式のモニタリ
monitoring usually starts with the selection of the trial
ングの期間は、通常試験実施施設が選択されたときに
sites and ends with the collection and cleaning of the last
始まり、最後の被験者のデータが収集されクリーニン
subject’s data.
グされたときに終了する。
The other type of trial monitoring (interim analysis)
治験モニタリングのもう一つの型式（中間解析）は、
involves the accruing of comparative treatment results.
比較のため試験治療の結果の集積を必要とする。中間
Interim analysis requires unblinded (i.e. key breaking)
解析は、割付を明らかにして（キーコードの開示）試
access to treatment group assignment (actual treatment
験治療グループにアクセスすること（実際の試験治療
assignment or identification of group assignment) and
の割付、又は割付グループの同定）を必要とし、比較
comparative treatment group summary information.
を行う試験治療グループ間の要約情報を必要とする。
This necessitates that the protocol (or appropriate
このため、ある種の偏りを防ぐ目的で、中間解析のた
amendments prior to a first analysis) contains statistical
めの統計解析計画を治験実施計画書中（又は最初に解
plans for the interim analysis to prevent certain types of
析を行う前に目的にあった改訂を行った場合、その改
bias. This is discussed in Sections 4.5 & 4.6.
訂中）に含める必要がある。これに関しては 4.5 節と
4.6 節で議論する。
4.2 Changes in Inclusion and Exclusion Criteria
4.2 選択基準と除外基準の変更
Inclusion and exclusion criteria should remain constant,
選択基準及び除外基準は、被験者募集期間を通じて、
as specified in the protocol, throughout the period of
治験実施計画書に明記されているとおり一定に保つべ
subject recruitment. Changes may occasionally be
きである。ときには基準を変更することが適切な場合
appropriate, for example, in long term trials, where
もある。例えば、長期にわたる試験において、その試
growing medical knowledge either from outside the trial
験以外又は中間解析による医学知識の蓄積により、登
or from interim analyses may suggest a change of entry
録基準の変更が示唆される場合である。登録基準の違
criteria. Changes may also result from the discovery by
反が日常的に起こること又は募集率の低さが深刻であ
monitoring staff that regular violations of the entry
ることが、登録基準の制限が強すぎたためであること
criteria are occurring, or that seriously low recruitment
をモニタリング担当者が発見することにより、登録基
rates are due to over-restrictive criteria. Changes should
準の変更がなされる場合もある。登録基準の変更は割
be made without breaking the blind and should always
付を明らかにしない状態で行うべきであり、治験実施
be described by a protocol amendment which should
計画書の改訂に常に記述すべきである。治験実施計画
cover any statistical consequences, such as sample size
書の改訂には、例えば、イベント発生率が異なること
adjustments arising from different event rates, or
により行われなければならない必要な被験者数の調整
modifications to the planned analysis, such as stratifying
などの統計的変更の内容、又は修正された選択/除外基
the analysis according to modified inclusion/exclusion
準に従った解析の層化など、予定した解析の修正を含
criteria.
めるべきである。
4.3 Accrual Rates
4.3 集積率
In trials with a long time-scale for the accrual of subjects,
被験者の集積が長期にわたる試験では、集積率をモニ
the rate of accrual should be monitored and, if it falls
ターすべきである。もしそれが予定している水準を大
appreciably below the projected level, the reasons should
きく下回る場合には、その理由を確認すべきであり、
be identified and remedial actions taken in order to
試験の検出力を保ち、選択的登録及びその他試験の質
protect the power of the trial and alleviate concerns
に関する別の側面についての懸念を和らげるための対
about selective entry and other aspects of quality. In a
応策をとるべきである。多施設共同治験では、個々の
multicentre trial these considerations apply to the
施設において、これらの配慮が適用される。
individual centres.
4.4 Sample Size Adjustment
4.4 必要な被験者数の調整
In long term trials there will usually be an opportunity to
長期にわたる試験では、通常、当初の計画で用いた被
check the assumptions which underlay the original
験者数の計算根拠となる仮定を確認するための機会が
design and sample size calculations. This may be
あろう。この確認は、試験計画の詳細が予備的情報若
particularly important if the trial specifications have
しくは不確実な情報、又はその両方に基づいている場
been made on preliminary and/or uncertain information.
合、特に重要であろう。盲検下のデータを用い中間で
An interim check conducted on the blinded data may
の確認を行うことにより、それまでの試験全体での、
reveal that overall response variances, event rates or
反応の分散、イベント発生率又は生存状況が予期して
survival experience are not as anticipated. A revised
いた状況と異なることが明らかにされる場合がある。
sample size may then be calculated using suitably
その場合、適切に修正した仮定に基づいて被験者数の
modified assumptions, and should be justified and
再計算を行うこととなるが、その正当性を明らかにし、
documented in a protocol amendment and in the clinical
治験実施計画書の改訂及び総括報告書に記録しなけれ
study report. The steps taken to preserve blindness and
ばならない。盲検性を維持するために行う手続きと共
the consequences, if any, for the type I error and the
に、可能であれば、第一種の過誤と信頼区間の幅に対
width of confidence intervals should be explained. The
する被験者数の変更による影響を説明すべきである。
potential need for re-estimation of the sample size should
被験者数の再見積もりが必要になる可能性がある場合
be envisaged in the protocol whenever possible (see
には、そのことを可能な限り治験実施計画書に述べる
Section 3.5).
べきである（3.5 節参照）。
4.5 Interim Analysis and Early Stopping
4.5 中間解析と早期中止
An interim analysis is any analysis intended to compare
中間解析とは、試験が正式に完了する前に行われる有
treatment arms with respect to efficacy or safety at any
効性又は安全性に関する試験治療群間の比較を意図し
time prior to formal completion of a trial. Because the
たすべての解析を指す。中間解析の回数、方法及び結
number, methods and consequences of these comparisons
果が試験の解釈に影響するため、実施するすべての中
affect the interpretation of the trial, all interim analyses
間解析は前もって慎重に計画し、治験実施計画書に記
should be carefully planned in advance and described in
述すべきである。特別な状況では、試験開始当初には
the protocol. Special circumstances may dictate the need
予定していなかった中間解析が必要となる場合があ
for an interim analysis that was not defined at the start
る。中間解析が必要となる場合、割付が明らかにされ
of a trial. In these cases, a protocol amendment
試験治療を比較するデータにアクセスする前に、治験
describing the interim analysis should be completed prior
実施計画書の改訂に中間解析について記述しておくべ
to unblinded access to treatment comparison data. When
きである。試験を継続すべきか中止すべきかの決定を
an interim analysis is planned with the intention of
目的として中間解析を計画する場合、統計的モニタリ
deciding whether or not to terminate a trial, this is
ング計画を指針とする逐次群計画を用いるのが通常で
usually accomplished by the use of a group sequential
ある（3.4 節参照）。このような中間解析の目的は、
design which employs statistical monitoring schemes as
研究中の試験治療の優越性が疑いなく立証された場
guidelines (see Section 3.4). The goal of such an interim
合、適切な試験治療の差を示す見込みのないことが判
analysis is to stop the trial early if the superiority of the
明した場合又は許容できない有害作用が明らかになっ
treatment under study is clearly established, if the
た場合に試験を早期に中止することにある。一般に、
demonstration of a relevant treatment difference has
有効性モニタリングのための棄却限界値は、安全性モ
become unlikely or if unacceptable adverse effects are
ニタリングのための棄却限界値よりも、試験を早期に
apparent. Generally, boundaries for monitoring efficacy
中止するための証拠をより多く必要とする（つまり、
require more evidence to terminate a trial early (i.e. they
より保守的とする必要がある）。治験実施計画とモニ
are more conservative) than boundaries for monitoring
タリングの目的により、複数の評価項目が用いられる
safety. When the trial design and monitoring objective
場合、それによる多重性にも注意を払う必要があろう。
involve multiple endpoints then this aspect of
multiplicity may also need to be taken into account.
The protocol should describe the schedule of interim
治験実施計画書には中間解析のスケジュール又は、例
analyses, or at least the considerations which will govern
えばアルファ消費関数を用いる柔軟な方法を予定して
its generation, for example if flexible alpha spending
いる様な場合には、少なくとも中間解析の方針につい
function approaches are to be employed; further details
て述べるべきである。より詳細な内容については最初
may be given in a protocol amendment before the time of
の中間解析時までに治験実施計画書の改訂に示しても
the first interim analysis. The stopping guidelines and
よい。中止の指針及びそれらの特質を治験実施計画書
their properties should be clearly described in the
又はその改訂に明確に述べるべきである。早期中止が
protocol or amendments. The potential effects of early
中止の判断に用いた変数以外の重要な変数に及ぼす影
stopping on the analysis of other important variables
響と、それらの変数の解析方法についても考慮すべき
should also be considered. This material should be
である。この内容は、独立データモニタリング委員会
written or approved by the Data Monitoring Committee
が存在する場合には、その委員会により記述されるか
(see Section 4.6), when the trial has one. Deviations from
又は承認されるべきである（4.6 節参照）。予定した
the planned procedure always bear the potential of
手順からの逸脱により常に試験結果の妥当性が損なわ
invalidating the trial results. If it becomes necessary to
れる可能性がある。試験に変更を加える必要性が生じ
make changes to the trial, any consequent changes to the
た場合、変更により必要となる統計手法の変更のすべ
statistical procedures should be specified in an
てを、できるだけ早い機会に治験実施計画書の改訂に
amendment to the protocol at the earliest opportunity,
明記すべきである。特にそのような変更が原因となる
especially discussing the impact on any analysis and
おそれがあるすべての解析及び推測への影響を議論し
inferences that such changes may cause. The procedures
ておかなければならない。選択した中間解析の方法が、
selected should always ensure that the overall
全体の第一種の過誤の確率を制御していることを保証
probability of type I error is controlled.
すべきである。
The execution of an interim analysis should be a
中間解析では、割付を明らかにしたデータと結果が必
completely confidential process because unblended data
要となりうるため、内容が全く漏れない手順により実
and results are potentially involved. All staff involved in
施しなければならない。試験に対するスタッフの態度
the conduct of the trial should remain blind to the results
の変化及び募集される患者の特徴の変化が起こり、試
of such analyses, because of the possibility that their
験治療の比較に偏りをもたらす原因となりかねないた
attitudes to the trial will be modified and cause changes
め、試験の実施に関係しているすべてのスタッフが、
in the characteristics of patients to be recruited or biases
中間解析の結果を知ることがないようにすべきであ
in treatment comparisons. This principle may be applied
る。この原則は、中間解析の実施に直接関係している
to all investigator staff and to staff employed by the
者を除く、治験責任医師に関係するスタッフ全員及び
sponsor except for those who are directly involved in the
治験依頼者に雇用されているスタッフにも適用される
execution of the interim analysis. Investigators should
といってよい。治験責任医師には、試験の継続若しく
only be informed about the decision to continue or to
は中止の決定、又は試験手順の変更の決定のみを知ら
discontinue the trial, or to implement modifications to
せるべきである。
trial procedures.
Most clinical trials intended to support the efficacy and
被験薬の有効性及び安全性を裏付けることを意図した
safety of an investigational product should proceed to full
臨床試験は、ほとんどの場合、予定した被験者数の集
completion of planned sample size accrual; trials should
積が完全に完了するまで継続すべきである。試験は、
be stopped early only for ethical reasons or if the power is
倫理的な理由又は検出力が容認できない場合に限り、
no longer acceptable. However, it is recognised that drug
早期に中止すべきである。しかし、医薬品開発計画に
development plans involve the need for sponsor access to
は、他の試験計画の立案など様々な理由から、比較の
comparative treatment data for a variety of reasons, such
ための試験治療データに治験依頼者がアクセスする必
as planning other trials. It is also recognised that only a
要性があることが認識されている。また、集積してい
subset of trials will involve the study of serious
く試験治療効果の比較に関して継続的なモニタリング
life-threatening outcomes or mortality which may need
が倫理的な理由から必要となるような、生命を脅かす
sequential monitoring of accruing comparative treatment
重篤な結果に関する研究又は死亡に関する研究がある
effects for ethical reasons. In either of these situations,
が、そのような試験は全体のごく一部であることも認
plans for interim statistical analysis should be in place in
識されている。どちらの状況でも中間統計解析の計画
the protocol or in protocol amendments prior to the
は、生じるおそれのある潜在的な統計的及び運営上の
unblinded access to comparative treatment data in order
偏りに対処するため、割付を明らかにして試験治療の
to deal with the potential statistical and operational bias
比較データにアクセスする前に、治験実施計画書又は
that may be introduced.
その改訂の適切な箇所に記載すべきである。
For many clinical trials of investigational products,
被験薬に関する臨床試験では多くの場合、特に公衆衛
especially those that have major public health
生上意義のある試験の場合は、有効性若しくは安全性
significance, the responsibility for monitoring
の一方、又はその両方の比較についてのモニタリング
comparisons of efficacy and/or safety outcomes should be
に対する責任は、外部の独立したグループに委ねられ
assigned to an external independent group, often called
るべきである。このグループは、独立データモニタリ
an Independent Data Monitoring Committee (IDMC), a
ング委員会、データ及び安全性モニタリング委員会、
Data and Safety Monitoring Board or a Data Monitoring
又はデータモニタリング委員会と呼ばれることがあ
Committee whose responsibilities should be clearly
り、その責任は明確にしておく必要がある。
described.
When a sponsor assumes the role of monitoring efficacy
治験依頼者が有効性又は安全性を比較するためのモニ
or safety comparisons and therefore has access to
タリングの役割を担い、割付を明らかにした情報への
unblinded comparative information, particular care
アクセス権を持つ場合、試験の完全性を維持し、情報
should be taken to protect the integrity of the trial and to
の共有を適切に管理し制限するために特別の注意が払
manage and limit appropriately the sharing of
われるべきである。治験依頼者は、内部モニタリング
information. The sponsor should assure and document
委員会が文書で書かれた標準業務手順書に従っている
that the internal monitoring committee has complied
こと及び意思決定を行った会議の議事録が中間解析の
with written standard operating procedures and that
結果とともに保持されていることを保証し記録に残す
minutes of decision making meetings including records of
べきである。
interim results are maintained.
Any interim analysis that is not planned appropriately
適切に計画されていない中間解析はすべて（試験の早
(with or without the consequences of stopping the trial
期中止の結論によらず）、試験の結果を損なう恐れが
early) may flaw the results of a trial and possibly weaken
あるとともに導いた結論の信憑性を低下させる可能性
confidence in the conclusions drawn. Therefore, such
がある。したがって、計画されていない中間解析は行
analyses should be avoided. If unplanned interim
うべきではない。もし予定外の中間解析を実施するな
analysis is conducted, the clinical study report should
らば、その解析の必要性及びどの程度割付を明らかに
explain why it was necessary, the degree to which
したかを総括報告書に説明すべきであり、生じるおそ
blindness had to be broken, provide an assessment of the
れのある偏りの大きさ及び結果の解釈への影響を評価
potential magnitude of bias introduced, and the impact
すべきである。
on the interpretation of the results.
4.6 Role of Independent Data Monitoring Committee
4.6 独立データモニタリング委員会の役割
(IDMC)
（ICH E6、1.25節及び5.52節参照）
(see Sections 1.25 and 5.52 of ICH E6)
An IDMC may be established by the sponsor to assess at
独立データモニタリング委員会は、臨床試験の進行状
intervals the progress of a clinical trial, safety data, and
況、安全性データ及び重要な有効性変数を何回かにわ
critical efficacy variables and recommend to the sponsor
たり評価するとともに治験依頼者に試験の継続、変更、
whether to continue, modify or terminate a trial. The
又は中止を勧告するために治験依頼者が設立できる。
IDMC should have written operating procedures and
独立データモニタリング委員会は、文書で書かれた業
maintain records of all its meetings, including interim
務手順書を持つべきであり、すべての会議の議事録と
results; these should be available for review when the
中間解析の結果を保持すべきである。また、これらは
trial is complete. The independence of the IDMC is
試験完了時には審査可能な状態にしておくべきであ
intended to control the sharing of important comparative
る。独立データモニタリング委員会の独立性は、比較
information and to protect the integrity of the clinical
を目的とした重要な情報の漏洩を管理すること及び臨
trial from adverse impact resulting from access to trial
床試験の情報にアクセスすることによる悪影響から試
information. The IDMC is a separate entity from an
験の完全性を守ることを目的としたものである。独立
Institutional Review Board (IRB) or an Independent
データモニタリング委員会は治験審査委員会又は倫理
Ethics Committee (IEC), and its composition should
委員会とは別の組織であり、統計学を含む適切な学識
include clinical trial scientists knowledgeable in the
を持った臨床試験の専門家から構成されるべきであ
appropriate disciplines including statistics. When there
る。独立データモニタリング委員会に治験依頼者を代
are sponsor representatives on the IDMC, their role
表する者が参加する場合、（例えば、主要な問題の採
should be clearly defined in the operating procedures of
決に参加できるかどうか等）その役割を委員会の業務
the committee (for example, covering whether or not they
手順書に明確に定めるべきである。委員会に参加した
can vote on key issues). Since these sponsor staff would
治験依頼者のスタッフは割付を明らかにした情報に対
have access to unblinded information, the procedures
しアクセス権を持つと考えられることから、業務手順
should also address the control of dissemination of
書には治験依頼者の組織に対し試験の中間結果の開示
interim trial results within the sponsor organisation.
を制限することについても述べるべきである。
V. DATA ANALYSIS CONSIDERATIONS
V. データ解析上で考慮すべきこと
5.1 Prespecification of the Analysis
5.1 解析の事前明記
When designing a clinical trial the principal features of
臨床試験の計画立案の際、データの最終統計解析の主
the eventual statistical analysis of the data should be
要な特徴は、治験実施計画書の統計の部に記述すべき
described in the statistical section of the protocol. This
である。統計の部には、主要変数について行うことと
section should include all the principal features of the
した検証的解析の主要な特徴のすべてと、予想される
proposed confirmatory analysis of the primary variable(s)
解析上の問題に対処する方法を含めるべきである。探
and the way in which anticipated analysis problems will
索的試験の場合には、統計の部の記述をより一般的な
be handled. In case of exploratory trials this section could
原則と方向性にとどめることができる。
describe more general principles and directions.
The statistical analysis plan (see Glossary) may be
統計解析計画書（用語集参照）は治験実施計画書完成
written as a separate document to be completed after
後に別の文書として作成することができる。統計解析
finalising the protocol. In this document, a more
計画書には、治験実施計画書に述べた解析の主要な特
technical and detailed elaboration of the principal
徴について、より技術的で詳細な記述を含めることが
features stated in the protocol may be included (see
できる（7.1 節参照）。統計解析計画書には、主要変
section 7.1). The plan may include detailed procedures
数、副次変数及びその他のデータに関する統計解析を
for executing the statistical analysis of the primary and
実行するための詳細な手順を記載してもよい。統計解
secondary variables and other data. The plan should be
析計画はデータの盲検下レヴュー（7.1 節で定義する）
reviewed and possibly updated as a result of the blind
の結果に基づいて再検討すべきであり、場合によって
review of the data (see 7.1 for definition) and should be
は更新し、割付を明らかにする前に完成させるべきで
finalised before breaking the blind. Formal records
ある。割付を明らかにした日付だけではなく、統計解
should be kept of when the statistical analysis plan was
析計画書を完成した日付も公式な記録として保持すべ
finalised as well as when the blind was subsequently
きである。
broken.
If the blind review suggests changes to the principal
盲検下レヴューから、治験実施計画書に述べた解析の
features stated in the protocol, these should be
主要な特徴となる事項の変更を提案しようとする場
documented in a protocol amendment. Otherwise, it will
合、変更の内容は治験実施計画書の改訂に記録すべき
suffice to update the statistical analysis plan with the
である。変更が主要事項でない場合は、盲検下レヴュ
considerations suggested from the blind review. Only
ーにより示唆された検討事項に基づいて統計解析計画
results from analyses envisaged in the protocol (including
書を更新すれば十分であろう。治験実施計画書（その
amendments) can be regarded as confirmatory.
改訂を含む）に予定された解析の結果のみが検証的で
あるとみなされる。
In the statistical section of the clinical study report the
総括報告書の統計の部には、一連の臨床試験の過程に
statistical methodology should be clearly described
おいて方法論に関する決定がいつなされたかも含め
including when in the clinical trial process methodology
て、統計的方法論を明確に記述すべきである（ICH E3
decisions were made (see ICH E3).
参照）。
5.2 Analysis Sets
5.2 解析対象集団
The set of subjects whose data are to be included in the
主要な解析でデータが用いられる被験者の集団は、治
main analyses should be defined in the statistical section
験実施計画書の統計の部で定義すべきである。更に、
of the protocol. In addition, documentation for all
試験の手続きが開始された（例えば、治験導入期の）
subjects for whom trial procedures (e.g. run-in period)
すべての被験者の記録は有用といってよい。この被験
were initiated may be useful. The content of this subject
者の記録の内容は、個々の試験の詳細に依存するが、
documentation depends on detailed features of the
少なくとも人口統計学的なデータ及び疾患の状態に関
particular trial, but at least demographic and baseline
する基準となる時点のデータは、可能な限り収集すべ
data on disease status should be collected whenever
きである。
possible.
If all subjects randomised into a clinical trial satisfied all
ランダム化が行われて臨床試験に参加したすべての被
entry criteria, followed all trial procedures perfectly with
験者が、すべての登録基準を満たし、追跡不能となる
no losses to follow-up, and provided complete data
こともなくすべての試験手順に従い、そしてデータの
records, then the set of subjects to be included in the
記録が完全であれば、解析に含める被験者の集団は自
analysis would be self-evident. The design and conduct of
明であろう。試験を計画し、実施するに当たっては、
a trial should aim to approach this ideal as closely as
この理想に可能な限り近づけることを目ざすべきであ
possible, but, in practice, it is doubtful if it can ever be
るが、実際には、これまでにこの理想が完全に達成さ
fully achieved. Hence, the statistical section of the
れたことがあるかどうかは疑わしい。したがって治験
protocol should address anticipated problems
実施計画書の統計の部には、予想される問題について、
prospectively in terms of how these affect the subjects
それらが被験者及び解析されるデータにどのように影
and data to be analysed. The protocol should also specify
響を与えるかについて前もって述べておくべきであ
procedures aimed at minimising any anticipated
る。更に治験実施計画書には、十分な解析を損なうと
irregularities in study conduct that might impair a
予想される研究実施中の変則的な事例のすべて(様々
satisfactory analysis, including various types of protocol
な種類の治験実施計画書違反、試験治療の中止、欠測
violations, withdrawals and missing values. The protocol
値を含む)を最小にするための手続きも明記すべきで
should consider ways both to reduce the frequency of
ある。治験実施計画書では、そのような問題の発生す
such problems, and also to handle the problems that do
る頻度を少なくするための方法と、データ解析中に起
occur in the analysis of data. Possible amendments to the
こる問題へ対処する方法とを考慮しておくべきであ
way in which the analysis will deal with protocol
る。治験実施計画書違反に対処する解析の方法を修正
violations should be identified during the blind review. It
する必要性があれば、盲検下レヴューの際に確認して
is desirable to identify any important protocol violation
おくべきである。すべての重要な治験実施計画書違反
with respect to the time when it occurred, its cause and
は、違反が起きた日時、違反が起きた理由及び試験結
influence on the trial result. The frequency and type of
果へ与える影響を明らかにすることが望ましい。治験
protocol violations, missing values, and other problems
実施計画書違反及び欠測値などの問題の発生頻度とそ
should be documented in the clinical study report and
の種類は総括報告書に記録すべきであり、それらが試
their potential influence on the trial results should be
験結果に対し、どのような影響を与える可能性がある
described (see ICH E3).
かを記述すべきである（ICH E3 参照）。
Decisions concerning the analysis set should be guided by
解析対象集団の選択に関する判断は、次の二つの原則
the following principles : 1) to minimise bias, and 2) to
に従うべきである: 1) 偏りを最小にすること、及び 2)
avoid inflation of type I error.
第一種の過誤の増大を回避すること。
5.2.1 Full Analysis Set
5.2.1 最大の解析対象集団
The intention-to-treat (see Glossary) principle implies
Intention-to-treat の原則（用語集参照）は、ランダム
that the primary analysis should include all randomised
化が行われた全被験者を主要な解析に含めるべきであ
subjects. Compliance with this principle would
ると主張するものである。この原則を遵守するには、
necessitate complete follow-up of all randomised subjects
臨床結果を得るためにランダム化が行われた全被験者
for study outcomes. In practice this ideal may be difficult
を完全に追跡することを必要とするであろう。しかし
to achieve, for reasons to be described. In this document
実際には、後に述べる理由からこの理想を達成するこ
the term 'full analysis set' is used to describe the analysis
とは難しいであろう。本ガイドラインでは「最大の解
set which is as complete as possible and as close as
析対象集団」という用語を用いて、ランダム化が行わ
possible to the intention-to-treat ideal of including all
れた全被験者を含める intention-to-treat という理想
randomised subjects. Preservation of the initial
に可能な限り完全に近づけた解析対象集団を表すこと
randomisation in analysis is important in preventing
にする。解析時にも最初のランダム化を維持すること
bias and in providing a secure foundation for statistical
は、偏りを防ぎ、統計的検定の強固な基盤を与える点
tests. In many clinical trials the use of the full analysis
で重要である。多くの臨床試験において、最大の解析
set provides a conservative strategy. Under many
対象集団を用いることは保守的な戦略となる。また多
circumstances it may also provide estimates of treatment
くの状況で、最大の解析対象集団により得られる試験
effects which are more likely to mirror those observed in
治療の効果の推定値は、後の日常診療での効果を反映
subsequent practice.
する可能性がより高いといってよい。
There are a limited number of circumstances that might
ランダム化が行われた被験者を最大の解析対象集団か
lead to excluding randomised subjects from the full
ら除外することになる状況は限られている。それらに
analysis set including the failure to satisfy major entry
は、主要な登録基準を満たしていない場合（適格基準
criteria (eligibility violations), the failure to take at least
違反）、試験治療を一回も受けていない場合、ランダ
one dose of trial medication and the lack of any data post
ム化後のデータがない場合などがある。そのような除
randomisation. Such exclusions should always be
外については常に理由を示すべきである。登録基準を
justified. Subjects who fail to satisfy an entry criterion
満たしていない被験者は、以下の条件下でのみ偏りを
may be excluded from the analysis without the possibility
導入する可能性なく除外できるであろう。
of introducing bias only under the following
circumstances:
(i) the entry criterion was measured prior to
(i) 登録基準はランダム化以前に評価されている
randomisation;
(ii) the detection of the relevant eligibility violations can
(ii) 除外の対象となる適格基準違反の発見は完全に客
be made completely objectively;
観的になされる
(iii) all subjects receive equal scrutiny for eligibility
(iii) すべての被験者が適格基準違反について同様の綿
violations; (This may be difficult to ensure in an
密さで調べられている（非盲検試験においてはこの保
open-label study, or even in a double-blind study if the
証は困難であり、二重盲検試験であっても割付を明ら
data are unblinded prior to this scrutiny, emphasising
かにした後では難しい。このことは盲検下レヴューの
the importance of the blind review.)
重要性を強調している。）
(iv) all detected violations of the particular entry
(iv) 特定の登録基準違反が発見された場合、それに関
criterion are excluded.
するすべての違反が除外される
In some situations, it may be reasonable to eliminate
ランダム化が行われた全被験者から試験治療を一度も
from the set of all randomised subjects any subject who
受けていない被験者を除くことが適切な場合もある。
took no trial medication. The intention-to-treat principle
これらの患者を除外しても、例えば試験治療を開始す
would be preserved despite the exclusion of these
るかどうかの判断が、割付けられた試験治療が何であ
patients provided, for example, that the decision of
るかに影響されない場合には、intention-to-treat の原
whether or not to begin treatment could not be influenced
則は保たれるであろう。ランダム化後のデータが全く
by knowledge of the assigned treatment. In other
ない被験者をランダム化が行われた全対象集団から除
situations it may be necessary to eliminate from the set
外しなければならない場合もある。これらの個々の除
of all randomised subjects any subject without data post
外から生じる可能性のある偏り又はその他の偏りを明
randomisation. No analysis is complete unless the
示しない限り、どのような解析も完全とはいえない。
potential biases arising from these specific exclusions, or
any others, are addressed.
When the full analysis set of subjects is used, violations
最大の解析対象集団を用いる場合、ランダム化後に起
of the protocol that occur after randomization may have
こる治験実施計画書違反は、特にそれらの発生が試験
an impact on the data and conclusions, particularly if
治療の割付と関連しているときに、データ及び結論に
their occurrence is related to treatment assignment. In
影響するおそれがある。ほとんどの場合、ランダム化
most respects it is appropriate to include the data from
後に違反があった被験者のデータを解析に含めること
such subjects in the analysis, consistent with the
が適切であり、intention-to-treat の原則とも一致す
intention-to-treat principle. Special problems arise in
る。試験治療を一度以上受けた後で試験治療を中止し、
connection with subjects withdrawn from treatment after
その後のデータがない被験者及び中止ではないが追跡
receiving one or more doses who provide no data after
不能となった被験者の扱いに関連して特別の問題が起
this point, and subjects otherwise lost to follow-up,
きる。なぜなら、これらの被験者を最大の解析対象集
because failure to include these subjects in the full
団に含めないことにより、この方法が大きく損なわれ
analysis set may seriously undermine the approach.
るおそれがあるからである。どのような理由にせよ被
Measurements of primary variables made at the time of
験者が追跡不能となった時点に測定された主要変数の
the loss to follow-up of a subject for any reason, or
測定値、又は追跡不能となった後に治験実施計画書に
subsequently collected in accordance with the intended
従って集められた主要変数の測定値は、この観点から
schedule of assessments in the protocol, are valuable in
重要である。主要変数を追跡不能となった後に収集す
this context; subsequent collection is especially
ることは、死亡又は重大な疾患の発生を主要変数とす
important in studies where the primary variable is
る研究では特に重要である。このようにしてデータを
mortality or serious morbidity. The intention to collect
収集するということは、治験実施計画書に記述すべき
data in this way should be described in the protocol.
である。最終観察値をそれ以降の値に外挿する方法か
Imputation techniques, ranging from the carrying
ら複雑な数理モデルの使用まで、欠測値を埋め合わせ
forward of the last observation to the use of complex
る試みとして様々な技法を用いることができる。最大
mathematical models, may also be used in an attempt to
の解析対象集団の被験者全てについて、主要変数の測
compensate for missing data. Other methods employed to
定値が利用できることを保証するために用いられる別
ensure the availability of measurements of primary
の方法では、被験者の臨床結果又はより単純化した臨
variables for every subject in the full analysis set may
床結果（例えば、成功か失敗か）について何らかの仮
require some assumptions about the subjects' outcomes
定が必要であろう。これらのうちどの戦略を用いる場
or a simpler choice of outcome (e.g. success / failure). The
合でも、治験実施計画書の統計の部にその理由ととも
use of any of these strategies should be described and
に記述し、数理モデルで用いる前提は明確に説明すべ
justified in the statistical section of the protocol and the
きである。対応する解析結果の安定性を示すことも重
assumptions underlying any mathematical models
要であり、用いる戦略自体が試験治療の効果の偏った
employed should be clearly explained. It is also
推定値を与える可能性がある場合には特に重要であ
important to demonstrate the robustness of the
る。
corresponding results of analysis especially when the
strategy in question could itself lead to biased estimates
of treatment effects.
Because of the unpredictability of some problems, it may
前もって予見できない問題もあるため、それらの変則
sometimes be preferable to defer detailed consideration of
的な事例を取り扱う方法に関する詳細な検討は、時期
the manner of dealing with irregularities until the blind
を遅らせて研究終了前に行うデータの盲検下レヴュー
review of the data at the end of the trial, and, if so, this
時に行った方がよい場合もあろう。そのような場合は、
should be stated in the protocol.
治験実施計画書に述べるべきである。
5.2.2 Per Protocol Set
5.2.2 治験実施計画書に適合した対象集団
The 'per protocol' set of subjects, sometimes described as
ときには「’valid case’(妥当例)」、「’the efficacy’
the 'valid cases', the 'efficacy' sample or the 'evaluable
sample(有効性サンプル)」、又は「’evaluable subjects’
subjects' sample, defines a subset of the subjects in the
sample(評価可能被験者サンプル)」と表される「治験
full analysis set who are more compliant with the
実施計画書に適合した」対象集団は、最大の解析対象
protocol and is characterised by criteria such as the
集団の被験者のうち治験実施計画書をより遵守してい
following:
る一部であり、以下のような基準から特徴づけられる。
(i) the completion of a certain pre-specified minimal
(i) 事前に定められた最低限の試験治療規定を完了し
exposure to the treatment regimen;
ていること
(ii) the availability of measurements of the primary
(ii) 主要変数の測定値が利用可能であること
variable(s);
(iii) the absence of any major protocol violations
(iii) 登録基準違反などの重大な治験実施計画書違反が
including the violation of entry criteria.
ないこと
The precise reasons for excluding subjects from the per
治験実施計画書に適合した対象集団から被験者を除外
protocol set should be fully defined and documented
する理由の詳細は、個々の試験の状況に合わせた適切
before breaking the blind in a manner appropriate to the
な方法により、割付を明らかにする前にすべて明確に
circumstances of the specific trial.
し、文書として記録すべきである。
The use of the per protocol set may maximise the
治験実施計画書に適合した対象集団を用いることは、
opportunity for a new treatment to show additional
解析において新しい試験治療が有効性を示す可能性を
efficacy in the analysis, and most closely reflects the
最も大きくするといってもよく、治験実施計画書の基
scientific model underlying the protocol. However, the
礎となっている科学的なモデルを最もよく反映するこ
corresponding test of the hypothesis and estimate of the
とになる。しかし、対応する仮説検定と試験治療の効
treatment effect may or may not be conservative
果の推定値は試験により保守的となる場合もならない
depending on the trial; the bias, which may be severe,
場合もある。治験実施計画書の遵守状況が試験治療及
arises from the fact that adherence to the study protocol
び臨床結果と関連している場合には、生じる偏りは重
may be related to treatment and outcome.
大なものとなるおそれがある。
The problems that lead to the exclusion of subjects to
治験実施計画書に適合した対象集団を作り出すために
create the per protocol set, and other protocol violations,
被験者を除外しなければならなくなるような問題及び
should be fully identified and summarised. Relevant
その他の治験実施計画書違反は、十分に把握し要約す
protocol violations may include errors in treatment
べきである。ここで言う治験実施計画書違反には、試
assignment, the use of excluded medication, poor
験治療の割付間違い、禁止薬剤の使用、低い服薬遵守
compliance, loss to follow-up and missing data. It is good
状況、追跡不能及び欠測値などがある。試験治療グル
practice to assess the pattern of such problems among
ープ間で、これらの問題の発生頻度及び発生時間のパ
the treatment groups with respect to frequency and time
ターンを評価することはよい対処法である。
to occurrence.
5.2.3 Roles of the Different Analysis Sets
5.2.3 二つの異なる解析対象集団の役割
In general, it is advantageous to demonstrate a lack of
一般に、解析に用いる被験者集団の選択の変更を行っ
sensitivity of the principal trial results to alternative
ても、主要な試験結果が変わらないことを示すことは
choices of the set of subjects analysed. In confirmatory
有益である。検証的試験では、最大の解析対象集団の
trials it is usually appropriate to plan to conduct both an
解析と治験実施計画書に適合した対象集団の解析との
analysis of the full analysis set and a per protocol
相違を明示的な議論と解釈の対象にできるよう、通常
analysis, so that any differences between them can be the
両方の解析を計画することが適切である。解析対象集
subject of explicit discussion and interpretation. In some
団の変更により結論がどの程度変わり易いかを、更に
cases, it may be desirable to plan further exploration of
探索する計画を立てた方が望ましい場合もあろう。最
the sensitivity of conclusions to the choice of the set of
大の解析対象集団の解析と治験実施計画書に適合した
subjects analysed. When the full analysis set and the per
対象集団の解析が本質的に同じ結論に達する場合、試
protocol set lead to essentially the same conclusions,
験結果の信用度は高くなる。しかしその場合でも、最
confidence in the trial results is increased, bearing in
大の解析対象集団の相当の割合を治験実施計画書に適
mind, however, that the need to exclude a substantial
合した解析で除かなければならない場合には、試験全
proportion of subjects from the per protocol analysis
体の妥当性が疑われることに注意する必要がある。
throws some doubt on the overall validity of the trial.
The full analysis set and the per protocol set play
最大の解析対象集団と治験実施計画書に適合した対象
different roles in superiority trials (which seek to show
集団は、（被験薬が優っていることを示そうとする）
the investigational product to be superior), and in
優越性試験と（被験薬が匹敵しうることを示そうとす
equivalence or non-inferiority trials (which seek to show
る、3.3.2 節参照）同等性試験又は非劣性試験とで異な
the investigational product to be comparable, see section
る役割を果たす。優越性試験では、最大の解析対象集
3.3.2). In superiority trials the full analysis set is used in
団から得られる有効性の推定値は、治験実施計画書に
the primary analysis (apart from exceptional
適合した解析ほど過度に楽観的ではないため、最大の
circumstances) because it tends to avoid over-optimistic
解析対象集団が（例外的な状況を除いて）主要な解析
estimates of efficacy resulting from a per protocol
に用いられる。なぜならば、最大の解析対象集団に含
analysis, since the noncompliers included in the full
まれる遵守状況の悪い被験者は、推定される試験治療
analysis set will generally diminish the estimated
の効果を一般に小さくするからである。しかし、同等
treatment effect. However, in an equivalence or
性試験又は非劣性試験では、最大の解析対象集団を使
non-inferiority trial use of the full analysis set is
用することは一般に保守的ではないため、その役割は
generally not conservative and its role should be
十分慎重に考慮すべきである。
considered very carefully.
5.3 Missing Values and Outliers
5.3 欠測値と外れ値
Missing values represent a potential source of bias in a
欠測値は、臨床試験において偏りを起こし得る代表的
clinical trial. Hence, every effort should be undertaken to
な原因である。したがって、データ収集及びデータマ
fulfil all the requirements of the protocol concerning the
ネジメントに関しては、治験実施計画書が要求する事
collection and management of data. In reality, however,
項について、すべてを満たすようあらゆる努力をすべ
there will almost always be some missing data. A trial
きである。しかし、実際には、ほとんど常に欠測値が
may be regarded as valid, nonetheless, provided the
いくつか存在するであろう。そうであったとしても、
methods of dealing with missing values are sensible, and
欠測値を扱う方法が適切で、特にそれらの方法が治験
particularly if those methods are pre-defined in the
実施計画書の解析計画中に事前に定められている場合
protocol. Definition of methods may be refined by
には、試験は妥当であるとみなすことができる。この
updating this aspect in the statistical analysis plan
観点から盲検下レヴューの間に統計解析計画書を見直
during the blind review. Unfortunately, no universally
すことにより、欠測値を扱う方法が洗練されるであろ
applicable methods of handling missing values can be
う。残念ながら、欠測値に対処する方法で、普遍的に
recommended. An investigation should be made
適用可能と薦められる方法はない。欠測値に対処する
concerning the sensitivity of the results of analysis to the
方法により解析結果がどの程度変わり易いかを、欠測
method of handling missing values, especially if the
の数が多い場合には特に、検討すべきである。
number of missing values is substantial.
A similar approach should be adopted to exploring the
同様の手法は、外れ値(その統計的定義はある程度恣意
influence of outliers, the statistical definition of which is,
的だが)の影響を探索するためにも用いるべきである。
to some extent, arbitrary. Clear identification of a
特定の値が外れ値であると明確に判断するのに最も説
particular value as an outlier is most convincing when
得力があるのは、統計的に正当であるだけではなく医
justified medically as well as statistically, and the
学的にも正当な場合であり、医学的な背景によりしば
medical context will then often define the appropriate
しば適切な対応が決定される例がみられる。治験実施
action. Any outlier procedure set out in the protocol or
計画書又は統計解析計画書に記載する外れ値対策は、
the statistical analysis plan should be such as not to
どのような場合でも先験的にどの試験治療グループの
favour any treatment group a priori. Once again, this
利益にもならないようにすべきである。この観点から、
aspect of the analysis can be usefully updated during
盲検下レヴュー中に再度解析を見直すことが有益であ
blind review. If no procedure for dealing with outliers
る。治験実施計画書中に外れ値に対処する方法が記載
was foreseen in the trial protocol, one analysis with the
されていない場合、実際の値を用いた解析のほかに、
actual values and at least one other analysis eliminating
外れ値の影響を除くか小さくする別の解析を少なくと
or reducing the outlier effect should be performed and
も一つ行うべきであり、それらの結果の間の差異を議
differences between their results discussed.
論すべきである。
5.4 Data Transformation
5.4 データ変換
The decision to transform key variables prior to analysis
重要な変数を変換するために必要な判断は解析の前に
is best made during the design of the trial on the basis of
行い、先行する臨床試験での類似データに基づいて治
similar data from earlier clinical trials. Transformations
験実施計画立案時に行うのが最善である。変換（例え
(e.g. square root, logarithm) should be specified in the
ば、平方根、対数）を行うことは、主要変数について
protocol and a rationale provided, especially for the
は特に治験実施計画書に明記すべきであり、その理論
primary variable(s). The general principles guiding the
的根拠を述べるべきである。統計手法の前提を満たす
use of transformations to ensure that the assumptions
ことを保証するための変換の一般的な原則は、標準的
underlying the statistical methods are met are to be
な教科書に書かれている。また、特定の変数について
found in standard texts; conventions for particular
の変換の慣例的方法は、多くの臨床領域別に開発され
variables have been developed in a number of specific
てきている。変数を変換するかどうか、変換するので
clinical areas. The decision on whether and how to
あればどのように変換するかという判断は、臨床的な
transform a variable should be influenced by the
解釈を容易にする尺度を選択するという観点も含めて
preference for a scale which facilitates clinical
行われるべきである。
interpretation. Similar considerations apply to other
derived variables, such as the use of change from
baseline, percentage change from baseline, the 'area
under the curve' of repeated measures, or the ratio of two
different variables. Subsequent clinical interpretation
should be carefully considered, and the derivation should
be justified in the protocol. Closely related points are
made in Section 2.2.2.
5.5 Estimation, Confidence Intervals and Hypothesis
Testing
5.5 推定、信頼区間及び仮説検定
The statistical section of the protocol should specify the
治験実施計画書の統計の部には、試験の主要な目的に
hypotheses that are to be tested and/or the treatment
対応した検定すべき仮説及び推定すべき試験治療の効
effects which are to be estimated in order to satisfy the
果のどちらか又は両方を明記すべきである。主要変数
primary objectives of the trial. The statistical methods to
については（望むべくは副次変数についても）、これ
be used to accomplish these tasks should be described for
らの目的を果たすための作業に用いる統計手法を記述
the primary (and preferably the secondary) variables,
すべきであり、基礎となる統計モデルを明確にすべき
and the underlying statistical model should be made
である。試験治療の効果の推定値は可能な限り信頼区
clear. Estimates of treatment effects should be
間とともに提示すべきものであることから、治験実施
accompanied by confidence intervals, whenever possible,
計画書の統計の部にはこれらを求める方法を示すべき
and the way in which these will be calculated should be
である。例えば共分散分析を用いて、精度の向上のた
identified. A description should be given of any intentions
め又は存在する可能性のある基準となる時点での差に
to use baseline data to improve precision or to adjust
ついて推定値を調整するために、基準となる時点での
estimates for potential baseline differences, for example
値を使用するいかなる目的に関しても説明を加えるべ
by means of analysis of covariance.
きである。
It is important to clarify whether one- or two-sided tests
片側検定を用いるか両側検定を用いるかを明確にする
of statistical significance will be used, and in particular
ことは重要であり、特に片側検定の使用については前
to justify prospectively the use of one-sided tests. If
もって理由づけることが重要である。仮説検定が適切
hypothesis tests are not considered appropriate, then the
ではないと考えられる場合には、統計的結論を導くた
alternative process for arriving at statistical conclusions
めの別な手続きを提示すべきである。推測を片側と考
should be given. The issue of one-sided or two-sided
えるか両側と考えるかには議論があり、統計学の文献
approaches to inference is controversial and a diversity of
にも様々な見解がみられる。承認申請のための試験で
views can be found in the statistical literature. The
は、片側検定の第一種の過誤を両側検定で慣例的に用
approach of setting type I errors for one-sided tests at
いている値の半分に設定する方法が好ましい。このよ
half the conventional type I error used in two-sided tests
うに設定することにより、試験治療間の差の大きさを
is preferable in regulatory settings. This promotes
推定するために通常用いられる両側信頼区間との整合
consistency with the two-sided confidence intervals that
性を図ることができる。
are generally appropriate for estimating the possible size
of the difference between two treatments.
The particular statistical model chosen should reflect the
選択する個別の統計モデルには、試験の計画段階での
current state of medical and statistical knowledge about
統計的な配慮を反映させると同時に、解析する変数に
the variables to be analysed as well as the statistical
関する現在の医学的及び統計的知識を反映させるべき
design of the trial. All effects to be fitted in the analysis
である。（例えば、分散分析モデルで）解析に含める
(for example in analysis of variance models) should be
効果のすべてを十分に明記すべきであり、予備的な結
fully specified, and the manner, if any, in which this set
果からこの効果の組を変更する可能性があるのであれ
of effects might be modified in response to preliminary
ば、そのやり方を説明すべきである。共分散分析に用
results should be explained. The same considerations
いる共変量の組に関しても同様の考慮が当てはまる
apply to the set of covariates fitted in an analysis of
（5.7 節も参照のこと）。統計的方法を選択する際に
covariance. (See also Section 5.7.). In the choice of
は、主要変数及び副次変数両方の統計的分布に対して
statistical methods due attention should be paid to the
十分な注意を払うべきである。（例えばパラメトリッ
statistical distribution of both primary and secondary
ク手法を用いるかノンパラメトリック手法を用いるか
variables. When making this choice (for example between
の）選択の際には、試験治療効果の大きさに関する統
parametric and non-parametric methods) it is important
計的推定値を、（有意性検定に加えて）信頼区間とと
to bear in mind the need to provide statistical estimates
もに示すことが必要だと心得ておくことが重要であ
of the size of treatment effects together with confidence
る。
intervals (in addition to significance tests).
The primary analysis of the primary variable should be
主要変数の主要な解析は、その裏付けとして行う主要
clearly distinguished from supporting analyses of the
変数又は副次変数の解析とは明確に区別すべきであ
primary or secondary variables. Within the statistical
る。治験実施計画書の統計の部又は統計解析計画書に
section of the protocol or the statistical analysis plan
は、主要変数及び副次変数以外のデータをどのように
there should also be an outline of the way in which data
要約し報告するかについての概要も記述すべきであ
other than the primary and secondary variables will be
る。その際、一連の試験にまたがる解析、例えば安全
summarised and reported. This should include a
性データの解析を一貫させるためにとったすべての手
reference to any approaches adopted for the purpose of
段についても言及すべきである。
achieving consistency of analysis across a range of trials,
for example for safety data.
Modelling approaches that incorporate information on
既知の薬理パラメータ、個々の被験者の治験実施計画
known pharmacological parameters, the extent of
書遵守の程度又はそのほかの生物学的な根拠に基づく
protocol compliance for individual subjects or other
データをモデル化する方法により、真の有効性又は潜
biologically based data may provide valuable insights
在的な有効性について、とりわけ試験治療の効果の推
into actual or potential efficacy, especially with regard to
定の点から、有益な理解が得られるであろう。そのよ
estimation of treatment effects. The assumptions
うなモデルの前提条件は常に明確にしておくべきであ
underlying such models should always be clearly
り、モデルから得られる結論の限界についても慎重に
identified, and the limitations of any conclusions should
記述すべきである。
be carefully described.
5.6 Adjustment of Significance and Confidence Levels
5.6 有意水準と信頼水準の調整
When multiplicity is present, the usual frequentist
多重性が存在する場合、臨床試験データの解析に対す
approach to the analysis of clinical trial data may
る通常の頻度論的立場からは、第一種の過誤の調整を
necessitate an adjustment to the type I error. Multiplicity
必要とするであろう。多重性は、例えば主要変数が複
may arise, for example, from multiple primary variables
数ある場合（2.2.2 節参照）、試験治療間の多重比較、
(see Section 2.2.2), multiple comparisons of treatments,
時間に伴う繰り返し評価、中間解析（4.5 節参照）か
repeated evaluation over time and/or interim analyses
ら生じるであろう。多重性を回避する又は減じる方法
(see Section 4.5). Methods to avoid or reduce multiplicity
は、それが利用できる場合には好ましいこともある。
are sometimes preferable when available, such as the
例えば、変数が複数ある場合に主要な変数の中でも重
identification of the key primary variable (multiple
要な変数を指定すること、多群比較の場合に試験治療
variables), the choice of a critical treatment contrast
間の重要な対比を選択すること、繰り返し測定の場合
(multiple comparisons), the use of a summary measure
に「曲線下面積」といった要約指標を使用すること、
such as ‘area under the curve’ (repeated measures). In
などである。検証的解析では、このようにして多重性
confirmatory analyses, any aspects of multiplicity which
を減じた後の段階でまだ残っている多重性のすべての
remain after steps of this kind have been taken should be
側面について治験実施計画書において明らかにすべき
identified in the protocol; adjustment should always be
である。調整は常に考慮すべきであり、調整方法の詳
considered and the details of any adjustment procedure
細、又はなぜ調整は必要ないと考えるのかという説明
or an explanation of why adjustment is not thought to be
は、統計解析計画書に述べるべきである。
necessary should be set out in the analysis plan.
5.7 Subgroups, Interactions and Covariates
5.7 部分集団、交互作用及び共変量
The primary variable(s) is often systematically related to
主要変数が試験治療によるものとは別の作用と、系統
other influences apart from treatment. For example,
的に関連している例がよくみられる。例えば、年齢や
there may be relationships to covariates such as age and
性といった共変量と関連がある場合、又は多施設共同
sex, or there may be differences between specific
治験では異なる施設で試験治療を受けた被験者という
subgroups of subjects such as those treated at the
部分集団間に差がみられる場合がある。共変量の影響
different centres of a multicentre trial. In some instances
又は部分集団の効果を調整することは予定した解析の
an adjustment for the influence of covariates or for
重要部分となる場合があり、したがって治験実施計画
subgroup effects is an integral part of the planned
書に記述すべきである。主要変数に重要な影響を及ぼ
analysis and hence should be set out in the protocol.
すと予想される共変量と要因は、試験開始前に議論し
Pre-trial deliberations should identify those covariates
て確認しておくべきであり、精度を向上させ、試験治
and factors expected to have an important influence on
療グループ間のバランスの欠如を埋め合わせるため、
the primary variable(s), and should consider how to
それらを解析でどう取り扱うかを考慮すべきである。
account for these in the analysis in order to improve
計画時に一つ以上の因子を層別因子として用いたなら
precision and to compensate for any lack of balance
ば、それらの因子を解析時に考慮することが適切であ
between treatment groups. If one or more factors are
る。調整することの潜在的な有益さが疑わしい場合に
used to stratify the design, it is appropriate to account for
はしばしば、調整しない解析に主要な関心を払うと宣
those factors in the analysis. When the potential value of
言し、調整した解析はそれを補うものとすることが薦
an adjustment is in doubt, it is often advisable to
められる。施設の効果及び基準となる時点での主要変
nominate the unadjusted analysis as the one for primary
数の測定値の役割には特別な注意を払うべきである。
attention, the adjusted analysis being supportive. Special
しかし、ランダム化後に測定された共変量を主要な解
attention should be paid to centre effects and to the role
析で調整することは薦められない。なぜならばそのよ
of baseline measurements of the primary variable. It is
うな共変量は試験治療の影響を受けている可能性があ
not advisable to adjust the main analyses for covariates
るからである。
measured after randomisation because they may be
affected by the treatments.
The treatment effect itself may also vary with subgroup
試験治療の効果自身もまた部分集団や共変量により異
or covariate - for example, the effect may decrease with
なる可能性がある。例えば、効果は年齢とともに減少
age or may be larger in a particular diagnostic category of
する場合があり、被験者の特別な診断分類では、より
subjects. In some cases such interactions are anticipated
大きな効果があることもある。そのような交互作用は
or are of particular prior interest (e.g. geriatrics), and
予想されたり、交互作用自体に重要な関心がある場合
hence a subgroup analysis, or a statistical model
（例えば高齢者医療など）があり、したがって部分集
including interactions, is part of the planned
団別解析又は交互作用を含んだ統計モデルは予定した
confirmatory analysis. In most cases, however, subgroup
検証的解析に含まれる場合がある。しかし、多くの場
or interaction analyses are exploratory and should be
合、部分集団別解析又は交互作用解析は探索的である
clearly identified as such; they should explore the
ため、探索的であることを明確に確認しておくべきで
uniformity of any treatment effects found overall. In
ある。それらの解析では、全体でみられた試験治療の
general, such analyses should proceed first through the
効果がすべて一様であるかどうかを探索すべきであ
addition of interaction terms to the statistical model in
る。一般に、そのような解析は、まず問題となってい
question, complemented by additional exploratory
る統計モデルに交互作用項を加えることから始めるべ
analysis within relevant subgroups of subjects, or within
きであり、適切な被験者の部分集団内、又は共変量に
strata defined by the covariates. When exploratory, these
よって定められる層内を更に探索的に解析することに
analyses should be interpreted cautiously; any conclusion
よって補われる。探索的解析である場合、これらの解
of treatment efficacy (or lack thereof) or safety based
析結果は慎重に解釈すべきである。試験治療の有効性
solely on exploratory subgroup analyses are unlikely to
（若しくは有効性がないこと）、又は安全性に関する
be accepted.
結論は、どのようなものであっても、探索的な部分集
団別解析のみに基づいていては受け入れ難い。
5.8 Integrity of Data and Computer Software Validity
5.8 データの完全性の維持とコンピュータソフトウェ
アの妥当性
The credibility of the numerical results of the analysis
数値で表された解析結果の信憑性は、データマネジメ
depends on the quality and validity of the methods and
ント（データ入力、保存、確認、訂正及び復元）及び
software (both internally and externally written) used
データの統計処理の両方に用いられる方法とソフトウ
both for data management (data entry, storage,
ェア（自作、外注、市販）の質及び妥当性に依存する
verification, correction and retrieval) and also for
ものである。したがって、データマネジメントは綿密
processing the data statistically. Data management
で効果的な標準業務手順書に基づいて行うべきであ
activities should therefore be based on thorough and
る。データマネジメント及び統計解析に用いるコンピ
effective standard operating procedures. The computer
ュータソフトウェアは信頼できるものを使用すべきで
software used for data management and statistical
あり、ソフトウエアの適切な検証手順を資料として用
analysis should be reliable, and documentation of
意すべきである。
appropriate software testing procedures should be
available.
VI. EVALUATION OF SAFETY AND TOLERABILITY
VI. 安全性及び忍容性評価
6.1 Scope of Evaluation
6.1 評価の範囲
In all clinical trials evaluation of safety and tolerability
すべての臨床試験において、安全性及び忍容性（用語
(see Glossary) constitutes an important element. In early
集参照）の評価は重要な要素である。初期の相では、
phases this evaluation is mostly of an exploratory nature,
この評価の大部分は探索的な性質のものであり、敏感
and is only sensitive to frank expressions of toxicity,
にとらえられるのは明らかな毒性の出現のみである。
whereas in later phases the establishment of the safety
しかし、後期の相では、被験薬の安全性及び忍容性の
and tolerability profile of a drug can be characterised
プロファイルを、より多くの被験者により十分に特徴
more fully in larger samples of subjects. Later phase
づけて確立することができる。後期の比較試験は、一
controlled trials represent an important means of
般にこの点での検出力を欠いているとしても、新たな
exploring in an unbiased manner any new potential
潜在的有害作用のすべてを偏りなく探索するための重
adverse effects, even if such trials generally lack power in
要な手段を提供するものである。
this respect.
Certain trials may be designed with the purpose of
ある種の試験は、他の医薬品又は被験薬の別な用量と
making specific claims about superiority or equivalence
比較して、安全性及び忍容性に関する優越性又は同等
with regard to safety and tolerability compared to
性についての具体的な主張のために計画される場合が
another drug or to another dose of the investigational
ある。このような承認に関わる具体的な主張は、対応
drug. Such specific claims should be supported by
する有効性の主張に対し要求されるのと同様に、検証
relevant evidence from confirmatory trials, similar to
的試験による適切な証拠によって確認されるべきであ
that necessary for corresponding efficacy claims.
る。
6.2 Choice of Variables and Data Collection
6.2 変数の選択とデータ収集
In any clinical trial the methods and measurements
どのような臨床試験でも、医薬品の安全性及び忍容性
chosen to evaluate the safety and tolerability of a drug
を評価するために選ばれる方法と測定値は、多くの要
will depend on a number of factors, including knowledge
因に依存する。その要因には、関連医薬品の有害作用
of the adverse effects of closely related drugs, information
についての知識、非臨床試験及び初期の臨床試験から
from non-clinical and earlier clinical trials and possible
の情報、個々の医薬品の薬力学的/薬物動態的特質から
consequences of the pharmacodynamic/pharmacokinetic
起こると考えられる結果、使用方法、研究対象となる
properties of the particular drug, the mode of
被験者の特徴並びに試験の期間といったものがある。
administration, the type of subjects to be studied, and the
臨床化学と血液学に関する臨床検査値、バイタルサイ
duration of the trial. Laboratory tests concerning clinical
ン及び臨床的有害事象（疾患、徴候及び症状）は、通
chemistry and haematology, vital signs, and clinical
常、安全性及び忍容性データの主要部を形成する。重
adverse events (diseases, signs and symptoms) usually
篤な有害事象の発生及び有害事象による試験治療の中
form the main body of the safety and tolerability data.
断については、登録することが特に重要である（ICH
The occurrence of serious adverse events and treatment
E2A と E3 参照）。
discontinuations due to adverse events are particularly
important to register (see ICH E2A and ICH E3).
Furthermore, it is recommended that a consistent
更に、異なる臨床試験からのデータを結びつけること
methodology be used for the data collection and
を容易にするために、試験プログラム全体を通して一
evaluation throughout a clinical trial program in order to
貫したデータ収集及び評価の方法論を用いることが薦
facilitate the combining of data from different trials. The
められる。共通の有害事象の辞書の使用は特に重要で
use of a common adverse event dictionary is particularly
ある。有害事象の辞書は、器官分類、基本語又は慣用
important. This dictionary has a structure which gives
語（用語集参照）という、三つの異なる水準で有害事
the possibility to summarise the adverse event data on
象データを要約できるように構成されている。有害事
three different levels; system-organ class, preferred term
象を要約する通常の水準は基本語であり、同一の器官
or included term (see Glossary). The preferred term is the
分類に属している基本語は、データの記述的提示の際
level on which adverse events usually are summarised,
にまとめることができる（ICH M1 参照）。
and preferred terms belonging to the same system-organ
class could then be brought together in the descriptive
presentation of data (see ICH M1).
6.3 Set of Subjects to be Evaluated and Presentation of
6.3 評価される被験者集団とデータの提示
Data
For the overall safety and tolerability assessment, the set
全体的な安全性及び忍容性を評価するのに用いられる
of subjects to be summarised is usually defined as those
被験者集団は、通常被験薬を少なくとも一回服用した
subjects who received at least one dose of the
被験者の集団である。安全性及び忍容性の変数は、こ
investigational drug. Safety and tolerability variables
れらの被験者から可能な限り包括的に有害事象の種
should be collected as comprehensively as possible from
類、重症度及び発現時と持続期間を含めて収集される
these subjects, including type of adverse event, severity,
べきである（ICH E2B 参照）。女性、高齢者（ICH E7
onset and duration (see ICH E2B). Additional safety and
参照）、重症者、又は共通の併用治療を受けた被験者
tolerability evaluations may be needed in specific
といった特定の属性別集団については、恐らく更なる
subpopulations, such as females, the elderly (see ICH
安全性及び忍容性評価が必要となるであろう。これら
E7), the severely ill, or those who have a common
の評価では、より個別の問題に答えることが必要であ
concomitant treatment. These evaluations may need to
ろう（ICH E3 参照）。
address more specific issues (see ICH E3).
All safety and tolerability variables will need attention
評価の際には、すべての安全性及び忍容性変数に注意
during evaluation, and the broad approach should be
を払う必要があるため、広範な方法を治験実施計画書
indicated in the protocol. All adverse events should be
に示すべきである。試験治療と関係していると考えら
reported, whether or not they are considered to be related
れるか否かにかかわらず、すべての有害事象を報告す
to treatment. All available data in the study population
べきである。評価の際には、研究対象集団の利用でき
should be accounted for in the evaluation. Definitions of
るデータのすべてを用いるべきである。測定単位と臨
measurement units and reference ranges of laboratory
床検査変数の参照範囲は注意深く定義すべきである。
variables should be made with care; if different units or
もし異なる単位又は異なる参照範囲を同一の試験で用
different reference ranges appear in the same trial (e.g. if
いるのであれば（例えば、二つ以上の検査機関が入っ
more than one laboratory is involved), then
ている場合）、統一的な評価を可能にするために測定
measurements should be appropriately standardised to
値を適切に標準化すべきである。毒性評価尺度の使用
allow a unified evaluation. Use of a toxicity grading scale
については、事前に定め、正当化しておくべきである。
should be prespecified and justified.
The incidence of a certain adverse event is usually
ある有害事象の発現は、通常有害事象を経験した被験
expressed in the form of a proportion relating number of
者数とその有害事象を発現する可能性のある被験者数
subjects experiencing events to number of subjects at
との関係を示す割合の形で表現される。しかし、発現
risk. However, it is not always self evident how to assess
の評価の仕方はいつも自明というわけではない。例え
incidence. For example, depending on the situation the
ば状況に応じて、試験治療が使用された被験者数、又
number of exposed subjects or the extent of exposure (in
は使用の程度（人年）を分母とすることが考えられる。
person-years) could be considered for the denominator.
計算の目的がリスクの推定であるか、試験治療グルー
Whether the purpose of the calculation is to estimate a
プ間での比較であるかにかかわらず、その定義を治験
risk or to make a comparison between treatment groups
実施計画書に示すことは重要である。この定義は、試
it is important that the definition is given in the protocol.
験治療が長期にわたることが予定され、かなりの割合
This is especially important if long-term treatment is
で試験治療の中止又は死亡が起こると予想される場
planned and a substantial proportion of treatment
合、特に重要である。そのような状況では、生存解析
withdrawals or deaths are expected. For such situations
の方法を考慮すべきであり、過小評価を避けるために
survival analysis methods should be considered and
累積有害事象発現率を計算すべきである。
cumulative adverse event rates calculated in order to
avoid the risk of underestimation.
In situations when there is a substantial background
徴候や症状に相当の背景ノイズが存在する状況では
noise of signs and symptoms (e.g. in psychiatric trials)
（例えば、精神科での試験）、異なる有害事象に対す
one should consider ways of accounting for this in the
るリスクの推定に背景ノイズを考慮する方法を考える
estimation of risk for different adverse events. One such
べきである。そのような方法の一つは、「試験治療下
method is to make use of the 'treatment emergent' (see
での発現」（用語集参照）という概念を用いることで
Glossary) concept in which adverse events are recorded
ある。「試験治療下での発現」では試験治療前の基準
only if they emerge or worsen relative to pretreatment
となる発現状況と比べて、新たに発現又は悪化した有
baseline.
害事象のみを記録する。
Other methods to reduce the effect of the background
軽度の有害事象は無視する、又は分子に加えるための
noise may also be appropriate such as ignoring adverse
基準として、繰り返しの来院で事象が観察し続けられ
events of mild severity or requiring that an event should
ることを要求するような背景ノイズの効果を減らすた
have been observed at repeated visits to qualify for
めの別の方法もまた適切な場合がある。そのような方
inclusion in the numerator. Such methods should be
法は治験実施計画書に正当性を説明しておくべきであ
explained and justified in the protocol.
る。
6.4 Statistical Evaluation
6.4 統計的評価
The investigation of safety and tolerability is a
安全性及び忍容性の研究は多次元的な問題である。ど
multidimensional problem. Although some specific
のような被験薬についても、何らかの特定の有害作用
adverse effects can usually be anticipated and specifically
は通常予測でき特定してモニターできるが、起こりう
monitored for any drug, the range of possible adverse
る有害作用の幅はたいへん広く、新しく、予想もされ
effects is very large, and new and unforeseeable effects
ない作用が常に生じうる。更に、禁止薬の使用のよう
are always possible. Further, an adverse event
な治験実施計画書違反の後で発生した有害事象は恐ら
experienced after a protocol violation, such as use of an
く偏りの原因となるであろう。このような背景がある
excluded medication, may introduce a bias. This
ことが、被験薬の安全性及び忍容性の解析的評価が統
background underlies the statistical difficulties
計的に困難となる原因となり、検証的試験から結論を
associated with the analytical evaluation of safety and
確定するような情報を得ることをむしろ例外としてい
tolerability of drugs, and means that conclusive
る。
information from confirmatory clinical trials is the
exception rather than the rule.
In most trials the safety and tolerability implications are
ほとんどの試験で、安全性及び忍容性関連事項を扱う
best addressed by applying descriptive statistical
には、記述統計の手法でデータを整理し、信頼区間が
methods to the data, supplemented by calculation of
解釈の助けとなる場合にはその計算を加えることが最
confidence intervals wherever this aids interpretation. It
善である。試験治療グループ内と被験者個人内両方で
is also valuable to make use of graphical presentations in
有害事象のパターンが示されるようなグラフ表示を利
which patterns of adverse events are displayed both
用することもまた有益である。
within treatment groups and within subjects.
The calculation of p-values is sometimes useful either as
ｐ値の計算は、関心のある特定の差を評価する補助と
an aid to evaluating a specific difference of interest, or as
して、又は多数の安全性及び忍容性変数に対して注目
a 'flagging' device applied to a large number of safety and
するだけの価値のある差を際立たせるための目印とし
tolerability variables to highlight differences worth
て、有用な場合がある。これは検査データに特に有用
further attention. This is particularly useful for
であり、この方法以外で検査データを適切に要約する
laboratory data, which otherwise can be difficult to
ことは難しい。検査データには、例えば試験治療ごと
summarise appropriately. It is recommended that
の平均の評価のような定量的な解析と、ある閾値を超
laboratory data be subjected to both a quantitative
える又は下回る数を数える定性的な解析の両方を行う
analysis, e.g. evaluation of treatment means, and a
ことが薦められる。
qualitative analysis where counting of numbers above or
below certain thresholds are calculated.
If hypothesis tests are used, statistical adjustments for
仮説検定を用いる場合、第一種の過誤を勘案して多重
multiplicity to quantify the type I error are appropriate,
性を統計的に調整することは適切ではあるが、通常は
but the type II error is usually of more concern. Care
第二種の過誤により注意を払うべきである。多重性の
should be taken when interpreting putative statistically
調整を行っていない場合、統計的に有意となった結果
significant findings when there is no multiplicity
の解釈には注意すべきである。
adjustment.
In the majority of trials investigators are seeking to
大多数の試験で、治験責任医師たちは、実対照薬又は
establish that there are no clinically unacceptable
プラセボに比べて安全性及び忍容性に関して臨床的に
differences in safety and tolerability compared with
許容できない差はないことを立証しようとしている。
either a comparator drug or a placebo. As is the case for
有効性に関する非劣性又は同等性評価の場合と同様
non-inferiority or equivalence evaluation of efficacy the
に、この状況では仮説検定よりも信頼区間を使用する
use of confidence intervals is preferred to hypothesis
ことが望ましい。信頼区間を用いると、生起数が少な
testing in this situation. In this way, the considerable
いことが原因となってみられることの多い、はなはだ
imprecision often arising from low frequencies of
しい精度の低さを明確に示すことができる。
occurrence is clearly demonstrated.
6.5 Integrated Summary
6.5 統合した要約
The safety and tolerability properties of a drug are
被験薬の安全性及び忍容性に関する特質は、一般に被
commonly summarised across trials continuously during
験薬を開発する過程で逐次的に、複数の試験を通して
an investigational product’s development and in
要約され、特に承認申請時には必ず要約されるもので
particular at the time of a marketing application. The
ある。しかし、この要約の有用性は高い質のデータを
usefulness of this summary, however, is dependent on
伴い適切に計画・実施された個々の比較試験に依存す
adequate and well-controlled individual trials with high
る。
data quality.
The overall usefulness of a drug is always a question of
被験薬の全体的な有用性は、常にリスクと利益のバラ
balance between risk and benefit and in a single trial
ンスの問題であり、リスクと利益の評価は通常全臨床
such a perspective could also be considered, even if the
試験プログラムを要約して行われるものであるが、単
assessment of risk/benefit usually is performed in the
一の試験でも、有用性が見込まれるかについて検討す
summary of the entire clinical trial program. (See section
ることは可能である（7.2.2 節参照）。
7.2.2)
For more details on the reporting of safety and
安全性及び忍容性に関連する報告の要求事項について
tolerability, see Chapter 12 of ICH E3.
の詳細は、ICH E3 12 章を参照すること。
VII. REPORTING
VII. 報告
7.1 Evaluation and Reporting
7.1 評価と報告
As stated in the Introduction, the structure and content
「I. はじめに」で述べたように、総括報告書の構成と
of clinical study reports is the subject of ICH E3. That
内容は、ICH E3 での主題である。ICH ガイドライン
ICH guidance fully covers the reporting of statistical
E3 は、臨床とその他の資料を適切に統合する統計作業
work, appropriately integrated with clinical and other
の報告について十分に網羅している。したがって、こ
material. The current section is therefore relatively brief.
の節は比較的簡潔にとどめる。
During the planning phase of a trial the principal
試験の計画段階では、解析の主要な特徴は 5 節に述べ
features of the analysis should have been specified in the
たように治験実施計画書に明記すべきである。試験の
protocol as described in Section 5. When the conduct of
実施が終了し、データが集積されて予備的な点検がで
the trial is over and the data are assembled and available
きるようになると、やはり 5 節で述べたように、予定
for preliminary inspection, it is valuable to carry out the
した解析の盲検下レヴューを実施することは有益であ
blind review of the planned analysis also described in
る。この解析前に行う検討では、試験治療を盲検化し
Section 5. This pre-analysis review, blinded to treatment,
た状態で、例えば被験者又はデータを解析対象集団か
should cover decisions concerning, for example, the
ら除外することに関する判断を行うべきである。また、
exclusion of subjects or data from the analysis sets;
変数変換の可能性の検討と外れ値の定義、最近の研究
possible transformations may also be checked, and
で明らかになった重要な共変量をモデルに加えるこ
outliers defined; important covariates identified in other
と、パラメトリック手法を用いるかノンパラメトリッ
recent research may be added to the model; the use of
ク手法を用いるか等について再検討して差し支えな
parametric or non-parametric methods may be
い。この時点で下された判断は、報告書に記述される
reconsidered. Decisions made at this time should be
べきである。一般に盲検下での判断は偏りをもたらす
described in the report, and should be distinguished from
可能性が小さいので、統計家が試験治療のコードを知
those made after the statistician has had access to the
った後での判断と区別しておくべきである。割付を明
treatment codes, as blind decisions will generally
らかにして行った中間解析に従事した統計家及びその
introduce less potential for bias. Statisticians or other
他のスタッフは、盲検下レヴュー又は統計解析計画の
staff involved in unblinded interim analysis should not
変更に参加すべきではない。また試験治療に由来する
participate in the blind review or in making
効果がデータ上明らかなことによって盲検が破れる可
modifications to the statistical analysis plan. When the
能性がある場合、盲検下レヴューには特別の注意を必
blinding is compromised by the possibility that treatment
要とするであろう。
induced effects may be apparent in the data, special care
will be needed for the blind review.
Many of the more detailed aspects of presentation and
提示と作表のより詳細な内容の多くは、盲検下レヴュ
tabulation should be finalised at or about the time of the
ー時、又はその前後までに固定すべきである。その結
blind review so that by the time of the actual analysis full
果、実際の解析時には、被験者の選択、データ選択と
plans exist for all its aspects including subject selection,
変換、データの要約と作表、推定と仮説検定を含むす
data selection and modification, data summary and
べての解析内容についての完全な解析計画が存在する
tabulation, estimation and hypothesis testing. Once data
ことになる。データの妥当性の立証が完了したら、解
validation is complete, the analysis should proceed
析は事前に定められた計画に従って進めるべきであ
according to the pre-defined plans; the more these plans
る。事前の計画が守られればそれだけ結果の信憑性は
are adhered to, the greater the credibility of the results.
増すことになる。治験実施計画書、治験実施計画書の
Particular attention should be paid to any differences
改訂、又はデータの盲検下レヴューに基づいて更新さ
between the planned analysis and the actual analysis as
れた統計解析計画に記述されている、予定した解析と
described in the protocol, protocol amendments or the
実際の解析の間のどんな差にも特別な注意を払うべき
updated statistical analysis plan based on a blind review
である。予定した解析からの逸脱に対しては、慎重な
of data. A careful explanation should be provided for
説明を行うべきである。
deviations from the planned analysis.
All subjects who entered the trial should be accounted for
試験に登録されたすべての被験者は、解析に含まれて
in the report, whether or not they are included in the
いるかどうかにかかわらず、報告書で明らかにすべき
analysis. All reasons for exclusion from analysis should
である。解析から除外した理由はすべて記録すべきで
be documented; for any subject included in the full
ある。最大の解析対象集団には含まれるが治験実施計
analysis set but not in the per protocol set, the reasons
画書に適合した対象集団には含まれない被験者につい
for exclusion from the latter should also be documented.
ては、治験実施計画書に適合した対象集団から除外さ
Similarly, for all subjects included in an analysis set, the
れた理由も記録すべきである。同様にして、ある解析
measurements of all important variables should be
対象集団に含まれる被験者すべてについて、すべての
accounted for at all relevant time-points.
重要な変数のすべての適切な時点での測定値も明らか
にしておくべきである。
The effect of all losses of subjects or data, withdrawals
被験者又はデータの減失、試験治療の中止及び重大な
from treatment and major protocol violations on the main
治験実施計画書違反が主要変数の主な解析に及ぼす影
analyses of the primary variable(s) should be considered
響について、慎重に考慮すべきである。追跡不能、試
carefully. Subjects lost to follow up, withdrawn from
験治療の中止、又は重大な治験実施計画書違反があっ
treatment, or with a severe protocol violation should be
た被験者は明らかにすべきであり、減失の理由、減失
identified, and a descriptive analysis of them provided,
と試験治療及び結果との関係を含めた、記述的解析を
including the reasons for their loss and its relationship to
行うべきである。
treatment and outcome.
Descriptive statistics form an indispensable part of
記述統計は報告書に欠くことのできない部分である。
reports. Suitable tables and/or graphical presentations
適切な表若しくはグラフ表示、又はその両方により、
should illustrate clearly the important features of the
主要変数及び副次変数並びに主な予後変数及び人口統
primary and secondary variables and of key prognostic
計学的変数の重要な特徴を明確に説明すべきである。
and demographic variables. The results of the main
試験の目的に関連する主な解析結果には、特別に慎重
analyses relating to the objectives of the trial should be
な記述的報告を行うべきである。有意性検定の結果を
the subject of particularly careful descriptive
報告する際には、限界値を越えたかどうかではなく、
presentation. When reporting the results of significance
正確な p 値（例えば「p=0.034」）を報告すべきであ
tests, precise p-values (e.g.'p=0.034') should be reported
る。
rather than making exclusive reference to critical values.
Although the primary goal of the analysis of a clinical
臨床試験の解析の主たる目標は、その主目的から生じ
trial should be to answer the questions posed by its main
た問題に答えることであるべきだが、割付を明らかに
objectives, new questions based on the observed data
した後の解析で、観察されたデータに基づいた新たな
may well emerge during the unblended analysis.
問題が生じるおそれがある。追加解析、そして恐らく
Additional and perhaps complex statistical analysis may
複雑な統計解析がその結果として必要になるであろ
be the consequence. This additional work should be
う。この追加解析は、治験実施計画書に予定していた
strictly distinguished in the report from work which was
解析の報告とは厳密に区別すべきである。
planned in the protocol.
The play of chance may lead to unforeseen imbalances
偶然により、多少とも予後に重要な意味を持つが、共
between the treatment groups in terms of baseline
変量として事前に定めていなかった基準となる時点の
measurements not pre-defined as covariates in the
測定値に関して、試験治療グループ間の予期しないバ
planned analysis but having some prognostic importance
ランスのくずれが起こる可能性がある。このバランス
nevertheless. This is best dealt with by showing that an
のくずれに対処するには、それを考慮する解析を追加
additional analysis which accounts for these imbalances
し、予定した解析と本質的に同じ結論に達することを
reaches essentially the same conclusions as the planned
示すことが、最善である。しかし両者が同じ結論に達
analysis. If this is not the case, the effect of the
しない場合には、バランスのくずれが結論に与える影
imbalances on the conclusions should be discussed.
響を議論すべきである。
In general, sparing use should be made of unplanned
一般に、予定していない解析は最小限にとどめるべき
analyses. Such analyses are often carried out when it is
である。予定外の解析は、試験治療の効果が何か別の
thought that the treatment effect may vary according to
要因に応じて変化する可能性があると考えられる場合
some other factor or factors. An attempt may then be
に実施される例がよくみられる。その場合、被験者集
made to identify subgroups of subjects for whom the
団の中で試験治療効果が特に有益な部分集団を明らか
effect is particularly beneficial. The potential dangers of
にする試みがなされるであろう。よく知られているよ
over-interpretation of unplanned subgroup analyses are
うに、予定していない部分集団別解析を拡大解釈する
well known (see also Section 5.7), and should be carefully
ことは潜在的に危険なことであり（5.7 節を参照）、
avoided. Although similar problems of interpretation
慎重に避けるべきである。試験治療が被験者のある部
arise if a treatment appears to have no benefit, or an
分集団に何の利益もないと思われる場合、又は有害作
adverse effect, in a subgroup of subjects, such
用があるように思われる場合にも同様の問題が生じる
possibilities should be properly assessed and should
が、それらの可能性について正しい評価を行うべきで
therefore be reported.
あり、したがって常に報告すべきである。
Finally statistical judgement should be brought to bear
最後に、臨床試験の結果の解析、解釈及び提示には統
on the analysis, interpretation and presentation of the
計的判断が下されるべきである。このためには、試験
results of a clinical trial. To this end the trial statistician
統計家は総括報告書に責任を持つチームの一員である
should be a member of the team responsible for the
べきであり、総括報告書を承認する存在であるべきで
clinical study report, and should approve the clinical
ある。
report.
7.2 Summarising the Clinical Database
7.2 臨床データベースの要約
An overall summary and synthesis of the evidence on
報告されているすべての臨床試験での安全性及び有効
safety and efficacy from all the reported clinical trials is
性の証拠を、全体的に要約して総合することは、承認
required for a marketing application (Expert report in
申請の要求事項である（EU での「専門家報告書
EU, integrated summary reports in USA, Gaiyo in
(Expert Report)」、米国での「総合要約報告書
Japan). This may be accompanied, when appropriate, by
(Integrated Summary Reports)」、日本での「資料概
a statistical combination of results.
要」）。これは、適切であれば、結果の統計的結合を
伴ってもよい。
Within the summary a number of areas of specific
要約の中には、以下のような多くの領域において特定
statistical interest arise: describing the demography and
の統計的興味が発生する。一連の臨床試験プログラム
clinical features of the population treated during the
において試験治療を受けた集団の人口統計学的及び臨
course of the clinical trial programme; addressing the key
床的特徴を記述すること。適切な（通常、比較を伴う）
questions of efficacy by considering the results of the
試験の結果を考慮にいれて有効性に関する主要な問題
relevant (usually controlled) trials and highlighting the
に答えること及び治験が相互に補強しあう、又は矛盾
degree to which they reinforce or contradict each other;
しあう程度を際立たせること。承認申請の根拠となる
summarising the safety information available from the
すべての治験を結合したデータベースから利用できる
combined database of all the trials whose results
安全性情報をまとめること及び安全性に関する潜在的
contribute to the marketing application and identifying
な問題を確認すること。臨床プログラムの設計段階で
potential safety issues. During the design of a clinical
は、特に試験を通じてよく結合される測定値に対して、
programme careful attention should be paid to the
後になって一連の試験の解釈が容易になるように測定
uniform definition and collection of measurements which
値を統一して定義し収集するよう、慎重な注意を払う
will facilitate subsequent interpretation of the series of
べきである。薬物治療、医療歴及び有害事象の詳細を
trials, particularly if they are likely to be combined
記録するために、共通の辞書を選んで使用すべきであ
across trials. A common dictionary for recording the
る。主要変数と副次変数を共通して定義することはほ
details of medication, medical history and adverse events
とんど常に有益であり、メタアナリシスのためには必
should be selected and used. A common definition of the
須である。主要な有効性変数の測定方法、ランダム化
primary and secondary variables is nearly always
時/登録時から評価を行うまでの期間、治験実施計画書
worthwhile, and essential for meta-analysis. The manner
違反例と逸脱例の取り扱い及び恐らく予後因子の定義
of measuring key efficacy variables, the timing of
についても、そのようにしない妥当な理由がない限り、
assessments relative to randomisation/entry, the
矛盾なく保つべきである。
handling of protocol violators and deviators and perhaps
the definition of prognostic factors, should all be kept
compatible unless there are valid reasons not to do so.
Any statistical procedures used to combine data across
いくつかの試験を通してデータを結合するために用い
trials should be described in detail. Attention should be
た統計手法はすべて、詳細に記述
paid to the possibility of bias associated with the
すべきである。結合の際には、試験を選択することに
selection of trials, to the homogeneity of their results, and
伴う偏りの可能性、結果の一様性及
to the proper modelling of the various sources of
びばらつきの様々な原因を適切にモデル化することに
variation. The sensitivity of conclusions to the
注意を払うべきである。また、用いた仮定及び試験の
assumptions and selections made should be explored.
選択に対して結論がどの程度変わり易いかを探索すべ
きである。
7.2.1 Efficacy Data
7.2.1 有効性データ
Individual clinical trials should always be large enough
一つ一つの臨床試験は、常にその目的を果たせるだけ
to satisfy their objectives. Additional valuable
の十分な規模で行うべきである。本質的に同一である
information may also be gained by summarising a series
ような主要な有効性の問題を扱っている一連の臨床試
of clinical trials which address essentially identical key
験を要約することで、更なる有益な情報が得られるで
efficacy questions. The main results of such a set of trials
あろう。そのような一連の試験の主たる結果は、通常
should be presented in an identical form to permit
は推定値と信頼限界を中心とする表又は図として、比
comparison, usually in tables or graphs which focus on
較可能な同一形式で提示すべきである。これらの推定
estimates plus confidence limits. The use of
値を結合するためのメタアナリシス技法の使用が有用
meta-analytic techniques to combine these estimates is
な補足となる例がよくみられる。なぜならば、メタア
often a useful addition, because it allows a more precise
ナリシス技法は試験治療効果の大きさに関するより精
overall estimate of the size of the treatment effects to be
度の高い包括的な推定値を与え、試験の結果の完備し
generated, and provides a complete and concise summary
た簡潔な要約を与えるからである。例外的な状況下で
of the results of the trials. Under exceptional
はあるが、全体的な仮説検定を行うメタアナリシス手
circumstances a meta analytic approach may also be the
法が、有効性の十分な全体的証拠を与える最も適切な
most appropriate way, or the only way, of providing
方法、又は唯一の方法となる場合もある。このような
sufficient overall evidence of efficacy via an overall
目的でメタアナリシス技法を用いる場合には、メタア
hypothesis test. When used for this purpose the
ナリシスを実施するための研究計画を前もって準備す
meta-analysis should have its own prospectively written
べきである。
protocol.
7.2.2 Safety Data
7.2.2 安全性データ
In summarising safety data it is important to examine
安全性データの要約では、潜在的な毒性を示すいかな
the safety database thoroughly for any indications of
る徴候に対しても徹底的に安全性データベースを調べ
potential toxicity, and to follow up any indications by
ることが重要であり、裏づけるパターンを探索してそ
looking for an associated supportive pattern of
の徴候を追跡することが重要である。医薬品のあらゆ
observations. The combination of the safety data from all
る人体曝露に関する安全性データを結合することが、
human exposure to the drug provides an important
重要な情報源となる。なぜならば、そのようにして被
source of information, because its larger sample size
験者数が多くなることによりまれな有害事象を検出
provides the best chance of detecting the rarer adverse
し、恐らく有害事象のおおよその発現数を推定する可
events and, perhaps, of estimating their approximate
能性が最も高くなるからである。しかし、このような
incidence. However, incidence data from this database
データベースからの有害事象発現データは、比較する
are difficult to evaluate because of the lack of a
グループを欠いていることから評価が困難であり、こ
comparator group, and data from comparative trials are
の困難さを克服するためには比較試験からのデータが
especially valuable in overcoming this difficulty. The
特に有益である。それぞれの対照薬について十分なデ
results from trials which use a common comparator
ータを提供するため、共通の対照薬（プラセボ又は特
(placebo or specific active comparator) should be
定の実対照薬）を用いている試験の結果は結合し、別々
combined and presented separately for each comparator
に提示すべきである。
providing sufficient data.
All indications of potential toxicity arising from
データの探索から判明した毒性を持つ可能性を示す徴
exploration of the data should be reported. The
候はすべて報告すべきである。これらの潜在的有害作
evaluation of the reality of these potential adverse effects
用がどれだけ現実に起こりうるかの評価には、多数の
should take account of the issue of multiplicity arising
比較の実施によって生じる多重性の問題を考慮すべき
from the numerous comparisons made. The evaluation
である。評価には、有害事象の発生に曝露期間若しく
should also make appropriate use of survival analysis
は追跡期間又はその両方が潜在的に関連しているかど
methods to exploit the potential relationship of the
うかを探索するため、生存解析手法を適切に使用すべ
incidence of adverse events to duration of exposure
きである。確認された有害作用に関連するリスクは、
and/or follow-up. The risks associated with identified
リスクと利益の関係を正しく評価するために適切に定
adverse effects should be appropriately quantified to
量化すべきである。
allow a proper assessment of the risk/benefit
relationship.
GLOSSARY
用語集
Bayesian Approaches
ベイズ流の方法
Approaches to data analysis that provide a posterior
一部のパラメータ（例えば、試験治療の効果）につい
probability distribution for some parameter (e.g.
て、観察されたデータとパラメータの事前確率分布か
treatment effect), derived from the observed data and a
ら事後確率分布を与えるデータ解析の方法。得られた
prior probability distribution for the parameter. The
事後分布を、統計的推測の基礎とする。
posterior distribution is then used as the basis for
statistical inference.
Bias (Statistical & Operational)
偏り（統計的及び運営上の）
The systematic tendency of any factors associated with
臨床試験の計画、実施、解析及び結果の評価と関連し
the design, conduct, analysis and evaluation of the
た因子の影響により、試験治療の効果の推定値と真の
results of a clinical trial to make the estimate of a
値に系統的な差が生じること。試験実施中の逸脱によ
treatment effect deviate from its true value. Bias
って起こる偏りを「運営上の」偏りとよび、それ以外
introduced through deviations in conduct is referred to as
の原因によって起こる偏りを「統計的」偏りとよぶ。
'operational' bias. The other sources of bias listed above
are referred to as 'statistical'.
Blind Review
盲検下レヴュー
The checking and assessment of data during the period of
試験完了（最後の被験者の最終観察）から割付を明ら
time between trial completion (the last observation on
かにするまでの間に予定した解析を固定する目的で実
the last subject) and the breaking of the blind, for the
施するデータの評価と点検。
purpose of finalising the planned analysis.
Content Validity
内容的妥当性
The extent to which a variable (e.g. a rating scale)
ある変数（例えば、評価尺度）で測定しようとしてい
measures what it is supposed to measure.
るものを、どれだけ間違いなく測定しているか、その
程度。
Double-Dummy
ダブルダミー
A technique for retaining the blind when administering
臨床試験で二つの試験治療の区別がつく場合に、医薬
supplies in a clinical trial, when the two treatments
品投与時の盲検を維持する技法。試験治療 A について、
cannot be made identical. Supplies are prepared for
実際の製剤と共にそれと区別不能なプラセボを用意
Treatment A (active and indistinguishable placebo) and
し、また試験治療 B についても、実際の製剤と共にそ
for Treatment B (active and indistinguishable placebo).
れと区別不能なプラセボを用意する。被験者は二組の
Subjects then take two sets of treatment; either A (active)
試験治療(一つは A の実際の製剤と B のプラセボ、も
and B (placebo), or A (placebo) and B (active).
う一つは A のプラセボと B の実際の製剤)のどちらか
を受ける。
Dropout
脱落
A subject in a clinical trial who for any reason fails to
治験実施計画書が要求する最終観察以前に、何らかの
continue in the trial until the last visit required of
理由で臨床試験の継続ができない被験者。
him/her by the study protocol.
Equivalence Trial
同等性試験
A trial with the primary objective of showing that the
二つ以上の試験治療に対する反応が、臨床的に重要な
response to two or more treatments differs by an amount
意味を持つほど異ならないことを示すことが主要な目
which is clinically unimportant. This is usually
的の試験。このことは、通常臨床的に許容できる差で
demonstrated by showing that the true treatment
ある上側同等限界と下側同等限界の間に、試験治療間
difference is likely to lie between a lower and an upper
の真の差が存在する可能性が高いことを示すことによ
equivalence margin of clinically acceptable differences.
り証明される。
Frequentist Methods
頻度論的方法
Statistical methods, such as significance tests and
有意性検定及び信頼区間といった統計的方法であり、
confidence intervals, which can be interpreted in terms of
その意味は同一実験状況下という仮説的な繰り返しの
the frequency of certain outcomes occurring in
もとで起こるある結果の頻度という観点から解釈でき
hypothetical repeated realisations of the same
る。
experimental situation.
Full Analysis Set
最大の解析対象集団
The set of subjects that is as close as possible to the ideal
Intention-to-treat の原則に可能な限り近づけた被験
implied by the intention-to-treat principle. It is derived
者集団。最大の解析対象集団は、ランダム化が行われ
from the set of all randomised subjects by minimal and
た全被験者から、除くべき理由のある最低限の被験者
justified elimination of subjects.
を除外した集団である。
Generalisability, Generalisation
一般化可能性、一般化
The extent to which the findings of a clinical trial can be
臨床試験で得た知見を、その試験に参加した被験者か
reliably extrapolated from the subjects who participated
らより広い患者集団とより広い医療現場へ外挿するこ
in the trial to a broader patient population and a broader
とが信頼をもってできる程度。
range of clinical settings.
Global Assessment Variable
総合評価変数
A single variable, usually a scale of ordered categorical
被験者の疾患の状態又は疾患の状態の変化についての
ratings, which integrates objective variables and the
客観的変数と治験責任(分担)医師の全体的な印象を統
investigator's overall impression about the state or
合した、通常、順序カテゴリの評価尺度である単一の
change in state of a subject.
変数。
Independent Data Monitoring Committee (IDMC) (Data
独立データモニタリング委員会/効果安全性評価委員
and Safety Monitoring Board, Monitoring Committee,
会（データ及び安全性モニタリング委員会、モニタリ
Data Monitoring Committee)
ング委員会、データモニタリング委員会）
An independent data-monitoring committee that may be
臨床試験の進行状況、安全性データ及び重要な有効性
established by the sponsor to assess at intervals the
評価項目を何回かにわたって評価するとともに、治験
progress of a clinical trial, the safety data, and the
依頼者に試験の継続、修正、又は中止を勧告するため
critical efficacy endpoints, and to recommend to the
に、治験依頼者が必要に応じて設立する委員会。
sponsor whether to continue, modify, or stop a trial.
Intention-To-Treat Principle
Intention-To-Treat の原則
The principle that asserts that the effect of a treatment
治療に用いる治療方針により得られる効果は、実際に
policy can be best assessed by evaluating on the basis of
受けた試験治療ではなく、被験者を治療しようとする
the intention to treat a subject (i.e. the planned
意図（予定した試験治療規定）に基づくことにより最
treatment regimen) rather than the actual treatment
もよく評価できる、ということを主張する原則。この
given. It has the consequence that subjects allocated to a
原則から、一つの試験治療グループに割付けられた被
treatment group should be followed up, assessed and
験者は、予定した試験治療のコースを遵守したかどう
analysed as members of that group irrespective of their
かにかかわらず、割付けられたグループのまま追跡さ
compliance to the planned course of treatment.
れ、評価され、解析されるべきであることが導かれる。
Interaction (Qualitative & Quantitative)
交互作用（質的及び量的）
The situation in which a treatment contrast (e.g.
試験治療の対比（被験薬と対照薬との差など）が、他
difference between investigational product and control) is
の要因（施設など）により変わる状況。量的な交互作
dependent on another factor (e.g. centre). A quantitative
用とは、要因のレベルが異なるとそれに応じて対比の
interaction refers to the case where the magnitude of the
大きさが変わることであり、質的な交互作用とは、要
contrast differs at the different levels of the factor,
因のレベルの少なくとも一つにおいて、対比の方向ま
whereas for a qualitative interaction the direction of the
でもが変わることである。
contrast differs for at least one level of the factor.
Inter-Rater Reliability
評価者間信頼性
The property of yielding equivalent results when used by
異なる評価者が異なる機会に評価をする場合、同じ結
different raters on different occasions.
果を与える特性。
Intra-Rater Reliability
評価者内信頼性
The property of yielding equivalent results when used by
同一評価者が異なる機会に評価をする場合、同じ結果
the same rater on different occasions.
を与える特性。
Interim Analysis
中間解析
Any analysis intended to compare treatment arms with
試験の正式な完了以前に、有効性又は安全性に関して
respect to efficacy or safety at any time prior to the
試験治療群間を比較することを意図して行われるあら
formal completion of a trial.
ゆる解析。
Meta-Analysis
メタアナリシス
The formal evaluation of the quantitative evidence from
同じ問題を扱う二つ以上の試験から得られる定量的な
two or more trials bearing on the same question. This
証拠について形式に則って行う評価。最も一般的なメ
most commonly involves the statistical combination of
タアナリシスでは、様々な試験の要約統計量を統計的
summary statistics from the various trials, but the term
に結合するが、生データを結合する場合もメタアナリ
is sometimes also used to refer to the combination of the
シスと呼ぶ場合がある。
raw data.
Multicentre Trial
多施設共同治験
A clinical trial conducted according to a single protocol
単一の治験実施計画書に基づいて、二つ以上の施設で、
but at more than one site, and therefore, carried out by
したがって二人以上の治験責任医師によって実施され
more than one investigator.
る臨床試験。
Non-Inferiority Trial
非劣性試験
A trial with the primary objective of showing that the
被験薬への反応が比較薬剤（実薬又はプラセボ）より
response to the investigational product is not clinically
も臨床的に劣らないことを示すことが主要な目的の試
inferior to a comparative agent (active or placebo
験。
control).
Preferred and Included Terms
基本語及び慣用語
In a hierarchical medical dictionary, for example
MedDRA のような階層的医学辞書では、慣用語とは治
MedDRA, the included term is the lowest level of
験責任医師の記述がコード化される最下層の辞書用語
dictionary term to which the investigator description is
である。基本語とは、発生頻度を報告するために用い
coded. The preferred term is the level of grouping of
られるものであり、慣用語をグループ化するレベルで
included terms typically used in reporting frequency of
ある。例えば、「左腕に痛み」という治験責任医師の
occurrence. For example, the investigator text “Pain in
記述は、慣用語では「関節痛(Joint Pain)」としてコー
the left arm” might be coded to the included term “Joint
ド化され、基本語レベルでは「関節痛(Arthralgia)」と
pain”, which is reported at the preferred term level as
して報告される。
“Arthralgia”.
Per Protocol Set (Valid Cases, Efficacy Sample,
治験実施計画書に適合した対象集団（妥当例、有効性
Evaluable Subjects Sample)
サンプル、評価可能被験者サンプル）
The set of data generated by the subset of subjects who
データの集合であり、そのデータは基礎となる科学的
complied with the protocol sufficiently to ensure that
モデルに従い試験治療の効果をよく示すと十分考えら
these data would be likely to exhibit the effects of
れる程度に治験実施計画書を遵守した部分集団から得
treatment, according to the underlying scientific model.
られる。遵守には、試験治療への曝露、測定値の利用
Compliance covers such considerations as exposure to
可能性及び大きな治験実施計画書違反がないことが含
treatment, availability of measurements and absence of
まれる。
major protocol violations.
Safety & Tolerability
安全性及び忍容性
The safety of a medical product concerns the medical risk
医療用医薬品の安全性は、臨床試験では通常臨床検査
to the subject, usually assessed in a clinical trial by
（臨床化学、血液学を含む）、バイタルサイン、臨床
laboratory tests (including clinical chemistry and
的有害事象（疾患、徴候、症状）、その他特別な安全
haematology), vital signs, clinical adverse events
性検査（心電図、眼科学など）によって評価される、
(diseases, signs and symptoms), and other special safety
被験者の医療上のリスクに関するものである。医療用
tests (e.g. ECGs, ophthalmology). The tolerability of the
医薬品の忍容性とは、明白な有害作用が被験者にとっ
medical product represents the degree to which overt
てどれだけ耐えうるかの程度を示す。
adverse effects can be tolerated by the subject.
Statistical Analysis Plan
統計解析計画書
A statistical analysis plan is a document that contains a
統計解析計画書とは、治験実施計画書に記されている
more technical and detailed elaboration of the principal
解析の主要な特徴のより技術的な詳細を述べた文書で
features of the analysis described in the protocol, and
あり、主要変数、副次変数、その他のデータに関する
includes detailed procedures for executing the statistical
統計解析を実行するための詳細な手順を含むものであ
analysis of the primary and secondary variables and
る。
other data.
Superiority Trial
優越性試験
A trial with the primary objective of showing that the
被験薬への反応が比較薬剤（実薬又はプラセボ）より
response to the investigational product is superior to a
も臨床的に優れることを示すことが主要な目的の試
comparative agent (active or placebo control).
験。
Surrogate Variable
代替変数
A variable that provides an indirect measurement of
臨床的効果を直接測定することが実際的でない場合
effect in situations where direct measurement of clinical
に、効果の間接的な測定値を示す変数。
effect is not feasible or practical.
Treatment Effect
試験治療の効果
An effect attributed to a treatment in a clinical trial. In
臨床試験における試験治療の効果。ほとんどの臨床試
most clinical trials the treatment effect of interest is a
験では、関心のある試験治療の効果は、二つ以上の試
comparison (or contrast) of two or more treatments.
験治療間の比較（又は対比）である。
Treatment Emergent
試験治療下での発現
An event that emerges during treatment having been
試験治療前には存在しておらず試験治療期間に出現し
absent pre-treatment, or worsens relative to the
た事象、又は試験治療前の状態に比べて悪化した事象。
pre-treatment state.
Trial Statistician
試験統計家
A statistician who has a combination of
本ガイドライン中の原則を実行するために、十分な理
education/training and experience sufficient to
論又は実地の教育及び経験を併せ持ち、かつ当該試験
implement the principles in this guidance and who is
の統計的側面に責任を持つ統計家。
responsible for the statistical aspects of the trial.

Download Report