Methodological perspectives for register-based health system ...

4 downloads 232 Views 4MB Size Report
A pragmatically useful health system performance monitoring should ..... However, the development of new tools for data analysis has not solved the actual.
Reijo Sund

Methodological Perspectives for Register-Based Health System Performance Assessment Developing a Hip Fracture Monitoring System in Finland

National Research and Development Centre for Welfare and Health Research Report 174

Editorial board Matti Heikkilä, chairman Marjatta Bardy Marko Elovainio Mika Gissler Riitta Haverinen Timo Tuori Matti Virtanen

The research presented in this series has been approved for publication after undergoing a formal referee evaluation process.

© Author and STAKES

Translation of the Swedish abstract: Done Information Oy

Cover design: Harri Heikkilä Layout: Christine Strid

ISBN 978-951-33-2114-7 (nid.) ISSN 1236-0732 ISBN 978-951-33-2132-1 (PDF) Gummerus Printing Vaajakoski 2008 Finland

Dedicated to the memory of my father

Acknowledgements First of all, I want to praise my beloved wife Anne, the queen of my heart and the goddess of my soul, for giving a purpose to my life. I would also like to express my gratitude to my mother Irja and my sister Paula for their unconditional love, and to my goddaughter Saana-Emilia for bringing so much sunshine that this world has become a happier place. The preparation of this thesis would not have been possible without a research grant for the project ’Register-based data in hospital outcomes research’ from the Academy of Finland. I am grateful to Professor Pekka Rissanen and Adjunct Professor Ilmo Keskimäki for guiding me to the world of health services research. I also thank the collaborators within the ’Performance, effectiveness and cost of treatment episodes’ -project led by Research Professor Unto Häkkinen. In addition, I want to acknowledge the role of my other co-authors and the colleagues I have worked with. Thank you; you know who you are. I express my sincere appreciation to Professor Emeritus Seppo Mustonen. His extraordinary creativity, talent, and experience combined with the ability to say the right things at the right time have offered an invaluable source of inspiration and support during my whole statistical career. Moreover, the general computing environment provided by Survo, developed by Professor Mustonen, offered an excellent platform for register-based data analysis. I also thank Professor Risto Lehtonen, Professor Hannu Niemi and Professor Esa Läärä for comments and discussions concerning the manuscript of this thesis. I am thankful to the official reviewers of the dissertation, Adjunct Professor Timo Alanko and Adjunct Professor Helka Hytti. Their comments helped me to further simplify and clarify the message. Finally, one thing has become clear to me during the preparation of this thesis. It is extremely time-consuming to conduct and publish multidisciplinary scientific research that aims to offer alternative solutions to pragmatic problems. Although I had to make some compromises, I still hope that this thesis is able to reflect my personal way of thinking as well as my enthusiasm for scientific research.

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment



Abstract Reijo Sund. Methodological Perspectives for Register-Based Health System Performance Assessment. Developing a Hip Fracture Monitoring System in Finland. STAKES, Research Report 174. Helsinki 2008. ISBN 978-951-33-2114-7 The resources of health systems are limited. There is a need for information concerning the performance of the health system for the purposes of decisionmaking. This study is about utilization of administrative registers in the context of health system performance evaluation. In order to address this issue, a multidisciplinary methodological framework for register-based data analysis is defined. Because the fixed structure of registerbased data indirectly determines constraints on the theoretical constructs, it is essential to elaborate the whole analytic process with respect to the data. The fundamental methodological concepts and theories are synthesized into a data sensitive approach which helps to understand and overcome the problems that are likely to be encountered during a register-based data analyzing process. A pragmatically useful health system performance monitoring should produce valid information about the volume of the problems, about the use of services and about the effectiveness of provided services. A conceptual model for hip fracture performance assessment is constructed and the validity of Finnish registers as a data source for the purposes of performance assessment of hip fracture treatment is confirmed. Solutions to several pragmatic problems related to the development of a register-based hip fracture incidence surveillance system are proposed. The monitoring of effectiveness of treatment is shown to be possible in terms of care episodes. Finally, an example on the justification of a more detailed performance indicator to be used in the profiling of providers is given. In conclusion, it is possible to produce useful and valid information on health system performance by using Finnish register-based data. However, that seems to be far more complicated than is typically assumed. The perspectives given in this study introduce a necessary basis for further work and help in the routine implementation of a hip fracture monitoring system in Finland. Keywords: administrative registers, methodology, secondary data, hip fractures, health services, performance, effectiveness, incidence, profiling, care episodes

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment



Abstract in Finnish Reijo Sund. Methodological Perspectives for Register-Based Health System Performance Assessment. Developing a Hip Fracture Monitoring System in Finland [Menetelmällisiä näkökulmia rekisteriperusteiseen terveydenhuoltojärjestelmän vaikuttavuuden arviointiin. Lonkkamurtumien seurantajärjestelmän kehittäminen Suomessa]. Stakes, Tutkimuksia 174. Helsinki 2008. ISBN 978-951-33-2114-7 Terveydenhuoltojärjestelmän resurssit ovat rajallisia. Jotta näitä resursseja pystyttäisiin hyödyntämään mahdollisimman tarkoituksenmukaisella tavalla, tarvitaan tietoa terveydenhuoltojärjestelmän vaikuttavuudesta. Tässä tutkimuksessa tarkastellaan kuinka suomalaisia rekisteriaineistoja voidaan käyttää päätöksentekoa tukevan terveydenhuollon vaikuttavuustiedon tuottamiseen. Tutkimuksessa kehitetään menetelmällinen viitekehys rekisteriperusteisen data-analyysin tueksi yhdistämällä eri tieteenaloilta periytyviä menetelmällisiä ideoita. Lähtökohtana on, että rekistereitä hyödyntävän tutkimusprosessin tulee edetä myös aineiston ehdoilla, koska hallinnollisiin tarkoituksiin kerättyjen rekisteriaineistojen tietosisältö rajoittaa käyttömahdollisuuksia. Kehitetyn viitekehyksen puitteissa näiden välttämättömien rajoitusten syitä ja seurauksia on mahdollista käsitellä systemaattisella tavalla, joka auttaa ymmärtämään ja ratkaisemaan rekisteriperusteisessa tutkimuksessa tyypillisesti kohdattavia ongelmia. Terveydenhuollon vaikuttavuutta arvioitaessa tarvitaan tietoa niin terveysongelmien ilmaantuvuudesta, palvelujen käytöstä kuin hoidon laadustakin. Näitä tarkoituksia varten tässä tutkimuksessa muodostetaan käsitteellinen malli lonkkamurtumien seurantajärjestelmän tietotarpeille ja varmistetaan, että rekisteriaineistojen validiteetti on näiltä osin riittävä. Useaan lonkkamurtumien ilmaantuvuuden seurantaan liittyvään käytännölliseen ongelmaan esitetään rekisteriaineistojen kanssa sopusoinnussa oleva ratkaisu ja osoitetaan kuinka hoidon toteutumista voidaan arvioida rekisterien avulla koko hoitoketjun osalta. Lopuksi tarkastellaan kuinka hoidon laatua on mahdollista vertailla eri sairaaloiden välillä käyttäen yksityiskohtaisemmin perusteltua vaikuttavuusindikaattoria. Johtopäätöksenä voidaan todeta, että suomalaiset rekisteriaineistot tarjoavat erinomaiset mahdollisuudet terveydenhuollon vaikuttavuutta koskevan tiedon tuottamiseen, vaikka hyödyllisen ja tieteellisyyden kriteerit täyttävän tiedon tuottaminen onkin varsin haasteellista. Tässä tutkimuksessa esitetyt näkökulmat muodostavat perustellun lähestymistavan rekisteritietojen entistä tehokkaampaan hyödyntämiseen ja auttavat lonkkamurtuman rutiiniluonteisemman seurantajärjestelmän kehittämisessä. Avainsanat: hallinnolliset rekisterit, metodologia, toisen käden aineistot, lonkkamurtumat, terveydenhuolto, vaikuttavuus, ilmaantuvuus, laatu, hoitoketjut



Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

Abstract in Swedish Reijo Sund. Methodological Perspectives for Register-Based Health System Performance Assessment. Developing a Hip Fracture Monitoring System in Finland [Metodologiska perspektiv på registerbaserad utvärdering av effektiviteten i hälso- och sjukvårdssystem. Utveckling av ett system för uppföljning av höftfrakturer i Finland]. Stakes, Forskningsrapport 174. Helsingfors 2008. ISBN 978-951-33-2114-7 Hälso- och sjukvårdssystemets resurser är begränsade. För att dessa resurser skall kunna utnyttjas på ett så ändamålsenligt sätt som möjligt, behövs information om hälso- och sjukvårdssystemets effektivitet. I denna undersökning granskas hur finska registermaterial kan användas för att producera uppgifter om hälso- och sjukvårdens effektivitet som stöd för beslutsfattande. I undersökningen utvecklas en metodologisk referensram till stöd för dataanalysen genom att kombinera metodologiska idéer från olika vetenskapsgrenar. Utgångspunkten är, att en undersökningsprocess som utnyttjar register behöver ske på materialets villkor, eftersom innehållet i registermaterial som insamlats för administrativa ändamål begränsar materialets användningsmöjligheter. Med hjälp av den utvecklade referensramen finns det möjlighet att hantera orsakerna till dessa oundvikliga begräsningar och deras följder på ett systematiskt sätt, som underlättar förståelsen och lösningen av de problem som är typiska för registerbaserade undersökningar. Vid bedömningen av hälso- och sjukvårdens effektivitet krävs uppgifter om hälsoproblemens incidens, användningen av tjänsterna och vårdens kvalitet. För dessa ändamål utvecklas i denna undersökning en begreppsmodell för databehovet när det gäller uppföljningssystem för höftfrakturer och bekräftas att validiteten hos data i registermaterialet till dessa delar är tillräcklig. En metod som är förenlig med registermaterialen och som löser flera praktiska problem som är förknippade med uppföljningen av incidensen av höftfrakturer presenteras. Dessutom visas hur genomförandet av vården kan utvärderas med hjälp av registren för hela vårdkedjans del. Slutligen granskas hur vårdens kvalitet kan jämföras mellan olika sjukhus med hjälp av en mer detaljerat underbyggd effektivitetsindikator. Som slutsats kan man konstatera, att de finska registermaterialen erbjuder utmärkta möjligheter att producera uppgifter om hälso- och sjukvårdens effektivitet, även om det innebär en ganska stor utmaning att försöka ta fram uppgifter som uppfyller kriterierna för användbarhet och vetenskaplighet. De synpunkter som presenteras i den här undersökningen bildar en underbyggd angreppsmetod som gör det lättare än tidigare att utnyttja registeruppgifter på ett effektivt sätt och hjälp­er vid utvecklandet av ett med rutinmässigt system för uppföljning av höftfrakturer. Nyckelord: administrativa register, metodologi, andrahandsmaterial, höftfrakturer, hälso- och sjukvård, effektivitet, incidens, kvalitet, vårdkedjor

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment



Contents Acknowledgements Abstract Abstract in Finnish Abstract in Swedish List of original publications............................................................................................ 13 1

Introduction......................................................................................................... 15 1.1 Aim of the study.............................................................................................. 16 1.2 Structure of the study...................................................................................... 17

2 Register-based data analysis............................................................................. 19 2.1 From technical data processing to a knowledge discovery process............... 19 2.2 Secondary data as a methodological problem............................................... 20 2.3 Prerequisites in the register-based data analysis............................................ 21 2.3.1 Principles of measurement.................................................................. 22 2.3.2 Information science............................................................................. 23 2.3.3 Statistical computing........................................................................... 25 2.3.4 Statistics................................................................................................ 26 2.3.5 Theory.................................................................................................. 27 3 Register-based health system performance assessment............................ 28 3.1 Finnish health system...................................................................................... 28 3.2 Information production in the Finnish context............................................ 29 3.3 Measurement of the performance of a health system................................... 29 3.4 Assumptions for the production of concrete health system performance information .............................................................................. 30 3.4.1 Register-based health system performance monitoring.................... 31 4 Register-based data on hip fractures – sources and validity.................. 33 4.1 Hip fracture..................................................................................................... 33 4.1.1 Hip fracture treatment in Finland...................................................... 33 4.1.2 Register data on hip fractures in Finland........................................... 34 4.1.3 Previous register-based hip fracture studies in Finland.................... 35 4.1.4 Register data for the current study..................................................... 35 4.2 Validity of register data in the case of hip fracture........................................ 36 4.2.1 Completeness of registration.............................................................. 37 4.2.2 Conceptual model for hip fracture performance monitoring.......... 37 4.2.3 Consistency between prospective and register-based data................ 40 4.2.4 Conclusions on validity....................................................................... 41 5 Hip fracture incidence....................................................................................... 43 5.1 Aging-related hip fractures............................................................................. 44 5.1.1 A method for identifying a first aging-related hip fracture............... 45 5.2 Risk factor extraction...................................................................................... 47 5.3 Risk population data....................................................................................... 49 5.4 Hip fracture incidence between 1998 and 2002 in Finland........................... 51 5.5 Conclusions on hip fracture incidence monitoring...................................... 54

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

11

6 Hip fracture treatment..................................................................................... 55 6.1 State space for a hip fracture care episode..................................................... 55 6.2 Effectiveness in terms of care process............................................................. 57 6.3 Multivariate responses in outcomes research................................................ 57 6.3.1 Traceplots for care episode visualization............................................ 58 6.4 Comparing the care episode profiles of subpopulations............................... 61 6.4.1 Risk adjustment................................................................................... 61 6.4.2 Summarization of the follow-up data................................................ 62 6.5 Conclusions on monitoring hip fracture treatment...................................... 64 7

Operative delay as a performance indicator................................................ 66 7.1 Effects of different operative delays on mortality.......................................... 66 7.2 Profiling of providers...................................................................................... 67 7.3 Adjusted effect of operative delay on mortality............................................. 72 7.4 Provider-level hypotheses............................................................................... 73 7.5 Conclusions on operative delay as a performance indicator......................... 76

8

Discussion.............................................................................................................. 77

References......................................................................................................................... 80 Original publications

12

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

List of original publications [1] Sund R: Utilisation of administrative registers using scientific knowledge discovery. Intelligent Data Analysis 2003, 7(6):501–519. [2] Sund R, Nurmi-Lüthje I, Lüthje P, Tanninen S, Narinen A, Keskimäki I: Comparing properties of audit data and routinely collected register data in case of performance assessment of hip fracture treatment in Finland. Methods of Information in Medicine 2007, 46(5):558–566. [3] Sund R: Utilization of routinely collected administrative data in monitoring of aging dependent hip fracture incidence. Epidemiologic Perspectives & Innovations 2007, 4(2). [4] Sund R, Kauppinen S: Kuinka laskea ikääntyneiden pitkäaikaisasiakkaiden määriä rekisteritietojen perusteella? Sosiaalilääketieteellinen Aikakauslehti 2005, 42(2):137–144. [5] Sund R: Lonkkamurtumien ilmaantuvuus Suomessa 1998–2002. Duodecim 2006, 122(9):1085–1091. [6] Sund R, Liski A: Quality effects of operative delay on mortality in hip fracture treatment. Quality and Safety in Health Care 2005, 14(5):371–377.

Articles are included in the thesis with permissions from the publishers.

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

13

1 Introduction

1

Introduction

The resources available to health systems are limited. The equitable, efficient, and effective use of these resources requires administrative planning and political willingness. Information is needed concerning the performance of the health system for the purposes of decision-making [10]. Recently, a continuous evidence-based quality improvement aiming to “do the right thing right” has become very popular [11]. Essentially, this approach can be seen as an application of organizational learning and knowledge management ideas to the health care context [12]. To fulfill the requirements of this paradigm, information should be evidence-based, well-organized, pragmatic, task-specific, and available when and where it is needed [13]. It is obvious that information systems have a central role in this kind of information production. A comprehensive health information system should be suitable for clinical work as well as nationwide health policy purposes [14, 15]. The idea of such population data-based health information systems is not new [16], and some good examples do exist [17]. Finland has had a long tradition of maintaining administrative information systems, and extensive nationwide registration—using an individual’s single personal identification number for all systems—is exceptional also from the international point of view [18]. However, even though the potential advantages of Finnish information systems are well recognized, the utilization of these data sources has been rather limited. This controversy is not specific to Finnish health information systems. In fact, advances in information technology have made it possible to produce and store all kinds of data effectively, but the emphasis has been on technical aspects and not on the information itself [19]. For as long as there have been such information systems, the problem of giving too much data in an unusable form has been a common complain [20]. In fact, the predominant belief that secondary data stored in the health information system consist of autonomous, atom-like building blocks is fundamentally erroneous. Therefore, it is also erroneous to believe that the production of more detailed data would solve the basic problems. On the contrary, the more detailed and complex are the variables to be recorded, the more background information and tacit knowledge is required for secondary utilization. Such a fact is known as the law of medical information [21]. In this sense, it can be expected that as the amount of data increases with the introduction of electronic patient records and correspondingly complex information systems, it will be even more difficult to transform data into useful information. Traditionally, the problems in analyzing data have been considered as statistical issues, but statistical research focusing on probabilistic inference based on mathematics has not been able to offer enough concrete help in the analysis

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

15

1 Introduction

of growing amounts of data [22] . The practical need for information has led to the development of alternative ways of analyzing data, such as data mining [23]. However, the development of new tools for data analysis has not solved the actual problem of transforming raw data into useful information. As a matter of fact, it is naïve to believe that there would be some magic trick to overcome problems arising from the philosophical and methodological limitations of empirical research. Rather than trying to squeeze the data into a predefined model or saying too much on what can and cannot be done, data analysis should work to achieve an appropriate compromise between the practical problems and the data [24]. This kind of activity has been characterized as “greater statistics”, which tends to be inclusive, and eclectic with respect to methodology, while being closely associated with other disciplines and also practiced by many nonstatisticians [25]. Because formal statistical expertise provides an excellent basis for the understanding and evaluation of methodological ideas, statisticians ought to take advantage of the situation, get involved in interdisciplinary activities, learn from the experience, expand their own minds—and thereby also their field—and act as catalysts for the dissemination of insights and methodologies [26].

1.1 Aim of the study This thesis is about scientific research methodology in the context of register-based health system performance assessment. The application field is health services research. Health services research differs from most areas of applied research, since it does not exist in isolation from the decision-making procedures needed in health policy. A multidisciplinary approach and knowledge is required in health services research, including perspectives from at least biological, medical, social, clinical, management, economic, statistical and information sciences [27]. Another special aspect here is the use of register-based data. Since registerbased data have been originally produced for other purposes than for this specific research, the traditional methodological assumptions concerning the nature of data are not valid. In addition, information that allows for the monitoring of health system performance should be available if a health information system is to be adequate. This means that the perspectives of routine statistics production and the technical possibilities of information systems must also be considered simultaneously by means of a more research-oriented approach. Unfortunately, there are no widely accepted principles for the production of register-based statistics, which makes the development of register-statistical methodology challenging [28]. These issues make things rather complex. It is not possible to rely solely on a particularly well-developed paradigm and so the only reasonable way is to accept the diversity of the approaches and learn to deal with them simultaneously. This

16

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

1 Introduction

can be done by “raising” the ideas from the application level to the philosophical– methodological level. However, the abstract ideas may become real only through the applications. In other words, the innovative results can be achieved only by rethinking the methodology separately for each specific application. In short, the aim of this thesis is to give multidisciplinary methodological perspectives for scientific register-based information production for the purposes of health system performance assessment. More specifically, the goals are 1) to develop a methodological framework that helps to utilize register-based data effectively, 2) to demonstrate how the register-based data can be used as a data source for the performance monitoring of a health system. The specific application area is hip fracture from the public health point of view.

1.2 Structure of the study This thesis consists of this summary and the following six original contributions. [1] Sund R: Utilisation of administrative registers using scientific knowledge discovery. Intelligent Data Analysis 2003, 7(6):501–519. [2] Sund R, Nurmi-Lüthje I, Lüthje P, Tanninen S, Narinen A, Keskimäki I: Comparing properties of audit data and routinely collected register data in case of performance assessment of hip fracture treatment in Finland. Methods of Information in Medicine 2007, 46(5):558–566. [3] Sund R: Utilization of routinely collected administrative data in monitoring of aging dependent hip fracture incidence. Epidemiologic Perspectives & Innovations 2007, 4(2). [4] Sund R, Kauppinen S: Kuinka laskea ikääntyneiden pitkäaikaisasiakkaiden määriä rekisteritietojen perusteella? Sosiaalilääketieteellinen Aikakauslehti 2005, 42(2):137–144. [5] Sund R: Lonkkamurtumien ilmaantuvuus Suomessa 1998-2002. Duodecim 2006, 122(9):1085–1091. [6] Sund R, Liski A: Quality effects of operative delay on mortality in hip fracture treatment. Quality and Safety in Health Care 2005, 14(5):371–377. This summary synthesizes the methodological ideas from these contributions. The second chapter is about register-based data analysis in general. It begins with the database-oriented approach to data analysis and then extends the perspective to a more comprehensive research-process approach which is able to deal with secondary data, which is the topic of article 1. The latter part of chapter two gives a brief review of certain multidisciplinary ideas that help in understanding the problems likely to be encountered during a register-based data analyzing process. The general methodological ideas that I have presented mainly in articles 1 and 

Articles are included in the thesis with permissions from the publishers.

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

17

1 Introduction

3 and developed further elsewhere [7, 8], are in this chapter reflected against the existing literature. As a whole, chapter two defines a methodological framework for register-based data analysis and describes fundamental assumptions that have been applied in the later chapters. In chapter three, some fundamental background information on health system performance assessment in Finland is given and a perspective for the applied research problem is fixed with certain focusing assumptions. This chapter extends and focuses the introduction given in article 2 on the basis of my other publications [7–9]. The fourth chapter includes a brief introduction to hip fracture, registerbased sources of information on hip fractures, and a review on previous registerbased research on hip fractures. The latter part of the fourth chapter summarizes the contents of article 2, which defined a conceptual model for the hip fracture performance assessment and examined the validity of Finnish registers as a data source for the purposes of a performance assessment of hip fracture treatment. Chapter five, based on articles 3–5, deals with the pragmatic problems related to the development of a surveillance system for register-based hip fracture incidence. The sixth chapter updates the conceptual model on the hip fracture treatment process presented in article 1 so as to be compatible with the conceptual model on hip fracture performance monitoring reported in article 2 and gives a very basic example of an application of the conceptual model. I have already presented the ideas summarizing the approach elsewhere [9], and prepared several manuscripts that deal with the application in more detail. Chapter seven gives a more detailed example on the justification of a performance indicator, and elaborates on the methodological ideas presented in article 6, including a basic model for the profiling of providers first described in article 1. Finally, in the eighth chapter, the motives for this study are once more discussed, the idea of the methodological approach is briefly summarized, and whether the aims of the study have been fulfilled is evaluated.

18

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

2 Register-based data analysis

2 Register-based data analysis A register is an information system which continuously records subject-based data for a particular complete set of subjects. A register contains a logically coherent collection of related data with some inherent meaning, typically reflecting some events occurring in the observational reality. A register is implemented for a specific purpose and has an intended group of users. These properties also fulfill the definition of a database and hence a register can be considered as a database [29]. Typically a register is a large collection of data and is maintained using computers. As long as there have been (computerized) databases, there has been an increasing interest to provide information at the correct level of detail to support decisionmaking [30, 31].

2.1 From technical data processing to a knowledge discovery process In principle, it is quite straightforward to implement procedures that take register data as input and produce a sensible summarization of them as an output. For example, intensive cross-tabulations and basic visualization tools can be used for the comprehensive summarization of stored data. In addition, implementations of many traditional statistical techniques are suitable for such processing. However, the massive size of a database may impose certain computational difficulties in applying methods initially developed for smaller data sets [32]. Therefore, more recent and computationally efficient methods and algorithms offer an invaluable enhancement to data analysis [33, 34]. In practice, the description of data analysis as a finite series of precisely encoded rules needed to transform raw data in a database into interesting information gives an overly simplistic impression of data analysis, because it remains unclear what should actually be done in order to achieve information in data analysis. In this sense, it is convenient to consider data analysis as a collection of tasks, such as exploratory data analysis, descriptive and predictive modeling, pattern discovery, and identification [35]. In fact, it has been suggested that reasonable results can be achieved by considering the whole series of actions required to transform data into information as a knowledge discovery process [36]. A descriptive model of a standard process of data mining—which gives detailed pragmatic guidelines for performing a knowledge discovery process—has been developed [37]. In this business-oriented tradition, the goals of the process are evaluated in terms of its ability to produce interesting information. Suggested criteria for interesting

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

19

2 Register-based data analysis

Context Debate

Problem Question Perspective

Idea Theory

Data

Analysis

Answer

Figure 1. Research process schema

information are, for instance, evidence, non-redundancy, simplicity, novelty, and usefulness [38]. The problem with such a pragmatically useful description of a businessoriented knowledge discovery process is that certain important aspects of traditional empirical research and scientific method have been disregarded. Therefore I have complemented the process model to give a more comprehensive and detailed description of the phases commonly encountered during an empirical research process that incorporates register-based data [1]. The improved schema for a knowledge discovery process is presented in Figure 1. The main phases in such process are: understanding the phenomenon, understanding the problem, understanding data, data preprocessing, modeling, evaluation and reporting [1]. The case study of this thesis represents an application of this schema.

2.2 Secondary data as a methodological problem If the schema presented in Figure 1 is compared to a standard scientific inquiry, the most important difference is the connection between the problem and the data. In fact, typically a theory of a given phenomenon of interest should drive the primary data collection. This makes it possible to concentrate on just those parts of observable reality that are considered most relevant for the current theoretical purpose. A similar approach is not possible with register-data, as the formulation of a problem in terms of register-based data is opportunistic, given that the measurement can only be based on existing secondary data originally produced for some other purposes than the research problem at hand [39]. In fact, the limitations of secondary data are determined by the choices made in relation to the production of the data, such as a decision to collect only easily available data and using fixed categorizations in the data production, with consequent variation in the production practices and categorizations used, and a lack of flexibility in the information systems to take into account changes in phenomena and society [40].

20

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

2 Register-based data analysis

Bias/Noise

Phenomenon

Coding

Register

Decoding

Data

Figure 2. Schematic diagram of information communication via administrative registers

I have proposed that the methodological challenges related to the utilization of register-based data can be illustrated using ideas from communication theory [1]. A schematic diagram illustrating information communication via registers is presented in Figure 2. First, it is assumed that some phenomenon exists that can be observed. Since it is impossible to completely observe all details or perform exact measurements, some kind of coding is used to describe things. This coded signal is then stored in a database. The noise and bias can be interpreted as an explanation for measurement compromises, possible inconsistencies and coding errors, and coding practices existing in the stored signal. When this signal is then utilized, it must be decoded into a suitable form. This phase is also subject to noise and bias caused by incompatibility of choices and interpretations made by the data producer and the data user. Even the decoded signal (data) is not a final phase in the research process, because further analysis and processing is needed in order to transform the data into information. Even though this is a very simple and technical representation of communication, it seems to contain the essential elements needed in the commonsense understanding of secondary data.

2.3 Prerequisites in the register-based data analysis The schemas in Figure 1 and 2 give a basic overview of the process of registerbased data analysis, but leave many pragmatic details unanswered. Even though the actual realization of the process is determined by the research problem and the available register data, it has been pointed out that the effective use of registerdata presumes skills in at least four areas: in computer science, in statistics, in the principles of measurement, and in the theory of the subject matter [41]. In the following, I briefly review and synthesize the most important ideas, assumptions and approaches that were prerequisites for the case study of this thesis.

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

21

2 Register-based data analysis

2.3.1 Principles of measurement Typically it is assumed that reality can be confronted by recording observations that reflect the phenomenon of interest. Measurement aims to create data as symbolic representations of the observations. The two main aspects for measurement are the representational and the practical approach [42]. Representational measurements are quantifications of attributes of objects existing in reality, while the practical measurements are operationalizations of the phenomena of interest. All representational measurements resulting in concrete data can be formulated in terms of a practical approach. The operationalization used in the practical approach for measurement determines how the phenomenon P that becomes visible via observations O is mapped to data D. In the pragmatic sense, operationalization can be taken as successful if it becomes possible to make valid interpretations I of symbolic data D in regard to the phenomenon P, i.e. that a satisfactory saturation between the phenomenon, conceptualization and data is achieved [1]. The key point is that three recognized levels of contextual dependencies accompanying the empirical research are explicitly separated: a theory driven observation (O), the operationalized data (D), and a theoretical interpretation of the data (I) [43]. It is particularly important to notice that the actual information is not assumed to be contained in the data, but is something that has to be produced from the data and the pre-knowledge. Such an idea has been presented more formally in terms of an infological equation I = i(D,S,t), which states that the information I is produced from the data D and the preknowledge S by the interpretation process i during time t [44]. From this point of view it is obvious that any sharing of data can only be a proxy for the process of sharing of information, because the unbiased sharing of information would require the background knowledge S to be identical with the producer and the users of data. In this sense it is not surprising that data as such are of no value and become interesting only if there is also a meaning and a context for them [45]. The range of possible interpretations of the data can be reduced by increasing the common background knowledge of the potential data users by offering descriptive data about data, that is, metadata [46]. While it is true that the increased amount of metadata increases the proportion of shared information, the more diverse contexts the data have to be usable in, the more work is required to disentangle the data from the context of its production [21]. These are also starting points in register-based data analysis, and the problem is to find some shared perspective between the original and intended data utilization purposes in terms of available data and (more or less tacit) metadata [3]. Following

22

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

2 Register-based data analysis

the ideas of a cognitive fit approach [47], there could be a problem-solving task related to phenomenon P, but only data Dp’ are available that tell about some other phenomenon P’. A “true” task solution Ip can only be approximated with a conditional task solution Ip|p’ . The conditional task solution is based on conditional data Dp|p’,which are a result of some intelligent transformation of available data Dp’ and are based on the cognitive fit between the internal representation of hypothetical “true” data Dp and available data Dp’ . In practice this means that an extra interpretation-operationalization phase is required in the research process if secondary data are to be utilized [1], i.e. that data in a register need to be decoded to become suitable research data (Figure 2: register → decoding → data). This important idea has been applied throughout this dissertation. The problem with the approach is that phenomena are different in terms of measurement possibilities. As the law of medical information states, it is more difficult to utilize more complex variables for purposes other than those originally intended [21]. In this sense, it is in practice fruitful to evaluate the amount of subjectivity in the variables. One criterion is the separation between representational and practical measurement: the measurement is representational only if it is reasonable to assume that the data D are direct reflections of observations O that refer to some entity existing in reality [42], i.e. that the phenomenon of interest is directly observable. Another useful criterion is related to the amount of additional background knowledge S+ required in the interpretation of data D. I have suggested that if there is no need for knowledge beyond the shared human common sense, the measurement can be characterized as stable [8]. In fact, it is known that space and time provide a shared biological-development basis for ordering human common sense as our world of sensory observations is arranged to observe things changing in space over time [48]. If measurement is representational and stable, it is factual. These ideas have an important role in this thesis, because the purpose has been to minimize unnecessary subjectivity by maximizing the use of factual data. For example, the hip fracture care episodes are reconstructed in terms of factual data.

2.3.2 Information science A concrete data management requires some kind of data structures. In fact, the raw data are nothing but a sequence of bits stored in a database. There must be some a priori fixed rules for the interpretation and handling of bit sequences, which define the basic data objects, such as integers, floating point numbers and character strings. A data model specifies which kinds of data and data manipulations are permissible [49]. New types of data objects and associated operations can be easily constructed by combining the existing types and operations [50]. This idea makes it possible to provide a conceptual representation of data which hides storage and implementation details which are of no interest to most database users [29].

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

23

2 Register-based data analysis

Phenomenon Taxonomy

Knowledge component

⇓ Concept

Partonomy

⇓ Object Host

Attributes

Logical component

Theoretical measurement properties

Time

Place

Realized observation

Data component

Figure 3. An overview of the extended pyramid framework

More generally, data modeling can be used to construct (computer-based) symbol structures which capture the meaning of data and organize it in ways that make it understandable and useful [51]. Data modeling is restricted in the sense that only what is (or can be) represented is considered to exist [52]. A recently introduced unified meta-information architecture of statistics (UMAS) provides a comprehensive conceptual framework for data modeling [53]. The aspects of the complicated framework that are useful in pragmatic register-based data analysis can be illustrated with the help of a pyramid framework that is based on the cognitive principles of how humans store everyday knowledge [48]. A slightly extended pyramid framework is presented in Figure 3. Following the ideas that I have presented independently [7], but which the UMAS also captures, the framework had to be complemented with an additional logical component. A semantic object (conceptual entity), reflecting some phenomenon in reality, is a basic element in the framework. Taxonomy and partonomy reflect the principles of a cognitive categorization of the object and represent also the schemas needed to interpret the observational data. These parts form a knowledge component of the framework, i.e. reflect the required pre-knowledge that cannot be extracted from the data. The object must be logically defined by identifying an observable host for a concept and by determining which attributes related to the concept are to be observed. The logical component also includes theoretical measurement properties of variables given in terms of random variables that represent the (assumed) operationalization of the variables. The data component of the framework represents the actual data by means of three distinct data perspectives: place (where), time (when), and realized observation (what). The whole path from a knowledge component to the realized observation is needed to

24

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

2 Register-based data analysis

describe what is actually observed. The (extended) pyramid framework provides a human–computer environment for (register-based) data analysis that incorporates models of human cognition so that the utilization of large databases becomes more effective than without the framework [54]. A generalized event-sequence approach that I have presented in article 1 is obviously a special case of this more general framework. The framework also had an important role in the development of the conceptual model in article 2.

2.3.3 Statistical computing Information sciences also deal with more pragmatic tasks concerning the acquisition and preparation of data, which can also be classified as statistical tasks. The acquisition of relevant data in practice may be a difficult and time-consuming task, because the sources of data need first to be identified and the use of register-based individual-level data for scientific research requires permissions to be applied for [55]. Technically the problem is similar to the one of extracting and integrating data from multiple sources into a new data view (data warehouse) [56]. An important special case aims to identify the instances which include data about the same realworld entity [57]. Such duty is better known as record linkage [58]. The record linkage becomes problematic, however, if the entity identifiers in the data sources are not identical and probabilistic techniques are needed to predict the equivalence [59]. More generally, data integration may require schema matching, which aims to find compatible interpretations between multiple database structures [60]. For example, typical mortality data have a different structure to hospital discharge data, but in practice it is useful to define a data structure that allows incorporation of data from both sources. There are also some commonly encountered data preparation tasks [1]. Data cleaning task involves detecting and removing errors and inconsistencies from data in order to improve the quality of data [61]. In statistics literature, data cleaning is known as editing and imputation [62], and in applied mathematics as error correction [63]. Data cleaning deals with logical errors, such as violations against integrity constraints or duplicate observations. The nature of data cleaning is technical, meaning that such corrective manipulation of data could be done no matter what the actual research problem under investigation is. Data reduction aims to produce a reduced representation of data which is much smaller in volume than the original data set, yet produces (almost) the same answers to the research problem [64]. This kind of data preparation can consist of anything from simple dropping of unimportant variables, combination of several variables or observations into single one, or more radical changes in aggregate levels to even more complicated analyses. For example, in some cases it may be reasonable to assume a multivariate normal distribution for variables of interest, which means

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

25

2 Register-based data analysis

that only the covariance matrix and means (sufficient statistics) are needed for subsequent analyses. Data abstraction tries to embed an intelligent interpretation (enrichment) of “raw” data into analyses so that the resulting derived data set would be at the level of abstraction corresponding to the current problem [65]. In practice this kind of abstraction may be feasible by processing data with explicit algorithms so that important a priori schemas that are not directly available in data become formalized in terms of the data. For example, register-based data are usually patient-specific, while medical knowledge is patient-independent and consists of generalizations that apply across patients. A complication after a surgical operation is a medical concept, but from individual-based hospital discharge data it must be abstracted by using some rules, such as a list of particular diagnosis codes with appropriate time stamps recorded in the data. Data abstraction always results in a problem-specific derived data set. In summary, the data preparation has been characterized as a process that requires human participation, which means that as much art as science is needed in good data preparation [66]. Whatever preprocessing tasks are applied, it is obvious that incorporated explanatory analyses offer insights and realistic perspectives into the data. It can be further stated that a sophisticated preprocessing—which is full of ideologically dependent qualitative choices—in order to scale matters down to a size more suitable for specific analyses is the most important and time-consuming part of register-based data analysis [1]. Data sensitive preprocessing had a key role in the empirical part of this thesis.

2.3.4 Statistics Statistics is a science devoted to the production, analysis, modeling, and presentation of data. Statistical models help to distinguish systematic patterns from random fluctuations, measurement errors, and confounding biases occurring in the data. However, the traditional mathematically oriented statistical paradigm has had some difficulties in adapting to a new situation that requires concrete analyses of massive (register-based) datasets [22]. One reason is that the basic assumptions about the independent observations and the sampling error as the main source of uncertainty are often violated in complex register-based data sets which possibly include total populations [67]. Further, the traditional statistical significance has become as an issue, because even the practically unimportant differences easily become statistically significant within the large data sets. In addition, with massive datasets it can be expected that there are some distortions and errors in the data, and typically it is unfeasible to manually check all of them. On the other hand, the reasonable use of data typically requires the use of problem-specific data abstraction rules which transform the data to a useful form. This kind of data analysis is typical for “soft” qualitative analysis, but with massive register-based

26

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

2 Register-based data analysis

datasets, strong technical skills in quantitative data management are also required. In summary, a flexible approach for statistical inference is typically beneficial with register-based data. Moreover, statistics offers not only a set of tools for problemsolving, but also a formal way of thinking about the modeling of the actual problem [68]. In this sense, statistical thinking also has a key role in the process of registerbased data analysis. In fact, one useful perspective for register-based data analysis is to consider the whole process of information communication (Figure 2), and try to explicate the sources of noise and bias that may distort the communication [69]. The basic idea is—in addition to a traditional modeling of the phenomenon of interest—to take into account suspected sources of bias, possibly with the help of a priori information [70]. Such is one way to deal with the particular restrictions of secondary data by using theory driven statistical modeling. Chapter 7 in this thesis also incorporates this kind of statistical thinking. Another option for statistical modeling is to apply statistical algorithms directly to data and let the data speak for themselves [71]. Also this approach is often useful with massive secondary data sets, because data exploration and description plays a more general role than it does in the case of small data sets. The screening of data for significant associations without having specific hypotheses may be questionable [72], but it may also be erroneous to assume that data are automatically in concordance with the theory. Within this approach it becomes possible to be data sensitive so that empirical justification is given to the proposed operationalizations. Most analyses in this thesis are of this type.

2.3.5 Theory Theory of the subject matter is needed for the formulation of the research problem. As shown in Figure 1, theory is also the driving force in the generation of the question and in the choice of the perspective, as well as in the interpretation of the data and analyses. In other words, the theory determines the framework within which justified data analysis becomes possible. The special feature in register-based data analysis is that more than one theory must be simultaneously dealt with [3]. For meaningful results there is a need to find some communality in terms of realized data between problem-driven subject matter theory and the (more or less unknown) theories that have been used during the production of the register-based data. The extended pyramid framework helps to structure the data in a concrete and methodologically sound way that makes the finding of suitable compromises much easier. For example, the components of data that have the most stable measurement properties are obviously the ones that can be most easily reused in various contexts and the basics of many useful theoretical approaches can be built on these factual main elements.

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

27

3 Register-based health system performance assessment

3

Register-based health system performance assessment

In the previous chapter it was argued that the register-based data analysis should be considered as a research process. Obviously that is true also in the case of scientific health system performance assessment. In this chapter some essential background information needed for the understanding of the phenomena of interest are given. The applied research problem is also introduced and the perspective is fixed with certain focusing assumptions.

3.1 Finnish health system A (public) health system is a creation of the human community, and the health system of any society can only be understood in the light of its societal operating principles and policies [73]. The Finnish societal system is typically characterized as a Nordic welfare state, which is internationally rather exceptional. The Finnish health care system is very decentralized, and the country’s numerous (>400) local authorities (municipalities) are responsible for arranging services [74]. Each municipality is a member of one of the 21 hospital district joint authorities, which are responsible for organizing specialized medical services and coordinating hospital treatment in its own district. Secondary and tertiary level medical care is provided by a hierarchy of regional, central and university teaching hospitals. Services for older people are provided in both social and health care, both being incorporated into the same national planning and financing system [75]. The organization and financing of health care has long been considered a public responsibility [76]. There are also many recent reforms to the Finnish health system, such as the setting of thresholds for admission onto waiting lists for elective surgical procedures, the introduction of a set of maximum waiting-time targets for non-urgent examinations and treatments, a national electronic patient record and a project aiming to restructure municipalities and services [77].

28

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

3 Register-based health system performance assessment

3.2 Information production in the Finnish context One important tool in the maintenance of a Nordic welfare state has been to produce information which supports the (nationwide) decision-making. The main emphasis has been on producing official statistics and indicators that reflect important aspects of health and social welfare, and therefore offer information for decision-making and controlling purposes. However, the national-level control mechanisms were decomposed in 1993 due to the change from direct authorative control (imperatives, rules, norms, earmarked funds) into indirect control based on information (instructions, guidelines, hopes, total funds) [78]. This new nationwide policy on “steering by information” has aimed to produce information for meso-level decision-makers (such as municipalities and hospital districts) [79]. The problem is that now there should be information to suit the purposes of hundreds of local policies instead of one global policy as in the past [80]. Moreover, the potential users of information may easily refuse to act on the given information if they consider it to be irrelevant for their purposes [81]. In this sense, there is an obvious need for methodological studies that aim to make policy relevant information production more efficient.

3.3 Measurement of the performance of a health system Even though there does not exist a universal value base to all health care systems, most health systems in developed countries aim to promote, restore and maintain health [82]. The common goal is the optimization of the health of individual patients and populations in an equitable, efficient, and effective manner that is acceptable to patients, providers and administrators [83]. The growing need for appropriate services with limited resources and the concern about the continuing inequities in health and in access to health care means that there is much interest among decision-makers in improving the performance of health systems [84]. However, performance measurement has proved to be very difficult in practice [85]. The main problem in performance measurement seems to be that there is little agreement on the philosophy of measurement and on what to measure [86]. For example, at least 15 dimensions for health care performance can be identified from the existing frameworks [87]. In fact, the problem seems to be related to a more general change in societies and in information production [88]. The evaluation of performance has become a more powerful organizing concept in activities of societies and there is a real need for information about performance. However, it has been claimed that the

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

29

3 Register-based health system performance assessment

abstract rhetoric of administration blurs the actual meaning of performance: there are no more norms about what and how things should be done, but the political responsibility is implicitly included by using vague formulations of strategic objectives which delegate the actual implementation to “actors” [89]. In this sense, performance assessment as such is an empty concept [88]. Only if the goals are fixed and operationalized to a measurable form can the concept of performance become interesting and potentially useful. Concrete quantitative measurement of performance has mainly been conducted using various performance indicators [90–92]. Recently the whole idea of performance measurement seems to be focused on the sophisticated development and reporting of indicators [93-95]. Furthermore, methods for evidence synthesis and assessment in the context of multiple indicators have been suggested [96, 97]. As the indicators certainly reflect something that can be measured, it is typically useful to compare indicators between the appropriate subpopulations, because descriptive data on relative performance may help to identify “best practices” or “what works”. However, the causal mechanism underlying differences between the indicators typically remains unknown in such an approach [98], which “invalidates” the appealing analogy to the experimental research design [99]. In this sense, registerbased indicators without detailed justification of their theoretical-methodological basis do not represent scientifically valid information.

3.4 Assumptions for the production of concrete health system performance information In this thesis, I focus on the methodology of producing health system performance information. The key assumption is that the routinely collected register data are to be used for performance assessment purposes. The use of register-data in health research is known to be prone to several problems, such as the perceived lack of value of administrative data, privacy and confidentiality, data availability, population coverage, registration period, record linkage possibilities, lack of clinical data, data format, coding systems, coding practices, completeness of registration, accuracy of registered data, data processing, size of data, and discovery of chance occurrences [100–104]. I do not claim that the problems are not real, but adopt the view that proponents of certain approaches have been more interested in advocating their ready-made mechanical procedures than in understanding alternative logics of interpretation [105]. In fact, many of the mentioned problems are related to particular study designs or properties of existing data sets and therefore relevant only for certain types of research questions or data. For the most fruitful results it seems to be essential to elaborate the whole analytic process with respect to the data, because the fixed structure of secondary data indirectly determines constraints on

30

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

3 Register-based health system performance assessment

the theoretical constructs [106]. In this sense, the methodological approach for register-based data analysis presented in Chapter 2 also provides a basic framework for the register-based performance assessment of the health system. I focus on the data sensitive analyzing approach, and the aim is a step-by-step transformation of data into as objective information as possible by maximizing the use of the most factual register data. I assume that the interesting dimensions of health system performance assessment can be examined by using the framework which conceptualizes health system performance assessment in terms of structure, process and outcomes [107]. In short, the structure comprises resources that are devoted to producing actions whose primary purpose is to improve health, while the process means the realized utilization of these resources, and the outcomes reflect the effects of resource utilization on health. More recent formulations of the framework emphasize also the importance of a context and health policy goals that have led to certain health system implementations [10, 108]. The definition of what kind of expectations the commonly accepted abstract health policy goals truly reflect is difficult [109]. However, it can be defined that effectiveness in this context measures how the health system (or resources spent on it) affects the health of the target population [10]. In this thesis I concentrate on the effectiveness dimension of health system performance, because effectiveness seems to be a key dimension in several performance frameworks [87]. It is also known that the production of health benefits (effectiveness) plays a central role in assessing the cost of producing health benefits (efficiency) as well as the distribution of these benefits and costs across groups (equity) [10]. In addition, there is an acute need for information on effectiveness for the purposes of information steering in Finland [110].

3.4.1 Register-based health system performance monitoring The Finnish registers offer an exceptional coverage of health and social welfare data [18]. In principle, a separate scientific study utilizing these data could be conducted to answer some specific question. However, from the information steering point of view, the available register-based data should be routinely converted to useful information. As it is unfeasible to repeat separate studies manually over and over again, a better alternative is to incorporate new information production pipelines to the health information system that can be used in the production of routine statistics [14]. In fact, the implementation of a nationwide health system performance monitoring system is one of the main development goals of for the information systems of the health and social welfare services in the near future [111].

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

31

3 Register-based health system performance assessment

In the Finnish proposals for the health information system, register data (raw data) and evidence-based decision support (know-how of transforming the raw data into information about issues that should be known) are essential ingredients for the continuously improving epidemiological and quality components that routinely produce new scientifically valid information [14, 15, 112]. Following these proposals, health system performance monitoring should consist of two components. The epidemiological component is needed for monitoring the volume and incidence of health problems, which are essential information for the evaluation of prevention strategies and which also help to prepare for changes in the need for care caused by changes in the population structure. The quality component concentrates on producing information on the effectiveness and quality of treatment, which are essential information for the evaluation of health system performance and for the purposes of finding ways to improve the health system. For the purposes of this thesis, I assume that pragmatically useful health system performance monitoring should be able to produce valid information about the volume and incidence of the problems, use (and costs) of services as well as about the effectiveness (and quality) of provided services. It is well known that the performance assessment is practically feasible in a reasonable way only if it is carefully developed and tailored for each specific health problem separately [106, 113]. Therefore, I concentrate only on one health problem, hip fracture.

32

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

4 Register-based data on hips fractures – sources and validity

4 Register-based data on hip fractures – sources and validity The assumptions and theoretical considerations in the chapters above create adequate starting points for more concrete health system performance assessment. The case study in this dissertation is intended to offer perspectives and methods for the implementation of a nationwide system for monitoring health system performance. Hip fracture is used as a concrete example. A brief review of essential background knowledge is given first. Thereafter, the feasibility of using Finnish register data in the case of hip fracture treatment performance assessment is discussed based on article 2.

4.1 Hip fracture Hip fractures are common injuries among older people, and associated with substantial morbidity and mortality [114]. The term hip fracture refers to a fracture of the upper end of the thigh bone (femur). Most hip fractures in persons aged 50 years and over result from moderate low-energy trauma, usually a fall from a standing height or lower [115]. For younger persons it is more likely that a case of hip fracture results from a high-energy trauma, such as traffic accidents or a fall from a height [116]. Prevention has focused on minimizing the risk of falls and on reducing the injury potential of those falls [117]. About 7000 (of which more than 95% occur for patients aged 50 years and over) hip fractures per year occur in Finland currently [5]. Ageing among populations is increasing hip fracture patients’ mean age and the number and severity of their pre-existing co-morbidities, which is likely to cause additional problems in patients’ treatment and rehabilitation in the future [118]. Sometimes the hip fracture can be interpreted as an indication of the “beginning of the end” (patients were doing well until they broke a hip and went downhill quickly) and sometimes as an “end of the beginning” (hip fracture signals that the cumulative effect of small declines has reached a critical level) [119].

4.1.1 Hip fracture treatment in Finland Virtually all suspected hip fracture patients are first referred for examination and treatment to the nearest hospital with orthopedic services. The main objective in hip fracture treatment is to return the patient to his or her level of function before

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

33

4 Register-based data on hips fractures – sources and validity

the fracture [115]. The diagnosis of fracture of the hip is straightforward, using xray examination. A surgical operation is performed on the majority of patients. The main methods used in treatment are reduction of the fracture using internal fixation and hip replacement arthoplasty. The care pathway for a hip fracture patient is rather complex with several phases such as surgical management and rehabilitation [120], and is known to result in diverse episode profiles in Finland [121]. Typically a patient is transferred for rehabilitation to the health center serving the patient’s resident municipality after a short postoperative hospital treatment [122]. Finnish health centers are local primary health care units, which also contain inpatient wards. Other institutional environments of care include residential homes and service housing with 24-hour assistance, which both correspond to the nursing home type of care. Non-institutional services utilized by hip fracture patients include outpatient health services, home nursing, ordinary service housing, homehelp services, and support for informal care [75]. For six Finnish hospitals, patients aged 50 and over had an average mortality at 30 days after the fracture of about 7%, 17% at four months, 26% at one year, and about 50% at four years [123]. At four months, about 40% of patients lived at home, about 15% were unable to walk, and about 8% had a lot of pain in the injured hip [123]. The functional capacity of the patients does not typically revert to the level prior to the fracture [124]. Hip fractures are also costly to the society. The average patient-specific costs during the first post-fracture year in Finland were estimated to be around €14 410 and more than €35 000 in case of a previously home-dwelling individual who becomes a long-term care patient following the fracture [125]. Treatment processes as well as the outcomes vary considerably between areas and hospitals, and improved auditing of hip fracture treatment has been suggested [126, 127]. A recent Finnish current care guideline on the management of hip fracture patients proposes that a nationwide hip fracture register allowing continuous auditing should be established in Finland [128]. In this sense, there is a pragmatic justification for the methodological studies aimed at transforming routinely collected register data into relevant hip-fracture-specific information about the performance of the health system. In addition, hip fracture is a good choice for a pilot study on performance assessment, because it can also be viewed as a tracer condition in health systems, testing how well health and social services are integrated in the provision of acute care, rehabilitation, and continuing support for a large and vulnerable group of patients [129].

4.1.2 Register data on hip fractures in Finland Finland has a long history of collecting data on health and social services. At the end of 20th century, there were about twenty different routinely collected national administrative health registers [18, 55]. A unique national personal identification

34

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

4 Register-based data on hips fractures – sources and validity

number is used in all Finnish registers which can be utilized as a linkage key in order to combine data from multiple sources. A particularly important register for the purposes of health services research is the Finnish Health and Social Welfare Care Registers (including the Finnish hospital discharge register). These registers contain data on all inpatient care periods in hospitals, in health centers, in residential homes and in service housing with 24-hour assistance. The data warehouse of the Finnish Hospital Benchmarking Project (nowadays a part of the Finnish Health Care Register) is the corresponding register for hospitals, but also incorporates data on outpatient visits. In general, the complete registration combined with easily linkable registers makes large, longitudinal population-based studies feasible in Finland.

4.1.3 Previous register-based hip fracture studies in Finland The first nationwide register data based on hospital discharges in Finland are available for the year 1960, although the continuous hospital discharge data collection began no earlier than 1967. Hip fractures were not reported separately, but were combined with other fractures of limbs [130]. The register data from 1968 included hip fractures as a separate group [131]. The hospital discharge data from 1968 is also a data source for the first hip-fracture-specific registerbased study in Finland [132]. Most of the register-based studies concerning hip fractures have had an epidemiological perspective [133–137]. In the late 1980s and early 1990s, a project aimed at improving the reporting of treatment and costs data based on registers had hip fractures as a separate group [138]. At the same time, the first register-based small-area analyses examining treatment practices in terms of surgical operations reported data on hip fractures treated with arthoplasty [139]. The next step in the utilization of register-based data on the description of hip fracture treatment practices was the reconstruction of the care episodes of the patients [121, 140]. There followed a pilot study examining the effectiveness of hip fracture treatment using register-based data [141, 142].

4.1.4 Register data for the current study The data used in this study were also based on Finnish registers. As virtually all hip fracture patients are treated at the hospital inpatient ward and given that hip fracture is easy to diagnose, it is very likely that the patient population can be easily identified from the hospital discharge data by using a simple database query with a list of diagnosis codes. To make sure that all hip fracture patients were included in the study population, a total population of fracture of femur (corresponding to

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

35

4 Register-based data on hips fractures – sources and validity

ICD-10: S72) in the period 1998–2002 was identified in the Finnish Health Care register. In order to capture the medical histories of these patients, data on all inpatient (1987–2002) and outpatient hospital care (1998–2002), residential home care and care in service houses with 24-hour assistance (1997–2002), and deaths for this population were obtained from the Finnish hospital discharge register, the Finnish Health and Social Welfare Care Register, the data warehouse of the Finnish Hospital Benchmarking Project and the National Causes of Death statistics using the unique national identification numbers of the patient population. It was first time that the hospital outpatient visits from the benchmarking project data and the data on inpatient care in social institutions from the Social Welfare Care Register were included in a hip fracture specific register-study. The results of simple database queries were integrated into a new data set containing 988 762 records for 39 041 patients. Each record in this data set corresponded to one care period in inpatient institutional care or outpatient visit in hospital (or death), and included variables such as patient and provider IDnumbers, age, sex, area codes, and diagnosis and operation codes, as well as dates of admission, operation and discharge (or death).

4.2 Validity of register data in the case of hip fracture Valid data is a prerequisite for any information system aiming to produce practically useful information on health system performance. Several studies have reported a high completeness of registration and a high validity for most variables in the Finnish registers [143–146]. However, there have been some suspicions about the quality of register data in the case of hip fracture [147], and recent a Finnish current care guideline has requested more validity analyses [128]. Moreover, there have not been studies where the validity of data is judged against the health system performance assessment purposes. Therefore, an investigation aimed at clarifying these issues was conducted [2]. Prospectively collected data for 106 consecutive hip fracture cases in the Kuusankoski Regional Hospital between January 1, 1999 and January 31, 2000 and the register data for patients treated during the same period in the surgical ward of the Kuusankoski Regional Hospital were compared and the observed differences were checked from the medical records.

36

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

4 Register-based data on hips fractures – sources and validity

4.2.1 Completeness of registration It was rather straightforward to check whether the patients in the register data having a recorded hip fracture diagnosis (ICD-10: S72.0, S72.1, S72.2) in a particular hospital period were found in the prospective data and vice versa. As it was very likely that all hip fracture patients were identified from the register or prospective data, and as the hip fracture diagnosis could be confirmed from the medical records, it was possible to evaluate whether all true hip fracture cases were identifiable from the data sources. It turned out that there were in total 105 patients with a confirmed hip fracture diagnosis. The audit data missed one patient and the register data two patients. That indicated very good completeness of registration in both data sources. Three extra hip fracture candidates clearly had a false positive diagnosis in the register, and a further five candidates had been excluded from the prospective data on the basis of additional information, i.e. register data may slightly overestimate the number of fresh hip fractures unless appropriate data abstraction rules are used for the exclusion of extra cases. In summary, this analysis confirmed that the validity of the data seems to be acceptable for simple incidence monitoring purposes.

4.2.2 Conceptual model for hip fracture performance monitoring It was more difficult to evaluate the quality of register data for performance assessment purposes, because data validity must be judged against the intended utilization purposes [148]. However, there is no obvious golden-standard that would have told which aspects and issues of performance monitoring should have been evaluated between the data sources [149]. In addition, it was obvious that the quality criteria could change with time. For instance, the detailed diagnosis of hip fracture is more important in the operating room than during the final stages of rehabilitation, i.e. the requirements for correct diagnosis in data may be more important in a surgical ward than in a nursing home. Therefore, in order to make any statements concerning the quality of data, it was essential to first outline the properties of required measures carefully under a fixed conceptual model for performance assessment (determine how data result from the theory), and then evaluate the observed data against these requirements (examine how well the theory can be reconstructed using the properties of actual observed data). The same idea was expressed more explicitly in chapter 2.3.1. In the case of hip fracture treatment, the importance of practical and reliable information on evidence-based performance assessment has been recognized [126], and good experiences of the Swedish Rikshöft-registry have led to the formalization of data production procedures for the purposes of a standardized audit of hip

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

37

4 Register-based data on hips fractures – sources and validity

fractures in Europe (SAHFE) [127]. The suggested prospective collection of clinical data represents the practical “state-of-the-art” consensus for data requirements for assessing the treatment of hip fracture. This work has produced a list of SAHFEvariables which reflect the operationalizations of patient-level concepts that have been considered important in the case of hip fracture. However, the more or less randomly ordered list of variables is not a useful conceptual model. In order to define a conceptual model, the SAHFE-variables were chosen to represent an adequate base for data requirements in terms of patient level concepts. The extended pyramid framework presented in Chapter 2 was used in the systematic characterization of these concepts. Each phenomenon represented by a concept was assumed to be attributable to an individual on a continuous time scale. It was also assumed that each concept can be described using a systems approach with a (limited or unlimited) number of states so that the system is always in some of these states at any time point. All concepts were classified on the basis of a subjective evaluation of potentially suitable measurement scales and types of statistical distributions (theoretical measurement properties). The second classification dimension was the temporal stability of the concept, i.e. it was considered, for example, at what time intervals the concept makes sense and how often the concept should be measured. In addition to these statistical classification principles, a contextual interpretation was linked to each concept in order to make sure that the resulting operationalization offers desirable insights on the phenomenon of interest. The linkage was mainly based on the conceptualization used in SAHFE-metadata, but it was complemented using theoretical models of performance assessment [10, 150, 151]. After this initial classification phase, the concepts were further classified into more general groups in terms of their contextual interpretation, temporal stability and theoretical measurement properties. In the third phase selected measures from the SAHFE-list, from the prospective Kuusankoski data, and from the register data were mapped to the conceptual model and the goodness of the fit in terms of comprehensibility and simplicity of the model was evaluated. These three qualitative phases were repeated until the resulting conceptual model was considered satisfactory. The final conceptual model presented in Table 1 gave a pragmatic mapping between concepts from a performance assessment theory, SAHFE metadata and structured data entry in prospective data and secondary register-based data in terms of contextual interpretation, temporal stability and theoretical measurement properties. Of the model dimensions, the biological constants do not change in time and therefore one measurement is generalizable to all times. Biological events, accident/fall history, and hip fracture event represent dimensions for which actual values of measures are recorded in the proximity of some observable event. All other dimensions relate to phenomena that potentially change in time and should be continuously monitored. In practice continuous monitoring is impossible, and actual measurements of continuously changing phenomena can be made only

38

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

4 Register-based data on hips fractures – sources and validity

Table 1. Conceptual model dimensions and their realizations in different data sources SAHFE measure

Prospective data measure

Register data measure

Biological events

date of birth, death, year of menopause, menarche

date of birth and death

date of birth and death

Biological constants (dna)

sex

sex

sex

Biological facts

Biological measures Physical examinations

height, weight

Laboratory tests

hemoglobin, creatine, albumin, bone density

Demographic history Household composition

living alone

marital status, living alone

Place of living

residential status

residential status, area of living

area of living, level of care1

smoking, alcohol intake

alcohol usage

alcohol- related inpatient care1

pain, psychological state, fear of fall

pain

Diseases/symptoms

comorbidity, complications

complications

Physical functioning

walking

walking, activities of daily living need for care

Cognitive functioning

abbreviated mental test score

Socioeconomic history Education, occupation, economic resources Health related behavior Risk behavior Diet, activities Subjective quality of life Overall well-being Objective need for care recorded diagnoses, use of care1

Use of care Technical aid

walking aids

Medication Health care utilization

use of assistive technology, changes in living environment use of painkillers

provider, type of stay, length of stay

type of care, rehabilitation, dates for hospital visits

provider, type of care, admission and discharge dates, diagnoses, operations

date of fall, place of fall

date of fall, place of fall

place of and reason for hospitalized injury1

Properties of fracture

time of fracture, occurrence place, fracture type, fracture side

day of fracture, reason for fracture, occurrence place, fracture type, fracture side

place of and reason for hospitalized injury

Initial treatment process

hospital ID, admission time, admission source, start of operation, reason for operative delay, type of surgeon and anesthetist, type of anesthetic, operative method, length of surgery, type of prophylaxis, complications in operation, reoperations, time to mobilization, discharge time, discharge destination

admission day, admission source, day of operation, reason for operative delay, operative method, complications in operation, reoperations, discharge day, discharge destination

hospital ID, admission day, admission source, diagnoses, day of operation, operation codes, discharge day, discharge destination

Clinical stability

ASA grade, comorbidities, coexisting fractures, reason for operative delay in medically unfit patients

ASA grade, reason for operative recorded diagnoses, use of care1 delay in medically unfit patients

Accident/fall history Properties of fall Hip fracture event

1) Indirect measurement requiring record linkage.

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

39

4 Register-based data on hips fractures – sources and validity

at selected time points. In the prospective data, the measurement was done in connection with a hip fracture event and for certain measures also at two weeks, at four months, and one year following the fracture. In the register data, the recording took place at each discharge. The constructed conceptual model explicitly revealed commonalities between the prospective and register data. As expected, the content of prospective data was richer in clinical detail than the register data, even after using indirect measures of certain concepts. The conceptual model also showed which dimensions were comparable between the available data sources. In the actual empirical analyses these comparable dimensions were examined by using the available complete prospective data for 106 hip fractures and corresponding register-based data.

4.2.3 Consistency between prospective and register- based data Three types of comparisons between the data sources were used depending on the theoretical measurement properties of the variables. In the type I comparisons, the same rather easily measurable variable was available in both data sources, and detected differences could be further checked from the medical records of the patients. The percentage of cases with a recorded value for the variable (completeness) and proportion of correctly recorded values for cases with some recorded value (accuracy) were calculated for such variables. Completeness and accuracy were very good for age, sex, area of living, admission and discharge information as well as for hip fracture diagnosis and operation day and type. Side of fracture was seldom recorded and accuracy of information on place of accident was poor in the register. In the type II comparisons, conceptually similar measures which had substantially different operationalizations in both data sources were examined. From this category, follow-up information was a particularly important issue for performance assessment purposes. Using the register-based data it was straightforward to extract information on the use of inpatient health services and mortality on a daily basis. The derivation of corresponding information from the prospective data was much more complicated and required extra assumptions and interpolation. The overall follow-up patterns were very similar indicating that the register-based data suits better for the follow-up of use of inpatient health services than prospective data. However, some extra work needed for the prospective collection of follow-up data may be useful in order to produce data on issues that are not available in the register-based data, but offer clinically interesting followup information, such as the use of technical aids, pain, and validated measures of functional disability and quality of life.

40

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

4 Register-based data on hips fractures – sources and validity

Type III comparisons were actually a mixture of type I and type II comparisons. This approach was used to evaluate the more detailed diagnosis of hip fracture. Using a detailed hip fracture classification of femoral neck fracture, trochanteric fracture or subtrochanteric fracture, the overall agreement between prospective and register data—calculated as a sum of equivalent classifications in data sources divided by sample size—was 86.3%. The 95% confidence interval for overall agreement— calculated with a nonparametric bootstrapping technique [152]—was 79.4–92.2%. The proportion of agreements that were not due to chance was measured using a delta coefficient, which is based on the model of multiple-choice tests and avoids certain known deficiencies of the traditionally used kappa coefficient [153]. Kappa may be considered as an approximation of delta, as both yield values that are very similar when at least one marginal distribution is balanced or when both marginal distributions are moderately unbalanced in the same direction [154]. According to the estimated delta coefficient a high proportion 81.3% (95% CI: 71.3–91.3%) of agreements were not due to chance. The delta coefficient was somewhat higher than the kappa coefficient of 73.5% (95% CI: 61.2–85.7%) indicating that the assumptions of kappa were not satisfied. In addition, the accuracy of diagnoses was addressed by considering bone anatomy as a biological entity identifying true fracture status [155]. This made it possible to assume that the trait underlying different fracture types was continuous, and a simple latent trait model in the form of a polychoric correlation coefficient was estimated [156], yielding a value of 0.92 (95% CI: 0.86–0.98) which also reflected a very high consistency between the register and prospective data.

4.2.4 Conclusions on validity In order to analyze the quality of register data for the purposes of performance assessment, a conceptual model had to be constructed. There were several novel aspects in the model. The statistical properties of the concepts had a very important role in the classifications and the special features of different data sources were simultaneously taken into account during the construction process. As a result, the presented conceptual model revealed the most important assumptions affecting the actual results of analyses, and showed how it was possible to produce a pragmatically useful mapping between problem-oriented and data-driven theories. As expected, the register data lacked clinical detail, but had a data structure that allowed the complete observation of inpatient care history, and therefore outperformed prospective data in this respect. The empirical validity analyses revealed that the completeness of registration was very good, but appropriate data abstraction rules should be used in order to avoid overestimation of hip fracture cases. Also the accuracy of the most easily measurable variables and the quality of follow-up information in the register was found to be very good. In addition,

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

41

4 Register-based data on hips fractures – sources and validity

detailed hip fracture classification showed a high consistency between the register and prospective data. As a conclusion, register-based data seemed to be a valid data source for the purposes of the performance assessment of hip fracture treatment. However, there was room for improvements in the register. An inflexible data structure for the performed operations causes problems if a patient has several operations during a single hospitalization. There was also a need for additional data, while especially the validated functional disability or quality of life measure would have been useful. The content of these register data could also be improved by giving standardized instructions that guide practitioners to record the relevant additional operation codes available in the classification, such as on which side the fracture was. Also a voluntary input of additional hip fracture event data to the centralized register should be made possible, because the best option for practical performance assessment purposes would be to combine SAHFE-type of data with the register data.

42

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

5 Hip fracture incidence

5

Hip fracture incidence

The epidemiological component of the hip fracture surveillance system has an important role, because without continuous monitoring of hip fracture incidence it would be difficult to assess the effectiveness of prevention interventions and also to prepare for changes in the need of care caused by changes in the population structure. This chapter is based on articles 3–5. In principle, the Finnish register data are well suited to the hip fracture incidence calculation. Hospital admission and discharge days are easily observable facts (i.e. have stable measurement properties in the terminology presented in chapter 2), which are correctly and completely recorded in the Finnish data. However, in practice it is not straightforward to determine the true number of hip fracture events from the register data. In fact, there are several recognized potential pitfalls in using hospital discharge data for calculating the number of injuries [157, 158]. Even though all hip fractures are identifiable from the hospital discharge data (virtually all hip fracture patients require inpatient hospital treatment and hip fracture is easy to diagnose), there are several discussable details, such as the exact diagnosis codes to be used in the identification, the use of secondary diagnoses in the hip fracture identification, and exclusion of certain nonstandard cases. However, from the methodological perspective a much more important source of bias (which is independent of the use of diagnosis codes) is related to the detection of multiple hospitalizations of a single patient. At the conceptual level this means that all events related to the same underlying disorder should be recognized. A so-called care episode approach provides a sound methodological framework for dealing with multiple hospitalizations in terms of discharge data [159]. In the case of hip fracture, it is known that a typical treatment process starts with an emergency admission to a hospital after the hip fracture event followed by an operation at the surgical ward and continues with rehabilitation at other inpatient wards. With a careful development of data abstractions rules to exclude multiple hospitalizations it was possible to demonstrate that several previous Finnish studies on hip fracture incidence had overestimated the numbers of hip fractures [5]. More specifically, the decision to use the calendar year boundaries to combine multiple hospitalizations in these studies was artificial and had no epidemiological justification, because the underlying disorder had nothing to do with the beginning or end of the year. In fact, it was likely that many patients having their fracture during the final months of each year were erroneously counted as separate cases for two consecutive years [3]. However, these considerations, combined with the problem that even the clinical criterion of a fresh hip fracture may vary, point out that it is impossible to

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

43

5 Hip fracture incidence

obtain a unique estimate for the number of hip fractures using register-based data. This ambiguity reflects the undesirable incompatibility between the theoretical model and data. As it was not possible to change data, there was a need for an alternative theoretical interpretation that would be more compatible with the restrictions of data.

5.1 Aging-related hip fractures It is known that hip fracture incidence increases exponentially with age [160]. This kind of population level functional dependency means that the condition is related to aging [161]. Biological aging can be seen as a complex process occurring stochastically in organs and tissues, which results in irreversible damage accumulation and vulnerability to the failures in maintaining the integrity of tissues and organs [162], and the components of a biological system are prone to aging-related failures in the system [163]. Most aging-related conditions are closely connected to alterations in certain tissue or organ, although a well-known exception is death, which is a result of fatal failure(s) in any vital components of human body. Following this line of though, it became obvious that a hip fracture was also a special case in aging-related conditions. In fact, hip fracture is not just an agingrelated failure in some tissue or organ, but typically a result of an accidental event (such as fall). Cumulative damage in some tissues (such as bone and muscle) increases the probability of (accidental or pathological) fracture, but it is also likely that some aging-related conditions increase the risk of accident. In this sense, hip fracture seemed to capture the effects of non-fatal failures in components of the human body, which most likely appear in the form of some kind of disabilities. Therefore, by drawing a methodological analogy to the event of death, it could be hypothesized that the event of hip fracture actually represents a multidimensional disability event (“disability death”). From this point of view, it was clear that the occurrence of first aging-related (low energetic) hip fracture would be the most interesting one, because it gives the upper limit for the time of development of critical hip fracture risk factors (for a hip-fracture-related chronic disability condition). Therefore, the actual problem of identifying the interesting hip fracture episodes could be transformed to the identification of the first aging-related hip fractures of the patients. This theoretically justifiable approach turned out to be in better concordance with the register data than the identification of all hip fracture events [3]. In terms of chapter 2, an extra interpretation–operationalization phase was needed.

44

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

5 Hip fracture incidence

5.1.1 A method for identifying a first aging-related hip fracture The pragmatic problem in identifying the first hip fractures was that only a limited backward follow-up was possible with the register data, i.e. some fractures could have occurred outside the observation window. A straightforward solution would have been to use all available data for the determination of first hip fractures, but it was considered more convenient to use the same “hip fracture free” clearance period, that is shorter than the minimum backward follow-up time in the data, for all individuals. In order to justify the length of clearance period, it was assumed that there may be a cut-off point after which the probability of having a new hip fracture is reduced to the level it would be even without a preceding hip fracture, i.e. that the preceding hip fracture would be an “uninformative” predictor of the new hip fracture [3]. In the concrete analyses, the first admissions (index points) with the diagnosis of hip fracture in 1998–2002 were identified for each patient. For each index point, backward time to the previous hip fracture admission of the same patient was calculated. Time was measured in months. There were followup data from the period 1987–2002, so the minimum (backward) follow-up time was 11 years. If there was no previous hip fracture admission, the observation was considered to be censored at the beginning of 1987. The discrete hazard probabilities of the form P(T = t | T ≥ t), reflecting the conditional probability for an event occurrence at time t on condition that no event had occurred until t, were estimated for each follow-up time 0 < t ≤ 160 as h(t) =

c(t) , n(t)

where c(t) was the number of events at time t and n(t) was the number of persons at risk at time t. To diminish the degree of random variation, estimated hazard rates were smoothed using a simple moving average technique with polynomial weights [164]. The smoothed hazard rates were referred as observed probabilities. For the calculation of corresponding probabilities in the case that no preceding hip fractures had occurred (expected probabilities), there was first a need for reasonable estimates for the incidence of first hip fractures. It was assumed that clearance periods between one and ten years are reasonable a priori, and that the average incidence between the years 1998 and 2002 can be interpolated to the whole observation period. After the preparation of risk population data (described

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

45

5 Hip fracture incidence

below in chapter 5.3) it became possible to calculate age- and sex-group specific incidence rates by using standard methods [165]. Sexes were analyzed separately and clearance periods of one and ten years were used. A logarithmic transformation was made for the estimated age-group specific incidence rates, which linearized the exponential increase of rates with age. After that, a linear regression model was fitted to the logarithmic incidence rates so that an interpolated incidence rate could be predicted for any age by using the estimated model [166]. For ages below 40 years, a constant incidence rate was used in the prediction, because an assumption of the exponential increase in incidence with age was not satisfied for young ages [167]. As age and sex were known for each index period identified from the data, it was possible to subtract one month from each age and then predict the expected probability of a hip fracture after a one-month (backward) follow-up for each case in the risk population by using the estimated models. As the incidence rates were person-year based, they had to be divided by 12 so that they corresponded to a time period of approximately one month. An average expected probability of hip fracture could then be calculated by summing the expected probabilities across the risk population and finally dividing by the size of the risk population. By subtracting one month more from each age and making the predictions again, an average expected probability of hip fracture after a two-months (backward) followup could be derived. Following the same procedure, expected probabilities up till a 160-month (backward) follow-up were calculated. The observed (solid line) and expected probabilities (dotted lines) are presented in Figure 4. The curve of observed probabilities showed that there was a nonstabilized risk for a hip fracture admission up to about 120 months. The difference between expected probabilities based on one- and ten-year clearance periods was quite small, and the observed risk also remained higher than expected during the first 7–10 years. Based on these analyses, a conservative ten-year clearance period was selected to make sure that the risk was reduced to the same level as that without a preceding hip fracture. The ten-year criterion has also been used in other studies [168].

46

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

5 Hip fracture incidence

Figure 4. Conditional (hazard) probabilities of having a preceding hip fracture as a function of backward time in months from the first fracture in 1998–2002. The smaller picture is a tenfold magnification of the final months. Dotted curves represent the expected probabilities of having a hip fracture with one year (upper) and ten year (lower) clearance periods.

5.2 Risk factor extraction The use of a clearance period in the determination of first hip fractures identified the persons with first hip fracture in addition to the determination of the number of first hip fractures. This made it possible to extract the status of certain hip fracture risk factors available in the data in relation to these patients. Three different risk factor extraction techniques, internal, external, and empirical extraction, were used for this purpose. Internal extraction corresponded to the use of data abstraction rules within the register data. For example, using the Finnish data it was not enough to determine the index hip fracture period, but the actual day of hip fracture had to be inferred using appropriate rules. The basic idea in the rule construction was to find one simple rule at a time that was applicable to the majority of the population, and each additional rule was then developed and applied only for the remaining population. The index period could be a very long stay in residential care, while the diagnosis referred to an accident that happened near the discharge day, or the index period could be a surgical period, while the actual accident had happened

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

47

5 Hip fracture incidence

before the admission. Therefore, the first rule aimed to identify hospitalizations with an operation code for a hip fracture procedure in the proximity of the candidate period, and the second rule identified hospitalizations at the surgical ward. The third rule was developed to identify patients who had died shortly after the first diagnosis. With these rules a reasonable index period could be found for about 96.9% of patients, while the last three percent required more complicated identification strategies. After the detection of the most reasonable index period, the basic covariates (such as the patient’s age, sex, area of living) could be extracted from that period. Type of fracture was determined on the basis of diagnoses from the operative surgical period if possible, because those were most likely the first ones based on xray examination. Further abstraction rules were required for the definition of more complex covariates. For example, a person was classified as a long-term care patient if there were at least 90 days of inpatient care or a recorded administrative decision for long-term care immediately before the fracture. In external extraction, variables linking individuals to aggregate levels (such as area codes) were first internally extracted and then external data describing aggregate units were linked back to each individual with the help of these variables. For example, the area code (municipality) at the index period was used to classify each patient as rural or urban (including semi-urban areas) using the official grouping defined by Statistics Finland (rural municipalities are those municipalities in which less than 60 per cent of the population lives in urban settlements).

Figure 5. Daily numbers of the hip fractures in Finland 1998–2002

48

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

5 Hip fracture incidence

In empirical extraction, the observed data were used to justify the definitions. For example, the Finnish register data allowed accurate calculation of the daily numbers of hip fractures, also separately for long-term care patients. After smoothing with a moving average technique, a small but clear seasonal variation attributable to non-institutionalized persons was detected (Figure 5). Without these kinds of preliminary analyses it would have been more difficult to justify that winter season should last from November to April and correspondingly the summer season from May to October. The method developed above for the definition of the length of clearance period was another example of empirical extraction.

5.3 Risk population data For incidence calculations, data on risk population were also needed, and official population figures (taken on the last day of the years 1997–2002) in 5-year groups were obtained for each municipality. Municipality worked as an aggregate unit in external extraction, and also allowed easy determination of population figures for any combination of these basic units. It was more difficult to determine the risk population for internally extracted risk factors. There were no data available in the official statistics on the numbers of people in long-term institutional care with the required, rather detailed stratification. Therefore, data on long-term institutional care for the period 1997– 2002 were extracted from the Finnish Social and Welfare Care register using the recorded decision for long-term care or more than 90 days of continuous care in a single institution as the extraction criteria (n = 532 169). In principle, these data allowed the calculation of the total numbers of clients in long-term institutional care with the required stratification, as the register included individual-level data on all inpatient hospital and nursing home care in Finland. However, it turned out that the calculation technique used in the derivation of official statistics utilized only census data (collected at the final day of each year), while the number of long-term care patients seemed to be much higher if discharge data were also taken into account [4]. This was examined more closely by cross-tabulating both numbers with the stratification of sex, year, age, and type of institution. The proportion p of census data clients (O) in combined data clients (E) corresponded to the prevalence proportion p = O / E. In order to describe the differences in the proportions and to estimate the missing E for the latest year, it was assumed that the counts O were Poisson distributed and a Poisson regression model was formulated of the type log(O / E) = b0 + b1x1 + b2x2 +···+ bkxk,

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

49

5 Hip fracture incidence

where xi are the covariates and the model parameters βi are log relative risks [169]. The final model estimated with the years 1997–2001 included main effects for age, sex, and type of institution as well as the year-effect for each age-institution combination. The fit of the model was very good in terms of χ2-test, and no oddities were detected in residual plots. As an assumption of binomial distribution instead of Poisson distribution was theoretically more elegant, the model with a log-binomial formulation was also estimated [170]. The point estimates were practically identical, but the variances of the coefficients were somewhat smaller, and turned out to be almost equivalent to the robust variance estimates in Poisson regression. As a conclusion, an easily estimable Poisson regression model resulted in good point estimates, but gave more conservative (wider) confidence intervals. The proportions increased with age and also with year in social institutions, but were smaller among men and in social institutions in general. Reasons for most differences could be traced back to the definition of long-term care, but also to the properties of the register data especially in the Social Welfare Care register. It was concluded that the approach where both census and discharge data had been taken into account was better suited for epidemiological purposes [4]. The practical problem from the incidence calculation point of view was then to calculate the numbers of clients on the final days of the years 1997–2002. That was straightforward for all but the most recent year which required prediction using the estimated model, because no discharge data concerning the year to follow were available. The procedures described above resulted finally in simultaneously observed risk populations on the final day of the years 1997-2002 with stratification by sex, age, urbanity, and institutionalization. A common method has been to interpolate average population at risk by using the mean of observed census data on two consecutive years. In order to generalize the interpolation so as to also incorporate the seasons, the population figures were plotted against the years in each group with stratification by sex, age, institutionalization, and urbanity. Visual inspections suggested that the associations between the years and population figures were rather simple, and that simple curve-linear models could be used to approximate the relationship. Therefore, linear regression models with constant, year, and squared year as exploratory variables were used to summarize the curve-linear trend in the population figures. These models allowed an interpolation of population figures to any day during 1997-2002. These models were then used in the calculation of approximate follow-up times with the required stratification including seasons and years. These denominators were desirable in terms of accuracy, but not perfect, because an assumption of a stable population (no short-term fluctuations in migration or mortality) was needed for interpolating census day population figures to appropriate mean follow-up times [171]. In addition, it was not possible to exclude persons who had had hip fracture during the preceding ten years. Bias

50

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

5 Hip fracture incidence

resulting from keeping the prevalent pool in the risk population was estimated to be small because of low incidence and high mortality, and to be towards conservative estimates (i.e. to underestimation rather than overestimation of group specific incidences).

5.4 Hip fracture incidence between 1998 and 2002 in Finland After the adequate data were finally assembled, it was straightforward to examine incidence rates with standard methods [165]. Standardized incidence was higher among women and long-term care patients, and a long increase in standardized incidence in Finland was found to be stabilized [5]. From the hip fracture surveillance system point of view, it was particularly important to be able to monitor the small-area variation in hip fracture incidence. The estimation of the hospital-district-specific standardized incidence rates was performed by using a hierarchical estimator, because the small sex- and agegroup-specific numbers of events were prone to extensive random variation. More specifically, it was assumed that better estimates (in terms of mean squared error) could be obtained by a weighted sum of area specific estimates and an appropriate guess [172]. In an empirical Bayes-approach the guesses are chosen to be the average estimates obtained without area stratification [173]. The hospital-district-specific weights and group-specific rates were estimated for each year by using an iterative algorithm [172]. According to the results [5], the incidence had been somewhat higher in Helsinki, but otherwise the differences were rather small (Figure 6). The constructed population-based data also allowed the examination of the simultaneous impact of several risk factors on hip fracture incidence [3]. Year was not considered as an interesting factor, so the data from years 1999–2002 (during which the average incidence had remained constant) were combined. A Poisson regression model was used in the analysis [169]. There were a lot of significant interactions between risk factors, and it seemed to be more fruitful to model the cross-tabulated data in the style of log-linear analysis [174] instead of aiming at determining the independent impacts of the risk factors on the incidence by means of epidemiological modeling. In this sense, the modeling goal was to compress the representation of the cross-tabulated data, and all main effects and statistically significant interactions up to the third degree were included in the model. The fit of the final model was reasonable in terms of χ2-test, and residuals did not reveal any exceptional observations. The relaxation of the Poisson assumption by using a negative binomial regression model [175] turned out to be unnecessary as it was after estimation almost equal to the Poisson model, indicating that no significant overdispersion was apparent.

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

51

5 Hip fracture incidence

225

275

325

375

425

Helsinki

1998 1999 2000 2001 2002

Keski-Suomi

1998 1999 2000 2001 2002

Varsinais-Suomi

1998 1999 2000 2001 2002

Hyvinkää-Porvoo

1998 1999 2000 2001 2002

Kymenlaakso

1998 1999 2000 2001 2002

Satakunta

1998 1999 2000 2001 2002

L-Pohja

1998 1999 2000 2001 2002

Jorvi-Peijas

1998 1999 2000 2001 2002

Lohja-L-Uusimaa

1998 1999 2000 2001 2002

Pirkanmaa

1998 1999 2000 2001 2002

Päijät-Häme

1998 1999 2000 2001 2002

Vaasa

1998 1999 2000 2001 2002

P-Karjala

1998 1999 2000 2001 2002

Kanta-Häme

1998 1999 2000 2001 2002

P-Pohjanmaa

1998 1999 2000 2001 2002

I-Savo

1998 1999 2000 2001 2002

Lappi

1998 1999 2000 2001 2002

P-Savo

1998 1999 2000 2001 2002

Keski-Pohjanmaa

1998 1999 2000 2001 2002

E-Savo

1998 1999 2000 2001 2002

E-Karjala

1998 1999 2000 2001 2002

Kainuu

1998 1999 2000 2001 2002

E-Pohjanmaa

1998 1999 2000 2001 2002

200

250

300

350

400

450

Figure Figure 6. Age- 6. and sex-adjusted hip fracture incidence in Finland 1998-2002 by hospital district, Ageand sex-adjusted hip fracture incidence in Finland 1998–2002 by hospital district, rates per 100 000 persons years. Year-specific rates with confidence intervals areeach rates per 100 000 persons years. Year-specific rates with confidence intervals are plotted for plotted for each hospital district, from 1998 (uppermost) to 2002 (lowest). The short vertical hospitalline district, from 1998district (uppermost) to 2002 (lowest). The short vertical line each hospital at each hospital corresponds to the average rate in 1998–2002 in at that hospital

The dashed gives rate the overall mean incidence during 1998–2002 Finland.line gives districtdistrict. corresponds to theline average in 1998-2002 in that hospital district. Theindashed the overall mean incidence during 1998-2002 in Finland.

52

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

5 Hip fracture incidence

As there were quite a lot of parameter estimates, it was assumed that the simplified cross-tabulation based on the model predictions had captured the systematic characteristics of the original data and removed the random noise, and the actual effects were investigated in the terms of group-specific predicted rates. The main results illustrated in Figure 7 indicated that there was a higher hip fracture incidence among older and institutionalized persons. The incidence among women managing at home increased more rapidly that among men, but in institutions the risks were rather similar. The incidence was higher during the winter time for non-institutionalized persons. The effect of urbanization was more complicated suggesting that the effect was different between men and women, and that there was a tendency for higher incidence among institutionalized persons, the youngest of the men and oldest of the women in urban areas. The results could be interpreted from the population aging point of view, and seemed to reflect aging-related disability [3, 5]: Institutionalization is a sign of reduced coping with daily activities [176], while women tend to have more disabilities than men as age increases [177], and the additional hip fractures during winter occur mostly among persons with a good enough functional status to allow walking outdoors [178]. As reduced functional capacity and coping with physical activities of daily living are known to be important risk factors for hip fracture [179], it was hypothesized that hip fracture incidence would be proportional to the prevalence of disability related hip fracture risk factors in the underlying population. The hypothesis was not directly falsified and requires further examination, as a

Men

Men Men



Women Incidence Incidence (logarithmic scale), women(logarithmic scale), women

Incidence Incidence (logarithmic scale), men (logarithmic scale), men 2500 Institutionalized

2500

Institutionalized

1000

Winter Summer

Winter Summer

Urban

Urban 1000

Rural

Rural 400

400

Women

Women

2500

2500 Institutionalized

1000

1000

400

400 Non-institutionalized

50

Non-institutionalized

150

150

Urban

Non-institutionalized

50

Rural

Rural

60

70

60 80

70 90

80 Age

90

50

Winter Summer

60

Age

Non-institutionalized

150

150

Urban

50

Urban Rural

Institutionalized Urban Rural

Winter Summer

70

60 80

70 90

80

90 Age

Age

Figure 7. Hip fracture incidence in Finland in 1999-2001 by sex, age, institutionalization, season, and urbanity. Rates are per 100 000 person years, logarithmic scale. Dotted lines refer to rates Figure 7.urban Hip areas. fracture incidence Finland inby1999-2001 by sex, age, institutionalization, for Figure 7. Hip fracture incidence in Finland inin 1999-2001 sex, age, institutionalization, season,

season,

urbanity. are person per 100years, 000 person years,scale. logarithmic Dotted and urbanity.and Rates are perRates 100 000 logarithmic Dottedscale. lines refer to lines rates refer for to rates for urban areas. urban areas. Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

53

5 Hip fracture incidence

high correlation between the prevalence of outdoor walking ability and hip fracture incidence was detected [3].

5.5 Conclusions on hip fracture incidence monitoring The feasibility of producing information for the purposes of monitoring hip fracture incidence based on Finnish administrative register data was demonstrated in the current chapter. It was pointed out that unwary use of Finnish register data had resulted in biased estimates of the numbers of hip fractures. As there were problems in the reliable identification of all hip fractures, the definition of hip fracture was linked to population aging. Within the renewed theory only the first aging-related hip fracture of each individual had to be found from the data, and a new method was developed to support this purpose. A systematic account of pragmatic problems in the extraction of risk factor and risk population data and in the data analyses was given with proposed solutions. The empirical results suggested that hip fracture incidence would be linked to the prevalence of population level disability. This connection may be important from the monitoring point of view, because under this hypothesis, hip fracture incidence trends would also tell about more general disability trends of the population.

54

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

6 Hip fracture treatment

6 Hip fracture treatment Not all hip fractures can be prevented so it is important that the effectiveness and quality of treatment following hip fracture are continuously evaluated. The basic idea in this kind of outcomes research is to elucidate cause-and-effect relationships between health actions (treatments, interventions) and health gains (outcomes) [150]. Usually large scale empirical investigations that aim to do that are observational studies [180], as it is not possible to assign subjects at random to treatment or control groups, as would be done in a controlled experiment. It is quite common that administrative register data are used in outcomes research [103]. Good examples exist, but the actual assumptions that would justify the use of register data in outcomes research are seldom reported. Such practice makes it impossible to evaluate whether the key assumptions have been uncritically taken as given or whether the assumptions have been carefully tailored to the case under study. Therefore, it was considered important to provide detailed justification for the register-based monitoring of hip fracture treatment, i.e. to aim at finding an adequate compromise between problem-oriented and data-driven theories as proposed in chapter 2.

6.1 State space for a hip fracture care episode The available data consisted of individual-level data on the use of health services for hip fracture patients. Such data reported directly observable events that determine whether an individual had used certain health service at a particular moment in time. The events describing the use of health services connected data to an idea of record linkage, where such observable events are assembled to the “book of life” of an individual [181]. In this sense, data provided descriptions of patientspecific paths through the observable components of the health system. As the aim was not to analyze complete health service usage, but to examine hip fracture related treatment, an episode-of-care approach was adopted for operationalization purposes [182]. A care episode refers to a series of health-related events linked to a particular health problem [183]. That is, the episodes-of-care approach helps to define health-problem-specific boundaries for the health system structure and to identify interesting elements from the care processes [159]. The first task in the reconstruction of hip fracture care episodes was to identify the beginning of the episode. A natural choice for an index event was the surgical hospitalization after the first aging-dependent hip fracture as described in subchapter 5.1. In addition to being compatible with incidence monitoring, a

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

55

6 Hip fracture treatment

restriction to first aging-dependent hip fracture homogenized the study population appropriately. The next task was to categorize the usage data so that the observed events would have a relevant interpretation from the hip fracture point of view. The idea was to construct a state space that could be used to describe possible transitions across a set of care regions in time [184, 185]. Many different ad hoc classifications of health services had been utilized in the analyses of hip fracture treatment [120, 186, 187]. In addition, theories and models describing the processes of care for the elderly were also applicable to the case of hip fracture [188]. More specifically, it has been suggested that care and services intended for older persons could be categorized in terms of the level of care and the need for care [189]. The level of care is a description of the intensity and type of service, while the need for care is attributable to the patient’s characteristics (such as health status, severity of medical condition and functional disabilities). Following this idea, the actual care regions were considered to consist of four different levels of care that were identifiable from the data (in increasing order of intensity): 1) home (including home care, ordinary service houses and outpatient care), 2) nursing home (service houses with 24-hour assistance and residential homes), 3) health center (inpatient ward of local primary care unit), 4) hospital. In addition, an absorbing death state was included to the state space. These levels of care are common to the whole country in the sense that all the more specialized components of local structures can be classified into these main categories. Validation analyses reported in article 2 showed that this categorization was compatible with the one used in a standardized hip fracture audit, and that the register data had excellent properties in this respect. The constructed state space is presented graphically in Figure 8.

Home Nursing home Initial hip fracture hospitalization

Health center Hospital Death

Figure 8. State space for the hip fracture care episode

56

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

6 Hip fracture treatment

6.2 Effectiveness in terms of care process The constructed state space was well suited to the description of care processes. In fact, by using only data that were based on directly observable events, it became possible to produce hard facts about the care episodes. The problem was the lack of theoretical links between the care episode data and the outcomes of care. However, by assuming that the outcome of interest was effectiveness—operationalized as a change in health of the target population—and that the patients were not treated ambiguously, it was possible to hypothesize that the decision to treat a patient in a given care region is associated with the patient’s health status [190-192]. In other words, it was assumed that the transitions between care regions reflected the appraised need for care, which is also a sign of a qualitative change in health status. For instance, an acute admission to a hospital typically reflects a sudden drop in health status of a patient, while a discharge to lower levels of care would be expected to reflect a somewhat improved health status. This assumption was supported by empirical evidence [75, 176] showing that at the population level there had been a strong positive correlation between the intensity-ordered levels of care and the need for care in Finland. Using the notation from chapter 2.3.1, there was a problem-solving task related to effectiveness, where a mental representation for the task solution Iuser(effectiveness) | producer(health service usage), was based on the data Duser(effectiveness) | producer(health service usage), that had been derived from the externally available data Dproducer(health service usage), and that had an acceptable cognitive fit with the internal representation of suitable data Duser(effectiveness). As a conclusion, an appropriate compromise between the data-driven and the problem-driven theories could be reached in the terms of available data.

6.3 Multivariate responses in outcomes research By following the episodes-of-care approach it became clear that the whole realization of the care process was important in performance measurement. This was somewhat conflicting with the traditional outcomes research models that typically deal with univariate response variables or try to combine information from several indicators [96]. In the methodological sense it was obviously more proper to deal with a multivariate response variable that described the whole care process. This posed certain challenges, because no well-established methods were available for concrete analyses. More specifically, the available register data allowed complete observation of realized levels of care on a daily basis for each individual. The care process of an individual j consisted of a sequence of multinomial observations, i.e. corresponded to the repeated categorical response profile

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

57

6 Hip fracture treatment

Yj = Y1j ,..., Ykj , where k is the length of a sequence. One option for data analysis would have been to use a multivariate probability distribution for the responses [193]. However, from a pragmatic point of view it was considered more important to be able to summarize the multivariate responses in a comprehensive, yet intuitively meaningful way. This kind of exploratory perspective for the data analysis of categorical response profiles has been developed within the sequence methods framework, where the basic idea is to analyze whether useful patterns can be found among the sequences [194]. However, no methods matching the targeted purpose were found in the literature. The most relevant studies concentrated on the clustering of sequences to appropriate groups [195, 196], but did not offer much help on the visual summarization of sequences within the groups.

6.3.1 Traceplots for care episode visualization I show here that it was possible to achieve the required comprehensive summarization by simply plotting the actual realizations of patient-specific care episodes. First, it was assumed that some symbol is linked to each possible category. It was obvious that each care episode could then be expressed with a sequence of these symbols. For example with the hip fracture case, if 3 stands for hospital use, 0 for being home, and 4 for death, the sequence

333333300000000000033444444444 tells that the individual was followed for 30 days (one symbol for each day); first she was seven days at the hospital, and then after twelve days at home she had a readmission to hospital where she died. Matters became more complicated when there were sequences for more than one individual. Instead of using numerical symbols, it was beneficial to use colors for telling the realized state. With the hip fractures it was natural to use a gray scale where a darker color reflected the more intensive level of care. By using the colors to indicate the level of care, it was easy to plot diagrams in which each row corresponded to a care episode of one individual. This idea is illustrated in the Figure 9, where the register-based hip fracture care episodes with one-year followup were plotted for the randomly ordered cohort of patients living in the town of Espoo during 1998–2001 (n = 516).

58

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

6 Hip fracture treatment

Figure 9. Unsorted traceplot for the hip fracture care episodes in Espoo 1998-2001. Each row Figure 9. Unsorted traceplot for the hip fracture care episodes in Espoo 1998–2001. Each

corresponds to row a care episodetoof oneepisode patient, which consists of 365 ofdaily realizations corresponds a care of one patient, which consists 365 daily realizations(one year (one year follow-up) of the level of care, where the darker color reflects more intensive level

follow-up) of the level of care, where the=darker reflects more intensive level of=care of care (white = home, light gray nursingcolor home, middle gray = health center, dark gray hospital, black = death).home, middle gray=health center, dark gray=hospital, (white=home, light gray=nursing

black=death). The presentation of care episodes in a random order resulted in a diagram without any simple interpretation. Intuitively, it was clear that the sorting of episodes could reveal some patterns among the sequences. As there were no known solutions to this kind of problem, a heuristic algorithm for the sorting was developed. The basic idea in the algorithm was to measure similarity between the care episodes and to find an optimal location for each episode in the figure. As the aim was to find a visually simple presentation, common sense logic suggested that the episodes with a lot of common realized states had to be similar. This idea was formalized by measuring similarity between two episodes in terms of the number of equal daily states, i.e. as a sum of the daily specific Hamming-distances (zero if the realized states were the same, otherwise one). In the actual algorithm two linked lists were used. The first singly linked list (unsorted list) was initialized to contain all episodes in a random order, and the second doubly circular linked list (sorted list) with a single seed episode consisting of only death statuses. The fundamental loop in the algorithm aimed to move one episode from the unsorted list to an optimal place in the sorted list. More specifically, the unsorted list was looped over one episode at a time, and distances

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

59

6 Hip fracture treatment

Death without discharge to home Death after discharge to home Long-term care at health centre

Discharge to home

Long-term care at nursing home Mixed care episodes

Figure 10. Sorted traceplot for the hip fracture care episodes in Espoo 1998-2001. Each row Figure 10. Sorted traceplot for the hip fracture care episodes in Espoo 1998–2001. Each

corresponds to thecorresponds care episode oneepisode patient, which consists 365 daily (one year row to theofcare of one patient, whichofconsists of 365realizations daily realizations the level care, where darker more color reflects morelevel intensive level follow-up) of(one the year levelfollow-up) of care, of where theofdarker colorthe reflects intensive of care of care (white = home, light gray = nursing home, middle gray = health center, dark gray =

(white=home,hospital, light gray=nursing black = death).home, middle gray=health center, dark gray=hospital, black=death).

between this current candidate episode and the preceding and following episodes at each possible place in the sorted list were calculated. The episode with the smallest distance in the unsorted list was then moved to the place in the sorted list that had given this smallest distance. The fundamental loop was repeated until all episodes from the unsorted list had been moved to the sorted list. A sorted traceplot for the Espoo cohort is presented in the Figure 10. The sorted episodes had a simple structure which identified several qualitatively different types of episodes. In this sense, the face validity of the developed algorithm was pretty good. From the methodological point of view the aim of achieving an easier interpretation from the sorting reminds one about the use of rotation in factor analysis: there was no unique solution and the interpretability of the resulting simple structure was the criterion that finally determined the utility of the solution.

60

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

6 Hip fracture treatment

6.4 Comparing the care episode profiles of subpopulations Even though the illustrations of the realized care processes of total populations based on the factual data were useful as such, there was also a need for comparisons of multivariate responses in different subpopulations as especially the comparisons may tell something about the effects that are of interest. In order to illustrate such comparisons with the hip fracture data, patients who were institutionalized at the time of fracture were selected as a base group. This was an interesting group from the health services point of view, because it could be interpreted as representing the upper boundary of the expected use of resources for hip fracture patients. In addition, the recent Finnish clinical guideline for hip fracture treatment had requested further research on the outcomes of those patients who already were in long-term care at the time of fracture In order to examine how institutionalization changes the treatment to be received, the patients managing at home (at the time of fracture) were considered an interesting group to be compared with the institutionalized patients. More specifically, the patient was defined as home-dwelling at the time of hip fracture if there had not been care during the 30 days before the fracture, and no long-term care in the year before the fracture. This population had a low initial need for care, and it represented the lower boundary for the expected resource usage. After the exclusion of patients aged less than 65 at the time of facture, there were 4325 patients identified as institutionalized at the time of hip fracture in 1998–2001 in Finland. This was about 21.2% of all patients aged 65 or more with the fracture of a proximal femur. The population of home-dwelling patients was larger and consisted of 10 347 patients.

6.4.1 Risk adjustment An observational study is considered biased, if the populations to be compared have differences in observed or unobserved background factors in ways that confound the outcomes of interest [180]. In the strict sense, the comparison between institutionalized and home-dwelling patients was not an observational study, because it was not reasonable to conceive of the groups as interventions and the interest was not on the causal effects of any intervention but on the observed differences between the groups. Anyhow, there were background factors, such as age and sex, whose impact had to be controlled, and an actual method that could be used to control for these confounding variables was needed [197]. Regression models have been widely used for the adjustment [151]. However, in the case of current multivariate responses, that was not a realistic approach. A

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

61

6 Hip fracture treatment

more attractive and pragmatic choice for adjustment was the matching framework [198]. In addition to the fact that matching works with any response (even if the response was still unobserved at the time of matching), it can ensure that the characteristics of the compared populations truly overlap, and that each patient remains intact as a single patient and observed differences directly reflect disparities between real patients and not between any artificial predictions based on estimated model parameters [199]. As the number of covariates to be adjusted was small, a pair directly matched according to sex, fracture type, and age (in years) was chosen for each patient in the institutionalized population (n = 4 325) from the randomly ordered set of homedwelling patients (n = 10 347) without replacement. Perfect matches were found for 98.7% of the patients, while the rest had minimal differences in age. If the number of factors to be matched would have been larger, other matching techniques (such as the propensity score approach) could have been utilized [198].

6.4.2 Summarization of the follow-up data In order to get a comprehensive explorative view of the data, the traceplots describing the care episodes were considered to be potentially useful. As the subpopulation to be compared had a different initial need for care, the main interest was on the summarization of the need for care after the hip fracture. It was also known that the Institutionalized patients

Home-dwelling patients

Figure 11. Traceplots for the hip fracture care episodes of institutionalized patients in Finland in

Figure 11. Traceplots for the hip fracture care episodes of institutionalized patients in Finland in 1998–2001 and for home-dwelling patients matched for age, sex, and fracture corresponds a care episode of to oneapatient, which consists 120 daily realizations (four-month type. Each to row corresponds care episode of one ofpatient, which consists of 120 daily follow-up) of the level of care, where theof darker color reflects more intensive levelcolor of carereflects a realizations (four-month follow-up) the level of care,a where the darker more intensive level of care (white home,gray=health light gray =center, nursing home, middle gray = health (white=home, light gray=nursing home,=middle dark gray=hospital, center, dark gray = hospital, black = death). black=death). 1998-2001 and for home-dwelling patients matched for age, sex, and fracture type. Each row

62

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

6 Hip fracture treatment

levels of care described the intensity of care in an ordinal fashion, and that one step in the levels of care corresponds to approximately a constant step in the functional dependency measures [75, 176]. Therefore, it was assumed that an average level of care over the follow-up period would work as a proxy variable for the need for care of an individual. These averages were calculated and the traceplots were sorted according to them. As Figure 11 reveals, even the straightforward mean sorting resulted in rather simple and interpretable structures. For both subpopulations the sorted order could be interpreted as an approximate continuum from good health (top) to bad health (bottom). As expected, there were huge differences between the institutionalized and home-dwelling patients. Another way to summarize follow-up data in the case of hip fracture treatment was developed in Sweden in the middle-1980s [200]. In these state diagrams, the proportion of patients in each state is calculated for each day, and the daily proportions are then plotted. A logarithmic scale on the time dimension is typically used in order to emphasize the beginning of the follow-up time in relation to its importance in the hip fracture case. It turned out that the state diagrams corresponded to a kind of margin of the traceplot, i.e. could be derived by summarizing the vertical dimension of the traceplot with proportions of patients in different states. The state diagrams in Figure 12 clearly demonstrate the most obvious differences between the institutionalized and home-dwelling patients, such as the higher mortality among the institutionalized patients and the greater proportions of lower levels of care for the home-dwelling patients. In fact, the state diagrams gave an easily interpretable yet comprehensive view of the multivariate response by reporting simultaneously a huge number of measures that could be used as traditional performance indicators, such as day-specific mortalities.

B. Home-dwelling patients (matched pairs of A. Institutionalized patients 100

institutionalized patients)

%

100

80

%

80

60

60 Death

40

Death

40

Hospital

Hospital

Health center

Health center

Nursing home

20

Nursing home

20

Home 0 Before fracture

10

15

30

60 90 120 180 365 Days after fracture (logarithmic scale)

Home 0 Before fracture

10

15

30

60 90 120 180 365 Days after fracture (logarithmic scale)

Figure 12. State diagrams for hip fracture patients aged 65 or more

Figure 12. State diagrams for hip fracture patients aged 65 or more

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

63

6 Hip fracture treatment

The information in the state diagram could be further compressed. An obvious way was to compress the multivariate response to the state indicator that reported the mean proportions of different levels of care during the follow-up after the fracture for both subpopulations. Such state indicators are reported in Table 2. These percentages would correspond to areas in the Figure 12, if a standard time scale (instead of a logarithmic one) had been used in the figure. Table 2. Percentages of days spent at different levels of care during the one-year followup of hip fracture patients Home

Nursing home

Health centre

Hospital

Death

Institutionalized patients

6.3

32.3

26

3.7

31.7

Home-dwelling patients*

54.0

6.6

17.1

4.8

17.6

* Age, sex, and fracture type matched pairs of institutionalized patients.

6.5 Conclusions on monitoring hip fracture treatment In summary, this chapter aimed to provide a justification for the register-based monitoring of hip fracture treatment. First a state-space model for hip fracture treatment that was compatible with the register-based data was developed. Then assumptions which were used to link the developed data-driven model to the theory of effectiveness measurement were discussed. It was pointed out that the combination of theory and data naturally leads to the use of multivariate responses, and an illustration of summarization of multivariate response with novel methodology were given. In addition, an example of adjusted comparisons between multivariable responses was given. Finally it was demonstrated how complicated responses could be reduced to simpler summarizations and further to more traditional indicators. In conclusion, this chapter demonstrated how it was possible to find an adequate compromise between problem-oriented and data-driven theories. This idea can be illustrated within the general framework for register-based health services research that I have proposed [9]. Figure 13 visualizes the general structure of the conceptual framework. The figure should be read from left to right. The components that are closer to each other in the figure are more directly connected in the framework. The two central paths from left to right in the figure correspond to theory-oriented modeling (solid line) and to data-driven modeling (dashed line). It must also be remembered that the selected health problem gives a third and most concrete theory to be utilized in the actual application of the framework.

64

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

6 Hip fracture treatment

Public Health

Equity Quality

Health system

Goals

Effectiveness Outcomes

Structure

information systems

Register-

assessment model

Processes Health

Performance

Care episodes

based data Preprocessing

Event history analysis

Figure 13. Visualization of the conceptual framework for register-based health services research. A solid line reflects the theory-driven approach, and the dashed line the datadriven approach. The approaches are most compatible under the oval.

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

65

7 Operative delay as a performance indicator

7 Operative delay as a performance indicator In the previous chapter it was demonstrated what kind of assumptions were needed to transform register-based data into factual information about hip fracture treatment. In this chapter, it is illustrated what kind of issues may be related to the justification of the use of a performance indicator. This chapter is mainly based on article 6, but the presentation of a model for the provider profiling originates from article 1. A surgical operation is performed for most patients during the acute management of hip fracture. A typical hip fracture patient is confined to hospital bed-rest before the surgical operation. A fairly short delay between hospital admission and surgical operation has been advised in the clinical guidelines [128, 129], but considerable variation in the operative delays between the providers has been reported [201–203]. This discrepancy, which points out a potential area for improvements, makes the operative delay an attractive performance indicator [204]. However, the evidence concerning the timing of surgery in relation to the overall outcomes of hip fracture treatment has been quite mixed. This is unfortunate, because the operative delay should have confirmed effects on outcomes, otherwise improvements may possibly be targeted at issues which do not improve health [205]. Therefore, it was considered important to examine the association between operative delay and mortality more carefully. The data were restricted to patients aged 65 or more and operated on using an internal fixation, a prosthesis, or a total hip replacement during 1998-2001 in Finland. The study population consisted of 16 881 patients, which was 83.3% of all first-time hip fracture patients aged 65 or older in Finland 1998–2001.

7.1 Effects of different operative delays on mortality The first open question was the definition of the time period for delayed surgery [204]. The register data included admission and operation days, so the operative delay could be calculated only with limited accuracy, as the number of nights spent in a hospital before the operation. Time from admission to death was measured in days. The cumulative probabilities of death for different operative delays were calculated using Kaplan-Meier estimators. As can be seen from Figure 14,

66

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

7 Operative delay as a performance indicator

40

Proportion of death patients, % 5+ nights 3 4

30

0 1 2

20

10

0

10 30

60

90

120

180

240

300

365 Days

Figure 14. Cumulative probabilities of unadjusted mortality following hip fracture for different operative delays Figure 14. Cumulative probabilities of unadjusted mortality following hip fracture for different

operative delays waiting times from 0–2 nights had the same effect on mortality, but there was a clear increase in mortality with longer waiting times. Therefore, early surgery was defined as consisting of patients with a 0–2 nights waiting time and late surgery as consisting of patients with at least 3 nights waiting time.

7.2 Profiling of providers Next, it was interesting to compare whether the potential performance indicator (proportion of late surgery patients) varied by hospital. This kind of activity has been commonly referred to as the profiling of providers [206]. It has been pointed out that the results of provider profiling have been commonly interpreted as a ranking of providers, even though the extent of variation attributable to the quality of provider may have been negligible [207], and an appropriate statistical modeling has been considered as a necessary prerequisite for justified conclusions in provider profiling [208]. In fact, it has been suggested that there exist variations between providers for at least three reasons: 1) differences may be attributable to a random variation related to the size of the provider, 2) the patient case-mix varies from provider to provider, and 3) providers may differ in the performance of their care [209].

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

67

7 Operative delay as a performance indicator

There were several commonly applied risk-adjustment methods available for the case of a binary response variable [210]. Following the standard approach, a model for the probability of late surgery was constructed by using a logistic regression model ¥ p ´ log ¦ µ = B 0 + B1x1 + B2x2 + ··· + Bkxk , §1–p¶ where xi are the predictor variables and the model parameters bi are log odds ratios. Age, sex, fracture type, comorbidities, pre-admission residence, inpatient care preceding fracture, admission weekday, and year were used as predictors. The estimated probability of late surgery was calculated for each individual j as pj = logit–1(Bˆ0 + Bˆ 1x1j + Bˆ 2x2j + ··· + Bˆ kxkj), where Bˆi are estimated parameters, and logit–1 (z) =

exp(z) .. 1 + exp(z)

The fit of the model was acceptable yielding a c-statistics value 0.65, indicating a reasonable ability for discrimination. Since the focus in profiling was on providers and not on individuals, the observed (Y) and expected outcomes (p) were then aggregated to the provider level [207], i.e. Oi = Yj

3

and

Ei =

3p, j

where Yj refers to the individual’s observed status of late surgery and the sums are over patients treated by provider i. The ratio of observed to expected numbers of late surgery O / E was then used to quantify the risk-adjusted differences between providers. This quantity can be interpreted as the relative risk of a given provider to have the same share of late surgery patients as would have been expected on average for the similar patient population [210]. However, as the adjustment is practically always only partial, there remains residual variability between the providers that may result in overestimated differences in profiling analyses [211], and it has been recommended to use of hierarchical (random effect) models in the profiling [208]. The advantages of hierarchical modeling include the automatic removal of regression to the mean

68

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

7 Operative delay as a performance indicator

problem by shrinkage, an improved precision for estimates among small providers, an adequate adjustment for multiple comparisons, and a more realistic framework for comparison judgments [212]. The common approach in the modeling of the O / E-ratio has been to assume that the observed counts Oi being non-negative integers describing frequencies of rare events have a Poisson distribution with mean γi , that is Oi ~ Poisson (gi). Two types of hierarchical models based on the Poisson assumption were constructed, a gamma-Poisson model [213] and a Poisson-lognormal model [214]. The gamma-Poisson case was modeled as log gi = log Ei + log(λi), where the expected outcomes Ei adjust the patient-level variation and λi ~ Gamma (a, b) describe the provider-level variation of interest. A vague exponential prior was used for a and a gamma prior for b. The posterior distributions of λi corresponded to the shrunken O / E-ratios and were of primary interest. In the Poisson-lognormal case the model took the form where

log γi = log Ei + θi ,

θi ~ N(µ,σ2), which could have been easily extended to incorporate provider-level covariates. A normal prior was used for µ and a gamma prior for precision 1/σ2 . Here the posterior distributions for exp(θi) were the interesting ones. In principle, the conditions justifying the Poisson approximation for the distribution of Oi were not very well satisfied, because there was a fixed number of patients at each provider and the proportions of late surgery patients were quite large. Therefore, a beta-binomial model [213] and a logistic model [207] were also formulated. For both models it was assumed that Oi ~ Binomial (pi ,ni), where ni stands for the number of patients at provider i. The proportion pi was modeled in the beta-binomial case as and in the logistic case as

Research Report 174 STAKES 2008

pi ~ Beta (a, b),

Methodological Perspectives for Register-Based Health System Performance Assessment

69

7 Operative delay as a performance indicator

where

logit(pi ) = θi ,

θi ~ N (µ,σ2). The gamma priors were used for α and β, and the priors for µ and 1/ σ2 were the same as in the Poisson-lognormal model. The posterior distributions for (ni / Ei)pi gave the shrunken O / E-ratios in both cases. The actual estimation was carried out by using Markov chain Monte Carlo (MCMC) simulation [215]. The mixing of the estimation procedures was examined by using two chains in each estimation, and the convergence was evaluated on the basis of Gelman-Rubin convergence plots [216]. After an initial 1000 burnin iterations, a further 10 000 iterations were used in the actual estimation of the parameters. The complexity and relative fit of the models were assessed with the deviance information criterion (DIC) [217]. The obtained shrunken O / E-ratios were further multiplied by the average share of late surgery patients so that the reporting could be done in terms of easily interpretable risk-adjusted proportions for providers [207]. The actual results were rather similar for all models. In terms of DIC, the fit was somewhat better with binomial assumption than with the Poisson assumption. With the binomial assumption variance tend to be smaller if the provider was small or the proportion of late surgery patients was large. The shrinkage was a bit stronger for models with non-normal random effects distribution. As the purpose was not to rank the providers or identify unusual performance, but to indicate the potentiality for improvements, the most conservative gamma-Poisson model was selected in order to avoid too strong interpretations. The potentiality for improvements was assessed by assuming that the percentage of late surgery patients can be reduced to some potentially achievable level [218]. The level was determined empirically from the provider-specific shrunken estimates by assuming that if 20% of providers have achieved that level or better, the others should also be able to do it. In addition, the 20th centile is about one standard deviation from the mean and can be interpreted to reflect the best practice proportion [213]. As can be seen from the Figure 15, there was extensive variation in the proportions of late surgery between the providers, and the 95% credible intervals crossed the potentially achievable share of 92.7% of early surgery patients only for less than half of the providers.

70

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

7 Operative delay as a performance indicator

0

10

20

30

40

50

60 %

Figure 15. Provider-specific risk-adjusted percentages of patients with latewith hip fracture Figure 15. Provider-specific risk-adjusted percentages of patients late hip surgery. fracture surgery. Thin parts of lines correspond to 95% credible intervals and thick parts correspond to Thin parts of lines correspond to 95% credible intervals and thick parts correspond to 50% credible 50% credible intervals. The diamonds indicate the mean risk-adjusted shrunken proportions intervals. The diamonds indicate the mean risk-adjusted shrunken proportions and the crosses the and the crosses the originally observed proportions. The dotted line refers to the potentially originallyachievable observedpercentage proportions. The dotted line refers to late the potentially (7.7%) of patients with the surgery. achievable percentage (7.7%) of patients with the late surgery.

As the difference between unadjusted mortalities in late and early surgery groups was considerable, it would have been straightforward to infer that the reduction of the operative delay would also diminish the mortality. However, there were no such huge variations in provider-specific mortalities, and the linkage between operative delay and mortality had to be examined more carefully.

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

71

7 Operative delay as a performance indicator

7.3 Adjusted effect of operative delay on mortality In fact, because of the potential biasing effects of different patient characteristics between the early and late surgery group, there was first a need to adjust for these factors. As the data included times from admission to death or end of follow-up, a Cox proportional hazards model were used to control for the observed patient characteristics. A proportional hazard model specifies the hazard for individual j as

λj(t) = λ0(t)exp(b1x1j + b2x2j + ··· + bkxkj), where λ0(t) is a nonnegative function of time t known as baseline hazard, xi are the predictor variables and the model parameters bi are log hazard ratios [219]. In the Cox model, the baseline hazard function λ0(t) is considered as a nuisance parameter, and parameters bi are estimated semi-parametrically without fixing the form of λ0(t). The estimated hazard ratio between the late surgery and early surgery was 1.24 (95% CI: 1.15 to 1.34) as no other covariates were included. The hazard ratio for the late surgery reduced to 1.18 (95% CI: 1.09 to 1.28) after the inclusion of age, sex, fracture type, comorbidities, pre-admission residence, inpatient care preceding fracture, admission weekday, year, and hospital dummies as covariates. Since there were no detailed clinical data available, it was likely that certain relevant covariates were missing from the model. In addition, the patients were known to be nested to hospitals. In the case of survival data, the concept of frailty offers a suitable way to incorporate random effects in the proportional hazards model to account for association and unobserved heterogeneity. A frailty may be considered as an unobserved random factor Z that has a multiplicative effect on the hazard function of an individual or a group of individuals. More specifically, a constant shared frailty model is a conditional independence model having the form

λj(t | Zi) = Ziµi(t)exp(b1x1j + b2x2j + ··· + bkxkj), which corresponds to the hazard function of the individual j belonging to group i given the group specific frailty Zi, where µi(t) is a baseline hazard rate that may or may not depend on i, and b and x are as above [220]. The frailty Z was assumed to have a form of gamma distribution and to vary between hospitals i, but a common baseline hazard λ0(t) was assumed. This model resulted in a slightly smaller hazard ratio 1.17 (95% CI: 1.08 to 1.27) than with the fixed effects model. However, the fixed and random effect models failed to fulfill the assumption of proportional hazards [221]. Therefore, a stratified Cox model, which allowed a

72

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

7 Operative delay as a performance indicator

different baseline hazards for groups of interest but common values for coefficients bi , was estimated with robust sandwich variance estimators which took the clustering by hospitals into account [219]. Stratification was done in terms of age, sex, cancer, dementia, and hospital. The resulting model seemed to fulfill the proportional hazards assumption and gave a hazard ratio estimate of 1.15 (95% CI: 1.03 to 1.29), still indicating a higher mortality among late surgery patients. The plotting of smoothed hazard functions revealed that there was clearly increased risk of death during the four months following the hip fracture in early and late surgery groups. The plotting of excess cumulative mortality (late surgery mortality – early surgery mortality) showed that the difference between the groups first rapidly increased to 3% and then continued to increase until a maximum of about 5% at 1–3 years. Based on these considerations, a binary variable indicating one-year mortality was selected to summarize information in the actual time-todeath variable. It was further hypothesized that logistic regression analyses could approximately replicate the actual results obtained with the Cox model by using a simpler binary response variable. The concrete analyses confirmed this hypothesis to be true. With the binary response it was straightforward to apply a pseudorandomizing instrumental variable approach to the current problem. The basic idea with the approach is to define an instrument variable that causes variation in the treatment variable of interest (early or late surgery in this case), but has no direct effect on the response variable (one-year mortality), because it then becomes possible to estimate how much the variation in the treatment variable induced by the instrument affects the response variable [222]. The pseudo-randomized assignment of patients into the early and late surgery groups was done by using the admission day of the week as an instrumental variable [223, 224]. This variable had an effect on operative delay, but no direct effect on mortality in the case of hip fracture [225]. In practice, the approach took a simple form: to test whether the risk-adjusted one-year mortality was equal for each admission day of the week. It turned out that, according to a χ2-test with null hypothesis of equal mortalities for each admission day of the week, the difference in adjusted one-year mortality was not attributable to the operative delay. This result was in contrast with the results from the Cox models above.

7.4 Provider-level hypotheses The clear difference of the estimates of the operative delay effect on mortality between the methods indicated that there was a need for additional methods to explain such differences. Fortunately, the results pointing out the huge variation between providers in the shares of late surgery patients made it possible to formulate novel hypotheses to address the problem. First, it was assumed that

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

73

7 Operative delay as a performance indicator

an acceptable delay in hip fracture surgery corresponds to the clinical decision to postpone the operation and that the clinical decisions remain nearly constant within providers. Under this assumption, the potentially achievable lower limit in the proportion of late surgery patients could also be interpreted as the upper limit for the proportion of acceptable delayed patients. Correspondingly, the expected proportion of unacceptable delayed patients was the proportion of late surgery patients exceeding this upper limit. This inference resulted in a hypothesis stating that the overall mortality of hip fracture patients should increase with the rising share of the late surgery patients given that the longer operative delay would have an adverse effect on mortality. Another novel provider-level hypothesis was that the long-term mortality of the late surgery patients is higher if only the patients who are unfit for surgery are delayed, since the unfit condition for the surgery is also a risk factor for one-year mortality. In order to test these hypotheses, the provider-level heterogeneity was further examined in terms of simultaneous provider specific shares of the late surgery patients and one-year mortality. The provider-specific overall mortalities were decomposed into mortalities of early and late surgery patients by recognizing the fact that the weighted average of these mortalities gives the overall mortality, i.e. that moverall (p) = [p · mlate (p)] + [(1 – p) · mearly (p)], where p corresponds to the proportion of late surgery patients and m(p) to mortality with proportion p of late surgery patients. It was straightforward to calculate these three (adjusted) mortalities for each provider, and use a scatter plot between the share of late surgery patients and mortalities to describe the associations. In order to extract trends from these associations, it was, after preliminary examinations, assumed that the trends between the share of late surgery patients and overall as well as early surgery mortalities are linear, i.e. that they have the forms and

moverall (p) = a + bp, mearly (p) = c + dp,

where p is the proportion of late surgery patients, and a, b, c and d are regression coefficients to be estimated from the observed data. Late surgery mortality could then be calculated deterministically from the weighted average formula given above. Due to the extensive random variation in mortality figures among small providers, there was a need for hierarchical modeling giving shrunken estimates. More specifically, it was assumed that better estimates (in terms of mean squared error) could be obtained by a weighted sum of observed mortalities and appropriate guesses, i.e. that

74

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

7 Operative delay as a performance indicator

si = wioi + (1 – wi)ei, where si is the shrunken mortality estimate for provider i, oi corresponds to observed and ei to guessed mortality, and wi is the (probability) weight to be estimated from the data [173]. The guesses were chosen to be the trends extracted from the data by using the formulas given above, and the weights and shrunken estimates were calculated by using Stein estimation in the form of an iterative algorithm [172]. This simultaneous examination of the provider-level proportions of the late surgery patients and one-year mortality illustrated in Figure 16 revealed that there was an almost flat (positive) trend between the greater share of late surgery patients and one-year overall mortality. This confirmed that the actual independent effect of operative delay on mortality is very small. Another result indicated that the smaller proportion of late surgery patients was clearly (nonlinearly) associated with a higher mortality for these patients. In other words, the providers with low proportions of late surgery patients had been able to select the patients with such severity that immediate operation would not have been clinically justified.

One year mortality, % 70 60 50 40 30 20 10 0

0

10

20

30

40

50

Share of patients with late surgery, %

Figure 16. A scatter plot of adjusted mortality for the operated hip fracture patients and the Figure 16. of A hip scatter plotpatients of adjusted mortality operated hip fracture patients and the share of share fracture with late surgeryfor for the different providers. (The triangles describe the overall mortality of the operated hip fracture patients for each provider, and the crosses hip fracture patients with late surgery for different providers. (The triangles describe the overall the mortality of the late surgery patients for each provider. The line with a positive slope is mortality of the hip fracture patients each provider, the crosses the mortality of the the trend of operated overall mortality, and the dottedfor curve describes theand association between the share ofpatients late surgery patients and mortality across providers.)slope is the trend of overall late surgery for each provider. The line withthea positive

mortality, and the dotted curve describes the association between the share of late surgery patients and mortality across the providers.)

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

75

7 Operative delay as a performance indicator

The provider-level associations were further analyzed for five severity groups of patients by using the same methodology. The severity was measured as predicted one-year mortality for observed patient characteristics, and groups were formed by dividing the patients into five equivalently sized classes on the basis of sorted severity score. Intuitively speaking, the least severe patients were young, coming from home and were without severe medical conditions. Correspondingly the most severe patients were older, with much comorbidity and coming from residential care. For the patients in the least severe group, mortality was higher for the late surgery patients for all providers. Probably this indicated that in this group, the hipfracture-related mortality was caused by the medical problems which necessitated late surgery and the prolonged surgical delay did not itself increase the mortality. For the patients in the most severe group, mortality was also higher for the late surgery patients for all providers. However, for this group the late surgery mortality was lower in the providers with a small share of late surgery patients and there were no differences in early surgery mortalities between the providers. It seems that in this group, it was essential to perform early surgery for all patients who could bear it, since the significantly prolonged surgical delay makes the patient’s condition worse and increases mortality. For groups 2–4 the interpretation was more difficult than for extreme patients. Since the mortality of early surgery patients increased significantly and was even higher than the mortality of the late surgery patients for the providers with a large share of late surgery patients, it seems that in these groups an operation too early may have caused more harm than the prolonged waiting time. In summary, these results actually gave reasonable explanations for the mixed results in the existing literature.

7.5 Conclusions on operative delay as a performance indicator The analyses presented above demonstrated that methods that have been traditionally used may result in biased results if used uncritically in the case of register-based data, with such a bias being difficult to detect. The problem was that the register data did not include detailed clinical data about the individualspecific factors that may have postponed the operation, and that the actual selection mechanism related to becoming a late surgery patient varied between the providers. In this case the formulation of novel provider-level hypotheses helped to reveal these associations indirectly from the register-based data. In conclusion, the average effect of operative delay on mortality was small, but the provider-level association between the share of late surgery patients and non-optimal treatment was obvious, and the proportion of patients with a prolonged waiting time for a hip fracture operation seems to be an effective quality indicator, since it clearly indicates an area for improvement.

76

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

8 Discussion

8 Discussion As a statistician and a scientist I was quite enthusiastic when I first participated in a study that used Finnish administrative registers as a data source. The massive amount of data had been shown to be of good quality and with an internationally exceptional coverage as a result of unique individual identification numbers allowing easy linkages, offering excellent opportunities for valuable health services research. There were many good examples of such research, and it seemed that the administrative data could be used effectively as long as the potential pitfalls were avoided. However, practical data analyses quickly revealed that the meaningful use of data was very challenging, and that the most fundamental parts of the analysis corresponded more closely to a qualitative than to a quantitative methodological approach. This troubled me, because—even though it was possible to successfully use administrative data in health services research—the links between scientific theories and secondary data were mostly extremely fuzzy. It was also clear that the data sources and therefore the methodological problems abroad were not identical to Finnish ones, and the optimal solution for the problems in a Finnish context obviously requires that the methodology for a solution is developed from the Finnish context point of view. In this sense, the main question in the utilization of register-based data turned out to be: How the use of secondary register data could be justified by using strong methodological arguments tailored to the Finnish context? In this thesis I have tried to give an answer to that question. The first aim of this study was to develop a methodological framework that helps in the effective utilization of register-based data. The problem was approached from a multidisciplinary methodological perspective. Chapter 2 gave a brief review to the methodological approaches, issues, and assumptions that had a fundamental role in the more applied work of later chapters. In this sense, Chapter 2 could also be interpreted as representing summarizations of those philosophic-methodological ideas that are likely to be fruitful in the characterization and solving of the concrete problems in register-based data analysis in general, and thereby sketching the core of the framework. Two main ideas of the framework should be noted. First is the aim to find suitable compromises between the problem-oriented and data-driven theories, and the second is the empirical justification of theory-driven operationalized definitions, i.e. the confirmation that the theoretical constructs are in concordance with the register-based data. Both of these ideas were used throughout this study, combined with the idea of minimizing the overall bias via the maximization of the use of the most factual data, and reflected the need for data sensitive methodological

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

77

8 Discussion

thinking. In this sense, the framework is not just about the methods, but about the use of the right methods in the right situations. There are two potential problems with the framework: 1) the framework is not a sufficient condition for the register-based data analysis, because a strong and more or less tacit background knowledge on multidisciplinary methodological issues (including application experience) is a prerequisite for a full scale understanding and evaluation of the approach, and 2) the framework is not a necessary condition for a register-based data analysis, because it may be possible to end up with similar results without any knowledge of the framework. However, as the presented framework included a multidisciplinary collection of extensions of well-grounded methodological approaches, it is likely that a skilled methodologist would find at least some ideas helpful in the efficient utilization of register-based data analysis. The second aim was to demonstrate how register-based data can be used as a data source for the performance monitoring of a health system, i.e. to connect the methodological developments to the practical needs of information and further to concrete health policy goals, such as the “steering by information” policy regime. The actual demonstration included four components. The first was the validity of register data, the second concentrated on the epidemiological perspective of the monitoring system, the third on the monitoring of treatment in terms of care episodes, and the fourth gave an example on the justification of a more detailed performance indicator. At the end of each chapter the main conclusions were summarized. From the methodological point of view, each component aimed to find an acceptable compromise between the theory and the data. Especially the first three components also included the tailoring of conceptual definitions and assumptions. Data sensitive empirical analyses had a central role in each component, and throughout these chapters the empirical results were novel contributions to the medical literature. The development of a comprehensive hip fracture monitoring system was such a huge task, that it was not possible to include everything into this thesis. In fact, several manuscripts developing issues further are already under preparation. These include a review article about prerequisites in register-based research, several papers that extend the care episode ideas presented in Chapter 6 or develop more detailed risk-adjustment methods, and working papers that compare the costs of treatment [226] and model the lengths of stay in treatment processes [227]. In this sense, the practical work is still in progress, but the perspectives given in this thesis introduce a necessary basis for further work and help in the routine implementation of a hip fracture monitoring system [228]. It should be noted that this study concentrated on methodological issues in the wide sense. The main idea in the analyses was to solve a problem, not to use some particular model or technique. Statistical thinking was the key that made it possible to deal with multidisciplinary methodological ideas at a sufficiently abstract level. After an extraction of the “wireframe model”, the actual search for

78

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

8 Discussion

a suitable method or analytical approach was in some cases straightforward, but occasionally there was also a need to extend the available methods or even create new ones. In this sense, the methodological problems considered important were not on the (mathematical) details of any single model, but on finding appropriate methodological solutions to various interactively encountered problems. This kind of approach naturally required a lot of preliminary and sensitivity analyses, but it was feasible to report only a small selection of those, and mostly the ones that were required to justify the use of the selected models. The drawback in this kind of underreporting is that the actual process of transforming register-based data into useful information may seem to be easier than it really was [3]. All in all, the methodological approach demonstrated in this thesis represents an alternative style of reasoning for scientific research utilizing register-based data. This style of reasoning could be characterized as methodological constructionism or—in the sense of desire for factual data for data analyses—as hermeneutic positivism. But that is another story.

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

79

References

References 1.

Sund R: Utilisation of administrative registers using scientific knowledge discovery. Intelligent Data Analysis 2003, 7(6):501–519. 2. Sund R, Nurmi-Lüthje I, Lüthje P, Tanninen S, Narinen A, Keskimäki I: Comparing properties of audit data and routinely collected register data in case of performance assessment of hip fracture treatment in Finland. Methods Inf Med 2007, 46(5):558–566. 3. Sund R: Utilization of routinely collected administrative data in monitoring of aging dependent hip fracture incidence. Epidemiol Perspect Innov 2007, 4(2). 4. Sund R, Kauppinen S: Kuinka laskea ikääntyneiden pitkäaikaisasiakkaiden määriä rekisteritietojen perusteella? Sos Laaketiet Aikak 2005, 42(2):137–144. 5. Sund R: Lonkkamurtumien ilmaantuvuus Suomessa 1998-2002. Duodecim 2006, 122(9):1085–1091. 6. Sund R, Liski A: Quality effects of operative delay on mortality in hip fracture treatment. Qual Saf Health Care 2005, 14(5):371–377. 7. Sund R: Huono-osaisuus tiedollisena haasteena. In: Toinen Tieto. Kirjoituksia Huono-osaisuuden Tunnistamisesta. Edited by Hänninen S, Karjalainen J, Lahti T. Helsinki: Sosiaali- ja terveysalan tutkimus- ja kehittämiskeskus; 2005. 8. Sund R, Nylander O, Palonen T: Raa’asta rekisteriaineistosta terveyspoliittisesti relevanttiin informaatioon. Yhteiskuntapolitiikka 2004, 69(4):372–379. 9. Sund R: Terveydenhuoltojärjestelmän vaikuttavuuden arvioinnista. Yhteiskuntapolitiikka 2005, 70(5):507–513. 10. Aday LA, Begley CE, Lairson DR, Slater CH: Evaluating the Healthcare System: Effectiveness, Efficiency, and Equity. Second Edition. Chicago: Health Administration Press; 1998. 11. Gray JAM: Evidence-Based Healthcare. How to Make Health Policy and Management Decisions? New York: Churchill Livingstone; 1997. 12. Brailer DJ: Management of knowledge in the modern health care delivery system. Jt Comm J Qual Improv 1999, 25(2):55– 67.

80

Methodological Perspectives for Register-Based Health System Performance Assessment

13. Tierney WM: Improving clinical decisions and outcomes with information: a review. Int J Med Inform 2001, 62(1):1–9. 14. Härö AS (ed.): Planning Information Services for Health. Decision-SimulationApproach. Health Services Research by the National Board of Health in Finland 22. Helsinki: National Board of Health in Finland; 1981. 15. Nenonen M, Nylander O, Rintanen H, Suominen T: What might a health information system look like? Inform Prim Care 2002, 10:135–138. 16. White KL: The ecology of medical care: origins and implications for populationbased healthcare research. Health Serv Res 1997, 32(1):11–21. 17. Roos NP: Establishing a population data-based policy unit. Med Care 1999, 37(6 Suppl):JS15–26. 18. Gissler M: Routinely collected registers in Finnish health research. In: Statistics, Registries, and Science Experiences from Finland. Edited by Alho J. Keuruu: Statistics Finland; 1999. 19. Shani M: The impact of information on medical thinking and health care policy. Int J Med Inform 2000, 58-59:3–10. 20. Bevan JM: Some general remarks on the nature and purpose of medical information systems. Statistician 1972, 21(4):237–243. 21. Berg M, Goorman E: The contextual nature of medical information. Int J Med Inform 1999, 56(1-3):51–60. 22. Friedman HP, Goldberg JD: Knowledge discovery from databases and data mining: New paradigms for statistics and data analysis. Biopharmaceutical Report 2000, 8(2):1–12. 23. Han J, Kamber M: Data Mining: Concept and Techniques. San Fransisco: Morgan Kaufman; 2001. 24. Tukey JW: The future of data analysis. Ann Math Statist 1962, 33(1):1–67. 25. Chambers JM: Greater or lesser statistics: A choice for future research. Stat Comput 1993, 3:182–184. 26. Huber PJ: Massive datasets workshop: four years after. J Comput Graph Stat 1999, 8:635–652.

Research Report 174 STAKES 2008

References

27. St Leger AS, Schnieden H, WalsworthBell JP: Evaluating Health Services’ Effectiveness. Milton Keynes: Open University Press; 1992. 28. Wallgren A, Wallgren B: Register-Based Statistics: Administrative Data for Statistical Purposes. Chichester: John Wiley & Sons; 2007. 29. Elmasri R, Navathe SB: Fundamentals of Database Systems. Third Edition. Reading: Addison-Wesley; 2000. 30. Marshall JT: Canada’s national vital statistics index. Popul Stud 1947, 1(2): 204–211. 31. Nordbotten S: A statistical file system. Stat Tidskr 1966, 4(2):99–103. 32. Wegman EJ: Huge data sets and the frontiers of computational feasibility. J Comput Graph Stat 1995, 4(4):281–295. 33. Hastie T, Tibshirani R, Friedman J: The Elements of Statistical Learning; Data Mining, Inference, and Prediction. New York: Springer-Verlag; 2001. 34. Bishop CM: Pattern Recognition and Machine Learning. New York: Springer; 2006. 35. Hand DJ, Mannila H, Smyth P: Principles of Data Mining. Cambridge: The MIT Press; 2001. 36. Fayyad U, Piatetsky-Shapiro G, Smyth P: The KDD process for extracting useful knowledge from volumes of data. Commun ACM 1996, 39(11):27–34. 37. Shearer C: The CRISP-DM model: the new blueprint for data mining. Journal of Data Warehousing 2000, 5(4):13–22. 38. Silberschatz A, Tuzhilin A: What makes patterns interesting in knowledge discovery systems. IEEE T Knowl Data En 1996, 8(6):970–974. 39. Cartwright DW, Armknecht PA: Statistical uses of administrative records. Rev Public Data Use 1980, 8(1):13–27. 40. Irvine J, Miles I, Evans J (eds.): Demystifying Social Statistics. London: Pluto Press; 1979. 41. David M: Access to data: the frustration and utopia of the researches. Rev Public Data Use 1980, 8:327–337. 42. Hand DJ: Measurement Theory and Practice: The World Through Quantification. London: Hodder Arnold; 2004. 43. Faber J, Scheper WJ: Social scientific explanations? On Quines’s legacy and contextual fallacies. Qual Quant 2003, 37:135–150.

Research Report 174 STAKES 2008

44. Langefors B: Essays on Infology. Summing Up and Planning for the Future. Gothenburg: Department of Information Systems, University of Göteborg; 1993. 45. Hand DJ: Data, metadata and information. Stat J UN Econ Comm Eur 1993, 10:143– 151. 46. Sundgren B: Making statistical data more available. Int Stat Rev 1996, 64(1):23– 38. 47. Khatri V, Vessey I, Ram S, Ramesh V: Cognitive fit between conceptual schemas and internal problem representations: The case of geospatio-temporal conceptual schema comprehension. IEEE T Prof Commun 2006, 49(2):109–127. 48. Peuquet DJ: Representations of Space and Time. New York: The Guilford Press; 2002. 49. Codd EF: Data models in database management. ACM SIGMOD Record 1981, 11(2):112–114. 50. Armstrong DJ: The quarks of objectoriented development. Commun ACM 2006, 49(2):123–128. 51. Mylopaolos J: Information modeling in the time of the revolution. Inform Syst 1998, 23(3/4):127–155. 52. Chandrasekaran B, Josephson JR, Benjamins VR: What are ontologies, and why do we need them? IEEE Intelligent Systems 1999, 14(1):20–26. 53. Froeschl KA, Grossmann W, Del Vecchio V (eds.): The Concept of Statistical Metadata. Edinburgh: A Network of Excellence for Harmonising and Synthesising the Development of Statistical Metadata; 2003. 54. Mennis JL, Peuquet DJ, Qian L: A conceptual framework for incorporating cognitive principles into geographical database representation. Int J Geogr Inf Sci 2000, 14(6):501–520. 55. Gissler M, Haukka J: Finnish health and social welfare registers in epidemiological research. Norsk Epidemiologi 2004, 14(1):113–120. 56. Lenzerini M: Data integration: A theoretical perspective. In: Proceedings of 21th ACM SIGMOD-SIGACTSIGART Symposium on Principles of Database Systems. New York: ACM Press; 2002: 233–246. 57. Hernández MA, Stolfo SJ: Real-world data is dirty: Data cleansing and the merge/purge problem. Data Min Knowl Disc 1998, 2(1):1–31.

Methodological Perspectives for Register-Based Health System Performance Assessment

81

References

58. Gu L, Baxter R, Vickers D, Rainsford C: Record Linkage: Current Practice and Future Directions. CMIS Technical Report 03/83. Canberra: Commonwealth Scientific and Industrial Research Organication (CSIRO); 2003. 59. Fellegi IP, Sunter AB: A Theory for record linkage. J Am Stat Assoc 1969, 64(328):1183–1210. 60. Rahm E, Bernstein PA: A survey of approaches to automatic schema matching. VLDB J 2001, 10:334–350. 61. Rahm E, Do HH: Data cleaning: problems and current approaches. IEEE Data Eng Bull 2000, 23(4):3–13. 62. Fellegi IP, Holt D: A systematic approach to automatic edit and imputation. J Am Stat Assoc 1976, 71(353):17–35. 63. Bruni R: Error correction for massive datasets. Optim Method Softw 2005, 20(2–3):291–310. 64. Barbará D, DuMouchel W, Faloutsos C, Haas PJ, Hellerstein JM, Ioannidis Y, Jagadish HV, Johnson T, Ng R, Poosala V et al: The New Jersey data reduction report. IEEE Data Eng Bull 1997, 20(4):3–42. 65. Shahar Y: A framework for knowledgebased temporal abstraction. Artif Intell 1997, 90:79–133. 66. Pyle D: Data Preparation for Data Mining. San Fransisco: Morgan Kaufmann; 1999. 67. Hand DJ: Data mining: New challenges for statisticians. Soc Sci Comput Rev 2000, 18(4):442–449. 68. Box GEP: Science and statistics. J Am Stat Assoc 1976, 71(356):791–799. 69. Maclure M, Schneeweiss S: Causation of bias: the episcope. Epidemiology 2001, 12(1):114–122. 70. Greenland S: Multiple-bias modelling for analysis of observational data (with discussion). J R Stat Soc Ser A Stat Soc 2005, 168:267–306. 71. Breiman L: Statistical modeling: the two cultures (with discussion). Stat Sci 2001, 16(3):199–231. 72. Jensen D: Data snooping, dredging and fishing: the dark side of data mining. SIGKDD Explorations 2000, 1(2):52– 54. 73. Thomas RK: Society and Health. Sociology for Health Professionals. New York: Kluwer Academic / Plenum Publishers; 2003. 74. Järvelin J: Health Care Systems in Transition: Finland. Copenhagen:

82

Methodological Perspectives for Register-Based Health System Performance Assessment

75.

76.

77.

78. 79.

80.

81.

82.

83.

84.

European Observatory on Health Care Systems; 2002. Kauppinen S, Forss A, Säkkinen S, Noro A, Voutilainen P: Care and Services for Older People 2002. Official Statistics of Finland, Social Security 2003:1. Helsinki: National Research and Development Centre for Welfare and Health; 2003. Häkkinen U, Lehto J: Reform, change, and continuity in Finnish health care. J Health Polit Policy Law 2005, 30(1– 2):79–96. OECD: OECD Reviews of Health Systems. Finland. Paris: Organisation for economic co-operation and development; 2005. Laaksovirta TH: Tiedonvälittymisen trendit ja informaatio-ohjaus. Yhteiskuntapolitiikka 2000, 65(4):355–358. Smith PC, Häkkinen U: Information strategies for decentralization. In: Decentralization in Health Care: Strategies and Outcomes. Edited by Saltman R, Bankauskaite V, Vrangbaek K. Milton Keynes: Open University Press; 2006. Rauhala P-L, Simpura J, Uusitalo H: Hyvinvoinnin tutkimusperinteet, hyvinvointipolitiikka ja 1990-luvus uudet avaukset. Yhteiskuntapolitiikka 2000, 65(3):191–207. Nylander O, Ståhle P, Nenonen M: Informaatio-ohjauksesta tietointensiiviseen vuorovaikutukseen terveydenhuollon kehittämisessä. Yhteiskuntapolitiikka 2003, 68(1):3–18. WHO: The World Health Raport 2000, Health Systems: Improving Performance. Geneva: World Health Organization; 2000. Naylor CD, Iron K, Handa K: Measuring health system performance: problems and opportunies in the era of assessment and accountability. In: Measuring Up Improving Health System Performance in OECD Countries. Edited by Smith PC. Paris: Organization for Economic Co-operation and Development; 2002: 13–34. Hurst J: Performance measurement and improvement in OECD health systems: overview of issues and challenges. In: Measuring Up Improving Health System Performance in OECD Countries. Edited by Smith PC. Paris: Organization for Economic Co-operation and Development; 2002:35–54.

Research Report 174 STAKES 2008

References

85. Eddy DM: Performance measurement: problems and solutions (with discussion). Health Aff 1998, 17(4):7– 41. 86. Loeb JM: The current state of performance measurement in health care. Int J Qual Health Care 2004, 16 Suppl 1:i5–9. 87. Arah OA, Westert GP, Hurst J, Klazinga NS: A conceptual framework for the OECD Health Care Quality Indicators Project. Int J Qual Health Care 2006, 18 Suppl 1:5–13. 88. Rajavaara M: Vaikuttavuusyhteiskunta. Sosiaalisten Olojen Arvostelusta Vaikutusten Todentamiseen. Sosiaali- ja Terveysturvan Tutkimuksia 84. Helsinki: Kansaneläkelaitos; 2007. 89. Sulkunen P: The welfare state and the state of knowledge: The Finnish case. In: Research on the Study of the Nordic Welfare State Renvall Institute Publications 16. Edited by Marjanen J, Stenius H, Vauhkonen J. Helsinki: Renvall Institute; 2004:58–74. 90. Klazinga N, Stronks K, Delnoij D, Verhoeff A: Indicators without a cause. Reflections on the development and use of indicators in health care from a public health perspective. Int J Qual Health Care 2001, 13(6):433–438. 91. Freeman T: Using performance indicators to improve health care quality in the public sector: a review of the literature. Health Serv Manage Res 2002, 15(2):126–137. 92. Mattke S, Epstein AM, Leatherman S: The OECD Health Care Quality Indicators Project: history and background. Int J Qual Health Care 2006, 18 Suppl 1:1–4. 93. Mainz J: Developing evidence-based clinical indicators: a state of the art methods primer. Int J Qual Health Care 2003, 15 Suppl 1:i5–11. 94. Bird SM, Cox D, Farewell VT, Goldstein H, Holt T, Smith PC: Performance indicators: good, bad, and ugly. J R Stat Soc Ser A Stat Soc 2005, 168(1):1–27. 95. Berg M, Meijerink Y, Gras M, Goossensen A, Schellekens W, Haeck J, Kallewaard M, Kingma H: Feasibility first: developing public performance indicators on patient safety and clinical effectiveness for Dutch hospitals. Health Policy 2005, 75(1):59–73. 96. Ades AE, Sutton AJ: Multiparameter evidence synthesis in epidemiology

Research Report 174 STAKES 2008

97.

98.

99.

100. 101. 102.

103.

104.

105.

106.

and medical decision-making: current approaches. J R Stat Soc Ser A Stat Soc 2006, 169(1):5–35. Hauck K, Street A: Performance assessment in the context of multiple objectives: a multivariate multilevel analysis. J Health Econ 2006, 25(6):1029– 1048. Lipsey MW: Theory as method: small theories of treatments (with discussion). In: Research Methodology: Strengthening Causal Interpretations of Nonexperimental Data. Edited by Sechrest L, Perrin E, Bunker J. Washington DC: U.S. Department of Health and Human Resources; 1990: 33– 60. Walshe K: Understanding what works-and why--in quality improvement: the need for theory-driven evaluation. Int J Qual Health Care 2007, 19(2):57–59. Potvin L, Champagne F: Utilization of administrative files in health research. Soc Indic Res 1986, 18:409–423. Connell FA, Diehr P, Hart LG: The use of large data bases in health care studies. Annu Rev Public Health 1987, 8:51–74. Romano PS, Luft HS: Getting the most out of messy data: Problems and approaches for dealing with large administrative data sets. In: Medical Effectiveness Research Data Methods. Edited by Grady ML, Schwartz HA. Rockville: Agency for Health Care Policy and Research; 1992: 57–75. Powell AE, Davies HT, Thomson RG: Using routine comparative data to assess the quality of health care: understanding and avoiding common pitfalls. Qual Saf Health Care 2003, 12(2):122–128. Motheral B, Brooks J, Clark MA, Crown WH, Davey P, Hutchins D, Martin BC, Stang P: A checklist for retrospective database studies—report of the ISPOR Task Force on Retrospective Databases. Value Health 2003, 6(2):90–97. Schwartz HA: Medical effectiveness research methods and data. In: Medical Effectiveness Research Data Methods. Edited by Grady ML, Schwartz HA. Rockville: Agency for Health Care Policy and Research (AHCPR); 1992: 1–3. Wray NP, Ashton CM, Kuykendall DH, Hollingsworth JC: Using administrative databases to evaluate the quality of medical care: a conceptual framework. Soc Sci Med 1995, 40(12):1707–1715.

Methodological Perspectives for Register-Based Health System Performance Assessment

83

References

107. Donabedian A: Evaluating the quality of medical care. Milbank Mem Fund Q 1966, 44(3):Suppl:166–206. 108. Handler A, Issel M, Turnock B: A conceptual framework to measure performance of the public health system. Am J Public Health 2001, 91(8):1235– 1239. 109. STAKES: From Values to Choices. Report of the Working Group on Health Care Prioritisation. Helsinki: National Research and Development Centre for Welfare and Health; 1995. 110. Stenvall J, Syväjärvi A: Onks Tietoo? Valtion Informaatio-Ohjaus Kuntien Hyvinvointitehtävissä. Helsinki: Valtiovaranministeriö; 2006. 111. Hämäläinen H, Gissler M, Muuri A: Sosiaali- ja Terveydenhuollon Tietouudistus 2005. Palvelujärjestelmä ja Sen Toimivuus -ryhmän raportti. Helsinki: Sosiaali- ja terveysministeriö; 2004. 112. Nenonen M, Nylander O: A theoretical framework for health information systems. Int J Healthcare Technology and Management 2002, 4(6):467–477. 113. OECD: A Disease-Based Comparison of Health Systems. What is Best and at What Cost? Paris: Organisation for Economic Co-operation and Development; 2003. 114. Liporace FA, Egol KA, Tejwani N, Zuckerman JD, Koval KJ: What’s new in hip fractures? Current concepts. Am J Orthop 2005, 34(2):66–74. 115. Zuckerman JD: Hip fracture. N Engl J Med 1996, 334(23):1519–1525. 116. Robinson CM, Court-Brown CM, McQueen MM, Christie J: Hip fractures in adults younger than 50 years of age. Epidemiology and results. Clin Orthop Relat Res 1995(312):238–246. 117. Kannus P, Sievänen H, Palvanen M, Järvinen T, Parkkari J: Prevention of falls and consequent injuries in elderly people. Lancet 2005, 366(9500):1885– 1893. 118. Marks R, Allegrante JP, Ronald MacKenzie C, Lane JM: Hip fractures among the elderly: causes, consequences and control. Ageing Res Rev 2003, 2(1):57–93. 119. Heithoff KA, Lohr KN (eds.): Hip Fracture: Setting Priorities for Effectiveness Research. Washington, DC: National Academy Press; 1990.

84

Methodological Perspectives for Register-Based Health System Performance Assessment

120. Currie CT, Hoy D, Tierney AJ, BryanJones J, Lapsley I: HipMod: Development of a multi-agent audit-based computer simulation of hip fracture care. Health Inf J 2003, 9(3):183–197. 121. Noro A, Rissanen P, Laitinen O, Hammar T, Pöyry P: Lonkkamurtumapotilaiden hoito- ja kotiuttamiskäytännöt ja kotihoidossa selviäminen. Gerontologia 2000, 14(2):83–88. 122. Huusko T, Karppi P, Avikainen V, Kautiainen H, Sulkava R: Significant changes in the surgical methods and length of hospital stay of hip fracture patients occurring over 10 years in Central Finland. Ann Chir Gynaecol 1999, 88(1):55–60. 123. Heikkinen T, Willig R, Hänninen A, Koskinen K, Mannismäki P, Alavaikko A, von und zu Fraunberg M, Jalovaara P: Hip fractures in Finland—a comparison of patient characteristics and outcomes in six hospitals. Scand J Surg 2004, 93(3):234–240. 124. Nurmi I, Narinen A, Lüthje P, Tanninen S: Functional outcome and survival after hip fracture in elderly: a propective study of 106 consecutive patients. J Orthopaed Traumatol 2004, 5(1):7–14. 125. Nurmi I, Narinen A, Lüthje P, Tanninen S: Cost analysis of hip fracture treatment among the elderly for the public health services: a 1-year prospective study in 106 consecutive patients. Arch Orthop Trauma Surg 2003, 123(10):551–554. 126. Treml J, Kroker PB: Hip fracture audit: time for standards to be evidence-based. Int Orthop 2000, 24(4):181–183. 127. Parker MJ, Currie CT, Mountain JA, Thorngren KG: Standardised audit of hip fracture in Europe (SAHFE). Hip Int 1998, 8(1):10–15. 128. Duodecim: Käypä hoito suositus. Lonkkamurtumapotilaiden hoito. Duodecim 2006, 122(3):358–379. 129. SIGN: Prevention and Management of Hip Fracture in Older People. Edinburgh: Scottish Intercollegiate Guidelines Network (SIGN); 2002. 130. Lääkintöhallitus: Yleissairaaloiden käyttö Suomessa vuonna 1960. Lääkintöhallituksen Tutkimuksia 1. Helsinki: Lääkintohallitus; 1966. 131. Slätis P: Tapaturmapotilaiden Sairaansijatarve. Yleissairaaloiden Käytön Analyysi Tapaturmapotilaiden Osalta Vuonna 1968 ja Ennuste Vuodeksi 1980.

Research Report 174 STAKES 2008

References

132.

133. 134. 135.

136.

137.

138.

139.

140.

141.

142.

Lääkintöhallituksen Tutkimuksia 10. Helsinki: Lääkintöhallitus; 1972. Alhava EM, Puittinen J: Fractures of the upper end of the femur as an index of senile osteoporosis in Finland. Ann Clin Res 1973, 5(6):398–403. Mattila K: Reisiluun yläosan murtuman ilmaantuvuus Suomessa. Sos Laaketiet Aikak 1983, 20:53–59. Lüthje P: Incidence of hip fracture in Finland. A forecast for 1990. Acta Orthop Scand 1985, 56:223–225. Simonen O: Incidence of femoral neck fractures: Senile osteoporosis in Finland in the years 1970-1985. Calcif Tissue Int 1991, 49:S8–S10. Kurttio P, Gustavsson N, Vartiainen T, Pekkanen J: Exposure to natural fluoride in well water and hip fracture: a cohort analysis in Finland. Am J Epidemiol 1999, 150(8):817–824. Kannus P, Niemi S, Parkkari J, Palvanen M, Vuori I, Järvinen M: Hip fractures in Finland between 1970 and 1997 and predictions for the future. Lancet 1999, 353(9155):802–805. Hokkanen E, Vauramo E, Mikkola P, Sippo-Tujunen I: Poistoprojektin Loppuraportti. Aiheita 45/1993. Helsinki: Sosiaali- ja terveysalan tutkimus- ja kehittämiskeskus; 1993. Keskimäki I, Teperi J, Aro S, Pohjanlahti J-P: Leikkaustoimenpiteiden Yleisyyden Alueellinen Vaihtelu Suomessa. Stakes, Raportteja 83. Helsinki: Sosiaali- ja terveysalan tutkimus- ja kehittämiskeskus; 1993. Rissanen P,NoroA: Ikääntyvien Potilaiden Hoito- ja Kotiuttamiskäytännöt. Rekisteripohjainen Analyysi Aivohalvausja Lonkkamurtumapotilaista. Aiheita 44/1999. Helsinki: Sosiaali- ja terveysalan tutkimus- ja kehittämiskeskus; 1999. Rissanen P, Sund R, Nordback I, Rousi T, Idänpään-Heikkilä U: Lonkkamurtuman Hoidon Vaikuttavuuden Rekisteriperusteinen Mittaaminen ja Vertailu. Aiheita 21/2002. In. Helsinki: Sosiaali- ja terveysalan tutkimus- ja kehittämiskeskus; 2002. Rissanen P, Sund R, Linna M, IdänpäänHeikkilä U, Rousi T, Nordback I: Vaikuttaako sairaalassa suoritettujen leikkausten määrä lonkkamurtuman hoidon tuloksiin? Suomen Lääkärilehti 2003, 58(12):1419–1423.

Research Report 174 STAKES 2008

143. Keskimäki I, Aro S: Accuracy of data on diagnoses, procedures and accidents in the Finnish Hospital Discharge Register. Int J Health Sci 1991, 2(1):15–21. 144. Lüthje P, Nurmi I, Kataja M, Heliövaara M, Santavirta S: Incidence of pelvic fractures in Finland in 1988. Acta Orthop Scand 1995, 66(3):245–248. 145. Pajunen P, Koukkunen H, Ketonen M, Jerkkola T, Immonen-Räihä P, KärjäKoskenkari P, Mähönen M, Niemelä M, Kuulasmaa K, Palomäki P et al: The validity of the Finnish Hospital Discharge Register and Causes of Death Register data on coronary heart disease. Eur J Cardiovasc Prev Rehabil 2005, 12(2):132–137. 146. Tolonen H, Salomaa V, Torppa J, Sivenius J, Immonen-Raiha P, Lehtonen A: The validation of the Finnish Hospital Discharge Register and Causes of Death Register data on stroke diagnoses. Eur J Cardiovasc Prev Rehabil 2007, 14(3):380–385. 147. Huusko TM, Karppi P, Avikainen V, Kautiainen H, Sulkava R: The changing picture of hip fractures: dramatic change in age distribution and no change in ageadjusted incidence within 10 years in Central Finland. Bone 1999, 24(3):257– 259. 148. Sorensen HT, Sabroe S, Olsen J: A framework for evaluation of secondary data sources for epidemiological research. Int J Epidemiol 1996, 25(2):435–442. 149. Chin MH, Muramatsu N: What is the quality of quality of medical care measures? Rashomon-like relativism and real-world applications (with discussion). Perspect Biol Med 2003, 46(1):5–23. 150. Kane RL (ed.): Understanding Health Care Outcomes Research. Gaithersburg: Aspen Publishers; 1997. 151. Iezzoni LI: Risk Adjustment for Measuring Health Care Outcomes. Third Edition. Chigago: Health Administration Press; 2003. 152. Graham P, Bull B: Approximate standard errors and confidence intervals for indices of positive and negative agreement. J Clin Epidemiol 1998, 51(9):763–771. 153. Andres AM, Marzo PF: Delta: A new measure of agreement between two raters. Br J Math Stat Psychol 2004, 57(1):1–19.

Methodological Perspectives for Register-Based Health System Performance Assessment

85

References

154. Andres AM, Marzo PF: Chance-corrected measures of reliability and validity in K x K tables. Stat Methods Med Res 2005, 14(5):473–492. 155. Audige L, Bhandari M, Hanson B, Kellam J: A concept for the validation of fracture classifications. J Orthop Trauma 2005, 19(6):401–406. 156. Olsson U: Maximum likelihood estimation of the polychoric correlation coefficient. Psychometrika 1979, 44(4): 443–460. 157. Langley J, Stephenson S, Cryer C, Borman B: Traps for the unwary in estimating person based injury incidence using hospital discharge data. Inj Prev 2002, 8(4):332–337. 158. Boufous S, Williamson A: Reporting of the incidence of hospitalised injuries: numerator issues. Inj Prev 2003, 9(4): 370–375. 159. Hornbrook MC, Hurtado AV, Johnson RE: Health care episodes: definition, measurement and use. Med Care Rev 1985, 42(2):163–218. 160. Melton LJ, 3rd: Epidemiology of hip fractures: implications of the exponential increase with age. Bone 1996, 18(3 Suppl):121S–125S. 161. Brody JA, Schneider EL: Diseases and disorders of aging: an hypothesis. J Chronic Dis 1986, 39(11):871–876. 162. Holliday R: The multiple and irreversible causes of aging. J Gerontol A Biol Sci Med Sci 2004, 59(6):B568–572. 163. Gavrilov LA, Gavrilova NS: The reliability theory of aging and longevity. J Theor Biol 2001, 213(4):527–545. 164. Wang J-L: Smoothing hazard rate. In: Encyclopedia of Biostatistics Second Edition. Edited by Armitage P, Colton T, vol. 7. Chichester: John Wiley & Sons; 2005: 4986–4997. 165. Kirkwood BR, Sterne JAC: Essential Medical Statistics. Second Edition. Malden: Blackwell Science; 2003. 166. Johnell O, Gullberg B, Allander E, Kanis JA: The apparent incidence of hip fracture in Europe: a study of national register sources. MEDOS Study Group. Osteoporos Int 1992, 2(6):298–302. 167. Brody JA, Grant MD: Age-associated diseases and conditions: implications for decreasing late life morbidity. Aging Clin Exp Res 2001, 13(2):64–67. 168. Giversen IM: Time trends of age-adjusted incidence rates of first hip fractures: a

86

Methodological Perspectives for Register-Based Health System Performance Assessment

169.

170.

171.

172. 173.

174. 175. 176.

177.

178.

179.

180. 181. 182.

register-based study among older people in Viborg County, Denmark, 1987–1997. Osteoporos Int 2006, 17(4):552–564. Frome EL, Checkoway H: Use of Poisson regression models in estimating incidence rates and ratios. Am J Epidemiol 1985, 121(2):309–323. Barros AJ, Hirakata VN: Alternatives for logistic regression in cross-sectional studies: an empirical comparison of models that directly estimate the prevalence ratio. BMC Med Res Methodol 2003, 3:21. Schlaud M, Brenner MH, Hoopmann M, Schwartz FW: Approaches to the denominator in practice-based epidemiology: a critical overview. J Epidemiol Community Health 1998, 52 Suppl 1:13S–19S. Greenland S: Principles of multilevel modelling. Int J Epidemiol 2000, 29(1): 158–167. Efron B, Morris C: Stein’s estimation rule and its competitors--An empirical Bayes approach. J Am Stat Assoc 1973, 68(341):117–130. Agresti A: Categorical Data Analysis. New York: Wiley; 2002. Hilbe JM: Negative Binomial Regression. Cambridge: Cambridge University Press; 2007. Laukkanen P, Karppi P, Heikkinen E, Kauppinen M: Coping with activities of daily living in different care settings. Age Ageing 2001, 30(6):489–494. Murtagh KN, Hubert HB: Gender differences in physical disability among an elderly cohort. Am J Public Health 2004, 94(8):1406–1411. Finsen V: Improvements in general health among the elderly: a factor in the rising incidence of hip fractures? J Epidemiol Community Health 1988, 42(2):200–203. Walter LC, Lui LY, Eng C, Covinsky KE: Risk of hip fracture in disabled community-living older adults. J Am Geriatr Soc 2003, 51(1):50–55. Rosenbaum PR: Observational Studies. Second Edition. New York: SpringerVerlag; 2002. Dunn HL: Record linkage. Am J Public Health 1946, 36:1412–1416. Rosen AK, Mayer-Oakes A: Episodes of care: theoretical frameworks versus current operational realities. Jt Comm J Qual Improv 1999, 25(3):111–128.

Research Report 174 STAKES 2008

References

183. Solon JA, Feeney JJ, Jones SH, Rigg RD, Sheps CG: Delineating episodes of medical care. Am J Public Health Nations Health 1967, 57(3):401–408. 184. Navarro V: A systems approach to health planning. Health Serv Res 1969, 4(2):96– 111. 185. Fetter RB, Thompson JD: A decision model for the design and operation of a progressive patient care hospital. Med Care 1969, 7(6):450–462. 186. Intrator O, Berg K: Benefits of home health care after inpatient rehabilitation for hip fracture: health service use by Medicare beneficiaries, 1987–1992. Arch Phys Med Rehabil 1998, 79(10):1195– 1199. 187. Braithwaite RS, Col NF, Wong JB: Estimating hip fracture morbidity, mortality and costs. J Am Geriatr Soc 2003, 51(3):364–370. 188. Chaussalet TJ, El-Darzi E: Modelling the process of care. Health Care Manag Sci 2001, 4(1):5–66. 189. Lagergren M: ASIM: a system for monitoring and evaluating the longterm care of the elderly and disabled. Health Serv Res 1993, 28(1):27–44. 190. Smallwood RD, Murray GR, Silva DD, Sondik EJ, Klainer LM: A medical service requirements model for health system design. Proc IEEE Inst Electr Electron Eng 1969, 57(11):1880–1887. 191. Smallwood RD, Sondik EJ, Offensend FL: Toward an integrated methodology for the analysis of health-care systems. Oper Res 1971, 19(6):1300–1322. 192. Tilquin C: Modeling health services systems. Med Care 1976, 14(3):223–240. 193. Jokinen J: Fast estimation algorithm for likelihood-based analysis of repeated categorial responses. Comput Stat Data Anal 2006, 51:1509–1522. 194. Abbot A: Sequence analysis: New methods for old ideas. Annu Rev Sociol 1995, 21:93–113. 195. McVicar D, Anyadike-Danes M: Predicting successful and unsuccessful transitions from school to work by using sequence methods. J R Stat Soc Ser A Stat Soc 2002, 165:317–334. 196. Cadez I, Heckermann D, Meek C, Smyth P, White S: Model-based clustering and visualization of navigation patterns on a web site. Data Min Knowl Disc 2003, 7:399–424.

Research Report 174 STAKES 2008

197. Greenland S, Morgenstern H: Confounding in health research. Annu Rev Public Health 2001, 22:189–212. 198. Rubin DB: Matched Sampling for Causal Effects. Cambridge: Cambridge University Press; 2006. 199. Rosenbaum PR, Silber JH: Matching and thick description in an observational study of mortality after surgery. Biostatistics 2001, 2(2):217–232. 200. Holmberg S, Thorngren KG: Rehabilitation after femoral neck fracture. 3053 patients followed for 6 years. Acta Orthop Scand 1985, 56(4):305–308. 201. Freeman C, Todd C, Camilleri-Ferrante C, Laxton C, Murrell P, Palmer CR, Parker M, Payne B, Rushton N: Quality improvement for patients with hip fracture: experience from a multisite audit. Qual Saf Health Care 2002, 11(3):239–245. 202. Heikkinen T, Willig R, Hänninen A, Koskinen K, Mannismäki P, Alavaikko A, von und zu Fraunberg M, Jalovaara P: Hip fractures in Finland--a comparison of patient characteristics and outcomes in six hospitals. Scand J Surg 2004, 93(3):234–240. 203. Bottle A, Jarman B, Aylin P, Taylor R: Some way to go for consistent implementation of guidance on hip fracture. BMJ 2004, 328(7448):1097. 204. Fairbank J, Goldacre MJ, Mason A, Wilkinson E, Fletcher J, Amess M, Eastwood A, Cleary R: Health Outcome Indicators: Fractured Proximal Femur. Report of a Working Group to the Department of Health. Oxford: National Centre for Health Outcomes Development; 1999. 205. Brook RH, McGlynn EA, Cleary PD: Quality of health care. Part 2: measuring quality of care. N Engl J Med 1996, 335(13):966–970. 206. Brand DA, Quam L, Leatherman S: Medical practice profiling: concepts and caveats. Med Care Res Rev 1995, 52(2):223–251. 207. Goldstein H, Spiegelhalter D: League tables and their limitations: Statistical issues in comparisons of institutional performance (with discussion). J R Stat Soc Ser A Stat Soc 1996, 159(3):385– 443. 208. Christiansen CL, Morris CN: Improving the statistical approach to health care

Methodological Perspectives for Register-Based Health System Performance Assessment

87

References

209.

210.

211.

212.

213.

214.

215. 216.

217.

218.

88

provider profiling. Ann Intern Med 1997, 127(8 Pt 2):764–768. Normand S-LT, Glickman ME, Gatsonis CA: Statistical methods for profiling providers of medical care: issues and applications. J Am Stat Assoc 1997, 92(439):803–814. DeLong ER, Peterson ED, DeLong DM, Muhlbaier LH, Hackett S, Mark DB: Comparing risk-adjustment methods for provider profiling. Stat Med 1997, 16(23):2645–2664. Burgess JF, Jr., Christiansen CL, Michalak SE, Morris CN: Medical profiling: improving standards and risk adjustments using hierarchical models. J Health Econ 2000, 19(3):291–309. Ohlssen DI, Sharples LD, Spiegelhalter DJ: A hierarchical modelling framework for identifying unusual performance in health care providers. J R Stat Soc Ser A Stat Soc 2007, 170(4):865–890. Howley PP, Gibberd R: Using hierarchical models to analyse clinical indicators: a comparison of the gamma-Poisson and beta-binomial models. Int J Qual Health Care 2003, 15(4):319–329. Marshall EC, Spiegelhalter DJ: Institutional performance. In: Multilevel Modelling of Health Statistics. Edited by Leyland AH, Goldstein H. Chichester: John Wiley & Sons; 2001: 127–142. Gilks WR, Richardson S, Spiegelhalter D: Markov Chain Monte Carlo in Practice. Boca Raton: Chapman & Hall; 1996. Brooks SP, Gelman A: Alternative methods for monitoring convergence of iterative simulations. J Comput Graph Stat 1998, 7:434–455. Spiegelhalter D, Best NG, Carlin BP, van der Linde A: Bayesian measures of model complexity and fit (with discussion). J R Stat Soc Ser B 2002, 64:583–639. Gibberd R, Hancock S, Howley P, Richards K: Using indicators to quantify the potential to improve the quality of

Methodological Perspectives for Register-Based Health System Performance Assessment

219. 220. 221.

222.

223.

224.

225.

226.

227.

228.

health care. Int J Qual Health Care 2004, 16(Suppl 1):i37–43. Therneau TM, Grambsch PM: Modeling Survival Data. Extending the Cox Model. New York: Springer; 2000. Hougaard P: Analysis of Multivariate Survival Data. New York: Springer; 2000. Grambsch PM, Therneau TM: Proportional hazards tests and diagnostics based on weighted residuals. Biometrika 1994, 81(3):515–526. Newhouse JP, McClellan M: Econometrics in outcomes research: the use of instrumental variables. Annu Rev Public Health 1998, 19:17–34. Ho V, Hamilton BH, Roos LL: Multiple approaches to assessing the effects of delays for hip fracture patients in the United States and Canada. Health Serv Res 2000, 34(7):1499–1518. McGuire KJ, Bernstein J, Polsky D, Silber JH: The 2004 Marshall Urist award: delays until surgery after hip fracture increases mortality. Clin Orthop Relat Res 2004, 428(428):294–301. Bell CM, Redelmeier DA: Mortality among patients admitted to hospitals on weekends as compared with weekdays. N Engl J Med 2001, 345(9):663–668. Liski A, Sund R: A new register based method for comparing the costs of health care episodes. In: Nordic meeting on register-based health research: September 22-23 2005; Helsinki, Finland; 2005. Riihimäki J, Vehtari A, Sund R, Lampinen J: A Bayesian nonparametric approach for modelling the lengths of stay in a treatment process. In: The international conference of the Royal Statistical Society (RSS 2006): 10–14 September 2006; Queen’s University Belfast, Northern Ireland, Great Britain; 2006. Sund R: Lonkkamurtuman hoitoketjun arviointi kehittyy. CHESS Online 2005 (3):9.

Research Report 174 STAKES 2008

I

Intelligent Data Analysis 7 (2003) 501–519 IOS Press

501

Utilisation of administrative registers using scientific knowledge discovery Reijo Sund

National Research and Development Centre for Welfare and Health (STAKES), P.O.Box 220, 00531 Helsinki, Finland Tel.: +358 9 3967 2469; Fax: +358 9 3967 2485; E-mail: reijo.sund@stakes.fi Received 1 November 2002 Revised 11 December 2002 Accepted 25 February 2003 Abstract. The volume of data being produced for administrative purposes is increasing rapidly. Data must be analysed in order to extract useful information to support decision making. The demand for evidence-based information means that the analysis must be conducted according to the principles of scientific research. Unfortunately, the massive second-hand data sets seem not to fit very well into the traditional methodological paradigm. A secondary data source imposes limitations on the formulation of a problem and concepts, because the measurement can only be based on existing data. The aim of this paper is to present a methodological framework for the utilisation of administrative registers in the creation of scientifically valid information. This is done by discussing fruitful methodological aspects encountered in the practical knowledge-discovery process. The ideas presented originate from many different fields, such as statistics, data mining and sociology. The emphasis lies on understanding connections between problem, data and analysis in the case of massive secondary administrative data sources. Keywords: Scientific research process, secondary data, administrative registers, event history analysis, statistical data mining, health care provider profiling

1. Introduction Data have been produced for hundreds of years. The reasons for such production were originally administrative in nature. There was a need for systematically collected numerical facts on a particular subject. Later, a belief in the advantages of quantitative information accompanied by advances in statistical techniques led to the birth of a widely used methodological paradigm for scientific research. Even more recently, advances in information technology have made it possible to more effectively collect and store larger and larger data sets. Administrative information systems have been at the forefront of data gathering, since there have been a growing demand for evidence-based information to support decision making and other administrative purposes. In many cases, however, raw data as such are of little value. Data must be analysed in order to convert them to useful information. Unfortunately data-driven analysis and the massive size of data sets seem not to fit very well into the traditional methodological paradigm. A traditional way of creating scientifically valid information is to design and carry out a scientific study, in which data production, analysis and reporting are all strictly problem-specific. Data represent tailored measurements of observational reality that are needed in problem solving. Analysis is then used 1088-467X/03/$8.00  2003 – IOS Press. All rights reserved

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

502

R. Sund / Utilisation of administrative registers using scientific knowledge discovery

to reveal ‘how things work’ in empirical data. By interpreting these empirical results in the theoretical context and discussing relevant limitations, it is hopefully possible to create a sophisticated answer to a problem in such a way that it can be evaluated by the scientific community. The most important challenge for the traditional paradigm is the nature of data. If it is not possible to produce tailored data according to the problem in question, the whole problem-solving process becomes more restricted. The secondary data sources utilised impose limitations on possibilities for technical analysis. Moreover, the formulation of a problem and the associated concepts inevitably becomes opportunistic, because the measurement can only be based on existing data [6]. For some researchers, even the validity of data is suspect if they were not produced for the specific purpose for which they are used. Such criticism is not necessarily based on evidence concerning the quality of the information contained by the data but rather on philosophical beliefs regarding the connection between data origin and data quality [35]. In addition, the growing size of data sets has raised some issues that must be dealt with in a different way: data exploration and description plays a more general role than it does in the case of small data sets; efficient algorithms (and computers) are needed in analyses; the unfeasibility of manual analysis results in separation between the data and the analyst; the manual management of all errors in the data is not possible; and the chance occurrence of patterns is more likely in large data sets. In addition, some databases include the whole population and not just a sample, which changes the nature of the whole statistical inference [13]. In other words, in the analysis of secondary data, the problem must be defined under preconditions arising from the data. Moreover, massive data sets are usually so overwhelming that their processing and analysis is difficult (‘data overload syndrome’). It may also be tempting to screen the data for significant associations without having an adequate prior hypothesis (‘snooping/dredging/fishing’). Rather than tackling too much on what can and can not be done, analysis should achieve an appropriate compromise between the practical problem and the data. This kind of activity has been characterised as ‘greater statistics’, which tends to be inclusive, eclectic with respect to methodology, closely associated with other disciplines and also practised by many non-statisticians [3]. Moreover, the larger the data sets are, the more important the general-science and collaboration aspects of the analysis process seem to become relative to the ‘statistical’ aspects. Since statistical research has traditionally focused on probabilistic inference based on mathematics, it has not been able to offer very much concrete help in these different circumstances. The practical need for information has led to the development of alternative ways of analysing data, such as data mining [9]. However, these strategies for analysing data have much in common, and formal statistical expertise also provides an excellent basis for the understanding and evaluation of these ‘new’ ideas. Statisticians ought to take advantage of the situation, get involved in interdisciplinary activities, learn from the experience, expand their own minds – and thereby their field – and act as catalysts for the dissemination of insights and methodologies [15]. 1.1. The aim of the paper The aim of this paper is to present a methodological framework for the utilisation of administrative registers in the creation of scientifically valid information. This is done by discussing the fundamental methodological issues encountered in the practical knowledge discovery process. The ideas presented originate from many different fields, such as statistics, data mining and sociology. The paper emphasises a broader understanding of connections between problem, data and analysis in the case of massive secondary data sources.

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

R. Sund / Utilisation of administrative registers using scientific knowledge discovery

Debate Context

Problem Question Perspective

Idea Theory

503

Data

Method Analysis

Answer

Fig. 1. Research process schema.

The development of this paper is heavily influenced by the context of health services research, a field in which the most valuable official data sets are individual-based. In these circumstances, the histories of individuals recorded in administrative data are the most general starting point for any analysis – and a fruitful starting point for this paper, too. A practical real-world example – used to demonstrate the research made by following the guidelines presented in the paper – is also taken from the context of health services research. 2. The scientific knowledge-discovery process The ultimate goal in all research is – or at least should be – to search for appropriate answers to relevant problems. To find sensible and scientifically valid solutions requires not only a knowledge of the phenomena being investigated but also the capacity to think originally about the nature of inference and the process of conceptualisation. The whole problem-solving process must be conducted in a systematic way, using appropriate methods. Even though it is not in general possible to split research process completely into distinct phases, certain common phases are encountered repeatedly and even simultaneously during the research process. The process does not proceed in such a way that one phase is completed first, after which one moves on to the next. Findings made during the research process inevitably have effects on the final conclusions, since these preliminary findings provide a deeper understanding of the phenomenon and data in question. One sophisticated characterisation of complex problem-solving, is known as a knowledge-discovery process (see e.g. [2,4,8]). However, despite the practical usefulness of a process description of this kind, some critical issues concerning the scientific nature of the research process are not mentioned at all. To see what is actually happening in the research process, it is helpful to place it in a wider context. For this purpose, an alternative schema for a research process is presented in Fig. 1. The interactive phases related to this schema are: understanding the phenomenon, understanding the problem, understanding data, data preprocessing, modelling, evaluation and reporting. Research is conducted in order to obtain an answer to some problem. The researcher has to choose the perspective from which solving the problem is approached, since research requires rigorous communication and careful definitions of concepts related to a phenomenon of interest. However, all researchers have some more or less latent foreknowledge, which provides directions and criteria for understanding the world and also guides understanding gained and interpretations made during the research process. The researcher must be aware that these choices and interpretations are his or her ideologically and historically dependent decisions. The choice of perspective and conceptions may even prevent one from seeing (or enable one to see) something important or interesting. In order to move from the theoretical level to the empirical level, the researcher must operationalise the research frame in such a way that it is possible to produce useful information from the phenomenon

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

504

R. Sund / Utilisation of administrative registers using scientific knowledge discovery

and thus create some answers to the problem. This kind of operationalisation is fundamental to the plausibility of research. This is the stage at which the research defines the point of view from which the phenomenon and concepts are examined. Data represent the empirical world in research. Observed information can not cover all details which are relevant to the phenomenon of interest. The observation and measurement must also be dealt with in a conceptual context. This means that some compromises must be made between the exactness of measurement and the ambiguity of observation. Definitions can not replace the empirical information, but definitions affect the organisation of information and the decisions based on such information. From this point of view, it is easy to argue that data should be produced strictly according to the research problem. That is not the case with secondary data sources, such as administrative registers, since secondary data are originally produced for some other purposes. The available data and the conceptual definitions used in their production are not necessarily compatible with the research problem at hand. As an extreme example, imagine that your observational world is all that you see, hear, smell and feel. Would you trade your observational world for an old black-and-white photo, if you could trade it for a high-quality three-dimensional digital image with full colour scale, including infrared? All in all, the researcher must decide what to observe, how to conceptualise variables, what kind of operationalisation to make and how to acquire appropriate data. The same freedom also applies to the choice of analysing strategy, interpretation of the results and discussion of alternative interpretations. Therefore it is important to reflect upon decisions made during the research process. Every scientific explanation is valid only in so far as the investigator can provide a rationale for what has been done. At each stage of the research, decisions and arguments must not rest on the uncritical or automatic use of established techniques but on a set of logical decisions which must be taken on theoretical grounds with full consciousness of what is being done. Results must be justified. In the example above, one obvious question to be answered is: Why was that particular black-and-white photo chosen? In this example, ideology may be viewed as the position of and lighting for the camera, and historical dependency as the moment of taking the picture in question. Since in the case of administrative registers the data typically have certain common properties, it is possible to make some suggestions of which it is useful to be aware if data originating from such a source are to be utilised. The next section briefly describes a few ideas that have been found very worthwhile in practice. In the research process these ideas may be seen as aspects of the following three phases: understanding the problem, understanding data, and data preprocessing. 2.1. Understanding the problem Time is an essential factor in many problem domains. For example, disease processes evolve in time, and patient records give the history of patients. In research on such dynamic phenomena, interest very often focuses on the sequences of events which occur in time and reflect changes in research objects. A good example is an employment career, which can be regarded as consisting of the sequences and timing of the various jobs and of their association with other events, such as changes in marital status and place of residence. In fact, the importance of the longitudinal event history approach has long been recognised in many areas, especially in the social sciences, econometrics and medical research (see e.g. [5,19,34]). 2.1.1. Operationalisation For the purposes of problem solving, the problem has to be ‘matched’ against the corresponding domain knowledge and data. This can be done by characterising the important properties of the phenomenon carefully, i.e. by operationalising it as a system. A system is defined as a ‘group of things or parts

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

R. Sund / Utilisation of administrative registers using scientific knowledge discovery

B: Outpatient care

A: No care

505

D: Death

C: Hospital care

Fig. 2. A system shown as a directed graph.

working together or connected in some way so as to form a whole’. This definition of a system requires that the concepts, objects and restrictions related to the system be adequately described. In the case of event histories, a common approach to characterising the phenomenon is to categorise events in such a way that they describe transitions across a set of discrete states in time. In the simplest case only one qualitative change is possible – for example, the transition from the state ‘alive’ to the state ‘dead’. However, in many cases one transition can not describe the dynamics of a phenomenon in a realistic way. For example, the slow development of a disease which finally results in the state ‘death’, can not be modelled accurately using only the time elapsing between the first diagnosis and death. If the system is constructed in a reasonable way, the individuals’ event histories can be regarded as ‘paths’ through the system. Pictorially the system can be presented as a directed graph whose nodes and edges have characteristics which describe the properties of possible states and transitions in a system (Fig. 2). This kind of presentation can be criticised as simplistic or even confusing, since it may seem that nothing happens to an individual as long as he or she remains in the same state. On the other hand, it has been found that this kind of formalisation of a problem is easily understandable even for technical outsiders and that it thus provides a common language for the experts from different fields which is required for effective co-operation between members of a research group. As a matter of fact, the presentation technique described above does not restrict the number of possible states. If it seems too unrealistic to assume that nothing happens during a stay in a state or that stays in the same state are not comparable, it is not a problem to split a state into a greater number of more specific states. In general there is no limit to this kind of hierarchical splitting. The choice of an appropriate degree of hierarchical structure is a good example of a fundamental decision on how to reach a compromise between exactness of measurement, ambiguity of observation and – most importantly – the practical needs of the problem-solving process in question. 2.2. Understanding data In register-based analyses, the data themselves impose restrictions on the whole problem-solving task. Such data are known as secondary data, in contrast to the primary data obtained by the investigator through direct observation or interviewing. The use of secondary data raises particular problems associated with the need to take account of the process by which the data were compiled. In general, data must have a form appropriate for storage and analysis purposes, and in addition for intuitive interpretation.

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

506

R. Sund / Utilisation of administrative registers using scientific knowledge discovery

Noise

Phenomenon

Coding

Register

Decoding

Data

Fig. 3. Schematic diagram of information communication via administrative registers.

2.2.1. The nature of data If we apply the traditional Shannon’s communication model in the context of administrative registers, the transmission of information in register utilisation can be regarded as having a structure such as that presented in Fig. 3 (cf. [31]). The main idea is that some parts of the phenomenon of interest are observed and coded to signal, which is then transmitted using a possibly noisy channel into a database and then decoded in order to obtain proper data concerning the phenomenon. Even though this is a very simple and technical representation of communication, it seems to contain the essential elements needed in the common-sense understanding of secondary data. In other words, it is assumed that there exists some phenomenon which can be observed. Since it is impossible to completely observe all details or make exact measurements, some kind of coding is used to describe things. This coded signal is then stored in a database. The noise can be interpreted as an explanation for measurement compromises, possible inconsistencies and coding errors, and coding practices existing in the stored signal. When this signal is to be utilised, it must be decoded into understandable data, i.e. into a form which tells us something about the phenomenon. This phase is also subject to noise caused by incompatibility of choices and interpretations made by the data producer and the data user. In other words, there are many problematic preconditions in the case of administrative data. Most importantly, the limitations of data are determined by the choices made by the producer of the data: easily available data are also easier to collect; data are produced using fixed categorisations and may be dependent on producers’ interpretations; there may be many data producers, with consequent variation in production practices and categorisations used; and information systems may not be flexible enough to take changes in phenomena and society into account. In practice, secondary data are interpreted in order to utilise them for purposes other than those for which they were originally intended. The most typical problems encountered in such a situation are the lack of data on important background phenomena and discrepancies between the users and producers of data as regards their beliefs and knowledge concerning the nature of data. It is possible to grasp the real nature of the data only if one understands (a) what factors have led to the production of certain data and a particular information system, (b) what are the sources and consequences of the type of organisation and data structures used, and (c) whether the data have an effect on the understanding of their domain [24,32]. 2.2.2. Data structures Event-history data consist of observations of the form (τ k ,Dk ), where τk is an ‘occurrence time’ and Dk is an ‘explanation’ for the event (and n is the number of observations and τ 1 � τ2 � . . . τn and τi < τj for at least one observation i �= j and i, j, k = 1, 2, . . . , n). Usually Dk consists of a set of attributes (variables). However, not all the attributes necessarily contain important or interesting information. Some attributes can be irrelevant to the solving of particular problem or can be easily derived from other attributes. It is often reasonable to divide up the relevant information into two subsets of attributes. The first subset defines an event type E k and the other includes important

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

507

R. Sund / Utilisation of administrative registers using scientific knowledge discovery

tk

Ek

0

A

4

C

8

B

10

C

13

D

A B C D 0

2

4

A

6

0

8

C

10

2

4

B

6

8

12

C

10

14

16

18

16

18

time

D

12

14

time

S = < {(0,A)}, {(4,C)}, {(8,B)}, {(10,C)}, {(13,D)} > Fig. 4. An example of event-history data and the corresponding event sequence.

‘covariate’ information ik . For the sake of generality it is useful to allow a transformation f k for the occurrence time τk . If not stated otherwise, fk (τk ) = τk (k = 1, 2, . . . , n). Let m be the number of distinct occurrence times, t i be the ith distinct occurrence time (i = 1, 2, . . . , m) and the event set A i be the set of relevant information on observations occurring at the same time, i.e. Ai = {(ti , Ek , ik )}, where i = 1, 2, . . . , m and for every i, k is over the observations for which ti = fk (τk ). The generalised event sequence S is defined to be a queue of event sets sorted by the (transformed) occurrence time, i.e. S =< A1 , A2 , . . . , Am >. In addition, let the conditional event sequence, to be called an event subsequence, be a sequence S θ =< Ai |Ai ∈ S and condition θ is true >, where i = 1, . . . , m. The definition of the generalised event sequence given above is not very rigorous and can be used in the absence of any knowledge about the event-history framework. However, if one uses this framework, generalised event sequences acquire a constructive and intuitively clear interpretation. Moreover, transformations allowed in generalised event sequences make it possible to use the same data structure in the implementation of different statistical and data mining methods. 2.2.3. An example of a generalised event sequence Figure 4 shows an example of event-history data and the corresponding event sequence, which could be produced by system – say system P, presented in Fig. 2. Two graphical representations of this particular event history are also shown. The first is an event-history description of transitions in a system where an individual actually stays in a current state until there is a transition to another state. In the second, only the events occurring are marked on the figure. Since the time between two consecutive events is also the length of stay in a particular state, it is often very useful to include a ‘length of stay’ attribute in the covariate attributes ik , even though it can be easily calculated from the corresponding occurrence times. In the definition of a generalised event sequence, parallel occurrences of events are allowed. In principle, the systems approach can be rendered valid in this case by defining each combination of event

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

508

R. Sund / Utilisation of administrative registers using scientific knowledge discovery A

a)

0

C

2

4

6

G

0

B

C

8

10

G A

D

12

14

16

18

time

2

4

6

8

10

12

14

16

18

F B

C

8

10

D

b) 0

F

C

2

4

6

12

14

16

18

time

time

S = < {(0,A), (0,G)}, {(4,C)}, {(8,B), (8,F)}, {(10,C)}, {(13,D)} > Fig. 5. An example of a situation with parallel occurrence times.

sets as a ‘new’ event type. In other words, one can think of the explanation for an event type as a combined description of events occurring at the same time. However, in many cases it is reasonable to assign the parallel events to different event types, since it is possible to find ‘natural’ interpretations for these parallel events. The data shown in Fig. 4 constitute a description of movements of individual X in a system P. Let Q be a system with two states (state F: ‘married’ and state G: ‘not married’). As a result, data corresponding to the movements of individual X in the system Q have the same form as the data in Fig. 4. Figure 5a shows an example of this kind of situation. Now there are two event histories for individual X, one corresponding to the path through the system P and the other to that through the system Q. In this case it is known that the systems which ‘generate’ the data are parallel, but the two event histories can be combined into one event sequence (Fig. 5b). In the case of two parallel systems, the information concerning the ‘source system’ of an observation is a very valuable covariate. In general, there can, of course, be more than two data-generating systems. If there is a need to restrict analyses to the observations obtained from some particular system, this can be done easily by using an event subsequence conditional to the corresponding system. In practice there are always event histories for more than one individual. It is thus trivial to include a covariate which identifies an individual, for example his or her social security number, while the whole data set still has the form of a generalised event sequence. Again it is possible to restrict the analyses to a pretermined set of observations by using event subsequences with suitable conditions. 2.2.4. Censoring An additional point to consider is the fact that in practice, data constitute only a narrow window on the dynamics of a phenomenon. In other words, the observations contained in the data set fulfil the condition a < ti < b, where a and b are finite constants and t i is the occurrence time of an event. The problems caused by a limited observation window are illustrated in Fig. 6, which shows examples of different types of ‘censoring’. Each case corresponds to an individual’s length of stay in some particular state, i.e. each case is combination of two events: transition to the state of interest and away from it. Dotted parts of the lines are unobserved; this corresponds to the case where the ‘true’ transition to or from the state is not observed. Censoring in fact guides the possibilities for analysis, and it must be taken into account in such analyses. In the cases (a) and (g), there are no observed transitions to or from the state. This kind of censoring can be very problematic if the very first (or last) occurrence of some event type is considered more important than other occurrences (for example the first diagnosis of schizophrenia or the first backsurgery operation). In the case (b), the true transition to the state is not observed, but the individual was

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

R. Sund / Utilisation of administrative registers using scientific knowledge discovery

509

a b c d e f g h t1

t2

time

Fig. 6. Examples of censoring.

in that state when the observation began, and the transition time from the state is known. In the case (c) both transitions are observed, but there is a potential problem, because the data outside the observation window are not complete for all individuals (typical situation for hospital discharge data). The case (d) corresponds to the uncensored observation. In the case (e), there has been an unknown or ‘wrong’ transition from the state (drop-outs or ‘competing risk’). It is also possible that the follow-up has ended before the occurrence of the event of interest (case f) or the transitions to and from the state are outside of the observation window (case h). 2.2.5. Statistical interpretations of a generalised event sequence A generalised event sequence can be interpreted as a sample path of a marked point process, if the occurrence times for all events are distinct. This leads to a very general family of statistical hazard-rate models suitable for censored data (see e.g. [1]). In practice, all methods in the event-history framework are special cases of this general interpretation. For example, there are many situations in which the ‘calendar time’ of event occurrence is not important, since the ‘real’ information is the time between two consecutive events. In other words, the ‘starting time’ of a follow-up can vary between individuals. A traditional solution, very common in survival analysis, is to transform the time axis from calendar time to ‘failure time’. This can be done using a transformation function f k (τk ) = τk − bk , where bk is the occurrence time of ‘starting event’ of a corresponding individual and k = 1, 2, . . . , n (see e.g. [7,18]). Assuming that the probability of the next event type (state) depends only on time spent in the current state, an appropriate choice for a model is a semi-Markov-model (see e.g. [29]). By restricting the probabilities to change only at discrete time points, the semi-Markov model can be formulated using the Markov chain, where the state space is expanded in a proper way (see e.g. [14]). In the traditional Markov chain, the probability of the next event type (state) depends only on the current state (first-order Markov property), and the probabilities are time-homogeneous. In fact, the Markov chain interpretation corresponds to a situation in which the exact occurrence time is not important and only the order of observations matters. A generalised event sequence can be transformed to an event type sequence of this kind by using the transformation f k (τk ) = k(k = 1, 2, . . . , n). Moreover, by choosing the transformation function f k so that it is of the form fk (τk ) = c, where c is an arbitrary constant and k = 1, 2, . . . , n, the time dimension of the event sequence can be eliminated and the sequence reduces to the data miner’s classical market basket model (see e.g. [12], Chapter 6) .

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

510

R. Sund / Utilisation of administrative registers using scientific knowledge discovery

2.3. Data preprocessing Usually massive second-hand data sets contain so much information and so many domain-specific features, inaccuracies and problems that raw data as such are not usable. In order to use data in problem solving, there must be understanding of the connections between the problem and the data. In registerbased analyses, the problem, domain knowledge and data determine the most suitable model for the final problem solving. Typically it is possible to construct data sets of the event-history type using register data. Preprocessing can also be interpreted as a kind of technical operationalising phase in the research process. 2.3.1. Data abstraction Often the connections between highly specific raw data and the highly abstract domain knowledge are so complicated that it is not possible to find any direct links between data and knowledge. An intelligent interpretation of raw data must be embedded into analyses, so that the resulting derived data set is at the level of abstraction corresponding to the current problem. Since noise is an unavoidable phenomenon, some kind of data validation and verification which makes use of knowledge should also be performed. This kind of task, performed in order to abstract higher-level concepts from possibly time-stamped data, is called data abstraction (see e.g. [20], Chapter 2; [30]). In the discovery of medical knowledge, data are usually patient-specific, while medical knowledge is patient-independent and consists of generalisations that apply across patients. For example, a complication after surgical operation is a medical concept, but from individual-based data it must be abstracted by using some ‘rules’, such as a list of some particular diagnosis codes recorded in data[RF6]. 2.3.2. Data cleaning Real-world data are very often more or less incomplete, noisy and inconsistent. Data cleaning involves detecting and removing errors and inconsistencies from data in order to improve the quality of data (see e.g. [25]). Compared to data abstraction, cleaning is more data-driven and technically oriented. In other words, corrections of erroneous and inconsistent codes as well as missing values can be usually made to the whole database, but data abstraction always results in problem-specific derived data sets. 2.3.3. Data integration and reduction Two other common types of preprocessing are known as data integration and data reduction. The idea in data integration is to include data from multiple sources in analyses; the process is also known as record linkage (see e.g. [26,36]). Data reduction results in a reduced representation of a data set which is much smaller in volume than the original data set, yet produces the same (or almost the same) analytical results (see e.g. [12], Chapter 3). Data reduction can in fact consist of anything from simple database queries to very complicated analyses. 2.3.4. Preprocessing tasks in the event-history framework Sometimes it can in fact be difficult to distinguish what is operationalising, what is preprocessing and what is modelling. All these phases need interpretation and evaluation of the results. Since these are also highly domain-specific and problem-specific, it is difficult to give any general suggestions concerning tasks and methods for analyses, other than that the problem in question implicitly defines the most suitable technique and that the assumptions underlying techniques applied must be valid.

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

R. Sund / Utilisation of administrative registers using scientific knowledge discovery

511

However, some non-trivial preprocessing tasks in the event-history framework could include: ‘defining’ the state space for some system (What happens to a patient after a surgical operation?); finding interesting and frequent combinations of patterns (What combinations of diagnosis and operation codes are frequent?); assigning patterns to adequate hierarchies (What diagnoses relate to complications of a surgical operation?); and confirming the expert’s ‘hypotheses’ about the phenomenon from data. Most analysing techniques are feasible only in the case of moderately small data sets, since these typically need access to the whole data set, and the processing time will be directly proportional to the physical file size. In fact, many data mining methods are very well suited in such circumstances and are clearly something that is needed in practical data analysis. It can be stated that a sophisticated preprocessing operation incorporating non-technical domain knowledge in order to scale things down to a size fit for more specific statistical analyses is the most important and time-consuming part of register-based data analysis. One very straightforward but extremely useful preprocessing technique is ‘pattern remapping’. The idea is to first ‘forget’ the time dimension in the generalised event sequence and to use a levelwise search (see e.g. [21]) to extract the frequent patterns from data, regardless of occurrence times. These patterns (such as a list of medical diagnoses) are then given to a domain expert, who can identify and make hierarchical groupings for the relevant patterns. In the remapping phase, interesting patterns are then assigned to appropriate event types simply by ‘renaming’ interesting patterns according to suggestions made by the domain expert. Finally, the time dimension is restored and records lacking interesting event types are removed. This abstraction usually results in a considerably reduced data set with an interpretation corresponding to the current problem. 3. Practical example: Hip-fracture surgery This example is a simplified extract from a study, which aimed 1) to develop and implement registerbased performance indicators to measure the effectiveness of surgical treatment of hip fracture and 2) to evaluate and compare the effectiveness of health-care providers. The complete results are reported elsewhere [27,33]. The study is a part of a larger project which aims to develop register-based methods for the measurement of effectiveness in specialised health care. 3.1. Defining the problem The first task of the project was to build up a research group consisting of experts from different fields. The group defined the actual problem more meticulously: the idea was to identify all hip-fracture patients from the Finnish Health Care Register and follow the life events that they encountered after hip-fracture surgery according to register data. A simplified system related to this particular problem is shown in Fig. 7. As can be seen, the first hip-fracture operation performed on an individual patient has a key role in the characterisation of this phenomenon. Actually, the state preceding the first hip-fracture operation also matters from the clinical point of view, since the patients coming from home are usually in better condition than patients who are already in residential or hospital care. However, the major interest is in the events and pathways of care following the first hip-fracture operation. In this case, these events are classified into four categories. If everything goes well, the patient should return home. Hip fracture is a serious condition for the elderly, and it can also be a starting point or catalyst for other problems which may result in the need for residential or hospital care. A hip-fracture operation may even be followed by fatal complications. This

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

512

R. Sund / Utilisation of administrative registers using scientific knowledge discovery

Home

Home Hip-fracture operation Residential/ hospital care

Residential/ hospital care Complication

Death

Fig. 7. System characteristics of significant life events related to hip-fracture surgery.

particular categorisation was chosen because the event types comprised the events of interest from the viewpoint of the original problem; and even more importantly, these life events are recorded in various registers. For the sake of generality, the system’s formulation also allows there to be multiple events after hipfracture surgery. As a matter of fact, the pathways of care are more interesting than single events in situations such as a cost-effectiveness evaluation. In the formulation of the system, only death is an absorbing state; all other states can be followed by any other state, i.e. only the death state ends a path in the system. Moreover, even though these four states are distinct, they are not necessarily independent. For example, death may be more probable after complications, or it may not be very likely that the patient will return home if long-term residential care is approved. The absorbing death state causes even more problems, because it is not possible to say of any individual that there is an increased risk of complications after death. However, it is possible to speculate what would have happened if death had not occurred. All in all, this kind of quite simple system definition seems to result in an extremely complicated competing risks model, and more simplifications are needed in the actual modelling. 3.2. Understanding and preprocessing data A cohort of patients with hip fracture in 1998 or 1999 was identified in the Finnish Hospital Care Register using a simple diagnosis-group abstraction (all patients with at least one ICD-10: S72 diagnosis in 1998 or 1999). Using the unique personal identity codes of the patient cohort, data on all inpatient and outpatient hospital care and deaths for this cohort were obtained from the Finnish Health Care Register, the data warehouse of the Finnish Hospital Benchmarking Project and the National Causes of Death Register. The results of these straightforward database queries were integrated into a new data set containing 167 952 records for 17 099 patients. Each record in this data set corresponds to one care episode in hospital (or death), not any actual event of the system, i.e. each observation includes information, such as patient and hospital ID-numbers, age, sex, area codes and diagnosis and operation codes, as well as dates of admission and discharge (or death). Data cleaning was performed in order to correct impossible simultaneous hospital episodes, systematic

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

R. Sund / Utilisation of administrative registers using scientific knowledge discovery

513

errors in the use of symptom vs. cause diagnoses, and some missing or erroneous attribute values in area codes. Many types of censoring occur in this data set: the first hip-fracture operation (or other important event) can be outside the observation window; some hospital episodes may have begun before 1998; follow-up finishes at the end of 1999 (there are census data available for the last day of every year in the Finnish Health Care Register); and follow-up may be terminated by the death of a patient. Operation codes corresponding to hip-fracture surgery were abstracted into two different operation types. Using this and the diagnosis-group abstraction of hip fracture, hip-fracture operations were identified from the data. Since the state preceding a hip-fracture operation was also important, the histories of the patients were traced backwards and the preceding state was classified as home or residential/hospital care by using data abstraction of a more complicated nature. The forward-direction abstractions were even more complicated, and all event types needed special abstractions and techniques. For example, the acute complication events were identified using the pattern remapping technique, in which all clinically relevant complication diagnoses were remapped to one event type. 3.3. Modelling, evaluation and reporting For statistical modelling purposes it was assumed that any acute complication event occurring after a hip-fracture operation is an outcome which reflects the effectiveness of the surgical treatment. In addition, deaths and the upper limit of the observation window were assumed to cause censoring of the event of interest. With these assumptions, the modelling reduced to standard survival analysis (see e.g. [7]) where the variables of interest are the time between a hip-fracture operation and a complication or censoring event, and the censoring indicator. These variables were calculated for all of those patients who were (a) aged over 60 years and (b) had been living at home before surgery. The final preprocessed data set had 8824 records, each containing relevant variables for one patient. In survival analysis, the distribution function of failure time random variable T is F (t) = P (T � t), where t � 0, and S(t) = P (T > t) = 1− F (t), t � 0, is the corresponding survival function. Assuming F to be absolute continuous and f to be the corresponding density, the hazard function is defined by r(t) = f (t)/S(t), t � 0. The differential r(t)dt = P (T ∈ dt|T � t) has the intuitive interpretation of ‘the conditional failure probability at time t, given survival to at least t’. Moreover, the hazard function determines uniquely the distribution function, and hazard-based models are often a convenient way to handle censored observations. In spite of the fact that the time dimension includes a lot of information, the actual effectiveness indicators should be as easy as possible to interpret. In this study the continuous time scale was reduced to a simple dichotomous scale which merely indicates whether or not the event of interest has occurred in the case of an individual during some particular ‘limit’ time period. However, it is not obvious how such ‘limit’ times should be chosen. In addition to using clinical knowledge, it is also possible to utilise data empirically in this task. Since the outcome was an acute complication, the hazard function of acute complication occurrences was estimated. According to the hazard function presented in Fig. 8, the probability of acute complications was higher during the first 30-day period after a surgical operation. This finding based on the data corresponded to the domain knowledge and provided some evidence that the data abstraction was done in a proper way. Since there are censored observations in the data, the proportion of the risk population for whom an event of interest occurred within the ‘limit’ time period does not necessarily correspond to the ‘true’

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

514

R. Sund / Utilisation of administrative registers using scientific knowledge discovery

Hazard function for acute P 0.01 0.009 0.008 0.007 0.006 0.005 0.004 0.003 0.002 0.001 0

0

30 60 100 150 200 250 300 350 400 450 500 550

Time (days) Fig. 8. Hazard function for acute complication.

occurrence rate. In other words, rates must be estimated using more sophisticated methods. In this study the censoring was assumed to be independent (survival experience in the future is not statistically altered by censoring and survival experience in the past). The ‘limit’ time rates were estimated using product-limit (Kaplan-Meier) estimators. In addition, the counts of events were needed, and the corrected ‘observed’ counts were calculated using the relation O i = Ni Fˆi (t), where i indexes the risk population, Ni is the size of risk population i and Fˆi (t) is the estimated cumulative probability of event occurrence at time t (‘limit’ time rate) in risk population i. 3.4. Profiling the providers The most useful information is obtained when rates are evaluated for health-care providers, such as hospitals or hospital districts. Profiling analyses of this kind also allow comparisons of effectiveness between providers. However, the profiling can be quite complicated, since there is variation between providers for at least three reasons: 1) differences may be attributable to random variation caused by the size of the provider, 2) the patient case-mix may vary from provider to provider, and 3) providers may differ in the effectiveness of their care. For these reasons, a statistical model for provider profiling in which provider differences are modelled explicitly was constructed in the study. For an individual patient j , an observed outcome has the form (Y j |xj , zj ), where xj is a vector of patient characteristics and z j is a provider-specific effectiveness component. The expected outcome for a patient is E(Yj |xj ), i.e. a constant degree of effectiveness is assumed. Obviously, if the expected outcome is subtracted from that observed, the remaining residual reflects the effectiveness of care of a provider. In the case of binary outcomes, a logistic regression is a suitable tool for the calculation of the expected outcomes. The idea is to construct and estimate a model in which the observed outcome is a dependent variable and patient characteristics are independent variables. Using this kind of model, it is possible to calculate predicted values for all individuals using patient characteristics and estimated values of parameters with the inverse logit transformation.

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

R. Sund / Utilisation of administrative registers using scientific knowledge discovery

515

Since the focus in profiling is on providers and not�on individuals, � the observed and expected outcomes must be aggregated to the provider level, i.e. O i = Yj and Ei = logit−1 (xj β), where the sums are over patients treated by provider i and β is an estimated parameter vector. Traditionally the ratio of observed to expected outcomes multiplied by the mean rate is used as a risk-adjusted rate for a provider. This quantity forms the basis for comparisons between providers and can be interpreted as the estimated event rate for a given provider if the population of patients treated were identical to the nationwide case-mix. Since the observed outcomes O i are non-negative integers describing frequencies of events, they can be assumed to have a Poisson distribution with unknown mean µ i . That is, Oi ∼ Poisson(µi), where log µi = log Ei + θi and i is the provider index. In other words, it is assumed that the expected outcomes Ei adjust the patient characteristics, and θ i describes the variation caused by provider. The use of logarithms guarantees that θ i remains positive in the model. Case-mix adjustment does not necessarily eliminate the variation in performance indicators in such a way that the indicators reflect a provider‘s effectiveness of care. For example, in data sets with a hierarchical structure there often exists correlations between observations, and this may result in overestimated differences in profiling analyses. Differences in the sizes of providers may also cause problems. For example, rates can not be estimated accurately for small providers. Assuming the exchangeability of providers (i.e. that the results for all providers are equal if there is infinite number of (similar) patients), a two-level hierarchical model can be used to solve the problems mentioned above. A simple solution is to assume that variation caused by providers is normally distributed, i.e. θi ∼ N (α, σ 2 ), where exp(α) is the ‘general’ case-mix-adjusted risk ratio and σ 2 describes the variance between providers (in logarithmic scale). This kind of hierarchical Bayes model needs appropriate prior distributions for the hyperparameters α and σ 2 , such as α ∼ N (0, 106 ) and σ −2 ∼ Γ(0.001, 0.001). The estimation of posterior distributions for parameters of interest (in this case θi s for all providers i) can be done using Markov chain Monte Carlo (MCMC) simulation [22]. Multiplying the obtained adjusted risk ratios by the mean rate results in easily interpretable riskadjusted rates for providers. The use of a hierarchical multilevel model eliminates many drawbacks of traditional profiling analyses. Multilevel models are well suited to the simultaneous calculation of many confidence intervals (multiple comparisons problem), and they give more conservative estimates for differences between providers than do traditional methods [11]. 3.4.1. Example drawn from the results Figure 9 shows the hospital-district-specific risk-adjusted rates for 30-day complication rates. The mean 30-day acute complication rate was 13.0%, and there were no significant differences between districts according to the 95% confidence intervals of rates. However, according to the 50% confidence intervals, hospital districts 9 and 15 seem to have slightly higher rates than others. For more results, see [27] and [33]. 4. Conclusions This paper has presented a methodological framework for the utilisation of administrative registers in the creation of scientifically valid information. This has been done by discussing essential methodological criteria encountered in the practical research process and by combining fruitful methodological ideas from different fields, such as statistics, data mining and sociology. The emphasis has been on the understanding of connections between problem, data and analysis in the case of secondary data sources.

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

516

R. Sund / Utilisation of administrative registers using scientific knowledge discovery

Hyvinkaa-Porvoo Lohja-L-Uusimaa Helsinki Jorvi-Peijas V-Suomi Satakunta K-Hame Pirkanmaa P-Hame Kymenlaakso E-Karjala E-Savo I-Savo P-Karjala P-Savo K-Suomi E-Pohjanmaa Vaasa K-Pohjanmaa P-Pohjanmaa Kainuu L-Pohja Lappi

33 34 31 32 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21

(dottedline).

10 11 12 13 14 15 16 17 Fig. 9. Hospital district-specific risk-adjusted 30-day complication rates in 1998 and 1999. Rates as complications per 100 patients. Thin parts of lines correspond to 95% confidence intervals and thick parts correspond to 50% confidence intervals. Mean rate is 13 complications per 100 patients (dotted line).

In the introduction, the problem caused by a changing paradigm of data analysis was discussed. Different perspectives are needed in analyses. Bringing carefully defined ideas originating from some field into the body of knowledge concerning other fields may create new possibilities or solve some problems – if the scholars working in the ‘object field’ are open-minded enough. Moreover, some pieces of knowledge, such as the criteria for scientific information and the phases of research processes, may be ‘self-evident’ issues for many researchers and data analysts, but the listing of these principles is useful in any case, since it helps to anticipate and avoid the most crucial pitfalls. The paper contains some key points concerning the research process when it is based on massive secondary data sources. The secondary nature and massive size of data sets highlight the importance of scientific issues as compared to the technical questions. Effective research utilising massive secondary data sources requires broad expertise and active collaboration, since it is waste of time to reinvent things which are trivial to some other scholars. This can easily lead to the problem that there is no common language shared by experts in different fields. The ‘slaves of ideologies’ think differently and have different latent assumptions, so it can be difficult to find suitable compromises needed in order to achieve reasonable results. An even bigger mistake is to forget the possibly major impact that decisions made during the execution of a study can have on its outcomes. In addition, beliefs concerning the nature of secondary data are not the same for producers and users, and this generates unknown amounts of ‘noise’ for the results if such issues are not considered carefully. Sophisticated preprocessing – incorporating knowledge from the non-technical domain in order to scale things down to a size fit for more detailed

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

R. Sund / Utilisation of administrative registers using scientific knowledge discovery

517

statistical analyses – is the most important and time-consuming element in the register-based data analysis. All in all, it can be stated that ‘data are not collected, but produced; research results are not findings, but creations’ [17]. There is a great deal of literature concerning the utilisation of register-based data in research (see e.g. [6,10,16,23,28,35,37]). However, the perspective is usually very closely connected to the domain field or to some particular problem, which makes it difficult to extract more generally applicable ideas. Moreover, a purely problem-oriented approach can easily lead to a situation in which the contents of the data are taken too literally. For example, one pitfall is to think that medical diagnoses are recorded in administrative registers in just as detailed a manner as the diagnosis classification allows. There are also two very common erroneous beliefs concerning the utilisation of register data in research. It has been stated that the formation of research data is often technically easy, and that register data are ‘hard’ data representing the truth and providing evidence-based quantitative information. In spite of the fact that time-consuming and expensive production of new data is avoided, according to the principles presented above, the data preprocessing, which is full of ideologically dependent qualitative choices, is the most fundamental and the most difficult part of register-based research. Data analysis is never ‘easy record linking’. It is at least as important a part of the research as the domain knowledge. All data analysts should remember that they are not just assistants to ‘real researchers’ but full and equal members of the research group. Collaboration is the key to obtaining the results. Nobody can be an expert on everything. The principles of the research process that were presented above are the basis for the actual utilisation framework. Since the ideas were somewhat abstract in nature, some suggestions for concrete and practically useful interpretations were considered more carefully. The event-history framework was suggested as a well-developed option in outlining the characteristics of dynamic phenomena. A systems approach to the operationalisation of dynamic phenomenon was described as a helpful tool in generating a common language for experts from different fields. The structure of event-history data was shown to have the form of a generalised event sequence and to be a suitable data structure for the handling of second-hand register data. We drew attention to aspects of censoring which, in connection with eventhistory data, implicitly define possible model types for problem solving and provide a glimpse of the data-generating process, which has an intuitive interpretation in the event-history framework. Moreover, a wide variety of suitable data structures for traditional models can be produced from the generalised event sequence as special cases using the time-transformation property presented. In addition, common preprocessing types were reviewed, some examples of possible preprocessing tasks were given and a pattern remapping technique was formulated. Finally the ideas were illustrated by means of a practical example. It was seen that register-based data analysis becomes very complicated and challenging even in seemingly simple situations. All in all, this paper is simply a monologue presented by a statistician who is specialised in computational statistics and has a background in the social sciences, research interests in problems related to health services research, and experience in the production of official statistics, the maintenance of information systems, the development of statistical software, the teaching of statistics, and consultation with other researchers. In other words, the issues presented in this article are just an ‘insider’s’ collection of pieces of preliminary knowledge which are essential to the conduct of scientific research based on administrative registers. Acknowledgements This study is supported in part by research grant from the Academy of Finland.

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

518

R. Sund / Utilisation of administrative registers using scientific knowledge discovery

References [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] [20] [21] [22] [23] [24] [25] [26] [27]

[28] [29]

E. Arjas, Survival Models and Martingale Dynamics, Scandinavian Journal of Statistics 16 (1989), 177–225. R.J. Brachman and T. Anand, The Process of Knowledge Discovery in Databases, in: Advances in Knowledge Discovery and Data Mining, U.M. Fayyad, G. Piatetsky-Shapiro, P. Smyth and R. Uthurusamy, eds, AAAI Press, Menlo Park, 1996, pp. 37–58. J.M. Chambers, Greater or Lesser Statistics: A Choice for Future Research, Statistics and Computing 3 (1993), 182–184. P. Chapman, J. Clinton, R. Kerber, T. Khabaza, T. Reinartz, C. Shearer and R. Wirth, CRISP-DM 1.0 : Step-by-step data mining guide, The CRISP-DM consortium, 2000. D.G. Clayton, The Analysis of Event History Data: A Review of Progress and Outstanding Problems, Statistics in Medicine 7 (1988), 819–841. F.A. Connel, P. Diehr and L.G. Hart, The use of large data bases in health care studies, Annual Reviews of Public Health 8 (1987), 51–74. D.R. Cox and D. Oakes, Analysis of Survival Data, Chapman and Hall, London, 1984. U. Fayyad, G. Piatetsky-Shapiro and P. Smyth, The KDD Process for Extracting Useful Knowledge from Volumes of Data, Communications of the ACM 39 (1996), 27–34. J.H. Friedman, The Role of Statistics in the Data Revolution, Bulletin of the International Statistical Institute, 52nd Session, Proceedings, Book 1, 1995, pp. 121-124. M. Gissler, Routinely Collected Registers in Finnish Health Research, in: Statistics, Registries, and Research – Experiences from Finland, J. Alho, ed., Statistics Finland, Keuruu, 1999, pp. 241–254. H. Goldstein and D.J. Spiegelhalter, League Tables and Their Limitations: Statistical Issues in Comparisons of Institutional Performance (with discussion), Journal of the Royal Statistical Society, series A 159 (1996), 385–443. J. Han and M. Kamber, Data Mining: Concepts and Techniques, Morgan Kaufmann Publishers, San Francisco, 2001. D.J. Hand, Data Mining: New Challenges for Statisticians, Social Science Computer Review 18 (2000), 442–449. R.A. Howard, Dynamic Probabilistic Systems, Volume I: Markov Models, Volume II: Semi-Markov and Decision Processes, John Wiley & Sons, New York, 1971. P.J. Huber, Massive Datasets Workshop: Four Years After, Journal of Computational and Graphical Statistics 8 (1999), 635–652 P. Huston and D. Naylor, Health services research: reporting on studies using secondary data sources, Canadian Medical Association Journal 155 (1996), 1697–1702. J. Irvine, I. Miles and J. Evans, Introduction: Demystifying Social Statistics, in: Demystifying Social Statistics, J. Irvine, I. Miles and J. Evans, eds, Pluto Press, London, 1979, pp. 1–7. J.D. Kalbfleisch and R.L. Prentice, The Statistical Analysis of Failure Time Data, John Wiley & Sons, New York, 1980. T. Lancaster, The Econometric Analysis of Transition Data, Econometric Society Monographs 17, Cambridge University Press, Cambridge, 1990. N. Lavraˇc, E. Keravnou and B. Zupan, Intelligent data analysis in medicine, in: Encyclopedia of computer science and technology, (Vol. 42)(Supp. 27), Marcel Dekker, New York, 2000, pp. 113–157. H. Mannila and H. Toivonen, Levelwise Search and Border of Theories in Knowledge Discovery, Data Mining and Knowledge Discovery 1 (1997), 241–258. E.C. Marshall and D.J. Spiegelhalter, Institutional Performance, in: Multilevel Modelling of Health Statistics, A.H. Leyland and H. Goldstein, eds, Wiley Series in Probability and Statistics, John Wiley & Sons, Chichester, 2001, pp. 127–142. C.J. McDonald and S.L. Hui, The analysis of Humongous Databases: Problems and Promises, Statistics in Medicine 10 (1991), 511–520. I. Miles and J. Irvine, The Critique of Official Statistics, in: Demystifying Social Statistics, J. Irvine, I. Miles and J. Evans, eds, Pluto Press, London, 1979, pp. 113–129. E. Rahm and H.H. Do, Data Cleaning: Problems and Current Approaches, Bulletin of the IEEE Computer Society Technical Committee on Data Engineering 23 (2000), 3–13. Record Linkage Techniques – 1997, in: Proceedings of an International Workshop and Exposition (March 20–21, 1997, Arlington, VA), W. Alvey and B. Jamerson, eds, Federal Committee on Statistical Methodology, Office of Management and Budget, Washington DC, 1997. P. Rissanen, R. Sund, I. Nordback, T. Rousi and U. Id¨anp¨aa¨ n-Heikkil¨a, Lonkkamurtuman hoidon vaikuttavuuden rekisteriperusteinen mittaaminen ja vertailu (Register-based measurement of effectiveness of surgical treatment of hip fracture, in Finnish), Aiheita 21/2002, National Research and Development Centre for Welfare and Health (STAKES), Helsinki, 2002. L.L. Roos and J.P. Nicol, A Research Registry: Uses, Development, and Accuracy, Journal of Clinical Epidemiology 52 (1999), 39–47. Semi-Markov Models: Theory and Applications, J. Janssen, ed., Plenum Press, New York and London, 1986.

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

R. Sund / Utilisation of administrative registers using scientific knowledge discovery [30] [31] [32] [33] [34] [35] [36] [37]

519

Y. Shahar, A framework for knowledge-based temporal abstraction, Artificial Intelligence 90 (1997), 79–133. C.E. Shannon, A mathematical theory of communication, Bell System Technical Journal 28 (1948), 379–423. P. Starr, The Sociology of Official Statistics, in: The Politics of Numbers, W. Alonso and P. Starr, eds, Russel Sage Foundation, New York, 1987, pp. 7–56. R. Sund, Health Care Provider Profiling: Effectiveness of Surgical Treatment of Hip Fracture, Nordic Health Econometric Workshop, August 22, Helsinki, Finland, 2002. N.B. Tuma and M.T. Hannan, Social Dynamics: Models and Methods, Academic Press, Orlando, 1984. B.A. Virnig and M. McBean, Administrative Data for Public Health Surveillance and Planning, Annual Reviews of Public Health 22 (2001), 213–230. W.E. Winkler, The State of Record Linkage and Current Research Problems, US Bureau of the Census, Technical Report 4/1999, 1999. N.P. Wray, C.M. Ashton, D.H. Kuykendall and J.C. Hollingsworth, Using Administrative Databases to Evaluate the Quality of Medical Care: A Conceptual Framework, Social Science and Medicine 40 (1995), 1707–1715.

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

II

xxn ^ ÓääÇ

-V…>ÌÌ>ÕiÀ “L

œ“«>Àˆ˜} *Àœ«iÀ̈ià œv Õ`ˆÌ >Ì> >˜` ,œṎ˜iÞ œiVÌi` ,i}ˆÃÌiÀ >Ì> ˆ˜ >Ãi œv *iÀvœÀ“>˜Vi ÃÃiÃÓi˜Ì œv ˆ« À>VÌÕÀi /Ài>̓i˜Ì ˆ˜ ˆ˜>˜` ,° -՘`£] ° ÕÀ“ˆ‡Ø̅iÓ] *° Ø̅iÎ] -° />˜˜ˆ˜i˜Î] ° >Àˆ˜i˜Ó] ° iΈ“BŽˆ£ £ >̈œ˜> ,iÃi>ÀV… >˜` iÛiœ«“i˜Ì i˜ÌÀi vœÀ 7iv>Ài >˜` i>Ì…] iÃˆ˜Žˆ] ˆ˜>˜` Ó i>Ì… i˜ÌÀi œv œÕۜ> ,i}ˆœ˜] œÕۜ>] ˆ˜>˜` Î ÕÕÃ>˜ŽœÃŽˆ ,i}ˆœ˜> œÃ«ˆÌ>] ÕÕÃ>˜ŽœÃŽˆ] ˆ˜>˜` -Փ“>ÀÞ

!HƒaR»~Êa³W /œ Vœ“«>Ài «ÀœÃ«iV̈Ûi >Õ`ˆÌ `>Ì> >˜` ÃiVœ˜`>ÀÞ >`“ˆ˜ˆÃÌÀ>̈Ûi Ài}ˆÃÌiÀ `>Ì> ˆ˜ ̅i «Àœ`ÕV̈œ˜ œv «iÀvœÀ“>˜Vi >ÃÃiÃÓi˜Ì ˆ˜vœÀ“>̈œ˜ ˆ˜ ̅i V>Ãi œv …ˆ« vÀ>VÌÕÀi ÌÀi>̓i˜Ì] >˜` ̜ VÀœÃÇÛ>ˆ`>Ìi ̅i µÕ>ˆÌÞ œv ˆ˜vœÀ“>̈œ˜° a»|Ž[³W ˆÀÃÌ] > Vœ˜Vi«ÌÕ> “œ`i vœÀ ̅i «iÀvœÀ“‡ >˜Vi >ÃÃiÃÓi˜Ì œv …ˆ« vÀ>VÌÕÀi ÌÀi>̓i˜Ì Ü>à `ivˆ˜i`° /…ˆÃ “œ`i Ü>à ̅i˜ Ṏˆâi` ˆ˜ Vœ“«>ÀˆÃœ˜Ã LiÌÜii˜ ̅i «ÀœÃ«iV̈Ûi >Õ`ˆÌ `>Ì> Vœ˜ViÀ˜ˆ˜} £äÈ Vœ˜ÃiVṎÛi …ˆ« vÀ>VÌÕÀi «>̈i˜Ìà vÀœ“ ̅i ÕÕÃ>˜ŽœÃŽˆ ,i}ˆœ˜> œÃ«ˆÌ> >˜` VœÀÀi뜘`ˆ˜} Ài}ˆÃÌiÀ `>Ì> vÀœ“ ̅i ˆ˜˜ˆÃ… i>Ì… >Ài ,i}ˆÃÌiÀ >˜` ̅i >ÕÃià œv i>̅ ,i}ˆÃÌiÀ° 7i iÝ>“ˆ˜i` ̅i Vœ“«iÌi˜iÃà œv Ài}ˆÃÌÀ>̈œ˜ œv «>̈i˜Ìà >˜` >Ãœ ̅i >VVÕÀ>VÞ >˜` `i}Àii œv Vœ“‡ «iÌi˜iÃà œv ̅i Ài}ˆÃÌiÀi` `>Ì>° "LÃiÀÛi` `ˆvviÀi˜Vià ÜiÀi V…iVŽi` >}>ˆ˜ÃÌ Ì…i “i`ˆV> ÀiVœÀ`ð +a³Ä…»³W ,i}ˆÃÌiÀ `>Ì> >VŽ Vˆ˜ˆV> `iÌ>ˆ] LÕÌ œÕÌ«iÀ‡ vœÀ“ «ÀœÃ«iV̈Ûi `>Ì> ˆ˜ ̅i ÀiVœÀ`ˆ˜} œv ˆ˜«>̈i˜Ì V>Ài …ˆÃ̜ÀÞ° œ“«iÌi˜iÃà œv ̅i Ài}ˆÃÌiÀ `>Ì> ˆÃ ÛiÀÞ }œœ`° /…i >VVÕÀ>VÞ œv i>ÈÞ “i>ÃÕÀ>Li Û>Àˆ>Lià ˆ˜ ̅i Ài}‡ ˆÃÌiÀ ˆÃ >Ì i>ÃÌ ™x¯° /…i >}Àii“i˜Ì LiÌÜii˜ Ài}ˆÃÌiÀ >˜` >Õ`ˆÌ `>Ì> Ü>à nȰί vœÀ `iÌ>ˆi` …ˆ« vÀ>VÌÕÀi `ˆ‡ >}˜œÃˆÃ° *œÞÃiÀˆ> VœÀÀi>̈œ˜ LiÌÜii˜ ̅i v՘V̈œ˜> `i«i˜`i˜VÞ Û>Àˆ>Lià Ü>à ä°Èn°

ŽŠR…ij~ŽŠ³W ,i}ˆÃÌiÀ >˜` >Õ`ˆÌ `>Ì> …>Ûi ViÀÌ>ˆ˜ ˆ“ˆÌ>̈œ˜Ã >˜` «ÀœLi“Ã] LÕÌ LœÌ… Ãii“ ̜ Li ÃՈÌ>Li vœÀ ̅i «iÀvœÀ“>˜Vi >ÃÃiÃÓi˜Ì œv …ˆ« vÀ>VÌÕÀi ÌÀi>̇ “i˜Ì° /œ ˆ“«ÀœÛi ̅i vi>ÈLˆˆÌÞ œv ̅i >`“ˆ˜ˆÃÌÀ>̈Ûi Ài}ˆÃÌiÀ `>Ì>] ̅i ۜÕ˜Ì>ÀÞ ˆ˜«ÕÌ œv >``ˆÌˆœ˜> …ˆ« vÀ>VÌÕÀi iÛi˜Ì `>Ì> ̜ ̅i Ài}ˆÃÌiÀ ŜՏ` Li “>`i «œÃ‡ ÈLi° -Ì>˜`>À`ˆâi` ˆ˜ÃÌÀÕV̈œ˜Ã ̅>Ì }Ո`i ̅i ÕÃi œv >Û>ˆ>Li Ài}ˆÃÌiÀ V>ÃÈvˆV>̈œ˜Ã ˆ˜ > Ãi˜ÃˆLi Ü>Þ ÜœÕ` ˆ“«ÀœÛi ̅i µÕ>ˆÌÞ œv `>Ì>°

iÞܜÀ`Ã

6>ˆ`ˆÌÞ] Vœ“«iÌi˜iÃÃ] >VVÕÀ>VÞ] ˆ˜vœÀ“>̈œ˜ ÃÞÃÌi“Ã] Vœ˜Vi«ÌÕ> “œ`i i̅œ`à ˜v i` ÓääÇÆ {È\ xxnqxÈÈ `œˆ\£ä°££ÈäÉ äÎnÓ i̅œ`à ˜v i` xÉÓääÇ Research Report 174 STAKES 2008

˜ÌÀœ`ÕV̈œ˜ /…i ̅iœÀïV> «Àˆ˜Vˆ«ià œv ̅i «iÀvœÀ“‡ >˜Vi >ÃÃiÃÓi˜Ì œv …i>Ì… ÃÞÃÌi“à …>Ûi Lii˜ ܈`iÞ ÃÌÕ`ˆi` Q£] ÓR° *iÀvœÀ“>˜Vi >Ç ÃiÃÓi˜Ì ÀiµÕˆÀià > “Տ̈`ˆÃVˆ«ˆ˜>ÀÞ >«‡ «Àœ>V… >˜` ˆÃ VœÃiÞ Vœ˜˜iVÌi` ̜ ˆÃÃÕià ÃÕV… >à µÕ>ˆÌÞ œv …i>Ì… V>Ài QÎ] {R] ÃÞÃÌi“à >˜>ÞÈà QxR] Ž˜œÜi`}i “>˜>}i“i˜Ì QÈR] >˜` `iVˆÃˆœ˜ ÃÕ««œÀÌ ÃÞÃÌi“à QÇR° œ˜Ìˆ˜Õ‡ œÕà ˆ“«ÀœÛi“i˜Ì ÌiV…˜ˆµÕià >Ài ÀiµÕˆÀi` ˆ˜ ̅i `iÈ}˜ >˜` “>˜>}i“i˜Ì œv ˆ˜vœÀ“>̈œ˜ ÃÞÃÌi“à ÃՈÌ>Li vœÀ «iÀvœÀ“>˜Vi >ÃÃiÃÇ “i˜Ì Qn] ™R° /…iÀi i݈ÃÌ ÃiÛiÀ> “i̅œ`œ‡ œ}ˆV> V…>i˜}iÃ\ Vœ“«ˆV>Ìi` ˆ˜ÌiÀ>V̈œ˜Ã LiÌÜii˜ ̅i «…ÞȜœ}ˆV> œ˜ÃiÌ œv `ˆÃi>Ãi] …i>Ì… ÃÞÃÌi“] ÌÀi>̓i˜Ì `iVˆÃˆœ˜Ã] …Õ“>˜ Li…>ۈœÀ] ÜVˆiÌÞ >˜` iÝ«i˜`ˆÌÕÀià ˜ii` ̜ Li ˆ˜Ìi}À>Ìi` ՘`iÀ ̅i Ã>“i ̅iœÀïV> “œ`i Q£ä] ££R° ˜ «À>V̈Vi] ̅i “œÃÌ ÕÃivՏ >««Àœ>V… ̜ «iÀvœÀ“>˜Vi >ÃÃiÃÓi˜Ì Ãii“à ̜ Li > `ˆÃi>Ãi‡L>Ãi` Vœ“«>ÀˆÃœ˜ œv i«ˆ‡ Ü`ià œv V>Ài Q£Ó] £ÎR° ,iiÛ>˜Ì «iÀvœÀ“‡ >˜Vi VÀˆÌiÀˆ> V>˜ Li iÝÌÀ>VÌi` vÀœ“ ̅i iۈ`i˜Vi‡L>Ãi` ÌÀi>̓i˜Ì }Ո`iˆ˜ià Q£{R° /Þ«ˆV>Þ Vœ“«>ÀˆÃœ˜Ã >Ài “>`i LiÌÜii˜ ̅i «ÀœÛˆ`iÀà œv V>Ài Q£xR] ̅œÕ}… ˆÌ ÀiµÕˆÀià ̅i >`ÕÃ̓i˜Ì œv Ž˜œÜ˜ ÀˆÃŽ v>V̜Àà ̅>Ì “>Þ `ˆÃÌÕÀL ̅i Vœ“«>ÀˆÃœ˜Ã Q£È‡£nR° *À>V̈V> >˜` Àiˆ>Li ˆ˜vœÀ“>̈œ˜ œ˜ iۈ‡ `i˜Vi‡L>Ãi` «iÀvœÀ“>˜Vi >ÃÃiÃÓi˜Ì ˆ˜ ̅i V>Ãi œv …ˆ« vÀ>VÌÕÀi ÌÀi>̓i˜Ì ˆÃ LiVœ“ˆ˜} ˆ˜VÀi>Ș}Þ ˜iViÃÃ>ÀÞ Q£™R° œœ` iÝ«iÀˆ‡ i˜Vià œv ̅i -Üi`ˆÃ… ,ˆŽÃ…Ÿv̇Ài}ˆÃÌÀÞ …>Ûi i` ̜ ̅i vœÀ“>ˆâ>̈œ˜ œv `>Ì> «Àœ`ÕV̈œ˜ vœÀ ̅i ÃÌ>˜`>À`ˆâi` >Õ`ˆÌ œv …ˆ« vÀ>VÌÕÀià ˆ˜ ÕÀœ«i ­- ® QÓäR° >Ì> Vœ“«Þˆ˜} ܈̅ ̅i - ÃÌ>˜`>À`à …>Ûi >Ãœ Lii˜ «Àœ`ÕVi` ˆ˜ ܓi …œÃ«ˆÌ>Ã ˆ˜ ˆ˜>˜`

QÓ£‡ÓÎR° /…ˆÃ Žˆ˜` œv «ÀœÃ«iV̈Ûi VœiV̈œ˜ œv Vˆ˜ˆV> `>Ì> œLۈœÕÃÞ Ài«ÀiÃi˜Ìà ̅i «À>V̈V> ºÃÌ>Ìi‡œv‡Ì…i‡>ÀÌ» Vœ˜Ãi˜ÃÕà vœÀ `>Ì> ÀiµÕˆÀi“i˜Ìà vœÀ >ÃÃiÃȘ} ̅i ÌÀi>̇ “i˜Ì œv …ˆ« vÀ>VÌÕÀið 1˜vœÀÌ՘>ÌiÞ] ̅i Ãi«>À>Ìi `>Ì> «Àœ`ÕV̈œ˜ ÀiµÕˆÀià iÝÌÀ> ܜÀŽ >˜` ÀiÜÕÀViÃ] >˜` ̅iÀivœÀi ˆÌ ˆÃ ՘‡ ˆŽiÞ ̅>Ì Ì…i iÝÌi˜ÃˆÛi ۜÕ˜Ì>ÀÞ `>Ì> Vœ‡ iV̈œ˜ ÀiµÕˆÀi` LÞ - ÌÞ«i >Õ`ˆÌ `>Ì> ܜՏ` Li vi>ÈLi ˆ˜ > …œÃ«ˆÌ>Ã° Ì ÜœÕ` Li «À>V̈V> >˜` VœÃ̇ivviV̈Ûi ˆv ÀœṎ˜iÞ VœiVÌi` >`“ˆ˜ˆÃÌÀ>̈Ûi `>Ì> VœÕ` Li ÕÃi` vœÀ «iÀvœÀ“>˜Vi >ÃÃiÃÓi˜Ì «ÕÀ«œÃià QÓ{R° ˜ v>VÌ] ˆ˜ ˆ˜>˜` >`“ˆ˜ˆÃÌÀ>̈Ûi Ài}ˆÃÌiÀà …>Ûi Lii˜ Ṏˆâi` ˆ˜ «iÀvœÀ“>˜Vi >ÃÃiÃÇ “i˜Ì ˆ˜ ̅i V>Ãi œv …ˆ« vÀ>VÌÕÀià QÓx‡ÓÇR° ˜ «Àˆ˜Vˆ«i] ̅i ˆ˜˜ˆÃ… …i>Ì… Ài}ˆÃÌiÀà œvviÀ > ÛiÀÞ >ÌÌÀ>V̈Ûi >˜` vi݈Li i˜ÛˆÀœ˜‡ “i˜Ì vœÀ ÀiÃi>ÀV… «ÕÀ«œÃiÃ] LiV>ÕÃi ̅i ՘ˆÛiÀÃ> «iÀܘ> ˆ`i˜ÌˆvˆV>̈œ˜ ˜Õ“LiÀà >Ài ÕÃi` ˆ˜ > Ài}ˆÃÌiÀà QÓnR° iÌiÀ“ˆ˜ˆÃ̈V ÀiVœÀ` ˆ˜Ž>}i V>˜ Li ÕÃi` >˜` ̅i «œÌi˜Ìˆ> `ˆvvˆVՏ̈ià ܈̅ Vœ“«iÝ «ÀœL>LˆˆÃ̈V ˆ˜Ž‡ >}ià >Ài ̅Õà >ۜˆ`i` Qә‡Î£R° Ãœ ̅i >V‡ VÕÀ>VÞ œv ̅i “œÃÌ ˆ“«œÀÌ>˜Ì Û>Àˆ>Lià ˆ˜ ̅i ˆ˜˜ˆÃ… Ài}ˆÃÌiÀà ˆÃ Ž˜œÜ˜ ̜ Li }œœ` QÎӇÎxR° 1˜vœÀÌ՘>ÌiÞ] ̅i >Û>ˆ>Li iۈ‡ `i˜Vi Vœ˜ViÀ˜ˆ˜} ̅i Û>ˆ`ˆÌÞ œv Ài}ˆÃÌiÀ `>Ì> V>˜ ˜œÌ Li `ˆÀiVÌÞ }i˜iÀ>ˆâi` ̜ ̅i V>Ãi œv …ˆ« vÀ>VÌÕÀi° "˜i ÃÌÕ`Þ œv >VÕÌi «i‡ ۈV vÀ>VÌÕÀià ˆ˜ ˆ˜>˜` `ÕÀˆ˜} £™nn …>à ˆ˜‡ ÛiÃ̈}>Ìi` ̅i VœÀÀi뜘`i˜Vi LiÌÜii˜ ̅i Ài}ˆÃÌiÀ `>Ì> >˜` ̅i “i`ˆV> ÀiVœÀ`à œv ££{ «>̈i˜Ìà ­£ä¯ œv £Ó£Ó «>̈i˜Ìî QÎÈR° /…i >VVÕÀ>VÞ œv ̅i `>Ì> œv ̅i Ài}ˆÃÌiÀ Ü>à vœÕ˜` ̜ Li >Ì i>ÃÌ ™x¯ vœÀ “œÃÌ ˆ“«œÀÌ>˜Ì Û>Àˆ>Lià iÝVÕ`ˆ˜} ÃiVœ˜`>ÀÞ `ˆ>}˜œÃˆÃ ­n䯮 >˜` «>Vi œv ˆ˜ÕÀÞ ­Çx¯®° ˜ >˜œÌ…iÀ ÃÌÕ`Þ ˆ˜ÛiÃ̈}>̈˜} …ˆ« vÀ>VÌÕÀi ˆ˜Vˆ`i˜Vi ˆ˜ ,iViˆÛi`\ œÛi“LiÀ ££] ÓääxÆ >VVi«Ìi`\ -i«Ìi“LiÀ n] ÓääÈ

Methodological Perspectives for Register-Based Health System Performance Assessment

xx™ *iÀvœÀ“>˜Vi ÃÃiÃÓi˜Ì œv ˆ« À>VÌÕÀi /Ài>̓i˜Ì

œ˜i …i>Ì… V>Ài `ˆÃÌÀˆVÌ vÀœ“ £™nӇΠ>˜` vÀœ“ £™™Ó‡Î ˆ˜ ˆ˜>˜`] ̅i `>Ì> œ˜ …ˆ« vÀ>V‡ ÌÕÀi «>̈i˜Ìà ­˜ r ÈÈn® ˆ`i˜Ìˆvˆi` vÀœ“ …œÃ‡ «ˆÌ> vˆià Ü>à VœÀÀiVÌi` ÕȘ} ˆ˜vœÀ“>̈œ˜ vÀœ“ “i`ˆV> ÀiVœÀ`à QÎÇR° /…i …œÃ«ˆÌ> `ˆÃ‡ V…>À}i Ài}ˆÃÌiÀ “ˆÃÃi` Ȱȯ œv «>̈i˜Ìà ˆ˜ £™nӇΠ>˜` Ӱȯ ˆ˜ £™™Ó‡Î] >˜` > …ˆ}… «iÀ‡ Vi˜Ì>}i œv v>Ãi œÀ “Տ̈«i `>Ì> ÜiÀi vœÕ˜` ­{£°n¯ ˆ˜ £™nӇΠ>˜` Ó£°Ç¯ ˆ˜ £™™Ó‡Î®° iÈ`ià ̅iÃi] Üi `œ ˜œÌ Ž˜œÜ œv >˜Þ œÌ…iÀ ˆ˜˜ˆÃ… ÃÌÕ`ˆià ̅>Ì >˜>Þâi ̅i >VVÕÀ>VÞ œÀ Vœ“«iÌi˜iÃà œv ̅i ˜>̈œ˜> …œÃ«ˆÌ> `ˆÃ‡ V…>À}i Ài}ˆÃÌiÀ ˆ˜ Ài}>À` ̜ vÀ>VÌÕÀi «>̈i˜Ìð Àœ“ > }i˜iÀ> «iÀëiV̈Ûi] >VVÕÀ>VÞ >˜` Vœ“«iÌi˜iÃà V>˜ ˜œÌ Li `ivˆ˜i` ՘ˆµÕiÞ ܈̅œÕÌ >˜ >ÃÃՓ«Ìˆœ˜ œv > }œ`i˜ ÃÌ>˜`>À` œv “i>ÃÕÀi“i˜Ì] >˜` `>Ì> Û>ˆ`ˆÌÞ “ÕÃÌ >Ãœ Li Õ`}i` >}>ˆ˜ÃÌ Ì…i ˆ˜Ìi˜`i` Ṏˆâ>̈œ˜ «ÕÀ«œÃià QÎn] ΙR° ˜ ÃÌÕ`ވ˜} ˆ˜Vˆ`i˜Vi] ̅i `ivˆ˜ˆÌˆœ˜ œv > }œ`i˜ ÃÌ>˜‡ `>À` ˆÃ ÃÌÀ>ˆ}…ÌvœÀÜ>À`] LÕÌ Ì…ˆÃ ˆÃ ˜œÌ ̅i V>Ãi ˆ˜ «iÀvœÀ“>˜Vi >ÃÃiÃÓi˜Ì° œÀÌՇ ˜>ÌiÞ] ̅i `>Ì> ÀiµÕˆÀi“i˜Ìà vœÀ ˆ˜Vˆ`i˜Vi V>VՏ>̈œ˜ >˜` vœÀ `iÌiÀ“ˆ˜ˆ˜} ̅i ÃÌ>À̈˜} «œˆ˜Ìà œv V>Ài i«ˆÃœ`ià ˆ˜ «iÀvœÀ“>˜Vi >Ç ÃiÃÓi˜Ì >Ài ÛiÀÞ Ãˆ“ˆ>À Q{äR° /…ˆÃ ÀiÃi“‡ L>˜Vi LiÌÜii˜ Ìܜ >««Àœ>V…ià ˆ“«ˆVˆÌÞ ŜÜà ̅>Ì >ÃÃՓ«Ìˆœ˜Ã œv > }œ`i˜ ÃÌ>˜`>À` >Ài ÛiÀÞ Ãˆ“«ˆÃ̈V ˆ˜ ˆ˜Vˆ`i˜Vi ÃÌÕ`ˆiÃ\ ViÀÌ>ˆ˜ œLÃiÀÛ>Li v>VÌà >Ài “i>ÃÕÀi` >Ì > vˆÝi` ̈“i «œˆ˜Ì° ˜ «iÀvœÀ“>˜Vi >ÃÃiÃÇ “i˜Ì ̅i “>ˆ˜ ˆ˜ÌiÀiÃÌ ˆÃ œ˜ ̅i `ޘ>“ˆV «ÀœViÃÃià œv V>Ài] ܅ˆV… iÃÃi˜Ìˆ>Þ “i>˜Ã `i>ˆ˜} ܈̅ `ˆvviÀi˜Ì «iÀëiV̈Ûià œv “i>ÃÕÀi“i˜Ì V…>˜}ˆ˜} ˆ˜ ̈“i Q{£R° œÀ ˆ˜ÃÌ>˜Vi] ˆÌ ˆÃ œLۈœÕà ̅>Ì Ì…i `iÌ>ˆi` `ˆ>}˜œÃˆÃ œv …ˆ« vÀ>VÌÕÀi ˆÃ “œÀi ˆ“«œÀÌ>˜Ì ˆ˜ ̅i œ«iÀ>̈˜} Àœœ“ ̅>˜ `ÕÀˆ˜} ̅i vˆ˜> ÃÌ>}ià œv Ài…>LˆˆÌ>̈œ˜] >˜` >˜ >ÃÃՓ«Ìˆœ˜ œv Vœ˜ÃÌ>˜Ì >VVÕÀ>VÞ ˆÃ «À>V̈V>Þ ՘Ài>‡ ˆÃ̈V° œÀiœÛiÀ] ̅i >VÌÕ> `ˆ>}˜œÃˆÃ œv …ˆ« vÀ>VÌÕÀi ˆÃ œ˜Þ >˜ ˆ˜ÌiÀ«ÀiÌ>̈œ˜ iÛi˜ >vÌiÀ Ãiiˆ˜} ݇À>ÞÃ] ܅ˆV… ÃÕ}}iÃÌà ̅>Ì Ì…i >Ç ÃՓ«Ìˆœ˜ œv > }œ`i˜ ÃÌ>˜`>À` “>Þ Li iÀ‡ Àœ˜iœÕà Q{ÓR° /…ˆÃ «œÌi˜Ìˆ> >“Lˆ}ՈÌÞ ˆÃ Vi>ÀiÀ ˆ˜ œÌ…iÀ `ˆ>}˜œÃià ÃÕV… >à `i“i˜Ìˆ> œÀ ÃV…ˆâœ«…Ài˜ˆ>] >˜` ˆ˜iۈÌ>Li ˆ˜ ̅i V>Ãià œv “œÀi Vœ“«iÝ Vœ˜Vi«Ìà ÃÕV… >à «œÃ̇ œ«iÀ>̈Ûi Vœ“«ˆV>̈œ˜ œÀ …i>Ì… ÃÌ>ÌÕà Q{ÎR° /…iÃi «ÀœLi“à LiVœ“i iÛi˜ “œÀi Vœ˜‡ VÀiÌi >˜` VÀˆÌˆV>] ˆv ÃiVœ˜`>ÀÞ `>Ì> q ˆ˜ œÌ…iÀ ܜÀ`à `>Ì> VœiVÌi` œÀˆ}ˆ˜>Þ vœÀ ܓi œÌ…iÀ «ÕÀ«œÃi q >Ài ̜ Li ÕÃi` QÓ{] ÓÈ] {{R°

/œ “>Ži >˜Þ ÃÌ>Ìi“i˜Ìà Vœ˜ViÀ˜ˆ˜} ̅i µÕ>ˆÌÞ œv ­ÃiVœ˜`>ÀÞ® `>Ì>] ˆÌ ˆÃ iÃÃi˜Ìˆ> ̜ vˆÀÃÌ œÕ̏ˆ˜i ̅i «Àœ«iÀ̈ià œv ÀiµÕˆÀi` “i>ÃÕÀià V>ÀivՏÞ ՘`iÀ > vˆÝi` Vœ˜Vi«ÌÕ> “œ`i ­`iÌiÀ“ˆ˜i …œÜ `>Ì> ÀiÃÕÌ vÀœ“ ̅i ̅iœÀÞ®] >˜` ̅i˜ iÛ>Õ>Ìi ̅i œLÃiÀÛi` `>Ì> >}>ˆ˜ÃÌ Ì…iÃi ÀiµÕˆÀi“i˜Ìà ­iÝ>“ˆ˜i …œÜ Üi ̅i ̅iœÀÞ V>˜ Li ÀiVœ˜ÃÌÀÕVÌi` ÕȘ} ̅i «Àœ«iÀ̈ià œv >VÌÕ> œLÃiÀÛi` `>Ì>®° ˜ œÌ…iÀ ܜÀ`Ã] ̜ >˜>Þâi ̅i µÕ>ˆÌÞ œv >Û>ˆ‡ >Li `>Ì> vœÀ «iÀvœÀ“>˜Vi >ÃÃiÃÓi˜Ì > «À>}“>̈V>Þ ÕÃivՏ Vœ“«Àœ“ˆÃi LiÌÜii˜ «ÀœLi“‡œÀˆi˜Ìi` >˜` `>Ì>‡`ÀˆÛi˜ ̅iœÀˆià ˆÃ ÀiµÕˆÀi`° ˜ ̅ˆÃ Ãi˜Ãi] ̅i `ivˆ˜ˆÌˆœ˜ vœÀ > }œ`i˜ ÃÌ>˜`>À` œv “i>ÃÕÀi“i˜Ì LiVœ“ià …iÀ“i˜iṎV\ ̅i ŽiÞ ˆÃÃÕi ˆÃ ̜ ՘`iÀÃÌ>˜` Ü…Þ Ì…i `>Ì> >Ài ˆŽi ̅iÞ >Ài ܈̅œÕÌ vˆÝˆ˜} ̅i Ài>ˆÌÞ ÕȘ} >˜Þ Ș}i `>Ì> ÜÕÀVi° œÜiÛiÀ] ˆÌ V>˜ Li >ÃÃՓi` ̅>Ì «Àˆ“>ÀÞ `>Ì> VœiVÌi` vœÀ …ˆ« vÀ>VÌÕÀi >Õ`ˆÌ «ÕÀ‡ «œÃià VœÀÀi뜘` “œÀi VœÃiÞ ̜ >˜Þ Ài>‡ ܘ>Li }œ`i˜ ÃÌ>˜`>À` ̅>˜ ÃiVœ˜`>ÀÞ Ài}ˆÃÌiÀ `>Ì> ̅>Ì ÜiÀi œÀˆ}ˆ˜>Þ VœiVÌi` vœÀ œÌ…iÀ «ÕÀ«œÃið

"LiV̈Ûià /…i >ˆ“à œv ̅ˆÃ ÃÌÕ`Þ ÜiÀi ̜ `ivˆ˜i > Vœ˜‡ Vi«ÌÕ> “œ`i vœÀ «Àœ`ÕVˆ˜} `>Ì> vœÀ «iÀ‡ vœÀ“>˜Vi >ÃÃiÃÓi˜Ì ˆ˜ ̅i V>Ãi œv …ˆ« vÀ>V‡ ÌÕÀi ÌÀi>̓i˜Ì] ̜ Vœ“«>Ài Ìܜ `ˆvviÀi˜Ì ÜÕÀVià œv `>Ì> q «ÀœÃ«iV̈Ûi Vˆ˜ˆV> >Õ`ˆÌ `>Ì> ­`iÈ}˜i` vœÀ “œ˜ˆÌœÀˆ˜} ÌÀi>̓i˜Ì µÕ>ˆÌÞ® >˜` ÃiVœ˜`>ÀÞ >`“ˆ˜ˆÃÌÀ>̈Ûi Ài}‡ ˆÃÌiÀ `>Ì> ­ÕÃi` “>ˆ˜Þ ˆ˜ Vœ“«ˆˆ˜} ÃÌ>̈Ç ̈Vî q ˆ˜ ̅i «Àœ`ÕV̈œ˜ œv «iÀvœÀ“>˜Vi >ÃÃiÃÓi˜Ì ˆ˜vœÀ“>̈œ˜] >˜` >Ãœ ̜ VÀœÃÇ Û>ˆ`>Ìi ̅i µÕ>ˆÌÞ œv ̅iÃi `>Ì> ÜÕÀViÃ

i̅œ`à œÀ ̅i `ivˆ˜ˆÌˆœ˜ œv ̅i Vœ˜Vi«ÌÕ> “œ`i] ̅i ˆÃÌ œv - ‡Û>Àˆ>Lià Ü>à V…œÃi˜ ̜ Ài«ÀiÃi˜Ì >˜ >`iµÕ>Ìi L>Ãi vœÀ `>Ì> ÀiµÕˆÀi‡ “i˜Ìà ­…ÌÌ«\ÉÉÜÜÜ°Ã>…vi°œÀÌ°Õ°ÃiÉ}Ո`i° …Ì“®° /…iÃi >Ài >VÌÕ> œ«iÀ>̈œ˜>ˆâ>̈œ˜Ã œv ̅œÃi «>̈i˜Ì iÛi Vœ˜Vi«Ìà Vœ˜Ãˆ`iÀi` ˆ“«œÀÌ>˜Ì ˆ˜ ̅i V>Ãi œv …ˆ« vÀ>VÌÕÀi° ˜ ̅ˆÃ ÃÌÕ`Þ] ̅i `ˆ“i˜Ãˆœ˜Ã œv ̅i Vœ˜Vi«ÌÕ> “œ`i ÜiÀi >LÃÌÀ>VÌi` vÀœ“ ̅i >VÌÕ> œ«iÀ‡

>̈œ˜>ˆâi` “i>ÃÕÀiÃ] >˜` ̅i˜ Vœ“«i‡ “i˜Ìi` ÕȘ} ̅iœÀïV> “œ`iÃ œv «iÀ‡ vœÀ“>˜Vi >ÃÃiÃÓi˜Ì Q£] £n] {x] {ÈR° >V… «…i˜œ“i˜œ˜ Ài«ÀiÃi˜Ìi` LÞ > Vœ˜Vi«Ì Ü>à >ÃÃՓi` ̜ Li >ÌÌÀˆLÕÌ>Li ̜ >˜ ˆ˜`ˆÛˆ`Õ> œ˜ > Vœ˜Ìˆ˜ÕœÕà ̈“i ÃV>i° Ì Ü>à >Ãœ >Ç ÃՓi` ̅>Ì i>V… Vœ˜Vi«Ì V>˜ Li `iÃVÀˆLi` ÕȘ} > ÃÞÃÌi“à >««Àœ>V… ܈̅ > ­ˆ“ˆÌi` œÀ ՘ˆ“ˆÌi`® ˜Õ“LiÀ œv ÃÌ>Ìià Ü ̅>Ì Ì…i ÃÞÇ Ìi“ ˆÃ >Ü>Þà ˆ˜ ܓi œv ̅iÃi ÃÌ>Ìià >Ì i>V… ̈“i «œˆ˜Ì° /…i Vœ˜Vi«Ìà ÜiÀi ̅i˜ V>Ãȇ vˆi` ˆ˜Ìœ “œÀi }i˜iÀ> }ÀœÕ«Ã ˆ˜ ÌiÀ“à œv ̅iˆÀ Vœ˜ÌiÝÌÕ> ˆ˜ÌiÀ«ÀiÌ>̈œ˜] Ìi“«œÀ> ÃÌ>LˆˆÌÞ >˜` ̅iœÀïV> “i>ÃÕÀi“i˜Ì «Àœ«‡ iÀ̈ið /…ˆÃ µÕ>ˆÌ>̈Ûi V>ÃÈvˆV>̈œ˜ «Àœ‡ Vi`ÕÀi Ü>à Ài«i>Ìi` ՘̈ ̅i ÀiÃՏ̈˜} Vœ˜‡ Vi«ÌÕ> “œ`i Ü>à Vœ˜Ãˆ`iÀi` Ã>̈Ãv>V̜ÀÞ ˆ˜ ̅i Ãi˜Ãi œv "VV>“¿Ã À>âœÀ «Àˆ˜Vˆ«i° Õ`ˆÌ `>Ì> ÜiÀi VœiVÌi` «ÀœÃ«iV̈ÛiÞ vœÀ £äÈ Vœ˜ÃiVṎÛi …ˆ« vÀ>VÌÕÀi «>̈i˜Ìà ˆ˜ ̅i ÕÕÃ>˜ŽœÃŽˆ ,i}ˆœ˜> œÃ«ˆÌ> Li‡ ÌÜii˜ >˜Õ>ÀÞ £] £™™™ >˜` >˜Õ>ÀÞ Î£] Óäää QÓÓ] ÓÎR° *>̈i˜Ìà ÌÀi>Ìi` ˆ˜ ̅i ÃÕÀ}ˆV> Ü>À` >Ì Ì…i ÕÕÃ>˜ŽœÃŽˆ ,i}ˆœ˜> œÃ«ˆÌ> `ÕÀˆ˜} ̅i Ã>“i «iÀˆœ` ÜiÀi ˆ`i˜Ìˆvˆi` vÀœ“ ̅i ˆ˜˜ˆÃ… i>Ì… >Ài ,i}ˆÃÌiÀ] >˜` > ÀiVœÀ`à œv ̅iÃi «>̈i˜Ìà vÀœ“ £™nLJÓääÓ ÜiÀi iÝÌÀ>VÌi` vÀœ“ ̅i ˆ˜˜ˆÃ… œÃ«ˆÌ> ˆÃV…>À}i ,i}ˆÃÌiÀ] ̅i ˆ˜˜ˆÃ… i>Ì… >Ài ,i}ˆÃÌiÀ] >˜` ̅i >ÕÃià œv i>̅ ,i}ˆÃÌiÀ ÕȘ} ̅i ՘ˆµÕi «iÀܘ> ˆ`i˜ÌˆvˆV>̈œ˜ ˜Õ“LiÀà œv ̅i «>̈i˜Ì «œ«Õ>̈œ˜° iÌiVÌi` `ˆvviÀi˜Vià LiÌÜii˜ `>Ì> ÜÕÀVià ÜiÀi vÕÀ̅iÀ V…iVŽi` “>˜Õ>Þ vÀœ“ ̅i “i`ˆV> ÀiVœÀ`à œv ̅iÃi «>̈i˜Ìà ­*®° /…i i̅ˆV> Vœ““ˆÌÌii œv ̅i ޓi˜>>ŽÃœ œÃ«ˆÌ> ˆÃÌÀˆVÌ >««ÀœÛi` ̅i ÃÌÕ`Þ° *iÀ“ˆÃȜ˜ ̜ ÕÃi ̅i `>Ì> Ü>à œLÌ>ˆ˜i` vÀœ“ ̅i ÕÕÃ>˜‡ ŽœÃŽˆ ,i}ˆœ˜> œÃ«ˆÌ>] ̅i >̈œ˜> ,i‡ Ãi>ÀV… >˜` iÛiœ«“i˜Ì i˜ÌÀi vœÀ 7iv>Ài >˜` i>Ì…] >˜` >Ãœ vÀœ“ -Ì>̈Ã̈Và ˆ˜>˜`° /…i ÃiiVÌi` “i>ÃÕÀià œv `ˆvviÀi˜Ì `>Ì> ÜÕÀVià ÜiÀi vˆÀÃÌ “>ÌV…i` ̜ ̅i `ivˆ˜i` Vœ˜Vi«ÌÕ> “œ`i° /…i Vœ“«iÌi˜iÃà œv Ài}‡ ˆÃÌÀ>̈œ˜ œv …ˆ« vÀ>VÌÕÀi «>̈i˜Ìà >à Üi >à ̅i >VVÕÀ>VÞ >˜` `i}Àii œv Vœ“«iÌi˜iÃà œv ̅i Ài}ˆÃÌiÀi` `>Ì> ÜiÀi iÝ>“ˆ˜i` ˆ˜ LœÌ… ̅i >Õ`ˆÌ >˜` Ài}ˆÃÌiÀ `>Ì> ÕȘ} ÃÌ>˜`>À` “i̅œ`œœ}Þ QÎnR° œÀ ܓi Û>Àˆ>Lià ˆÌ Ü>à “œÀi Ài>ܘ>Li ̜ “i>ÃÕÀi >}Àii“i˜Ì ̅>˜ >VVÕÀ>VÞ° /…ˆÃ ÀiµÕˆÀi` >««Àœ>V…ià L>Ãi` œ˜ “œ`iˆ˜} ̅i `iVˆÃˆœ˜‡“>Žˆ˜} «ÀœViÃÃ] ÃÕV… >à > «œÞV…œÀˆV VœÀÀi>̈œ˜ “œ`i ­> vœÀ“ œv >Ìi˜Ì ÌÀ>ˆÌ “œ`i® Q{ÇR° /…i «Àœ«œÀ‡ i̅œ`à ˜v i` xÉÓääÇ

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

xÈä -՘` iÌ >°

̈œ˜ œv >}Àii“i˜Ìà ̅>Ì ÜiÀi ˜œÌ `Õi ̜ V…>˜Vi Ü>à “i>ÃÕÀi` ÕȘ} > `iÌ> Vœiv‡ vˆVˆi˜Ì] ܅ˆV… ˆÃ L>Ãi` œ˜ ̅i “œ`i œv “Տ̈«i‡V…œˆVi ÌiÃÌà >˜` >ۜˆ`à ViÀÌ>ˆ˜ Ž˜œÜ˜ `ivˆVˆi˜Vˆià œv ̅i Ž>««> VœivvˆVˆi˜Ì Q{nR° >««> “>Þ Li Vœ˜Ãˆ`iÀi` >à >˜ >«‡ «ÀœÝˆ“>̈œ˜ œv `iÌ>] >à LœÌ… ވi` Û>Õià ̅>Ì >Ài ÛiÀÞ Ãˆ“ˆ>À ܅i˜ >Ì i>ÃÌ œ˜i “>À‡ }ˆ˜> `ˆÃÌÀˆLṎœ˜ ˆÃ L>>˜Vi` œÀ ܅i˜ LœÌ… “>À}ˆ˜> `ˆÃÌÀˆLṎœ˜Ã >Ài “œ`iÀ>ÌiÞ ՘‡ L>>˜Vi` ˆ˜ ̅i Ã>“i `ˆÀiV̈œ˜ Q{™R°

,iÃՏÌà /…i `ˆ“i˜Ãˆœ˜Ã œv ̅i Vœ˜Vi«ÌÕ> “œ`i >˜` ÃiiVÌi` “i>ÃÕÀià >ÌÌÀˆLÕÌ>Li ̜ ̅iÃi `ˆ“i˜Ãˆœ˜Ã ˆ˜ Û>ÀˆœÕà `>Ì> ÜÕÀVià >Ài Ài‡ «œÀÌi` ˆ˜ />Li £° "v ̅i `ˆ“i˜Ãˆœ˜Ã] ̅i Lˆœœ}ˆV> Vœ˜ÃÌ>˜Ìà `œ ˜œÌ V…>˜}i ˆ˜ ̈“i >˜` ̅iÀivœÀi œ˜i “i>ÃÕÀi“i˜Ì ˆÃ }i˜iÀ>‡ ˆâ>Li ̜ > ̈“i𠈜œ}ˆV> iÛi˜ÌÃ] >VVˆ‡ `i˜ÌÉ v> …ˆÃ̜ÀÞ] >˜` …ˆ« vÀ>VÌÕÀi iÛi˜Ì Ài«ÀiÃi˜Ì `ˆ“i˜Ãˆœ˜Ã vœÀ ܅ˆV… >VÌÕ> Û>Õià œv “i>ÃÕÀià >Ài ÀiVœÀ`i` ˆ˜ ̅i «ÀœÝˆ“ˆÌÞ œv ܓi œLÃiÀÛ>Li iÛi˜Ì°  œÌ…iÀ `ˆ“i˜‡ Ȝ˜Ã Ài>Ìi ̜ «…i˜œ“i˜> ̅>Ì «œÌi˜Ìˆ>Þ V…>˜}i ˆ˜ ̈“i >˜` ŜՏ` Li Vœ˜Ìˆ˜ÕœÕÃÞ “œ˜ˆÌœÀi`° ˜ «À>V̈Vi] ̈“i “ÕÃÌ Li vˆÝi` vœÀ >VÌÕ> “i>ÃÕÀi“i˜Ì° ˜ ̅i >Õ`ˆÌ `>Ì>] ̅i “i>ÃÕÀi“i˜Ì ˆÃ `œ˜i ˆ˜ Vœ˜˜iV̈œ˜ ܈̅ > …ˆ« vÀ>VÌÕÀi iÛi˜Ì >˜` vœÀ ViÀÌ>ˆ˜ “i>ÃÕÀià >Ãœ Ìܜ ÜiiŽÃ] vœÕÀ “œ˜Ì…Ã] >˜` œ˜i Þi>À >vÌiÀ ̅i vÀ>VÌÕÀi° ˜ ̅i Ài}ˆÃÌiÀ `>Ì>] ̅i Ài‡ VœÀ`ˆ˜} Ì>Žià «>Vi >Ì i>V… `ˆÃV…>À}i°à i݇ «iVÌi`] ̅i Vœ˜Ìi˜Ì œv >Õ`ˆÌ `>Ì> ˆÃ ÀˆV…iÀ ̅>˜ ˆ˜ ̅i Ài}ˆÃÌiÀ `>Ì>] iÛi˜ >vÌiÀ ÕȘ} ˆ˜‡ `ˆÀiVÌ “i>ÃÕÀià œv ViÀÌ>ˆ˜ Vœ˜Vi«Ìð œÜ‡ iÛiÀ] ̅i “i>ÃÕÀi“i˜Ì vÀiµÕi˜VÞ ˆ˜ Ài}ˆÃÌiÀ `>Ì> ˆÃ ÃÕ«iÀˆœÀ ̜ >Õ`ˆÌ `>Ì>] >œÜˆ˜} Vœ“‡ «iÌi œLÃiÀÛ>̈œ˜Ã œv iÛi˜Ì …ˆÃ̜Àˆià vœÀ ˆ˜‡ «>̈i˜Ì V>Ài°

œ“«iÌi˜iÃà œv ,i}ˆÃÌÀ>̈œ˜ /…iÀi ÜiÀi £ä{ «>̈i˜Ìà ˆ˜ ̅i «ÀœÃ«iV̈Ûi `>Ì> ­Ìܜ …>` > ˜iÜ vÀ>VÌÕÀi >Ì Ì…i œÌ…iÀ È`i œv ̅i …ˆ« `ÕÀˆ˜} ̅i vœœÜ‡Õ«®° ˜ ̅i Ài}‡ ˆÃÌiÀ ̅iÀi ÜiÀi £££ «>̈i˜Ìà ܅œ ÜiÀi >`‡ “ˆÌÌi` ̜ ̅i ÃÕÀ}ˆV> ˆ˜«>̈i˜Ì Ü>À` >Ì Ì…i ÕÕÃ>˜ŽœÃŽˆ ,i}ˆœ˜> œÃ«ˆÌ> `ÕÀˆ˜} ̅i

ÃÌÕ`Þ «iÀˆœ` ܈̅ > …ˆ« vÀ>VÌÕÀi `ˆ>}˜œÃˆÃ° Ì ÌÕÀ˜i` œÕÌ Ì…>Ì Ì…iÀi ÜiÀi ˆ˜ ̜Ì> £äx «>‡ ̈i˜Ìà ܈̅ > Vœ˜vˆÀ“i` …ˆ« vÀ>VÌÕÀi `ˆ>}‡ ˜œÃˆÃ ­/>Li Ó®° /…i >Õ`ˆÌ `>Ì> “ˆÃÃi` œ˜i V>Ãi >˜` ̅i Ài}ˆÃÌiÀ `>Ì> Ìܜ V>Ãià ˆ˜`ˆV>̇ ˆ˜} ÛiÀÞ }œœ` Vœ“«iÌi˜iÃð /…i «>̈i˜Ì “ˆÃÇ ˆ˜} vÀœ“ ̅i >Õ`ˆÌ `>Ì> …>` >˜ ˆ“«>VÌi` …ˆ« vÀ>VÌÕÀi ­vi“œÀ> ˜iVŽ® >˜` Ü>à iÀÀœ˜iœÕÃÞ iÝVÕ`i` vÀœ“ ̅i «ÀœÃ«iV̈Ûi ÃÌÕ`Þ° œÌ… V>Ãià “ˆÃȘ} vÀœ“ ̅i Ài}ˆÃÌiÀ `>Ì> …>` > `ˆ>}˜œÃˆÃ œv `ˆÃÌ> vi“ÕÀ vÀ>VÌÕÀi ÀiVœÀ`i` ˆ˜ ̅i Ài}ˆÃÌiÀ° /…Àii iÝÌÀ> …ˆ« vÀ>VÌÕÀi «>‡ ̈i˜Ìà ܈̅ Vi>ÀÞ v>Ãi …ˆ« vÀ>VÌÕÀi `ˆ>}‡ ˜œÃià ÜiÀi vœÕ˜` ˆ˜ ̅i Ài}ˆÃÌiÀ° œÀ ̅i vˆÀÃÌ «>̈i˜Ì] ̅iÀi Ü>à > Vœ`ˆ˜} iÀÀœÀ ˆ˜ ̅i “i`ˆ‡ V> ÀiVœÀ` `ˆ>}˜œÃˆÃ ­-{Ó°ä - -ÇÓ°ä® >˜` ̅i ÃiVœ˜` «>̈i˜Ì …>` > `ˆÃÌ> vi“ÕÀ vÀ>V‡ ÌÕÀi] LÕÌ vœÀ ̅i ̅ˆÀ` «>̈i˜Ì >˜Þ iÝ«>˜>̈œ˜ vœÀ >˜ œLۈœÕà iÀÀœÀ Ü>à ˜œÌ vœÕ˜`° "˜i «>‡ ̈i˜Ì ܈̅ …ˆ« vÀ>VÌÕÀi `ˆ>}˜œÃˆÃ ˆ˜ ̅i Ài}‡ ˆÃÌiÀ …>` > «iÀˆ«ÀœÃ̅ïV …ˆ« vÀ>VÌÕÀi >˜` ̅iÀivœÀi ̅ˆÃ «>̈i˜Ì Ü>à ˜œÌ > …ˆ« vÀ>VÌÕÀi «>̈i˜Ì° ˜œÌ…iÀ «>̈i˜Ì Ü>à >`“ˆÌÌi` Li‡ V>ÕÃi œv ÃÕëiVÌi` …ˆ« vÀ>VÌÕÀi] LÕÌ ˜œ vÀ>V‡ ÌÕÀi Ü>à vœÕ˜`° /…Àii «>̈i˜Ìà iÝVÕ`i` vÀœ“ ̅i >Õ`ˆÌ `>Ì> ÜiÀi …>ۈ˜} œ˜‡}œˆ˜} ÌÀi>̓i˜Ì vœÀ > ÀiVi˜Ì i>ÀˆiÀ …ˆ« vÀ>VÌÕÀi q œ˜i œ«iÀ>Ìi` œ˜ >Ì Ì…i ÕÕÃ>˜ŽœÃŽˆ ,i‡ }ˆœ˜> œÃ«ˆÌ> >˜` Ìܜ >Ì Ãœ“i œÌ…iÀ …œÃ‡ «ˆÌ>° Ì >Ãœ ÌÕÀ˜i` œÕÌ Ì…>Ì vœÕÀ «>̈i˜Ìà ˆ˜‡ VÕ`i` ˆ˜ ̅i >Õ`ˆÌ `>Ì> …>` …>` q >VVœÀ`ˆ˜} ̜ ̅i Ài}ˆÃÌiÀ q >˜ i>ÀˆiÀ …ˆ« vÀ>VÌÕÀi `ÕÀˆ˜} ̅i «ÀiVi`ˆ˜} Ìi˜ Þi>Àð /…i È`i œv ̅i vÀ>V‡ ÌÕÀi ˆÃ ÀiVœÀ`i` ÛiÀÞ Ãi`œ“ ˆ˜ ̅i Ài}ˆÃÌiÀ] LÕÌ Ì…i “i`ˆV> ÀiVœÀ`à ÀiÛi>i` ̅>Ì œ˜Þ œ˜i œv ̅iÃi «>̈i˜Ìà …>` ̅i ˜iÜ vÀ>VÌÕÀi œ˜ ̅i Ã>“i È`i >à ̅i i>ÀˆiÀ vÀ>VÌÕÀi° /…i «œÃˆÌˆÛi >}Àii“i˜Ì LiÌÜii˜ Ài}ˆÃÌiÀ >˜` >Õ`ˆÌ `>Ì> Ü>à ÛiÀÞ }œœ` ­™{°™¯®] >˜` ܜՏ` Li iÛi˜ …ˆ}…iÀ ˆv >««Àœ«Àˆ>Ìi `>Ì> >LÃÌÀ>V̈œ˜ ÀՏià vœÀ ˆ`i˜ÌˆvˆV>̈œ˜ œv v>Ãi «œÃˆÌˆÛià >Ài `ivˆ˜i`° v `>Ì> >LÃÌÀ>V̈œ˜ ÀՏià >Ài ˜œÌ ÕÃi`] ̅i Ài}ˆÃÌiÀ `>Ì> ܈ œÛiÀ‡ iÃ̈“>Ìi ̅i ˜Õ“LiÀ œv ˜iÜ …ˆ« vÀ>VÌÕÀið

œ“«iÌi˜iÃà œv ,i}ˆÃÌÀ>̈œ˜ vœÀ ,i‡œ«iÀ>̈œ˜Ã >˜` iÜ À>VÌÕÀià /ܜ œv ̅i £äx …ˆ« vÀ>VÌÕÀi «>̈i˜Ìà ˜ii`i` >˜ >VÕÌi Ài‡œ«iÀ>̈œ˜ `ÕÀˆ˜} ̅i vˆÀÃÌ >`“ˆÃ‡ Ȝ˜° iˆÌ…iÀ œv ̅iÃi Ài‡œ«iÀ>̈œ˜Ã Ü>à `i‡ ÌiVÌ>Li vÀœ“ ̅i Ài}ˆÃÌiÀ `>Ì>° ˜ v>VÌ] ̅i

`>Ì> ÃÌÀÕVÌÕÀi œv ̅i ˆ˜˜ˆÃ… i>Ì… >Ài ,i}ˆÃÌiÀ >œÜà œ˜Þ œ˜i œ«iÀ>̈œ˜ `>Þ Ìœ Li ÀiVœÀ`i` `ÕÀˆ˜} œ˜i «iÀˆœ`] >˜` ÀiVœÀ`ˆ˜} œv Ìܜ ȓˆ>À œ«iÀ>̈œ˜ Vœ`ià vœÀ ̅i Ã>“i «iÀˆœ` “>Þ Ãii“ ̜ Li >˜ iÀÀœÀ° ˜ LœÌ… V>ÃiÃ] ̅i `>Ì> ˆ˜ ̅i Ài}ˆÃÌiÀ VœÀÀi뜘`i` ̜ ̅i >ÌÌiÀ œ«iÀ>̈œ˜° /…Àii «>̈i˜Ìà ܈̅ > ˜iÜ …ˆ« vÀ>VÌÕÀi `ÕÀˆ˜} ̅i vˆÀÃÌ Þi>À œv vœœÜ‡Õ« ÜiÀi `i‡ ÌiVÌi` vÀœ“ ̅i Ài}ˆÃÌiÀ° /ܜ ÜiÀi >Ãœ vœÕ˜` vÀœ“ ̅i >Õ`ˆÌ `>Ì> >˜` ̅i ̅ˆÀ` Ü>à Vœ˜vˆÀ“i` ÕȘ} ̅i ˆ˜vœÀ“>̈œ˜ ˆ˜ ̅i “i`ˆV> ÀiVœÀ`° /…i ­Vœ˜vˆÀ“i`® Ài}ˆÃÌiÀ `>Ì> >Ãœ ÀiÛi>i` ̅>Ì œ˜i œv ̅iÃi ̅Àii «>̈i˜Ìà …>` > ̅ˆÀ` œ«iÀ>̈œ˜ LiV>ÕÃi œv «ÀœLi“à ܈̅ ̅i …ˆ« «ÀœÃ̅iÈà `ÕÀˆ˜} ̅i vˆÀÃÌ Þi>À œv vœœÜ‡Õ«°

VVÕÀ>VÞ >˜` œ“«iÌi˜iÃà œv >ÈÞ i>ÃÕÀ>Li 6>Àˆ>Lià /…i Vœ“«iÌi˜iÃà >˜` >VVÕÀ>VÞ œv ̅i Ài}‡ ˆÃÌiÀ `>Ì> ÜiÀi iÝ>“ˆ˜i` ÕȘ} £äÈ …ˆ« vÀ>VÌÕÀi iÛi˜Ìà ܈̅ Vœ“«iÌi >Õ`ˆÌ `>Ì>°

œ“«iÌi˜iÃà ˆ˜`ˆV>Ìià ̅i «iÀVi˜Ì>}i œv V>Ãià ܈̅ > ÀiVœÀ`i` Û>Õi vœÀ ̅i Û>Àˆ>Li ˆ˜ µÕiÃ̈œ˜° VVÕÀ>VÞ ÌiÃ ̅i «Àœ«œÀ̈œ˜ œv VœÀÀiVÌÞ ÀiVœÀ`i` Û>Õià vœÀ V>Ãià …>ۇ ˆ˜}>Ì i>ÃÌ Ãœ“i ÀiVœÀ`i` Û>Õið /…i VœÀ‡ ÀiV̘iÃà q Ìiˆ˜} ̅i œÛiÀ> ṎˆÌÞ ­“i>ÃÕÀi` >à ̅i «Àœ`ÕVÌ œv Vœ“«iÌi˜iÃà >˜` >VVÕÀ>VÞ® œv ̅i Û>Àˆ>Li q Ü>à ÛiÀÞ }œœ` vœÀ “œÃÌ œv ̅i Û>Àˆ>Lià ­/>Li ή° /…i >VVÕÀ>VÞ œv >`“ˆÃȜ˜ ÜÕÀVi >˜` `ˆÃ‡ V…>À}i `iÃ̈˜>̈œ˜ Ü>à ˆ“«ÀœÛi` LÞ ÕȘ} ÀiVœÀ` ˆ˜Ž>}i ˆ˜ÃÌi>` œv Û>Àˆ>Lià ˆ˜ ̅i ˆ˜`iÝ >`“ˆÃȜ˜ «iÀˆœ`° ˆÃV…>À}i `>Þ >Ãœ ÀiµÕˆÀi` ÀiVœÀ` ˆ˜Ž>}i] LiV>ÕÃi i>V… ÌÀ>˜ÃviÀ vÀœ“ œ˜i Ü>À` ̜ >˜œÌ…iÀ iÛi˜ ˆ˜ ̅i Ã>“i …œÃ«ˆÌ> ÀiÃՏÌà ˆ˜ > ˜iÜ `ˆÃ‡ V…>À}i ˆ˜ ̅i Ài}ˆÃÌiÀ° ˆÃȘ} œ«iÀ>̈œ˜ Vœ`ià >Ài > ˜ÕˆÃ>˜Vi ˆ˜ ̅i V>Ãi œv …ˆ« vÀ>V‡ ÌÕÀiÃ] LiV>ÕÃi ̅iÞ “>Þ >Ãœ ˆ˜`ˆV>Ìi Vœ˜‡ ÃiÀÛ>̈Ûi ÌÀi>̓i˜Ì° /…i `>Ì> ÃÌÀÕVÌÕÀi œv ̅i Ài}ˆÃÌiÀ `œià ˜œÌ ÃÕˆÌ Üi ̜ ÀiVœÀ`ˆ˜} >VÕÌi Ài‡œ«iÀ>̈œ˜Ã `ÕÀˆ˜} ̅i Ã>“i …œÃ«ˆ‡ Ì> «iÀˆœ`] ÀiÃՏ̈˜} ˆ˜ ̅i ÀiVœÀ`ˆ˜} œv ÜÀœ˜} Vœ`ià >˜` `>Þà vœÀ «Àˆ“>ÀÞ œ«iÀ‡ >̈œ˜° /…i iÝÌÀ> œ«iÀ>̈œ˜ Vœ`i ˆ˜`ˆV>̈˜} ̅i È`i œv vÀ>VÌÕÀi Ü>à ÕÃi` ÛiÀÞ À>ÀiÞ° VVÕÀ>VÞ vœÀ ̅i «>Vi œv >VVˆ`i˜Ì Ü>à «œœÀ° /…ˆÃ Ü>à “>ˆ˜Þ LiV>ÕÃi œv ˆ˜‡ V>«>LˆˆÌÞ œv ̅i  ‡£ä V>ÃÈvˆV>̈œ˜ ̜

i̅œ`à ˜v i` xÉÓääÇ Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

xÈ£ *iÀvœÀ“>˜Vi ÃÃiÃÓi˜Ì œv ˆ« À>VÌÕÀi /Ài>̓i˜Ì

/>Li £

œ˜Vi«ÌÕ> “œ`i `ˆ“i˜Ãˆœ˜Ã >˜` ̅iˆÀ Ài>ˆâ>̈œ˜Ã ˆ˜ `ˆvviÀi˜Ì `>Ì> ÜÕÀVià - “i>ÃÕÀi

Õ`ˆÌ `>Ì> “i>ÃÕÀi

,i}ˆÃÌiÀ `>Ì> “i>ÃÕÀi

ˆœœ}ˆV> iÛi˜ÌÃ

`>Ìi œv LˆÀ̅] `i>̅] Þi>À œv “i˜œ«>ÕÃi] “i˜>ÀV…i

`>Ìi œv LˆÀ̅ >˜` `i>̅

`>Ìi œv LˆÀ̅ >˜` `i>̅

ˆœœ}ˆV> Vœ˜ÃÌ>˜Ìà ­`˜>®

ÃiÝ

ÃiÝ

ÃiÝ

ˆœœ}ˆV> v>VÌÃ

ˆœœ}ˆV> “i>ÃÕÀià *…ÞÈV> iÝ>“ˆ˜>̈œ˜Ã

…iˆ}…Ì] Üiˆ}…Ì

>LœÀ>̜ÀÞ ÌiÃÌÃ

…i“œ}œLˆ˜] VÀi>̈˜i] >LՓˆ˜] Lœ˜i `i˜ÃˆÌÞ

i“œ}À>«…ˆV …ˆÃ̜ÀÞ œÕÃi…œ` Vœ“«œÃˆÌˆœ˜

ˆÛˆ˜} >œ˜i

“>ÀˆÌ> ÃÌ>ÌÕÃ] ˆÛˆ˜} >œ˜i

*>Vi œv ˆÛˆ˜}

ÀiÈ`i˜Ìˆ> ÃÌ>ÌÕÃ

ÀiÈ`i˜Ìˆ> ÃÌ>ÌÕÃ] >Ài> œv ˆÛˆ˜}

>Ài> œv ˆÛˆ˜}] …aÊa… Žn R:°a”

ӜŽˆ˜}] >Vœ…œ ˆ˜Ì>Ži

>Vœ…œ ÕÃ>}i

:…RŽ|Ž… °a…:»a[ ~Šœ:»~aŠ» R:°a”

«>ˆ˜] «ÃÞV…œœ}ˆV> ÃÌ>Ìi] vi>À œv v>

*>ˆ˜

ˆÃi>ÃiÃÉÃޓ«Ìœ“Ã

Vœ“œÀLˆ`ˆÌÞ] Vœ“«ˆV>̈œ˜Ã

œ“«ˆV>̈œ˜Ã

°aRŽ°[a[ [~:tŠŽ³a³X ija Žn R:°a”

*…ÞÈV> v՘V̈œ˜ˆ˜}

Ü>Žˆ˜}

Ü>Žˆ˜}] >V̈ۈ̈ià œv `>ˆÞ ˆÛˆ˜}

˜ii` vœÀ V>Ài

œ}˜ˆÌˆÛi v՘V̈œ˜ˆ˜}

LLÀiۈ>Ìi` “i˜Ì> ÌiÃÌ ÃVœÀi

-œVˆœiVœ˜œ“ˆV …ˆÃ̜ÀÞ

`ÕV>̈œ˜] œVVÕ«>̈œ˜] iVœ˜œ“ˆV ÀiÜÕÀVià i>Ì… Ài>Ìi` Li…>ۈœÀ ,ˆÃŽ Li…>ۈœÀ ˆiÌ] >V̈ۈ̈ià -ÕLiV̈Ûi µÕ>ˆÌÞ œv ˆvi "ÛiÀ> Üi‡Liˆ˜} "LiV̈Ûi ˜ii` vœÀ V>Ài

1Ãi œv V>Ài /iV…˜ˆV> >ˆ`

Ü>Žˆ˜} >ˆ`Ã

i`ˆV>̈œ˜

ÕÃi œv >ÃÈÃ̈Ûi ÌiV…˜œœ}Þ] V…>˜}ià ˆ˜ ˆÛˆ˜} i˜ÛˆÀœ˜“i˜Ì ÕÃi œv «>ˆ˜ŽˆiÀÃ

i>Ì… V>Ài Ṏˆâ>̈œ˜

«ÀœÛˆ`iÀ] ÌÞ«i œv ÃÌ>Þ] i˜}̅ œv ÃÌ>Þ

ÌÞ«i œv V>Ài] Ài…>LˆˆÌ>̈œ˜] `>Ìià vœÀ …œÃ«ˆÌ> ۈÈÌÃ

«ÀœÛˆ`iÀ] ÌÞ«i œv V>Ài] >`“ˆÃȜ˜ >˜` `ˆÃV…>À}i `>ÌiÃ] `ˆ>}˜œÃiÃ] œ«iÀ>̈œ˜Ã

`>Ìi œv v>] «>Vi œv v>

`>Ìi œv v>] «>Vi œv v>

œ…:Ra Žn :Š[ °a:³ŽŠ nŽ° |Ž³œ~»:…~Ña[ ~ŠƒÄ°Í”

*Àœ«iÀ̈ià œv vÀ>VÌÕÀi

̈“i œv vÀ>VÌÕÀi] œVVÕÀÀi˜Vi «>Vi] vÀ>VÌÕÀi ÌÞ«i] vÀ>VÌÕÀi È`i

`>Þ œv vÀ>VÌÕÀi] Ài>ܘ vœÀ vÀ>VÌÕÀi] œVVÕÀÀi˜Vi «>Vi] vÀ>VÌÕÀi ÌÞ«i] vÀ>VÌÕÀi È`i

«>Vi œv >˜` Ài>ܘ vœÀ …œÃ«ˆÌ>ˆâi` ˆ˜ÕÀÞ

˜ˆÌˆ> ÌÀi>̓i˜Ì «ÀœViÃÃ

…œÃ«ˆÌ> ˆ`] >`“ˆÃȜ˜ ̈“i] >`“ˆÃȜ˜ ÜÕÀVi] ÃÌ>ÀÌ œv œ«iÀ>̈œ˜] Ài>ܘ vœÀ œ«iÀ>̈Ûi `i>Þ] ÌÞ«i œv ÃÕÀ}iœ˜ >˜` >˜iÃ̅ïÃÌ] ÌÞ«i œv >˜iÃ̅ïV] œ«iÀ>̈Ûi “i̅œ`] i˜}̅ œv ÃÕÀ}iÀÞ] ÌÞ«i œv «Àœ«…ޏ>݈Ã] Vœ“«ˆV>̈œ˜Ã ˆ˜ œ«iÀ>̈œ˜] Àiœ«iÀ>̈œ˜Ã] ̈“i ̜ “œLˆˆâ>̈œ˜] `ˆÃV…>À}i ̈“i] `ˆÃV…>À}i `iÃ̈˜>̈œ˜

>`“ˆÃȜ˜ `>Þ] >`“ˆÃȜ˜ ÜÕÀVi] `>Þ œv œ«iÀ>̈œ˜] Ài>ܘ vœÀ œ«iÀ>̈Ûi `i>Þ] œ«iÀ>̈Ûi “i̅œ`] Vœ“«ˆV>̈œ˜Ã ˆ˜ œ«iÀ>̈œ˜] Àiœ«iÀ>̈œ˜Ã] `ˆÃV…>À}i `>Þ] `ˆÃV…>À}i `iÃ̈˜>̈œ˜

…œÃ«ˆÌ> ˆ`] >`“ˆÃȜ˜ `>Þ] >`“ˆÃȜ˜ ÜÕÀVi] `ˆ>}˜œÃiÃ] `>Þ œv œ«iÀ>̈œ˜] œ«iÀ>̈œ˜ Vœ`iÃ] `ˆÃV…>À}i `>Þ] `ˆÃV…>À}i `iÃ̈˜>̈œ˜

ˆ˜ˆV> ÃÌ>LˆˆÌÞ

- }À>`i] Vœ“œÀLˆ`ˆÌˆiÃ] Vœi݈Ã̈˜} vÀ>VÌÕÀiÃ] Ài>ܘ vœÀ œ«iÀ>̈Ûi `i>Þ ˆ˜ “i`ˆV>Þ ՘vˆÌ «>̈i˜ÌÃ

- }À>`i] Ài>ܘ vœÀ œ«iÀ>̈Ûi `i>Þ ˆ˜ “i`ˆV>Þ ՘vˆÌ «>̈i˜ÌÃ

°aRŽ°[a[ [~:tŠŽ³a³X ija Žn R:°a”

VVˆ`i˜ÌÉv> …ˆÃ̜ÀÞ *Àœ«iÀ̈ià œv v> ˆ« vÀ>VÌÕÀi iÛi˜Ì

£® ˜`ˆÀiVÌ “i>ÃÕÀi“i˜Ì ÀiµÕˆÀˆ˜} ÀiVœÀ` ˆ˜Ž>}i

i̅œ`à ˜v i` xÉÓääÇ Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

xÈÓ -՘` iÌ >°

˜ LœÌ… `>Ì> ÃiÌÃ\ ™n ˜iÜ …ˆ« vÀ>VÌÕÀi «>̈i˜Ìà s œ:»~aŠ»³ Ë~»| a:°…~a° n°:R»Ä°a

ˆÃȘ} vÀœ“ Ài}ˆÃÌiÀ\ Ó v>Ãi‡˜i}>̈Ûi Ài}ˆÃÌiÀ `ˆ>}˜œÃiÃ

ˆÃȘ} vÀœ“ >Õ`ˆÌ `>Ì>\ £ ÛiÀˆvˆi` ˜iÜ «>̈i˜Ì  Žœa°:»a[ ~Š Ž»|a° |Ž³œ~»:… ” °aRaŠ» a:°…~a° n°:R»Ä°a ” a:°…~a° »Ž»:… œ°Ž³»|a³~³ ” ³Ä³œaR»a[ |~œ n°:R»Ä°a Î v>Ãi‡«œÃˆÌˆÛi Ài}ˆÃÌiÀ `ˆ>}˜œÃiÃ

*œÃˆÌˆÛi >}Àii“i˜Ì\ ™{°™¯ ­™x¯ \ ™ä°Î‡™È°n¯® "ÛiÀiÃ̈“>̈œ˜ ˆ˜ ̅i Ài}ˆÃÌiÀ\ Ç°Ó¯ ­™x¯‡ \ ΰӇ£Î°Ç¯® 1˜`iÀiÃ̈“>̈œ˜ ˆ˜ ̅i Ài}ˆÃÌiÀ\ £°™¯ ­™x¯‡ \ ä°Ó‡È°Ç¯®

6>Àˆ>Li

œ“«iÌi˜iÃã

VVÕÀ>VÞÓ

œÀÀiV̘iÃÃÎ

*iÀܘ> ˆ`i˜ÌˆvˆV>̈œ˜ ˜Õ“LiÀ{

£ää¯

£ää¯

£ää¯

œ“i “Õ˜ˆVˆ«>ˆÌÞ

£ää¯

£ää¯

£ää¯

`“ˆÃȜ˜ `>Þ

£ää¯

£ää¯

£ää¯

`“ˆÌÌi` vÀœ“

£ää¯

nΰä¯

`“ˆÌÌi` vÀœ“x

£ää¯

™x°Î¯

ˆÃV…>À}i `>Þ

£ää¯

ˆÃV…>À}i ̜

£ää¯

x

ˆÃV…>À}i ̜

£ää¯

™x°Î¯

£ää¯x

£ää¯

™È°Ó¯x

™È°Ó¯

£ää¯

£ää¯

™™°£¯

™Ç°£¯

™È°Ó¯

*Àˆ“>ÀÞ œ«iÀ>̈œ˜

™™°£¯

™n°£¯

™Ç°Ó¯

Ó°n¯

*>Vi œv >VVˆ`i˜ÌÈ

£ää¯

™Î°{¯

>ˆ˜ `ˆ>}˜œÃˆÃ ­…ˆ« vÀ>VÌÕÀi®

£ää¯

/>Li Î

œ“«iÌi˜iÃà >˜` >VVՇ À>VÞ œv i>ÈÞ “i>ÃÕÀ>Li Û>Àˆ>Lià ˆ˜ ̅i Ài}ˆÃÌiÀ

nΰä¯

"«iÀ>̈œ˜ `>Þ -ˆ`i œv vÀ>VÌÕÀi

/>Li Ó

œ“«iÌi˜iÃà œv `>Ì> ÃiÌÃ

Ó°n¯

È£°Î¯

xǰί

™n°£¯

™n°£¯

£® *iÀVi˜Ì>}i œv V>Ãià ܈̅ ÀiVœÀ`i` Û>Õi vœÀ ̅i Û>Àˆ>Li Ó® *Àœ«œÀ̈œ˜ œv VœÀÀiVÌÞ ÀiVœÀ`i` Û>Õià vœÀ V>Ãià ܈̅ ܓi ÀiVœÀ`i` Û>Õi ή /…i œÛiÀ> ṎˆÌÞ œv ̅i Û>Àˆ>Li ­«Àœ`ÕVÌ œv Vœ“«iÌi˜iÃà >˜` >VVÕÀ>VÞ® {® œ˜Ì>ˆ˜Ã `>Ìi œv LˆÀ̅ >˜` ÃiÝ x® `i˜ÌˆvˆV>̈œ˜ LÞ Ṏˆâˆ˜} ÀiVœÀ` ˆ˜Ž>}i È® ˜Vœ“«>̈Li V>ÃÈvˆV>̈œ˜ ˆ˜ ̅i Ài}ˆÃÌiÀ

/>Li {

VVÕÀ>VÞ œv …ˆ« vÀ>VÌÕÀi `ˆ>}˜œÃˆÃ ˆ˜ ̅i Ài}ˆÃÌiÀ -i˜ÃˆÌˆÛˆÌÞ

-«iVˆvˆVˆÌÞ

*œÃˆÌˆÛi «Ài`ˆV̈Ûi Û>Õi

i}>̈Ûi «Ài`ˆV̈Ûi Û>Õi

i“œÀ> ˜iVŽ vÀ>VÌÕÀi

™È°Ç¯ ­™£°x‡£ä䯮

nä°x¯ ­Èn°Ó‡™£°Ç¯®

nn°£¯ ­Ç™°Ç‡™x°Î¯®

™{°Î¯ ­nx°Ç‡£ä䯮

/ÀœV…>˜ÌiÀˆV vÀ>VÌÕÀi

Èn°È¯ ­xÓ°™‡nΰǯ®

™n°x¯ ­™x°£‡£ä䯮

™È°ä¯ ­nǰ䇣ä䯮

nx°Ç¯ ­ÇÇ°x‡™Î°£¯®

-ÕLÌÀœV…>˜ÌiÀˆV vÀ>VÌÕÀi nΰί ­{Ó°™‡£ä䯮

™È°™¯ ­™Ó°n‡£ä䯮

ÈÓ°x¯ ­Óx°ä‡£ä䯮

™n°™¯ ­™È°Ç‡£ä䯮

Ãi«>À>Ìi …œ“i >VVˆ`i˜Ìà vÀœ“ ̅i >VVˆ‡ `i˜Ìà œVVÕÀÀˆ˜} ˆ˜ ÀiÈ`i˜Ìˆ> V>Ài >˜` v>Ã œVVÕÀÀˆ˜} œÕÌ`œœÀà vÀœ“ ̅i œ˜ià œVVÕÀ‡ Àˆ˜} ˆ˜`œœÀð

VVÕÀ>VÞ >˜` ,iˆ>LˆˆÌÞ œv ˆ« À>VÌÕÀi ˆ>}˜œÃià ,i}ˆÃÌiÀ `ˆ>}˜œÃˆÃ Ü>à ÀiVœÀ`i` >à …ˆ« vÀ>V‡ ÌÕÀi ­ ‡£ä\ -ÇÓ°ä] -ÇÓ°£] -ÇÓ°Ó® ˆ˜

™n°£¯ ­™x¯ \ ™Î°{‡™™°n¯® œv V>Ãià ­/>Li ή° œÜiÛiÀ] ̅i iÛ>Õ>̈œ˜ œv ̅i iÝ>VÌ >VVÕÀ>VÞ œv ̅i “>ˆ˜ `ˆ>}˜œÃˆÃ Ü>à ˜œÌ ÃÌÀ>ˆ}…ÌvœÀÜ>À`] >à `ˆvviÀi˜Ì V>ÃÈvˆ‡ V>̈œ˜Ã œv …ˆ« vÀ>VÌÕÀià ÜiÀi ÕÃi` ˆ˜ ̅i Ài}ˆÃÌiÀ >˜` ˆ˜ ̅i >Õ`ˆÌ `>Ì>° œœÜˆ˜} ̅i “i̅œ`œœ}ˆV> ÃÕ}}iÃ̈œ˜Ã] ̅i >VVÕÀ>VÞ œv `ˆ>}˜œÃià Ü>à >``ÀiÃÃi` LÞ Vœ˜Ãˆ`iÀˆ˜} ̅i Lœ˜i >˜>̜“Þ >à > Lˆœœ}ˆV> i˜ÌˆÌÞ ˆ`i˜Ìˆvވ˜} ÌÀÕi vÀ>VÌÕÀi ÃÌ>ÌÕà QxäR° ˆ« vÀ>VÌÕÀià >Ài Vœ““œ˜Þ V>ÃÈvˆi` ˆ˜Ìœ ˆ˜ÌÀ>‡ >˜` iÝÌÀ>‡V>«ÃՏ>À >VVœÀ`ˆ˜} ̜ ̅iˆÀ Ài>̈œ˜Ã…ˆ« ̜ ̅i V>«ÃՏ>À >ÌÌ>V…“i˜Ì œv ̅i …ˆ«° ˜ ÌiÀ“à œv VÕÀÀi˜Ì `>Ì>] vÀ>VÌÕÀià œv ̅i ˜iVŽ œv ̅i vi“ÕÀ >Ài ˆ˜ÌÀ>‡V>«ÃՏ>À] >˜` œÌ…iÀ …ˆ« vÀ>VÌÕÀià >Ài iÝÌÀ>‡V>«ÃՏ>À° /…i Lœ˜i >˜>̜“Þ “>Þ Li ÕÃi` ̜ ÃÕL`ˆÛˆ`i iÝÌÀ>‡V>«ÃՏ>À vÀ>VÌÕÀià vÕÀ̅iÀ ˆ˜Ìœ L>܇ ViÀۈV>] ÌÀœV…>˜ÌiÀˆV >˜` ÃÕLÌÀœV…>˜ÌiÀˆV vÀ>VÌÕÀið 1˜vœÀÌ՘>ÌiÞ] ̅iÀi ˆÃ ˜œ Ãi«>À>Ìi `ˆ>}˜œÃˆÃ Vœ`i ˆ˜  ‡£ä vœÀ L>ÜViÀۈV> vÀ>VÌÕÀi° 1Ș} > `iÌ>ˆi` vÀ>VÌÕÀi V>ÃÈvˆ‡ V>̈œ˜ œv vi“œÀ> ˜iVŽ vÀ>VÌÕÀi] ÌÀœV…>˜ÌiÀˆV vÀ>VÌÕÀi œÀ ÃÕLÌÀœV…>˜ÌiÀˆV vÀ>VÌÕÀi] ̅i >}Àii“i˜Ì LiÌÜii˜ >Õ`ˆÌ `>Ì> >˜` Ài}ˆÃ‡ ÌiÀ Ü>à nȰί ­™x¯ \ Ǚ°{‡™Ó°Ó¯®° *>ˆÀ‡ ܈Ãi >}Àii“i˜Ì >˜>ÞÃià ÀiÛi>i` ̅>Ì vi‡ “œÀ> ˜iVŽ >˜` ÃÕLÌÀœV…>˜ÌiÀˆV vÀ>VÌÕÀià ÜiÀi VœÀÀiVÌÞ ˆ`i˜Ìˆvˆi` ˆ˜ ̅i Ài}ˆÃÌiÀ ­…ˆ}… Ãi˜ÃˆÌˆÛˆÌÞ®] LÕÌ ˆ`i˜ÌˆvˆV>̈œ˜ œv ÌÀœ‡ V…>˜ÌiÀˆV vÀ>VÌÕÀià Ü>à ˜œÌ >à >VVÕÀ>Ìi ­/>Li {®° /…ˆÃ ˆÃ “œÃÌ «ÀœL>LÞ LiV>ÕÃi ̅i iÝÌÀi“ˆÌˆià >Ài i>ÈiÀ ̜ ˆ`i˜ÌˆvÞ VœÀÀiV̏ް /…ˆÃ ˆ˜ÌiÀ«ÀiÌ>̈œ˜ ˆÃ ÃÕ««œÀÌi` LÞ Ì…i v>VÌ Ì…>Ì Ì…i Ài}ˆÃÌiÀ `ˆ>}˜œÃià œv ÌÀœV…>˜ÌiÀˆV vÀ>VÌÕÀi ÜiÀi «À>V̈V>Þ >Ü>Þà ÌÀÕi ÌÀœ‡ V…>˜ÌiÀˆV vÀ>VÌÕÀià ­…ˆ}… «œÃˆÌˆÛi «Ài`ˆV̈Ûi Û>Õi®° /…ˆÃ Ü>à ˜œÌ ̅i V>Ãi vœÀ vi“œÀ> ˜iVŽ vÀ>VÌÕÀià œÀ ÃÕLÌÀœV…>˜ÌiÀˆV Ài}ˆÃÌiÀ‡ `ˆ>}˜œÃià ܅ˆV… ÜiÀi œVV>Ȝ˜>Þ ÕÃi` vœÀ ÌÀÕi ÌÀœV…>˜ÌiÀˆV vÀ>VÌÕÀià ­œÜ «œÃˆÌˆÛi «Ài`ˆV̈Ûi Û>Õi®° ˆÃV>ÃÈvˆV>̈œ˜ >Ì Ì…i LœÀ`iÀà œv ̅i ÌÀœV…>˜ÌiÀˆV Ài}ˆœ˜ œv ̅i vi“ÕÀ “>Þ Li œ˜i Ài>ܘ° ˜œÌ…iÀ V>ÕÃi ˆÃ ̅i  ‡£ä V>ÃÈvˆV>̈œ˜ Ṏˆâi` ˆ˜ ˆ˜‡ >˜`] ܅ˆV… ÃÕ}}iÃÌà ̅i ÕÃi œv vi“œÀ> ˜iVŽ `ˆ>}˜œÃˆÃ vœÀ ՘ëiVˆvˆi` …ˆ« vÀ>VÌÕÀið /…i «Àœ«œÀ̈œ˜ œv >}Àii“i˜Ìà ̅>Ì ÜiÀi ˜œÌ `Õi ̜ V…>˜Vi Ü>à ÛiÀÞ }œœ` >à “i>ÃÕÀi` ܈̅ ̅i `iÌ> VœivvˆVˆi˜Ì ­n£°Î¯Æ ™x¯ \ Ç£°Î‡™£°Î¯® œÀ ̅i Ž>««> VœivvˆVˆi˜Ì ­Çΰx¯Æ ™x¯ \ È£°Ó‡nx°Ç¯®° ˜ >``ˆÌˆœ˜] ȘVi ˆÌ Ü>à Ài>ܘ>Li ̜ >ÃÃՓi ̅>Ì Ì…i ՘`iÀÞˆ˜} ÌÀ>ˆÌ ˆÃ Vœ˜Ìˆ˜ÕœÕÃ] ̅i «œÞ‡

i̅œ`à ˜v i` xÉÓääÇ Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

xÈÎ *iÀvœÀ“>˜Vi ÃÃiÃÓi˜Ì œv ˆ« À>VÌÕÀi /Ài>̓i˜Ì

V…œÀˆV VœÀÀi>̈œ˜ VœivvˆVˆi˜Ì Ü>à V>VՇ >Ìi` ˆ˜`ˆV>̈˜} > ÛiÀÞ …ˆ}… Vœ˜ÃˆÃÌi˜VÞ LiÌÜii˜ ̅i >Õ`ˆÌ `>Ì> >˜` Ài}ˆÃÌiÀ ­ä°™ÓÆ ™x¯ \ ä°nȇ䰙n®°

}Àii“i˜Ì LiÌÜii˜ ՘V̈œ˜> i«i˜`i˜VÞ i>ÃÕÀià ˜ ̅i >Õ`ˆÌ `>Ì>] v՘V̈œ˜> `i«i˜`i˜VÞ Ü>à “i>ÃÕÀi` ÕȘ} >˜ >V̈ۈ̈ià œv `>ˆÞ ˆÛˆ˜} ­ ® ÌÞ«i œv ÃV>i QÓÎR° ˜ ̅i ˆ˜˜ˆÃ… i>Ì… >Ài ,i}ˆÃÌiÀ] œÛiÀ> `i«i˜`i˜VÞ ­>˜` ˜ii` vœÀ V>Ài® ˆÃ ÀiVœÀ`i` ÕȘ} > ÃV>i œv vˆÛi V>Ìi}œÀˆià Qx£R° /…i Vœ“«iÌi˜iÃà œv `i«i˜`i˜VÞ Û>Àˆ>Li ˆ˜ ̅i Ài}ˆÃÌiÀ Ü>à £ä䯰 /…i LœÝ«œÌà `iÃVÀˆLˆ˜} `i«i˜`i˜VÞ “i>ÃÕÀià ­>Ì `ˆÃV…>À}i® >Ài Ŝܘ ˆ˜ ˆ}‡ ÕÀi £° /…i Õ««iÀ V>Ìi}œÀˆià œv ̅i Ài}ˆÃÌiÀ‡ L>Ãi` “i>ÃÕÀi >Ài Vœ“Lˆ˜i` `Õi ̜ ̅i Ó> ˜Õ“LiÀ œv œLÃiÀÛ>̈œ˜Ã° /…i «œÞÃiÀ‡ ˆ> VœÀÀi>̈œ˜ LiÌÜii˜ ̅i Û>Àˆ>Lià ˆÃ ä°Èn ­™x¯ \ ä°xx‡ä°n£®] Ŝ܈˜} À>̅iÀ }œœ` Vœ˜ÃˆÃÌi˜VÞ° i>˜Ã >˜` “i`ˆ>˜Ã Li…>Ûi Ài>ܘ>LÞ] LÕÌ ˆ}ÕÀi £ ˆ˜`ˆV>Ìià ̅>Ì Ì…i “œÃÌ Vœ““œ˜Þ ÕÃi` V>Ìi}œÀÞ ˆ˜ ̅i Ài}‡ ˆÃÌiÀ `>Ì> …>à > ÛiÀÞ Üˆ`i À>˜}i° Ì Ãii“à ̅>Ì …ˆ}… Ài}ˆÃÌiÀ V>Ìi}œÀˆâ>̈œ˜Ã ÌÀÕÞ Ài‡ viVÌ L>` v՘V̈œ˜>ˆÌÞ] LÕÌ >L܏ÕÌi ˆ˜ÌiÀ‡ «ÀiÌ>̈œ˜ œv ̅i Ài}ˆÃÌiÀ ÃV>i >Ì >˜ ˆ˜`ˆÛˆ`Շ > iÛi “>Þ Li iÀÀœ˜iœÕð

œœÜ‡Õ« ˜vœÀ“>̈œ˜ -Ì>Ìi `ˆ>}À>“à vœÀ ̅i Ài}ˆÃÌiÀ >˜` >Õ`ˆÌ `>Ì> ÃՓ“>Àˆâˆ˜} ̅i vœœÜ‡Õ« ˆ˜vœÀ“>‡ ̈œ˜ >Ài Ŝܘ ˆ˜ ˆ}ÕÀi Ó° /…i œÛiÀ> Å>«ià >Ài «ÀiÌÌÞ Ãˆ“ˆ>À] LÕÌ Ì…i Ài}ˆÃÌiÀ `>Ì> …>Ûi “œÀi >VVÕÀ>VÞ° /…i `>Ìià œv `i>̅Ã] i˜}̅à œv ˆ˜ˆÌˆ> …œÃ«ˆÌ>ˆâ>̈œ˜Ã] >˜` ÀiÈ`i˜Ìˆ> ÃÌ>ÌÕÃià >Ì vœÕÀ “œ˜Ì…à >˜` >Ì œ˜i Þi>À >vÌiÀ ̅i vÀ>VÌÕÀi ˆ˜ ̅i >Õ`ˆÌ `>Ì> >Ài >VVÕÀ>Ìi] LÕÌ Ì…i vœœÜ‡Õ« ˆ˜vœÀ“>̈œ˜ Vœ˜ViÀ˜ˆ˜} >VÕÌi V>Ài ÃÕLÃiµÕi˜Ì ̜ ̅i ˆ˜‡ ˆÌˆ> …œÃ«ˆÌ>ˆâ>̈œ˜ ˆ˜ ̅i >Õ`ˆÌ `>Ì> ˆÃ «>ÀÌÞ ˆ˜Vœ“«iÌi >˜` ̅iÀivœÀi ÌiV…˜ˆV>Þ `ˆv‡ vˆVÕÌ Ìœ …>˜`i° /…ˆÃ V>˜ Li “œÃÌ Vi>ÀÞ Ãii˜ vÀœ“ ̅i «Àœ«œÀ̈œ˜ œv «>̈i˜Ìà ˆ˜ ˜ÕÀÇ ˆ˜} …œ“iÃ] ܅ˆV… ˆÃ vˆÀÃÌ œÛiÀiÃ̈“>Ìi` >˜` >ÌiÀ œ˜ ՘`iÀiÃ̈“>Ìi` ­ˆ}° Ó®°

ˆ}° £ ՘V̈œ˜> `i«i˜`i˜VÞ “i>ÃÕÀià ˆ˜ Ài}ˆÃÌiÀ >˜` >Õ`ˆÌ `>Ì>° Õ`ˆÌ `>Ì> …>à >˜ >V̈ۈ̈ià œv `>ˆÞ ˆÛˆ˜} ­ ® ÌÞ«i œv ÃՓ ÃV>i QÓÎR° ,i}ˆÃÌiÀ `>Ì> iÛiÃ >Ài\ >® Vœ“«iÌiÞ œÀ >“œÃÌ ˆ˜`i«i˜`i˜Ì] L® œVV>Ȝ˜> ˜ii` vœÀ V>Ài] V® ÀiVÕÀÀi˜Ì ˜ii` vœÀ V>Ài] `³® >“œÃÌ ­œÀ ̜Ì>Þ® Vœ˜Ìˆ˜ÕœÕà ˜ii` vœÀ V>Ài Qx£R° ÀœÃà ÀiviÀà ̜ “i>˜ ˆ˜ LœÝ«œÌ°

ˆÃVÕÃȜ˜ ˜ ̅ˆÃ ÃÌÕ`Þ Ìܜ vœÀ“à œv `>Ì> «Àœ`ÕV̈œ˜ vœÀ «iÀvœÀ“>˜Vi >ÃÃiÃÓi˜Ì «ÕÀ«œÃià ÜiÀi Vœ“«>Ài` ˆ˜ ̅i V>Ãi œv …ˆ« vÀ>VÌÕÀi ÌÀi>̇ “i˜Ì°  ÕÃ̈vˆ>Li Vœ“«>ÀˆÃœ˜ ÀiµÕˆÀi` > `ivˆ˜ˆÌˆœ˜ œv ̅i Vœ˜Vi«ÌÕ> “œ`i] ܅ˆV… >œÜi` vœÀ > ÃÞÃÌi“>̈V ÃÌÀÕVÌÕÀˆ˜} œv ˆ˜ŽÃ LiÌÜii˜ œLÃiÀÛ>Li “i>ÃÕÀià >˜` ̅iœÀï‡ V> Vœ˜Vi«Ìð -ÕV… “œ`iÃ >Ài œ˜Þ À>ÀiÞ Ài«œÀÌi` Q{ÈR] iÛi˜ ̅œÕ}… ̅i iÝ«ˆVˆÌ ˆ˜‡ ÌÀœ`ÕV̈œ˜ œv > Vœ˜Vi«ÌÕ> “œ`i œLۈœÕÃÞ Ãˆ}˜ˆvˆV>˜ÌÞ ˆ“«ÀœÛià ̅i ՘`iÀÃÌ>˜`ˆ˜} œv ̅i «ÀœLi“ >˜` “>Žià ̅i “œÃÌ ˆ“«œÀÌ>˜Ì >ÃÃՓ«Ìˆœ˜Ã >vviV̈˜} ̅i >VÌÕ> ÀiÃՏÌà œv >˜>ÞÃià ۈÈLi° -ÞÃÌi“>̈V >««Àœ>V…ià V>˜ Li ÕÃi` vœÀ ̅i `iÛiœ«“i˜Ì œv > Vœ˜Vi«ÌÕ> “œ`i QxÓR° +ՈÌi œvÌi˜ Vœ˜Vi«ÌÕ> “œ`iÃ …>` ̜ Li Ài«ÀiÃi˜Ìi` Ü ̅>Ì ˆÌ LiVœ“ià «œÃÈLi ̜ Vœ˜Ãˆ`iÀ ÃiÛiÀ> `ˆvviÀi˜Ì «iÀëiV̈ÛiÃ

ȓՏÌ>˜iœÕÃÞ Qx·xÈR° /…ˆÃ ˆÃ «>À̈VՏ>ÀÞ V…>i˜}ˆ˜} ˆv `>Ì> >Ài ÕÃi` vœÀ œÌ…iÀ «ÕÀ‡ «œÃià ̅>˜ œÀˆ}ˆ˜>Þ ˆ˜Ìi˜`i` QxÇR° ˜ ̅ˆÃ ÃÌÕ`Þ] > «À>}“>̈V “>««ˆ˜} LiÌÜii˜ Vœ˜‡ Vi«Ìà vÀœ“ > «iÀvœÀ“>˜Vi >ÃÃiÃÓi˜Ì ̅i‡ œÀÞ] - “iÌ>`>Ì>] >˜` ÃÌÀÕVÌÕÀi` `>Ì> i˜ÌÀÞ ˆ˜ «ÀœÃ«iV̈Ûi >Õ`ˆÌ `>Ì> >˜` ÃiVœ˜`‡ >ÀÞ Ài}ˆÃÌiÀ `>Ì> Ü>à Vœ˜ÃÌÀÕVÌi`° ˜ ëˆÌi œv ̅i }i˜iÀ>ˆÌÞ œv ̅i VÕÀÀi˜Ì “œ`i] ˆÌ ˆÃ ˆ“ˆÌi` ˆ˜ ̅i Ãi˜Ãi ̅>Ì ˆÌ `i>Ã ܈̅ ˆ˜`ˆ‡ ۈ`Õ> iÛi Vœ˜Vi«Ìà œ˜Þ° œÀ Li˜V…“>ÀŽ‡ ˆ˜} «ÕÀ«œÃiÃ] «ÀœÛˆ`iÀ iÛi «…i˜œ“i˜> ̅>Ì }ˆÛi iÝ«>˜>̈œ˜Ã œv œLÃiÀÛi` `ˆvviÀi˜Vià LiÌÜii˜ «ÀœÛˆ`iÀà œv V>Ài ŜՏ` >Ãœ Li Vœ˜Ãˆ`iÀi`° "˜i ÛiÀÞ ˆ“«œÀÌ>˜Ì ˆ˜`ˆÛˆ`Õ> iÛi “i>ÃÕÀi q ˜>“iÞ VœÃÌà q ˆÃ >Ãœ “ˆÃȘ} vÀœ“ ̅i “œ`i° Ì ÜœÕ` …>Ûi ëiVˆ> “i>ÃÕÀi“i˜Ì «Àœ«iÀ̈iÃ] LiV>ÕÃi ˆÌ ˆÃ VՓՏ>̈Ûi ˆ˜ ̈“i° à iÝ«iVÌi`] ̅i Ài}ˆÃÌiÀ `>Ì> >VŽi` Vˆ˜ˆV> `iÌ>ˆ° œÜiÛiÀ] “œÃÌ œv ̅i “œÀi `iÌ>ˆi` “i>ÃÕÀià ˆ˜ ̅i >Õ`ˆÌ `>Ì> ÜiÀi >V‡ i̅œ`à ˜v i` xÉÓääÇ

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

xÈ{ -՘` iÌ >°

ˆ}° Ó

-Ì>Ìi `ˆ>}À>“à vœÀ Ài}ˆÃÌiÀ >˜` >Õ`ˆÌ `>Ì>

ÌÕ>Þ Ài>Ìi` ̜ «…i˜œ“i˜> ̅>Ì Vœ˜Ìˆ˜Õ‡ œÕÃÞ V…>˜}i ˆ˜ ̈“i° -ÕV… `>Ì> ÀiµÕˆÀi Ì>VˆÌ Ž˜œÜi`}i ̜ Li ÕÃivՏ ˆ˜ > “œÀi }i˜iÀ> Vœ˜ÌiÝÌ QxnR° "˜ ̅i œÌ…iÀ …>˜`] Ài}ˆÃÌiÀ `>Ì> …>Ûi > `>Ì> ÃÌÀÕVÌÕÀi ̅>Ì >œÜà ̅i Vœ“‡ «iÌi œLÃiÀÛ>̈œ˜ œv ˆ˜«>̈i˜Ì V>Ài …ˆÃ̜ÀÞ] >˜` ̅iÀivœÀi œÕÌ«iÀvœÀ“à «ÀœÃ«iV̈Ûi `>Ì> ˆ˜ ̅ˆÃ Ãi˜Ãi ˆv ܓi `i>Þ ˆ˜ `>Ì> «Àœ`ÕV‡ ̈œ˜ ˆÃ ̜iÀ>Li° >Žˆ˜} ÕÃi œv `i“œ‡ }À>«…ˆV] ÜVˆœiVœ˜œ“ˆV >˜` “i`ˆV>̈œ˜ …ˆÃ̜Àˆià `>Ì> q ܅ˆV… >Ài] ˆ˜ «Àˆ˜Vˆ«i] >Û>ˆ>Li ˆ˜ œÌ…iÀ ˆ˜˜ˆÃ… Ài}ˆÃÌiÀà q VœÕ` >Ãœ ˆ“«ÀœÛi ViÀÌ>ˆ˜ `iÌ>ˆÃ œv Ài}ˆÃÌiÀ `>Ì>] LÕÌ ˆÌ ÀiµÕˆÀià iÝÌi˜ÃˆÛi ܜÀŽ ̜ œLÌ>ˆ˜ >˜` «Ài«ÀœViÃà ÃÕV… q «œÃÈLÞ iÝ«i˜ÃˆÛi q `>Ì>° ˜ ̅ˆÃ ÃÌÕ`Þ] ˆÌ Ü>à >ÃÃՓi` ̅>Ì > …ˆ« vÀ>VÌÕÀi «>̈i˜Ìà ÜiÀi ˆ`i˜Ìˆvˆ>Li vÀœ“ ̅i Ài}ˆÃÌiÀ œÀ >Õ`ˆÌ `>Ì>° /…ˆÃ >ÃÃՓ«Ìˆœ˜ ˆÃ Û>ˆ` Ü v>À >à «>̈i˜Ìà ÜiÀi ÌÀi>Ìi` >Ì Ì…i œÀ‡ ̅œ«i`ˆV ˆ˜«>̈i˜Ì Ü>À` >Ì Ì…i ÕÕÃ>˜ŽœÃŽˆ ,i}ˆœ˜> œÃ«ˆÌ>] LiV>ÕÃi ˆÌ ˆÃ ÛiÀÞ Õ˜‡ ˆŽiÞ ̅>Ì Ìܜ Vœ“«iÌi] LÕÌ “ÕÌÕ>Þ `ˆv‡ viÀi˜Ì Ài}ˆÃÌÀ>̈œ˜Ã ܜՏ` “ˆÃà > «>̈i˜Ì° ˜ v>VÌ] iÛi˜ ̅i «>̈i˜Ìà ÌÀi>Ìi` Vœ˜ÃiÀÛ>̈ÛiÞ œÀ œ«iÀ>Ìi` œ˜ >Ì >˜œÌ…iÀ …œÃ«ˆÌ> >Ài ÀœÕ‡ ̈˜iÞ ÀiviÀÀi` ̜ >˜ œÀ̅œ«i`ˆV ˆ˜«>̈i˜Ì Ü>À` >˜` ܜՏ` Li Ài}ˆÃÌiÀi`° ˜ ̅iœÀÞ] ܓi «>̈i˜Ìà >`“ˆÌÌi` ̜ ̅i œÕÌ«>̈i˜Ì i“iÀ}i˜VÞ `i«>À̓i˜Ì ܅œ ÜiÀi ˜œÌ Àiȇ `i˜Ì ˆ˜ ̅i œ«iÀ>̈œ˜ >Ài> œv ̅i ÕÕÃ>˜‡ ŽœÃŽˆ ,i}ˆœ˜> œÃ«ˆÌ> >˜` ̅Õà `ˆÀiV̏Þ

ÌÀ>˜ÃviÀÀi` ̜ ̅iˆÀ œÜ˜ œV> …œÃ«ˆÌ>] VœÕ` …>Ûi Lii˜ “ˆÃÃi` vÀœ“ ̅i `>Ì>° œÜiÛiÀ] ˆ˜ œÕÀ ÃÌÕ`Þ V>«ÌÕÀi‡ÀiV>«ÌÕÀi >˜>ÞÃià iÃ̈‡ “>̈˜} ̅i ̜Ì> ˜Õ“LiÀ œv V>Ãià ÜiÀi ˜œÌ >««ˆV>Li] LiV>ÕÃi ̅iÀi Ü>à ÃÌÀÕVÌÕÀ> `i‡ «i˜`i˜VÞ LiÌÜii˜ `>Ì> ÜÕÀVià ­Ì…i œVVÕÀ‡ Ài˜Vi œv > «>̈i˜Ì ˆ˜ >Õ`ˆÌ `>Ì> >Ãœ ˆ˜`ˆV>Ìi` iÝ«iVÌi` œVVÕÀÀi˜Vi ˆ˜ Ài}ˆÃÌiÀ `>Ì>® Qx™] ÈäR° /…iÀivœÀi ̅i Vœ“«iÌi˜iÃà Ü>à iÛ>Õ‡ >Ìi` LÞ ˆ`i˜Ìˆvވ˜} > …ˆ« vÀ>VÌÕÀi V>Ãià vÀœ“ >Õ`ˆÌ `>Ì> >˜` Ài}ˆÃÌiÀ `>Ì>] >˜` ̅i˜ > ˆ˜Vœ“«>̈LˆˆÌˆià ÜiÀi V…iVŽi` >}>ˆ˜ÃÌ Ì…i ˆ˜vœÀ“>̈œ˜ ˆ˜ “i`ˆV> ÀiVœÀ`ð ˜ ̅ˆÃ Ãi˜Ãi] ̅i `ˆvviÀi˜Vià ˆ˜ Vœ“«iÌi˜iÃà Li‡ ÌÜii˜ `>Ì> ÜÕÀVià ˆ˜`ˆV>Ìi ̅i `ˆvviÀi˜Ì `ivˆ˜ˆÌˆœ˜Ã œv …ˆ« vÀ>VÌÕÀi œÀ iÀÀœÀà ˆ˜ Ài}‡ ˆÃÌiÀ `>Ì>° /…i Vœ“«iÌi˜iÃà œv >Õ`ˆÌ `>Ì> Ü>à i݇ Vii˜Ì] >˜` >Ãœ ̅i Vœ“«iÌi˜iÃà œv Ài}‡ ˆÃÌiÀ `>Ì> Ãii“i` ̜ Li ÛiÀÞ }œœ`° ˜ v>VÌ] ̅iÀi ˆÃ > Ìi˜`i˜VÞ Ìœ œÛiÀiÃ̈“>̈œ˜ ˆv ˜œ >««Àœ«Àˆ>Ìi `>Ì> >LÃÌÀ>V̈œ˜ ÀՏià >Ài ÕÃi`° œÀ iÝ>“«i] ˆÌ ˆÃ “œÀi Ài>ܘ>Li ̜ VœÕ˜Ì ˜iÜ …ˆ« vÀ>VÌÕÀi V>Ãià ̅>˜ iÛiÀÞ «>̈i˜Ì ܈̅ >˜ œ˜}œˆ˜} …ˆ«‡vÀ>VÌÕÀi‡Ài>Ìi` V>Ài i«ˆÃœ`i QÈ£] ÈÓR° œÜiÛiÀ] ՘iÃà ̅i Vˆ˜ˆ‡ V> Õ`}“i˜Ì œv > …ˆ« vÀ>VÌÕÀi V>Ãi ˆÃ >VVՇ À>ÌiÞ >˜` Vœ“«iÌiÞ ÀiVœÀ`i`] Ài}ˆÃÌiÀ‡ L>Ãi` iÃ̈“>Ìià vœÀ ̅i ˜Õ“LiÀà œv …ˆ« vÀ>V‡ ÌÕÀià >Ài «Àœ˜i ̜ Lˆ>ð /…i >««Àœ«Àˆ>Ìi˜iÃà œv ÃÕV… `>Ì> >LÃÌÀ>V̈œ˜ ÀՏià V>˜ Li «>À‡ ̈>Þ ÛiÀˆvˆi` LÞ Vœ“«>Àˆ˜} ̅i …œÃ«ˆÌ>‡

ëiVˆvˆV ˜Õ“LiÀà œv Vœ˜ÃiVṎÛi …ˆ« vÀ>V‡ ÌÕÀià `ÕÀˆ˜} «>À̈VՏ>À ̈“i «iÀˆœ`à q >Û>ˆ‡ >Li vÀœ“ «ÀœÃ«iV̈Ûi ÃÌÕ`ˆià q ̜ Ài}ˆÃÌiÀ‡ `>Ì>‡L>Ãi` iÃ̈“>Ìià vœÀ ̅i Ã>“i …œÃ«ˆÌ>Ã >˜` ̅i Ã>“i ̈“i «iÀˆœ`ð Ì “ÕÃÌ Li ˜œÌi` ̅>Ì iÛi˜ ̅ˆÃ Žˆ˜` œv > `ivˆ˜ˆÌˆœ˜ …>à ܓi `À>ÜL>VŽÃ ˆ˜ ˆ˜Vˆ`i˜Vi V>VՏ>̈œ˜Ã° /…i ˜Õ“LiÀ œv «>̈i˜Ìà ˆÃ …œÃ«ˆÌ>‡Ã«iVˆvˆV] LÕÌ Ì…i >Û>ˆ>Li ÀˆÃŽ‡«œ«Õ>̈œ˜ `>Ì> ÀiviÀà ̜ }iœ}À>«…ˆV> >Ài>à >˜` œ˜i V>˜ ̅iÀivœÀi >À}Õi ̅>Ì «>̈i˜Ìà ˆÛˆ˜} ˆ˜ ̅i Ã>“i >Ài> LÕÌ œ«iÀ>Ìi` ˆ˜ ܓi œÌ…iÀ …œÃ«ˆÌ> ŜՏ` >Ãœ Li VœÕ˜Ìi` >à V>ÃiÃ] >˜` «>̈i˜Ìà vÀœ“ œÌ…iÀ >Ài>à ŜՏ` Li iÝVÕ`i`° "ÕÀ `ivˆ˜ˆÌˆœ˜ vœÀ > }œ`i˜ ÃÌ>˜`>À` œv “i>ÃÕÀi“i˜Ì ˆ˜ ̅ˆÃ ÃÌÕ`Þ Ü>à …iÀ“i˜iṎV° ˜ «À>V̈Vi] ̅Àii ÌÞ«ià œv Vœ“«>ÀˆÃœ˜Ã LiÌÜii˜ `>Ì> ÜÕÀVià ÜiÀi Ṏˆâi` ­iÛi˜ ̅œÕ}… ̅i LœÕ˜`>Àˆià LiÌÜii˜ ÌÞ«ià >Ài ˜œÌ Vi>À VÕÌ®° /…i vˆÀÃÌ ÌÞ«i Ü>à >««ˆi` ˆ˜ >V‡ VÕÀ>VÞ Vœ“«>ÀˆÃœ˜Ã\ ̅i Ã>“i À>̅iÀ i>ÈÞ “i>ÃÕÀ>Li Û>Àˆ>Li Ü>à >Û>ˆ>Li ˆ˜ LœÌ… `>Ì> ÜÕÀViÃ] >˜` `iÌiVÌi` `ˆvviÀi˜Vià VœÕ` Li vÕÀ̅iÀ V…iVŽi` vÀœ“ ̅i “i`ˆV> ÀiVœÀ`à ­i°}° />Li ή° /…i ÃiVœ˜` ÌÞ«i Ü>à vœÀ ̅i Vœ“«>ÀˆÃœ˜ œv ̅iœÀïV>Þ ȓˆ>À ˆ˜vœÀ‡ “>̈œ˜ ̅>Ì …>à ÃÕLÃÌ>˜Ìˆ>Þ `ˆvviÀi˜Ì œ«iÀ‡ >̈œ˜>ˆâ>̈œ˜Ã ˆ˜ LœÌ… `>Ì> ÜÕÀVià ­i°}° ˆ}ð £ >˜` Ó®° /…i ̅ˆÀ` ÌÞ«i œv Vœ“«>ÀˆÃœ˜ Ü>à > Vœ“Lˆ˜>̈œ˜ œv ̅i vˆÀÃÌ Ìܜ ÌÞ«iÃ\ LœÌ… >««Àœ>V…ià V>˜ Li >««ˆi` ­i°}° >˜>‡ ÞÃià œv ̅i `iÌ>ˆi` vÀ>VÌÕÀi V>ÃÈvˆV>̈œ˜®°

i̅œ`à ˜v i` xÉÓääÇ

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

xÈx *iÀvœÀ“>˜Vi ÃÃiÃÓi˜Ì œv ˆ« À>VÌÕÀi /Ài>̓i˜Ì

/…i ŽiÞ ˆ`i> Ü>à ̜ “>Ži ÛiÀÃ>̈i Vœ“«>Àˆ‡ ܘà ̅>Ì >Ài “i̅œ`œœ}ˆV>Þ ÕÃ̈vˆi` >˜` «À>V̈V>Þ ÀiiÛ>˜Ì° /…i >VVÕÀ>VÞ œv ̅i “œÃÌ i>ÈÞ “i>ÃÕÀ‡ >Li Û>Àˆ>Lià ˆ˜ ̅i Ài}ˆÃÌiÀ ˆÃ ÛiÀÞ }œœ`° œÜiÛiÀ] ̅i `>Ì> ÃÌÀÕVÌÕÀi vœÀ ̅i ÀiVœÀ`‡ ˆ˜} œv «iÀvœÀ“i` œ«iÀ>̈œ˜Ã ˆÃ ˜œÌ œ«Ìˆ“> ˆ˜ ̅i Ài}ˆÃÌiÀ° œÀ iÝ>“«i] œ˜Þ œ˜i œ«iÀ‡ >̈œ˜ `>Þ ˆÃ >œÜi` vœÀ œ˜i …œÃ«ˆÌ>ˆâ>̈œ˜ iÛi˜ ̅œÕ}… ̅i «>̈i˜Ì “>Þ Li Ài‡œ«iÀ>Ìi` œ˜ `ÕÀˆ˜} ̅i Ã>“i …œÃ«ˆÌ>ˆâ>̈œ˜°/…i Vœ˜‡ Ìi˜Ì œv ̅iÃi vˆi`à VœÕ` >Ãœ Li ˆ“«ÀœÛi` LÞ }ˆÛˆ˜} ÃÌ>˜`>À`ˆâi` ˆ˜ÃÌÀÕV̈œ˜Ã ̅>Ì }Ո`i «À>V̈̈œ˜iÀà ̜ >Ãœ ÀiVœÀ` ̅i Ài‡ iÛ>˜Ì >``ˆÌˆœ˜> œ«iÀ>̈œ˜ Vœ`ià >Û>ˆ>Li ˆ˜ ̅i V>ÃÈvˆV>̈œ˜] ÃÕV… >à ܅ˆV… È`i ̅i vÀ>VÌÕÀi ˆÃ œ˜° -ÕV… >``ˆÌˆœ˜> “iÌ>`>Ì> ŜՏ` >Ãœ Li vi>ÈLi ̜ ÕÃi ̜ ˆ“«ÀœÛi ̅i µÕ>ˆÌÞ œv Ài}ˆÃÌiÀ `>Ì> ˆ˜ }i˜iÀ> ˆ˜ V>Ãià ܅iÀi ̅i ÕÃi œv ÃÌÀÕVÌÕÀi` `>Ì> i˜ÌÀˆià ˆÃ Ž˜œÜ˜ ̜ Û>ÀÞ LiÌÜii˜ Vœ`iÀà QΙ] ÈÎ] È{R° œÀ ̅i «ÕÀ«œÃià œv «iÀvœÀ“>˜Vi >ÃÃiÃÇ “i˜Ì] vœœÜ‡Õ« ˆ˜vœÀ“>̈œ˜ ˆÃ «>À̈VՏ>ÀÞ iÃÃi˜Ìˆ>° œÀÌ>ˆÌÞ >˜` i˜}̅ œv ˆ˜ˆÌˆ> …œÃ«ˆÌ>ˆâ>̈œ˜ «iÀˆœ` ÜiÀi ÀiVœÀ`i` iµÕ>Þ ˆ˜ LœÌ… `>Ì> ÃiÌÃ] LÕÌ œÌ…iÀ܈Ãi ˆÌ Ü>à ˜œÌ i>ÃÞ Ìœ `iÀˆÛi ˆ˜vœÀ“>̈œ˜ ̅>Ì Ü>à iÝ>VÌÞ Vœ“«>̈Li° ˜vœÀ“>̈œ˜ œ˜ ̅i ÕÃi œv ˆ˜«>̈i˜Ì …i>̅ ÃiÀۈVià œ˜ > `>ˆÞ L>Èà Ü>à >Û>ˆ>Li `ˆÀiVÌÞ vÀœ“ ̅i Ài}ˆÃÌiÀ ܅ˆi `iÀˆÛˆ˜} ȓˆ>À ˆ˜vœÀ“>̈œ˜ vÀœ“ ̅i >Õ`ˆÌ `>Ì> ÀiµÕˆÀi` ViÀÌ>ˆ˜ >ÃÃՓ«Ìˆœ˜Ã >˜` ˆ˜ÌiÀ«œ>̈œ˜° /…i VœiV̈œ˜ œv vœœÜ‡Õ« ˆ˜vœÀ“>̈œ˜ vÀœ“ ̅i >Õ`ˆÌ `>Ì> ÀiµÕˆÀià iÝÌÀ> ܜÀŽ >˜` ˆÃ «Àœ˜i ̜ `ˆvviÀi˜Ì Lˆ>Ãià >à ˆÃ ̅i ÀœṎ˜i VœiV̈œ˜ œv Ài}ˆÃÌiÀ `>Ì>°

Ûi˜ ̅œÕ}… ̅i ÕÃi œv …i>Ì… ÃiÀۈVià ˆÃ ÀiVœÀ`i` LiÌÌiÀ ˆ˜ ̅i Ài}ˆÃÌiÀ] œÌ…iÀ “i>ÃÕÀi“i˜Ìà œv ÕÃivՏ Vœ˜Vi«Ìà ÃÕV… >à ÕÃi œv ÌiV…˜ˆV> >ˆ`] v՘V̈œ˜> `ˆÃ>LˆˆÌÞ >˜` «>ˆ˜ œvviÀ ˆ˜vœÀ“>̈œ˜ ̅>Ì ˆÃ ˜œÌ `ˆ‡ ÀiVÌÞ >Û>ˆ>Li vÀœ“ ̅i Ài}ˆÃÌiÀ° œÜiÛiÀ] LiV>ÕÃi ̅i Ài}ˆÃÌiÀ ˆ˜VÕ`ià > `ˆÃ>LˆˆÌÞ “i>ÃÕÀi] ˆÌ >VÌÕ>Þ œvviÀà Vœ“«i“i˜Ì>ÀÞ ˆ˜vœÀ“>̈œ˜ ­>Ì >`“ˆÃȜ˜] `ˆÃV…>À}i >˜` Vi˜ÃÕà «œˆ˜Ìî° /…ˆÃ ˆ˜vœÀ“>̈œ˜ ܜՏ` Li “œÀi Û>Õ>Li ˆv ܓi Vœ““œ˜Þ ÕÃi` v՘V̈œ˜> `ˆÃ>LˆˆÌÞ œÀ µÕ>ˆÌÞ œv ˆvi “i>ÃÕÀi …>` Lii˜ >Û>ˆ>Li ˆ˜ ̅i Ài}ˆÃÌiÀ ˆ˜ÃÌi>` œv > ˜œ˜‡Û>ˆ`>Ìi` >` …œV “i>ÃÕÀi QÈxR° "˜ ̅i œÌ…iÀ …>˜`] ̅i Ài}ˆÃÌiÀ ˆ˜VÕ`ià ˆ˜vœÀ“>̈œ˜ Vœ˜ViÀ˜ˆ˜} ̅i «ÀœÛˆ`iÀ‡Ã«i‡ VˆvˆV ÕÃi œv …i>Ì… ÃiÀۈVià ܈̅ `ˆ>}˜œÃ̈V‡

Ài>Ìi` }ÀœÕ« ˆ˜vœÀ“>̈œ˜ vœÀ ̅i ̜Ì> «>‡ ̈i˜Ì «œ«Õ>̈œ˜° /…ˆÃ >œÜà ̅i V>VՏ>̈œ˜ œv VœÃÌà LÞ ÃՓ“ˆ˜} Õ« ̅i ÃՈÌ>LÞ ­«ÀˆVi®‡ Üiˆ}…Ìi` ˆ˜vœÀ“>̈œ˜ œ˜ ̅i ÕÃi œv …i>Ì… ÃiÀۈVið /…ˆÃ Ã>“i ˆ`i> vœÀ V>VՏ>̈˜} VœÃÌà V>˜ Li vÕÀ̅iÀ Ṏˆâi`] ̅œÕ}… ̅i Üiˆ}…Ìà ̅ˆÃ ̈“i VœÀÀi뜘` ˜œÌ ̜ «ÀˆVià LÕÌ Ìœ ̅i “i>˜ …i>Ì… œÀ `ˆÃ>LˆˆÌÞ ÃÌ>ÌÕà œv «iÀܘà ÀiViˆÛˆ˜} ëiVˆvˆV vœÀ“à œv V>Ài] >˜` ̅ˆÃ ÀiÃՏÌà ˆ˜ …i>Ì… ÃÌ>ÌÕà “i>ÃÕÀi ܅ˆV… ˆÃ ՘Lˆ>Ãi` ­ˆ˜ ̅i Ã>“i Ãi˜Ãi >à VœÃÌà V>˜ Li ՘Lˆ>Ãi`® >Ì Ì…i «œ«Õ>̈œ˜ iÛi° Ý>“ˆ˜>̈œ˜ œv ÃÕV… ˆ`i>à ˆÃ œÕÌ œv ̅i ÃVœ«i œv ̅ˆÃ ÃÌÕ`Þ] LÕÌ “>Þ œvviÀ vÀՈÌvՏ `ˆÀiV̈œ˜Ã vœÀ vÕÀ̅iÀ ÀiÃi>ÀV…° à > Vœ˜VÕȜ˜] Ài}ˆÃÌiÀ >˜` >Õ`ˆÌ `>Ì> LœÌ… …>Ûi ViÀÌ>ˆ˜ ˆ“ˆÌ>̈œ˜Ã >˜` «ÀœLi“Ã] LÕÌ Ãii“ ̜ Li ÃՈÌ>Li vœÀ ̅i «iÀvœÀ“>˜Vi >ÃÃiÃÓi˜Ì œv …ˆ« vÀ>VÌÕÀi ÌÀi>̓i˜Ì° 6œÕ˜‡ Ì>ÀÞ ˆ˜«ÕÌ œv >``ˆÌˆœ˜> …ˆ« vÀ>VÌÕÀi iÛi˜Ì `>Ì> ̜ ̅i Ài}ˆÃÌiÀ ŜՏ` Li “>`i «œÃÈLi] LiV>ÕÃi ̅i LiÃÌ Vœ“«Àœ“ˆÃi vœÀ «À>V̈V> «iÀvœÀ“>˜Vi >ÃÃiÃÓi˜Ì «ÕÀ«œÃià ܜՏ` Li Ài>V…i` LÞ > «ÀœÃ«iV̈Ûi ÀiVœÀ`ˆ˜} œv …ˆ« vÀ>VÌÕÀi iÛi˜Ì `>Ì> Vœ“Lˆ˜i` ܈̅ iÝÌÀ>V‡ ̈œ˜ œv V>Ài …ˆÃ̜Àˆià œv ̅iÃi «>̈i˜Ìà vÀœ“ ̅i Ài}ˆÃÌiÀð VŽ˜œÜi`}“i˜Ìà /…ˆÃ ܜÀŽ Ü>à vˆ˜>˜Vˆ>Þ ÃÕ««œÀÌi` LÞ > ÀiÃi>ÀV… }À>˜Ì vÀœ“ ̅i V>`i“Þ œv ˆ˜>˜`° /…i v՘`ˆ˜} >}Àii“i˜Ì i˜ÃÕÀi` ̅i >Õ̅œÀÿ ˆ˜`i«i˜`i˜Vi ˆ˜ `i‡ È}˜ˆ˜} ̅i ÃÌÕ`Þ] ˆ˜ÌiÀ«Àï˜} ̅i `>Ì>] >˜` ÜÀˆÌˆ˜} >˜` «ÕLˆÃ…ˆ˜} ̅i Ài«œÀÌ°

,iviÀi˜Vià £° `>Þ ] i}iÞ

] >ˆÀܘ ,] ->ÌiÀ °

Û>Õ>̈˜} ̅i …i>Ì…V>Ài ÃÞÃÌi“\ vviV̈Ûi˜iÃÃ] ivvˆVˆi˜VÞ] >˜` iµÕˆÌÞ° Ә` i`ˆÌˆœ˜° …ˆV>}œ\ i>Ì… `“ˆ˜ˆÃÌÀ>̈œ˜ *ÀiÃÃÆ £™™n° Ó° " ° i>ÃÕÀˆ˜} Õ« q “«ÀœÛˆ˜} …i>Ì… ÃÞÃÌi“ «iÀvœÀ“>˜Vi ˆ˜ " VœÕ˜ÌÀˆið *>ÀˆÃ\ "À}>˜ˆ‡ Ã>̈œ˜ vœÀ Vœ˜œ“ˆV œ‡œ«iÀ>̈œ˜ >˜` iÛi‡ œ«“i˜ÌÆ ÓääÓ° ΰ >Ó>˜ ] ->vÀ>˜ ] />Ži`> ° +Õ>ˆÌÞ œv …i>Ì… V>Ài\ ˆ˜vœÀ“>̈Và vœÕ˜`>̈œ˜Ã° i̅œ`à ˜v i` ÓääÎÆ {Ó ­x®\ x䙇£n° {° ““i˜ÜiÀ̅ ] >ÕÝ ,] ՏˆŽœÜΈ ] œ…˜i] À>˜`˜iÀ ,] Àˆ} ] iÌ >° i`ˆV> ˆ˜vœÀ“>̈Và >˜` ̅i µÕ>ˆÌÞ œv …i>Ì…\ ˜iÜ >««Àœ>V…ià ̜ ÃÕ«‡ «œÀÌ «>̈i˜Ì V>Ài q vˆ˜`ˆ˜}à vÀœ“ ̅i  9i>À‡ LœœŽ œv i`ˆV> ˜vœÀ“>̈Và Óääΰ i̅œ`à ˜v i` ÓääÎÆ {Ó ­Ó®\ £nx‡™° x° ““i˜ÜiÀ̅ ] …iÀà ] ˆV…ÃÌ>`ÌiÀ ,] >ÕÝ ,] *œ… 1] ,iÃV… ° -ÞÃÌi“à >˜>ÞÈà ˆ˜ …i>Ì… V>Ài\ vÀ>“iܜÀŽ >˜` iÝ>“«i° i̅œ`à ˜v i` ÓääÓÆ {£ ­Ó®\ £Î{‡{ä°

È° -Ìiv>˜iˆ ° ˜œÜi`}i “>˜>}i“i˜Ì ̜ ÃÕ««œÀÌ «iÀvœÀ“>˜Vi‡L>Ãi` “i`ˆVˆ˜i° i̅œ`à ˜v i` ÓääÓÆ {£ ­£®\ Îȇ{ΰ Ç° ˆiÃV…ˆ ] ÕvœÕÀ  ] -Ì>VVˆ˜ˆ *] œÕÛiÀ˜iÌ ] œÕ…>``œÕ "° i`ˆV> `iVˆÃˆœ˜ ÃÕ««œÀÌ ÃÞÃÌi“Ã\ œ` `ˆi““>à >˜` ˜iÜ «>À>`ˆ}“ö i̅œ`à ˜v i` ÓääÎÆ {Ó ­Î®\ £™ä‡n° n° iœ˜>À` ° /œÌ> µÕ>ˆÌÞ ˆ˜ ˆ˜vœÀ“>̈œ˜ ÃÞÃÌi“à “>˜>}i“i˜Ì\ ˆÃÃÕià vœÀ ̅i …i>Ì… V>Ài ˆ˜`ÕÃÌÀÞ° i̅œ`à ˜v i` £™™nÆ ÎÇ ­Ó®\ £xȇÈä° ™° ގB˜i˜ *] >Àˆ“>> ° -ÕVViÃà >˜` v>ˆÕÀi v>V‡ ̜Àà ˆ˜ ̅i Ài}ˆœ˜> …i>Ì… ˆ˜vœÀ“>̈œ˜ ÃÞÃÌi“ `i‡ È}˜ «ÀœViÃà q ÀiÃՏÌà vÀœ“ > Vœ˜ÃÌÀÕV̈Ûi iÛ>Õ‡ >̈œ˜ ÃÌÕ`Þ° i̅œ`à ˜v i` ÓääÈÆ {x ­£®\ nx‡™° £ä° -Ìiv>˜iˆ ° ˜œÜi`}i >˜` «ÀœViÃà “>˜>}i‡ “i˜Ì ˆ˜ …i>Ì… V>Ài œÀ}>˜ˆâ>̈œ˜Ã° i̅œ`à ˜v i` Óää{Æ {Î ­x®\ xÓx‡Îx° ££° >À`i -] 7œvv  ] ÕÌÃV…> 1] 7iÌÌiÀ /] ˜>Õ« *°

-‡- q œ˜Vi«Ìà vœÀ ӜœÌ… ˆ˜Ìi}À>̈œ˜ œv …i>Ì… V>Ài ˆ˜vœÀ“>̈œ˜ ÃÞÃÌi“ Vœ“«œ˜i˜Ìà ˆ˜Ìœ iÃÌ>LˆÃ…i` «ÀœViÃÃià œv «>̈i˜Ì V>Ài° i̅œ`à ˜v i` ÓääÈÆ {x ­£®\ £ä‡n° £Ó° " °  `ˆÃi>Ãi‡L>Ãi` Vœ“«>ÀˆÃœ˜ œv …i>Ì… ÃÞÃÌi“ð 7…>Ì ˆÃ LiÃÌ >˜` >Ì Ü…>Ì VœÃ̶ *>ÀˆÃ\ "À‡ }>˜ˆÃ>̈œ˜ vœÀ Vœ˜œ“ˆV œ‡œ«iÀ>̈œ˜ >˜` iÛi‡ œ«“i˜ÌÆ Óääΰ £Î° À>`Þ ] -V…Ü>ÀÌâ ] i`ˆÌœÀð i`ˆV> ivviV‡ ̈Ûi˜iÃà ÀiÃi>ÀV… `>Ì> “i̅œ`ð ,œVŽÛˆi\ }i˜‡ VÞ vœÀ i>Ì… >Ài *œˆVÞ >˜` ,iÃi>ÀV…Æ £™™Ó° £{° Փ>À ] -“ˆÌ… ] *ˆÃ>˜iˆ ] >˜}i“ˆ ] -Ìiv>˜iˆ ° ˆ˜ˆV> }Ո`iˆ˜ià >à «>˜Ã q >˜ œ˜Ìœœ}ˆV> ̅iœÀÞ° i̅œ`à ˜v i` ÓääÈÆ {x ­Ó®\ Óä{‡£ä° £x° …>ˆ 7] > , ] Å -] ,œÃi˜ ] œÃŽœ‡ ܈Ìâ ° Û>Õ>̈œ˜ œv Vœ“«ˆV>̈œ˜ À>Ìià >vÌiÀ VœÀœ˜>ÀÞ >ÀÌiÀÞ LÞ«>Ãà ÃÕÀ}iÀÞ ÕȘ} >`“ˆ˜ˆÃ‡ ÌÀ>̈Ûi `>Ì>° i̅œ`à ˜v i` £™™nÆ ÎÇ ­Ó®\ £™Ó‡Óää° £È° ½œœÀi 7] -ˆVœÌÌi ] /ˆµÕˆ˜ ° ,ˆÃŽ >`ÕÃ̓i˜Ì ˆ˜ œÕÌVœ“i >ÃÃiÃÓi˜Ì\ ̅i …>ÀÃœ˜ Vœ“œÀLˆ`ˆÌÞ ˆ˜`iÝ° i̅œ`à ˜v i` £™™ÎÆ ÎÓ ­x®\ ÎnӇǰ £Ç° "½ œ˜˜i ,] ˆ“ ° 1̈ˆÌÞ œv ̅i …>ÀÃœ˜ Vœ“œÀLˆ`ˆÌÞ ˆ˜`iÝ Vœ“«ÕÌi` vÀœ“ ÀœṎ˜iÞ Vœ‡ iVÌi` …œÃ«ˆÌ> `ˆÃV…>À}i `ˆ>}˜œÃˆÃ Vœ`ið i̅œ`à ˜v i` ÓäääÆ Î™ ­£®\ LJ££° £n° iâ✘ˆ ° ,ˆÃŽ `ÕÃ̓i˜Ì vœÀ “i>ÃÕÀˆ˜} …i>Ì… V>Ài œÕÌVœ“ið /…ˆÀ` i`ˆÌˆœ˜° …ˆ}>}œ\ i>Ì… `“ˆ˜ˆÃÌÀ>̈œ˜ *ÀiÃÃÆ Óääΰ £™° /Ài“ ] ÀœŽiÀ * ° ˆ« vÀ>VÌÕÀi >Õ`ˆÌ\ ̈“i vœÀ ÃÌ>˜`>À`à ̜ Li iۈ`i˜Vi‡L>Ãi`° ˜Ì "À̅œ« ÓäääÆ Ó{ ­{®\ £n£‡Î° Óä° *>ÀŽiÀ ] ÕÀÀˆi /] œÕ˜Ì>ˆ˜ ] /…œÀ˜}Ài˜ ° -Ì>˜`>À`ˆÃi` >Õ`ˆÌ œv …ˆ« vÀ>VÌÕÀi ˆ˜ ÕÀœ«i ­- ®° ˆ« ˜Ì £™™nÆ n ­£®\ £ä‡x° Ó£° iˆŽŽˆ˜i˜ /] 7ˆˆ} ,] B˜˜ˆ˜i˜ ] œÃŽˆ˜i˜ ] >˜˜ˆÃ“BŽˆ *] >Û>ˆŽŽœ ] iÌ >° ˆ« vÀ>VÌÕÀià ˆ˜ ˆ˜>˜` q > Vœ“«>ÀˆÃœ˜ œv «>̈i˜Ì V…>À>VÌiÀˆÃ̈Và >˜` œÕÌVœ“ià ˆ˜ ÃˆÝ …œÃ«ˆÌ>Ã° -V>˜`  -ÕÀ} Óää{Æ ™Î ­Î®\ ÓÎ{‡{ä° ÓÓ° ÕÀ“ˆ ] >Àˆ˜i˜ ] Ø̅i *] />˜˜ˆ˜i˜ -° œÃÌ >˜>ÞÈà œv …ˆ« vÀ>VÌÕÀi ÌÀi>̓i˜Ì >“œ˜} ̅i i`iÀÞ vœÀ ̅i «ÕLˆV …i>Ì… ÃiÀۈViÃ\ > £‡Þi>À «ÀœÃ«iV̈Ûi ÃÌÕ`Þ ˆ˜ £äÈ Vœ˜ÃiVṎÛi «>̈i˜Ìð ÀV… "À̅œ«/À>Փ> -ÕÀ} ÓääÎÆ £ÓÎ ­£ä®\ xx£‡{°

i̅œ`à ˜v i` xÉÓääÇ Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

xÈÈ -՘` iÌ >°

Óΰ ÕÀ“ˆ ] >Àˆ˜i˜ ] Ø̅i *] />˜˜ˆ˜i˜ -° ՘V‡ ̈œ˜> œÕÌVœ“i >˜` ÃÕÀۈÛ> >vÌiÀ …ˆ« vÀ>VÌÕÀi ˆ˜ i`iÀÞ\ > «Àœ«iV̈Ûi ÃÌÕ`Þ œv £äÈ Vœ˜ÃiVṎÛi «>‡ ̈i˜Ìð  "À̅œ«>i` /À>Փ>̜ Óää{Æ x ­£®\ LJ£{° Ó{° *œÜi  ] >ۈià /] /…œ“Ãœ˜ ,° 1Ș} ÀœÕ‡ ̈˜i Vœ“«>À>̈Ûi `>Ì> ̜ >ÃÃiÃà ̅i µÕ>ˆÌÞ œv …i>Ì… V>Ài\ ՘`iÀÃÌ>˜`ˆ˜} >˜` >ۜˆ`ˆ˜} Vœ““œ˜ «ˆÌv>Ã° +Õ> ->v i>Ì… >Ài ÓääÎÆ £Ó ­Ó®\ £ÓӇn° Óx° ,ˆÃÃ>˜i˜ *] -՘` ,] œÀ`L>VŽ ] ,œÕÈ /] `B˜‡ «BB˜‡iˆŽŽˆB 1° ,i}ˆÃÌiÀ L>Ãi` ivviV̈Ûi˜iÃà “i>ÃÕÀi“i˜Ì ˆ˜ ̅i V>Ãi œv …ˆ« vÀ>VÌÕÀi ÌÀi>̓i˜Ì ­ˆ˜ ˆ˜˜ˆÃ…®° iÃˆ˜Žˆ\ >̈œ˜> ,iÃi>ÀV… >˜` i‡ Ûiœ«“i˜Ì i˜ÌÀi vœÀ 7iv>Ài >˜` i>Ì…Æ ÓääÓ° ÓÈ° -՘` ,° 1̈ˆÃ>̈œ˜ œv >`“ˆ˜ˆÃÌÀ>̈Ûi Ài}ˆÃÌiÀà ÕȘ} ÃVˆi˜ÌˆvˆV Ž˜œÜi`}i `ˆÃVœÛiÀÞ° ˜Ìiˆ}i˜Ì >Ì> ˜>ÞÈà ÓääÎÆ Ç ­È®\ x䣇£™° ÓÇ° -՘` ,] ˆÃŽˆ ° +Õ>ˆÌÞ ivviVÌà œv œ«iÀ>̈Ûi `i>Þ œ˜ “œÀÌ>ˆÌÞ ˆ˜ …ˆ« vÀ>VÌÕÀi ÌÀi>̓i˜Ì° +Õ> ->v i>Ì… >Ài ÓääxÆ £{ ­x®\ ÎÇ£‡Ç° Ón° ˆÃÏiÀ ] >ՎŽ> ° ˆ˜˜ˆÃ… …i>Ì… >˜` ÜVˆ> Üiv>Ài Ài}ˆÃÌiÀà ˆ˜ i«ˆ`i“ˆœœ}ˆV> ÀiÃi>ÀV…° œÀÎ «ˆ`i“ˆœœ}ˆ Óää{Æ £{ ­£®\ ££Î‡Óä° Ó™° ,œœÃ ] 7>`> ° ,iVœÀ` ˆ˜Ž>}i ÃÌÀ>Ìi}ˆið *>ÀÌ \ Ã̈“>̈˜} ˆ˜vœÀ“>̈œ˜ >˜` iÛ>Õ>̈˜} >«‡ «Àœ>V…ið i̅œ`à ˜v i` £™™£Æ Îä ­Ó®\ ££Ç‡Óΰ Îä° 7>`> ] ,œœÃ ] >ÞivÃŽÞ ] -ˆ˜}i̜˜ ° ,iVœÀ` ˆ˜Ž>}i ÃÌÀ>Ìi}ˆiÃ\ *>ÀÌ ° *œÀÌ>Li Üv̇ Ü>Ài >˜` `iÌiÀ“ˆ˜ˆÃ̈V “>ÌV…ˆ˜}° i̅œ`à ˜v i` £™™£Æ Îä ­Î®\ Ó£ä‡{° Σ° "LiÀ>ˆ}˜iÀ 7] -ÌՅˆ˜}iÀ 7° ,iVœÀ` ˆ˜Ž>}i ˆ˜ ̅i >˜ViÀ ,i}ˆÃÌÀÞ œv /ÞÀœ] ÕÃÌÀˆ>° i̅œ`à ˜v i` ÓääxÆ {{ ­x®\ ÈÓȇÎä° ÎÓ° iΈ“BŽˆ ] Àœ -° VVÕÀ>VÞ œv `>Ì> œ˜ `ˆ>}‡ ˜œÃiÃ] «ÀœVi`ÕÀià >˜` >VVˆ`i˜Ìà ˆ˜ ̅i ˆ˜˜ˆÃ… œÃ«ˆÌ> ˆÃV…>À}i ,i}ˆÃÌiÀ° ˜Ì  i>Ì… -Vˆ £™™£Æ Ó ­£®\ £x‡Ó£° Îΰ >…̈ ,] *i˜Ì̈B ° /…i Û>ˆ`ˆÌÞ œv `i>̅ ViÀ̈vˆ‡ V>ÌiÃ\ ÀœṎ˜i Û>ˆ`>̈œ˜ œv `i>̅ ViÀ̈vˆV>̈œ˜ >˜` ˆÌà ivviVÌà œ˜ “œÀÌ>ˆÌÞ ÃÌ>̈Ã̈Vð œÀi˜ÃˆV -Vˆ ˜Ì Óää£Æ ££x ­£‡Ó®\ £x‡ÎÓ° Î{° i««BB ] 6ˆÀÌ>“œ ] iˆ˜œ˜i˜ "*° 6>ˆ`>̈œ˜ œv ÃÌÀœŽi `ˆ>}˜œÃˆÃ ˆ˜ ̅i >̈œ˜> œÃ«ˆÌ> ˆÃ‡ V…>À}i ,i}ˆÃÌiÀ >˜` ̅i ,i}ˆÃÌiÀ œv >ÕÃià œv i>̅ ˆ˜ ˆ˜>˜`° ÕÀ  «ˆ`i“ˆœ £™™™Æ £x ­Ó®\ £xx‡Èä° Îx° *>Õ˜i˜ *] œÕŽŽÕ˜i˜ ] i̜˜i˜ ] iÀŽŽœ> /] ““œ˜i˜‡,Bˆ…B *] BÀB‡œÃŽi˜Ž>Àˆ *] iÌ >° /…i Û>ˆ`ˆÌÞ œv ̅i ˆ˜˜ˆÃ… œÃ«ˆÌ> ˆÃV…>À}i ,i}ˆÃÌiÀ >˜` >ÕÃià œv i>̅ ,i}ˆÃÌiÀ `>Ì> œ˜ VœÀœ˜>ÀÞ …i>ÀÌ `ˆÃi>Ãi° ÕÀ  >À`ˆœÛ>ÃV *ÀiÛ ,i‡ …>Lˆ ÓääxÆ £Ó ­Ó®\ £ÎӇǰ ÎÈ° Ø̅i *] ÕÀ“ˆ ] >Ì>> ] iˆŸÛ>>À> ] ->˜Ì>‡ ۈÀÌ> -° ˜Vˆ`i˜Vi œv «iÛˆV vÀ>VÌÕÀià ˆ˜ ˆ˜>˜` ˆ˜ £™nn° VÌ> "À̅œ« -V>˜` £™™xÆ ÈÈ ­Î®\ Ó{x‡n° ÎÇ° ÕÕΜ /] >À««ˆ *] ۈŽ>ˆ˜i˜ 6] >Ṏ>ˆ˜i˜ ] -ՏŽ>Û> ,° /…i V…>˜}ˆ˜} «ˆVÌÕÀi œv …ˆ« vÀ>VÌÕÀiÃ\ `À>“>̈V V…>˜}i ˆ˜ >}i `ˆÃÌÀˆLṎœ˜ >˜` ˜œ V…>˜}i ˆ˜ >}i‡>`ÕÃÌi` ˆ˜Vˆ`i˜Vi ܈̅ˆ˜ £ä Þi>Àà ˆ˜ i˜‡ ÌÀ> ˆ˜>˜`° œ˜i £™™™Æ Ó{ ­Î®\ ÓxLJ™°

În° -œÀi˜Ãi˜ /] ->LÀœi -] "Ãi˜ °  vÀ>“iܜÀŽ vœÀ iÛ>Õ>̈œ˜ œv ÃiVœ˜`>ÀÞ `>Ì> ÜÕÀVià vœÀ i«ˆ`i“ˆ‡ œœ}ˆV> ÀiÃi>ÀV…° ˜Ì  «ˆ`i“ˆœ £™™ÈÆ Óx ­Ó®\ {Îx‡{Ó° Ι° 7ˆ˜Ì…iÀiˆŽ ,° º7i vˆ ˆ˜ œÕÀ ܜÀŽˆ˜} ՘`iÀ‡ ÃÌ>˜`ˆ˜}»\ œ˜ Vœ`iÃ] V>ÃÈvˆV>̈œ˜Ã >˜` ̅i «Àœ‡ `ÕV̈œ˜ œv >VVÕÀ>Ìi `>Ì>° i̅œ`à ˜v i` ÓääÎÆ {Ó ­{®\ {n™‡™È° {ä° œÀ˜LÀœœŽ  ] ÕÀÌ>`œ 6] œ…˜Ãœ˜ , ° i>Ì… V>Ài i«ˆÃœ`iÃ\ `ivˆ˜ˆÌˆœ˜] “i>ÃÕÀi“i˜Ì >˜` ÕÃi° i` >Ài ,iÛ £™nxÆ {Ó ­Ó®\ £È·ӣn° {£° …ˆ˜ ] ÕÀ>“>ÌÃÕ ° 7…>Ì ˆÃ ̅i µÕ>ˆÌÞ œv µÕ>ˆÌÞ œv “i`ˆV> V>Ài “i>ÃÕÀiö ,>Ŝ“œ˜‡ˆŽi Ài>̈ۈÓ >˜` Ài>‡ÜœÀ` >««ˆV>̈œ˜Ã° *iÀëiVÌ ˆœ i` ÓääÎÆ {È ­£®\ x‡ÓäÆ `ˆÃVÕÃȜ˜ Ó£‡Î° {Ó° Õ`ˆ}i ] …>˜`>Àˆ ] i>“ ° œÜ Àiˆ>Li >Ài Àiˆ>LˆˆÌÞ ÃÌÕ`ˆià œv vÀ>VÌÕÀi V>ÃÈvˆV>̈œ˜Ã¶  ÃÞÃÌi“>̈V ÀiۈiÜ œv ̅iˆÀ “i̅œ`œœ}ˆið VÌ> "À̅œ« -V>˜` Óää{Æ Çx ­Ó®\ £n{‡™{° {ΰ ˜} ° ,i«ÀiÃi˜Ì>̈œ˜Ã œv …i>Ì… Vœ˜Vi«ÌÃ\ > Vœ}˜ˆÌˆÛi «iÀëiV̈Ûi°  ˆœ“i` ˜vœÀ“ ÓääÓÆ Îx ­£®\ £Ç‡Ó{° {{° 7À>Þ *] Å̜˜ ] Õގi˜`> ] œˆ˜}Ç ܜÀ̅  ° 1Ș} >`“ˆ˜ˆÃÌÀ>̈Ûi `>Ì>L>Ãià ̜ iÛ>Õ>Ìi ̅i µÕ>ˆÌÞ œv “i`ˆV> V>Ài\ > Vœ˜Vi«ÌÕ> vÀ>“iܜÀŽ° -œV -Vˆ i` £™™xÆ {ä ­£Ó®\ £ÇäLJ£x° {x° >˜i ,] i`ˆÌœÀ° 1˜`iÀÃÌ>˜`ˆ˜} …i>Ì… V>Ài œÕ̇ Vœ“ià ÀiÃi>ÀV…° >ˆÌ…iÀÃLÕÀ}\ ëi˜ *ÕLˆÃ…iÀÃÆ £™™Ç° {È° -…ÀœÞiÀ ] œ˜`œ˜ ] -i̅ˆ ] >ÀÅ> ] ÀœÛiÀ ] >““iÀ“iˆÃÌiÀ  ° ,i>̈œ˜Ã…ˆ«Ã LiÌÜii˜ «>̈i˜Ì‡Ài>Ìi` ÀˆÃŽ v>V̜ÀÃ] «ÀœViÃÃiÃ] ÃÌÀÕVÌÕÀiÃ] >˜` œÕÌVœ“ià œv V>À`ˆ>V ÃÕÀ}ˆV> V>Ài°

œ˜Vi«ÌÕ> “œ`iÃ° i` >Ài £™™xÆ ÎÎ ­£ä -Õ««®\ "-ÓȇÎ{° {Ç° Àˆ«VÃ>Ž ] iˆÌ>˜ ° i>ÃÕÀˆ˜} >}Àii“i˜Ì ˆ˜ “i`ˆV> ˆ˜vœÀ“>̈Và Àiˆ>LˆˆÌÞ ÃÌÕ`ˆið  ˆœ“i` ˜vœÀ“ ÓääÓÆ Îx ­Ó®\ ™™‡££ä° {n° ˜`Àià ] >À✠*° iÌ>\  ˜iÜ “i>ÃÕÀi œv >}Àii“i˜Ì LiÌÜii˜ Ìܜ À>ÌiÀð À  >̅ -Ì>Ì *Ãއ V…œ Óää{Æ xÇ ­£®\ £‡£™° {™° ˜`Àià ] >À✠*° …>˜Vi‡VœÀÀiVÌi` “i>Ç ÕÀià œv Àiˆ>LˆˆÌÞ >˜` Û>ˆ`ˆÌÞ ˆ˜  y  Ì>Lið -Ì>Ì i̅œ`à i` ,ià ÓääxÆ £{ ­x®\ {Ç·™Ó° xä° Õ`ˆ}i ] …>˜`>Àˆ ] >˜Ãœ˜ ] i>“ °  Vœ˜Vi«Ì vœÀ ̅i Û>ˆ`>̈œ˜ œv vÀ>VÌÕÀi V>ÃÈvˆ‡ V>̈œ˜Ã°  "À̅œ« /À>Փ> ÓääxÆ £™ ­È®\ {䣇Ȱ x£° ŸÀŽ}Ài˜ ] Àˆià ] BŽŽˆ˜i˜ 1] Àœ““iÃ ° >Ãi‡“ˆÝ >`ÕÃ̓i˜Ì >˜` ivvˆVˆi˜VÞ “i>ÃÕÀi‡ “i˜Ì° -V>˜`  *ÕLˆV i>Ì… Óää{Æ ÎÓ ­È®\ {È{‡Ç£° xÓ° -V…˜>Li ° ,i«ÀiÃi˜Ìˆ˜} >˜` «ÀœViÃȘ} “i`ˆ‡ V> Ž˜œÜi`}i ÕȘ} vœÀ“> Vœ˜Vi«Ì >˜>ÞÈð i̅œ`à ˜v i` ÓääÓÆ {£ ­Ó®\ £Èä‡Ç° xΰ i}œÕiÌ *] ˆiÃV…ˆ ] …>ÌiˆiÀ ° iVˆÃˆœ˜ ÃÕ«‡ «œÀÌ ÃÞÃÌi“à vÀœ“ ̅i ÃÌ>˜`«œˆ˜Ì œv Ž˜œÜi`}i Ài«ÀiÃi˜Ì>̈œ˜° i̅œ`à ˜v i` £™™xÆ Î{ ­£‡Ó®\ ÓäӇn° x{° À>˜`Ì ] -՘ ] …>À«i˜ÌˆiÀ *] >`Ž>À˜ˆ *° ˜Ìi}À>̈œ˜ œv 7iL‡L>Ãi` >˜` * ‡L>Ãi` Vˆ˜ˆV>

ÀiÃi>ÀV… `>Ì>L>Ãið i̅œ`à ˜v i` Óää{Æ {Î ­Î®\ ÓnLJ™x° xx° À>˜`Ì ] œ…i˜ ] -…ˆv“>˜ ] ˆiÀ *] >`Ž>À˜ˆ *] À>܏iÞ -° ««Àœ>V…ià >˜` ˆ˜‡ vœÀ“>̈Và ̜œÃ ̜ >ÃÈÃÌ ˆ˜ ̅i ˆ˜Ìi}À>̈œ˜ œv ȓˆ‡ >À Vˆ˜ˆV> ÀiÃi>ÀV… µÕiÃ̈œ˜˜>ˆÀið i̅œ`à ˜v i` Óää{Æ {Î ­Ó®\ £xȇÈÓ° xÈ° Û>˜ ˆ˜˜iŽi˜ ° œ˜Ãˆ`iÀ>̈œ˜Ã vœÀ ̅i Ài«Ài‡ Ãi˜Ì>̈œ˜ œv “iÌ>‡`>Ì> vœÀ ̅i ÃÕ««œÀÌ œv ÃÌÀÕV‡ ÌÕÀi` `>Ì> i˜ÌÀÞ° i̅œ`à ˜v i` ÓääÎÆ {Ó ­Î®\ ÓÓȇÎx° xÇ° >ÕÝ ,° ˜œÜi`}i‡L>Ãi` `iVˆÃˆœ˜ ÃÕ««œÀÌ vœÀ `ˆ>}˜œÃˆÃ >˜` ̅iÀ>«Þ\ œ˜ ̅i “Տ̈«i ÕÃ>LˆˆÌÞ œv «>̈i˜Ì `>Ì>° i̅œ`à ˜v i` £™n™Æ Ón ­Ó®\ ș‡ÇÇ° xn° iÀ} ] œœÀ“>˜ ° /…i Vœ˜ÌiÝÌÕ> ˜>ÌÕÀi œv “i`ˆV> ˆ˜vœÀ“>̈œ˜° ˜Ì  i` ˜vœÀ“ £™™™Æ xÈ ­£‡Î®\ x£‡Èä° x™° 7 ° ˜ÌiÀ˜>̈œ˜> 7œÀŽˆ˜} ÀœÕ« vœÀ ˆÃ‡ i>Ãi œ˜ˆÌœÀˆ˜} >˜` œÀiV>Ã̈˜}° >«ÌÕÀi‡ÀiV>«‡ ÌÕÀi >˜` “Տ̈«i‡ÀiVœÀ` ÃÞÃÌi“à iÃ̈“>̈œ˜ \ ˆÃ̜ÀÞ >˜` ̅iœÀïV> `iÛiœ«“i˜Ì° “  «ˆ`i‡ “ˆœ £™™xÆ £{Ó ­£ä®\ £ä{LJxn° Èä° Ài˜˜iÀ ° ««ˆV>̈œ˜ œv V>«ÌÕÀi‡ÀiV>«ÌÕÀi “i̅œ`à vœÀ `ˆÃi>Ãi “œ˜ˆÌœÀˆ˜}\ «œÌi˜Ìˆ> ivviVÌà œv ˆ“«iÀviVÌ ÀiVœÀ` ˆ˜Ž>}i° i̅œ`à ˜v i` £™™{Æ ÎÎ ­x®\ xäӇȰ È£° >˜}iÞ ] -Ìi«…i˜Ãœ˜ -] ÀÞiÀ ] œÀ“>˜ ° /À>«Ã vœÀ ̅i ՘Ü>ÀÞ ˆ˜ iÃ̈“>̈˜} «iÀܘ L>Ãi` ˆ˜ÕÀÞ ˆ˜Vˆ`i˜Vi ÕȘ} …œÃ«ˆÌ> `ˆÃV…>À}i `>Ì>° ˜ *ÀiÛ ÓääÓÆ n ­{®\ ÎÎӇǰ ÈÓ° Ãœ«  ] >˜}iÞ  ° iÌiÀ“ˆ˜ˆ˜} vˆÀÃÌ >`“ˆÃ‡ Ȝ˜Ã ˆ˜ > …œÃ«ˆÌ> `ˆÃV…>À}i vˆi ۈ> ÀiVœÀ` ˆ˜Ž‡ >}i° i̅œ`à ˜v i` £™™nÆ ÎÇ ­£®\ ÎӇǰ Èΰ œÃ ,] ,œÕŽi“> ] Û>˜ ˆ˜˜iŽi˜ ] `i 7ˆ`i ] Û>˜ `iÀ iˆ ° Ài ÃÌÀÕVÌÕÀi` `>Ì> ÃÌÀÕVÌÕÀi` ˆ`i˜ÌˆV>Þ¶ ˜ÛiÃ̈}>̈˜} ̅i ՘ˆvœÀ“ˆÌÞ œv «i`ˆ‡ >ÌÀˆV «>̈i˜Ì `>Ì> ÀiVœÀ`i` ÕȘ} "«i˜- ° i̅œ`à ˜v i` ÓääxÆ {{ ­x®\ ÈΣ‡n° È{° 7ˆˆ>“à ° i>ÃÕÀˆ˜} ̅i Vœ“«iÌi˜iÃà >˜` VÕÀÀi˜VÞ œv Vœ`ˆvˆi` Vˆ˜ˆV> ˆ˜vœÀ“>̈œ˜° i̅‡ œ`à ˜v i` ÓääÎÆ {Ó ­{®\ {nӇn° Èx° >Þ ] 6œ˜`À> 6° i˜iÀˆV ÛiÀÃÕà `ˆÃi>Ãi‡Ã«i‡ VˆvˆV ˆ˜ÃÌÀՓi˜Ìà ˆ˜ µÕ>ˆÌއœv‡ˆvi >ÃÃiÃÓi˜Ì œv V…Àœ˜ˆV œLÃÌÀÕV̈Ûi «Õ“œ˜>ÀÞ `ˆÃi>Ãi° i̅œ`à ˜v i` ÓääÈÆ {x ­Ó®\ Ó££‡x°

œÀÀi뜘`i˜Vi ̜\ ,iˆœ -՘` >̈œ˜> ,iÃi>ÀV… >˜` iÛiœ«“i˜Ì i˜ÌÀi vœÀ 7iv>Ài >˜` i>Ì… ­-/ -® "ÕÌVœ“ià >˜` iµÕˆÌÞ ÀiÃi>ÀV… *°"° œÝ ÓÓä ääxΣ iÃˆ˜Žˆ ˆ˜>˜`

‡“>ˆ\ Àiˆœ°Ã՘`JÃÌ>Žiðvˆ

i̅œ`à ˜v i` xÉÓääÇ

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

III

Epidemiologic Perspectives & Innovations

BioMed Central

Open Access

Analytic Perspective

Utilization of routinely collected administrative data in monitoring the incidence of aging dependent hip fracture Reijo Sund* Address: National Research and Development Centre for Welfare and Health (STAKES), Health Services and Policy Research, P.O. Box 220, FIN00531 Helsinki, Finland Email: Reijo Sund* - [email protected] * Corresponding author

Published: 7 June 2007 Epidemiologic Perspectives & Innovations 2007, 4:2

doi:10.1186/1742-5573-4-2

Received: 13 October 2005 Accepted: 7 June 2007

This article is available from: http://www.epi-perspectives.com/content/4/1/2 © 2007 Sund; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract Societies are facing challenges as the public health burden increases in tandem with population aging. Local information systems are needed that would allow a continuous monitoring of the incidence and effectiveness of treatments. This study investigates the possibilities of routinely collected administrative data as a data source for hip fracture incidence monitoring in Finland. The study demonstrates that a straightforward use of register data results in biased estimates for the numbers of hip fractures. An interpretation of hip fractures from the population aging point of view offers an alternative perspective for hip fracture incidence calculation. This enables development of a generalizable method for probabilistic detection of starting points of hip fracture care episodes. Several risk factor and risk population extraction techniques required in registerbased data analyses are also demonstrated. Finally, it is shown that empirical evidence suggests that hip fracture incidence is proportional to population level disability prevalence. In conclusion, Finnish administrative data makes it possible to derive data for rather detailed population level risk factor stratification. Certain limitations of register-based data can be partly avoided by synthesizing data-sensitive methodological solutions during the analysis process.

Background

Societies are facing challenges as the public health burden increases in tandem with population aging. In order to fulfill the growing need for information for prevention and performance assessment purposes, local information systems are needed that would allow a continuous monitoring of the incidence and effectiveness of treatments for important health problems. However, additional data production requires funding and resources, and it would be beneficial if the required information could be produced using existing administrative data such as hospital discharge registers.

In fact, there is a great deal literature on doing descriptive epidemiology with administrative data [1,2]. The main problem with secondary administrative data is that the straightforward application of standard epidemiological practices may not be feasible, because the data collection cannot be tailored to meet the needs of the exact research problem, as with separate primary data collection. Therefore, it is obvious that the validity of secondary data depends on the research question in mind [3]. In other words, an extra interpretation and preprocessing phase that aims to find an adequate compromise between problem-driven and data-driven approaches becomes an important part of the research process [4]. Even though Page 1 of 11 (page number not for citation purposes)

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

Epidemiologic Perspectives & Innovations 2007, 4:2

this phase is commonly encountered while using secondary data in epidemiological studies, it is seldom reported explicitly. In fact, a more coherent report of the results can be achieved by stating that the required compromises were known a priori even if those choices were actually "findings" of the preprocessing phase. The drawback in underreporting is that the data utilization may seem to be easier than it really is. In the worst case, this may lead to uncritical and mechanical repetition of "good examples" in circumstances where the assumptions of those examples are not valid. This study examines the possibilities of routinely collected Finnish administrative data as a data source for an epidemiologic surveillance system. The basic idea is to demonstrate what kind of issues may require some "rethinking" during a concrete empirical research process that utilizes secondary data. Hip fracture incidence monitoring is used as an example application. First, an example is given of how conventional use of data turns out to produce improper estimates. Because data are not fully compatible with the traditional conceptualization, some alternative theoretical ideas originating from health services and aging research are used as additional links between the problem of hip fracture incidence monitoring and secondary data. After that several pragmatic issues confronted during the empirical research process on hip fracture incidence are examined from a methodological point of view. This study aims to offer practically useful methodological perspectives for the utilization of data by emphasizing the importance of continuous data-sensitive problem solving.

Analysis

Conventional approaches for determining the annual numbers of hip fractures by using hospital discharge data Hip fractures represent a worldwide major public health burden whose impact is expanding as the population ages, with hip fracture incidence rates increasing exponentially with age [5,6]. For an individual, hip fracture is a serious and painful condition that requires and invariably results in acute hospital treatment. Hip fracture is also relatively easy to diagnose (compared with many other health problems), and practically all recorded hip fracture diagnoses (regardless whether principal or secondary) in hospital discharge abstracts reflect hip fracture treatment. Therefore, it can be expected that hospital discharge data are a good source for the identification of hip fracture patients. Pitfalls in the use of hospital discharge data There are also several known potential pitfalls in using hospital discharge data for calculating the numbers of injures [7-10]. One general nuisance is that only hospitalized injuries are observed. In the case of hip fracture that

http://www.epi-perspectives.com/content/4/1/2

is not a problem, because virtually all patients with hip fracture require hospital treatment, which results in a discharge record with a hip fracture diagnosis. Another potential pitfall is related to multiple hospitalizations of a single patient. In principle, record linkage allows detection of multiple hospitalizations of the same patient, and the real problem is to define which hospitalizations should be considered as readmissions. There are also other (more or less data-source specific) problems — such as the use of diagnosis codes in the selection of cases — which are typically special cases of common challenges for the use of secondary register data in (epidemiological) research, and require careful data preprocessing that incorporates tacit knowledge formalized in terms of appropriate data abstraction rules [2-4]. Hospital discharge data in Finland In Finland, hospital discharge data are available in the Finnish Health Care Register which records data for all inpatient care discharges in institutions with 24-hour personnel and for outpatient surgical operations. Census data are also collected at the final day of each year in order to capture the ongoing care periods. The register is nationwide, which in international terms is exceptional in that it has such extensive legislative coverage of all public and private service providers. Each record in the register includes data on patient and provider ID-numbers, age, sex, area codes, and diagnosis and operation codes, as well as dates of admission, operation and discharge. Patient ID is a unique identification number given to all Finnish citizens and permanent residents. A system of unique identification numbers has been operating since 1968 and is used in all Finnish registers. Importantly, this system allows complete deterministic record linkage within and between Finnish registers [11]. Data quality is also shown to be good [12-18]. In the case of hip fracture, the completeness is very good and the accuracy of easily measurable variables is at least 95% [18]. Register based numbers of hip fractures in Finland Finnish register data have been previously used as a data source for the calculation of the number of hip fractures [19,20]. For the purposes of this study, all discharge abstracts with a primary or secondary hip fracture diagnosis (10th revision of the International Classification of Diseases diagnoses S72.0, S72.1 and S72.2) from 1998–2002 were identified in the Finnish Health Care Register. The mean number of hospital discharge records (including census records) with hip fracture diagnosis was 14430 per year during 1998–2002 in Finland (Table 1).

However, the annual numbers of hip fracture discharge records do not tell the actual number of hip fracture events. A typical care chain for a hip fracture patient consists of acute hospital treatment on a surgical ward and

Page 2 of 11 (page number not for citation purposes)

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

Epidemiologic Perspectives & Innovations 2007, 4:2

http://www.epi-perspectives.com/content/4/1/2

Table 1: Number of hip fractures in Finland 1998–2002

1998 14089 7817 7575

1999 13818 7661 7465

2000 14192 7643 7425

2001 14978 7924 7706

2002 15071 8030 7854

Mean 14430 7815 7605

Admissions with hip fracture diagnosis Of which: - recorded hip fracture operation - recorded hip fracture operation or at least two years gap to earlier record with hip fracture diagnosis - recorded hip fracture operation or at least two months gap to earlier record with hip fracture diagnosis

13219

12965

13381

14142

14145

13570

5934 6742

5972 6631

6100 6702

6221 6924

6195 6937

6084 6787

7246

7094

7136

7411

7405

7258

Admissions with first record with hip fracture diagnosis in ten years Of which: - persons aged 50 or more

5990

5853

5932

6083

6141

6000

5551

5413

5543

5644

5667

5564

Records with hip fracture diagnosis in the Finnish Health Care Register The number of patients with hip fracture diagnosis The number of patients with hip fracture discharge

follow-up care on a general ward or in a specialist rehabilitation unit. In addition, a patient may be readmitted for the same fracture after initial discharge to home or to a long-term facility. In order to prevent multiple counting of cases, an individual-based record linkage is recommended for the detection of multiple hospitalizations for the same fracture [10]. This is not a problem with Finnish data, and the previous Finnish studies have identified all hospital discharge records with hip fracture diagnosis and then used calendar year boundaries to exclude multiple hospitalizations of each patient. The corresponding annual number of patients with hip fracture discharge was on average 7605 per year between 1998 and 2002 (Table 1). Unfortunately, the use of calendar year boundaries makes such an exclusion approach artificial and has no epidemiological justification, because it is clear that calendar year boundaries result in the fact that the related hip fracture free clearance periods (time from the beginning of the year to a first fracture hospitalization of the year) vary per patient. This has two serious consequences. First, it is likely that many patients having their fracture during the final months of each year are erroneously counted as separate cases for two years, because it takes at least four months until the maximum restoration in terms of residential status (hospitalizations) is reached at the population level [21]. Second, one patient may have more than one hip fracture per year resulting in some undercounting. These drawbacks are an indication of need for a more appropriate estimate of the annual number of hip fracture events. It is reasonable to hypothesize that virtually all recorded hospitalizations with a hip fracture diagnosis combined with hip fracture operation represent a new hip fracture,

and that the date of an acute admission for surgery can be used as an estimator for the actual occurrence date of the hip fracture. The mean number of such operations was 6084 per year in Finland during 1998–2002 (Table 1). Unfortunately, this simple definition underestimates the true number of hip fractures. Even though a small number of (incorrectly recorded) re-operations may be erroneously included, the definition excludes patients treated conservatively (non-operatively) as well as patients who die before the operation. It is also known that the recording of hip fracture operations in The Finnish Health Care Register has not been totally complete [18,22]. This means that even some hip fracture related admissions without recorded operations should be considered indications of fresh hip fracture. The problem is to define which admissions reflect fresh hip fractures, and an easy solution is to use a constant hip fracture free clearance period to sort out new admissions from readmissions. If the new admission is defined as a hip fracture related record with no other hip fracture admissions for that individual in the previous two years, the mean number of hip fractures is 6787 per year (Table 1). As the use of long clearance periods may exclude some true subsequent fractures, the numbers of hip fractures were calculated also with a twomonth hip fracture free clearance period. With this criterion, the mean number of hip fractures was 7258 per year (Table 1). In principle, further diagnosis-based exclusion rules could be utilized in order to exclude certain nonstandard cases, such as pathological hip fractures, arthrosis related fractures, and cases with multiple fractures or orthopedic aftercare. However, variations in coding practices partially invalidate the use of such data abstraction rules, which assume recording of secondary diagnoses. In practice, even the clinical criterion of hip fracture may have some

Page 3 of 11 (page number not for citation purposes)

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

Epidemiologic Perspectives & Innovations 2007, 4:2

regional variation which makes it impossible to obtain a unique estimate for the number of hip fractures using register data. In any case, following the reasoning described above, it is likely that the mean number of hip fractures is somewhere between 6800–7200 per year in Finland between 1998 and 2002. The estimate for the number of hip fractures in 1998 is 6742–7246. Even the upper limit is significantly smaller than in the earlier Finnish study [20], where the annual number of hip fractures in 1998 was reported to be 7698. The reasons for the small difference in relation to the number of hip fracture discharges in 1998 (Table 1) are not known, but are most probably due to difficulties in reconstruction of the exact operationalization of some inclusion and exclusion criteria used in the earlier study. Also some overestimation is likely to occur because of the artificial definition of fresh hip fracture, as was described above. In conclusion, the earlier study reports a number that seems to be about 10% too large (corresponding to about 700 hip fractures). Alternative theoretical approaches As was demonstrated above, it is difficult to derive adequate estimates for the number of hip fractures even with the careful use of register data. The main problem is that the secondary data are not completely compatible with the theoretical objective of calculating the number of fresh hip fracture events. In other words, it would be beneficial if the theoretical objectives could be modified so that they are in agreement with the limits of the available register data. Fortunately, certain methodological ideas originating from health services and aging research are also useful in epidemiological applications. Care episode approach The definition for an incident case is a common problem in epidemiologic research. At the conceptual level this means that all events related to the same underlying disorder should be recognized. This is the fundamental idea in the care episode approach [23]. The care episode approach is widely applied in health economics and in health services research. The key point is that the care episode approach offers a sound methodological framework for dealing with multiple hospitalizations in terms of hospital discharge data, and it also provides a sound basis for the measurement of incidence, because an appropriate episode measure combining related hospitalizations is less subject to over- or undercounting of cases [24].

Unfortunately, a sound methodological framework does not automatically solve empirical difficulties [25]. One of the biggest practical problems with linked register data is to determine the starting point of the care episode. In principle, the starting point of a care episode for hip frac-

http://www.epi-perspectives.com/content/4/1/2

ture should be rather easy to determine, because the actual event of fracture (or hospitalization following that event) is an obvious index point. However, in practice it is challenging to make a distinction between true subsequent hip fractures and hospitalizations due to ongoing treatment episodes or reoperations in terms of data. In standard practice, it is common to use the first health problemrelated event available in the data as an index point. Another widely used approach is to use certain clearance periods to determine the appropriate index points. A third technique is to use external complementary data in determining (the number of) incident cases [26]. Other approaches seem to be rare [27]. Intuitively, it seems to be a good idea to determine an individual's first hip fracture occurrence and consider it as a starting point for a (chronic) care episode. In terms of hospital discharge data this corresponds to the detection of an individual's first hospitalization with recorded hip fracture diagnosis. The problem is that the hip fracture itself is not a chronic condition, but a remediable health deficiency. Therefore, an alternative theoretical interpretation for the first hip fracture is needed. Gompertzian interpretation of aging-related hip fractures One particularly interesting fact is that hip fracture incidence increases exponentially with age [5]. This kind of functional dependency (Gompertz law) has very strong interpretations from the point of view of population aging [28-30]. In this sense, perspectives from aging research may strengthen the traditional epidemiological interpretations (and vice versa) [31,32]. Biologically, aging can be seen as a complex process occurring stochastically in organs and tissues after reaching adulthood (and mainly after reproductive maturity), which results in irreversible damage accumulation and vulnerability to the failures in maintaining the integrity of tissues and organs [33,34]. A particularly fruitful approach is to consider aging-related events of interest as failures in components of a biological system [35]. An exponential increase of incidence with age means that the intensity of failures is constant (as the probability of failure is constant in time, the cumulative probability of failure is exponential). Therefore, it is not surprising that most aging-related (Gompertzian) conditions are closely connected to (cumulative) alterations in certain tissues or organs.

However, hip fracture is not just an aging-related failure in some single tissue or organ, but is typically the result of an accidental event (such as a fall). In other words, cumulative damage in some tissues (such as bone and muscle) increases the probability of (accidental or pathological) fracture, but it is also likely that some aging-related conditions increase the risk of accident. In this sense, hip fracture seems to capture effects of non-fatal failures in several

Page 4 of 11 (page number not for citation purposes)

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

Epidemiologic Perspectives & Innovations 2007, 4:2

components of the human body. Among the Gomperzian conditions, only death seems to have a similar multidimensional interpretation as it captures fatal failure(s) in any vital components of human body. In other words, an interesting (methodological) analogy can be seen between death and hip fracture. It is also obvious that serious but non-fatal failures in components of the human body result in some kind of disabilities. Such disabilities — typically measured in terms of reduced functional capacity and coping with physical activities of daily living — are important risk factors for hip fracture [36,37]. In other words, it is reasonable to hypothesize that the prevalence of such risk factors in the underlying population is related to the incidence of (aging dependent) hip fractures. Following this interpretation, it becomes clear that the occurrence of first agingdependent hip fracture gives an approximate upper limit for the time of development of critical hip fracture risk factors, because the risk factors must have exceeded the critical level before the event of (low-energetic aging dependent) hip fracture. From this it follows that the first aging-related hip fracture can be seen as an indication of a chronic (disability) condition, and any subsequent hip fractures represent just a nuisance for related population level interpretations. In conclusion, there are theoretical justifications for the determination of the first agingrelated hip fracture occurrences, which are in concordance with the restrictions of available hospital discharge data. This interpretation is also of particular importance in a theoretical sense, because it suggests that a commonly available time series of hip fracture incidence may also reflect the more general disability trends of the population, which is a testable hypothesis. Calculating the number of hip fractures So far, the methodological problem of the definition of the hip fracture episode has been reduced to a detection of first aging-related hip fractures in the population (starting points of the first hip fracture care episodes). This is not necessarily a straightforward task in practice, because the available data allow only limited backward follow-up time and it remains unknown whether the first hip fracture found in the data really is the first hip fracture of the person. Probabilistic methods can be used to correct the number of observed persons to the number of persons with the first appearance of a chronic condition [27].

However, if there is a need to identify the persons with a first hip fracture in terms of care episode (instead of just calculating the number of persons with a first hip fracture), it is inconvenient that the actual backward followup times (definitions of first hip fractures) vary between persons. In addition, the subsequent hip fracture of a person is biasing interpretations only as far as the risk of sub-

http://www.epi-perspectives.com/content/4/1/2

sequent hip fracture is significantly higher than the risk of first aging-related hip fracture. In other words, there may be a cut-off point after which the probability of having a new hip fracture is reduced to the level it would be even without preceding hip fracture, i.e. preceding hip fracture is an "uninformative" predictor of the new hip fracture. By examining the observed and expected probabilities of preceding hip fractures it becomes possible to give justification for a suitable clearance period to be used. Importantly, this probabilistic argument for the definition of the starting points of the care episodes also generalizes easily to other disorders. Probabilistic determination of the starting point of care episode In order to determine the cut-off point for a clearance period, the first admissions (index points) with the (principal or secondary) diagnosis of hip fracture in 1998– 2002 were identified for each patient. To ensure sufficient (backward) follow-up time for each patient, all available discharge records of the hip fracture population from 1987–2002 were obtained from the registers using the unique personal identification numbers as linkage keys. For each index point, backward time to the previous hip fracture admission of the same patient was calculated. Time was measured in months. There were data from the years 1987–2002, so the minimum (backward) follow-up time was 11 years.

The first task was to calculate the expected probabilities of earlier hip fractures in the hip fracture population of interest. The problem here is that reasonable estimates for the incidence of first hip fractures are needed for the calculation. In this study, it was hypothesized that clearance periods between one and ten years may result in a reasonable definition of first hip fracture, and the mean age- and sexgroup specific incidences between 1998 and 2002 were calculated by using one year and ten year clearance periods. In order to calculate the actual expected probabilities, the logarithms of the mean hip fracture incidence rates for different age-groups were used in estimating the exponential trend in age-group specific rates by using (log-)linear regression analysis for both sexes separately [38]. This functional relationship allows the interpolation of an incidence rate for persons who are over 40 years old [29]. The individuals with an observed hip fracture were then followed (backwards) in time, with age correspondingly corrected at each time point for all persons in the risk population. With a known age and sex distribution of the risk population, it was possible to predict the expected probability of hip fracture by using the estimated log-linear relationship between age and hip fracture incidence. For ages below 40 years, a constant incidence rate was used in the prediction. These probabilities were then summed across the risk population resulting in an expected number of hip fractures that was finally divided by the size of the risk

Page 5 of 11 (page number not for citation purposes)

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

Epidemiologic Perspectives & Innovations 2007, 4:2

population, giving an expected (conditional) probability of hip fracture (for each time point). The expected probabilities are shown in Figure 1 using dotted lines. As can be seen, the difference between expected probabilities based on one- and ten-year clearance periods is quite small. In order to calculate the observed probabilities, a hazard function giving the (conditional) probabilities of having a preceding hip fracture admission as a function of backward time (i.e. time was measured from index admission to a preceding hip fracture admission) in months was estimated nonparametrically using a product-limit estimator, and smoothed using a polynomial moving average. As the estimated hazard function in Figure 1 shows there was an increased and non-stabilized (non-constant) risk for an admission involving hip fracture occurring about 120 months after the previous admission related to hip fracture and the observed risk remains higher than expected in the ten-year period. If a more formal way is needed to determine where the lines meet, it is straightforward to calculate confidence intervals for the hazard function. In conclusion, it seems that a clearance period between seven and ten years is needed until the risk is reduced to the same level as that without a preceding hip fracture. In this study, a conservative ten-year clearance period was selected. The ten-year criterion has also been used in other studies [39].

http://www.epi-perspectives.com/content/4/1/2

Estimating the number of new hip fractures with limited data It is unfortunate that no shorter than a ten-year criterion seems to be suitable for hip fractures, because in most countries there may not be the required data available for such a long backward follow-up period. However, if an estimate for the overall number of first-ever hip fractures is enough, and there is no need to identify which patients actually had their first hip fracture, data with limited backward follow-up can also be used.

The numbers of patients without a preceding hip fracture as a function of backward time in the logarithmic scale for 1998 are drawn in Figure 2. As can be seen, there is almost a linear relationship between logarithmic time and the number of patients without a preceding hip fracture. In other words, even data with quite a short backward follow-up time allow the estimation of such a linear trend resulting in a reasonable prediction for the number of hip fracture patients at the ten year cut-off point (or at any other desirable cut-off point) for a clearance period. For example, if only one year data for backward follow-up are available, the idea is to calculate time from index admission to previous fracture or to the beginning of (backward) follow-up. Then for each day (preceding the index admission) the number of patients with a longer time distance for the previous fracture is calculated. Finally, a logarithmic transformation is applied to the day variable, and a linear regression model (where the number of fractures is a response and constant and logarithmic day are explanatory variables) is estimated. Predictions from this linear model can be easily extrapolated to any cut-off point of interest. Risk factor extraction Using Finnish register data it is feasible to identify hip fracture patients who have their first aging-related hip fracture in the sense described above. This makes it possible to determine the status of certain hip fracture risk factors available in the data — such as age, sex, institutionalization, urbanity, season and year [40-45] — in relation to these patients. Methodologically, three different risk factor extraction techniques can be separated: internal, external, and empirical.

2002 1 time Probability backward Figure of having in months a preceding from the hipfirst fracture fracture as ainfunction 1998– of Probability of having a preceding hip fracture as a function of backward time in months from the first fracture in 1998– 2002. The smaller picture is a tenfold magnification of the final months. Dotted curves represent the expected probabilities of having a hip fracture (upper curve is calculated with a one-year clearance period and the lower curve with a tenyear clearance period, see text for more information).

Internal extraction Internal extraction corresponds to the use of data abstraction rules within the register data. For example, using the Finnish data it is not enough to determine the index hip fracture admission, but the actual day of hip fracture must be inferred using appropriate algorithms. It is obvious that the care episode related to hip fracture starts from the first contact with the health care system after the actual event of fracture. However, the index admission could be for a long stay in residential care, while the diagnosis referred to an accident that happened near the discharge

Page 6 of 11 (page number not for citation purposes)

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

Epidemiologic Perspectives & Innovations 2007, 4:2

Figure Number tion of backward 2of patients time without in 1998 preceding in Finlandhip fracture as a funcNumber of patients without preceding hip fracture as a function of backward time in 1998 in Finland.

day, or the index admission could be for a surgical period, while the actual accident had happened before the admission. After the detection of initial surgical admission, more abstraction rules can be developed. For example, the most accurate diagnoses can be extracted from the data corresponding to the surgical treatments following the fractures, since those are the first ones based on x-ray and physical examinations. For the purposes of this study, a person was classified as an institutionalized long-term care patient if he or she was admitted to the surgical ward from some institution providing inpatient care, and if he or she also had a recorded administrative decision for long-term care or had received inpatient care for more than six months during the year preceding the fracture.

http://www.epi-perspectives.com/content/4/1/2

Empirical extraction In empirical extraction, preliminary analyses of the available data are used to justify the definitions of variables and data abstraction rules required in the internal extraction. For example, it is not obvious what kind of definition for seasons should be used. This problem can be solved when using the Finnish register data, which allow accurate calculation of daily numbers of hip fractures. After smoothing out the random variation in absolute numbers by a moving-average technique, the mean of daily numbers of the new hip fractures was 15.2 for persons aged 50 or older (Figure 3). There was small but clear seasonal variation so that 53.5 per cent of fractures occurred during the winter/spring season (from November to April) compared to 46.5% during the summer/fall season (from May to October). Data also revealed that there had been some very "hazardous" days during the winter season, but seasonal variation was almost completely due to non-institutionalized persons (Figure 3). Using these results (of preliminary analyses) in the definition of seasons is an example of empirical extraction. In fact, the method developed above for the definition of first aging-related hip fracture is another example of empirical extraction. Risk population data For incidence calculations data on risk population are also needed. Typically, the official population figures are available in administrative databases with stratification according to age, sex and area of living. In this study, population figures (taken on the last day of the years 1997– 2002) in 5-year groups were obtained for each municipality (local administrative unit in Finland) from the Social and Health Service Statistical Database (SOTKA). Municipality works as an aggregate unit in external extraction, and also allows easy determination of population figures for any combination of these basic units.

External extraction In external extraction, variables that link individuals to aggregate levels are first internally extracted and then external data that describe aggregate units are linked to each individual. For example, for the purposes of this study, the area code (municipality) at the index admission was used to classify each patient as rural or urban (including semi-urban areas) using the official grouping defined by Statistics Finland (rural municipalities are those municipalities in which less than 60 per cent of the population lives in urban settlements). In general, any other patient characteristic attributable to area-specific phenomena could also be used here instead of urbanity.

Figure Daily numbers 3 of hip fractures in Finland 1998–2002 Daily numbers of hip fractures in Finland 1998–2002.

Page 7 of 11 (page number not for citation purposes)

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

Epidemiologic Perspectives & Innovations 2007, 4:2

Risk population for internally extracted risk factors It is more difficult to determine risk population for internally extracted risk factors. In this study, long-term institutional care was one risk factor of interest. Fortunately, data from the Finnish Health Care Register can be used to calculate the total numbers of clients in long-term institutional care on the last day of the years 1997–2002, since the register includes all inpatient hospital and nursing home care in Finland. This individual level data can be easily aggregated to appropriate groups (such as stratification by age, sex and municipality), and further subdivision of population figures according to long-term institutional care is possible. However, during the research process it turned out that the calculation technique used in official statistics concerning institutional care was inappropriate for the purposes of epidemiological studies. Therefore, the significantly downwards biased numbers were corrected using a technique reported elsewhere [46].

This procedure finally resulted in simultaneously observed risk populations on the final day of the years 1997–2002 with stratification by sex, age, urbanity, and institutionalization. The derivation of such exceptionally detailed nationwide population figures was possible because of common aggregate units (municipalities) in the databases and registers. Trends for risk populations Observed risk populations on the final day of the years of interest are not good approximations for the size of the risk population (or the follow-up time for risk population). A common method is to interpolate the mean population by using observed census data on two consecutive years (approximating also the follow-up time of the risk population, if measured in person years). This method can be generalized by modeling the trends extractable from observed risk populations. In this study, regression models with the constant, year and square(year) as regressors were fitted for each group with a stratification by sex, age, institutionalization, and urbanity (with age groups 50 to 64, 65 to 74, 75 to 84 and 85+). These models were then used in the approximation of the required risk populations varying by year and–importantly–by (empirically extracted) seasons, too. Finally, there were the observed numbers of hip fracture events and the follow-up times (in person years) for 320 groups. Limitations of risk population data It is well known that an accurate number of the population at risk is an essential requirement for incidence calculations [47]. In this study it was possible to derive population-based denominators for each group of interest. These denominators are desirable in terms of accuracy, but not perfect, because an assumption of a stable

http://www.epi-perspectives.com/content/4/1/2

population (no short-term fluctuations in migration or mortality) is needed for interpolating census day population figures to appropriate mean follow-up times. In addition, as a ten-year clearance period was used in this study, a person is not at risk of a new hip fracture for the 10 years following a hip fracture and should be — in principle — left out of the risk population. However, this correction was not done in this study, because adequate (group-specific) hip fracture prevalence data were not available. The bias resulting from keeping the prevalent pool in the risk population was considered to be insignificant, as the number of hip fractures is very small in relation to the population in younger age-groups and mortality following the hip fracture is very high for the older age-groups. In fact, relative bias [1-(uncorrected incidence/corrected incidence)] of mean incidence seemed to be less than 2%. Moreover, even if some groupspecific bias exists, the direction is towards conservative estimates (underestimation rather than overestimation of incidences). In conclusion, there is still room for improvement in deriving risk population data. However, the problems are insignificant in terms of the "iceberg phenomenon" involved in the denominator problem [47], and there is no reason to believe that the data used in this study would not reflect the true epidemiology of hip fracture. Hip fracture incidence After data preparation, it is simple to examine the univariate effects of population-level risk factors using standard methodology. It is quite straightforward to extend such analyses for other stratifications of interest, such as area specific estimates (possibly requiring empirical-Bayes estimation) revealing regional variation in hip fracture incidence. Results of such basic analyses are reported elsewhere [48]. Only exemplary results on the simultaneous effects of risk factors with an interpretation offering a new epidemiological perspective for hip fracture are given here.

In standard practice, only a few risk factors are typically considered simultaneously in population level incidence studies. Risk factors of interest are included into the model, and estimates are interpreted as independent effects (effects adjusted for other risk factors in the model). The problem is that there may be complicated interactions between the risk factors, which distort straightforward interpretations. In this study, the Poisson-regression model was used in the simultaneous examination of several risk factors. Other analyses have indicated that the (long-lasting) increase in age-adjusted incidence had recently stabilized

Page 8 of 11 (page number not for citation purposes)

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

Epidemiologic Perspectives & Innovations 2007, 4:2

in Finland, with the age-adjusted incidence almost constant between 1999 and 2002 [48]. As the interest in this study was not in secular trends of incidence but in the effects of risk factors, the final Poisson regression model was estimated using the combined data from 1999 to 2002. All main effects (sex, age, institutionalization, urbanity, and season) and statistically significant interactions up to the third degree were included in the model. The goodness of fit was acceptable in terms of Pearson chisquare statistic. Because there were a lot of significant interactions between risk factors, it was rather difficult to report all relevant estimates in easily interpretable form. Therefore, the group-specific rates adjusted for all other factors were calculated by anti-logging the corresponding linear predictors (without the offset) based on the estimated model. These adjusted rates represent the systematic effects extracted from the data by the model. Overlapping confidence intervals of two groups indicate that there is no statistically significant difference between rates in these groups. Alternative epidemiological perspective for hip fracture incidence The results in Table 2 show that there is a higher hip fracture incidence among older persons and women than among younger persons and men managing at home. However, there is no significant effect of sex on incidence among institutionalized persons. The incidence is higher during the winter time for all persons with a functional status potentially good enough to allow for walking outdoors (non-institutionalized under age 85). Urbanization is not associated with significantly higher hip fracture incidence in Finland. As it is known that institutionalization is a sign of reduced coping with daily activities [49] and that women tend to have more functional limitations and physical disability than men as age increases [50], the results are in concordance with the hypothesis suggesting that hip fracture incidence is proportional to the prevalence of disability related hip fracture risk factors in the underlying population.

In order to test this hypothesis, a correlation was calculated between the nationwide data on the prevalence of outdoor walking ability — being a strong disabilityrelated risk factor for hip fracture [51] — of persons aged 65 to 84 and the hip fracture incidences for the corresponding age and sex groups for each year between 1998– 2002. Data on walking ability status for the population was extracted from consecutive nationwide surveys on health behavior among older people conducted by the National Public Health Institute [52]. Linear regression analysis showed that the prevalence of individuals unable to walk outdoors explained 97.5% of the variation in the hip fracture incidence. In conclusion, the hypothesized association between hip fracture incidence and prevalence

http://www.epi-perspectives.com/content/4/1/2

of (disability-related) hip fracture risk factors was not falsified, and deserves further examination.

Conclusion

This study evaluated the feasibility of producing information for the purposes of monitoring hip fracture incidence based on Finnish administrative register data. This type of information production is not a simple task, but requires creative use of rather complicated methodology. It was shown that straightforward use of register data results in biased estimates of the numbers of hip fractures, and that even the sophisticated use of data is dependent on the more or less ambiguous definitions of actual hip fractures. In this study, the definition of hip fracture was linked to population aging. This reasoning resulted in the suggestion that hip fracture-related disability has certain interesting methodological similarities to the event of death. Moreover, an example from the results empirically supported this interpretation by demonstrating that hip fracture incidence is associated with a certain type of agingrelated disability. Actually it seems that hip fracture incidence is directly proportional to the prevalence of population level disability. In this sense, hip fracture incidence trends also reveal the more general disability trends of the population. Methodologically, while using register-based data, the determination of first aging-related hip fractures is easier than the determination of all hip fractures. Moreover, the detection of the first aging-related hip fracture event closely resembles the determination of the starting point of the hip fracture care episode. A method was developed based on an observed (and expected) hazard function, that allows probabilistic justification for the starting point of the care episode. This method is applicable in the general care episode approach for a wide variety of health problems regardless of the intention to use care episodes in incidence calculations. Furthermore, three different techniques for risk factor (and corresponding risk population) extraction were demonstrated. Finnish administrative data makes it possible to derive data for a rather detailed population level risk factor stratification, but such a feature has certain more or less data-specific limitations which can be partly avoided by developing methodological solutions to the encountered problems. One area requiring further work that is beyond the scope of this study is the more careful estimation of hip fracture prevalence in Finland. Finally, this study tries to demonstrate that the traditional methodological paradigm with an assumption of theorydriven data collection and fixed methods has limited suitability for data analyses that utilize secondary data. Two

Page 9 of 11 (page number not for citation purposes)

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

Epidemiologic Perspectives & Innovations 2007, 4:2

http://www.epi-perspectives.com/content/4/1/2

Table 2: Adjusted incidence rates per 100000 person years for hip fracture in Finland 1999–2002

Urban

Rural

Winter

Summer

Winter

Summer

Rate

95% CI

Rate

95% CI

Rate

95% CI

Rate

95% CI

Non-institutionalized persons Men 50–64 Women 50–64 Men 65–74 Women 65–74 Men 75–84 Women 75–84 Men 85+ Women 85+

62 45 191 223 608 966 1797 2805

57, 67 41, 49 178, 205 211, 237 574, 644 932, 1002 1668, 1936 2693, 2922

49 35 158 185 503 799 1622 2531

45, 53 32, 39 147, 170 174, 197 474, 533 769, 830 1504, 1748 2426, 2640

46 46 176 234 561 925 1858 2567

40, 53 39, 53 159, 195 214, 254 516, 609 878, 974 1680, 2054 2423, 2720

36 36 145 193 464 765 1676 2316

31, 42 31, 42 131, 161 177, 211 427, 504 725, 807 1515, 1854 2184, 2457

Institutionalized persons Men 50–64 Women 50–64 Men 65–74 Women 65–74 Men 75–84 Women 75–84 Men 85+ Women 85+

1380 1214 1688 2018 2106 2495 2330 2529

1074, 1773 908, 1622 1437, 1983 1775, 2294 1883, 2355 2332, 2670 2066, 2628 2376, 2692

1277 1123 1646 1967 2053 2432 2480 2691

993, 1643 840, 1503 1400, 1934 1730, 2237 1836, 2296 2272, 2604 2200, 2795 2531, 2861

916 1096 1387 1884 1735 2134 2152 2068

689, 1218 795, 1511 1149, 1674 1616, 2196 1511, 1992 1946, 2339 1861, 2489 1894, 2257

848 1014 1352 1836 1691 2080 2290 2200

638, 1128 735, 1400 1120, 1633 1575, 2142 1473, 1942 1897, 2282 1981, 2647 2017, 2400

main ideas should be noted. First is the empirical justification of theory-driven operationalized definitions, and second is the theoretical model that is compatible with theory-driven as well as data-driven approaches. For example, in this study register data gave only uncertain estimates for the number of all hip fractures, so the definition and theory underlying the hip fractures had to be revised. After theory revision, empirical justification for the extraction of risk factors was utilized, which finally gave results that supported the revised theory. In fact, it may be difficult to make a clear distinction between definitions, preliminary analyses justifying these definitions, and actual results of the study. In conclusion, it seems that the more detailed (secondary) the available data is, the more alternative perspectives are needed in the analyses of such data and in the reporting of the results of the analyses.

References 1. 2. 3. 4.

5. 6. 7. 8. 9.

Competing interests

The author(s) declare that they have no competing interests.

Authors' contributions

10. 11.

RS carried out the whole research process. 12.

Acknowledgements This work was financially supported by the research grant from the Academy of Finland. The funding agreement ensured the author's independence in designing the study, interpreting the data, writing, and publishing the report.

13.

Bright RA, Avorn J, Everitt DE: Medicaid data as a resource for epidemiologic studies: strengths and limitations. J Clin Epidemiol 1989, 42:937-945. Baron JA, Weiderpass E: An introduction to epidemiological research with medical databases. Ann Epidemiol 2000, 10:200-204. Sorensen HT, Sabroe S, Olsen J: A framework for evaluation of secondary data sources for epidemiological research. Int J Epidemiol 1996, 25:435-442. Sund R: Utilisation of administrative registers using scientific knowledge discovery. Intelligent Data Analysis 2003, 7:501-519 [http://iospress.metapress.com/openurl.asp?genre=arti cle&issn=1088-467X&volume=7&issue=6&spage=501]. Melton LJ 3rd: Epidemiology of hip fractures: implications of the exponential increase with age. Bone 1996, 18:121S-125S. Cummings SR, Melton LJ: Epidemiology and outcomes of osteoporotic fractures. Lancet 2002, 359:1761-1767. Langley J, Stephenson S, Cryer C, Borman B: Traps for the unwary in estimating person based injury incidence using hospital discharge data. Inj Prev 2002, 8:332-337. Boufous S, Williamson A: Reporting of the incidence of hospitalised injuries: numerator issues. Inj Prev 2003, 9:370-375. Smith GS, Langlois JA, Buechner JS: Methodological issues in using hospital discharge data to determine the incidence of hospitalized injuries. Am J Epidemiol 1991, 134:1146-1158. Brophy S, John G, Evans E, Lyons RA: Methodological issues in the identification of hip fractures using routine hospital data: a database study. Osteoporos Int 2006, 17:405-409. Gissler M, Haukka J: Finnish health and social welfare registers in epidemiological research. Norsk Epidemiologi 2004, 14:113-120 [http://www.ub.ntnu.no/journals/norepid/2004-1/2004(1)_21Gissler.pdf]. Keskimäki I, Aro S: Accuracy of data on diagnoses, procedures and accidents in the Finnish Hospital Discharge Register. Inj J Health Sci 1991, 2:15-21. Leppälä JM, Virtamo J, Heinonen OP: Validation of stroke diagnosis in the National Hospital Discharge Register and the Register of Causes of Death in Finland. Eur J Epidemiol 1999, 15:155-160.

Page 10 of 11 (page number not for citation purposes)

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

Epidemiologic Perspectives & Innovations 2007, 4:2

14. 15. 16.

17. 18.

19. 20.

21. 22. 23. 24. 25. 26. 27. 28. 29. 30. 31. 32. 33. 34. 35. 36. 37.

38.

Lüthje P, Nurmi I, Kataja M, Heliövaara M, Santavirta S: Incidence of pelvic fractures in Finland in 1988. Acta Orthop Scand 1995, 66:245-248. Makikyro T, Isohanni M, Moring J, Hakko H, Hovatta I, Lonnqvist J: Accuracy of register-based schizophrenia diagnoses in a genetic study. Eur Psychiatry 1998, 13:57-62. Pajunen P, Koukkunen H, Ketonen M, Jerkkola T, Immonen-Raiha P, Karja-Koskenkari P, Mahonen M, Niemela M, Kuulasmaa K, Palomaki P, Mustonen J, Lehtonen A, Arstila M, Vuorenmaa T, Lehto S, Miettinen H, Torppa J, Tuomilehto J, Kesaniemi YA, Pyorala K, Salomaa V: The validity of the Finnish Hospital Discharge Register and Causes of Death Register data on coronary heart disease. Eur J Cardiovasc Prev Rehabil 2005, 12:132-137. Poikolainen K: Accuracy of hospital discharge data: five alcohol-related diseases. Drug Alcohol Depend 1983, 12:315-322. Sund R, Nurmi-Lüthje I, Lüthje P, Tanninen S, Narinen A, Keskimäki I: Comparing properties of audit data and routinely collected register data in case of performance assessment of hip fracture treatment in Finland. Methods Inf Med 2007, 46:(in press). Kannus P, Niemi S, Parkkari J, Palvanen M, Vuori I, Järvinen M: Hip fractures in Finland between 1970 and 1997 and predictions for the future. Lancet 1999, 353:802-805. Lüthje P, Nurmi I, Palvanen M, Kannus P: Epidemiology and prevention possibilities of proximal femur fracture [in Finnish]. Suomen Lääkärilehti 2001, 56:1615-1619 [http://www.fimnet.fi/cl/ laakarilehti/pdf/2001/SLL142001-1615.pdf]. Heikkinen T, Jalovaara P: Four or twelve months' follow-up in the evaluation of functional outcome after hip fracture surgery? Scand J Surg 2005, 94:59-66. Sund R, Liski A: Quality effects of operative delay on mortality in hip fracture treatment. Qual Saf Health Care 2005, 14:371-377. Solon JA, Feeney JJ, Jones SH, Rigg RD, Sheps CG: Delineating episodes of medical care. Am J Public Health Nations Health 1967, 57:401-408. Hornbrook MC, Hurtado AV, Johnson RE: Health care episodes: definition, measurement and use. Med Care Rev 1985, 42:163-218. Rosen AK, Mayer-Oakes A: Episodes of care: theoretical frameworks versus current operational realities. Jt Comm J Qual Improv 1999, 25:111-128. Kegler SR: Reporting incidence from a surveillance system with an operational case definition of unknown predictive value positive. Epidemiol Perspect Innov 2005, 2:7. Brameld KJ, Holman CD, Lawrence DM, Hobbs MS: Improved methods for estimating incidence from linked hospital morbidity data. Int J Epidemiol 2003, 32:617-624. Brody JA, Schneider EL: Diseases and disorders of aging: an hypothesis. J Chronic Dis 1986, 39:871-876. Brody JA, Grant MD: Age-associated diseases and conditions: implications for decreasing late life morbidity. Aging (Milano) 2001, 13:64-67. Holliday R: The close relationship between biological aging and age-associated pathologies in humans (with discussion). J Gerontol A Biol Sci Med Sci 2004, 59:B543-53. Khaw KT: Epidemiological aspects of ageing. Philos Trans R Soc Lond B Biol Sci 1997, 352:1829-1835. Brenner H, Arndt V: Epidemiology in aging research. Exp Gerontol 2004, 39:679-686. Yin D, Chen K: The essential mechanisms of aging: Irreparable damage accumulation of biochemical side-reactions. Exp Gerontol 2005, 40:455-465. Holliday R: The multiple and irreversible causes of aging. J Gerontol A Biol Sci Med Sci 2004, 59:B568-72. Gavrilov LA, Gavrilova NS: The reliability theory of aging and longevity. J Theor Biol 2001, 213:527-545. Walter LC, Lui LY, Eng C, Covinsky KE: Risk of hip fracture in disabled community-living older adults. J Am Geriatr Soc 2003, 51:50-55. Stel VS, Pluijm SM, Deeg DJ, Smit JH, Bouter LM, Lips P: Functional limitations and poor physical performance as independent risk factors for self-reported fractures in older persons. Osteoporos Int 2004, 15:742-750. Kanis JA, Johnell O, De Laet C, Jonsson B, Oden A, Ogelsby AK: International variations in hip fracture probabilities: implications for risk assessment. J Bone Miner Res 2002, 17:1237-1244.

http://www.epi-perspectives.com/content/4/1/2

39.

40.

41.

42.

43.

44. 45. 46. 47.

48. 49. 50. 51. 52.

Giversen IM: Time trends of age-adjusted incidence rates of first hip fractures: a register-based study among older people in Viborg County, Denmark, 1987-1997. Osteoporos Int 2006, 17:552-564. Brennan nee Saunders J, Johansen A, Butler J, Stone M, Richmond P, Jones S, Lyons RA: Place of residence and risk of fracture in older people: a population-based study of over 65-year-olds in Cardiff. Osteoporos Int 2003, 14:515-519. Sanders KM, Nicholson GC, Ugoni AM, Seeman E, Pasco JA, Kotowicz MA: Fracture rates lower in rural than urban communities: the Geelong Osteoporosis Study. J Epidemiol Community Health 2002, 56:466-470. Sugarman JR, Connell FA, Hansen A, Helgerson SD, Jessup MC, Lee H: Hip fracture incidence in nursing home residents and community-dwelling older people, Washington State, 19931995. J Am Geriatr Soc 2002, 50:1638-1643. Chevalley T, Herrmann FR, Delmi M, Stern R, Hoffmeyer P, Rapin CH, Rizzoli R: Evaluation of the age-adjusted incidence of hip fractures between urban and rural areas: the difference is not related to the prevalence of institutions for the elderly. Osteoporos Int 2002, 13:113-118. Crawford JR, Parker MJ: Seasonal variation of proximal femoral fractures in the United Kingdom. Injury 2003, 34:223-225. Bulajic-Kopjar M: Seasonal variations in incidence of fractures among elderly people. Inj Prev 2000, 6:16-19. Sund R, Kauppinen S: How to determine the number of older people in long-term care using register-based data? (in Finnish). Sos Laaketiet Aikak 2005, 42:137-144. Schlaud M, Brenner MH, Hoopmann M, Schwartz FW: Approaches to the denominator in practice-based epidemiology: a critical overview. J Epidemiol Community Health 1998, 52 Suppl 1:13S-19S. Sund R: Hip fracture incidence in Finland 1998-2002 (in Finnish). Duodecim 2006, 122:1085-1091 [http://www.terveysportti.fi/ xmedia/duo/duo95708.pdf]. Laukkanen P, Karppi P, Heikkinen E, Kauppinen M: Coping with activities of daily living in different care settings. Age Ageing 2001, 30:489-494. Murtagh KN, Hubert HB: Gender differences in physical disability among an elderly cohort. Am J Public Health 2004, 94:1406-1411. Meriläinen S, Nevalainen T, Luukinen H, Jalovaara P: Risk factors for cervical and trochanteric hip fracture during a fall on the hip. Scand J Prim Health Care 2002, 20:188-192. Sulander T, Helakorpi S, Nissinen A, Uutela A: Health behaviour among Finnish Elderly, Spring 2003, with trends 1993-2003. Helsinki, National Public Health Institute; 2004.

Publish with Bio Med Central and every scientist can read your work free of charge "BioMed Central will be the most significant development for disseminating the results of biomedical researc h in our lifetime." Sir Paul Nurse, Cancer Research UK

Your research papers will be: available free of charge to the entire biomedical community peer reviewed and published immediately upon acceptance cited in PubMed and archived on PubMed Central yours — you keep the copyright

BioMedcentral

Submit your manuscript here: http://www.biomedcentral.com/info/publishing_adv.asp

Page 11 of 11 (page number not for citation purposes)

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

IV

-"- / /

 Ê1- /ÊÓääx\Ê{ÓÊ£ÎÇq£{{

-LKB@DLJK8@;FK

+UINKALASKEAIKËËNTYNEIDENPITKËAIKAISASIAKKAIDEN MËËRIËREKISTERITIETOJENPERUSTEELLA 4ËSSËTUTKIMUKSESSAKUVATAANKAHDELLAERITAVALLALASKETUTIKËËNTYNEIDENPITKËAIKAISASIAKKAIDEN LUKUMËËRËTIKË JASUKUPUOLIRYHMITTËINVUOSILLEn4UTKIMUSOSOITTAA ETTËPELKËSTËËN YHDENVUODENASIAKASLASKENTATIEDOISTALASKEMALLAPITKËAIKAISASIAKKAIDENLUKUMËËRËONSELVËSTI PIENEMPIKUINSILLOIN KUNOTETAANHUOMIOONMYÚSMYÚHEMPIENASIAKASLASKENTOJENJAPËËTTYNEIDEN HOITOJAKSOJENTIEDOT+UMPIKAANTAPAEITËYSINVARMASTIOLETOISTAOIKEAMPIJARIIPPUUKIN KËYTTÚTARKOITUKSESTA KUMPAATAPAAKANNATTAAKËYTTËË REIJO SUND, SARI KAUPPINEN

*/($!.4/

«ˆ`i“ˆœœ}ˆÃˆÃÃ>ÊÌÕ̎ˆ“ՎÈÃÃ>ʜ˜ÊÕÃiˆ˜ÊÌ>À‡ «iiˆÃÌ>ÊÃՅÌiÕÌÌ>>ÊÌ>«>…ÌՓˆi˜ÊÕŽÕ“BBÀˆBʘˆˆ‡ ÌBÊÛ>ÃÌ>>ۈˆ˜ÊÀˆÃŽˆÛBiÃ̟ˆ…ˆ˜°Ê Àˆ>ˆÃˆÃÌ>Ê̈>Ã̜̈i‡ ̜Ž>˜˜œˆÃÌ>ʏŸÞÌÞÞʅi«œÃ̈Ê-՜“i˜ÊÛBiÃ̟À>Ži˜‡ ˜iÌÌ>ʎÕÛ>>ۈ>Ê̈i̜>ʈŽB‡Ê>ÊÃՎի՜ˆÀޅ“ˆÌ‡ ÌBˆ˜°ÊŽBB˜ÌޘiˆÌBʎœÃŽiۈÃÃ>ÊÌ>ÀŽ>ÃÌiÕˆÃÃ>ʜ˜Ê ŽÕˆÌi˜Žˆ˜ÊÕÃiˆ˜ÊÌ>ÀŽœˆÌՎÃi˜“ÕŽ>ˆÃÌ>ÊiÀœÌi>Ê ˆB˜Ê>ÊÃՎի՜i˜ÊˆÃBŽÃˆÊ…œˆ`œ˜ÊiÀˆÊ«œÀÌ>ˆiÊ ŽÕՏÕۈ>Ê>È>ŽŽ>ˆÌ>Ê̜ˆÃˆÃÌ>>˜°Ê ÀˆÌވÃi˜ÊÌBÀŽiB˜Ê Àޅ“B˜Ê“Õœ`œÃÌ>Û>ÌÊ«ˆÌŽB>ˆŽ>ˆÃiÃÃ>Êޓ«BÀˆÛ՜‡ ÀœŽ>ṎÃiÃÃ>ʅœˆ`œÃÃ>ʜiÛ>ÌʈŽBB˜ÌޘiiÌ°Ê/B“BÊ Àޅ“BʎÕÛ>ÃÌ>>Ê«>ÛiÕBÀiÃÌi“BÃÌBÊÀˆˆ««ÕÛ>ˆÃÌ>Ê »…Õœ˜œŽÕ˜ÌœˆÃÌ>»ÊœÕŽŽœ>]ʍœ˜Ž>ʎ>ˆŽŽˆi˜ÊB‡ Ãi˜Ìi˜Êiˆ˜Þ“«BÀˆÃ̟ʜ˜ÊÛ>ÀȘÊÃ>“>˜Ž>Ì>ˆ˜i˜°Ê *ˆÌŽB>ˆŽ>ˆÃÌ>Ê Þ“«BÀˆÛ՜ÀœŽ>ṎÃÌ>Ê …œˆÌœ>Ê Ã>>Û>ÌʈŽBB˜ÌޘiiÌÊ>È>ŽŽ>>ÌÊۜˆÛ>Ìʜ>ÊÛ>˜…>ˆ˜‡ Žœ`ˆÃÃ>]ÊÌiÀÛiÞ`i˜…Õœœ˜ÊÛ՜`iœÃ>Ã̜…œˆ`œÃÃ>Ê Ì>ˆÊÛ>˜…>ˆ˜ŽœÌˆ>ÊÛ>ÃÌ>>Û>˜ÊÌi…œÃÌiÌ՘ʫ>ÛiÕ‡ >ÃՓˆÃi˜Ê«ˆˆÀˆÃÃB°Ê Bˆ`i˜Ê«>ÛiÕi˜Ê>È>ŽŽ>ˆÃÌ>Ê Ã>>`>>˜Ê…œˆÌœ>ŽÃœŽœ…Ì>ˆÃÌ>Ê̈i̜>Ê-Ì>ŽiȘʅœˆ‡ ̜ˆ“œˆÌÕÃÀiŽˆÃÌiÀiˆÃÌBÊ­-Ì>ŽiÃÊÓääή°Êˆ˜iˆÃ̜œ˜Ê ŽiÀBÌBB˜Ê Û՜ÈÌÌ>ˆ˜Ê Ž>…`i˜Ê ÌÞÞ««ˆÃˆBÊ Ìˆi̜>\Ê Û՜`i˜Êœ«ÕÃÃ>ʎiÎi˜Êœiˆ`i˜Ê…œˆÌœ>ŽÃœi˜Ê ̈i`œÌÊ­˜ˆˆ˜ÊÃ>˜œÌÕÌÊ>È>Ž>Ï>Îi˜Ì>̈i`œÌ®ÊÃiŽBÊ «BBÌÌޘiˆ`i˜Ê…œˆÌœ>ŽÃœi˜Ê̈i`œÌ°ÊÀÞ«Ì>Ì՘Êގ‡ ȏŸˆ˜ÌˆÌ՘˜ˆÃÌii˜Ê>ÛՏ>ʜ˜Ê“>…`œˆÃÌ>Êޅ`ˆÃÌBBÊ Ž>ˆŽŽˆÊÃ>“>˜Ê>È>ŽŽ>>˜Ê…œˆÌœ>ŽÃœÌÊ̜ˆÃˆˆ˜Ã>°Ê ˆ˜iˆÃ̜˜Ê>ÛՏ>Êۜˆ`>>˜Ê“ÞŸÃʏ>Îi>Ê>È>Ž>Ç “BBÀBÌÊ«BˆÛBŽœ…Ì>ˆÃiÃ̈°

6ˆÀ>ˆÃˆˆ˜Ê̈>Ã̜ˆ…ˆ˜Ê­ŽÃ°Ê““°ÊŽBB˜Ìޘiˆ`i˜Ê ÜÈ>>ˆ‡Ê>ÊÌiÀÛiÞë>ÛiÕÌÊÓääӮʫˆÌŽB>ˆŽ>ˆÃ>È>Ž‡ Ž>>Ìʏ>ÎiÌ>>˜ÊÛ>ˆ˜ÊŽÞÃiiÃÃBʜiÛ>˜Ê̈>Ã̜Û՜‡ `i˜Ê>È>Ž>Ï>Îi˜Ì>̈i̜i˜Ê>ÛՏ>°Ê/BŸˆ˜Ê«ˆÌŽB‡ >ˆŽ>ˆÃ>È>ŽŽ>ˆŽÃˆÊ“BBÀˆÌiBB˜Ê˜iʅi˜ŽˆŸÌ]ʍœˆ>Ê œ˜Ê>È>Ž>Ï>Îi˜Ì>̈i̜i˜Ê«iÀÕÃÌii>Êۜˆ“>ÃÃ>Ê œiÛ>Ê«ˆÌŽB>ˆŽ>ˆÃ…œˆ`œ˜Ê«BB̟ÃÊÌ>ˆÊœÌŽ>Ê>È>Ž>Ç >Îi˜Ì>ˆ“œˆÌՎÃi˜Ê̈i̜i˜Ê“ÕŽ>>˜ÊœÛ>ÌÊȈ…i˜Ê “i˜˜iÃÃBʜiiÌÊޅÌB>ŽÃœˆÃiÃ̈ʅœˆ`œÃÃ>ʎÞÃiiÃÃBÊ œiÛ>ÃÃ>ʅœˆÌœ«>ˆŽ>ÃÃ>ÊޏˆÊ™äÊ«BˆÛBB° ՘ÊÃiÛˆÌiÌBB˜ÊÀˆÃŽˆÛBiÃ̟BÊi«ˆ`i“ˆœœ}ˆÃÌ>Ê ÌÕ̎ˆ“ÕÃÌ>ÊÛ>ÀÌi˜]ʜ˜ÊÌBÀŽiBBÊÃ>>`>ÊÌ>ۜˆÌiÌÌÕ>Ê ŽÞÃiiÃÃBÊ œiÛ>Ê ÛBiÃ̟Àޅ“BÊ “>…`œˆÃˆ““>˜Ê Ž>ÌÌ>Û>Ã̈°Ê Bˆ˜Êœi˜ÊÃ>>ÌÌ>>ʜ>ÊÌ>À«ii˜ÊŽBÞ̇ ÌBBʅÞÛBŽÃˆÊ“ÞŸÃʓޟ…i“«ˆi˜Ê>È>Ž>Ï>Îi˜Ìœ‡ i˜Ê>Ê«BBÌÌޘiˆ`i˜Ê…œˆÌœ>ŽÃœi˜Ê̈i̜>° /B“B˜ÊÌÕ̎ˆ“ÕŽÃi˜ÊÌ>ÀŽœˆÌՎÃi˜>ʜ˜ÊŽÕÛ>Ì>Ê ˆŽBB˜Ìޘiˆ`i˜Ê«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜ÊÕŽÕ“BB‡ ÀBÌʈŽB‡Ê>ÊÃՎի՜ˆÀޅ“ˆÌÌBˆ˜ÊÛ՜ȏiÊ£™™Çq ÓääÓÊÃiŽBÊÃiÛˆÌÌBB]ʓˆÌi˜Ê>ˆ˜œ>ÃÌ>>˜Êޅ`i˜ÊÛ՜‡ `i˜Ê>È>Ž>Ï>Îi˜Ì>̈i`œˆÃÌ>ʏ>ÎiÌÌÕÊ«ˆÌŽB>ˆ‡ Ž>ˆÃ>È>ŽŽ>ˆ`i˜Ê ÕŽÕ“BBÀBÊ iÀœ>>Ê ÃˆˆÌB]Ê œŽ>Ê Ã>>`>>˜]ʎ՘ʜÌiÌ>>˜Ê…Õœ“ˆœœ˜Ê“ÞŸÃʓޟ…i“‡ «ˆi˜Ê>È>Ž>Ï>Îi˜Ìœi˜Ê>Ê«BBÌÌޘiˆ`i˜Ê…œˆÌœ>Ž‡ ܍i˜Ê̈i`œÌ°Ê

!).%)34/*!-%.%4%,-¯4 /Õ̎ˆ“ÕÃÌ>ÊÛ>ÀÌi˜ÊÜÈ>>ˆ‡Ê>ÊÌiÀÛiÞ`i˜…Õœœ˜Ê …œˆÌœˆ“œˆÌÕÃÀiŽˆÃÌiÀˆÃÌBÊ«œˆ“ˆÌ̈ˆ˜ÊŽ>ˆŽŽˆÊޏˆÊxä‡ Û՜̈>ˆ`i˜Ê>È>ŽŽ>ˆ`i˜Ê«ˆÌŽB>ˆŽ>ˆÃ…œˆ`œ˜ÊŽÀˆÌii‡



Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

ÀˆÌÊÌBÞÌÌBÛBÌʅœˆÌœ>ŽÃœÌÊ­ÃiŽBÊ>È>Ž>Ï>Îi˜Ì>̈i‡ `œÌÊiÌÌBÊ«BBÌÌޘiˆ`i˜Ê…œˆÌœ>ŽÃœi˜Ê̈i`œÌ®ÊiÀˆ‡ ŽœˆÃÃ>ˆÀ>>˜…œˆ`œ˜ÊÛ՜`iœÃ>Ã̜…œˆ`œÃÃ>Ê­Ã>ˆÀ>>‡ >®]Ê«iÀÕÃÌiÀÛiÞ`i˜…Õœœ˜ÊÛ՜`iœÃ>Ã̜…œˆ`œÃÃ>Ê ­ÌiÀÛiÞÎiÎÕî]ÊÛ>˜…>ˆ˜Žœ`ˆÃÃ>ÊÌ>ˆÊÛ>ÃÌ>>Û>ÃÃ>Ê Ìi…œÃÌiÌÕÃÃ>Ê«>ÛiÕ>ÃՓˆÃiÃÃ>ÊÛ՜ȏÌ>Ê£™™Çq ÓääÓ°ÊœˆÌœ>ŽÃœ>ʍ>Ê>È>Ž>Ï>Îi˜Ì>̈i̜>ʜˆÊ ޅÌii˜ÃBÊxÎÓʣș° ˆ˜iˆÃ̜ÃÌ>ʏ>ÎiÌ̈ˆ˜ÊÛ՜Èi˜Ê£™™ÇqÓääÓÊۈˆ‡ “iˆÃÌi˜Ê«BˆÛˆi˜Ê«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜ÊÕŽÕ“BB‡ ÀBÌʎBÞÌÌBi˜Ê«iŽŽˆBÊÌ>ÀŽ>ÃÌiÕÛ՜`i˜Ê>È>Ž>Ç >Îi˜Ì>̈i̜>Ê­"®Ê>ÊÛ>ÃÌ>>Û>ÌʏՎՓBBÀBÌÊÌBއ `i˜ÌBi˜ÊÌ>ÀŽ>ÃÌiÕÛ՜`i˜Ê>È>Ž>Ï>Îi˜Ì>̈i̜>Ê «BBÌÌޓˆÃˆ“œˆÌՎȏ>ÊÃiŽBʓޟ…i““ˆBÊ>È>‡ Ž>Ï>Îi˜Ì>̈i`œˆ>Ê­ ®°Ê/>ÀŽ>ÃÌiÕ«BˆÛB˜ÊˆŽBÊ «ÞÃÌÞÌ̈ˆ˜Ê«BBÌÌii“BB˜ÊÌ>ÀŽ>Ã̈]ʎœÃŽ>ʈŽBʅœˆ‡ ̜>ŽÃœ˜Ê>ÕÃÃ>ʍ>ʏœ«ÕÃÃ>ÊÃiŽBʅœˆÌœ>ŽÃœ˜Ê>ŽÕ‡Ê >ʏœ««Õ«BˆÛBʜˆÛ>ÌÊ̈i`œÃÃ>°ÊÀÞ«Ì>ÌÌÕ>Êގȇ Ÿˆ˜ÌˆÌ՘˜ÕÃÌ>ʎBÞÌÌBi˜ÊÛ>À“ˆÃÌiÌ̈ˆ˜]ÊiÌÌBʍœŽ>ˆ‡ ˜i˜Ê>È>Ž>Ãʏ>ÎiÌ̈ˆ˜ÊŽÕ˜>Žˆ˜Ê«BˆÛB˜BʓՎ>>˜Ê Û>ˆ˜ÊŽiÀÀ>˜° ˆ˜iˆÃ̜ʍ>iÌ̈ˆ˜ÊˆÃBŽÃˆÊÃՎի՜i˜]ʈB˜Ê>ʏ>ˆ‡ ̜ÃÌÞÞ«ˆ˜Ê“ÕŽ>>˜ÊÀޅ“ˆˆ˜°Ê-Վի՜ˆÀޅ“BBÊ ˆ˜`iŽÃœˆÌˆˆ˜Êˆ\BÊ­ˆÊrÊ£]ÓÆÊ£ÊrʓˆiÃʍ>ÊÓÊrʘ>ˆ˜i˜®]Ê ˆŽBÀޅ“BBʍ\BÊ­ÊrÊ£]ÊÓ]ÊÎ]Ê{ÆÊ£ÊrÊxäqÈ{‡]ÊÓÊrÊÈxq Ç{‡]ÊÎÊrÊÇxqn{‡Ê>Ê{ÊrÊޏˆÊnx‡Û՜̈>>̮ʍ>ʏ>ˆÌœÃ‡ ÌÞÞ««ˆBʈ˜`iŽÃˆBʎʭŽÊrÊ£]ÊÓÆÊ£ÊrÊÌiÀÛiÞÎiÎÕÃÊÌ>ˆÊ Ã>ˆÀ>>>ʍ>ÊÓÊrÊÛ>˜…>ˆ˜ŽœÌˆÊÌ>ˆÊÛ>ÃÌ>>Û>®°Ê BˆBÊ “iÀŽˆ˜˜ŸˆBÊÛ՜`i˜ÊÝÊ>È>Ž>Ï>Îi˜Ì>̈i̜ˆ…ˆ˜Ê «iÀÕÃÌÕÛ>ʏՎՓBBÀBÊÀޅ“BÃÃBʈŽÊœ˜Ê"ˆŽ­Ý®Ê>Ê ŽœŽœÊ>ˆ˜iˆÃ̜>ʅޟ`ޘÌBi˜ÊÃ>>ÌÕÊÛ>ÃÌ>>Û>ʏՎՇ “BBÀBÊ ˆŽ­Ý®°Ê `iii˜Ê«ˆŽ­Ý®ÊrÊ"ˆŽ­Ý®É ˆŽ­Ý®Êœ˜Ê >Îi˜Ì>̈i̜i˜ÊœÃÕÕÃʎœŽœÊ>ˆ˜iˆÃ̜˜Ê«iÀÕÃÌii>Ê Ã>>`ÕÃÌ>ʏՎՓBBÀBÃÌBÊÀޅ“BÃÃBʈŽÊÛ՜˜˜>ÊÝ°Ê />ۜˆÌÌii˜>ʜˆÊ“Õœ`œÃÌ>>ʓ>ˆ]ʍœÌ>ʎBÞ̇ ÌBi˜ÊœˆÃˆÊ“>…`œˆÃÌ>Ê>Àۈœˆ`>ÊÀޅ“ˆi˜ÊˆŽÊÛBˆÃˆBÊ iÀœ>ʜÃÕÕ`iÃÃ>ʫʍ>Ê̜ˆÃ>>Ì>Êi˜˜ÕÃÌ>>Ê ˆŽÊrÊ"ˆŽÉ « ˆŽÊ …>ÕÌՏiÊ Û՜`iiÊ Ý]Ê ŽÕ˜Ê " ˆŽÊ Ì՘˜iÌ>>˜°Ê "iÌÌ>“>>]ÊiÌÌBÊ«ˆŽÊHÊ­£É ˆŽ®*œˆÃܘ­λˆŽ ˆŽ®]ʍœÃ‡ Ã>Ê눍ŽÊœ˜ÊÌiœÀiiÌ̈˜i˜ÊœÃÕÕÃÊÀޅ“BÃÃBʈŽÊ>Ê눍Ž ˆŽÊ œ˜Ê"ˆŽ\˜Êœ`œÌÕÃ>Àۜ]Ê«BB`ÞÌ̈ˆ˜ÊޏiˆÃÌiÌÌÞޘʏœ}‡ ˆ˜i>>ÀˆÃii˜Ê*œˆÃܘ‡Ài}ÀiÃȜ“>ˆˆ˜°ÊœÃŽ>ÊÃiˆ‡ ÌiÌÌBÛB˜BʜˆÛ>ÌʜÃÕÕ`iÌ]Ê«ˆÌˆÊ“>ˆÃÃ>Ê«>ˆ˜œÌÌ>>Ê ŽÕÌ>Žˆ˜Ê…>Û>ˆ˜Ìœ>Ê«ˆŽÊÕŽÕ“BBÀBBÊ ˆŽ°Ê,ޅ“ˆÌ‡ ÌiÞ“ÕÕÌÌՍˆi˜ÊˆÃBŽÃˆÊ“>ˆÃÃ>ʎBÞÌiÌ̈ˆ˜ÊÃiˆÌÌB‡ ÛB˜BʓÕÕÌÌՍ>˜>ʓޟÃÊÛ՜ÌÌ>°Ê>ˆ>ÊiÃ̈“œˆÌ>‡ iÃÃ>ʎBÞÌiÌ̈ˆ˜ÊÛ՜Èi˜Ê£™™ÇqÓää£Ê…>Û>ˆ˜Ìœ>ʍ>Ê «ÞÀˆÌ̈ˆ˜Ê i˜˜ÕÃÌ>“>>˜Ê Û՜`i˜Ê ÓääÓÊ œÃÕÕ`iÌÊ «ˆŽ­ÓääӮʎ>ˆŽˆiÊÀޅ“ˆiʈŽ°

45,/+3%4 />ՏՎœˆÃÃ>ʣʍ>ÊÓʜ˜ÊiÈÌiÌÌÞÊ«ˆÌŽB>ˆŽ>ˆÃ>È>Ž‡ Ž>ˆ`i˜ÊÕŽÕ“BBÀBÌʈŽB‡Ê>ÊÃՎի՜ˆÀޅ“ˆÌÌBˆ˜Ê

Û՜Èi˜Ê£™™ÇqÓääÓÊۈˆ“iˆÃˆ˜BÊ«BˆÛˆ˜B°Ê ˜Ãˆ˜Êœ˜Ê À>«œÀ̜ˆÌÕÊÃ>ˆÀ>>>ÃÃ>ÊÌ>ˆÊÌiÀÛiÞÎiÎՎÃi˜ÊÛ՜`i‡ œÃ>Ã̜…œˆ`œÃÃ>ʜiۈi˜ÊÕŽÕ“BBÀBÌÊ«iŽŽˆi˜ÊŽÞ‡ ÃiiÃÃBʜiÛ>˜ÊÛ՜`i˜Ê>È>Ž>Ï>Îi˜Ì>̈i̜i˜Ê«i‡ ÀÕÃÌii>ʍ>ÊÃi˜ÊBŽii˜ÊŽœŽœÊ>ˆ˜iˆÃ̜˜Ê«iÀÕÃÌii>°Ê >Îi˜Ì>̈i̜i˜ÊޅÌiÞ`iÃÃBʜ˜Ê“iÀŽˆÌÌÞʘBŽÞ‡ ۈˆ˜Ê“ÞŸÃʜÃÕÕÃʎœŽœÊ>ˆ˜iˆÃ̜˜Ê«iÀÕÃÌii>ʏ>Îi‡ ÌՈÃÌ>ʏՎՓBBÀˆÃÌB°Ê6>ÃÌ>>Û>ÌÊ̈i`œÌʜÛ>ÌʓޟÃÊ Û>˜…>ˆ˜Žœ`iˆiÊÌ>ˆÊÛ>ÃÌ>>ۈi°Ê6ˆˆ“iˆÃˆÃÃBÊÃ>À>Ž‡ ŽiˆÃÃ>ʜÛ>ÌʎœŽœÊ«œ«Õ>>̈œ˜ÊÕŽÕ“BBÀBʍ>ʘˆˆ‡ `i˜Ê>È>ŽŽ>ˆ`i˜ÊœÃÕÕÃ]ʍœÌŽ>Ê>ˆ˜iˆÃ̜˜Ê«iÀÕÃÌii>Ê œˆÛ>ÌÊÃ>“>>˜Ê>ˆŽ>>˜ÊÃiŽBÊÛ՜`iœÃ>Ã̜>ÊiÌÌBÊ Û>˜…>ˆ˜Žœ`ˆÃÃ>ÊÌ>ˆÊÌi…œÃÌiÌÕÃÃ>Ê«>ÛiÕ>ÃՓˆÃiÃÃ>° />ՏՎœˆ`i˜Ê£Ê>ÊÓÊ̈i`œˆÃÌ>ʅ>Û>ˆÌ>>˜]ÊiÌÌBÊ «iŽŽˆˆ˜Ê>Îi˜Ì>̈i̜ˆ…ˆ˜Ê«iÀÕÃÌÕÛ>ÌʏՎՓBBÀBÌÊ œÛ>ÌÊ£äqÎäʯʫˆi˜i“«ˆBʎՈ˜ÊÛ>ÃÌ>>Û>Ìʏ>Îi̇ ÌՈ˜>ʎœŽœÊ>ˆ˜iˆÃ̜ÃÌ>ʍ>ÊiÌÌBʜÃÕՎÈÃÃ>ʜ˜ÊÃi‡ ÛBBÊÛ>ˆ…ÌiÕ>ÊiÀˆÊÀޅ“ˆi˜Ê>ÊÛ՜Èi˜ÊÛBˆB°Ê/>À‡ Ž>ÃÌiÌ>iÃÃ>ÊގÈÌÌBˆÃÌi˜Ê“ÕÕÌÌՍˆi˜ÊÛ>ˆŽÕÌՎÈ>Ê œÃÕՎȈ˜]ʎ>ÌÌ>ÛÕÕÃÊ«>À>˜iiÊۜˆ“>ŽŽ>ˆ““ˆ˜Ê ˆB˜ÊŽ>ÃÛ>iÃÃ>°Ê>ÌÌ>ÛÕÕÃÊ«>À>˜iiʓޟÃÊÌ>ÀŽ>ÃÌi‡ ÕÛ՜`i˜ÊŽ>ÃÛ>iÃÃ>°Êˆi…ˆBʎ>ÌÌ>ÛÕÕÃʜ˜Ê…Õœ‡ ˜œ“«ˆÊ>ÊÛ>˜…>ˆ˜ŽœÌˆi˜ÊÌ>ˆÊÛ>ÃÌ>>ۈi˜ÊœÃÕÕ`iÌÊ œÛ>ÌÊ«ˆi˜i“«ˆB°ÊœÃŽ>ʓÕÕÌÌՍˆi˜ÊÛBˆBʜ˜ÊŽÕˆ‡ Ìi˜Žˆ˜ÊÃiÛˆBÊÃBB˜˜Ÿ˜“ÕŽ>ˆÃÕՎÈ>]ʅ>…“œÌi̇ ̈ˆ˜ÊœÃÕՎÈi˜ÊŽBÞÌÌBÞÌޓˆÃÌBʓޟÃʓœ˜ˆ“ÕÕÌÌՇ >“>ˆ˜Ê>ÛՏ>° />ՏՎœÃÃ>ÊÎʜ˜ÊÀ>«œÀ̜ˆÌÕÊ>ˆ˜iˆÃ̜>ÊÌ>ÀŽœˆ‡ ÌՎÃi˜“ÕŽ>ˆÃˆ““ˆ˜ÊŽÕÛ>>Û>˜Ê“>ˆ˜ÊÌՏœŽÃiÌ°Ê "ÃÕÕÌÌ>ʜ˜ÊȈÃÊÃiˆÌiÌÌÞʈŽBÀޅ“BB]ʏ>ˆÌœÃÌÞÞ«‡ «ˆB]ÊÃՎի՜Ì>ÊÃiŽBʈŽB‡Ê>ʏ>ˆÌœÃÀޅ“ˆÌÌBˆÃÌBÊ Û՜ÈiviŽÌˆBʎÕÛ>>ۈi˜Ê“ÕÕÌÌՍˆi˜Ê>ÛՏ>°Ê>ˆ˜Ê ޅÌii˜Ãœ«ˆÛÕÕÃÊ>ˆ˜iˆÃ̜˜ÊŽ>˜ÃÃ>ʓˆÌ>ÌÌ՘>ÊχӇ ÌiÃ̈Bʜ˜ÊiÀˆ˜œ“>ˆ˜i˜Ê­«ÊrÊä]™{Ç®]ÊiˆŽBʓˆ˜ŽBB˜Ê ˆ˜ÌiÀ>ŽÌˆœÌiÀ“ˆ˜Ê ˆÃBB“ˆ˜i˜Ê i˜BBÊ «>À>˜Ì>˜ÕÌÊ “>ˆ>ʓiÀŽˆÌÌBÛBÃ̈°Ê*iÀÕÃÀޅ“B˜Bʜ˜ÊŽBÞÌiÌÌÞÊ Û>˜…>ˆ˜Žœ`ˆÃÃ>ÊÌ>ˆÊÛ>ÃÌ>>Û>ÃÃ>Ê>ÃÕۈ>ÊޏˆÊn{‡ Û՜̈>ˆÌ>ʘ>ˆÃˆ>ʍ>ÊÛ՜ȓÕÕÌÌՍ>˜Ê«iÀÕÃÛ՜`iŽÃˆÊ œ˜ÊÛ>ˆÌÌÕÊÛ՜ÈÊÓäää°ÊŽBÀޅ“B˜]ʏ>ˆÌœÃÌÞÞ«ˆ˜Ê>Ê ÃՎի՜i˜Ê«BBÛ>ˆŽÕÌՎÃiÌʜÛ>ÌÊ̈>Ã̜ˆÃiÃÌˆÊ “iÀŽˆÌÃiۈB]ÊÛ>ˆŽŽ>ʘiʎœ˜ÌÀœœˆ`>>˜Ê̜ˆÃÌi˜Ã>Ê >ÊÀޅ“ˆÌÌBˆÃi˜ÊÛ՜ȓÕÕÌÌՍ>˜ÊÃՅÌii˜°Ê>ÌÌ>‡ ÛÕÕÃÊȈÃÊ«>À>˜iiÊÃiÛBÃ̈ʈB˜Ê“ÞŸÌBÊ­«ÊÊä]ää䣮°Ê "ÃÕÕ`iÌʜÛ>Ìʈܓ«ˆ>ÊÃ>ˆÀ>>œˆ`i˜Ê>ÊÌiÀÛiÞÎiÇ ŽÕÃÌi˜ÊÛ՜`iœÃ>Ã̜ˆ>ʎՈ˜ÊÛ>˜…>ˆ˜Žœ`iˆÃÃ>ÊÌ>ˆÊ Û>ÃÌ>>ۈÃÃ>Ê­«ÊÊä]ää䣮ʍ>ʎ>ÌÌ>ÛÕÕÃʜ˜Ê«>Ài“«ˆÊ ˜>ˆÃÌi˜ÊœÃ>Ì>Ê­«ÊrÊä]ääÓ®°Êˆ˜i>>Àˆ˜i˜ÊÛ՜ÈiviŽ‡ ̈ʜ˜ÊŽ>ÃÛ>Û>ÊÛ>˜…>ˆ˜ŽœÌˆi˜ÊÌ>ˆÊÛ>ÃÌ>>ۈi˜ÊœÃ>Ì>Ê Ž>ˆŽˆÃÃ>ʈŽBÀޅ“ˆÃÃB°Ê->ˆÀ>>œˆ`i˜Ê>ÊÌiÀÛiÞÎiÇ ŽÕÃÌi˜ÊÛ՜`iœÃ>Ã̜ˆ>ʜ˜Ê>ÎiÛ>ÊÌÀi˜`ˆÊ˜ÕœÀˆ“‡ “>ÃÃ>ʈŽBÀޅ“BÃÃB]ʓÕÌÌ>ʓՈÃÃ>ʈŽBÀޅ“ˆÃÃBÊ Ìˆ>Ã̜ˆÃiÃ̈ʓiÀŽˆÌÃiÛBBʏˆ˜i>>ÀˆÃÌ>ÊÌÀi˜`ˆBÊiˆÊ œiʅ>Û>ˆÌÌ>ۈÃÃ>°



Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

/>ՏՎŽœÊ£° ŽBB˜Ìޘiˆ`i˜Ê«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜Ê“BBÀBÌÊ-՜“iÃÃ>Ê£™™ÇqÓääÓ]ʓˆi…iÌ°

£™™Ç £™™n £™™™ Óäää Óää£ ÓääÓ

xäqÈ{ʈi…iÌ /iÀÛiÞÎiÎÕÃÊÌ>ˆÊÃ>ˆÀ>>> >Îi˜Ì>£ "ÃÕÕÃÓ >ˆŽŽˆÎ ÈÓÈ nä]{ʯ ÇǙ xÇÇ Ç™]nʯ ÇÓΠȣä n£]Óʯ Çx£ Èän Ç{]Çʯ n£{ x™x Ç{]nʯ Ǚx È È xÇÓ ÇÎ]{Ê¯È ÇǙÈ

6>˜…>ˆ˜ŽœÌˆÊÌ>ˆÊÛ>ÃÌ>>Û> >Îi˜Ì>£ "ÃÕÕÃÓ >ˆŽŽˆÎ ÎnÎ Çx]£Ê¯ x£ä Î{{ ș]Èʯ {™{ ÎÈ{ ș]Èʯ xÓÎ În£ Çä]nʯ xÎn {™Ó n£]™Ê¯ Èä£ È È x£™ ÇÇ]™Ê¯È ÈÈÈÈ

œi““>Ì 9…Ìii˜ÃB{ *BBiŽŽBˆÃˆBx £Ón{ ä]{ʯ £Ó£{ ä]Óʯ £ÓÇÓ ä]Óʯ £Î{Ç ä]{ʯ £Î™{ ä]£Ê¯ È £{ÎnÈ

£™™Ç £™™n £™™™ Óäää Óää£ ÓääÓ

ÈxqÇ{ʈi…iÌ /iÀÛiÞÎiÎÕÃÊÌ>ˆÊÃ>ˆÀ>>> >Îi˜Ì> "ÃÕÕà >ˆŽŽˆ ™ÇÓ nä]xʯ £ÓäÇ ™Èx nÎ]£Ê¯ ££È£ ™{Ç nÎ]Îʯ ££ÎÇ n™{ n{]Îʯ £äÈä ™Ó™ nÓ]Çʯ ££ÓÎ n{{ nÎ]Çʯ £ään

6>˜…>ˆ˜ŽœÌˆÊÌ>ˆÊÛ>ÃÌ>>Û> >Îi˜Ì> "ÃÕÕà >ˆŽŽˆ £ÓÓ£ Çn]xʯ £xxÈ £Ó{Î nä]£Ê¯ £xx£ £ÓÇx n£]xʯ £xÈ{ £ÓÇ{ nä]™Ê¯ £xÇx £ÓÈ{ nx]{ʯ £{nä £Îxn nÎ]™Ê¯ £È£™

œi““>Ì 9…Ìii˜ÃB *BBiŽŽBˆÃˆB ÓÇÎÇ ä]™Ê¯ ÓșΠä]Çʯ ÓÈnÈ ä]Èʯ ÓÈÓx ä]{ʯ Óxnx ä]Çʯ ÓÈ£{

£™™Ç £™™n £™™™ Óäää Óää£ ÓääÓ

Çxqn{ʈi…iÌ /iÀÛiÞÎiÎÕÃÊÌ>ˆÊÃ>ˆÀ>>> >Îi˜Ì> "ÃÕÕà >ˆŽŽˆ £Î™n nÓ]{ʯ £È™Ç £ÎÇ{ nÎ]äʯ £Èxx £{Ó£ n{]Èʯ £ÈǙ £{{Ó nÎ]Óʯ £ÇÎ{ £{ÎÇ n{]{ʯ £ÇäÓ £{n£ nx]Îʯ £ÇÎÈ

6>˜…>ˆ˜ŽœÌˆÊÌ>ˆÊÛ>ÃÌ>>Û> >Îi˜Ì> "ÃÕÕà >ˆŽŽˆ Ó££Î nä]Óʯ ÓÈÎ{ ӣΣ Ǚ]£Ê¯ Óș{ ÓÎxÎ nÓ]Çʯ Ón{x Ó{È{ nÓ]Çʯ әÇn ÓÈxÈ nx]Èʯ Î£ä£ Ón£ä nx]Îʯ Îәx

œi““>Ì 9…Ìii˜ÃB *BBiŽŽBˆÃˆB {ÎäÈ ä]Èʯ {ÎÓ£ ä]Èʯ {xäÓ ä]xʯ {ÈnÎ ä]Èʯ {ÇÇä ä]Çʯ xääÈ

£™™Ç £™™n £™™™ Óäää Óää£ ÓääÓ

nxqʈi…iÌ /iÀÛiÞÎiÎÕÃÊÌ>ˆÊÃ>ˆÀ>>> >Îi˜Ì> "ÃÕÕà >ˆŽŽˆ ™{{ n{]Çʯ £££{ ™{£ nn]Îʯ £äÈÈ ™äÇ n{]Èʯ £äÇÓ n™™ n{]xʯ £äÈ{ n™n nÇ]äʯ £äÎÓ nÇ{ nn]Îʯ 䙙ä

6>˜…>ˆ˜ŽœÌˆÊÌ>ˆÊÛ>ÃÌ>>Û> >Îi˜Ì> "ÃÕÕà >ˆŽŽˆ £nÎÎ n{]Îʯ Ó£Çx £™Î™ n{]{ʯ Ó £™Çx nÎ]Óʯ ÓÎÇx ÓäÇn nÎ]nʯ Ó{n£ Ó£xÈ nÈ]nʯ Ó{nÎ ÓÓxx nÈ]xʯ ÓÈän

œi““>Ì 9…Ìii˜ÃB *BBiŽŽBˆÃˆB ÎÓÈ{ ä]nʯ ÎÎ{£ ä]Çʯ Î{ә ä]xʯ Îxän £]£Ê¯ Î{™Î ä]Èʯ Îxnä

£®ÊÞÃiiÃÃBʜiÛ>˜ÊÛ՜`i˜Ê>È>Ž>Ï>Îi˜Ì>̈i̜i˜Ê“ÕŽ>ˆ˜i˜Ê«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜Ê“BBÀB Ó®ÊÈ>Ž>Ï>Îi˜Ì>̈i̜i˜Ê“ÕŽ>ˆ˜i˜Ê“BBÀBÊÃՅÌiÕÌiÌÌ՘>ʓՈ>Ê«ˆÌŽB>ˆŽ>ˆÃ…œˆ`œ˜ÊŽÀˆÌiiÀˆÌÊÌBÞÌÌBۈBʅœˆÌœˆ“œˆ‡ ÌՎȏ>ÊÌBÞ`i˜˜iÌޘÊ>ˆ˜iˆÃ̜˜Ê“ÕŽ>ˆÃii˜Ê“BBÀBB˜ ήÊ>ˆŽŽˆi˜Ê«ˆÌŽB>ˆŽ>ˆÃ…œˆ`œ˜ÊŽÀˆÌiiÀˆÌÊÌBÞÌÌBۈi˜Ê…œˆÌœˆ“œˆÌՎÈi˜Ê«iÀÕÃÌii>ʏ>ÎiÌÌÕÊ«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜Ê “BBÀB {®Ê*ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜Ê“BBÀBÊޅÌii˜ÃBÊ­«>ÛiÕÌÞÞ«ˆ˜Ê“ÕŽ>>˜ÊiÀœÌÌii“>ÌÌ>® x®Ê ˆˆ`i˜Ê>È>ŽŽ>ˆ`i˜ÊœÃÕÕÃ]ʍœÌŽ>Ê>ˆ˜iˆÃ̜˜Ê“ÕŽ>>˜ÊœˆÛ>ÌÊÃ>“>>˜Ê>ˆŽ>>˜ÊÃiŽBÊÌiÀÛiÞ`i˜…Õœœ˜ÊÛ՜`iœÃ>Ã̜>Ê iÌÌBÊÛ>˜…>ˆ˜Žœ`ˆÃÃ>ÊÌ>ˆÊÛ>ÃÌ>>Û>ÃÃ> Ȯʈ…>ۜˆ`ÕÌʏÕÛÕÌʜÛ>ÌÊÛ՜`i˜ÊÓääÓÊi˜˜ÕÃÌiÌÌՍ>Ê>Àۜ>Ê­Ž>ÌÜÊÃiˆÌÞÃÊÌiŽÃ̈ÃÌB®

/>ՏՎœˆÃÃ>ʣʍ>ÊÓʜ˜ÊÀ>«œÀ̜ˆÌÕʓޟÃÊÛ՜‡ `i˜ÊÓääÓÊi˜˜ÕÃÌiÌÕÌʜÃÕÕ`iÌʍ>Ê>È>ŽŽ>ˆ`i˜ÊÕ‡ ŽÕ“BBÀBÌÊ­ˆ…>ۜˆ`ÕÌʏÕÛÕÌ®°ÊœÃŽ>ÊÃ>“>>˜Ê>ˆ‡ Ž>>˜ÊÛ՜`iœÃ>Ã̜>ʍ>ÊÛ>˜…>ˆ˜Žœ`ˆÃÃ>ʜiۈŽÃˆÊ “iÀŽˆÌÌލBʜ˜ÊŽiΈ“BBÀˆ˜Ê˜œˆ˜Ê«ÕœˆÊ«ÀœÃi˜Ì̈>]Ê œ˜Êi˜˜ÕÃÌiÌÌÕ>Ê ŽœŽœ˜>ˆÃÕŽÕ“BBÀBBÊ ŽœÀ>ÌÌÕÊ Û>ÃÌ>>Û>Ã̈Ê>>ëBˆ˜Ê>ˆÌœÃÌÞÞ««ˆŽœ…Ì>ˆÃÌi˜Êi˜‡ ˜ÕÃÌiˆ`i˜ÊÃՓ“>>˜Ê˜B…`i˜°

0/($).4! /BÃÃBÊÌÕ̎ˆ“ÕŽÃiÃÃ>ʅ>Û>ˆÌ̈ˆ˜]ÊiÌÌBÊ«iŽBÃÌBB˜Ê ޅ`i˜ÊÛ՜`i˜Ê>È>Ž>Ï>Îi˜Ì>̈i`œˆÃÌ>ʏ>Îi“>‡ >Ê«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜ÊÕŽÕ“BBÀBʜ˜ÊÃiÛBÃÌˆÊ «ˆi˜i“«ˆÊŽÕˆ˜Êȏœˆ˜]ʎ՘ʜÌiÌ>>˜Ê…Õœ“ˆœœ˜Ê “ÞŸÃʓޟ…i“«ˆi˜Ê>È>Ž>Ï>Îi˜Ìœi˜Ê>Ê«BBÌÌއ ˜iˆ`i˜Ê…œˆÌœ>ŽÃœi˜Ê̈i`œÌ°Ê/B…B˜Êœ˜Ê“œ˜Ì>Ê “>…`œˆÃÌ>ÊÃÞÞÌB°Ê



Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

/>ՏՎŽœÊÓ°Ê ŽBB˜Ìޘiˆ`i˜Ê«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜Ê“BBÀBÌÊ-՜“iÃÃ>Ê£™™ÇqÓääÓ]ʘ>ˆÃiÌ°

£™™Ç £™™n £™™™ Óäää Óää£ ÓääÓ

xäqÈ{Ê >ˆÃiÌ /iÀÛiÞÎiÎÕÃÊÌ>ˆÊÃ>ˆÀ>>> >Îi˜Ì>£ "ÃÕÕÃÓ >ˆŽŽˆÎ {xx Çn]Çʯ xÇn {Î{ ÇÎ]™Ê¯ xnÇ {ÓÇ Çn]nʯ x{Ó {äÈ ÇÎ]Çʯ xx£ Ι{ Ç£]{ʯ xxÓ È È {xÎ Ç{]ÈÊ¯È ÈäÇÈ

6>˜…>ˆ˜ŽœÌˆÊÌ>ˆÊÛ>ÃÌ>>Û> >Îi˜Ì>£ "ÃÕÕÃÓ >ˆŽŽˆÎ ÎÈ£ Ç{]Îʯ {nÈ Î{{ ÇÎ]nʯ {ÈÈ Îx™ ÇÎ]Çʯ {nÇ ÎÈÎ ÇÇ]Óʯ {Çä {ÓÈ nÓ]Èʯ x£È È È {ÓÈ Ç™]ÓÊ¯È xÎnÈ

œi““>Ì 9…Ìii˜ÃB{ *BBiŽŽBˆÃˆBx £äx{ ä]™Ê¯ £ä{Ç ä]Èʯ £äÓÎ ä]Èʯ £ä£n ä]Îʯ £äÈ{ ä]{ʯ È ££Î™È

£™™Ç £™™n £™™™ Óäää Óää£ ÓääÓ

ÈxqÇ{Ê >ˆÃiÌ /iÀÛiÞÎiÎÕÃÊÌ>ˆÊÃ>ˆÀ>>> >Îi˜Ì> "ÃÕÕà >ˆŽŽˆ £ÓnÈ n£]£Ê¯ £xnÈ £ÓÓä nÓ]Çʯ £{ÇÈ ££ÎÓ nÎ]™Ê¯ £Î{™ ££ÎÓ nÓ]™Ê¯ £ÎÈx £ä{ä nÎ]Çʯ £Ó{Ó £ääÓ nx]£Ê¯ ££ÇÇ

6>˜…>ˆ˜ŽœÌˆÊÌ>ˆÊÛ>ÃÌ>>Û> >Îi˜Ì> "ÃÕÕà >ˆŽŽˆ £näÇ ÇÇ]™Ê¯ ÓΣ™ £nä£ ÇÇ]Èʯ ÓÎÓÓ £nn{ nÎ]Óʯ ÓÓÈx £n££ n£]nʯ ÓÓ£{ £™än nÈ]Îʯ ÓÓ£Ó ÓäÎÈ nx]Îʯ ÓÎnÇ

œi““>Ì 9…Ìii˜ÃB *BBiŽŽBˆÃˆB ÎnÇÇ ä]Çʯ ÎÇÇÎ ä]Çʯ ÎÈä£ ä]{ʯ ÎxÈÎ ä]{ʯ Î{Ι ä]{ʯ Îx{È

£™™Ç £™™n £™™™ Óäää Óää£ ÓääÓ

Çxqn{Ê >ˆÃiÌ /iÀÛiÞÎiÎÕÃÊÌ>ˆÊÃ>ˆÀ>>> >Îi˜Ì> "ÃÕÕà >ˆŽŽˆ {äÓÇ nÎ]™Ê¯ {Ç™Ç {äÇn nx]™Ê¯ {Ç{x ÎnäÎ nÈ]Îʯ {{äÇ ÎnäÓ nÈ]Îʯ {{äÈ ÎÇÇx nÈ]nʯ {Î{n ÎÈÎx nÈ]nʯ {£™ä

6>˜…>ˆ˜ŽœÌˆÊÌ>ˆÊÛ>ÃÌ>>Û> >Îi˜Ì> "ÃÕÕà >ˆŽŽˆ ÈnÈ£ nä]xʯ nxÓx șÈÈ nä]nʯ nÈÓÈ Ç{£Ç nÎ]£Ê¯ n™Ó£ ÇÈÈÎ n{]äʯ ™£ÓÇ n££n nÇ]Çʯ ™ÓxÓ nx£Ó nÈ]Çʯ ™n£x

œi““>Ì 9…Ìii˜ÃB *BBiŽŽBˆÃˆB £ÎÓÓÓ ä]nʯ £ÎÓÈx ä]nʯ £ÎÓxÓ ä]Èʯ £Î{{x ä]Çʯ £ÎxÎÈ ä]xʯ £Î™Îx

£™™Ç £™™n £™™™ Óäää Óää£ ÓääÓ

nxqÊ >ˆÃiÌ /iÀÛiÞÎiÎÕÃÊÌ>ˆÊÃ>ˆÀ>>> >Îi˜Ì> "ÃÕÕà >ˆŽŽˆ {Ç{£ nÇ]Îʯ x{ÎÎ {Ǚ{ n™]Èʯ xÎxä {näÇ n™]£Ê¯ xÎ™È {ÇÇÈ n™]Èʯ xÎә {n£{ ™ä]xʯ xÎÓÓ {{Çn n™]nʯ {™nÇ

6>˜…>ˆ˜ŽœÌˆÊÌ>ˆÊÛ>ÃÌ>>Û> >Îi˜Ì> "ÃÕÕà >ˆŽŽˆ nn£Î nx]Óʯ £äÎ{È ™£{È nx]äʯ £äÇÈÈ ™xÎ{ nÈ]£Ê¯ ££äÇÓ ™Çäx n{]nʯ ££{{£ £äÎx™ä n™]nʯ ££xÎÈ nÇ]™Ê¯ £Ó{{È £ä™{Óä

œi““>Ì 9…Ìii˜ÃB *BBiŽŽBˆÃˆB £xÈÇx ä]Çʯ £Èä£Ç ä]Èʯ £ÈÎnn ä]xʯ £ÈÈ{Î ä]nʯ £ÈÇÈÇ ä]xʯ £ÇÎ{È

£®ÊÞÃiiÃÃBʜiÛ>˜ÊÛ՜`i˜Ê>È>Ž>Ï>Îi˜Ì>̈i̜i˜Ê“ÕŽ>ˆ˜i˜Ê«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜Ê“BBÀB Ó®ÊÈ>Ž>Ï>Îi˜Ì>̈i̜i˜Ê“ÕŽ>ˆ˜i˜Ê“BBÀBÊÃՅÌiÕÌiÌÌ՘>ʓՈ>Ê«ˆÌŽB>ˆŽ>ˆÃ…œˆ`œ˜ÊŽÀˆÌiiÀˆÌÊÌBÞÌÌBۈBʅœˆÌœˆ“œˆ‡ ÌՎȏ>ÊÌBÞ`i˜˜iÌޘÊ>ˆ˜iˆÃ̜˜Ê“ÕŽ>ˆÃii˜Ê“BBÀBB˜ ήÊ>ˆŽŽˆi˜Ê«ˆÌŽB>ˆŽ>ˆÃ…œˆ`œ˜ÊŽÀˆÌiiÀˆÌÊÌBÞÌÌBۈi˜Ê…œˆÌœˆ“œˆÌՎÈi˜Ê«iÀÕÃÌii>ʏ>ÎiÌÌÕÊ«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜Ê “BBÀB {®Ê*ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜Ê“BBÀBÊޅÌii˜ÃBÊ­«>ÛiÕÌÞÞ«ˆ˜Ê“ÕŽ>>˜ÊiÀœÌÌii“>ÌÌ>® x®Ê ˆˆ`i˜Ê>È>ŽŽ>ˆ`i˜ÊœÃÕÕÃ]ʍœÌŽ>Ê>ˆ˜iˆÃ̜˜Ê“ÕŽ>>˜ÊœˆÛ>ÌÊÃ>“>>˜Ê>ˆŽ>>˜ÊÃiŽBÊÌiÀÛiÞ`i˜…Õœœ˜ÊÛ՜`iœÃ>Ã̜>Ê iÌÌBÊÛ>˜…>ˆ˜Žœ`ˆÃÃ>ÊÌ>ˆÊÛ>ÃÌ>>Û>ÃÃ> Ȯʈ…>ۜˆ`ÕÌʏÕÛÕÌʜÛ>ÌÊÛ՜`i˜ÊÓääÓÊi˜˜ÕÃÌiÌÌՍ>Ê>Àۜ>Ê­Ž>ÌÜÊÃiˆÌÞÃÊÌiŽÃ̈ÃÌB®

0)4+¯!)+!)3!3)!++55$%.+2)4%%2)4%)6¯46¯,44¯-¯44¯/,% 6/)-!33!6)%,¯,!3+%.4!0¯)6¯.¯ *ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽÕÕÃÊiˆÊÛBÌÌB“BÌÌBʜiʜÕÌÊۜˆ‡ “>ÃÃ>Ê>È>Ž>Ï>Îi˜Ì>«BˆÛB˜B]ÊÛ>ˆŽŽ>ʓޟ…i“‡ «ˆi˜Êˆ“œˆÌÕÃÌi˜Ê“ÕŽ>>˜Ê…œˆÌœ>ŽÃœÊœÃœˆÌÌ>ÕÌÕÕÊ ÞˆÊ™äÊÛ՜ÀœŽ>ÕÌÌ>ʎiÃÌBÛBŽÃˆ°Ê՘Ê>ˆ˜iˆÃ̜ÃÌ>Ê «œˆ“ˆÌ>>˜Ê«ˆÌŽB>ˆŽ>ˆÃ…œˆ`œ˜ÊÌ՘˜ÕÓiÀŽˆÌÊÌBÞ̇ ÌBÛBÌʅœˆÌœ>ŽÃœÌ]Ê«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽÕÕÃʎœ…`ˆÃ‡

ÌÕÕʘBˆÃÃBʅœˆÌœ>ŽÃœˆÃÃ>ÊÌ>Ž>ÕÌÕÛ>Ã̈ʎœŽœÊ…œˆ‡ ̜>ŽÃœœ˜°Ê œÃÊÛ>ÀȘ>ˆ˜i˜Ê«BB̟ÃÊ«ˆÌŽB>ˆŽ>ˆÃ…œˆ`œÃÌ>Ê œ˜ÊÌi…ÌÞÊÛ>ÃÌ>ʏ>Îi˜Ì>«BˆÛB˜ÊBŽii˜]Ê>È>ŽŽ>>˜Ê iˆÊŽÕՏՎ>>˜Êœ>ʓՎ>˜>ʏ>Îi˜Ì>«BˆÛB˜Ê«ˆÌŽB‡ >ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜ÊÕŽÕ“BBÀBÃÃB°ÊœˆiŽˆ˜Ê>È>‡ Ž>ÃÀޅ“ˆiÊ«ˆÌŽB>ˆŽ>ˆÃ…œˆ`œ˜Ê«BB̟ÃÌBÊiˆÊi`iÃÊ Ìi…`B]ʍœœˆ˜Ê>È>Ž>Ãʜ˜Ê«ˆÌŽB>ˆŽ>ˆ˜i˜ÊÛ>ÃÌ>]Ê



Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

/>ՏՎŽœÊΰ *ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜Ê>È>Ž>Ï>Îi˜˜œˆÃÃ>ÊÀ>«œÀ̜ˆÌ>Û>>˜ÊœÃÕÕÌii˜ÊÛ>ˆŽÕÌÌ>ۈ>ÊÌiŽˆŸˆÌBʎÕÛ>>Û>˜Ê *œˆÃܘ‡Ài}ÀiÃȜ“>ˆ˜ÊÌՏœŽÃiÌ£° *iÀÕÜÃÕÕÃ

ä]nÈÓä

™x‡¯ÊÕœÌÌ>“ÕÃÛBˆ ä]nxÈ ä]nș

"ÃÕÕà ÀiviÀi˜ÃÈÀޅ“B˜Ê œÃÕÕ`iÃÌ>

™x‡¯ÊÕœÌÌ>“ÕÃÛBˆ

ŽBÀޅ“B xäqÈ{ ÈxqÇ{ Çxqn{ nx³

ä]nÇxx ä]™{xn ä]™Èxx £]ääää

ä]nxÈ ä]™ÎÎ ä]™xÈ

*>ÛiÕÌÞÞ««ˆ 6՜`iœÃ>Ã̜ 6>˜…>ˆ˜ŽœÌˆ

£]äÓnn £]ääää

£]䣙

-Վի՜ˆ ˆià >ˆ˜i˜

ä]™nÎ{ £]äää

ä]™ÇÎ

­ÀiviÀi˜ÃÈÀޅ“B®

ä]™näÎ £]äÓ{{ £]ääÇÎ £]äÓә £]ääÈÈ £]äÓän £]ääÈÓ £]ä䙙

£]äÎn ­ÀiviÀi˜ÃÈÀޅ“B® ä]™™{ ­ÀiviÀi˜ÃÈÀޅ“B®

"ÃÕÕÃÊi`iˆÃi˜ÊÛ՜`i˜Ê œÃÕÕ`iÃÌ> 6՜ÈiviŽÌˆÊ­«iÀÕÃÛ՜ÈÊÓäää® xäqÈ{ÊÛ՜`iœÃ>Ã̜ xäqÈ{ÊÛ>˜…>ˆ˜ŽœÌˆ ÈxqÇ{ÊÛ՜`iœÃ>Ã̜ ÈxqÇ{ÊÛ>˜…>ˆ˜ŽœÌˆ Çxqn{ÊÛ՜`iœÃ>Ã̜ Çxqn{ÊÛ>˜…>ˆ˜ŽœÌˆ nx³ÊÛ՜`iœÃ>Ã̜ nx³ÊÛ>˜…>ˆ˜ŽœÌˆ

ä]n™x ä]™x™ ä]™Çx

™x‡¯ÊÕœÌÌ>“ÕÃÛBˆ ä]™ÈÓ £]ääÓ ä]™™{ £]ä£Ó ä]™™n £]ä£{ ä]™™n £]ää{

ä]™™™ £]ä{Ç £]äÓ£ £]äÎ{ £]ä£x £]äÓÇ £]ä£x £]ä£È

£®Ê />ՏՎœÃÃ>ÊÀ>«œÀ̜ˆ`Տ>ʓ>ˆ>Êۜˆ`>>˜Ê>Îi>ÊÌ>ՏՎœˆ`i˜Ê£Ê>ÊÓÊi˜˜ÕÃÌiÌÕÌʜÃÕÕ`iÌ°Ê Ãˆ“iÀŽˆŽÃˆÊÇxqn{‡ Û՜̈>>ÌÊÛ՜`iœÃ>Ã̜>ʜiÛ>Ìʓˆi…iÌ\Êä]nÈÓäÊ­«iÀÕÜÃÕÕîÊ×Êä]™Èxx­ˆŽBÀޅ“BÊÇxqn{®Ê×Ê£]äÓnnÊ­Û՜`iœÃ>Ã̜®ÊÊ ×Êä]™nÎ{­“ˆiîÊ×Ê£]ääÈÈ­Û՜ÈiviŽÌˆÊÓäääÊq€ÊÓä䣮Ê×Ê£]ääÈÈÊ­Û՜ÈiviŽÌˆÊÓää£Êq€ÊÓääÓ®ÊrÊä]nxÎ

ŽÕ˜Ê…œˆÌœ>ŽÃœÊœ˜ÊŽiÃÌB˜ÞÌÊޏˆÊ™äÊÛ՜ÀœŽ>ÕÌÌ>°Ê

ȓiÀŽˆŽÃˆÊÌi…œÃÌiÌÌÕÊ«>ÛiÕ>ÃՓˆ˜i˜ÊiˆÊœiÊ >ˆÌœÃ…œˆÌœ>]ʍœÌi˜ÊÃi˜Ê>È>ŽŽ>ˆiÊiˆÊÌi…`BÊ«ˆÌŽB‡ >ˆŽ>ˆÃ…œˆ`œ˜Ê«BB̟ÃÌB°ÊޟÎBB˜ÊÃ>ˆÀ>>>ÌÊiˆÛBÌÊ «BB>È>ÃÃ>ʜiÊÌ>ÀŽœˆÌiÌÌÕÊ«ˆÌŽB>ˆŽ>ˆÃ…œˆÌœ>ÊÛ>À‡ Ìi˜]ʍœÌi˜Ê˜ˆˆÃÃBʜœÊ«iÀÕÃÌÕÕÊÛ>ˆ˜Ê…>Àۜˆ˜Ê«ˆÌŽB‡ >ˆŽ>ˆÃ…œˆ`œ˜Ê«BB̟ŽÃii˜Ê>ÊÃÞÞÊ«ˆÌŽBB˜Ê…œˆÌœ>Ž‡ ܜ˜Êœ˜ÊÛ>ÀȘ>ˆÃi˜ÊÃ>ˆÀ>>˜…œˆ`œ˜Ê«ˆÌŽˆÌÌޓˆ˜i˜Ê iȓiÀŽˆŽÃˆÊŽœ“«ˆŽ>>̈œˆ`i˜ÊÌ>Žˆ>°

`iBʓ>ˆ˜ˆÌÌՍi˜ÊÃiˆŽŽœi˜ÊÛ>ˆŽÕÌÕÃÌi˜Ê>À‡ ۈœˆ“ˆÃiŽÃˆÊ>ˆ˜iˆÃ̜ÃÌ>ʏ>ÎiÌ̈ˆ˜ÊŽœÀŽiˆ˜Ì>>˜Ê™äÊ Û՜ÀœŽ>ÕÌÌ>ʏ>Îi˜Ì>«BˆÛB˜Bʅœˆ`œÃÃ>ʜiˆ`i˜Ê >È>ŽŽ>ˆ`i˜Ê“BBÀBÌÊiÀˆÊÀޅ“ˆÃÃB°Ê6>ˆŽŽ>ʘBˆ˜Ê Ã>>`ÕÌʏÕÛÕÌÊÈÃBÌBÛBÌʓޟÃʘiÊ>iʙäÊÛ՜Àœ‡ Ž>ÕÌÌ>ʅœˆ`œÃÃ>ʜiiÌ]ʍœˆiÊi…ŽBʜ˜Žˆ˜ÊÌi…ÌÞÊ «ˆÌŽB>ˆŽ>ˆÃ…œˆ`œ˜Ê«BB̟Ã]ʘBÞÌÌBBÊȏÌB]ÊiÌÌBÊ«ˆÌ‡ ŽB>ˆŽ>ˆÃ>È>ŽŽÕÕ`i˜Ê»Ì>Ž>ÕÌÕÛ>ʎˆÀ>ÕÌՓˆ˜i˜»Ê “ÞŸÃʅœˆÌœ>ŽÃœ˜Ê«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽÕÕ`i˜ÊŽÀˆÌii‡ Àˆ˜ÊÌBÞÌÌޓˆÃÌBÊi`iÌB˜iiiÊ>>iÊÃiˆÌÌBBÊÛ>Ì>‡ œÃ>˜ÊÌiÀÛiÞÎiÎÕÃÌi˜Ê>ÊÃ>ˆÀ>>œˆ`i˜Ê>È>Ž>Ï>Ç

Ži˜Ìœi˜Ê«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜Ê«ÕÕÌÌՓˆÃiÃÌ>Ê >Îi˜Ì>̈i`œˆÃÃ>°Ê-i˜Êȍ>>˜ÊÛ>˜…>ˆ˜Žœ`iˆÃÃ>ʍ>Ê Û>ÃÌ>>ۈÃÃ>ʜ˜Ê>È>ŽŽ>ˆÌ>]ʍœˆ`i˜Ê«ÕÕÌÌՓˆ˜i˜Ê >Îi˜Ì>̈i̜i˜Ê«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆÃÌ>ÊiˆÊÃiˆÌÞÊ Ì>Ž>ÕÌÕÛ>>ʎˆÀ>ÕÌՓˆÃi>°Ê Bˆ`i˜Ê»ÃiˆÌÌB“Ḃ ̟“ˆi˜»ÊœÃÕÕÃʜ˜Ê˜ÕœÀˆ˜Ì>ʈŽBÀޅ“BBÊ­xäqÈ{‡Û°®Ê ÕŽÕ՘ʜÌÌ>“>ÌÌ>ʓÕÕÌ>“>˜Ê«ÀœÃi˜Ìˆ˜ÊÕœŽŽ>>°Ê

(/)4/),-/)453!).%)34/.+!44!6553/.'%,-!4 6>ˆŽŽ>ʓBBÀBˆÃiÃ̈ÊÃÕÕÀˆ˜ÊœÃ>ʏ>Îi˜Ì>̈i`œˆÃÌ>Ê «ÕÕÌÌÕۈÃÌ>Ê«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆÃÌ>ʘBÞÌÌBBŽˆ˜ÊÃi‡ ˆÌÌÞÛB˜ÊȏB]ÊiÌÌBʅiÊiˆÛBÌÊ̜`iˆÃÕÕ`iÃÃ>ÊۈiBÊ œiʜii̎>>˜Ê«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆÌ>ʏ>Îi˜Ì>«Bˆ‡ ÛB˜B]ʜÃ>ʅ>Û>ˆÌՈÃÌ>ÊiÀœˆÃÌ>ÊۜˆÊŽÕˆÌi˜Žˆ˜Êœ…ÌÕ>Ê “ÞŸÃÊ«ÕÕÌÌiˆÃÌ>ʅœˆÌœˆ“œˆÌÕÃ>ˆ˜iˆÃ̜i˜ÊŽ>ÌÌ>‡ ÛÕÕ`iÃÃ>°Ê/B“BÊۜˆÃˆÊÃiˆÌÌBBÊiÀœ>ÊÌiÀÛiÞ`i˜‡ …Õœœ˜Ê…œˆÌœˆ“œˆÌÕÃÌi˜ÊœÃ>Ì>]ÊȏBʎœŽi“ÕÇ «iÀBˆÃiÃ̈Ê>Àۈœˆ`>>˜]ÊiÌÌBÊÌiÀÛiÞ`i˜…ÕœœÃÃ>Ê >È>Ž>Ï>Îi˜Ì>ˆ“œˆÌՎÃiÌÊÃ>>ÌiÌ>>˜ÊBÌÌBBÊÕÃi‡ >““ˆ˜Êˆ“œˆÌÌ>“>ÌÌ>ʎՈ˜Ê«BBÌÌޘiiÌʅœˆÌœ>Ž‡



Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

ÜÌ]ʍœÌŽ>ÊÌՏiÛ>ÌÊÕÃiˆ˜ÊÃ՜À>>˜Ê>È>Ž>Ã̈i̜BÀ‡ iÃÌi“BÃÌB°Ê -œÃˆ>>ˆ…Õœœ˜ÊœÃ>Ì>ÊÌB“BÊiˆÊÃi˜Êȍ>>˜Ê˜Bއ ÌBÊ̜`i˜˜BŽŸˆÃiÌB°Ê՘ÊÜÈ>>ˆ…Õœœ˜Ê…œˆÌœˆ‡ “œˆÌÕÃ̈i̜>ÊÛiÀÀ>Ì>>˜Ê“Õˆ…ˆ˜Ê̈>Ã̜ˆ…ˆ˜]ʘBÞ̇ ÌBÛBÌÊ>È>Ž>Ï>Îi˜Ì>̈i`œÌʅÞۈ˜ÊŽ>ÌÌ>ۈÌ>°Ê-i˜Ê ȍ>>˜Ê…œˆÌœ«BˆÛˆi˜Ê“BBÀBʜ˜ÊœˆÃÃ>Žˆ˜Ê«>ÛiÕˆÃ‡ Ã>ʅœˆÌœˆ“œˆÌՎÈÃÃ>Ê>…>ˆÃi“«ˆÊŽÕˆ˜Ê“ÕˆÃÃ>Ê̈‡ >Ã̜ˆÃÃ>]ʓˆŽBÊۈˆÌÌ>>ÊȈ…i˜]ÊiÌÌBÊ«BBÌÌޘiˆ`i˜Ê …œˆÌœ>ŽÃœi˜Ê̈i`œÌÊiˆÛBÌÊi…ŽBʜˆÃˆŽ>>˜Ê˜ˆˆ˜Ê Ž>ÌÌ>ۈ>°Ê/B“Bʍœ…ÌÕÕÊ̜`i˜˜BŽŸˆÃiÃ̈ÊȈÌB]ÊiÌÌBÊ ÃœÃˆ>>ˆ…ÕœœÃÃ>ʎ>ŽÃˆÊŽœ“>ÜÃ>>ʅœˆÌœˆ“œˆ‡ ÌՎÈÃÌ>ÊÌi…`BB˜ÊiÀˆBB˜Ê>È>Ž>Ã̈i̜BÀiÃÌi“BÇ ÌBʍœŽœÊÌ>i˜Ì>i˜Ê˜iÊ"‡* ‡œ…i“>>˜ÊÌ>ˆÊ «>«iÀˆi°Ê/BBÊÌ>Û>>ÊÌi…ÌޘBÊ>È>Ž>Ï>Îi˜Ì>Ê œ˜Ê̜`i˜˜BŽŸˆÃiÃ̈ʅi«œ“«ˆÊ>ˆ˜iˆÃ̜ʎœœÌ>ʎՈ˜Ê «BBÌÌޘiiÌʅœˆÌœ>ŽÃœÌ°Ê -œÃˆ>>ˆ…ÕœœÃÃ>ʅœˆÌœˆ“œˆÌÕÃ>ˆ˜iˆÃ̜>ʜ˜Ê ŽÕˆÌi˜Žˆ˜ÊŽiÀBÌÌÞÊÛ>ÃÌ>ÊÛ>>>ʎޓ“i˜i˜ÊÛ՜ÌÌ>]Ê œÌi˜Ê Û>ÀȘŽˆ˜Ê Ì>ÀŽ>ÃÌiÕ>ŽÃœ˜Ê >ŽÕ«Õœi>Ê Ž>ÌÌ>ÛÕÕÃʜ˜ÊœÕÌʘގވÃÌBʅiˆŽœ“«>>°Ê-ˆÌi˜Ê Ì>ÀŽ>ÃÌiÕ>ŽÃœ˜Ê>ŽÕÛ՜Ș>ʘBŽÞÛBÊÃÕÕÀi“«ˆÊ iÀœÊ>Îi˜Ì>̈i̜i˜Ê>ʎ>ˆŽŽˆi˜Ê­“ÞŸÃʓޟ…i“‡ «ˆi˜®Êˆ“œˆÌÕÃÌi˜Ê«iÀÕÃÌii>ʏ>ÎiÌՈÃÃ>Ê«ˆÌŽB>ˆ‡ Ž>ˆÃ>È>ŽŽ>ˆ`i˜Ê“BBÀˆÃÃBÊÃiˆÌÌÞÞʘˆ“i˜œ“>>˜Ê Ž>ÌÌ>ÛÕÕ`i˜ÊÛ՜ÈÌÌ>ˆÃi>Ê«>À>˜i“ˆÃi>°Ê

6)2(%%49+3)44¯)3)33¯(/)4/),-/)45+3)33! ޟÃÊۈÀ…iiÌÊގÈÌÌBˆÃˆÃÃBʅœˆÌœˆ“œˆÌՎÈÃÃ>Êۜˆ‡ Û>Ìʜ>ÊÃÞޘBʜÃ>>˜ÊÕÛՈÃÃ>ʘBŽÞۈÃÌBÊiÀœˆÃÌ>°Ê œÃŽÕÃÊ«ˆÌŽB>ˆŽ>ˆÃ…œˆ`œ˜Ê«BB̟ÃÊۜˆÊœ>ÊÌi…ÌÞ]Ê “ÕÌÌ>ÊÈÌBÊiˆÊ…Õœ“>Ì>ʎˆÀ>Ì>ʅœˆÌœˆ“œˆÌÕÃ̈i̜ˆ‡ …ˆ˜°Ê ÀˆÌވÃiÃ̈ÊÜÈ>>ˆ…ÕœœÃÃ>]ʎ՘Ê̈i`œÌʎiÀB‡ ÌBB˜Ê«BB>È>ÃÃ>ʎBȘ]ÊÌB>ˆÃÌ>Ê̈i̜>ÊiˆÊÛBÌÌB‡ “BÌÌBʏB…`iÌBÊÌ>ÀŽ>ÃÌ>“>>˜]ÊÛ>>˜ÊŽœ…Ì>ʍBÌiÌBB˜Ê ÌޅBŽÃˆ°Ê ˆÃBŽÃˆÊÌՏœ‡Ê>ʏB…ÌŸ«BˆÛBÌÊۜˆÛ>Ìʜ>ʍœÃÃ>ˆ˜Ê ˆ“œˆÌՎÈÃÃ>ÊÛBBÀˆ˜°Ê՘ʅœˆÌœˆ“œˆÌÕÃ>ˆ˜iˆÃ̜>Ê -Ì>ŽiÈÃÃ>ʎœœÌ>>˜]ÊÃ>“>˜ÊÛ՜`i˜Êˆ“œˆÌՎÈÃÌ>Ê Ì>ÀŽ>ÃÌiÌ>>˜]ÊiÌÌBÊ«BBiŽŽBˆÃˆBʅœˆÌœ>ŽÃœ>ÊiˆÊ œi°Ê ÀˆÊÛ՜Èi˜Êˆ“œˆÌՎÈ>ÊiˆÊŽÕˆÌi˜Ž>>˜ÊÛiÀÀ>‡ Ì>]ʍœœˆ˜Ê“ÞŸ…i““BÃÃBʈ“œˆÌՎÃiÃÃ>ÊۜˆÊœ>Ê ÀˆÃ̈ÀˆˆÌ>ˆ˜i˜Ê̈i̜ÊÌՏœ«BˆÛBÃÌB°Ê Bˆ˜Êœi˜Ê“ÞŸ‡ …i““B˜Êˆ“œˆÌՎÃi˜Ê“ÕŽ>>˜Ê>È>ŽŽ>>ÃÌ>ÊÌՏiiÊ «ˆÌŽB>ˆŽ>ˆÃ>È>Ž>Ã]ÊÛ>ˆŽŽ>Ê>ˆŽ>ˆÃi“«ˆÊ­>È>Ž>Ç >Îi˜Ì>®ˆ“œˆÌÕÃÊiˆÊÈÌBʘBÞÌB°Ê"˜Ê“>…`œÌœ˜Ì>Ê Ã>˜œ>]ʜÛ>̎œÊ>ˆŽ>ˆÃi““ˆ˜ÊÛ>ˆÊ“ÞŸ…i““ˆ˜ÊÌՏ‡ ii˜Êˆ“œˆÌՎÃi˜ÊÌՏœ«BˆÛB̈i`œÌʜˆŽiˆ˜°Ê

0¯¯,,%++¯)3%4(/)4/*!+3/4 9ŽÃˆÊ…œˆÌœˆ“œˆÌÕÃÀiŽˆÃÌiÀˆÌˆi̜i˜Êœ˜}i“ˆÃÌ>Ê œÛ>ÌÊ«BBiŽŽBˆÃiÌʅœˆÌœ>ŽÃœÌ°ÊÞÃiÊiˆÊ«BB>È>Ç Ã>ʜiÊ>ˆ˜iˆÃ̜˜Ê«ÕÕÌÌiiˆÃÕÕ`iÃÌ>]ÊÛ>>˜ÊȈÌB]Ê

iÌÌBÊ>È>Ž>Ãʜ˜ÊœˆŽi>Ã̈ʎˆÀœˆ>ʎ>…`iÃÃ>Ê«>ˆŽ>Ç Ã>°Ê/B“Bʏˆi˜iiÊÃÕÕÀˆ““>ŽÃˆÊœÃ>ŽÃˆÊÃiÕÀ>ÕÃÌ>Ê ÃˆˆÌB]ÊiÌÌBʏޅÞi…ŽŸi˜ÊÃ>ˆÀ>>>ŽBޘ̈i˜Ê>>ŽÃˆÊiˆÊ œŽ>Ê«>ˆŽ>ÃÃ>ÊÛBÌÌB“BÌÌBÊÌi…`BÊՏœÃŽˆÀ>ÕÃÌ>Ê Û>˜…>ˆ˜Žœ`ˆÃÌ>°Ê *>ÛiÕ>ÃՓˆÃiÃÃ>Ê >È>ŽŽ>>˜Ê >ÃՓˆ˜i˜Ê«iÀÕÃÌÕÕÊÛ՜ŽÀ>Ü«ˆ“ÕŽÃii˜]ʍœÌi˜Ê …B˜ÌBÊiˆÊ̈iÌi˜ŽBB˜ÊŽˆÀœˆÌiÌ>ÊՏœÃÊiȓiÀŽˆŽÃˆÊ Ã>ˆÀ>>>ŽBޘ˜ˆ˜Ê>>ŽÃˆ°ÊœˆÌœˆ“œˆÌÕÃÌi˜Ê“Õ‡ Ž>>˜Ê>È>ŽŽ>ˆÃÌ>ʘœˆ˜Ê«ÕœˆÊ«ÀœÃi˜Ì̈>ʜˆÊ>Îi˜‡ Ì>«BˆÛB˜BʎˆÀœˆ>ÊÃiŽBÊÃ>ˆÀ>>>˜ÊÌ>ˆÊÌiÀÛiÞÎiÇ ŽÕŽÃi˜ÊÛ՜`iœÃ>Ã̜>ÊiÌÌBÊÛ>˜…>ˆ˜Žœ`ˆÃÃ>ÊÌ>ˆÊ Û>ÃÌ>>Û>ÃÃ>°ÊœÃŽ>Ê«BBiŽŽBˆÃÞÞÃʜ˜Ê˜Bˆ˜Ê«ˆi˜‡ ÌB]ÊiˆÊÈÌBÊۜˆÊ«ˆÌBBÊiÀˆÌވÃi˜ÊÛ>Ž>Û>˜>Ê«ÕÕÌÌii˜>Ê iÌi˜Žˆ˜ÊŽÕ˜Ê>È>˜Ê̈i`œÃÌ>“ˆÃi˜ÊBŽii˜ÊÃiʜ˜Ê …i«œ…ŽœÊŽœÀ>Ì>ʎBÞÌiÌÌBiÃÃBʅœˆÌœˆ“œˆÌÕÃ>ˆ‡ ˜iˆÃ̜>ÊÌÕ̎ˆ“ÕÃÌ>ÀŽœˆÌՎȈ˜°

2!*/)445-).%.0)4+¯!)+!)30¯¯4¾+3%,,)3)). (/)4/),-/)4534)%4/)(). 6>ˆŽŽ>Êi`iBʜ˜Ê«œ…`ˆÌÌÕÊiÀBˆÌBʅœˆÌœˆ“œˆÌÕÃÀi‡ ŽˆÃÌiÀˆˆ˜ÊˆˆÌÌÞۈBÊ«œÌi˜Ìˆ>>ˆÃˆ>ʜ˜}i“ˆ>]ʜ˜ÊÌBÀ‡ ŽiBBÊ «ˆÌBBÊ “ˆiiÃÃB]Ê iÌÌBÊ ÌBÃÃBÊ ÌÕ̎ˆ“ÕŽÃiÃÃ>Ê ŽBÞÌiÌÌÞÊ>ˆ˜iˆÃ̜ÊiˆÊÛ>ÀȘ>ˆÃiÃ̈Ê̜`ˆÃÌ>ʓˆÌBB˜Ê …œˆÌœˆ“œˆÌÕÃÀiŽˆÃÌiÀˆ˜ÊŽ>ÌÌ>ÛÕÕ`iÃÌ>]ÊȏBÊ«iŽ‡ Žˆi˜Ê«ˆÌŽB>ˆŽ>ˆÃÕÕ`i˜ÊŽÀˆÌiiÀˆÌÊÌBÞÌÌBۈi˜Ê…œˆÌœ‡ >ŽÃœi˜ÊŽBÞÌ̟Ê>ˆ…iÕÌÌ>>ʍœÊȘB˜ÃBʜ˜}i“ˆ>°Ê œÃÊ«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>>˜Ê…œˆÌœ>ŽÃœÌÊiȓiÀŽˆŽÃˆÊ Ã>ˆÀ>>>ŽBޘ˜ˆ˜ÊÌ>Žˆ>ʎiÎiÞÌÞÛBÌÊ>iʙäÊÛ՜Àœ‡ Ž>Õ`iÃÃ>]Ê>È>Ž>Ãʈ“>˜Ê“iÀŽˆ˜ÌBBÊ«ˆÌŽB>ˆŽ>ˆÃ‡ …œˆ`œ˜Ê«BB̟ŽÃiÃÌBÊiˆÊÌՏiʏ>ˆ˜Ž>>˜Ê“ÕŽ>>˜Ê«ˆÌ‡ ŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜Ê«œˆ“ˆ˜Ì>>˜°Ê/B“Bʜ˜}i“>Ê œ˜Êœi“>ÃÃ>ÊÀˆˆ««Õ“>ÌÌ>ÊȈÌB]ʎBÞÌiÌBB˜ŽŸÊ«ˆÌ‡ ŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜Ê“BBÀB˜Ê>Îi“ˆÃii˜Ê«iŽBÇ ÌBB˜Êޅ`i˜ÊÛ՜`i˜Ê>È>Ž>Ï>Îi˜Ì>̈i̜>ÊÛ>ˆÊ “ÞŸÃʓޟ…i“«ˆBʅœˆÌœˆ“œˆÌՎÈ>°Ê*ˆÌŽB>ˆŽ>ˆ‡ ÃÕÕ`i˜ÊŽÀˆÌiiÀiˆBÊÀ>>ÌՏ>Ê>ˆ˜iˆÃ̜>ÊiˆÊ«ÞÃÌÞÌBÊ “ÞŸÃŽBB˜ÊŽÕÛ>>“>>˜ÊŽ>ÌÌ>Û>Ã̈Ê>È>ŽŽ>>˜ÊŽœŽœÊ …œˆÌœŽi̍Õ>]ÊȏBÊ>È>ŽŽ>>˜Ê…œˆÌœ>ŽÃœÊÃ>>ÌÌ>>Ê Ìœ`iˆÃÕÕ`iÃÃ>ʜ>ÊÀiŽˆÃÌiÀˆÃÃB]ʓÕÌÌ>ÊiˆÊÌՏiÊ “ÕŽ>>˜Ê>ˆ˜iˆÃ̜œ˜]ʍœÃÊÃiÊiˆÊÌBÞÌBÊ«ˆÌŽB>ˆŽ>ˆÃÕՇ `i˜ÊŽÀˆÌiiÀiB°Ê BÞÌiÌÌBiÃÃBÊÃiŽBÊ«œˆÃ̜ˆ“œˆÌÕÇÊiÌÌBʏ>Îi˜‡ Ì>̈i̜>ʜˆÃˆÊ>È>ŽŽ>ˆ`i˜ÊÕŽÕ“BBÀBÌÊ«iÀˆ>>̇ ÌiiÃÃ>ʓ>…`œˆÃÌ>Ê«BBÌiBʍœŽ>ˆÃiiÊ«BˆÛBiÊ iÀˆŽÃii˜°Ê/B“BʎՈÌi˜Žˆ˜Êœ…Ì>>ʅ>˜Ž>ÕՎȈ˜]Ê ÃˆBÊÀ>>ÌÌ>iÃÃ>Ê>È>ŽŽ>>ÌÊÀiŽˆÃÌiÀˆÃÌBÊ«ˆÌŽB>ˆŽ>ˆÃ‡ «BB̟ŽÃi˜Ê«iÀÕÃÌii>]Ê«BˆÛˆÌÌBˆÃÌi˜Ê«œ«Õ>>̈œˆ‡ `i˜ÊŽÕÛ>>>ÃÌ>ÊÌՏiiÊ»Ã>…>“>ˆ˜i˜»ÊÈÌi˜]ÊiÌÌBÊ ŽÕ˜Žˆ˜ÊÛ՜`i˜Êۈˆ“iˆÃi˜Ê«BˆÛB˜Ê>ÊÃiÕÀ>>Û>˜Ê Û՜`i˜Êi˜Ãˆ““BˆÃi˜Ê«BˆÛB˜Ê«œ«Õ>>̈œÌÊiÀœ>Û>ÌÊ ÌœˆÃˆÃÌ>>˜Êœ«>ʎޓ“i˜i˜Ê«ÀœÃi˜Ì̈>°Ê Bˆ˜ÊÃÕÕÀˆÊ «BˆÛB“ÕÕ̜ÃÊiˆÊœiÊ̜`iˆÃÕÕ`iÃÃ>ÊÀi>ˆÃ̈˜i˜°Ê “ˆŸÊœ…ÌÕՎˆ˜ÊȈÌB]ÊiÌÌBʎ>ˆŽˆiÊ>È>ŽŽ>ˆiÊiˆÊ



Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

ŸÞ`Þʏ>Îi˜˜>˜ÊBŽii˜Ê…œˆ`œ˜Ê«BBÌÌޓˆÃˆ“œˆ‡ ÌÕÃÌ>ÊÌ>ˆÊÃiÕÀ>>Û>˜ÊÛ՜`i˜Ê>Îi˜Ì>̈i̜>]ʍœŽ>Ê ÌBÞÌÌBˆÃˆÊ«ˆÌŽB>ˆŽ>ˆÃ…œˆ`œ˜ÊŽÀˆÌiiÀˆÌ°Ê/œˆÃˆ˜ÊÃ>‡ ˜œi˜ÊȈÃʜÃ>iÊ>È>ŽŽ>ˆÃÌ>ÊiˆÊŸÞ`ÞʍœŽ>ÊÛ՜`iiÊ «ˆÌŽB>ˆŽ>ˆÃ…œˆ`œ˜ÊŽÀˆÌiiÀˆÌÊÌBÞÌÌBÛBBʏ>Îi˜Ì>̈i‡ ̜>ʍ>Ê̜ˆÃ>>Ì>ʜÃ>iÊiˆÊ>Îi˜Ì>̈i`œ˜ÊBŽii˜Ê ŸÞ`ÞʎÀˆÌiiÀˆÌÊÌBÞÌÌBÛBBʅœˆ`œ˜Ê«BBÌÌޓˆÃˆ“œˆ‡ ÌÕÃÌ>°Ê->“>˜>ˆ˜i˜Êˆ“ˆŸÊœ˜Ê…>Û>ˆÌÌÕÊ>ˆŽ>ˆÃi“‡ “ˆ˜Ê“ÞŸÃÊΈÌÜvÀi˜ˆ>`ˆ>}˜œœÃˆ>ÊÀ>>ÌՏiÊ>È>‡ Ž>͜ՎœiÊ­-՘`ÊÓäää]ÊxÓqxÇ®°Ê

Ži˜Ì>̈i`œÌʜˆÃˆÛ>ÌʓՎ>˜>ÊÌ>ÀŽ>ÃÌiÕˆÃÃ>ÊÀˆˆ«‡ «Õ“>ÌÌ>ÊȈÌBÊÌBÞÌÌBÛB̎ŸÊ˜iÊ«ˆÌŽB>ˆŽ>ˆÃ…œˆ`œ˜Ê ŽÀˆÌiiÀˆÌÊÛ>ˆÊiˆÛBÌ°Ê/B>ˆÃi˜Ê>ˆ˜iˆÃ̜˜Ê>ÛՏ>ÊÃ>>‡ Ì>ˆÃˆˆ˜ÊˆÃB̈i̜>ʓޟÃÊȈÌB]ʈ“œˆÌiÌ>>˜ŽœÊB…ÌŸ‡Ê >ÊÌՏœ«BˆÛBÌʜˆŽi>˜ÃÕ՘Ì>ˆÃiÃ̈°Ê `iii˜ÊÌB>ˆ‡ Ãi˜ÊŽœŽœ˜>ˆÃ>ˆ˜iˆÃ̜˜Ê>ÛՏ>ʜˆÃˆÊ“>…`œˆÃÌ>Ê >Îi>Ê>È>ŽŽ>>˜ÊŽœŽœÊ>ˆÌœÃ…œˆ`œ˜ÊŽiÃ̜]ʍœŽ>Ê œ˜Ê…œˆÌœˆ“œˆÌÕÃÀiŽˆÃÌiÀˆ˜Ê»œ}ˆˆŽ>˜»Ê­rÊ̈i`œÌÊ œÛ>Ìʍ>ŽÃœŽœ…Ì>ˆÃˆ>]ʍœÌi˜ÊiȓiÀŽˆŽÃˆÊÃ>ˆÀ>>>‡ ŽBޘ̈ʎ>̎>ˆÃiiÊÛ>˜…>ˆ˜ŽœÌˆ…œˆÌœ>ŽÃœ˜®ÊÌ>Žˆ>Ê “ÕÕÌi˜Êœ˜}i“>ˆÃÌ>°Ê

0)4+¯!)+!)3!3)!++55$%.-¯¯2)4%,-¯33¯+%()4%44¯6¯¯ ,iŽˆÃÌiÀˆ«œˆ“ˆ˜Ìœ>ÊÛ>ÀÌi˜Ê“BBÀˆÌi“B˜Êœ˜ÊœŽ>Ê Ì>«>ՎÃiÃÃ>ʜÌ>Û>ÊގÈÃiˆÌÌiˆ˜i˜]ʍœœˆ˜ÊȈÌBÊ ÌՏiiʏ՜˜ÌiiÌ>>˜Ê…i«œÃ̈ʅÞۈ˜ÊÌiŽ˜ˆ˜i˜°Ê އ ŽÞˆ˜i˜Ê «œˆ“ˆ˜Ì>Ì>«>Ê œ…Ì>>Ê Ãˆˆ…i˜]Ê iÌÌBÊ iiˆÊ >È>ŽŽ>ˆiʜiÊÌi…ÌÞÊ«ˆÌŽB>ˆŽ>ˆÃ…œˆ`œ˜Ê«BB̟ÃÌB]Ê …iÊÌՏiÛ>ÌʓՎ>>˜Ê«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜Ê“BB‡ ÀBB˜ÊÛ>ˆ˜]ʍœÃʅiʜÛ>ÌʜiiÌÊޅ`iÃÃBÊ«>ˆŽ>ÃÃ>ÊޏˆÊ Žœ“iʎÕՎ>ÕÌÌ>°ÊBÞÌB˜˜ŸÃÃBÊ>È>ŽŽ>ˆ`i˜Ê«>‡ ÛiÕŽi̍ÕÌʎՈÌi˜Žˆ˜ÊiÀœ>Û>Ì\ʍœÌŽÕÌÊ>È>ŽŽ>>ÌÊ œÛ>ÌÊޅ`iÃÃBÊ«>ˆŽ>ÃÃ>ÊÛ՜È>ʍ>ʍœÌŽÕÌÊÌ>>ÃÊÛ>ˆ…‡ Ì>Û>ÌʅœˆÌœ«>ˆŽŽ>>ÊÕÃi>““ˆ˜°Ê ˆÃBŽÃˆÊ«ˆÌŽB>ˆŽ>ˆÃ«BB̟ÃÊۜˆ`>>˜ÊÌi…`BÊiÀˆÊ ŽÕ˜˜ˆÃÃ>]ÊiÀˆÊ>ˆÌœÃÌÞÞ«iˆÃÃBʍ>ÊiÀˆ>ˆÃˆiÊ>È>Ž‡ Ž>ˆiÊiÀˆÊÌ>Û>>°Ê ȓiÀŽˆŽÃˆÊÌBÃÃBÊ>ˆ˜iˆÃ̜ÃÃ>Ê …>Û>ˆÌ̈ˆ˜]ÊiÌÌBÊÛ>˜…i““ˆÃÃ>ʈŽBÀޅ“ˆÃÃBʏ>Îi˜‡ Ì>‡>ˆ˜iˆÃ̜i˜ÊŽ>ÌÌ>ÛÕÕÃÊ«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜Ê “BBÀB˜ÊŽÕÛ>>>˜>ʜˆÊ«>Ài“«ˆÊŽÕˆ˜Ê˜ÕœÀi““ˆÃ‡ Ã>]ÊÃ>“œˆ˜Ê˜>ˆÃˆ>ʎ>ÌÌ>ÛÕÕÃʜˆÊ«>Ài“«ˆÊŽÕˆ˜Ê “ˆi…ˆB°Ê6>˜…i““ˆiÊ>È>ŽŽ>ˆiʍ>ʘ>ˆÃˆiÊ«BB‡ ̟ÃʍœŽœÊÌi…`BB˜Ê˜œ«i>““ˆ˜ÊÌ>ˆÊÃiʜ˜Ê«>Ài““ˆ˜Ê ̈i`œÃÃ>]ʍœœˆ˜ÊÃiÊÌՏiiÊÛ>À“i““ˆ˜Ê“ÕŽ>>˜Ê “ÞŸÃʏ>Îi˜Ì>̈i̜ˆ…ˆ˜°ÊœÃŽ>Ê«ˆÌŽB>ˆŽ>ˆÃii˜Ê>ˆ‡ ̜ŜˆÌœœ˜ÊȈÀÀÞÌBB˜Ê˜œˆ˜ÊnӇÛ՜̈>>˜>]ʜÛ>ÌÊ iȓiÀŽˆŽÃˆÊnxÊÛ՜ÌÌ>ÊÌBÞÌÌB˜iiÌʜiiÌʎiΈ“BB‡ Àˆ˜Ê>ˆÌœÃ…œˆ`œÃÃ>ʍœÊ“ÕÕÌ>“>˜ÊÛ՜`i˜°Ê*BB̟ÃÊ œ˜ÊÌBŸˆ˜ÊÌi…ÌÞʍœÊŽ>Õ>˜ÊÈÌÌi˜Ê>ʓޟÃʏ>Îi˜Ì>‡ ̈i̜ˆ…ˆ˜Ê̈i̜ʓiÀŽˆÌBB˜Ê̜`i˜˜BŽŸˆÃi““ˆ˜ÊŽÕˆ˜Ê ÕÕ`i““ˆiÊ>È>ŽŽ>ˆi°Ê-Վի՜Ìi˜ÊÛBˆÃÌBÊiÀœ>Ê œ˜ÊÛ>ˆŽi>“«ˆÊÃiˆÌÌBB]ʓÕÌÌ>ÊȈ˜BÊۜˆÊœ>ʎÞÃiÊ “ˆiÃÌi˜Ê>ʘ>ˆÃÌi˜ÊiÀˆ>ˆÃiÃÌ>Ê«>ÛiÕi˜ÊŽBÞ̟ÃÌBÊ ­ˆŽBB˜ÌޘiˆÃÌBʘ>ˆÃˆÃÌ>ʜ˜Ê>ˆÌœÃ…œˆ`œÃÃ>ʈܓ«ˆÊ œÃ>ʎՈ˜Ê“ˆi…ˆÃÌB®°Ê >Û>ˆÌÌՍi˜Ê«œÌi˜Ìˆ>>ˆÃÌi˜Ê«ÕÕÌÌiˆ`i˜ÊÃވ`i˜Ê Ì>ÀŽi“«ˆÊÃiÛˆÌÌB“ˆ˜i˜ÊœˆÃˆŽˆ˜ÊŽˆˆ˜ÌœˆÃ>ʏˆÃBÌ>À‡ Ž>ÃÌiÕi˜ÊŽœ…`i]ÊȏBÊÃiʎiÀ̜ˆÃˆÊÌ>ÀŽi““ˆ˜ÊÃiŽBÊ …œˆÌœˆ“œˆÌÕÃ>ˆ˜iˆÃ̜˜ÊÕœÌiÌÌ>ÛÕÕ`iÃÌ>ÊiÌÌBÊ«ˆÌ‡ ŽB>ˆŽ>ˆÃ…œˆ`œ˜ÊŽÀˆÌiiÀˆ˜ÊÌ>ÀŽœˆÌՎÃi˜“ÕŽ>ˆÃÕՇ `iÃÌ>°Ê/B“BʎՈÌi˜Žˆ˜ÊÛ>>̈ÈÊ>ˆ˜iˆÃ̜>]ʍœÃÃ>Ê «ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜ÊŽœ…œÀ̈˜ÊŽ>ˆŽŽˆÊގȏŸˆ˜‡ ̈Ì՘˜ÕŽÃi>ʈ`i˜ÌˆwœˆÌÕÛ>ÌʅœˆÌœ>ŽÃœÌʍ>ʏ>Ç

*/(4/0¯¯4¾+3%4 6>ˆŽŽ>Êޅ`i˜ÊÛ՜`i˜Ê>È>Ž>Ï>Îi˜Ì>̈i`œˆÃÌ>Ê >ÎiÌÌÕÊ«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜ÊÕŽÕ“BBÀBʜ˜Ê ÃiÛBÃ̈ʫˆi˜i“«ˆÊŽÕˆ˜Êȏœˆ˜]ʎ՘ʜÌiÌ>>˜Ê…Õœ‡ “ˆœœ˜Ê“ÞŸÃʓޟ…i“«ˆi˜Ê>È>Ž>Ï>Îi˜Ìœi˜Ê>Ê «BBÌÌޘiˆ`i˜Ê…œˆÌœ>ŽÃœi˜Ê̈i`œÌ]ÊiˆÊÃiÊ̜`ˆÃÌ>]Ê iÌÌBÊ>È>Ž>Ï>Îi˜˜œˆÃÌ>Ê«œˆ“ˆÌÌÕÊ«ˆÌŽB>ˆŽ>ˆÃ‡ >È>ŽŽ>ˆ`i˜Ê“BBÀBʜˆÃˆÊÃiÛBÃ̈Ê̜`iˆÃÌ>Ê«ˆÌŽB‡ >ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜Ê“BBÀBBÊ«ˆi˜i“«ˆ°Êޟ…i“‡ «ˆi˜Êˆ“œˆÌÕÃÌi˜Ê̈i`œÌÊiˆÛBÌÊ>Õ̜“>>Ì̈ÃiÃ̈ʜiÊ œˆŽiiˆÃi“«ˆ>Ê >Ê ŽœÃŽ>Ê «ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽÕÕÃÊ ˜BÞÌÌBBʓޟ…i“«ˆBʈ“œˆÌՎÈ>ʅޟ`ޘ˜iÌÌBiÃÃBÊ ÌՏiÛ>˜Êۜˆ“>>˜Ê»ˆˆ>˜Ê>ˆŽ>ˆÃˆ˜»]ÊۜˆÊ“ÞŸ…i“‡ «ˆi˜Êˆ“œˆÌÕÃÌi˜Ê>ÛՏ>ʏ>ÎiÌÌÕÊ«ˆÌŽB>ˆŽ>ˆÃ>È>Ž‡ Ž>ˆ`i˜Ê“BBÀBʜ>ʍœ«>ʏˆˆ>˜ÊˆÃœ°Ê6՜È̈>Ã̜>Ê >ÎiÌÌ>iÃÃ>ʜ˜Ê“ÕÕÌi˜Žˆ˜Ê«>Ài“«ˆÊŽBÞÌÌBBʍœŽ>Ê Û՜ÈÊÃ>“>>Ê«œˆ“ˆ˜Ì>Ì>«>>ʍ>ÊÃ>“>>Ê>ˆ˜iˆÃ̜>°Ê ՈÌi˜Žˆ˜ÊiȓiÀŽˆŽÃˆÊ>ÎiÌÌ>iÃÃ>ʏœ˜ŽŽ>‡ “ÕÀÌՓ>˜Êˆ“>>˜ÌÕÛÕÕÌÌ>Ê«ˆÌŽB>ˆŽ>ˆÃ…œˆ`œÃÃ>Ê œiۈi˜Ê…i˜ŽˆŸˆ`i˜ÊœÃ>Ì>ÊqÊÌ>ˆÊ>ÀۈœˆÌ>iÃÃ>Ê«ˆÌ‡ ŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜ÊÌ>ÀۈÌÃi“ˆ˜i˜ÊÛ՜`i«>ˆŽŽœ‡ i˜Ê“BBÀBBÊqʜ˜Ê>ˆ˜iˆÃ̜˜ÊŽÕÛ>ÌÌ>Û>ʓ>…`œˆ‡ ȓ“>˜ÊŽ>ÌÌ>Û>Ã̈ʫˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜Ê«œ«Õ‡ >>̈œÌ>°Ê/B>ˆÃÌ>ʎBÞÌ̟Ì>ÀŽœˆÌÕÃÌ>ÊÛ>ÀÌi˜ÊŽ>˜‡ ˜>ÌÌ>>ÊȈÃʅޟ`ޘÌBBʏ>Îi˜˜>ÃÃ>ʓޟÃʓޟ‡ …i“«ˆi˜Ê Û՜Èi˜Ê …œˆÌœˆ“œˆÌÕÃ̈i̜>°Ê œÃŽ>Ê Ì՜Àiˆ““>iÊÛ՜`iiÊiˆÊŽÕˆÌi˜Ž>>˜ÊœiʎBÞ̟ÃÃBÊ “ÞŸ…i“«ˆBʈ“œˆÌՎÈ>]ʓ՜`œÃÌiÌ̈ˆ˜ÊÌBÃÃBÊÌÕ̇ Žˆ“ÕŽÃiÃÃ>ʓޟÃÊ̈>Ã̜ˆ˜i˜Ê“>ˆ]ʍœ˜Ž>Ê>ÛՏ>Ê œˆÊ “>…`œˆÃÌ>Ê i˜˜ÕÃÌ>>Ê “ÞŸÃÊ Ì՜Àiˆ““>iÊ Û՜`iiʓՈ`i˜ÊÛ՜Èi˜ÊŽ>˜ÃÃ>ÊÛiÀÌ>ˆÕŽi«œˆ‡ ˜i˜Ê«ˆÌŽB>ˆŽ>ˆÃ>È>ŽŽ>ˆ`i˜ÊÕŽÕ“BBÀB° œ««Õi˜Êœ«ÕŽÃˆÊÀiŽˆÃÌiÀˆÌˆi̜>ʎBÞÌiÌÌBiÃÃBÊ œ˜ÊȈÃÊ>ˆ˜>ʜÌiÌÌ>Û>ʅ՜“ˆœœ˜]ʓˆÌi˜Ê>ʓˆ>ˆ‡ ÈÃÌ>ʏB…ÌiˆÃÌBʘiʜ˜ÊŽœœÌÌհʈ˜iˆÃ̜i˜ÊŽBÞÌÌBB˜Ê œ˜ÊBÀŽiۈi˜ÊÌՏŽˆ˜Ìœi˜Ê“>…`œˆÃÌ>“ˆÃiŽÃˆÊœŽ>Ê Ì>«>ՎÃiÃÃ>ÊÃiÛˆÌiÌÌBÛBʈÌÃiii˜]ÊiÌÌBÊ>ˆ˜iˆÃÌœÊ ŽÕÛ>>ÊÛ>À“>Ã̈ÊÈÌBÊ>È>>]ʍœÌ>ʅB˜ÊœiÌÌ>>ÊÃi˜Ê ŽÕÛ>>Û>˜Ê­-՘`ÊÓääÎ]Ê-՘`ʍ>Ê Þ>˜`iÀʍ>Ê*>œ‡ ˜i˜ÊÓää{®°Ê



Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

3UND2 +AUPPINEN3(OWTODETERMINETHENUMBEROFOLDERPEOPLEINLONG TERMCAREUSINGREGISTER BASEDDATA 3OSIAALILËËKETIETEELLINENAIKAKAUSLEHTIn*OURNALOF3OCIAL-EDICINEn /…ˆÃÊÃÌÕ`ÞÊÀi«œÀÌÃÊ̅iÊ>}iÊ>˜`ÊÃiÝÊëiVˆwVÊ VœÕ˜ÌÃʜvʜ`iÀÊ«iœ«iʈ˜Êœ˜}‡ÌiÀ“ÊV>Àiʈ˜Ê ˆ˜>˜`ÊvÀœ“Ê£™™ÇqÓääÓ]ÊL>Ãi`ʜ˜ÊÌܜÊ`ˆv‡ viÀi˜ÌÊV>VՏ>̈œ˜ÊÌiV…˜ˆµÕiðÊÌÊÌÕÀ˜ÃʜÕÌÊ Ì…>ÌÊ܅i˜ÊÕȘ}ʜ˜ÞÊ̅iÊȘ}iÊÞi>ÀÊVi˜ÃÕÃÊ `>Ì>]Ê̅iʘՓLiÀʜvʏœ˜}‡ÌiÀ“ÊVˆi˜ÌÃÊ>«‡

+)2*!,,)3553

ŽBB˜Ìޘiˆ`i˜ÊÜÈ>>ˆ‡Ê>ÊÌiÀÛiÞë>ÛiÕÌÊÓääÓ°Ê -՜“i˜Ê6ˆÀ>ˆ˜i˜Ê̈>Ã̜]Ê-œÃˆ>>ˆÌÕÀÛ>ÊÓääÎ\£°Ê -Ì>ŽiÃ]ÊiÃˆ˜Žˆ° -՘`Ê,°Ê/ˆ>Ã̜ˆÃˆ>ʓi˜iÌi“ˆBÊ`ޘ>>“ˆÃÌi˜Ê «œÌˆ>뜫Տ>>̈œˆ`i˜Ê“>ˆ˜Ì>“ˆÃii˜°Ê />«>…ÌՓ>…ˆÃ̜Àˆ>‡>˜>ÞÞÈ>Ê …œˆÌœˆ“œˆÌÕÃÀiŽˆÃÌiÀˆ˜ÊΈÌÜvÀii˜ˆŽœˆi°Êˆ…iˆÌ>Ê ÓÈÉÓäää°Ê-Ì>ŽiÃ]ÊiÃˆ˜Žˆ° -՘`Ê,°Ê1̈ˆÃ>̈œ˜ÊœvÊ>`“ˆ˜ˆÃÌÀ>̈ÛiÊÀi}ˆÃÌiÀÃÊÕȘ}Ê ÃVˆi˜ÌˆwVʎ˜œÜi`}iÊ`ˆÃVœÛiÀްʘÌiˆ}i˜ÌÊ >Ì>Ê ˜>ÞÈÃÊÓääÎ\Ç\È\xä£qx£™° -՘`Ê,]Ê Þ>˜`iÀÊ"]Ê*>œ˜i˜Ê/°Ê,>>½>ÃÌ>Ê ÀiŽˆÃÌiÀˆ>ˆ˜iˆÃ̜ÃÌ>ÊÌiÀÛiÞ뜏ˆˆÌ̈ÃiÃ̈ÊÀiiÛ>˜Ì̈ˆ˜Ê ˆ˜vœÀ“>>̈œœ˜°Ê9…ÌiˆÃŽÕ˜Ì>«œˆÌˆˆŽŽ>Ê Óää{\ș\{\ÎÇÓqÎǙ°

«i>ÀÊ̜ÊLiÊviÜiÀÊ̅>˜Ê܅i˜ÊÕȘ}Ê>``ˆÌˆœ˜>Ê ˆ˜vœÀ“>̈œ˜ÊvÀœ“Ê̅iÊ`ˆÃV…>À}iÊ>LÃÌÀ>VÌÃ°Ê iˆÌ…iÀʜvÊ̅iÃiÊÌiV…˜ˆµÕiÃʈÃÊLiÌÌiÀÊ̅>˜Ê ̅iʜ̅iÀ]Ê>˜`Ê̅iʈ˜Ìi˜`i`ÊÕÃiʜvÊ`>Ì>Ê`i‡ ÌiÀ“ˆ˜iÃÊ܅ˆV…ʈÃʓœÀiÊÃՈÌ>Li°

, "Ê-1



6/]Ê̈>Ã̜̈iÌiˆˆB 6>ˆŽÕÌÌ>ÛÕÕ`i˜Ê>ʜˆŽiÕ`i˜“ÕŽ>ˆÃÕÕ`i˜Ê ÌÕ̎ˆ“ÕÃÀޅ“B -Ì>ŽiÃ

-,Ê1**

]ÊiÀˆŽœˆÃÃÕ՘˜ˆÌÌiˆ> /ˆ>Ã̜̇Àޅ“B -Ì>ŽiÃ

-Ì>ŽiðÊ-œÃˆ>>ˆ…Õœœ˜Ê>ÊÌiÀÛiÞ`i˜…Õœœ˜Ê …œˆÌœˆ“œˆÌÕÃÊÓää{ÊqʓBBÀˆÌÌiÞÌʍ>ʜ…iˆÃÌÕÃ°Ê -Ì>ŽiÃ]ʜ…iˆÌ>ʍ>ʏ՜ŽˆÌՎÈ>ÊÓääÎ\ΰÊ-Ì>ŽiÃ]Ê iÃˆ˜Žˆ°



Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

V

Alkuperäistutkimus Reijo Sund

Lonkkamurtumien ilmaantuvuus Suomessa 1998–2002 Väestön ikääntyessä lonkkamurtumat ovat merkittävä terveydenhuollon haaste Suomessa. Lonkkamurtumien määrä on pitkän aikaa kasvanut suhteessa nopeammin kuin vastaava riskiväestö, mutta viime vuosina vakioidun ilmaantuvuuden kasvun on raportoitu tasaantuneen monissa maissa. Ikääntymiseen liittyvien lonkkamurtumien ilmaantuvuus ei kasvanut Suomessa vuosina 1998–2002. Talvisin murtumia tapahtui enemmän kuin muina aikoina, poikkeuksena pitkäaikaisessa laitoshoidossa olevat. Pitkäaikaisessa laitoshoidossa tapahtuneiden lonkkamurtumien ilmaantuvuudessa ei ollut sukupuolten välisiä eroja. Lonkkamurtumia ilmeni poikkeuksellisen paljon Helsingissä, mutta muuten sairaanhoitopiirien väliset ilmaantuvuuserot olivat varsin pieniä. Koska lonkkamurtuman ilmaantuvuus heijastaa väestön toimintakykyä, on vakioidun ilmaantuvuuden tasaantuminen selitettävissä väestön odotetun eliniän ja toimintakyvyn muutosten yhteisvaikutukset huomioon ottavalla yleisellä ikääntymisteorialla.

V

äestön ikääntyessä reisiluun yläosan mur‑ tumat eli lonkkamurtumat ovat merkittävä terveydenhuollon haaste Suomessa, koska lonkkamurtuman ilmaantuvuus kasvaa ekspo‑ nentiaalisesti suhteessa ikään (Melton 1996, Brody ja Grant 2001). Lonkkamurtumien il‑ maantuvuudessa on suuria eroja eri maiden välillä (Kanis ym. 2002), ja lonkkamurtumien määrän on havaittu kasvavan nopeammin kuin riskiväestön (Kannus ym. 1999). Ikävakioidun ilmaantuvuuden kasvun on kuitenkin viime vuosina raportoitu tasaantuneen monissa mais‑ sa, ja syiden on arvailtu liittyvän murtumien onnistuneeseen ehkäisyyn tai ikääntyneiden kohentuneeseen terveydentilaan (Rogmark ym. 1999, Boufous ym. 2004). Lonkkamurtumien ilmaantuvuudesta Suomes‑ sa ei ole julkaistu koko maan kattavia alueellisia tietoja ja koko maan tiedotkin ovat usean vuo‑ den takaa (Kannus ym. 1999, Lüthje ym. 2001). Lisäksi näiden rekisteritietoihin perustuvien

tutkimusten tulokset eivät ole yhdenmukaisia Keski‑Suomen tilannetta kuvaavan tutkimuk‑ sen kanssa; viimeksi mainitussa tutkimuksessa ei havaittu ikävakioidun ilmaantuvuuden kas‑ vua ajanjaksojen 1982–83 ja 1992–93 välillä (Huusko ym. 1999). Suomessa ei ole tutkittu lonkkamurtumien ilmaantuvuutta pitkäaikai‑ sessa laitoshoidossa eikä myöskään murtumiin liittyvää kausivaihtelua. Tässä tutkimuksessa tarkastellaan ikääntymiseen liittyvien lonkka‑ murtumien ilmaantuvuutta Suomessa vuosina 1998–2002 edellä mainittujen tiedon tarpeiden tyydyttämiseksi.

Aineisto ja menetelmät Stakesin ylläpitämästä valtakunnallisesta hoitoilmoitus‑ rekisteristä poimittiin vuosien 1998–2002 lonkkamurtu‑ matapaukset käyttäen rajauskriteerinä ICD‑10:n pää‑ tai sivudiagnooseja S72.0, S72.1 ja S72.2. Rekisteristä poi‑ mittiin henkilötunnusta käyttäen tiedot kaikista näiden potilaiden hoitojaksoista vuosilta 1987–2002. Hoitojakso

1085

Duodecim 2006;122:1085–91

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

luokiteltiin lonkkamurtumaan liittyväksi, jos sen pää‑ tai sivudiagnoosina oli lonkkamurtuma (ICD‑9: 820, ICD‑10: S72.0–2), ja vuosien 1998–2002 hoitojaksojen osalta edel‑ leen leikkaushoitojaksoksi, jos rekisteristä löytyi merkintä lonkkaleikkauksesta (Nomeskon toimenpideluokituksen suomalainen versio: NFB10‑99, NFJ42‑64). Saman hen‑ kilön lonkkamurtumaan liittyvien hoitojaksojen väliset erot laskettiin päivän tarkkuudella rekisteriin kirjattujen tulo‑ ja poistopäivien perusteella. Koska tässä tutkimuksessa ensisijaisena kiinnostuksen kohteena olivat ikääntymisen seurauksena ilmenevät lonk‑ kamurtumat, rajoituttiin tarkastelemaan 50 vuotta täyttä‑ neitä henkilöitä. Murtuman luokiteltiin tapahtuneen pit‑ käaikaisessa laitoshoidossa, jos murtuma rekisteritietojen perusteella tapahtui laitoshoidossa (henkilö tuli sairaalaan pitkäaikaista hoitoa antavasta laitoksesta) ja henkilö täytti murtuman tapahtumahetkellä väljennetyt pitkäaikaishoi‑ don kriteerit (yli kolme kuukautta yhtäjaksoista laitoshoi‑ toa tai yli kuusi kuukautta laitoshoitoa murtumaa edeltä‑ neen vuoden kuluessa). Vuosien 1998–2002 aikana tapah‑ tunutta lonkkamurtumaa pidettiin henkilön ensimmäisenä lonkkamurtumana, jos hänellä ei edellisen kymmenen vuo‑ den aikana ollut lonkkamurtumaan liittyviä hoitojaksoja. Näin määritelty lonkkamurtuman tapahtumahetki ilmaisee samalla myös sen, mihin mennessä ikääntymiseen liittyvät lonkkamurtuman riskitekijät ovat kullakin potilaalla vii‑ meistään ylittäneet kriittisen rajan, jos kyse on vähäener‑ giaisesta murtumasta. Riskiväestön laskemiseen käytettiin virallisia ikä‑ ja su‑ kupuoliryhmittäisiä väestötietoja. Pitkäaikaispotilaiden riskiväestötiedot muodostettiin rekisteritietojen avulla (ks. Sund ja Kauppinen 2005). Ilmaantuvuusluvut saatiin jaka‑ malla lonkkamurtumien lukumäärä vastaavalla riskiväes‑ tön henkilövuosina mitatulla seuranta‑ajalla. Vertailuihin sopivat vakioidut ilmaantuvuusluvut laskettiin suoraa va‑ kiointia käyttäen. Ne kertovat, mikä on kokonaisilmaan‑ tuvuus vakioväestössä, jos ryhmittäiset ilmaantuvuudet oletetaan samoiksi kuin tarkastelun kohteena olevassa vä‑ estössä. Vakioväestönä käytettiin Suomen keskimääräistä väestöä vuosina 1998–2002. Vakioidut ilmaantuvuusluvut laskettiin myös sairaanhoitopiireittäin (Helsingin ja Uuden‑ maan sairaanhoitopiiri jaettiin edelleen sairaanhoitoaluei‑ den mukaan neljään osaan). Puutteellisten tietojen takia Ahvenanmaa jätettiin pois tarkasteluista. Koska lonkka‑

murtumien sairaanhoitopiireittäiset määrät olivat varsin pieniä, käytettiin laskennassa empiiristä Bayes‑estimointia (Greenland 2000).

Tulokset Vuosien 1998–2002 aikana lonkkamurtuma‑ diagnoosillisia hoitojaksoja alkoi vuodessa noin 13 500 (taulukko 1). Alkaneita hoitojaksoja oli keskimäärin noin 7 200 potilaalla vuodessa. Re‑ kisteriin kirjattuja lonkkamurtumaleikkauksia tehtiin vuosien 1998–2002 aikana keskimäärin runsaat 6 000 vuodessa. Määrä on luultavas‑ ti murtumien todellista lukumäärää pienempi. Murtumaleikkauksen lisäksi uudeksi murtumak‑ si voidaan tulkita lonkkamurtumadiagnoosilli‑ nen hoitojakso, jos edeltäneestä lonkkamurtu‑ man hoitojaksosta on kulunut riittävän kauan. Käyttäen kriteerinä kahden vuoden eroa edelli‑ seen lonkkamurtumahoitojaksoon murtumien lukumääräksi saatiin keskimäärin noin 6 800 vuodessa (taulukko 1). Höllennettäessä aika‑ kriteeriä kahteen kuukauteen murtumien kes‑ kimääräiseksi lukumääräksi saatiin noin 7 200 vuodessa. Vaikka kliiniset kriteerit täyttävien murtumien lukumäärää ei rekisteritietoja käyt‑ täen ole mahdollista yksiselitteisesti laskea, vuo‑ sien 1998–2002 aikana näyttäisi tapahtuneen keskimäärin noin 7 000 lonkkamurtumaa vuo‑ dessa. Ensimmäisten lonkkamurtumien määrä oli vuosien 1998–2002 aikana keskimäärin noin 6 000 vuodessa (taulukko 1). Näistä runsaat 5 500 tapahtui vuosittain 50 vuotta täyttäneil‑ le. Kyseisistä lonkkamurtumapotilaista valta‑

TAulukko 1. Lonkkamurtumien määrät Suomessa vuosina 1998–2002.

Alkaneet hoitojaksot, joissa diagnoosina lonkkamurtuma merkintä lonkkaleikkauksesta merkintä lonkkaleikkauksesta tai vähintään 2 vuotta edellisestä hoitojaksosta, jossa diagnoosina lonkkamurtuma merkintä lonkkaleikkauksesta tai vähintään 2 kuukautta edellisestä hoitojaksosta, jossa diagnoosina lonkkamurtuma vähintään 10 vuotta edellisestä hoitojaksosta, jossa diagnoosina lonkkamurtuma kuten edellä, mutta vain 50 vuotta täyttäneet henkilöt

1086

1998

1999

2000

2001

2002

13 219 5 934

12 965 5 972

13 381 6 100

14 142 6 221

14 145 6 195

6 742

6 631

6 702

6 924

6 937

7 246

7 094

7 136

7 411

7 405

5 990 5 551

5 853 5 413

5 932 5 543

6 083 5 644

6 141 5 667

R. Sund

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

KuvA 1. Lonkkamurtumien päivittäiset määrät Suomessa 1998–2002.

osa (72 %) oli naisia. Noin 29 % murtumista tapahtui laitoshoidossa ja 19 % pitkäaikaisessa laitoshoidossa. Liukuvalla keskiarvolla tasoi‑ tetut lonkkamurtumien päivittäiset määrät on esitetty kuvassa 1. Lonkkamurtumia tapahtui keskimäärin 15,2 päivässä, ja havaittavissa oli pienehköä mutta selkeää kausivaihtelua: noin 54 % murtumista tapahtui marras–huhtikuus‑ sa ja noin 46 % touko–lokakuussa. Aineiston mukaan talvikauden aikana murtumia tapahtui muutamina päivinä poikkeuksellisen paljon. On myös huomion arvoista, että pitkäaikaispotilai‑ den murtumissa ei ollut havaittavissa kausivaih‑ telua (kuva 1). Ikä‑ ja sukupuolivakioitu ilmaantuvuus oli vuonna 1998 suurempi kuin muina vuosina, joiden aikana ilmaantuvuus pysyi varsin vakaa‑ na (taulukko 2). Tämä on merkki lonkkamur‑ tumien ilmaantuvuuden tasaantumisesta myös Suomessa. Vakioitu ilmaantuvuus oli naisilla suurempi kuin miehillä ja erityisen suuri pitkä‑ aikaisessa laitoshoidossa olleilla (taulukko 2). Pitkäaikaisessa laitoshoidossa olleiden miesten ja naisten ilmaantuvuusluvuissa ei kuitenkaan ollut merkitsevää eroa. Sairaanhoitopiireittäisissä ikä‑ ja sukupuoliva‑ kioiduissa ilmaantuvuuksissa ainoa muista sel‑

keästi poikkeava alue on Helsinki (taulukko 2). Muuten alueittaiset ilmaantuvuuserot ovat var‑ sin pienet. Keskimääräinen ilmaantuvuus vuosi‑ en 1998–2002 aikana oli suurinta Helsingissä ja Keski‑Suomessa ja pienintä Etelä‑Karjalassa, Ete‑ lä‑Pohjanmaalla ja Kainuussa (kuva 2). Etelä‑ ja Pohjois‑Savossa vakioitu ilmaantuvuus näyttäisi kasvaneen vuosien 1998–2002 aikana lineaarises‑ ti ja Kanta‑Hämeessä sekä Helsingissä taas pie‑ nentyneen varsin lineaarisesti (taulukko 2). Mui‑ den alueiden osalta ilmaantuvuuden trendi vuosi‑ na 1998–2002 oli käyräviivainen tai vakio.

Pohdinta Suomessa käytännössä kaikki lonkkamurtumat hoidetaan sairaalassa, joten on odotettavaa, että lonkkamurtumaan liittyvät hoitojaksot kirjau‑ tuvat valtakunnalliseen hoitoilmoitusrekisteriin. Lonkkamurtumapotilaiden kattava identifiointi rekisteristä on siis periaatteessa mahdollista sillä oletuksella, että tiedot on kirjattu kattavasti ja luotettavasti rekisteriin. Tuoreen validointitutki‑ muksen mukaan kattavuus on lonkkamurtuman osalta erittäin hyvä ja vain noin 2 %:iin kirur‑ gian erikoisalan hoitojaksoista on liitetty vir‑ heellisesti muu kuin lonkkamurtumadiagnoosi 1087

Lonkkamurtumien ilmaantuvuus Suomessa 1998–2002

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

21 Vakioitu ilmaantuvuus > 320 (2) 320–325 (5) 310–320 (5) 305–310 (6) < 305 (3)

20

18

19

17 16

13 15

6

4

11

10 5

3

12

14

7 8

9

2 1

KuvA 2. Lonkkamurtumien sairaanhoitopiireittäinen ikä‑ ja sukupuolivakioitu keskimääräinen ilmaantuvuus Suomessa 1998–2002 sataatuhatta henkilövuotta kohden (50 vuotta täyt‑ täneiden ensimmäiset murtumat).

(Sund ym., julkaisematon käsikirjoitus). Suurempia hankaluuksia kuin tavanomaiset kattavuus‑ ja luotettavuusongelmat aiheuttaa rekisteritietoihin perustuvissa ilmaantuvuus‑ laskelmissa lonkkamurtuman hoitoepisodin koostuminen useasta toisiinsa liittyvästä hoito‑ jaksosta, joista vain ensimmäinen liittyy uuteen lonkkamurtumaan (Langley ym. 2002). Myös pitkäaikaispotilaiden tunnistamiseen liittyy vas‑ taavankaltaisia ongelmia (Sund ja Kauppinen 2005). Tässä tutkimuksessa käytetyt määritel‑ mät perustuvat teoreettisten lähtökohtien lisäk‑ si lukuisiin alustaviin analyyseihin, ja herkkyys‑ analyysien perusteella tulokset ovat luotettavia. On kuitenkin huomattavaa, että tässä tutki‑ muksessa raportoidut lonkkamurtumien määrät 1088

ovat merkittävästi pienempiä kuin aikaisemmis‑ sa samaan rekisteriin perustuvissa tutkimuksissa (Kannus ym. 1999, Lüthje ym. 2001). Pääasial‑ linen syy raportoitujen lonkkamurtumien mää‑ rien eroon on lonkkamurtuman määritelmien erilaisuus. Aikaisemmista tutkimuksista poike‑ ten tässä tutkimuksessa tarkasteltiin kaikkien lonkkamurtumien sijaan ensimmäisiä lonkka‑ murtumia, koska ne pystytään poimimaan re‑ kisteristä huomattavasti yksiselitteisemmin kuin kaikki lonkkamurtumat. Ensimmäisiin lonkka‑ murtumiin perustuvissa ilmaantuvuuslaskelmis‑ sa myöskään saman henkilön toistuvien lonkka‑ murtumien välinen ilmeinen riippuvuus ei pääse vääristämään tuloksia. Vertailun vuoksi tässä tutkimuksessa arvioi‑ tiin lisäksi kaikkien lonkkamurtumien vuosittai‑ set lukumäärät. Tiedetään, että vuoden aikana alkaneiden lonkkamurtumadiagnoosillisten hoi‑ tojaksojen määrä ei vastaa lonkkamurtumien todellista määrää, koska lonkkamurtumapoti‑ laan hoitoepisodi on voinut alkaa edellisen vuo‑ den puolella, sillä se koostuu tyypillisesti useas‑ ta hoitojaksosta, ja samalla potilaalla voi olla useita murtumia saman vuoden kuluessa. Käy‑ tännössä eri hoitojaksoille kirjautuneita lonk‑ kamurtumaleikkauksia on mahdollista pitää merkkinä uudesta lonkkamurtumasta, vaikka pieni määrä uusintaleikkauksia voidaan tällöin virheellisesti tulkita uusiksi murtumiksi ja toi‑ saalta esimerkiksi konservatiivisesti hoidetut kiilautuneet murtumat ja sellaiset murtumat, joiden jälkeen potilas menehtyy ennen leikkaus‑ ta, jäävät havaitsematta. Lisäksi puutteet leikka‑ usten kirjaamisessa rekisteriin voivat aiheuttaa harhaa. Esimerkiksi vuonna 1998 lonkkamur‑ tumaleikkauksia ei muutamissa sairaaloissa kir‑ jattu lainkaan rekisteriin. Tässä tutkimuksessa saatu kaikkien lonkkamurtumien lukumäärä vastaa sellaisten tarkasteluvuonna alkaneiden lonkkamurtumadiagnoosillisten hoitojaksojen lukumäärää, joiden osalta oli merkintä lonkka‑ leikkauksesta tai joita edelsi kahden kuukauden tai kahden vuoden lonkkamurtumadiagnoosi‑ ton aika. Kliinisesti uutena pidettävien lonkka‑ murtumien lukumäärä on hyvin todennäköisesti edellisillä kriteereillä saatujen arvioiden osoitta‑ malla välillä. R. Sund

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

TAulukko 2. Lonkkamurtuman ilmaantuvuus Suomessa vuosina 1998–2002 sataatuhatta henkilövuotta kohden (50 vuotta täyttä‑ neiden ensimmäiset lonkkamurtumat).

50 vuotta täyttäneiden ensimmäiset lonkkamurtumat Ikä‑ ja sukupuolivakioitu ilmaantuvuus Miehet Ikävakioitu ilmaantuvuus Ilmaantuvuus 50–64‑vuotiailla Ilmaantuvuus 65–74‑vuotiailla Ilmaantuvuus 75–84‑vuotiailla Ilmaantuvuus 85 vuotta täyttäneillä Naiset Ikävakioitu ilmaantuvuus Ilmaantuvuus 50–64‑vuotiailla Ilmaantuvuus 65–74‑vuotiailla Ilmaantuvuus 75–84‑vuotiailla Ilmaantuvuus 85 vuotta täyttäneillä Pitkäaikaisessa laitoshoidossa olleet Ikä‑ ja sukupuolivakioitu ilmaantuvuus Muut kuin pitkäaikaisessa laitoshoidossa olleet Ikä‑ ja sukupuolivakioitu ilmaantuvuus Sairaanhoitopiireittäinen ikä‑ ja sukupuolivaikoitu ilmaantuvuus 3 – Varsinais‑Suomi 4 – Satakunta 5 – Kanta‑Häme 6 – Pirkanmaa 7 – Päijät‑Häme 8 – Kymenlaakso 9 – Etelä‑Karjala 10 – Etelä‑Savo 11 – Itä‑Savo 12 – Pohjois‑Karjala 13 – Pohjois‑Savo 14 – Keski‑Suomi 15 – Etelä‑Pohjanmaa 16 – Vaasa 17 – Keski‑Pohjanmaa 18 – Pohjois‑Pohjanmaa 19 – Kainuu 20 – Länsi‑Pohja 21 – Lappi 1a – Helsinki 1b – Jorvi‑Peijas 1c – Hyvinkää‑Porvoo 1d – Lohja‑Länsi‑Uusimaa

Näennäisesti samanlaisista määritelmistä huolimatta tämän tutkimuksen arviot lonkka‑ murtumien lukumääristä ovat selvästi pienempiä kuin aikaisemmissa tutkimuksissa raportoidut (Kannus ym. 1999, Lüthje ym. 2001). Esimer‑

1998

1999

2000

2001

2002

330

315

316

315

311

216 57 228 675 2 114

193 59 200 585 1 796

193 55 178 633 1 957

194 52 187 632 1 910

190 54 192 618 1 707

421 43 271 1 014 2 482

412 42 224 995 2 598

413 41 228 1 008 2 572

413 41 241 988 2 561

408 47 235 959 2 552

1 650

1 582

1 485

1 371

1 747

307

294

293

293

286

320 336 332 341 337 343 313 284 320 342 297 340 333 324 303 305 309 331 301 396 337 336 326

328 329 312 302 308 326 304 297 303 299 295 325 294 313 308 294 294 311 314 361 325 325 322

320 304 302 313 311 317 295 311 316 314 309 320 282 315 312 320 289 317 304 346 305 314 318

318 310 313 321 327 313 294 321 298 309 317 330 285 302 307 315 306 306 310 350 305 317 307

329 327 301 316 306 312 311 312 301 313 311 332 295 324 298 316 310 339 302 338 328 320 321

kiksi vuonna 1998 lonkkamurtumien todellinen määrä oli välillä 6 800–7 200, eli aiemmat arviot ovat noin 6–14 % liian suuria. Aikaisemmissa tutkimuksissa jokaisen henkilön ensimmäinen rekisteristä löytyvä lonkkamurtumadiagnoosilli‑ 1089

Lonkkamurtumien ilmaantuvuus Suomessa 1998–2002

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

nen hoitojakso kunakin vuonna on tulkittu liittyvän uuteen lonkkamurtumaan. Näin laskettu lukumäärä kertoo kyllä, kuinka useaa henkilöä kunakin vuonna on lonkkamurtuman takia hoidettu (kuinka monella henkilöllä hoitojakso on alkanut tai päättynyt kyseisenä vuonna). Itse murtumien määrän todentamiseen laskutapa kuitenkin sopii huonosti, sillä uuden murtuman kriteeri vaihtelee henkilöittäin sen mukaan, mihin aikaan vuotta ensimmäinen lonkkamurtumadiagnoosillinen hoitojakso sijoittuu. Jos se sijoittuu vuoden loppuun, on lonkkamurtumadiagnoositon aika noin vuoden pituinen, kun taas vuoden alussa alkaneen hoitojakson osalta diagnoositonta aikaa ei ole välttämättä päivääkään. Tästä seuraa, että vuoden loppupuolella lonkkansa murtaneet henkilöt lasketaan murtumatapauksiksi sekä todellisena murtumavuonna että seuraavana vuonna, vaikka kyse olisikin kiistatta samasta hoitoepisodista. Tässä tutkimuksessa todettiin, että naisilla oli suurempi lonkkamurtuman riski kuin miehillä paitsi pitkäaikaisessa laitoshoidossa, joka oli sukupuoltakin voimakkaammin yhteydessä lonkkamurtuman riskiin. Pitkäaikaisessa laitoshoidossa olevien osalta ei ollut havaittavissa vuodenaikoihin liittyvää kausivaihtelua lonkkamurtu➤ missa, vaikka muiden osalta talvikausi oli yhteydessä lisääntyneeseen murtumariskiin. Löydöksiä yhdistävä tekijä ➤ näyttäisi olevan toimintakyky. Toimintakyky heikkenee iän myötä ja naisilla nopeammin kuin miehillä (Murtagh ja Hubert 2004). Laitoshoi➤ toon joutuminen heijastelee kyvyttömyyttä selviytyä kotona ja kasvanutta palvelujen tarvetta, joten yhteys pitkäaikaisen laitoshoidon ja heikentyneen ➤ toimintakyvyn välillä on odotettava (Laukkanen ym. 2001). Lonkkamurtu➤ mien ilmaantuvuuksien samanlaisuus miehillä ja naisilla pitkäaikaisessa laitoshoidossa on myös ymmärrettävää, sillä jos laitoshoitoon valikoidutaan hoidon tarpeen perusteella, ei laitoksessa olevien toimintakyvyssä pitäisikään olla eroa sukupuolten välillä. 1090

Myös vuodenaikojen välisen kausivaihtelun ilmeneminen vain muilla kuin pitkäaikaisessa laitoshoidossa olevilla on selitettävissä toimintakyvyn perusteella. Kirjallisuudessa kausivaihtelulle on esitetty syyksi sään aiheuttaman liukkauden lisäksi esimerkiksi kylmyyttä, hitaampia reaktioaikoja ja pimeyttä (Douglas ym. 2000), jotka varmasti myötävaikuttavat kaatumisriskiin ulkona liikkumaan kykenevien osalta.

Lopuksi Ulotettaessa toimintakykytulkinta vakioidussa ilmaantuvuudessa havaittuun trendin katkeamiseen ei ole lainkaan yllättävää, että samoihin aikoihin myös toimintakyvyn heikkenemisen on raportoitu tasaantuneen Suomessa (Sulander ym. 2003). Tasaantuminen on sopusoinnussa myös odotetun eliniän ja toimintakyvyn muutosten yhteisvaikutuksia selittävän, väestön ikääntymistä kuvaavan yleisen teorian kanssa, joka kuitenkin ennustaa tasaantumisen olevan vain väliaikainen ilmiö (Robine ja Michel 2004).

Y D I N A S I AT Suomessa tapahtuu nykyään noin 7 000 lonkkamurtumaa vuodessa. Lonkkamurtumien vakioidun ilmaantuvuuden pitkään kestänyt kasvu on tasaantunut samoihin aikoihin kuin väestön toimintakyvyn heikentyminen on pysähtynyt. Pitkäaikaisessa laitoshoidossa olevien osalta lonkkamurtumien ilmaantuvuudessa ei ole eroa sukupuolten eikä vuodenaikojen välillä. Lonkkamurtumien ilmaantuvuudessa ei ole mer merkitseviä sairaanhoitopiirien välisiä eroja. Rekisteritietojen huolellinen käyttö tarjoaa erinomaisen mahdollisuuden lonkkamurtumien (alueellisen) ilmaantuvuuden jatkuvaan seurantaan ja alueellisten ehkäisyprojektien vaikuttavuuden arviointiin.

R. Sund

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

Jos toimintakykytulkinta pitää paikkansa, ker‑ tovat lonkkamurtumien alueelliset ilmaantuvuu‑ det samalla myös väestön toimintakyvyn alueel‑ lisista eroista. Tässä tutkimuksessa raportoidut vuosien 1998–2002 alueelliset trendit ovat var‑ sin monimuotoisia ja kuvastanevat toimintaky‑ vyn murrosta hiukan kehityksen eri vaiheissa olevien väestöjen osalta. Tässä mielessä on siis

liian varhaista tehdä päätelmiä lonkkamurtumi‑ en ehkäisyn vaikuttavuudesta. Varsinaisten pää‑ telmien tekoon tarvittaisiin pitempiä alueellisia aikasarjoja lonkkamurtumien ilmaantuvuudesta ja toisaalta tietoa alueellisista murtumienehkäi‑ syprojekteista, joissa on systemaattisesti pyritty soveltamaan käytäntöön tehokkaaksi todennet‑ tuja ehkäisykeinoja (Kannus ym. 2005).

*** Tämä tutkimus on osa Suomen Akatemian rahoittamaa projektia.

Kirjallisuutta Boufous S, Finch CF, Lord SR. Incidence of hip fracture in New South Wales: are our efforts having an effect? Med J Aust 2004;180:623– 6. Brody JA, Grant MD. Age‑associated diseases and conditions: implica‑ tions for decreasing late life morbidity. Aging Clin Exp Res 2001;13:64–7. Douglas S, Bunyan A, Chiu KH, Twaddle B, Maffulli N. Seasonal variation of hip fracture at three latitudes. Injury 2000;31:11–9. Greenland S. Principles of multilevel modelling. Int J Epidemiol 2000; 29:158–67. Huusko TM, Karppi P, Avikainen V, Kautiainen H, Sulkava R. The changing picture of hip fractures: dramatic change in age distribution and no change in age‑adjusted incidence within 10 years in Central Finland. Bone 1999;24:257–9. Kanis JA, Johnell O, De Laet C, Jonsson B, Oden A, Ogelsby AK. Interna‑ tional variations in hip fracture probabilities: implications for risk assessment. J Bone Miner Res 2002;17:1237–44. Kannus P, Niemi S, Parkkari J, Palvanen M, Vuori I, Järvinen M. Hip frac‑ tures in Finland between 1970 and 1997 and predictions for the future. Lancet 1999;353:802–5. Kannus P, Sievänen H, Palvanen M, Järvinen T, Parkkari J. Preven‑ tion of falls and consequent injuries in elderly people. Lancet 2005;366:1885–93. Langley J, Stephenson S, Cryer C, Borman B. Traps for the unwary in estimating person based injury incidence using hospital discharge data. Inj Prev 2002;8:332–7.

Laukkanen P, Karppi P, Heikkinen E, Kauppinen M. Coping with activities of daily living in different care settings. Age Ageing 2001;30:489– 94. Lüthje P, Nurmi I, Palvanen M, Kannus P. Reisiluun yläosan murtu‑ mien epidemiologia ja ehkäisymahdollisuudet. Suom Lääkäril 2001;56:1615–9. Melton LJ 3rd. Epidemiology of hip fractures: implications of the expo‑ nential increase with age. Bone 1996;18(3 Suppl):121S–125S. Murtagh KN, Hubert HB. Gender differences in physical disability among an elderly cohort. Am J Public Health 2004;94:1406–11. Robine JM, Michel JP. Looking forward to a general theory on population aging (with discussion). J Gerontol A Biol Sci Med Sci 2004;59: M590–620. Rogmark C, Sernbo I, Johnell O, Nilsson JA. Incidence of hip fractures in Malmo, Sweden, 1992–1995. A trend‑break. Acta Orthop Scand 1999;70:19–22. Sulander TT, Rahkonen OJ, Uutela AK. Functional ability in the elderly Finnish population: time period differences and associations, 1985–99. Scand J Public Health 2003;31:100–6. Sund R, Kauppinen S. Kuinka laskea ikääntyneiden pitkäaikaisasiak‑ kaiden määriä rekisteritietojen perusteella? Sosiaalilääk Aikak 2005;42:137–44.

Reijo SunD, VTM, tilastotieteilijä [email protected] Stakes, vaikuttavuuden ja oikeudenmukaisuuden tutkimusryhmä PL 220, 00531 Helsinki

1091

Lonkkamurtumien ilmaantuvuus Suomessa 1998–2002

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

VI

371

ORIGINAL ARTICLE

Quality effects of operative delay on mortality in hip fracture treatment R Sund, A Liski ............................................................................................................................... Qual Saf Health Care 2005;14:371–377. doi: 10.1136/qshc.2004.012831

See end of article for authors’ affiliations ....................... Correspondence to: R Sund, STAKES, P O Box 220, FIN-00531 Helsinki, Finland; reijo.sund@ stakes.fi Accepted for publication 7 August 2005 .......................

H

Background: Most hip fracture patients undergo surgery, but there is conflicting evidence on the relation between the timing of surgery and the outcome of treatment. There is considerable variation in the length of surgical delays between hospitals, possibly reflecting the quality of care. Aim: To examine the associations between in-hospital surgical delay and the mortality of hip fracture patients from a practical quality assessment perspective. Methods: The effects of operative delay on mortality were estimated using various statistical methods applied to observational data from 16 881 first time hip fracture patients aged 65 or older from 47 hospitals (providers) in Finland in 1998–2001. Results: A prolonged in-hospital operative delay was associated with a higher mortality of hip fracture patients in individual level analyses, but the instrumental variable approach indicated that the individual level effect was not caused by the operative delay but by inappropriate methodological assumptions. There was extensive variation between providers in the proportion of late surgery patients. Provider level analyses showed that the effects of the provider of operative delay on mortality are quite small, but there is a clear association between the proportion of late surgery patients and non-optimal treatment. Conclusions: If provider level heterogeneity is not explicitly taken into account, studies of the effects of surgical delay on outcomes are prone to serious bias. The proportion of patients with prolonged waiting time for surgery at the provider level seems to work as an effective evidence-based quality indicator. Providers should reduce unnecessary delays to surgery and identify more carefully patients not suitable for early surgery.

ip fractures are common injuries among older people and are associated with substantial morbidity and mortality.1–3 Ageing of the population has resulted in an increase in the mean age of hip fracture patients which is likely to cause additional problems in the treatment and rehabilitation of patients in the future.4 5 A surgical operation is performed for most patients during the acute management of hip fracture. A typical hip fracture patient is confined to hospital bed rest before surgery. A fairly short operative delay is suggested in the clinical guidelines,6–10 but it is known that there is considerable variation in the operative delays between providers.11–17 In fact, operative delay is a commonly used process measure in health system performance assessment.18 Minimisation of unnecessary preoperative inpatient care can be used as an economic justification for shorter operative delays. For quality measurement purposes, the operative delay should also have confirmed effects on outcomes otherwise improvements may possibly be targeted to issues which do not improve health.19 The evidence concerning the optimal timing of surgery in relation to the overall outcomes of hip fracture treatment is quite mixed.20–29 In short, the main reasons for variations in the operative delay are provider level system factors (availability of required resources such as operating rooms, support services, or personnel with expertise) and clinical decisions based on patient specific factors (co-morbidity, medical stabilisation). In some studies conflicting results may be due to methodological shortcomings (small numbers of observations, no adjustment for confounding factors), but more sophisticated evidence as to whether operative delay has an independent effect on primary outcomes is also conflicting.30–38 Another open question is the definition of the time period for delayed surgery.18 The impact of provider level heterogeneity on the

effects of operative delay on outcomes has also not received sufficient attention. This study was undertaken to examine the associations between the in-hospital operative delay and the mortality of hip fracture patients in Finland in 1998–2001. Since previous studies concerning the effect of operative delay on mortality had reported conflicting results, special attention was paid to methodological issues. More specific goals for empirical analyses were (1) to identify a suitable definition for early and late surgery groups, (2) to describe the postoperative excess mortality associated with prolonged operative delay, (3) to illustrate in which respect the early and late surgery groups differ from each other and whether adjustment of these observed factors changes the association between operative delay and mortality, (4) to compare the risk adjusted proportion of late surgery patients between providers, and (5) to investigate provider level heterogeneity in terms of 1 year mortality and the proportion of patients with late surgery.

METHODS Data set The total population of hip fracture patients in 1998–2001 was identified in the Finnish Health Care Register. The medical histories (1987–2002) and deaths (1998–2002) of the hip fracture population were extracted from the Finnish Hospital Discharge Register, Finnish Health Care Register, and from the National Causes of Death Register using the unique personal identification numbers of the patient population. Each record in these registers includes data such as patient and provider ID numbers, age, sex, area codes, and diagnosis and operation codes, as well as dates of admission,

www.qshc.com

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

372

Sund, Liski

operation, and discharge (or death). The completeness and accuracy of the registers is known to be good.39–41

0.10 0.09

Proportion of death patients

Statistical analyses Cumulative probabilities of unadjusted mortality were calculated using the product limit estimators. Excess risk of death was defined as difference in cumulative probabilities of mortality between two comparable groups. Hazard ratios were obtained using the proportional hazards model allowing the adjustment of observed confounders. Since only a limited number of covariates were observed, the biasing influence of unobserved heterogeneity was statistically controlled by incorporating the gamma frailty term into the model.45 The significance of the differences between the unadjusted patient characteristics was determined using the z test (binary variables) or x2 test (categorical variables). The logistic regression model was used to calculate odds ratios and significances of differences between adjusted patient characteristics. The same logistic model was also used for predicting the late surgery probability for each patient. These probabilities were then aggregated to the provider level, resulting in the expected number of late surgery patients for each provider.46 The provider specific ratios between observed and expected numbers of late surgery patients were further modelled using the hierarchical gamma-Poisson model to obtain the comparable risk adjusted proportions of late surgery patients for providers.47 The weekday adjusted proportion was calculated using the predictive margin technique.48 In the instrumental variables analysis49 the 1 year adjusted mortality was calculated for each admission day of the week and the null hypothesis of equal mortalities was tested using the x2 goodness of fit test.30 0.4

5+ nights

0.07 0.06 0.05 0.04 0.03 0.02 0.01 0 10 30

60

90 120

180

240

300

365

Days Figure 2 Unadjusted excess risk of death (with 95% confidence interval) for late hip fracture surgery.

Provider level heterogeneity was examined in terms of the proportion of late surgery patients and 1 year mortality. The provider specific overall mortality rates were stratified according to early and late surgery, recognising the fact that the weighted average of these mortalities gives the overall mortality. The provider level associations between these three mortality rates and the proportions of late surgery patients were estimated using Stein estimation with an assumption of linear trends for overall and early surgery mortality rates.50 The associations were also analysed separately according to severity. The five severity groups were established by classifying the patients into five equal sized groups in terms of their predicted 1 year mortality based on the logistic regression model.51

RESULTS Effect of length of operative delay on mortality The cumulative probabilities of unadjusted mortality rates attributable to different lengths of operative delay are shown in fig 1. Waiting times of 0–2 nights had the same effect on mortality, but there was a significant increase in mortality with waiting times of 3–4 nights (p,0.001), and waiting times of 5 or more nights resulted in an even higher mortality rate. Early surgery was defined as a waiting time of 0–2 nights (n = 14 426, 85.5%) and late surgery as a waiting time of at least 3 nights (n = 2455, 14.5%). The unadjusted hazard ratio between late and early surgery was 1.24 (95% confidence interval (CI) 1.15 to 1.34, p,0.0001). Excess risk of death The unadjusted excess risk of death for late surgery compared with early surgery quickly rose to 3%, then slowly increased during follow up to about 5% at 1 year (fig 2). Complementary analyses revealed that the excess risk was quite stable and near to its maximum at 1–3 years and then began to decrease slowly. As the lower confidence limits in fig 2 indicate, the unadjusted excess risk of death became statistically significant 2 weeks after the operation.

3, 4 nights

0.3

0, 1, 2 nights

0.2 0.1 0 10 30 60

Excess mortality

0.08

Data preprocessing Data were preprocessed so that information concerning patients aged 65 or older with a first hip fracture could be accurately identified.42 For the purposes of this study, the data were restricted to patients operated on using an internal fixation, a prosthesis, or a total hip replacement. Patients with incomplete data were excluded from the analyses. A few providers (hospitals) had not reported operation dates for the years 1998–1999, but otherwise no systematic bias was found in the sensitivity analyses. The final study population consisted of 16 881 patients, which represents 83.3% of all first time hip fracture patients aged 65 or older. Forty six providers had at least 50 cases, while the remaining 154 cases from other providers were combined into a single figure. The existence of possible co-morbidities was extracted for each patient from his or her medical history using the diagnosis codes recorded in the data.34 43 The extraction method was adapted and updated from the original Charlson co-morbidity categories44 and applied to the current data set.

90 120

180

240

300

365

Days Figure 1 Cumulative probabilities of unadjusted mortality following hip fracture for operative delays of various lengths.

Characteristics of early and late surgery groups Characteristics of the patients with hip fracture classified into early and late surgery groups are shown in table 1, together with the corresponding odds ratios for late surgery adjusted for the other characteristics in the table. Early surgery was associated with being 85 years or older, of female sex, having a pertrochanteric fracture, and receiving long term inpatient care during the year preceding the admission. Admission on a Wednesday and a medical history of dementia or cataract were also associated with early surgery. The risks for late surgery were likely to be increased by admission from a nursing home, health centre or readmission shortly after

www.qshc.com

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

Effects of operative delay in hip surgery on mortality

373

Table 1 Characteristics of patients with hip fracture Characteristic Age (years) 65–74 75–79 80–84 85–89 90+ Sex Male Female

Early surgery (n = 14 426, 85.5%)

Late surgery (n = 2455, 14.5%)

Unadjusted p value

Adjusted p value

Odds ratio

95% CI

1 0.885 0.954 0.867 0.836

0.770 0.833 0.753 0.706

1 0.844

0.762 to 0.935

1 1.241 1.135

1.027 to 1.500 1.029 to 1.253

,0.05 ,0.05

1.006 1.006 0.863 1.104 1.462 1.110 1.447 1.148 1.135 0.895 0.903 0.940 0.986

0.878 0.873 0.762 0.979 1.329 0.995 1.207 0.991 0.972 0.740 0.816 0.833 0.877

0.935 0.931 ,0.05 0.106 ,0.001 0.062 ,0.001 0.066 0.109 0.257 ,0.05 0.310 0.814

0.13

,0.001 2721 2995 3483 3379 1848

(18.9) (20.8) (24.1) (23.4) (12.8)

504 491 616 556 288

(20.5) (20.0) (25.1) (22.6) (11.7)

to to to to

1.017 1.092 0.998 0.989

,0.001 3443 (23.9) 10983 (76.1)

696 (28.4) 1759 (71.6)

Fracture type Pertrochanteric Subtrochanteric Femoral neck

4529 (31.4) 835 (5.8) 9065 (62.8)

694 (28.3) 165 (6.7) 1596 (65.0)

Medical history Cancer Diabetes Dementia Hypertension Cardiovascular disease Cerebrovascular disease Peripheral vascular disease Chronic pulmonary disease Anaemia Diseases of the nervous system* Eye diseases� Diseases of the digestive system` Other diseases1

1676 (11.6) 1470 (10.2) 2761 (19.1) 2188 (15.2) 5110 (35.4) 2807 (19.5) 632 (4.4) 1205 (8.4) 1161 (8.0) 852 (5.9) 4182 (29.0) 2316 (16.1) 2429 (16.8)

299 (12.2) 290 (11.8) 411 (16.7) 444 (18.1) 1108 (45.1) 562 (22.9) 174 (7.1) 265 (10.8) 236 (9.6) 139 (5.7) 673 (27.4) 396 (16.1) 432 (17.6)

Pre-admission residence Home (.1 week) Home (max 1 week) Nursing home Health centre� Hospital

9444 (65.5) 685 (4.7) 2508 (17.4) 1249 (8.7) 540 (3.7)

1559 (63.5) 153 (6.2) 416 (16.9) 143 (9.9) 84 (3.4)

2569 (17.8)

402 (16.4)

Day of admission Monday Tuesday Wednesday Thursday Friday Saturday Sunday

2479 2289 2248 2091 1898 1744 1677

(17.2) (15.9) (15.6) (14.5) (13.2) (12.1) (11.6)

307 (12.5) 288 (11.7) 220 (9.0) 401 (16.3) 678 (27.6) 358 (14.6) 203 (8.3)

Year** 1998 1999 2000 2001

3275 3325 3869 3957

(83.1) (84.8) (86.1) (87.5)

667 597 625 566

,0.01

,0.001

0.223 ,0.01 ,0.01 ,0.001 ,0.001 ,0.001 ,0.001 ,0.001 ,0.01 0.333 0.058 0.474 0.185

,0.05

to to to to to to to to to to to to to

1.151 1.160 0.976 1.244 1.609 1.238 1.736 1.330 1.319 1.084 1.000 1.060 1.108

,0.01 1 1.300 1.215 1.177 0.888

1.076 1.016 1.000 0.694

0.823

0.691 to 0.979

1 1.026 0.803 1.567 2.970 1.686 0.989

0.864 0.668 1.334 2.559 1.429 0.818

1.218 0.964 1.840 3.447 1.990 1.195

0.770 ,0.05 ,0.001 ,0.001 ,0.001 0.907

1 0.877 0.801 0.700

0.775 to 0.992 0.709 to 0.904 0.619 to 0.793

,0.05 ,0.001 ,0.001

to to to to

1.570 1.451 1.384 1.135

,0.001

,0.01 ,0.05 ,0.05 0.341 ,0.05

,0.001 to to to to to to

,0.001 (16.9) (15.2) (13.9) (12.5)

,0.01 ,0.05

,0.001

.180 days inpatient care during year preceding fracture

0.086 0.494 ,0.05 ,0.05

,0.001

*Includes Parkinson’s disease, hemiplegia or paraplegia, and epilepsy. �Includes glaucoma and cataract. `Includes peptic ulcer disease, abdominal hernia, liver disease, disorders of gallbladder and biliary tract, and irritable colon. 1Includes hypothyroidism, gout, renal disease, arthrosis, and rheumatic disease. �Health centre refers to the inpatient ward of the local primary health care unit **Percentages in parentheses are the proportions of the categories in the current year.

discharge, admission between Thursday and Saturday, and co-morbidities including cardiovascular disease and peripheral vascular disease (and, prior to adjustment, diabetes, cerebrovascular disease, chronic pulmonary disease, hypertension, and anaemia). The proportion of patients having late surgery substantially decreased during the period of the study.

unobserved covariates, the hazard ratio remained significant, indicating an increased mortality for late surgery. However, after the pseudo randomised assignment of patients into early and late surgery groups using admission day of the week as an instrumental variable, the difference in the adjusted 1 year mortality rate was found not to be attributable to operative delay (p = 0.069).

Adjusted effect of operative delay on mortality The hazard ratio for the late surgery reduced to 1.18 (95% CI 1.09 to 1.28, p,0.0001) after adjusting for the provider and the characteristics in table 1. Even after controlling for the

Proportions of late surgery patients There was extensive variation in the proportions of late surgery patients between providers (fig 3). The conservative 95% confidence interval crosses the potentially achievable

www.qshc.com

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

374

Sund, Liski

One year mortality

0.7 0.6 0.5 0.4 0.3 0.2 0.1 0

0

0.1

0.2

0.3

0.4

0.5

Share of patients with late surgery Figure 4 Scatter plot of adjusted mortality for surgically treated hip fracture patients and the share of patients undergoing late surgery for different providers (The triangles represent the overall mortality of operated hip fracture patients for each provider and the crosses the mortality of late surgery patients for each provider. The line with a positive slope is the trend of overall mortality and the dotted curved line indicates the association between the share of late surgery patients and late surgery mortality across providers). 0

10

20

30

40

50

60

% Figure 3 Provider-specific risk adjusted percentages of hip fracture patients undergoing late surgery. The thin parts of the lines correspond to 95% confidence intervals and the thick parts correspond to 50% confidence intervals. The dashed line refers to the potentially achievable percentage (7.7%) of patients undergoing late surgery.

(20th centile)52 92.3% share of patients undergoing early surgery for less than 50% of the providers. Trend analysis at provider level The simultaneous examination of the provider level proportions of late surgery patients and 1 year mortality rates showed that there was a statistically significant positive trend for an association between a larger share of late surgery patients and 1 year overall mortality (p,0.05), and that a smaller proportion of late surgery patients was (non-linearly) associated with a higher mortality rate for these patients (fig 4). Trend analyses for severity groups The associations were further analysed according to severity (measured as predicted 1 year mortality based on the same observed patient characteristics as in table 1) in five severity groups of patients (table 2). The proportion of patients undergoing late surgery was only slightly higher in the more severe patient groups but the 1 year mortality rate increased significantly with severity, as expected. The slope of the trend of overall mortality increased with the severity but was statistically significant only for the most severe patients. The positive slope of early surgery mortality was statistically significant in severity groups 3 and 4, but for the most severe patients this association disappeared. A smaller proportion of late surgery patients was associated with a higher late surgery mortality rate in all groups except the most severe patients.

DISCUSSION In this study the definition of late surgery turned out to be 3 or more nights in hospital after admission, since shorter waiting times were not associated with higher unadjusted mortality. A similar ‘‘long’’ delay is commonly used as the definition of late surgery in other studies.17 18 20 34

The unadjusted excess mortality for late surgery patients increased during the year following the operation. It is interesting that the short term mortality differed only slightly between early and late surgery groups, but the difference became much clearer after the perioperative period. This may be partly due to the fact that almost half the perioperative deaths are unavoidable in an unselected population.53 If patients are going to die shortly after the fracture—regardless of the quality of the hip fracture treatment provided—it is obvious that the treatment effect turns out to be small. The treatment effect becomes observable for long term mortality but is potentially biased because of heterogeneity between the early and late surgery groups in patient characteristics affecting the outcome.20 30 33 36 Several patient characteristics differed between the early and late surgery groups, but the prolonged operative delay seemed to increase the (long term) mortality significantly even after adjustment for these observed patient characteristics. There were no detailed clinical data available which obviously makes the adjustments only partial. The bias attributable to the unobserved covariates was controlled by allowing extra variation in the model, but the treatment effect remained significant. However, this kind of model is still prone to bias if the relationships between the risk factors and mortality are not correctly specified in the model.54 The pseudo randomizing instrumental variable approach was therefore applied and indicated that the increased mortality was not due to operative delay. Since the interpretations based on the instrumental variable approach also require strong and questionable assumptions,30 33 the sizeable difference in the estimates of the effect of operative delay on mortality between methods means that there is a need for hypotheses to explain such differences. For instance, one could try to separate the acceptable delays from the unacceptable ones27 36 or one could record the reason for the delay in the data.17 31 32 In the strictest sense, this kind of approach needs the assumption that the acceptable and unacceptable delays can be defined and measured uniquely.55–57 In practice, an acceptable delay corresponds to a clinical decision to postpone the operation and the assumption is approximately fulfilled if the clinical practice remains constant. Because the data used in this study did not include the reason for the delay, the hypothesis of constant clinical practice was examined indirectly using provider level analyses.

www.qshc.com

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

Effects of operative delay in hip surgery on mortality

375

Table 2 Effect of proportion of late surgery patients on mortality in different severity groups of hip fracture patients One year mortality

All patients (adjusted mortality)� Overall Early surgery Late surgery` Patient group 1 (least severe) Overall Early surgery Late surgery Patient group 2 Overall Early surgery Late surgery Patient group 3 Overall Early surgery Late surgery Patient group 4 Overall Early surgery Late surgery Patient group 5 (most severe) Overall Early surgery Late surgery

n (%)

10% of patients with late surgery

15% of patients with late surgery

20% of patients with late surgery

30% of patients Slope* with late surgery (p value)

16881 14426 (85) 2455 (15)

29.0 28.4 35.0`

29.3 28.8 31.9

29.5 29.3 30.5

30.0 30.2 29.7

0.5 (,0.05) 0.9 (,0.01)

3376 (20) 2931 (87) 445 (13)

9.4 8.8 15.4

9.3 8.6 13.4

9.2 8.4 12.4

9.0 8.1 11.1

20.2 (0.232) 20.3 (0.168)

3376 (20) 2932 (87) 444 (13)

18.0 17.4 23.8

17.9 17.5 20.1

17.8 17.7 18.4

17.6 17.9 16.7

20.2 (0.283) 0.2 (0.310)

3376 (20) 2875 (85) 501 (15)

26.9 26.6 29.7

27.2 27.5 25.4

27.6 28.5 23.8

28.2 30.4 23.1

0.6 (0.101) 1.9 (,0.01)

3377 (20) 2869 (85) 508 (15)

37.9 37.4 43.0

38.3 38.3 38.5

38.7 39.2 36.7

39.5 41.1 35.8

0.8 (0.126) 1.8 (,0.05)

3376 (20) 2819 (84) 557 (16)

51.4 50.7 57.9

51.9 50.6 59.3

52.5 50.6 59.9

53.5 50.5 60.6

1.1 (,0.05) 20.1 (0.455)

*Change in percentage of deaths when the share of patients with late surgey increased 10%. �Corresponds to fig 4. Example: Expected mortality for late surgery patients in provider with 10% of patients with late surgery is 35%. These percentages can be interpreted as coordinates (0.10, 0.35) which hit the dotted curve in fig 4. `Calculated from the equation: ml(X) = (m(X) 2 (12X) * me(X))/X, where X is the proportion of late surgery patients, m(X) refers to overall mortality at X, me(X) to early surgery mortality at X, and ml(X) to late surgery mortality at X. Example: ml(0.1) = (0.2903 2 0.9 * 0.2837)/0.1 = 0.3497.

The provider specific proportion of late surgery patients, adjusted for observed patient characteristics, showed that there was extensive variation between providers, which is a common finding.11–17 A simple performance assessment interpretation is that the percentage of late surgery patients can be reduced to a potentially achievable level52 that can also be interpreted as the upper limit for the proportion of acceptable delayed patients. Correspondingly, the expected proportion of unacceptable delayed patients is the proportion of late surgery patients exceeding this upper limit. This leads to the hypothesis that the overall mortality of hip fracture patients should increase with an increasing proportion of late surgery patients given that the longer operative delay would have an adverse effect on mortality. In fact, a statistically significant trend between the provider specific proportions of late surgery patients and overall 1 year mortality gives empirical evidence for the hypothesis, but the actual volume of the effect was small. Another provider level hypothesis is that the long term mortality of late surgery patients is higher if only the patients unfit for surgery are delayed, since the unfit condition for surgery is also a risk factor for 1 year mortality. This hypothesis was also supported by empirical evidence which revealed a non-linear association between the provider specific proportion of late surgery patients and the 1 year mortality rate for these patients. A more careful examination of the groups of patients with different predicted mortality (severity) gave even more insight into the provider level association between the proportion of late surgery patients and mortality. Intuitively speaking, the least severe patients are young, come from home, and are without severe medical conditions. Correspondingly, the most severe patients are older with much co-morbidity and come from residential care.21 31 53 For patients in the least severe group, mortality was higher in the late surgery patients for all providers. This probably indicates that, in this group, hip fracture related mortality is caused by

the medical problems which require late surgery and the prolonged surgical delay does not itself increase the mortality.28 30 31 36 For patients in the most severe group mortality was also higher for the late surgery patients for all providers. However, in this group the mortality rate in the late surgery patients was lower in the providers with a small proportion of late surgery patients and there were no differences in mortality rates among early surgery patients between providers. This means that, in this group, it is essential to perform early surgery in all patients who can withstand it because the significantly prolonged surgical delay makes the patient’s condition worse and increases mortality.22 23 29 32 33 For groups 2–4 the interpretation is more difficult than for patients in the extreme groups. Since the mortality rate in early surgery patients increases significantly and is higher even than the mortality rate of late surgery patients for providers with a large proportion of late surgery patients, it seems that in these groups an operation performed too early may cause more harm than the prolonged waiting time.21 22 26 Other differences between providers in the proportion of patients with late surgery result mainly from lack of resources such as temporary unavailability of the operating rooms or surgeons and problems with obtaining medical clearance from other specialities in off hours.57 58 The importance of these factors can be illustrated using the effect of the admission day on the in-hospital operation delay; the mean proportion of patients undergoing late surgery would be reduced from 14.5% to 9.0% (p,0.0001) if all patients had been admitted on ‘‘the best day’’. In spite of the large nationwide database and the advanced methods, the observational study design and administrative data require a special orientation to the analyses and caution should be exercised when interpreting the results. Data were available for 1998–2001 and the practices may have disproportionately changed for the different providers during

www.qshc.com

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment

376

that time. The results must therefore be interpreted as the mean situation between the years 1998–2001 for each provider. Non-operated patients were excluded from the analyses since the operative delay was not defined uniquely for these patients. Inclusion of these patients would certainly increase the mortality rate since the typical non-operated patient is in a worse condition and dies before any operation can even take place. The trend analyses are intended to summarise the systematic parts of the variation between the providers. However, there seems to be some unexplained variation, especially in the proportions of late surgery patients, because of the small number of patients for some providers, so the results of the trend analyses need to be confirmed and validated in further studies. Study designs which explicitly take into account the heterogeneity at the provider level are a prerequisite for unbiased results since the selection bias may vary between providers. One could also include the provider level characteristics in the analyses,35 37 but easily observable characteristics such as hospital type or number of surgeons typically allow only indirect interpretations. A more direct approach would be to observe the performance of providers continuously and to ask the providers themselves to clarify the reasons for their performance.12 58 59 In Finland the mean proportion of late surgery patients has decreased during the years of the study, but extensive variation between providers exists. In addition to possible system problems, there also seems to be variation in the clinical practices for judging which patients are suitable for early surgery. Identification of good practices and the release of a national clinical guideline for hip fracture treatment will probably improve the situation. In this sense, the proportion of patients with a prolonged waiting time for a hip fracture operation seems to be an effective evidence based quality indicator, since it clearly indicates an area for improvement. The effects of operative delay on mortality at the patient level are quite small, but at the provider level the association between the proportion of late surgery patients and nonoptimal treatment is clear. .....................

Authors’ affiliations

R Sund, A Liski, National Research and Development Centre for Welfare and Health (STAKES), Outcomes and Equity Research, Helsinki, Finland This research was supported by a research grant from the Academy of Finland.

REFERENCES 1 Zuckerman JD. Hip fracture. N Engl J Med 1996;334:1519–25. 2 Roberts SE, Goldacre MJ. Time trends and demography of mortality after fractured neck of femur in an English population, 1968–98: database study. BMJ 2003;327:771–5. 3 Liporace FA, Egol KA, Tejwani N, et al. What’s new in hip fractures? Current concepts. Am J Orthop 2005;34:66–74. 4 Marks R, Allegrante JP, Ronald MacKenzie C, et al. Hip fractures among the elderly: causes, consequences and control. Ageing Res Rev 2003;2:57–93. 5 Thorngren KG, Hommel A, Norrman PO, et al. Epidemiology of femoral neck fractures. Injury 2002;33:C1–7. 6 Royal College of Physicians. Fractured neck of femur. Prevention and management. Summary and recommendations of a report of the Royal College of Physicians. J R Coll Physicians Lond 1989;23:8–12. 7 Chilov MN, Cameron ID, March LM. Evidence-based guidelines for fixing broken hips: an update. Med J Aust 2003;179:489–93. 8 Scottish Intercollegiate Guidelines Network. Prevention and management of hip fracture in older people. Edinburgh: Scottish Intercollegiate Guidelines Network, 2002. 9 New Zealand Guidelines Group. Acute management and immediate rehabilitation after hip fracture amongst people aged 65 years and over. Wellington, NZ: New Zealand Guidelines Group, 2003. 10 Thorngren KG, Dolk T, Jarnlo GB, et al. National guidelines for care and treatment of hip fractures (in Swedish). Stockholm: The National Board of Health and Welfare, 2003. 11 Audit Commission. United they stand, Co-ordinating care for elderly patients with hip fracture. London: Audit Commission, 2000.

Sund, Liski

12 Freeman C, Todd C, Camilleri-Ferrante C, et al. Quality improvement for patients with hip fracture: experience from a multi-site audit. Qual Saf Health Care 2002;11:239–45. 13 New Zealand Health Information Service (NZHIS). Fracture of neck of femur services in New Zealand hospitals 1999/2000. Wellington: New Zealand Health Information Service (NZHIS), 2002. 14 Bottle A, Jarman B, Aylin P, et al. Some way to go for consistent implementation of guidance on hip fracture. BMJ 2004;328:1097. 15 Scottish Hip Fracture Audit Steering Group (SHFASG). Scottish hip fracture audit report 2002. Edinburgh: NHS Scotland, 2002. 16 Heikkinen T, Willig R, Hanninen A, et al. Hip fractures in Finland: a comparison of patient characteristics and outcomes in six hospitals. Scand J Surg 2004;93:234–40. 17 Todd CJ, Freeman CJ, Camilleri-Ferrante C, et al. Differences in mortality after fracture of hip: the East Anglian audit. BMJ 1995;310:904–8. 18 Fairbank J, Goldacre MJ, Mason A, et al. Health outcome indicators: fractured proximal femur, Report of a Working Group to the Department of Health. Oxford: National Centre for Health Outcomes Development, 1999. 19 Brook RH, McGlynn EA, Cleary PD. Quality of health care. Part 2: measuring quality of care. N Engl J Med 1996;335:966–70. 20 Zuckerman JD, Skovron ML, Koval KJ, et al. Postoperative complications and mortality associated with operative delay in older patients who have a fracture of the hip. J Bone Joint Surg Am 1995;77:1551–6. 21 Mullen JO, Mullen NL. Hip fracture mortality. A prospective, multifactorial study to predict and minimize death risk. Clin Orthop Relat Res 1992;280:214–22. 22 Kenzora JE, McCarthy RE, Lowell JD, et al. Hip fracture mortality. Relation to age, treatment, preoperative illness, time of surgery, and complications. Clin Orthop Relat Res 1984;186:45–56. 23 Casaletto JA, Gatt R. Post-operative mortality related to waiting time for hip fracture surgery. Injury 2004;35:114–20. 24 McGoye PF, Evans J. Fractures of the hip: Immediate versus delayed treatment. Can Med Assoc J 1960;83:260–3. 25 Davis FM, Woolner DF, Frampton C, et al. Prospective, multi-centre trial of mortality following general or spinal anaesthesia for hip fracture surgery in the elderly. Br J Anaesth 1987;59:1080–8. 26 Kern IB. Fractures of the neck of the femur: a review of 308 cases treated at the Prince of Wales Hospital from January 1960 to June 1961. Med J Aust 1962;49:554–7. 27 Bredahl C, Nyholm B, Hindsholm KB, et al. Mortality after hip fracture: results of operation within 12 h of admission. Injury 1992;23:83–6. 28 Holt EM, Evans RA, Hindley CJ, et al. 1000 femoral neck fractures: the effect of pre-injury mobility and surgical experience on outcome. Injury 1994;25:91–5. 29 Hoenig H, Rubenstein LV, Sloane R, et al. What is the role of timing in the surgical and rehabilitative care of community-dwelling older persons with acute hip fracture? Arch Intern Med 1997;157:513–20. 30 Ho V, Hamilton BH, Roos LL. Multiple approaches to assessing the effects of delays for hip fracture patients in the United States and Canada. Health Serv Res 2000;34:1499–518. 31 Grimes JP, Gregory PM, Noveck H, et al. The effects of time-to-surgery on mortality and morbidity in patients following hip fracture. Am J Med 2002;112:702–9. 32 Moran CG, Wenn RT, Sikand M, et al. Early mortality after hip fracture: is delay before surgery important? J Bone Joint Surg Am 2005;87:483–9. 33 McGuire KJ, Bernstein J, Polsky D, et al. The 2004 Marshall Urist award: delays until surgery after hip fracture increases mortality. Clin Orthop Relat Res 2004;428:294–301. 34 Roos LL, Walld RK, Romano PS, et al. Short-term mortality after repair of hip fracture. Do Manitoba elderly do worse? Med Care 1996;34:310–26. 35 Weller I, Wai EK, Jaglal S, et al. The effect of hospital type and surgical delay on mortality after surgery for hip fracture. J Bone Joint Surg Br 2005;87:361–6. 36 Orosz GM, Magaziner J, Hannan EL, et al. Association of timing of surgery for hip fracture and patient outcomes. JAMA 2004;291:1738–43. 37 Hamilton BH, Hamilton VH, Mayo NE. What are the costs of queuing for hip fracture surgery in Canada? J Health Econ 1996;15:161–85. 38 Hamilton BH, Bramley-Harker RE. The impact of the NHS reforms on queues and surgical outcomes in England: Evidence from hip fracture patients. Econ J 1999;109:437–62. 39 Keskima¨ki I, Aro S. Accuracy of data on diagnoses, procedures and accidents in the Finnish Hospital Discharge Register. Int J Health Sci 1991;2:15–21. 40 Lahti RA, Penttila A. The validity of death certificates: routine validation of death certification and its effects on mortality statistics. Forensic Sci Int 2001;115:15–32. 41 Luthje P, Nurmi I, Kataja M, et al. Incidence of pelvic fractures in Finland in 1988. Acta Orthop Scand 1995;66:245–8. 42 Sund R. Utilisation of administrative registers using scientific knowledge discovery. Intelligent Data Analysis 2003;7:501–19. 43 Romano PS, Roos LL, Jollis JG. Adapting a clinical comorbidity index for use with ICD-9-CM administrative data: differing perspectives (with discussion). J Clin Epidemiol 1993;46:1075–90. 44 Charlson ME, Pompei P, Ales KL, et al. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis 1987;40:373–83. 45 Hougaard P. Frailty models for survival data. Lifetime Data Anal 1995;1:255–73. 46 Goldstein H, Spiegelhalter DJ. League tables and their limitations: statistical issues in comparisons of institutional performance (with discussion). J R Stat Soc Ser A Stat Soc 1996;159:385–443.

www.qshc.com

Methodological Perspectives for Register-Based Health System Performance Assessment

Research Report 174 STAKES 2008

Effects of operative delay in hip surgery on mortality

47 Howley PP, Gibberd R. Using hierarchical models to analyse clinical indicators: a comparison of the gamma-Poisson and beta-binomial models. Int J Qual Health Care 2003;15:319–29. 48 Graubard BI, Korn EL. Predictive margins with survey data. Biometrics 1999;55:652–9. 49 Newhouse JP, McClellan M. Econometrics in outcomes research: the use of instrumental variables. Annu Rev Public Health 1998;19:17–34. 50 Greenland S. Principles of multilevel modelling. Int J Epidemiol 2000;29:158–67. 51 Daley J, Iezzoni LI, Shwartz M. Conceptual and practical issues in developing risk-adjustment methods. In: Iezzoni LI, eds. Risk adjustment for measuring health care outcomes. 3rd edn. Chicago: Health Administration Press, 2003:179–205. 52 Gibberd R, Hancock S, Howley P, et al. Using indicators to quantify the potential to improve the quality of health care. Int J Qual Health Care 2004;16:i37–43. 53 Foss NB, Kehlet H. Mortality analysis in hip fracture patients: implications for design of future outcome trials. Br J Anaesth 2005;94:24–9.

377

54 Shwartz M, Ash AS. Estimating the effect of an intervention from observational data. In: Iezzoni LI, eds. Risk adjustment for measuring health care outcomes. 3rd edn. Chicago: Health Administrative Press, 2003:275–95. 55 Schultz RJ, Whitfield GF, LaMura JJ, et al. The role of physiologic monitoring in patients with fractures of the hip. J Trauma 1985;25:309–16. 56 Charalambous CP, Yarwood S, Paschalides C, et al. Factors delaying surgical treatment of hip fractures in elderly patients. Ann R Coll Surg Engl 2003;85:117–9. 57 Orosz GM, Hannan EL, Magaziner J, et al. Hip fracture in the older patient: reasons for delay in hospitalization and timing of surgical repair. J Am Geriatr Soc 2002;50:1336–40. 58 Beupre LA, Johnston DB, Jones CA, et al. Continuum of care and integration of services for treatment of elderly patients with hip fracture. Alberta: Alberta Heritage Foundation for Medical Research (AHFMR), 2003. 59 Parker MJ, Currie CT, Mountain JA, et al. Standardised audit of hip fracture in Europe (SAHFE). Hip Int 1998;8:10–5.

www.qshc.com

Research Report 174 STAKES 2008

Methodological Perspectives for Register-Based Health System Performance Assessment