Abstract
Longitudinal study is type of studies that researchers visit subject in several time. Therefore, there are observations of the same subjects that are correlated. These types of studies are widely used in medical science. On the other hand, in medical studies, we frequently face situations that response mastered by linguistic terms. A new transition model which will be able to handle correlation between fuzzy responses is introduced. In this paper we model the transition possibility by fuzzy logistic regressions, and representing how the covariates relate to changes in response. With p covariates, there are (p + 1) parameters including intercepts, which we estimate by extended least squares method. These possibilities depend on the covariates. By using a real data set, an applied example is provided to explain the applicability of the proposed model in clinical studies. In the clinical studies, the effect of hydro-alcoholic extract of Urtica Dioica on menorrhagia (for which the status is basically expressed by linguistic/fuzzy terms) is investigated also the effect of mental intervention in recovery of patients with Lichen Planus disease.
Introduction
One of the most common medical researches is longitudinal study in which measurements of same subject are conducted repeatedly through time. In longitudinal studies, researchers are able to characterize the development or change in response variable over time [7]. Due to correlation between responses of same subject, ordinary generalized linear models are not appropriated in these cases. There are three main approaches to address the issue longitudinal data: marginal, transition and random effects models. These models are different from correlation handling and coefficient interpretation that due to different goals of inference of these models [26]. Starm et al. used a random effect model for binary correlated response and used EM algorithm for parameters estimate [5]. Stiratelli et al. [24] applied random effects model and using EM algorithm for estimated of parameters. Transition model is an extension of generalized linear regression for describing the conditional distribution of each response as an explicit function of past responses and predictors. Kalbfleisch and Lawless introduced methods for the analysis of panel data under a continuous-time Markov model. They showed the procedures for obtain maximum likelihood estimates and associated asymptotic covariance matrices for transition intensity parameters in time homogeneous models [12]. Muenz and Rubinstein proposed Markov models for covariate dependence of binary repeated response. They assumed that the sequence of response is followed a binary Markov chain. They modeled the transition probabilities for the responses by two logistic regressions, and showed how the covariates related to changes in response [15]. Diggle et al. [23] described transition model for correlative categorical data. Islam and Chowdhury suggested covariate dependence Markov model for modeling repeated responses [16].
The transition models frequently used in medical science. For example, Lee and Daniels used a Markov model for Longitudinal quality-of-life data from a colorectal cancer clinical trial study [13]. On the other hand, in medical studies, we frequently face situations that response mastered by linguistic terms. For example, to measure the severity of disease or pain in patients, using the numerical exact numbers is not usual, so that the results are usually reported as linguistic terms like low, medium, high, and very high [19].
For modeling vague responses, Tanaka et al. [9] introduced a regression model in fuzzy environment, for the first time. Fuzzy linear regression has been developed by a lot of researchers over the past decades. For example, Sekkeli et al. [8] proposed classification models based on Tanaka’s fuzzy linear regression approach and applied their model in customer satisfaction data. Nasrabadi and Nasrabadi considered a fuzzy linear regression models with fuzzy/crisp output, fuzzy/crisp input considered, and proposed an estimated method along with a mathematical programming based approach [17].
The second approach of fuzzy linear regression analysis adopts the fuzzy least squares method for minimizing errors between the given response and the estimated response, proposed by Diamond [22]. Yang and Lin proposed two estimation methods along with a fuzzy least-squares approach for linear regression with fuzzy input-output [21]. Torabi and behboodian introduced a method for least absolutes estimating of fuzzy parameters in a linear model with fuzzy input and fuzzy output, by used “Resolution Identity” [10]. In binary based fuzzy response field, Pourahmad et al. [27] proposed a new term that is named possibilistic odds and extended a possibilistic-based logistic regression in which the covariates were crisp and response variable was fuzzy. Namdari et al. [20] suggested a new possibilistic logistic regression in which the predictors were crisp and response variables were vague and were measured by linguistic terms.
According to what is being said in non-fuzzy situation, these fuzzy regression models are not appropriate for inference about correlated binary fuzzy response too. In this study, we introduced a new method which will be able to handle correlation between fuzzy responses through a fuzzy logistic regression model.
In Section 2 we describe logistic regression and classical transition model for binary crisp response and some fuzzy consepts. In section 3 we defined new fuzzy transition model in possibility case and in section 4 we proposed fuzzy transition model for linguistic term case. In Section 5 and 6, we illustrate proposed transition model for fuzzy binary based longitudinal in two clinical studies. In section 7 is provided a conclusion.
Preliminary
Logistic regression
Logistic regression is a model to analyze the studies that response variable is measured with binary variable. The purpose of logistic regression is to detect the best fitting model to describe the relationship between the binary response and set of predictors. As an example, imagine how coronary heart disease (CHD) can be predicted by the level of serum cholesterol. The probability of CHD increases with the serum cholesterol level [4].
Since logistic regression computes the probability of an event happening over the probability of an event not happening, the impact of predictor variables is usually explained in terms of in which π is probability of an event happening and is called odds.
With logistic regression, the natural log odds is modeled as a linear function of the predictor variables. If π (x) is detected as the probability of success related to predictors, the logistic regression is defined as follow:
This model is applied in a wide variety of studies. It is used in biomedical studies, social science researches, marketing and genetic studies. For instance, Levinson et al. [3] applied logistic regression to analysis of the genotype data of affected sibling pair (ASPs) and their parents from several research centers.
Transition model: the classical approach
A particular case of longitudinal study is called transition (or Markov) models. Let Y
ij
be the response of ith subject in jth flow up time where i = 1, . . . , n and j = 1, . . . , t. The joint distribution of the responses Yi1, Yi2, . . . , Y
it
of the ith subject can be computed as follows:
When the response variable is binary, a first order Markov chain is presented by the transition matrix where π
ab
= P (Y
ij
= b|Yij-1 = a), a, b = 0, 1. Note that, each row of a transition matrix sums to one since P (Y
ij
= 0|Yij-1 = a) + P (Y
ij
= 1|Yij-1 = a) = 1. As its name implies, the transition matrix records the probabilities of making each of the possible transition from one visit to the next. The transition probabilities are modeled as function of predictors. Therefore model is presented by
Fuzzy numbers and fuzzy arithmetic
normality: u (x0) =1 for some x0,
u is a convex fuzzy set,
and x, y ∈ R, 0 ≤ λ ≤ 1; u is upper semicontinuous,
(u) 0 = closure {t|t ∈ R, u (t) >0} is compact.
The space E is called a fuzzy number space and each u ∈ E is called a fuzzy number.
In special case, for L (x) = R (x) = max {0, 1 - |x|}, i called triangular fuzzy number and is denoted by A = (m, α, β) T .
Fuzzy transition model: a) representing responses by degree of memberships
To defined the relationship between a correlated binary biased response variable and a set of predictor variables by transition model, for each subject, let the responses and predictors consist of the observations
In fuzzy longitudinal study, the researchers usually ask the expert to assign a consistent degree (possibility of being patient in medical study) to each case as a number between 0 and 1 at each time. Therefore μ
ij
, i = 1, . . . , n, j = 1, . . . , t is the possibility of success and the possibility transition model proposed as follows:
Based on fuzzy arithmetic, it can be shown that, is a triangular fuzzy number, so that
Based on the fuzzy arithmetic, the membership function of the fuzzy estimated response can be shown as follows:
For estimation of the coefficients, the idea is to minimize the total fuzziness of the obtained model by minimizing the sum of the spreads of the fuzzy outputs, the subject to some constraints by which one can insure about the validity of the obtained model. In this regard, it is assumed that: Each observation, w
ij
has a membership degree as big as h in the function of the fuzzy estimated response, W
ij
,
The fuzzy coefficients are such that the fuzziness of the model is minimized. The determination of fuzzy coefficients leads to a linear programming problem, in which the objective function is the sum of the spreads of the fuzzy outputs,
Finally, one can minimize the objective function by using a linear programming algorithm (such as the Simplex method) to estimate the center, and the left and the right spreads of each coefficient. In this research, we use the Lingo 8.0 [14] software.
Fuzzy transition model: b) representing responses by linguistic terms
In the following, we defined a transition model with fuzzy binary based observations and possibility of success as a linguistic term such as: Very Low, Low, Medium, High, and Very High.
Model with this considerations is represented as follows:
Since, is a one-to-one function, so, there is one and only one x ∈ (0, 1) such that . Therefore, we can rewrite
Note that, this means that by placement x with in the definition of possibility term, the membership function of logarithm transformation of possibilistic odds is obtained. To estimate the parameters of the model, the sum of squared errors between and should be minimized. To do this, we require to define a function which describe well the distance between two fuzzy numbers [22]. A well-known definition, suggested by Xu and Li [25], is as follows:
In the following, we apply the proposed transition models to fuzzy binary based longitudinal data in two clinical studies.
Application in medical study 1: lichen planus data
In this section, we use the fuzzy transition model for longitudinal fuzzy response that was proposed in Section 3 to describe the efficacy of drug therapy of psychiatric disorders in oral lichen planus. In this study, the researchers fill out special examination forms for 45 patients referring to the Oral Medicine Department in Mashhad Dental School from October 2004 to December 2005 and oral lichen planus was confirmed through clinical examination, biopsy and histopathological evaluation. Then the patients were evaluated by a psychologist [30]. Forty five patients with oral Lichen planus completed 6 months of treatment with either conventional treatment or conventional treatment + drug therapy for their diagnosed psychological disorder. The primary outcome measure was response to the treatment which was recorded 3 times (see, Table 1).
The proposed fuzzy transition model for Lichen planus data is as follows:
The coefficients are estimated as follows:
In this model, after controlling for the potential confounders, a positive relation between the group and pain was detected.
Prediction for a new case. To see the applicability of the above model, consider for example, in a new case we have the following information: previous response = 0.35, age = 57 intervention group in 2th visit. Based on such information, the possibility odds predicted by the model would be (-0.064, 1.4)
T
. According to Extension Principle [25], we have
Figure 1 represents the membership function of the predicted possibilistic odds.
Application in medical study 2: Menorrhagia data
Menorrhagia is a common gynecological problem and leading causes of poor quality of life and iron deficiency anemia in women of reproductive age.
As there is no research on the effect of hydroalcoholic extract of Urtica Dioica on menorrhagia, this study is conducted to determine the effect of hydroalcoholic extract of Urtica Dioica on menorrhagia in Babol Azad University students on 2012-13. A randomized triple blind clinical trial was carried out on 100 women affected by menorrhagia, selected by convenience sampling, which had inclusion criteria. Data collection tools were data form, weight, meter and PLBAC chart. All samples after a control cycle of primary bleeding and dividing with random allocation to trial or control group were subjected to mefenamic acid treatment (500 mg every 8 hours) and Urtica Dioica 5 caps. Per day, from first to end of bleeding, up to 7 days, for two consecutive cycles, for trial group and for the control group, mefenamic acid and placebo as the same way of trial group, was prescribed [2]. Usually menorrhagia status was measured by PLBAC chart, but menorrhagia status was basically expressed by verbal reports, linguistic variables and borderlines of categories of linguistic variables [8]. In point of view, the reported borderline between subcategories of menorrhagia status that has been reported is vague (see,Table 2). Because fuzziness must be considered in modeling systems that human estimation is influential [9]. The membership functions of fuzzy numbers are defined as:
The membership functions of responses are shown in Fig. 2.
In order to fit a fuzzy transition model, we should model logarithm transformation of possibility odds, in each time, with linearity dependent to predictors and previous responses as follows:
Using fuzzy least squares method, the model coefficients are estimated and the model is presented as follows:
Positive sign of group coefficient shows that intervention is effective.
Prediction for a new case. To see the applicability of the above model, consider for example, in a new case we have the following information, previous response: medium, control group in first visit. In such a case, the output predicted by the model is:
So, using the Extension Principle, we get
Accordingly, Fig. 3 represents the membership function of the predicted possibilistic odds.
Conclusion
Longitudinal study is an observational research method in which data is gathered for the same subjects repeatedly over a period of time. These studies frequently are used in medical researches. Three types of models are considered for studying the longitudinal data: the marginal or population averaged models, the random-effects or subject-specific models and the transition or response conditional models.
Relation to regression approach makes that the transition model is as the simplest model between them. On the other hand the non-precise observations are usually seen in these studies. Vague observation is one of the situations in which fuzzy modeling methods are suggested. Although there are a lot of studies in fuzzy Markov object, but in a few one of them, considering covariate dependence as a fuzzy Markov model is seen. In this article, we suggested a fuzzy transition (Markov) model in which the possibility of being a special status is not only related to previous responses, but also related to predictor variables. When response variable is reported as the possibility of having disease or linguistic terms, the fuzzy approaches could be provided suitable models. For estimating of coefficients of model, linear programming is used for possibility cases and least square method is the used for the linguistic term in our paper.
We applied our suggested model, in two real clinical studies. In lichen planus study based on fuzzy transition model, it can estimate the possibilitic odds of lichen planuse disease recuperating for each case. And in menorraghia study, the responses collected as linguistic term, therefore, we used least square method for estimation of model coefficients. Hereupon this paper, we show the different appearance of fuzzy transition model in comparison with Markov model in fuzzy environment. Further studies for extending this model for conditions with fuzzy coefficients are recommended.
Acknowledgements
We would like to show our gratitude to the Dr. Dalirsani and Dr. Kariman for allowing us to apply the data of their studies in this paper.
