Abstract
Background:
The Patient Empowerment through Predictive Personalized Decision Support (PEPPER) system provides personalized bolus advice for people with type 1 diabetes. The system incorporates an adaptive insulin recommender system (based on case-based reasoning, an artificial intelligence methodology), coupled with a safety system, which includes predictive glucose alerts and alarms, predictive low-glucose suspend, personalized carbohydrate recommendations, and dynamic bolus insulin constraint. We evaluated the safety and efficacy of the PEPPER system compared to a standard bolus calculator.
Methods:
This was an open-labeled multicenter randomized controlled crossover study. Following 4-week run-in, participants were randomized to PEPPER/Control or Control/PEPPER in a 1:1 ratio for 12 weeks. Participants then crossed over after a washout period. The primary end-point was percentage time in range (TIR, 3.9–10.0 mmol/L [70–180 mg/dL]). Secondary outcomes included glycemic variability, quality of life, and outcomes on the safety system and insulin recommender.
Results:
Fifty-four participants on multiple daily injections (MDI) or insulin pump completed the run-in period, making up the intention-to-treat analysis. Median (interquartile range) age was 41.5 (32.3–49.8) years, diabetes duration 21.0 (11.5–26.0) years, and HbA1c 61.0 (58.0–66.1) mmol/mol. No significant difference was observed for percentage TIR between the PEPPER and Control groups (62.5 [52.1–67.8] % vs. 58.4 [49.6–64.3] %, respectively, P = 0.27). For quality of life, participants reported higher perceived hypoglycemia with the PEPPER system despite no objective difference in time spent in hypoglycemia.
Conclusions:
The PEPPER system was safe, but did not change glycemic outcomes, compared to control. There is wide scope for integrating PEPPER into routine diabetes management for pump and MDI users. Further studies are required to confirm overall effectiveness.
Clinical trial registration:
Introduction
Insulin bolus calculators can assist people with type 1 diabetes (T1D) to calculate the amount of insulin required for meals to achieve optimal glucose levels. 1 Overall, standard bolus calculators have been associated with improved postprandial glucose values, reduced dosing errors, reduction in hypoglycemia fear, and improved confidence in diabetes self-management. 2 –4 However, HbA1c outcomes have been mixed with some studies showing a reduction, 5 –7 while others show no impact with only improved treatment satisfaction. 4 Moreover, some studies have shown improved glycemic control, but at the expense of increased hypoglycemia. 8
While some bolus calculators use a rule-based approach to consider parameters such as exercise, illness, and menstrual cycle, all commercially available bolus calculators lack the ability to automatically adapt over time to respond to an individual's needs or changes in insulin sensitivity, providing real-time adaptive bolus advice to the user. To overcome this limitation, we propose the use of case-based reasoning (CBR), an established artificial intelligence (AI) methodology. 9,10
CBR addresses the task of solving newly encountered problems by applying the solutions learned from solving similar problems encountered in the past (i.e., cases). Related past cases are stored into a case-base, and each case is defined by three components: the problem description, the problem solution, and the outcome. 10 For example, the problem may be described by parameters affecting glucose levels (such as exercise or alcohol consumption), the solution can be defined by the parameters of a bolus calculator (i.e., insulin sensitivity factor [ISF] and insulin to carbohydrate [CHO] ratio [ICR]) and the outcome can be the glycemic metric assessing glucose excursions. 11 A CBR system learns by adding each new case in the case-base, after a revision process, making the system adapt its solutions to changes in the individual.
Following the CBR methodology, an adaptive insulin recommender system has been developed and coupled with a safety system in the Patient Empowerment through Predictive Personalized Decision Support (PEPPER) system to provide personalized bolus advice for people with T1D. The safety system is unique to PEPPER and includes a 30-min glucose forecasting algorithm 12 to provide predictive glucose alerts, as well as predictive low-glucose suspend for pump users (partial and total). In addition, the PEPPER safety system provides personalized carbohydrate recommendations to improve glycemia, as well as a dynamic bolus insulin constraint. The carbohydrate recommendations refer to when the PEPPER system recommends an oral dose of carbohydrates to correct hypoglycemia while minimizing rebound hyperglycemia. This is personalized, based on the participant's weight, and a “carbohydrate correction factor”, which is adaptive. 13 The aim of the dynamic bolus insulin constraint module is to eliminate extreme bolus advice. If the recommended insulin dose is outside the limits of ICR ±30% or ISF ±30%, it is saturated to the corresponding upper or lower bound.
The PEPPER safety system has been previously evaluated and the findings were published by Liu et al. 12,13 The PEPPER adaptive insulin recommender system has also been previously published and validated in silico by Torrent-Fontbona and Lopez. 14 Formal usability assessment of the application has also been performed. 15 In this study, we clinically evaluate the safety and efficacy of the coupled components.
Study Design and Methodology
This is an 8-month multicentered, prospective, randomized controlled study. Participants (on multiple daily injections [MDI] and continuous subcutaneous insulin infusion [CSII]; insulin pump) were recruited from two clinical sites; Imperial College London in the United Kingdom and the Institut d'Investigació Biomèdica de Girona Hospital Universitari Dr. Josep Trueta (IDIBGI) in Spain. Ethics and device approvals were obtained from the relevant regulatory bodies at each of the sites (NCT03849755). All participants provided verbal and written informed consent.
Participants
Adult participants aged 18 years or older with T1D for at least 1 year using an intensified MDI regimen (defined as a basal bolus regime, not using mixed insulin) or CSII treatment for over 6 months were recruited. Participants had an HbA1c between 48 and 86 mmol/mol with intact awareness of hypoglycemia. In addition, all participants had completed structured education (either group or 1:1 sessions) and were competent at carbohydrate counting, using ICR and ISF to calculate mealtime insulin boluses.
Participants were excluded if they had an episode of diabetic ketoacidosis (DKA) or severe hypoglycemia requiring third-party assistance within the preceding 6 months, used regular paracetamol, were pregnant or intending pregnancy, breastfeeding, had active malignancy or endocrinopathy, abnormal renal function or liver cirrhosis, or had a macrovascular complication in the past year.
System architecture
The PEPPER architecture has been previously described. 13,16 To summarize, the PEPPER system offers a dual architecture to cater for people with T1D on MDI or CSII, the latter via the Cellnovo insulin pump (Fig. 1). Participants used the PEPPER user app, which is integrated into a hand-held device. For pump users this was Cellnovo's own handset running an Android OS. For MDI users, this was an Android smartphone (Google Nexus 5x). In addition, users wore real-time continuous glucose monitoring (CGM; Dexcom G5, CA), which communicates to the hand-held device via Bluetooth. The CGM data were directly imported through open source software, xDrip+, into the Android native PEPPER app.

Overview of the components of the PEPPER system architecture.
Physical activity before a meal was determined through an activity monitor (Mi Band 1s; Xiaomi, China). The activity monitor records the step count performed in the 10 h before the meal. The data are coded into four personalized categories based on each individual's mean step count. The categories included are as follows: “none” < 1000 steps, “low” < (2000 + individual mean/2), “mild” < (5500 + individual mean/2), and “extreme” > (5500 + individual mean/2).
Users logged additional lifestyle data such as carbohydrate intake, meal absorption, future physical activity, alcohol consumption, stress, fever, digestive illness, and hormone cycle for women (if present), through the user interface of the PEPPER app. The default setting was “none” if not specifically marked.
All collected data, including insulin information, were remotely and automatically uploaded and stored to a remote server via wireless internet connection or mobile data. The web-based user interface (i.e., the PEPPER clinical platform) enables the supervision of the bolus insulin adjustments proposed by the CBR algorithm. Hence, the user is presented with real-time adaptive insulin bolus recommendations on their handset, taking into account the revised ICR and ISF.
Insulin pen devices with half-unit increments were provided to all participants (Echo pen [Novo Nordisk] for insulin aspart, the Junior Star [Sanofi] for insulin glulisine, or the HumaPen Luxura HD [Lilly] for insulin lispro) as insulin bolus advice was provided to the nearest 0.5 units. When the user accepts insulin recommendations, the data are automatically uploaded into the remote server. If the user rejects the bolus advice, the app requests the user to input the insulin administered. The CBR uses the user insulin dose for revision of cases.
During the run-in and control periods, insulin bolus recommendations were based on a standard bolus calculator using the formula:
where CHO (g) is the amount of estimated carbohydrate, G c (mmol/L) is the blood glucose measurement premeal, G sp (mmol/L) is the blood glucose target, ICR (g/U), ISF [mmol/(L·U)], and IOB (units) is the insulin-on-board.
The IOB was calculated using linear decay expressed as:
where
Trial intervention and study procedures
Visit 1: consent, training, and study enrolment
At study enrolment, the investigators confirmed the participants' eligibility for the study, obtained written informed consent, recorded demographic information, and collected relevant medical and drug history. Study participants underwent electrocardiography, urine analysis, height, and weight. Urine and blood samples were collected for laboratory tests (HbA1c, urinalysis, biochemistry panel with lipids, full blood count and pregnancy testing for women of child-bearing age).
Participants completed validated psychological study questionnaires (Diabetes treatment satisfaction questionnaire [DTSQs], problem areas in diabetes [PAID], and diabetes quality of life [DQOL]).
Participants received the PEPPER study system and completed a 4-week run-in period using real-time CGM (Dexcom G5 transmitter and sensor) to familiarize themselves. For the run-in period, the adaptive insulin recommender and safety system were disabled, and a standard bolus calculator was used. Dexcom G5 sensors were changed every 7 days as per manufacturer's guidance (or sooner in event of sensor failure). Standard CGM alarms were received through xDrip+, which ran in the background of the PEPPER app. Participants were instructed to test their capillary blood glucose every 12 h for calibration or if symptoms of hypo- or hyperglycemia, in the event of sensor failure or if the sensor glucose was out of the desired range.
Participants on insulin pump therapy were switched to the Cellnovo pump (Cellnovo Ltd., United Kingdom) and trained on its use. Differences from their own home pump were discussed, including important aspects such as calculation of insulin on board and correction boluses. Additional components of training included site initiation, cartridge/priming procedures, setting up the pump, navigation through menus, and bolus procedures, including how to stop a bolus delivery. User manuals for the PEPPER system and the Cellnovo insulin pump were provided.
Structured education revision was provided, with basal insulin dose, ICR, and ISF reviewed and adjusted if required. Participants were requested to obtain bolus advise (via the standard bolus calculator) using the PEPPER handset, as well as use it for any dose correction or additional carbohydrate intake.
Visit 2: randomization
At visit 2, enrolled participants were randomized to PEPPER/Control or Control/PEPPER in a 1:1 ratio by an online randomization tool. The groups were stratified by insulin delivery modality (CSII or MDI).
For participants in the PEPPER group, the PEPPER forecasting algorithm and the adaptive insulin recommender were activated remotely by using the PEPPER clinical server. CGM alarms based on the forecasting algorithm 12 and alarms based on actual CGM readings would be received by the user directly through the PEPPER app.
For participants in the Control group, the standard bolus calculator integrated into the PEPPER handset was used, with the adaptive insulin recommender system disabled. The PEPPER safety system was also switched off, but participants in the Control group continued to receive standard CGM alarms (activated through xDrip+ 17,18 ). Alert and alarm settings were initially standardized, but these could subsequently be altered by participants, except for the hypoglycemia alarm, which was hard-coded at 3.9 mmol/L (70 mg/dL) due to safety reasons. Alert thresholds for hypoglycemia and hyperglycemia were 4.4 mmol/L (80 mg/dL) and 14.0 mmol/L (250 mg/dL), respectively.
Target blood glucose on the PEPPER bolus calculator was standardized to 5.5 mmol/L (99 mg/dL), however, this target could subsequently be individualized if needed.
Participants were able to contact the study team for clinical and technical support. Where necessary, glucose data could be reviewed remotely by the PEPPER clinical team.
Visit 3: review
Any technical issues were reviewed at the visit and adjustments to settings made if required.
Visit 4: end of intervention phase 1
Following completion of 3 months in the intervention phase, each group reverted to their standard therapy for 3–4 weeks (washout period). They then crossed over to the other group. Nonfasting venous blood samples were taken for HbA1c, weight, and basal insulin requirements were recorded, as well as study questionnaires completed. If participants were in the PEPPER group first, they were not aware of their ICR and ISF values at the end of the intervention phase, and therefore reverted back to their initial parameters from the run-in period.
Visit 5–7: start of intervention phase 2
Visits 5–7 follow the same structure as visits 2–4. Participants switch on the PEPPER device and are crossed over from PEPPER/Control to Control/PEPPER. For participants crossing over to the PEPPER intervention arm second, the adaptive insulin recommender system with CBR was initialized at the start of the second PEPPER phase. Thus, the case-base was built up over the intervention period (12 weeks) only for each participant.
At the end of the study (i.e., visit 7—end of 8 months), repeat blood tests to assess HbA1c were performed and questionnaires similar to that at baseline and visit 4 were completed.
PEPPER case-base revision
The PEPPER case-base was revised by clinicians twice weekly throughout the intervention period (PEPPER with CBR). This was a semiautomated process completed remotely via the secure web server with all data automatically uploaded. The reviewing clinician rejected case revisions by the CBR algorithm if a meal or insulin bolus was taken within the 5-h postprandial period; if CGM sensor data quality affected the postprandial glucose curve; if IOB was >4 units, or if the carbohydrate content of a meal was <20 g (i.e., snack).
Once cases were accepted, these would be stored in the CBR case-base for future use. When users request an insulin dose recommendation, the CBR algorithm looks for a similar case in the case-base and computes the recommended dose based on the retrieved ICR and ISF, which is then presented on the user's handset. Then, the CBR creates a new case and revises the glycemic outcome corresponding to the recommended dose by analyzing the postprandial CGM curve. If the outcome is suboptimal (i.e., above or below glucose target range), then the new ICR and ISF are revised accordingly. A comprehensive description of the PEPPER CBR process and algorithms, including methods for case retrieval, reuse, and revision, has been previously published. 14
The parameters used by the CBR algorithm to determine similar cases were time of day, meal absorption rate, physical activity, alcohol quantity, tiredness, stress, fever, digestive illness, and menstrual cycle for women. Past physical activity was recorded through the activity monitor and future activity was based on the user log.
Primary and secondary outcomes
The primary outcome was the difference in change in percentage (%) time in range (TIR) (3.9–10.0 mmol/L [70–180 mg/dL]) between the intervention arm that receives the PEPPER safety system with adaptive bolus advice and the Control arm. Endpoint assessment was not blind to the study intervention arm allocated.
Secondary endpoints were % time spent in euglycemia (3.9–7.8 mmol/L; 70–140 mg/dL), hypoglycemia (<3.9 mmol/L; <70mg/dL), and hyperglycemia (>10.0 mmol/L; >180 mg/dL), number of episodes of serious hypoglycemia (defined as a sensor glucose <3.0 mmol/L; <55 mg/dL) for >20 min, episodes of hypoglycemia within 5-h postprandially, severe hypoglycemia (defined as a hypoglycemia event requiring third-party assistance), postprandial mean area under the curve (AUC) at 5 h (expressed as mmol/L min), and glycemia risk and variability measures.
Evaluated glycemic variability measures are standard deviation (SD), coefficient of variation, mean amplitude of glycemic excursions, continuous overall net glycemic action, mean of daily differences, lability index, glycemic variability percentage, mean absolute glucose change per unit time, glycemic risk assessment diabetes equation (GRADE), M-value, average daily risk range, J-index, personal glycemic status, index of glycemic control, risk index, low blood glucose index, and high blood glucose index. GRADE score is also reported as %GRADE hypoglycemia, %GRADE euglycemia, and %GRADE hyperglycemia representing percentages of GRADE scores attributable to glucose values <3.9 mmol/L (<70 mg/dL), and between 3.9–7.8 mmol/L (70–140 mg/dL) and >7.8 mmol/L (>140 mg/dL), respectively. Measures of glycemic variability were computed using EasyGV (v10.0) software. 19
Other secondary endpoints were HbA1c, change in weight (kg), basal insulin dose requirements, and diabetes-specific measures of quality of life (PAID, DQOL, DTSQs). Safety system outcomes were also assessed for incidents of low and high glucose alarms, alerts, carbohydrate recommendations, low-glucose suspend, and bolus insulin saturation. Outcome measures of the insulin recommender included the number of bolus recommendations, and proportions of those that were accepted or rejected by user. The number of cases revised by clinicians were also evaluated.
Statistical considerations
Glycemic outcome measures were analyzed at baseline (for 28 days during the run-in period) and using data from the last 28 days from each intervention period. Changes from baseline and between group differences were assessed. The primary analysis was conducted using the intention-to-treat (ITT) principle. To be included in the ITT analysis, participants had to have completed the full 28-day run-in period. A subanalysis was also done to evaluate differences in outcomes between the pump and MDI groups.
Analysis was also then completed with the per protocol population (PPP), which consisted of those participants in the ITT population who completed the study with no significant deviations from the planned protocol procedures. The study was terminated early for select pump participants due to the withdrawal of pump manufacturer, Cellnovo, withdrawing from market and insufficient supplies. For participants to be included in the PPP analysis, they had to have fully completed the study (both control and intervention phases) and had to have used the PEPPER insulin recommender system. Results from all the subanalysis can be found in the Supplementary Material.
Safety system measures were reported for the PEPPER and Control groups. While the safety system was not enabled in the Control group, the algorithm was running in the background to allow comparisons to be made. Data from the run-in period for the safety system were not used in the analysis because the algorithm was optimized using the run-in data itself.
For outcomes related to the adaptive insulin recommender, these have been reported for the full PEPPER intervention period (i.e., 12 weeks) to reflect the number of cases that contributed to the final glycemic outcome data.
The data were tested for normal distribution using quantile plots and the Shapiro–Wilk test of normality. Differences between the PEPPER and Control groups were tested for significance using the Wilcoxon Rank sum test. For the PPP analysis, the Wilcoxon matched-pairs signed-rank was used for analysis for differences between the intervention groups. Where appropriate, the nonparametric Spearman rank test was performed for correlation.
Data have been presented as medians (interquartile ranges [IQR]), unless otherwise stated. Statistical tests were two-tailed and results were considered statistically significant if P < 0.05. Statistical analysis was performed using Stata version 15 (StataCorp, College Station, TX).
With 50 participants (25 participants on MDI and 25 participants on CSII), a 0.57 SD difference can be demonstrated as significant with α of 0.05% and 80% power (two-tailed). Based on a previous pilot study (unpublished data), which showed a population mean (SD) % TIR (3.9–10 mmol/L) of 61.6 (18.8) %, we aimed to detect a 10.7% (i.e., = 0.57 × 18.8) difference in % TIR to demonstrate significance between the intervention and control in this study. Assuming that about 10% of participants would drop out, we calculated that a sample size of 55 participants would be needed.
This sample size is deemed comparable to other technology transfer studies, and is a realistic number for recruitment, providing robust safety data for a new technology. Glycemic variability and outcome measures on the safety system and insulin recommender were secondary, and therefore, the study was not powered to detect statistical differences in secondary outcomes between groups.
Results
Baseline demographics
Fifty-eight participants were enrolled, of which 54 participants completed the run-in period, making up the ITT study population (3 declined to participate, 1 did not meet inclusion criteria due to high HbA1c; Fig. 2). Twenty-six participants (48.1%) were male and 28 (51.9%) were female. The median (IQR) age was 41.5 (32.3–49.8) years with diabetes duration of 21.0 (11.5–26.0) years and HbA1c of 61.0 (58.0–66.1) mmol/mol. Baseline characteristics of the participants are summarized in Table 1. There was no statistical difference between groups at baseline.

Consort flow diagram. The ITT study population included 54 participants completing the run-in period. For the PPP analysis, 33 participants were included. Reasons for exclusion were as follows: early study termination due to Cellnovo withdrawal (n = 10); participants declining to participate (n = 8); SAE (n = 2); completion of study but nonuse of PEPPER adaptive bolus advisor (n = 1). ITT, intention-to-treat; PPP, per protocol population.
Baseline Demographics for Intention-to-Treat Population
PEPPER/Control group started with the PEPPER intervention and control/PEPPER started with control.
Results are expressed as median (IQR)/n (%).
BMI, body mass index; CGM, continuous glucose monitoring; CSII, continuous subcutaneous insulin infusion; DKA, diabetic ketoacidosis; IQR, interquartile range; ITT, intention-to-treat; MDI, multiple daily injections; PEPPER, Patient Empowerment through Predictive Personalized Decision Support; TSH, thyroid stimulating hormone.
Following enrolment, a further four participants withdrew. Fifty participants were randomized to PEPPER (n = 24) or Control (n = 26). Four participants then withdraw (PEPPER [n = 2] and Control [n = 2]) and one was withdrawn due to a serious adverse event. In total, the first crossover phase was completed by 45 participants (22 PEPPER, 23 Control). Ten participants were withdrawn before the second crossover phase due to loss of Cellnovo device support (PEPPER [n = 5] and Control [n = 5]). One further participant withdrew due to a serious adverse event and 17 participants crossed-over to PEPPER and 17 to Control. All (n = 34) completed the second intervention phase. In total, for the ITT, 39 participants allocated to PEPPER and 40 allocated to Control completed the intervention. For the PPP analysis, one participant was withdrawn for not using the PEPPER adaptive bolus advisor (n = 1). Thus, 33 participants fully completed both intervention phases and used the PEPPER adaptive insulin recommender system, making up the PPP analysis. The flow diagram (Fig. 2) summarizes recruitment and withdrawals.
Glycemic outcomes
For the primary outcome, no difference in % TIR 3.9–10.0 mmol/L (70–180 mg/dL) with the PEPPER system compared to Control (62.5 [52.1–67.8] % vs. 58.4 [49.6–64.3] %) was observed between the two groups (P = 0.27; Table 2). For clinically significant hypoglycemia, as well as % time in hyperglycemia for thresholds >10.0 mmol/L (>180 mg/dL) and >15 mmol/L (>270 mg/dL), no significant differences were observed between the PEPPER and Control groups.
Intention-to-Treat Pooled Analysis of Glycemic Outcomes
ADRR, average daily risk range; AUC, area under the curve; CONGA, continuous overall net glycemic action; CV, coefficient of variation; GRADE, glycemic risk assessment diabetes equation; GVP, glycemic variability percentage; HBGI, high blood glucose index; IGC, index of glycemic control; LBGI, low blood glucose index; LI, lability index; MAG, mean absolute glucose change per unit time; MAGE, mean amplitude of glycemic excursions; MODD, mean of daily differences; PGS, personal glycemic status.
In the PPP analysis, similar observations were noted to the ITT cohort, with no significant differences observed between the PEPPER system and Control. PEPPER demonstrated +4.2 (−4.9 to 10.4) % TIR compared to Control +1.2 (−7.2 to 7.3), P = 0.10 (Supplementary Material Table 1).
In the subanalysis based on modality, the pump participants observed greater increase in % TIR compared to the MDI cohort, although there were no significant between-group differences (Supplementary Material Table 2). No significant differences were observed for glycemic outcomes stratified by day (07.00–23.00 h) and night (23.00–07.00 h), or by country of recruitment (U.K. vs. Spain; Supplementary Material Tables 3 and 4). In addition, the PEPPER system with predictive low glucose suspend was not associated with improved nocturnal hypoglycemia for pump users. Nocturnal % TIR 3.9–10.0 mmol/L (70–180 mg/dL) with the PEPPER intervention was 60.8% (49.8–65.4) % versus Control 55.6% (45.3–67.6) % (P = 0.66). For clinically significant hypoglycemia, nocturnal %time in hypoglycemia <3.0 mmol/L (<54 mg/dL) with the PEPPER intervention was 0.1% (0.0–0.4) % versus Control 0.3% (0.0–1.6) % (P = 0.18).
Quality of Life Questionnaire Scores in the Intention-to-Treat Population
DQOL, diabetes quality of life (based on Likert 5-point scale from 1 to 5 with high scores indicate dissatisfaction, frequent impact, or frequent worry); DTSQs, diabetes treatment satisfaction questionnaire (scored out of 36 indicating very satisfied); DTSQs perceived frequency of hypo- and hyperglycemia are scored from 0 (none of the time) to 6 (most of the time); PAID, problem areas in diabetes (scored out of 100 with higher scores indicating distress).
Run-in n = 53.
Run-in n = 53, control n = 39.
P < 0.05 indicates significance.
Intention-to-Treat Pooled Analysis of Safety System Outcomes for Last 28 Days of Each Intervention Phase
CHO recommendations refer to a personalized oral dose of carbohydrates (based on weight) to revert hypoglycemia and minimize rebound hyperglycemia. % partial and % total suspension reported as proportions of the total time frame (the remaining time was not in suspension). Data presented as median (IQR) per participant.
No safety system data were available for 1 participant from the ITT analysis.
CHO, carbohydrate.
Glycemic variability and other secondary outcomes
For GV measures, there were no significant differences between the PEPPER and Control groups (Table 2). The number of episodes of hypoglycemia declined from baseline for both groups, but between the PEPPER and Control groups, no significant differences were observed.
For AUC, postprandial glucose or HbA1c, there were no differences between the two groups. Body weight and basal insulin did not significantly change between the treatment groups or from baseline.
Quality of life
At baseline, psychological questionnaires demonstrated that participants had a good quality of life without significant diabetes distress. The median PAID score at baseline was 18.8 (11.3–31.3). No significant differences were observed for diabetes distress, overall diabetes treatment satisfaction, the global quality of life scores or its subsections. Within the DTSQs questionnaire, there was an increase in perceived frequency of hypoglycemia with PEPPER in comparison with the Control group (P = 0.03, Table 3). This persisted in the PPP analysis (P < 0.01; Supplementary Material Table 5).
Safety system outcomes
No safety data were obtained from the handset of one participant. For evaluation of the safety system, although the safety system was not enabled in the Control group, the algorithm was running in the background without effect. Safety system outcomes were reported for the last 28 days of each intervention phase.
No significant differences were observed between groups for number of alerts, alarms, and carbohydrate recommendations (Table 4). Similarly, for pump users, the numbers of automated pump suspensions (total and partial) were not significantly different between the two groups. The total bolus insulin per participant per day also remained similar in the two groups: 32.8 (23.9–53.5) units/day in the Control group versus 32.6 (22.1–51.0) units/day in the PEPPER group (P = 0.96).
The root mean square error of the 30-min predictive glucose forecasting PEPPER algorithm 12 was 1.6 (1.4–1.9) mmol/L (29.0 [25.3–34.2] mg/dL). The overall mean absolute relative difference between the forecasting algorithm and the actual sensor glucose was 14.4 (12.6–15.9) %.
A total of 303 revisions were constrained and saturated by the safety system, that is, 5 (2–14) % of all proposed bolus recommendations. Of these, 85% were for hyperglycemia.
Adaptive insulin recommender outcomes
During the full PEPPER intervention period (i.e., 12 weeks), a total of 14,723 cases of meal bolus recommendations were imported to the revision platform, with an average of 4.0 (3.3–5.5) cases per participant/day. There was no decay in the number of bolus recommendations requested per day over the study period. A full analysis of the adaptive insulin recommender outcomes and individual case parameter usage can be found in Supplementary Material Table 6.
Out of all imported bolus advice, 46 (26–58) % of cases (i.e., meal scenarios) were approved by the physician and uploaded to the online platform to be integrated into the learnt case-base. The remaining cases were manually or automatically rejected by the system based on the criteria stated above in the methods.
The majority of bolus recommendations were accepted by each participant, 87 (65–96) %. For bolus recommendations that were rejected by the user, the proportion of doses reduced (from the total recommendations provided) were 59 (28–84) %. The median absolute difference in dose was 1.5 (1.0–2.0) units.
No correlation was observed between the number of cases per participant and the % TIR spent by participants (r = 0.371, P = 0.15).
Adverse events
There were two serious adverse events in the Control group; one participant had an episode of severe nocturnal hypoglycemia and another participant developed mild DKA requiring hospital admission, which was precipitated by denatured insulin with urinary tract infection. All resolved without adverse sequelae. Two other adverse events were reported, including appendicitis requiring appendicectomy and a fall resulting in fractured fifth metatarsal. None was deemed related to trial intervention with PEPPER safety system or AI algorithm.
Discussion
This study demonstrates proof of concept, safety, and feasibility of the PEPPER system with an AI-derived adaptive bolus calculator for people with T1D. While all times in ranges moved toward improvement in the intervention group, none of the differences observed between the PEPPER and Control arms reached statistical significance.
To our knowledge, this is the largest randomized controlled trial reporting clinical outcomes of a personalized, adaptive bolus calculator for use with both MDI and CSII therapy. Unfortunately, the study had to be terminated early due to Cellnovo withdrawing from the market. Ten pump participants in the United Kingdom had to be withdrawn prematurely as no further pump supplies were available, limiting the power and outcomes of the study. While no final conclusions about the impact of the adaptive intervention can be drawn from the study outcomes, the results are encouraging in moving toward improvement in all outcome measures for times in range and glycemic variability.
The change in outcomes compared to baseline may, in part, be attributable to the effect of CGM alone, but this effect is mitigated by the crossover study design. A high early drop-out rate was observed, and one of the key reasons stated was the complexity of the system requiring multiple components. Individuals were required to carry a handset (an additional device to their own mobile handset), CGM, an activity monitor and glucometer, as well as for pump participants requiring additional pump supplies. This reflects the nature of the intervention in a clinical investigation environment while a final implementation can be achieved on a single handset, which may be the user's smartphone. Other reasons for high early dropout rates, which were observed mostly in participants using CSII, were due to challenges with the Cellnovo pump, in particular, frequent cartridge ejection or wireless signal loss.
In terms of quality of life, participants reported higher perceived hypoglycemia with the PEPPER system despite no objective difference in time spent in hypoglycemic ranges. This did not affect overall diabetes treatment satisfaction; however, it may be a result of the additional forecasted hypoglycemia alerts received while using the PEPPER system. Overall, the recruited participants had good baseline glycemia with good quality of life, without significant diabetes distress, which may limit differences emerging between the two groups.
Limitations associated with the PEPPER CBR-based insulin recommender include the potential need for additional time required for the adaptive insulin recommender system to be effective. The algorithm is likely to be most beneficial to individuals maintaining regular work patterns, rather than shift workers. In the study, two shift workers were included, and they may not have derived the benefits of the learning algorithm. The algorithm only adapts for bolus insulin and assumes that the basal insulin has been optimized. Alternative methods in basal insulin adaptation could potentially be implemented. 20 Furthermore, the system is dependent on meal scenarios where the user has not ingested a significant snack or taken an insulin bolus correction within 5 h of a meal for revision. This may particularly be an issue for individuals tending to snack or eat their meal in phases. For pump users, the bolus advisor is unable to provide extended phase or dual wave bolusing, which is otherwise incorporated into most insulin pumps and is a clear development requirement for adaptive bolus calculators.
Currently the PEPPER CBR adaptive insulin recommender system is dependent on manual revisions using the online platform. With clear rejection criteria defined, this process can be automated; something that would be required for adoption at scale for use in people living with T1D.
Other adaptive bolus insulin recommender systems include the Advanced Bolus Calculator for Diabetes (ABC4D) project and also use CBR for the adaptive algorithm. Reddy et al. published a 6-week prospective nonrandomized single-arm pilot study with 10 adult participants using MDI. More than a twofold reduction in the number of postprandial hypoglycemic episodes was observed, however, the study was not sufficiently powered to show significance. 21 Currently, a randomized controlled trial over 8 months with 40 participants is underway (NCT03963219). Tyler et al. evaluated the K-nearest neighbors decision support system (DSS) recommender engine with physician advice within a feasibility study of 16 people with T1D over 4 weeks. 22 An improvement in overnight hypoglycemia was observed, however, this was also associated with a significant reduction in time in target range overnight (P = 0.034). It is interesting to note that for prandial mealtime dosing, a reduction in hypoglycemia during the daytime was not observed. 22 This system is currently being evaluated in a larger clinical trial integrated within a mobile app called DailyDose. 23
Adaptive basal and bolus insulin recommender systems are also being developed. Breton et al. assessed University of Virginia Decision Support System (UVA DSS) in a prospective study of 24 subjects with T1D using either CSII or MDI therapy with carbohydrate counting. 24 Participants completed two randomized crossover 48-h visits with either usual care or UVA DSS, which provided automated insulin titration, bolus calculation, and carbohydrate treatment advice. The study found that use of the DSS significantly reduced hypoglycemia, as well as glycemic variability. 24 Another automated DSS is the DreaMed Advisor Pro (DreaMed Diabetes Ltd., Petah Tikva, Israel), which is the first approved system by the FDA to provide insulin therapy adjustment recommendations to physicians, for people with T1D on insulin pump therapy, and uses MD-Logic. The Advisor Pro has been tested clinically in a single-center feasibility study with 13 participants. No difference was observed for time spent in range or hypoglycemia. 25 The algorithm effectiveness is being further tested in a randomized controlled trial to fully evaluate the system (NCT03003806).
Despite the limitations of the PEPPER study, the data are promising and suggest that, in a powered study over a sufficient period of time, an adaptive bolus advisor may be able to improve glycemic outcomes. There is wide scope for integrating the PEPPER system into routine diabetes management for both insulin pump users and those on MDI, and the recent availability of connected insulin delivery pens and wearable sensor technology may further enable integration of devices. Worldwide, most people with T1D use MDI regimes and for the foreseeable future, closed-loop control may not be an option for everyone. Systems using AI integrated into an app, such as PEPPER, may enable personalized decision support and may be additionally used alongside automated insulin delivery in future systems to improve times in ranges around the challenges of food and activity.
Conclusion
In conclusion, the PEPPER system was safe but did not change glycemic outcomes, compared with control. A larger powered randomized controlled trial is required to confirm overall effectiveness.
Role of the Funding Source
The funder of the study had no role in study design, data collection, data analysis, data interpretation, or writing of the report. The corresponding author had full access to all the data in the study and had final responsibility for the decision to submit for publication.
Footnotes
Acknowledgments
The authors thank all the study participants for their valuable time and commitment. Infrastructure support in the United Kingdom was provided by the NIHR Imperial Biomedical Research Centre and the NIHR Imperial Clinical Research Facility. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR, or the Department of Health and Social Care. Infrastructure support in Spain was provided by the Diabetes, Endocrinology and Nutrition Unit (UDEN), Hospital Universitari Dr Josep Trueta, Institut Català de la Salut. The authors thank Wifredo Ricart, Lidia Sojo, Elisabet Costa, Eduardo Esteve, who helped with the recruitment of participants, as well as the nursing and clinical staff of the UDEN, and particularly to Oscar Rovira (CIBEROBN) and Emili Loshuertos for their work.
The work outlined in this article was developed with the support of the awarded distinction by the Generalitat de Catalunya 2017 SGR 1551, and the European Union's Horizon 2020 research and innovation program under grant agreement No 689810 (PEPPER).
Author Disclosure Statement
M.R. has received honoraria for advisory board participation from Dexcom and Roche Diabetes. N.O. has received honoraria for speaking and advisory board participation from Abbott Diabetes, Dexcom, Medtronic Diabetes, and Roche Diabetes.
Funding Information
This project has received funding from the EU Horizon 2020 program, grant agreement number 689810.
Supplementary Material
Supplementary Material Table 1
Supplementary Material Table 2
Supplementary Material Table 3
Supplementary Material Table 4
Supplementary Material Table 5
Supplementary Material Table 6
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
