Skip to main content

Development of caesarean section prediction models: secondary analysis of a prospective cohort study in two sub-Saharan African countries



Caesarean section is recommended in situations in which vaginal birth presents a greater likelihood of adverse maternal or perinatal outcomes than normal. However, it is associated with a higher risk of complications, especially when performed without a clear medical indication. Since labour attendants have no standardised clinical method to assist in this decision, statistical tools developed based on multiple labour variables may be an alternative. The objective of this paper was to develop and evaluate the accuracy of models for caesarean section prediction using maternal and foetal characteristics collected at admission and through labour.


This is a secondary analysis of the World Health Organization’s Better Outcomes in Labour Difficulty prospective cohort study in two sub-Saharan African countries. Data were collected from women admitted for labour and childbirth in 13 hospitals in Nigeria as well as Uganda between 2014 and 2015. We applied logistic regression to develop different models to predict caesarean section, based on the time when intrapartum assessment was made. To evaluate discriminatory capacity of the various models, we calculated: area under the curve, diagnostic accuracy, positive predictive value, negative predictive value, sensitivity and specificity.


A total of 8957 pregnant women with 12.67% of caesarean births were used for model development. The model based on labour admission characteristics showed an area under the curve of 78.70%, sensitivity of 63.20%, specificity of 78.68% and accuracy of 76.62%. On the other hand, the models that applied intrapartum assessments performed better, with an area under the curve of 93.66%, sensitivity of 80.12%, specificity of 89.26% and accuracy of 88.03%.


It is possible to predict the likelihood of intrapartum caesarean section with high accuracy based on labour characteristics and events. However, the accuracy of this prediction is considerably higher when based on information obtained throughout the course of labour.

Peer Review reports

Plain English summary

Caesarean section (CS) is a surgical procedure indicated in situations in which vaginal delivery presents a higher likelihood of adverse maternal and/or perinatal outcomes than normal. In principle, it should not be performed without a clear medical indication, i.e., a clear likelihood that it will improve outcomes. However, the CS rates have continuously increased, which is a worrying scenario since CS can lead to complications as well. The situation is worse when CS are performed unnecessarily because then, it presents no clear benefit and only potential damages. To improve this scenario, it is important that attendants have tools that support the decision toward the most appropriate mode of birth. Statistical models can be used as part of these tools. Therefore, the aim of this study was to develop statistical models for CS prediction based on maternal and foetal characteristics. We considered admission and labour data from 8957 pregnant women from two African countries (Nigeria and Uganda). We developed a set of prediction models, which showed high performance on predicting CS, mainly when the labour characteristics are considered. Based on these results, we believe these models can be useful adjunct tools in helping medical decision-making.


Since its introduction in clinical practice, the CS rates have continuously increased across the world [1, 2]. Recent estimates suggest that the CS rates have doubled between 2000 and 2015 in some regions of the world [3]. This trend is hardly justified, since CS rates greater than 10% have not been associated with the reduction of maternal and perinatal mortality [4, 5]. In addition, elective CS is associated with higher rates of mortality and complications in the short and long term [6,7,8,9,10]. These risks are greatest in countries with low resources and high fertility rates, such as those in sub-Saharan Africa [11,12,13].

Early identification and appropriate management of women at risk of CS may improve outcomes [14]. However, labour attending teams have no standardized method to help to decide when CS is most appropriate in situations when the medical indication is not clear-cut [15, 16]. Sociodemographic, gestational, peri-partal and cultural factors may influence the choice of mode of childbirth and vary according to the development levels of each country [11, 16]. Therefore, to improve labour outcomes, it is important to develop tools based on multiple variables that contribute to decision-making processes before the woman reaches an increased risk of complications. Proper use of these tools could avoid unnecessary CS and emergency surgeries, reduce rates of severe maternal and perinatal outcomes.

In order to meet these objectives, some researchers have proposed statistical models that take into account the baseline and admission characteristics in the labour of a pregnant woman and the foetus [17,18,19,20,21,22,23,24]. Most of these models considered data from only one country such as Canada [22], the United States [18, 21], Scotland [23], Ireland [17], England [19] and Spain [20]. We identified only one CS prediction model developed with information of pregnant women from several countries including both low- and high-income settings; however, this model also used static information collected at labour room admission [24]. Harper and colleagues proposed a prediction model considering intrapartum information [25] from deliveries at a University Medical Centre in United States. As far as we know, few researches present models built with low-resource country data using labour related characteristics. Considering potential differences in the characteristics of the population and the response capacity of health facilities (among other factors), it would be advisable the development of such models based on information generated in low-resource settings in order to reduce indirectness and improve relevance and applicability of the tools.

In this article, we consider the hypothesis that statistical models can predict the occurrence of CS and that this prediction may be more accurate using intrapartum variables. Therefore, our objective was to develop CS prediction models and evaluate their accuracy in two sub-Saharan African countries.


We conducted a secondary analysis of the database from the Better Outcomes in Labour Difficulty (BOLD) project, a World Health Organization’s multicentre study aimed at accelerating the reduction of maternal, foetal and neonatal mortality and morbidity related to intrapartum. The researchers collected this database from the prospective cohort performed as part of the BOLD project. A methods paper presents a detailed description of the project study protocol [26]. In summary, BOLD researchers collected data from women admitted for labour care in 9 and 4 hospitals in Nigeria and Uganda, respectively, from 2014 to 2015. To be selected, hospitals must have a minimum of 1000 births per year, be the major health care facility in its region, and not a primary care unit. Intrapartum care was provided by skilled birth attendants, with stable access to CS, augmentation of labour, assisted vaginal delivery and good obstetric care practices [27, 28].

BOLD cohort considered eligible women admitted for spontaneous or induced vaginal delivery, with a single foetus, during the first stage of labour (both in the latent phase and in the active phase), with cervical dilatation less than 7 cm. The following women were excluded: pregnant women diagnosed with foetal death, cervical dilatation ≥7 cm, multiple gestation, gestational age less than 34 weeks, elective or pre-labour CS, with an indication of emergency CS or laparotomy on admission, failed induction of labour, false labour, unemancipated minors without a legal guardian, and women who were not able to give consent. Trained nurses carried out the recruitment process.

The main outcome of the present analysis was the occurrence of CS and the predictors are the maternal characteristics of admission and intrapartum variables evaluated in the first and second stages of labour. As the CS can be objectively measured, we reduce the potential detection bias in the context of a multicentre study. The BOLD project also recorded the dates and times of the interventions performed, as well as maternal and perinatal outcomes. The research team used a standardized collection form developed for the BOLD project and they collected the records during childbirth according to routine obstetric care protocols of the hospitals. They calculated the sample size based on the set of maternal and perinatal outcomes defined in the BOLD and considered 20 possible predictors. Based on initial assumptions, a minimum required sample size of 7812 women was calculated. The BOLD research team was concerned with avoiding potential biases throughout the development of the project, in steps such as: choosing the study design, developing the data collection instrument and managing the data to ensure its quality [26].

To describe the demographic characteristics of the women in the study, we present mean and standard deviation of quantitative variables and percentage and absolute frequencies for qualitative variables. For the study of the main hypothesis, we developed logistic regression models in which the dependent variable was CS, and the independent variables were baseline (fixed) and intrapartum (dynamic) measurements of the pregnant women. The logistic regression equation based in one independent variable is given by

$$ CSProbability=\frac{e^{\beta_0+{\beta}_1{X}_1}}{1+{e}^{\beta_0+{\beta}_1{X}_1}}, $$

where β0 is the equation intercept and β1 is the coefficient related to the independent variable X1 [29]. To calculate the probability of CS for a specific woman, when the independent variable is continuous, simply replace X1 with the value observed for that woman. When the independent variable is categorical, X1 represents the presence of a characteristic and must be replaced by 1 or 0 if the characteristic is present or absent, respectively. If the prediction model has more than one independent variable, the equation is analogous:

$$ CSProbability=\frac{e^{\beta_0+{\beta}_1{X}_1+{\beta}_2{X}_2+{\beta}_3{X}_3+\dots }}{1+{e}^{\beta_0+{\beta}_1{X}_1+{\beta}_2{X}_2+{\beta}_3{X}_3+\dots }}. $$

We present three types of models, which differ from one another in terms of the moment of recording the intrapartum variables used: the labour admission model, the interval models and the maximum score model. To obtain the labour admission model, we first selected variables that presented p-values lower than 5% in the chi-square and Student t-tests in the complete sample (bivariate analysis). We considered the admission records of the following variables in bivariate analysis:

  • Fixed: Maternal height, Symphysis-fundal height, Foot size (mother), BMI, Parity and previous caesarean section (CS), Previous uterine surgery, Previous abortions, History of prolonged labour, Outcome of last pregnancy, Complications with current pregnancy, Chronic health conditions prior to pregnancy, Gestational age, Mode of onset of labour, Cervix position.

  • Dynamic: Cervical dilatation, Maternal heart rate (MHR), Systolic blood pressure (SBP), Diastolic blood pressure (DBP), Number of uterine contractions in 10 min, Suspected foetal distress, Foetal movements, Foetal station, Foetal presentation, Amniotic membrane status, Abnormal axillary temperature, Moderate to extreme pain, Moulding status, Caput succedaneum.

We categorized the following quantitative predictors: Maternal BMI, SBP, DBP, MHR, Axillary temperature, Number of uterine contractions in 10 min. For Maternal BMI, we considered the categorization according to the gestational age [30]. The MHR and blood pressure were categorized according to the Modified Early Obstetric Warning Score [31]. We used the foetal heart rate (FHR) values to define the suspected foetal distress variable (i.e. values above 160 beats or below 120 beats were considered as suspected foetal distress [32]). We considered as abnormal axillary temperature, values below 35.5 °C and equal or greater than 37.5 °C. Additional file 1: Table S1 and Additional file 1: Table S2, respectively, present more details about the fixed and dynamic variables considered in the bivariate analysis step. The final admission model (Model 1) included those independent variables selected using a stepwise selection method among the significant variables in bivariate analysis. The stepwise is a method that sequentially adds variables into the model that most improves the fit [33]. To evaluate the model improvement at each step, we considered de Akaike information criterion [34].

Interval models include the same predictors of final admission model, however, we considered updated values for the last measured measure of the dynamic variables, at three time intervals, from 4 cm of dilatation: 0 to 2 h (Model 2), 2 to 4 h (Model 3) and 4 to 6 h (Model 4). In these models, the elapsed time between admission and the first record of dilatation equal to 4 cm was inserted as an additional independent variable. For the interval models, only women with a record of a 4 cm dilatation were included.

To obtain the maximum score model (Model 5), we considered the same fixed variables from final admission model. Regarding dynamic variables, we used bivariate analysis to select records related to conditions presented throughout the labour among the described below:

  • Final cervical dilatation, Most extreme MHR, Most extreme SBP, Most extreme DBP, Highest number of uterine contractions observed (10 min), At least one suspected foetal distress, Presence of foetal movements throughout intrapartum, Lower foetal station achieved, Final foetal presentation, Final amniotic membrane status, Some occurrence of abnormal axillary temperature, Some occurrence of moderate to extreme pain, Highest Moulding status, Most severe level of Caput succedaneum.

Additional file 1: Table S3 presents more details about the variables listed above. We also present an abridged version of each interval model and of maximum score model, that was obtained using a stepwise selection method. Figure 1 presents a schematic flow diagram including a summary of the models studied. In summary, Model 1 returns a probability of CS at admission, Models 2 to 4 return the probability of CS at 2, 4 and 6 hours after the start of the active phase and Model 5 returns the probability of CS throughout the labour.

Fig. 1

Schematic flow diagram of model building

For the model estimation, we used random samples containing 70% of the data set that did not present missing data. This segment was called the training sample, while the remaining 30% (test sample) was used in the validation step. It is important to divide the dataset, because it allows us to evaluate the model discriminatory capacity using a sample that was not the same used to fit the model.

To evaluate the discriminatory capacity, we present the ROC Curve and the area under the ROC curve (AUC). The ROC curve graphically represents the values of sensitivity and specificity for an entire range of prediction cut-off points. Sensitivity reflects the proportion of individuals for whom the model correctly indicated by performing CS among all those who actually undergone to CS. Specificity is the proportion of individuals that the model indicates for not performing CS among all those who actually did not. The higher the discriminatory capacity of a model, the higher its AUC, which can reach a maximum of 1 (or 100%).

We also presented the diagnostic accuracy (DA), positive predictive value (PPV) and negative predictive value (NPV) related to the cut-off point that maximizes sensibility and specificity. DA is the proportion of individuals in which the model is correct among all individuals. PPV and NPV are the proportion of observations in which the model is correct among that one the model indicates to perform and to not perform the CS, respectively. To evaluate the adequacy of fitted models to the data, we used the Hosmer-Lemeshow test, which evaluates the null hypothesis that the logistic model is the correct choice. The analyses were performed using the R software version 3.5.1 [35].


There were 9995 women in the database, out of which 8957 women without any inconsistency in the time of intrapartum record were used. Thirty percent of women (n = 2684) had at least one record taken at 4 cm of dilatation. From these, 2683 have at least one record between 0 and 2 h after dilatation of 4 cm, 1604, between 2 and 4 h and 714, between 4 and 6 h. Additional file 1: Figure S1 presents the analysis flow diagram. Table 1 presents the description of the sociodemographic data and the outcomes. The CS represented 12.7% of the deliveries performed. Among all women who had CS, the main indications were: suspected acute foetal distress (31.3%), cephalopelvic disproportion (16.3%) and prolonged labour (15.7%).

Table 1 Socio-demographic data of outcomes for the study population from the BOLD project

Table 2 presents the performance measures for all the estimated models and Additional file 1: Figure S2, the ROC curve of each of the models when applied to the training and test samples. Table 3 presents the estimated coefficients for the admission model and for the complete and abridged interval models and Table 4 presents the coefficients estimated for the maximum score models. From a practical point of view, positive coefficients indicate that this variable increases the probability of performing CS. On the other hand, negative coefficients indicate that a certain variable reduces the probability of CS.

Table 2 Performance measures for the complete and abridged models in the training and test samples
Table 3 Coefficients of admission and interval models to predict CS
Table 4 Coefficients and p-values of complete and abridged version of maximum score models to predict CS

By and large, the performance measures are higher in the maximum score model, followed by the interval models (Table 2). The AUC for the test samples ranged from 78.70 to 93.66% in the full models and 74.73 and 93.72% in the abridged models. The results are similar in the sample training. The sensitivity and specificity values are higher than 65% in all the cases studied, reaching values above 85% in the abridged maximum score model. The accuracy and negative predictive values are greater than 65 and 85%, respectively, for all models. In contrast, the highest positive predictive values observed in the test sample were 53.58 and 58.54%. Additional file 1: Table S4 presents the results of the Hosmer-Lemeshow test to verify the quality of the adjustments. All models have a p-value in this test higher than 0.05, which indicates that the models have a good fit to the data.

The variables with greater weight in the increase of the probability of CS of the maximum score model (Table 4) that is, higher estimated positive coefficients are: occurrence of third-degree moulding, moderate/severe caput succedaneum, amniotic membrane ruptured with meconium and suspected acute foetal distress. On the other hand, the greatest negative coefficients occurred for: multiparous without previous CS, foetal descent in/below the ischial spines and level of cervical dilatation. Particularly for the obstetric variables used in Robson’s classification [36], the estimated coefficients indicate that the following characteristics are associated with an increased probability of CS: induced labour, multiparous with previous CS, post-term gestational age and cephalic or non-cephalic transverse/posterior foetal presentations. However, the coefficients for multiparous pregnant women without CS and pre-term gestation indicate lower probability of CS.


In this manuscript, we present statistical models for the prediction of the occurrence of CS during labour, using baseline and intrapartum characteristics of 8957 pregnant women and their foetuses in Nigeria and Uganda. We present models obtained from variables recorded during the admission of women to hospitals and throughout labour. Our findings show that it is possible to predict CS and that the performance of this prediction increases considerably when we use information obtained during labour. Among the model versions presented, the maximum score model, which considers the situation of higher risk of women during their care, presented a discriminatory performance higher than 90%.

Our models have a performance equal to or greater than other logistic regression models in the literature, in which the AUC ranged from 70% (16,20–22), 84% [20] and 88% [24]. It should be noted that the previous models used only measures recorded at the hospital admission. Another model has a 75% AUC [25] when also considering information collected during labour. In addition, a study [20] presented AUC close to 94% using regression tree and random forest as an alternative to logistic regression. However, the authors used the hospital in which CS occurred as an independent variable, which made it impossible to apply outside these places.

Comparing the results of the present study with the AUC values ​​observed in the models already proposed [17,18,19,20,21,22,23,24], we observed that the discriminatory capacity of the maximum score model (AUC: 93.66 and 93.72%) is higher. Nevertheless, those of the interval models (2 to 4 h and 4 to 6 h after the start of the active phase) are close to those of the best performance models. On the other hand, our admission and interval models (0 to 2 h) present intermediate discriminatory capacity in relation to what has already been published in the literature. It should be emphasized that the characterization of the samples considered in the models already proposed can limit their application and comparison with the models presented here. This is because, in some cases, the samples are restricted and consider only women with specific characteristics, such as: induced labour [19, 21], spontaneous labour [22], absence of previous CS [21] or previous labour [17, 22], at least one prior CS [23], cephalic presentation [17, 19, 22] and term/post term gestational age [17, 19, 21, 25].

Regarding the practical applicability of the coefficients estimated in each of the models, it can be observed that most of the results are consistent with clinical variables associated with CS already described in the literature. Maternal age was the most used variable in the models already proposed for CS prediction and, as observed in the present study, presents positive coefficients in all studies [17, 18, 20, 22,23,24], therefore older women present a higher risk of being submitted to CS. Concerning the variables used in the classification of Robson [36], our results are in agreement with other studies that show the increase in the probability of CS for pregnant with induced labour and previous CS [20, 24] and reduction for multiparous women [19, 21, 24]. Regarding gestational age, our models indicated a lower probability of CS for preterm and a higher one for post-term pregnancies when compared to full-term pregnancies. This result contradicts the one observed in the model proposed in 2016 [24], in which pre-term gestation appears positively associated with CS. This difference could have occurred because, in the BOLD project database, preterm pregnancies are older than 34 weeks. Specifically, more than 80% of registered preterm pregnancies are older than 36 weeks, and therefore are associated with obstetric outcomes similar to those of full-term pregnancies.

The models presented here are innovative in relation to what has already been proposed in the literature for using intrapartum variables for CS prediction. This allows the estimation of the probability of CS to be updated throughout labour and not only calculated when women are admitted to hospital. On the other hand, a limitation is the fact that some of the models are somewhat extensive, which can make it difficult to estimate the CS probabilities. To mitigate this problem, each model has an abridged version, with predictive capabilities similar to those of larger models. In addition, the clinical assessments of cervical dilatation were made at irregular intervals in the participating hospitals – few women were rigorously assessed at regular intervals (e.g. every 4 h), which affected negatively the available information for developing interval models. This is because, for the construction of these models, there must be at least one four-centimetre dilatation record, as well as the records of the interest intervals. As many of the intrapartum BOLD records were made according to clinical needs, it was not possible to ensure that all the pregnant women had a four-centimetre dilatation record. In order to reduce the impact of this characteristic of our sample, we developed the maximum score model which was constructed in order to consider all the possible variabilities of the intrapartum measurements, regardless of when they were recorded.

Given the overall discriminatory capacity of the CS prediction models presented in this paper – very good – we hypothesize that such models could be used to support decision making during intrapartum care. The human component should remain as the substantive and final element of decision making during labour and childbirth. However, in certain settings, health care practitioners may lack the necessary experience to make a decision at an optimal timing, and an accurate CS prediction model could function as a decision support tool. Another envisaged application of these models is their integration into obstetric early warning systems, which could reduce delays during intrapartum care. In summary, these models could contribute to maximizing health effects of CS, which can be done by ensuring that all women who need a CS have it, unnecessary CS are avoided and access to safe surgery is strengthened. These goals are particularly important in sub-Saharan Africa, as women who undergo a CS have a greater risk of developing a serious complication or dying [37].

On top of the possible applications at individual level, CS prediction models could be used retrospectively to generate a cluster level predicted CS rate. The average individual predictions of CS risk of an obstetric population equates to the CS risk of that population, which is the same of a predicted CS rate for that population. By comparing the predicted to the observed CS rate, one can identify deviations from what would be expected for that population: CS models could thus be used to benchmark the CS rate of a certain population [24]. Our models were built based on women from Nigeria and Uganda, with single pregnancies, and it is necessary to carry out validation studies for their application in other populations. In addition, the construction of new models based on other populations is also encouraged.


We conclude that it is possible to predict CS with high accuracy and that intrapartum variables should be considered in this process. Thus, considering the evolution of the computational methods and the improvement in the information infrastructure of maternity settings, we believe that models such as those presented in this work could support clinical decision making and facilitate the analysis of CS rates at the health facility.

Availability of data and materials

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.



Area Under the Curve


Body Mass Index


Better Outcomes in Labour Difficulty


Confidence Interval


Caesarean Section


Diagnostic Accuracy


Diastolic Blood Pressure


Foetal Heart Rate


Maternal Heart Rate


Negative Predictive Value


Positive Predictive value


Receiver Operate Characteristic


Systolic Blood Pressure






  1. 1.

    Betrán AP, Ye J, Moller A-B, Zhang J, Gülmezoglu AM, Torloni MR. The increasing trend in caesarean section rates: global, regional and national estimates: 1990-2014. PLoS One. 2016;11:e0148343.

    Article  Google Scholar 

  2. 2.

    Marcolin AC. Até quando o Brasil será conhecido como o país da cesárea? Rev Bras Ginecol Obstet. 2014;36:283–9.

    Article  Google Scholar 

  3. 3.

    Boerma T, Ronsmans C, Melesse DY, Barros AJ, Barros FC, Juan L, et al. Global epidemiology of use of and disparities in caesarean sections. Lancet. 2018;392:1341–8.

    Article  Google Scholar 

  4. 4.

    Betran A, Torloni M, Zhang J, Gülmezoglu A. Section WWG on C, Aleem H, et al. WHO statement on caesarean section rates. BJOG Int J Obstet Gynaecol. 2016;123:667–70.

    CAS  Article  Google Scholar 

  5. 5.

    Ye J, Zhang J, Mikolajczyk R, Torloni M, Gülmezoglu A, Betran A. Association between rates of caesarean section and maternal and neonatal mortality in the 21st century: a worldwide population-based ecological study with longitudinal data. BJOG Int J Obstet Gynaecol. 2016;123:745–53.

    CAS  Article  Google Scholar 

  6. 6.

    Liu S, Liston RM, Joseph K, Heaman M, Sauve R, Kramer MS, et al. Maternal mortality and severe morbidity associated with low-risk planned cesarean delivery versus planned vaginal delivery at term. Can Med Assoc J. 2007;176:455–60.

    Article  Google Scholar 

  7. 7.

    Baskett T, O’Connell C. Severe obstetric maternal morbidity: a 15-year population-based study. J Obstet Gynaecol. 2005;25:7–9.

    CAS  Article  Google Scholar 

  8. 8.

    Deneux-Tharaux C, Carmona E, Bouvier-Colle M-H, Bréart G. Postpartum maternal mortality and cesarean delivery. Obstet Gynecol. 2006;108:541–8.

    Article  Google Scholar 

  9. 9.

    MacDorman MF, Declercq E, Menacker F, Malloy MH. Infant and neonatal mortality for primary cesarean and vaginal births to women with no indicated risk, United States, 1998–2001 birth cohorts. Birth. 2006;33:175–82.

    Article  Google Scholar 

  10. 10.

    Villar J, Carroli G, Zavaleta N, Donner A, Wojdyla D, Faundes A, et al. Maternal and neonatal individual risks and benefits associated with caesarean delivery: multicentre prospective study. Bmj. 2007;335:1025.

    Article  Google Scholar 

  11. 11.

    Harrison MS, Goldenberg RL. Cesarean section in sub-Saharan Africa. Matern Health Neonatol Perinatol. 2016;2:6.

    Article  Google Scholar 

  12. 12.

    Ngonzi J, Bebell LM, Fajardo Y, Boatin AA, Siedner MJ, Bassett IV, et al. Incidence of postpartum infection, outcomes and associated risk factors at Mbarara regional referral hospital in Uganda. BMC Pregnancy Childbirth. 2018;18:270.

    Article  Google Scholar 

  13. 13.

    Harrison MS, Pasha O, Saleem S, Ali S, Chomba E, Carlo WA, et al. A prospective study of maternal, fetal and neonatal outcomes in the setting of cesarean section in low-and middle-income countries. Acta Obstet Gynecol Scand. 2017;96:410–20.

    Article  Google Scholar 

  14. 14.

    Vogel J, Souza J, Mori R, Morisaki N, Lumbiganon P, Laopaiboon M, et al. Maternal complications and perinatal mortality: findings of the World Health Organization multicountry survey on maternal and newborn health. BJOG Int J Obstet Gynaecol. 2014;121:76–88.

    Article  Google Scholar 

  15. 15.

    Torloni MR, Betran AP, Souza JP, Widmer M, Allen T, Gulmezoglu M, et al. Classifications for cesarean section: a systematic review. PLoS One. 2011;6:e14566.

    CAS  Article  Google Scholar 

  16. 16.

    Chu K, Cortier H, Maldonado F, Mashant T, Ford N, Trelles M. Cesarean section rates and indications in sub-Saharan Africa: a multi-country study from Medecins sans Frontieres. PLoS One. 2012;7:e44484.

    CAS  Article  Google Scholar 

  17. 17.

    Burke N, Burke G, Breathnach F, McAuliffe F, Morrison JJ, Turner M, et al. Prediction of cesarean delivery in the term nulliparous woman: results from the prospective, multicenter genesis study. Am J Obstet Gynecol. 2017;216:598–e1.

    Article  Google Scholar 

  18. 18.

    Chen G, Uryasev S, Young TK. On prediction of the cesarean delivery risk in a large private practice. Am J Obstet Gynecol. 2004;191:616–23.

    Article  Google Scholar 

  19. 19.

    Peregrine E, O’brien P, Omar R, Jauniaux E. Clinical and ultrasound parameters to predict the risk of cesarean delivery after induction of labor. Obstet Gynecol. 2006;107:227–33.

    Article  Google Scholar 

  20. 20.

    Campillo-Artero C, Serra-Burriel M, Calvo-Pérez A. Predictive modeling of emergency cesarean delivery. PLoS One. 2018;13:e0191248.

    Article  Google Scholar 

  21. 21.

    Levine LD, Downes KL, Parry S, Elovitz MA, Sammel MD, Srinivas SK. A validated calculator to estimate risk of cesarean after an induction of labor with an unfavorable cervix. Am J Obstet Gynecol. 2018;218:254–e1.

    PubMed  Google Scholar 

  22. 22.

    Janssen PA, Stienen JJ, Brant R, Hanley GE. A predictive model for cesarean among low-risk nulliparous women in spontaneous labor at hospital admission. Birth. 2017;44:21–8.

    Article  Google Scholar 

  23. 23.

    Smith GC, White IR, Pell JP, Dobbie R. Predicting cesarean section and uterine rupture among women attempting vaginal birth after prior cesarean section. PLoS Med. 2005;2:e252.

    Article  Google Scholar 

  24. 24.

    Souza J, Betran A, Dumont A, De Mucio B, Gibbs Pickens C, Deneux-Tharaux C, et al. A global reference for caesarean section rates (C-model): a multicountry cross-sectional study. BJOG Int J Obstet Gynaecol. 2016;123:427–36.

    CAS  Article  Google Scholar 

  25. 25.

    Harper LM, Odibo AO, Macones GA, Cahill AG. Predicting cesarean in the second stage of labor. Am J Perinatol. 2013;30:827–32.

    Article  Google Scholar 

  26. 26.

    Souza JP, Oladapo OT, Bohren MA, Mugerwa K, Fawole B, Moscovici L, et al. The development of a simplified, effective, labour monitoring-to-action (SELMA) tool for better outcomes in labour difficulty (BOLD): study protocol. Reprod Health. 2015;12:49.

    Article  Google Scholar 

  27. 27.

    National Institute for Health and Clinical Excellence (NICE). Intrapartum care: care of healthy women and their babies during childbirth. NICE clinical guideline 55, 2007. Available from

  28. 28.

    Organization WH. WHO recommendations for augmentation of labour: World Health Organization; 2014.

  29. 29.

    Hosmer Jr DW, Lemeshow S, Sturdivant RX. Applied logistic regression. 3rd ed. New Jersey: Wiley; 2013.

  30. 30.

    Atalah S. Propuesta de um nuevo estándar de evaluación nutricional em embarazadas. Rev Med Chile. 1997;125:1429–36.

    CAS  PubMed  Google Scholar 

  31. 31.

    Bircher C. The use of the modified early obstetric warning score (MEOWS). Norfolk Norwich Univ Hosp NHS Found Trust MID33AO13–817 [Internet]. 2018; Available from:

  32. 32.

    von Steinburg SP, Boulesteix A-L, Lederer C, Grunow S, Schiermeier S, Hatzmann W, et al. What is the “normal” fetal heart rate? PeerJ. 2013;1:e82.

    Article  Google Scholar 

  33. 33.

    Hastie T, Tibshirani R, Friedman J, Franklin J. The elements of statistical learning: data mining, inference and prediction. Math Intell. 2005;27:83–5.

    Google Scholar 

  34. 34.

    Akaike H. A new look at the statistical model identification. Sel Pap Hirotugu Akaike. Springer. 1974:215–22.

  35. 35.

    Core R. Team. R: a language and environment for statistical computing [internet]. Vienna, Austria: R Foundation for statistical. Computing. 2018; Available from:

  36. 36.

    Robson MS. Classification of caesarean sections. Fetal Matern Med Rev. 2001;12:23–39.

    Article  Google Scholar 

  37. 37.

    Sobhy S, Arroyo-Manzano D, Murugesu N, Karthikeyan G, Kumar V, Kaur I, et al. Maternal and perinatal mortality and complications associated with caesarean section in low-income and middle-income countries: a systematic review and meta-analysis. Lancet. 2019;393:1973–82.

    Article  Google Scholar 

Download references


We acknowledge the immense contributions of Professor Bukola Fawole, in memoriam, the Nigeria country principal investigator for the BOLD project. He inspired us with his example and kindness, and he will always be remembered for his contributions not just to the BOLD project but to global maternal health research. The authors would like to thank the local investigators from Nigeria (Adesina Akintan, Amos Adebayo, Francis E. Alu, Hadiza Idris, Halima Bello, Ola Okike, Olubunmi Alabi, Sanni Wilfred) and Uganda (Josaphat Byamugisha, Michael Bukenya, Miriam Nakalembe, Musana Othiniel Mambya, Salim Bataale). They are also grateful to the members of World Health Organization coordination unit: Ahmet M. Gülmezoglu, Meghan A. Bohren and Özge Tunçalp, Joshua P. Vogel.


This paper reports a secondary analysis of the BOLD project database, which was funded by the Bill & Melinda Gates Foundation (Grant #OPP1084318), United States Agency for International Development, and the UNDP-UNFPA-UNICEF-WHO-World Bank Special Programme of Research, Development and Research Training in Human Reproduction (HRP). This analysis is partially supported by World Health Organization, the Public Health Graduate Program, Department of Social Medicine, Ribeirão Preto Medical School, University of São Paulo (Brazil) and the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior – Brasil (CAPES).

Author information




GSCP, JPS, OTO and KM conceptualized the analysis plan. GSCP and HCCS performed the statistical analysis. HCCS and ACM drafted the manuscript with substantial inputs from JPS, GSCP, LOO and OTO. ACM, GSCP, HCCS, JPS, LOO, OTO and KM interpreted and revised the results and approved the manuscript for publication.

Corresponding author

Correspondence to Hayala C. C. de Souza.

Ethics declarations

Ethics approval and consent to participate

Scientific and technical approval for BOLD cohort was obtained from the Review Panel on Research Projects (RP2) of the UNDP/UNFPA/UNICEF/WHO/World Bank Special Program of Research, Development and Research Training in Human Reproduction (HRP), Department of Reproductive Health and Research, World Health Organization. Ethical approval was obtained from the World Health Organization Ethical Review Committee (protocol A65879, approval date 25 August 2014), the Makerere University School of Health Sciences Research and Ethics Committee, Uganda (protocol #SHSREC REF 2014–058), University of Ibadan/University College Hospital Ethics Committee (UI/EC/14/0223), Federal Capital Territory Health Research Ethics Committee, Nigeria (protocol FHREC/2014/01/42/27–08-14), and Ondo State Government Ministry of Health Research Ethics Review Committee, Nigeria (AD 4693/160). The study was conducted according to the Declaration of Helsinki of the World Medical Association. All study participants provided an informed consent at study enrolment.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests. The manuscript represents the views of the named authors only.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Table S1.

Description of the fixed variables considered in the bivariate analysis step Table S2. Description of the dynamic variables considered in the bivariate analysis Table S3. Description of variables recorded throughout the labour considered in the univariate analysis Table S4. P-value for Hosmer-Lemeshow test in each of the estimated models Figure S1. Analysis flow diagram Figure S2. ROC curves for models in the training sample (left column) and test sample (right column).

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

de Souza, H.C.C., Perdoná, G.S.C., Marcolin, A.C. et al. Development of caesarean section prediction models: secondary analysis of a prospective cohort study in two sub-Saharan African countries. Reprod Health 16, 165 (2019).

Download citation


  • Caesarean section
  • Logistic regression
  • Prediction models
  • Decision support models