Skip to main content

Comparing human milk macronutrients measured using analyzers based on mid-infrared spectroscopy and ultrasound and the application of machine learning in data fitting



Fat, carbohydrates (mainly lactose) and protein in breast milk all provide indispensable benefits for the growth of newborns. The only source of nutrition in early infancy is breast milk, so the energy of breast milk is also crucial to the growth of infants. Some macronutrients composition in human breast milk varies greatly, which could affect its nutritional fulfillment to preterm infant needs. Therefore, rapid analysis of macronutrients (including lactose, fat and protein) and milk energy in breast milk is of clinical importance. This study compared the macronutrients results of a mid-infrared (MIR) analyzer and an ultrasound-based breast milk analyzer and unified the results by machine learning.


This cross-sectional study included breastfeeding mothers aged 22–40 enrolled between November 2019 and February 2021. Breast milk samples (n = 546) were collected from 244 mothers (from Day 1 to Day 1086 postpartum). A MIR milk analyzer (BETTERREN Co., HMIR-05, SH, CHINA) and an ultrasonic milk analyzer (Honɡyanɡ Co,. HMA 3000, Hebei, CHINA) were used to determine the human milk macronutrient composition. A total of 465 samples completed the tests in both analyzers. The results of the ultrasonic method were mathematically converted using machine learning, while the Bland-Altman method was used to determine the limits of agreement (LOA) between the adjusted results of the ultrasonic method and MIR results.


The MIR and ultrasonic milk analyzer results were significantly different. The protein, fat, and energy determined using the MIR method were higher than those determined by the ultrasonic method, while lactose determined by the MIR method were lower (all p < 0.05). The consistency between the measured MIR and the adjusted ultrasound values was evaluated using the Bland-Altman analysis and the scatter diagram was generated to calculate the 95% LOA. After adjustments, 93.96% protein points (436 out of 465), 94.41% fat points (439 out of 465), 95.91% lactose points (446 out of 465) and 94.62% energy points (440 out of 465) were within the LOA range. The 95% LOA of protein, fat, lactose and energy were - 0.6 to 0.6 g/dl, -0.92 to 0.92 g/dl, -0.88 to 0.88 g/dl and - 40.2 to 40.4 kj/dl, respectively and clinically acceptable. The adjusted ultrasonic results were consistent with the MIR results, and LOA results were high (close to 95%).


While the results of the breast milk rapid analyzers using the two methods varied significantly, they could still be considered comparable after data adjustments using linear regression algorithm in machine learning. Machine learning methods can play a role in data fitting using different analyzers.

Peer Review reports


Breast milk macronutrients, including fat, carbohydrates (mainly lactose) and protein provide indispensable benefits for the growth of newborns. The only source of nutrition in early infancy is breast milk, so the energy of breast milk is also crucial to the growth of infants. Due to individual differences, the nutrient composition of human breast milk (such as protein and fat) can sometimes be insufficient to meet the needs of infants with special health conditions (for example, premature infants) [1,2,3]. Thus, it has become a common practice in nutritional support for preterm infants to strengthen human milk nutrition [4, 5]. Detecting the macronutrient content of human milk and adjusting its quantity and fortification can make fortified breast milk more suitable for the individual needs of preterm infants [1,2,3]. The development of metabolomics has allowed the understanding of the composition and the dynamic changes of breast milk and metabolic pathways [6,7,8,9,10]. Numerous chemical methods have been established to determine fat, protein, and lactose in human milk, and new techniques have recently been developed and reviewed by various researchers [3, 11, 12]. However, most methods with high accuracies, such as mass spectrometry, are expensive and thus can be challenging to popularize.

Because of the need for quick and cost-friendly analysis in clinical use, mid-infrared (MIR) and ultrasound methods have been rapidly developed and applied in neonatal wards, neonatal intensive care units, and maternal and child health care institutions [13, 14]. MIR is a method based on infrared chromatography (IRS) and has been widely used in quickly analyzing human milk macronutrients [12, 13, 15,16,17,18,19,20,21,22]. The technology uses the different wavelengths of infrared energy absorption of protein, fat, and carbohydrate to determine the contents [20]. Another popular rapid analysis technique is the ultrasonic method, based on high-frequency ultrasound radiation passing through the sample material [23,24,25,26]. However, there have only been a few reports analyzing the accuracy of ultrasonic methods, and the accuracy of the ultrasonic method needs further research [15, 27].

Rapid analysis techniques of breast milk components have been found to be particularly useful in various clinical applications [11, 13, 14, 16, 28]. However, other experts hold contrasting views. One reason is that unifying the results obtained from different methods can be challenging [16, 17]. Previous studies have found that rapid milk analyzers may be used in clinical routine to measure milk macronutrient but will require major adjustments [16, 17]. Another study found that routine clinical use of bedside or rapid measurement of human milk contents still lacks evidence and should not be recommended [18].

Up to now, machine learning has numerous applications in nutrition (e.g., automatic diagnosis and model fitting) [29,30,31,32,33]. Machine learning, more specifically the field of predictive modeling is primarily concerned with minimizing the error of a model or making the most accurate predictions possible, at the expense of explainability [34]. In applied machine learning, algorithms from many different fields, including statistics are borrowed and reused [34]. Linear regression is one of the most well-known and well understood algorithms in statistics and machine learning. Linear regression was developed in the field of statistics and is studied as a model for understanding the relationship between input and output numerical variables, but has been borrowed by machine learning.

Therefore, this study aims to compare the results of the MIR and ultrasound methods in rapid milk analysis and provide alternative approaches for rapid breast milk detection. These goals were achieved using linear regression algorithm in machine learning.

Materials and methods

Objects of study

From November 2019 to February 2021, volunteers were recruited from the Shanghai Xinhua Hospital for the study. All recruits were Chinese women who had lived in Shanghai for at least six months.

Inclusion criteria

Chinese mothers who have been breastfeeding (including all lactation stages); living in Shanghai for more than six months [35].

Exclusion criteria

Lack of breast milk led to the cessation of breastfeeding; inability or unwillingness to provide milk; unable to communicate due to language barriers or mental problems; had severe medical condition(s) requiring medication; lost contact before collecting breast milk; failure to collect breast milk as required; failure to store breast milk as required [35].

After getting briefed on the screening and inclusion criteria, the volunteer mothers were asked to provide one or more breast milk samples at their discretion. Multiple milk samples were provided at intervals of more than one month. The milk sample collection was accomplished by each volunteer at home, and the samples were then transported to the Xinhua Hospital via cold chain express for treatment and storage.

Chinese mothers aged 22–40 years old and who have been breastfeeding were enrolled in the study. Breast milk samples (n = 546) were collected from 244 Chinese mothers (from Day 1 to Day 1086 postpartum). In total, 546 milk samples were analyzed using the MIR method, while 465 were evaluated using the ultrasonic method. Eighty-one samples were not able to undergo ultrasonic analysis because the sample volume was insufficient.

This study had been approved by the Ethics Committee of the Xinhua Hospital with approval number XHEC-C-2020-081, and each patient’s written informed consent was obtained prior to inclusion. The study design flow chart is shown in Fig. 1.

Fig. 1
figure 1

Study design flow chart. MIR: mid-infrared

Breast milk collection

The volunteers were first given a detailed explanation by face-to-face or via WeChat regarding the unified sample collection procedure. According to the collection protocol, the breast milk collection was to be completed at home, and the volunteer would have to collect the sample after fasting (not eating energy containing foods and drinks) for more than 8 h and before feeding the baby in the morning. Each volunteer would use an electric breast pump to suck the milk from one side of the breast until it was empty [15, 36]. The breast for milk collection was chosen at random by the volunteer. The following electric breast pump was recommended for milk sample collection, including Medela Co., Sonata Flex, USA; Medela Co., Freestyle Flex, Switzerland; Medela Co., Swing Maxi Flex, China; Medela Co., Swing maxi Flex, Switzerland; Medela Co., Swing Flex, Switzerland; Philips Co., Avent SCF303, China; Philips Co., Avent SCF316, China. The volunteer mothers must reconfirm with the researcher before collecting breast milk if the above model of breast pump wasn’t available. Manual breast pump was not allowed in this study. The mother would then have to homogenize the milk manually by carefully shaking the container in which the milk was pumped into, take 15ml of the mixture, and put it into the unified breast milk collection bag, stored at − 20 ℃. The remaining milk that was pumped was offered to the infant. The milk sample was then transferred via the cold chain within two days after collection.

After each sample was received, the milk properties were evaluated by the research group. If the samples were agglutinated, stratified, or had odor or other conditions, milk would have to be recollected within three days. Milk collection methods can be obtained from a previous literature [35]. Qualified samples were then manually homoginized by shaking the breast milk collection bag gently for 30 s until mix thoroughly and repacked [35]. All breast milk samples were divided into one 0.5 mL (for the MIR method) and 10ml vials (for the ultrasonic method) [17, 22]. The collected and encapsulated milk samples were stored in an exclusive freezer at − 80 ℃ until analysis [15, 37]. The researchers analyzed the collected samples every 60 days. The above process was performed for each breast milk collection.

Milk sample treatment

Before analysis, the stored milk samples were put in a 37℃ water bath until it was completely dissolved, checked to ensure that the sample temperature can meet the working requirements of the analyzer (the MIR analyzer will automatically measure the temperature of samples before detection), and manually homogenized to ensure constant mass [17, 28]. Each sample was thoroughly mixed by turning the sample tube upside down gently for several times for 30s [17, 22]. To exclude human factors, all sample treatments prior to analysis were conducted by one researcher.

Sample analysis

In this study, two rapid breast milk analyzers based on different methodologies were used to detect milk samples repeatedly. One researcher conducted all sample analyses.

MIR milk analyzer

The MIR spectroscopy method is considered reliable and has been widely used in previous studies [14, 16, 19, 21, 22, 38, 39]. For this study, we used the MIR-based milk analyzer (BETTERREN Co., HMIR-05, SH, CHINA) to analyze milk composition (lactose, protein, fat, energy). This analyzer is an improved version of the classic instrument (MIRIS, Uppsala, Sweden) and was patented by the State Intellectual Property Office of the people’s Republic of China (No.: CN 108,760,671 A) in 2018.

According to manufacturer’s recommendations, the ambient temperature of the instrument should be maintained between 25 and 35 ℃. Before the daily sample testing, the built-in manufacturer’s quality control material was used for calibration. A special straw provided by the instrument supplier was used to take 0.5mL of homogenized milk sample, which was then transferred to a transparent covered cuvette, ensuring that there were no bubbles. The researcher handled the whole process carefully to avoid milk splashing or bubbles. The cuvette was then placed on the instrument and scanned for 10 s. The sample temperature detected by the analyzer should between 20 and 25 ℃. If it is lower or higher than the temperature range, the analyzer will warn and refuse to analyze the sample. Each milk sample had its ID number and was without sensitive information. In this study, we used raw data obtained by optical analysis without digital modeling (including lactation time), and the initial test results were stored in a password-protected database.

Digital ultrasound milk analyzer

Another milk analyzer (Honɡyanɡ Co,. HMA 3000, Hebei, CHINA) used in this study is based on the ultrasonic method. In one published article, a breast milk sample was assessed using an ultrasonic analyzer (the same brand as the present study) six times which resulted in a relative standard deviation (RSD) of 0.08–3.79% [26]. This suggests that the instrument has good reproducibility and high recovery rating (95–99%). The macronutrient content error using the digital ultrasound milk analyzer was less than 2% compared to the results from traditional methods [26]. The analyzer was stored in a cool and dry environment to avoid strong light and direct sunlight. The homogenized milk samples were placed on the test tube rack of the analyzer, and the test button was pressed, which started the testing process. The analyzer automatically moved the sample to the front of the detector near the sampling tube and inserted the sampling tube into the sampling tube at the bottom. The measurement time of each sample was about 20 s, and a maximum of 23 samples could be detected each round. After each analysis round, the analyzer will start the automatic flushing procedure. The results were automatically displayed on the computer connected to the analyzer when each sample test was completed. After completing the one-day test, the researcher used acid and alkaline cleaning solutions to complete the final cleaning procedure. The equipment manufacturer provided the neutral, acid, and alkaline cleaning solutions mentioned. The results were then stored in PDF format.

Statistical analysis

Statistical analysis was performed using SPSS Statistics 25.0 (IBM Co., Armonk, NY, USA). Continuous variables were presented as mean ± SD. The results between the two groups were compared using paired t-test. Machine learning algorithm used in the present study was linear regression, and the machine learning model’s performance was evaluated by variance score [17]. Python 3.6 (Python Software Foundation ) was used to train and test the data set for fat and energy, and the following python packages (NumPy 1.19.5; Matplotlib 3.3.4; Scikit-learn 0.24.2; Pandas 1.1.5; SciPy 1.5.4) were used. Excel (Microsoft®Excel®2016MSO) was used to develop the model for lactose and protein. Bland–Altman analysis was used to measure consistency between the measured MIR and the adjusted ultrasonic results using MedCalc 19.0.7 (MedCalc software Ltd, Ostend, Belgium). Bland–Altman scatters plot displayed the mean difference and limits of agreement (LOA). If the difference between the measurements (bias) was close to zero and the 95% LOA was within the clinically acceptable range, the measurements would be considered to have good consistency. A p-value less than 0.05 indicates statistically significant differences.


Human milk composition of two analyzers

Table 1 shows the results of the samples measured by both MIR and ultrasound methods. Our results showed statistically significant differences between groups (all p < 0.001). After applying some adjustments, the ultrasound results were again compared with the MIR results using paired t-test (see Table 1). The results showed no significant difference between the adjusted ultrasound results and the MIR results (all p > 0.05).

Table 1 Main measurement results of two milk analyzers (mean ± SD)

Adjustment of the ultrasonic method results using machine learning

Since we found significant differences in the macronutrient analyses using different instrument, we used machine learning to generate mathematical models for three macronutrients and energy of human milk and converted the values of the ultrasonic measurements. The purpose of this step is to adjust the ultrasonic results and make them as close as possible to the MIR values.

The random algorithm was used to generate the data model between MIR and the initial ultrasound results. First, we established the data set according to Mersenne Twister(MT), which can quickly generate high-quality pseudo-random numbers and correct defects in the classical random number generation algorithm [40]. The machine learning model was produced by randomly sampling the data from the database. Training and test dataset were created by using the following code: klearn.model_selection.train_test_split (X, Y, test_size = 0.20, random_state = 1), X refers to ultrasonic, Y refers to MIR, and then sklearn.linear_model.LinearRegression (scikit-learn 0.24.2) was used for model training. We randomly selected 80% of the dataset to be used in generating the working model (called “training data”) and the other 20% to test the model (called “test data”), as recommended in previous literature [41,42,43]. These steps were used in creating the correction models for protein, fat, lactose, and energy content. Since the results for protein and lactose did not conform to the linear regression model after observation, they were translated using the mean value. The training data and the test data of fat and energy as well as the related statistical parameters are shown in Table 2. Figure 2 shows the training and test data and the linear equation for fat and energy.

Table 2 The training data and the test data of fat and energy of two milk analyzers (mean ± SD)
Fig. 2
figure 2

XY-plots for fat and energy. Y referred to the MIR method, x refers to the observed ultrasound method

Finally, the linear regression equation (y = ax + b) between observed and adjusted ultrasound values was established, where y refers to the adjusted ultrasound values, and x refers to the observed ultrasound values.

We obtained four adjustment equations for protein, fat, lactose and energy:

  • Adjusted fat: y = 0.453x + 1.873, mean squared error: 0.231, variance score: 0.670;

  • Adjusted energy: y = 0.447x + 153.988, mean squared error: 329.751; variance score: 0.465;

  • Adjusted protein: y = x + 0.98447; mean squared error: 0.095; variance score: not avalible;

  • Adjusted lactose: y = x-1.34567; mean squared error: 0.201; variance score: not avalible;

The adjusted ultrasonic values were obtained using the above steps. The training data and the test data of the adjusted ultrasound results and the related model fitting parameters can be seen in Table 3. The consistency between the measured MIR (test dataset) and the adjusted ultrasound values (test dataset) was evaluated using the Bland-Altman analysis and the scatter diagram was generated to calculate the 95% LOA, as shown in Fig. 3. The 95%LOA of fat and energy were - 0.93 to 0.97 g/dl and - 34.9 to 36.6 kj/dl, respectively. The results suggest that after adjustments, the test data values generated by the two methods are more comparable than the results before adjustment (as shown in Tables 2 and 3).

Table 3 The training data and the test data of the adjusted ultrasound results and the model fitting parameters
Fig. 3
figure 3

Bland–Altman scatter plots for fat, and energy of MIR and adjusted ultrasound values (test dataset).* refers to the adjusted results of the ultrasonic milk analyzer (test dataset)

The consistency and LOA of the MIR method and adjusted ultrasound values

The Bland Altman LOA analysis was proposed by Bland JM and Altman DG in 1986 to calculate the limits of agreement between two measurement results. Consistency is assessed by generating a scatter plot visualizing the deviation between each pair. If the values of one method are considerably greater (or less) than that of the other, the difference is called bias. Bias can be calculated using the mean difference between the two methods, and the variation is expressed by the standard deviation (SD) of the difference. If the difference follows a normal distribution, 95% of the points should be within the mean ± 1.96 SD. This interval is called the 95% LOA [44,45,46].

After building the model, the consistency between the measured MIR and the adjusted ultrasound values was evaluated using the Bland-Altman analysis and the scatter diagram was generated to calculate the 95% LOA. After complete dataset adjustments, 93.96% protein points (436 out of 465), 94.41% fat points (439 out of 465), 95.91% lactose points (446 out of 465) and 94.62% energy points (440 out of 465) were within the LOA range. The 95% LOA of protein, fat, lactose and energy were - 0.6 to 0.6 g/dl, -0.92 to 0.92 g/dl, -0.88 to 0.88 g/dl and - 40.2 to 40.4 kj/dl, respectively, and clinically acceptable, as shown in Fig. 4. The results suggest that after adjustments, the values generated by the two methods are comparable.

Fig. 4
figure 4

Bland–Altman scatter plots for protein, fat, lactose, and energy of MIR and adjusted ultrasound values. * refers to the adjusted results of the ultrasonic milk analyzer


We evaluated the difference between the measured values from milk analyzers based on the MIR and the ultrasound methods and used linear regression algorithm in machine learning to adjust the data and verify the consistency. The results of this study aid an understanding of the differences in measured values between popular milk analyzing approaches and can increased the practicability of commercially available rapid human milk analyzers.

The MIR analyzer is based on the principle of spectroscopy. Since the 1990s, the accuracy of the MIR analyzer has been continuously studied for different populations and their milk samples [11, 13, 17]. Compared with chemical approaches, the MIR method is often considered to have high accuracy in human milk composition analysis, although some studies have found that the results are not always accurate [12, 19, 39]. One study found that the accuracy of using MIR to determine protein and fat in human milk is about 60 − 70% [17]. Another study reported that the MIR method could underestimate protein content in human breast milk [47]. Other studies have shown that the MIR method could be used to determine the macronutrients in cow’s milk and predict other characteristics of dairy products (such as various fatty acids, milk protein composition and structure, curd characteristics, titrated acidity, and potential diseases) [48,49,50,51]. Digital modeling of the MIR analyzer is difficult, and the optical system is complex. This causes the overall price of MIR instruments to soar, limiting their use and access.

One popular alternative to the MIR is the ultrasonic analyzer. It uses digital ultrasonic technology to establish the mathematical model by measuring the sound velocity, attenuation, and impedance and the adiabatic compression coefficient of different human milk components to detect the milk’s protein, fat, lactose, minerals, water and energy contents [12, 27]. The main advantages of the ultrasonic instrument are that the equipment is cheap and easy to obtain and that the algorithm is simple. However, this approach has some major disadvantages, such as requiring more milk (5-10ml) than other methods, having longer testing time, and being easy to block. While the accuracies of these two methods still require further verification, considering that these two are the widely used methods for rapid breast milk analysis, it is necessary to use mathematical techniques to adjust the results to make them comparable.

In previous literature, the measurements of human breast milk from ultrasonic milk analyzer and MIR were found to be consistent [26]. However, in our study, the measured values using the two analyzers were not comparable. Although MIR and NIR methods have been used in clinical setting in some areas, some researchers have found that the results of existing milk analyzers require significant adjustments [17]. The main reasons include different methodologies and features, the absence of breast milk standards, and differences in sample preparation procedures [14, 19].

In this study, we used machine learning to convert the measured values of the ultrasonic method and verify the consistency. The results show that with the aid of machine learning, the adjusted ultrasound values could be consistent with the MIR results. Machine learning (linear regression algorithm) was used to establish the correction models for fat and energy. For protein and lactose, since the values did not conform to the linear regression model, they were converted based on the mean value. A possible reason for this is that other variables beyond the scope of this study could have affected the values for lactose and protein, such as maternal and infant information, details of breast milk (e.g., which breast the milk is collected from) and detection environment (e.g., temperature) [17, 25]. Another possible reason is that the measurement error of the instrument itself could have led to the above results. As for the fat and energy values obtained through the linear regression algorithm of machine learning, if more information can be included, the reliability of the model may be increased [34]. Future research can be carried out to develop more accurate models.

One major limitation of this study is its small sample size. The sample size was considerably affected by data availability, which could limit the applicability of the results. Another limitation is that the study focuses only on two milk analyzers, which could weaken the applicability of the research results. The selection of analyzers is another limitation; only two analyzers were used for comparison. Future studies may test other commercially available analyzers to better understand whether the model obtained by machine learning may be applicable. In addition, although the use of multiple samples from one mother for analysis and statistics has been widely used in previous studies, it is still necessary to consider the fact that the composition of human breast milk is highly variable inter- and intra-individually, which has not been fully considered in this study. The above deficiencies should be fully considered in future research.


The present study evaluated the results of rapid milk analyzers based on MIR and ultrasonic methods. The results showed significant differences between the two methods in determining the macronutrients and energy in breast milk. We then used the linear regression algorithm in machine learning to adjust the measurements of the ultrasonic analyzer, and the adjusted results were consistent with the MIR results. The results suggest that machine learning can play a role in data fitting for different methodological analyzers.

Availability of data and materials

All data generated or analyzed during this study are included in this published article.





infrared chromatography


limits of agreement




relative standard deviation


Mersenne Twister


  1. Lemons JA, Moye L, Hall D, Simmons M. Differences in the Composition of Preterm and Term Human Milk during Early Lactation. Pediatr Res 1982; 16: 113–117.

    CAS  PubMed  Article  Google Scholar 

  2. Seliga-Siwecka J, Chmielewska A, Jasińska K. Effect of targeted vs standard fortification of breast milk on growth and development of preterm infants (≤ 32 weeks): study protocol for a randomized controlled trial. Trials 2020; 21: 946.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  3. Boyce C, Watson M, Lazidis G, et al. Preterm human milk composition: a systematic literature review. Br J Nutr 2016; 116: 1033–1045.

    CAS  PubMed  Article  Google Scholar 

  4. Mangili G, Garzoli E. Feeding of preterm infants and fortification of breast milk. Pediatr Med Chir 2017; 39: 158.

    PubMed  Article  Google Scholar 

  5. Brown JVE, Embleton ND, Harding JE, McGuire W. Multi‐nutrient fortification of human milk for preterm infants. Cochrane Database Syst Rev. 2016;(5). Art. No.: CD000343. Accessed 11 July 2022.

  6. Garwolińska D, Hewelt-Belka W, Kot-Wasik A, Sundekilde UK. Nuclear magnetic resonance metabolomics reveals qualitative and quantitative differences in the composition of human breast milk and milk formulas. Nutrients. 2020;12:921.

    PubMed Central  Article  CAS  Google Scholar 

  7. Garwolińska D, Namieśnik J, Kot-Wasik A, Hewelt-Belka W. State of the art in sample preparation for human breast milk metabolomics—merits and limitations. Trac Trends Anal Chem. 2019;114:1–10.

    Article  CAS  Google Scholar 

  8. Praticò G, Capuani G, Tomassini A, et al. Exploring human breast milk composition by NMR-based metabolomics. Nat Prod Res. 2014;28:95–101.

    PubMed  Article  CAS  Google Scholar 

  9. Bardanzellu F, Peila C, Fanos V, Coscia A. Clinical insights gained through metabolomic analysis of human breast milk. Expert Rev Proteomics 2019; 16: 909–932.

    CAS  PubMed  Article  Google Scholar 

  10. Cesare Marincola F, Dessì A, Corbu S, et al. Clinical impact of human breast milk metabolomics. Clin Chim Acta 2015; 451: 103–106.

    CAS  PubMed  Article  Google Scholar 

  11. Choi A, Fusch G, Rochow N, et al. Establishment of micromethods for macronutrient contents analysis in breast milk. Matern Child Nutr. 2015;11:761–72.

    PubMed  Article  Google Scholar 

  12. Kala R, Samková E, Hanu O, et al. Milk protein analysis: An overview of the methods – development and application. Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis 2019; 67: 345–359.

    CAS  Article  Google Scholar 

  13. Smilowitz JT, Gho DS, Mirmiran M, et al. Rapid measurement of human milk macronutrients in the neonatal intensive care unit: accuracy and precision of fourier transform mid-infrared spectroscopy. J Hum Lac. 2014;30:180–9.

    Article  Google Scholar 

  14. Fusch G, Kwan C, Kotrri G, Fusch C. "Bed Side” Human Milk Analysis in the Neonatal Intensive Care Unit. Clin Perinatol. 2017;44:209–67.

    PubMed  Article  Google Scholar 

  15. Miller EM, Aiello MO, Fujita M, et al. Field and laboratory methods in human milk research. Am J Hum Biol 2013; 25: 1–11.

    PubMed  Article  Google Scholar 

  16. Kwan C, Fusch G, Rochow N, Fusch C. Milk analysis using milk analyzers in a standardized setting (MAMAS) study: A multicentre quality initiative. Clin Nutr 2020; 39: 2121–2128.

    CAS  PubMed  Article  Google Scholar 

  17. Fusch G, Rochow N, Choi A, et al. Rapid measurement of macronutrients in breast milk: How reliable are infrared milk analyzers? Clin Nutr 2015; 34: 465–476.

    CAS  PubMed  Article  Google Scholar 

  18. Jianxing Z. Limitations in clinical application of rapid measurements of macronutrients in human milk. Chin J Perinat Med. 2019;22:441–4.

    Google Scholar 

  19. Billard H, Simon L, Desnots E, et al. Calibration Adjustment of the Mid-infrared Analyzer for an Accurate Determination of the Macronutrient Composition of Human Milk. J Hum Lact 2016; 32: Np19-27.

    PubMed  Article  Google Scholar 

  20. Biggs DA. Infrared milk analyzer. J Dairy Sci 1972; 55: 650–651.

    CAS  PubMed  Article  Google Scholar 

  21. de Fluiter KS, Kerkhof GF, van Beijsterveldt I, et al. Longitudinal human milk macronutrients, body composition and infant appetite during early life. Clin Nutr 2021; 40: 3401–3408.

    PubMed  Article  CAS  Google Scholar 

  22. García-Lara NR, Escuder-Vieco D, García-Algar O, et al. Effect of freezing time on macronutrients and energy content of breastmilk. Breastfeed Med 2012; 7: 295–301.

    PubMed  PubMed Central  Article  Google Scholar 

  23. He Lianghua zL Li, Nan Deng, Chao. Application Study on Ultrasonic Breast Milk Quality System Detection Method. Comper Meas Control. 2013;21:2648–50.

    Google Scholar 

  24. Wang Jing LX, Guixiong GU. Composition of breast milk and its determination method. Chin J Woman Child Health Res. 2015;26:641–56.

    Google Scholar 

  25. Peng Dan XK, Xin Yin, Wang Hui. Study on Effects of Main Components and Temperature on Ultrasonic Properties in Milk. Food Sci. 2007;28:66–70.

    Google Scholar 

  26. Luo Xiaoming BD, Feng Lijuan. Application of milk analyzer in determining human milk nutritional composition. Chin J Woman Child Health Res. 2014;25:11–3.

    Google Scholar 

  27. Oto Hanuš JŘ, Eva Samková, David Ledvina, Jaroslav Kopecký. A Comparison of Result Reliability for Investigation of Milk Composition by Alternative Analytical Methods in Czech Republic. Acta Universitatis Agriculturae Et Silviculturae Mendelianae Brunensis 2014; 62: 929–937.

    Article  CAS  Google Scholar 

  28. Menjo A, Mizuno K, Murase M, et al. Bedside analysis of human milk for adjustable nutrition strategy. Acta Paediatr 2009; 98: 380–384.

    PubMed  Article  Google Scholar 

  29. Morgenstern JD, Rosella LC, Costa AP, et al. Perspective: Big Data and Machine Learning Could Help Advance Nutritional Epidemiology. Adv Nutr 2021; 12: 621–631.

    PubMed  PubMed Central  Article  Google Scholar 

  30. Buchlak QD, Esmaili N, Leveque JC, et al. Machine learning applications to neuroimaging for glioma detection and classification: An artificial intelligence augmented systematic review. J Clin Neurosci 2021; 89: 177–198.

    PubMed  Article  Google Scholar 

  31. Ikemura K, Bellin E, Yagi Y, et al. Using Automated Machine Learning to Predict the Mortality of Patients With COVID-19: Prediction Model Development Study. J Med Internet Res 2021; 23: e23458.

    PubMed  PubMed Central  Article  Google Scholar 

  32. Hanko M, Grendár M, Snopko P, et al. Random Forest-Based Prediction of Outcome and Mortality in Patients with Traumatic Brain Injury Undergoing Primary Decompressive Craniectomy. World Neurosurg 2021; 148: e450-e458.

    PubMed  Article  Google Scholar 

  33. Yang L, Wu H, Jin X, et al. Study of cardiovascular disease prediction model based on random forest in eastern China. Sci Rep 2020; 10: 5245.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  34. Handelman GS, Kok HK, Chandra RV, et al. eDoctor: machine learning and the future of medicine. J Intern Med 2018; 284: 603–619.

    CAS  PubMed  Article  Google Scholar 

  35. Ruan H, Tang Q, Zhao X, et al. The levels of osteopontin in human milk of chinese mothers and its associations with maternal body composition. Food Sci Hum Wellness. 2022;11:1419–27.

    Article  Google Scholar 

  36. Villalpando S, del Prado M. Interrelation among dietary energy and fat intakes, maternal body fatness, and milk total lipid in humans. J Mammary Gland Biol Neoplasia 1999; 4: 285–295.

    CAS  PubMed  Article  Google Scholar 

  37. Slutzah M, Codipilly CN, Potak D, et al. Refrigerator storage of expressed human milk in the neonatal intensive care unit. J Pediatr 2010; 156: 26–28.

    PubMed  Article  Google Scholar 

  38. Yang T, Zhang Y, Ning Y, et al. Breast milk macronutrient composition and the associated factors in urban Chinese mothers. Chin Med J. 2014;127:1721–5.

    PubMed  Article  Google Scholar 

  39. Zhu M, Yang Z, Ren Y, et al. Comparison of macronutrient contents in human milk measured using mid-infrared human milk analyser in a field study vs. chemical reference methods. Matern Child Nutr. 2017;13.

  40. Hongo K, Maezono R, Miura K. Random number generators tested on quantum Monte Carlo simulations. J Comput Chem 2010; 31: 2186–2194.

    CAS  PubMed  Article  Google Scholar 

  41. Sotomayor CG, Benjamens S, Dijkstra H, et al. Introduction of the grayscale median for ultrasound tissue characterization of the transplanted kidney. Diagnostics (Basel). 2021;11:390.

  42. Stamenovic D, Schneider T, Messerschmidt A. Aspirin for patients undergoing major lung resections: hazardous or harmless?†. Interact Cardiovasc Thorac Surg 2019; 28: 535–541.

    PubMed  Article  Google Scholar 

  43. Gwon H, Ahn I, Kim Y, et al. Self-Training With Quantile Errors for Multivariate Missing Data Imputation for Regression Problems in Electronic Medical Records: Algorithm Development Study. JMIR Public Health Surveill 2021; 7: e30824.

    PubMed  PubMed Central  Article  Google Scholar 

  44. Ruan H, Tang Q, Yang Q, et al. Resting energy expenditure early after cardiac surgery and validity of predictive equations: a prospective observational study. Ann Nutr Metab. 2021;77:271–8.

    CAS  PubMed  Article  Google Scholar 

  45. Flegal KM, Graubard B, Ioannidis JPA. Use and reporting of Bland-Altman analyses in studies of self-reported versus measured weight and height. Int J Obes (Lond) 2020; 44: 1311–1318.

    Article  Google Scholar 

  46. Cao X, Bai Z, Sun C, et al. Bland-Altman Analysis of Different Radiographic Measurements of the Hallux Valgus Angle and the Intermetatarsal Angle After Distal Osteotomy. Orthop Surg 2020; 12: 1621–1626.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  47. Thajer A, Fusch G, Binder C, et al. Human milk analyser underestimated protein content of unfortified and fortified samples compared to elemental analysis. Acta Paediatr 2019; 108: 2298–2300.

    PubMed  PubMed Central  Article  Google Scholar 

  48. Toffanin V, Marchi MD, Lopez-Villalobos N, Cassandro M. Effectiveness of mid-infrared spectroscopy for prediction of the contents of calcium and phosphorus, and titratable acidity of milk and their relationship with milk quality and coagulation properties. Int Dairy J. 2015;41:68–73.

    CAS  Article  Google Scholar 

  49. De Marchi M, Toffanin V, Cassandro M, Penasa M. Prediction of coagulating and noncoagulating milk samples using mid-infrared spectroscopy. J Dairy Sci 2013; 96: 4707–4715.

    PubMed  Article  CAS  Google Scholar 

  50. McParland S, Banos G, McCarthy B, et al. Validation of mid-infrared spectrometry in milk for predicting body energy status in Holstein-Friesian cows. J Dairy Sci 2012; 95: 7225–7235.

    CAS  PubMed  Article  Google Scholar 

  51. McParland S, Kennedy E, Lewis E, et al. Genetic parameters of dairy cow energy intake and body energy status predicted using mid-infrared spectrometry of milk. J Dairy Sci 2015; 98: 1310–1320.

    CAS  PubMed  Article  Google Scholar 

Download references


We thank all participants in this study. We thank the Department of Nutrition, School of Medicine, Shanghai Jiao Tong University, for providing the equipment. We thank Zhongliang Jin for his technical support with this manuscript. We also thank all colleagues in the Department of Obstetrics and Gynecology of Xinhua Hospital, Ms. Yang Jing of the Brand Development Department, and Ms. Zou Heng of the Outpatient Department for their support in the recruitment process. 


This study was supported by grants from the Shanghai Key Laboratory of Pediatric Gastroenterology and Nutrition (17dz2272000), Foundation of Shanghai Municipal Health Commission (Key weak discipline construction project 2019ZB0101), and Scientific Research Fund of China Nutrition Society (CNS- HPNK2021-16).

Author information

Authors and Affiliations



Huijuan Ruan, Qingya Tang, Wei Cai contributed to the conception of the study; Huijuan Ruan, Xuelin Zhao, Yi Xiang performed the experiment; Huijuan Ruan contributed significantly to analysis and manuscript preparation; Huijuan Ruan performed the data analysis and wrote the manuscript; Qingya Tang, Yajie Zhang, Yi Feng, Wei Cai helped perform the analysis with constructive discussions. All  authors have read and approved the final manuscript.

Corresponding author

Correspondence to Wei Cai.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the Ethics Committee of Xinhua Hospital with the approval number XHEC-C-2020-081. In this study, all participants provided written informed consent. This study is in compliance with the Helsinki declaration, relevant guidelines, and regulations.

Consent for publication

Not applicable.

Competing interests

The authors have no conflicts of interest relevant to this article.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Ruan, H., Tang, Q., Zhang, Y. et al. Comparing human milk macronutrients measured using analyzers based on mid-infrared spectroscopy and ultrasound and the application of machine learning in data fitting. BMC Pregnancy Childbirth 22, 562 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Human milk analyzer
  • Mid-infrared spectroscopy
  • Ultrasound
  • Bland–Altman method
  • Machine learning