Evaluating chemometric strategies and machine learning approaches for a miniaturized near-infrared spectrometer in plastic waste classification ACTA IMEKO ISSN: 2221-870X June 2023, Volume 12, Number 2, 1 - 7 ACTA IMEKO | www.imeko.org June 2023 | Volume 12 | Number 2 | 1 Evaluating chemometric strategies and machine learning approaches for a miniaturized near-infrared spectrometer in plastic waste classification Claudio Marchesi1 , Monika Rani1, Stefania Federici1, Matteo Lancini2, Laura E. Depero1 1 Department of Mechanical and Industrial Engineering, University of Brescia & UdR INSTM of Brescia, via Branze 38, 25123 Brescia, Italy 2 Department of Medical and Surgical Specialties, Radiological Sciences, and Public Health, University of Brescia, viale Europa 11, 25123 Brescia, Italy Section: RESEARCH PAPER Keywords: Plastic waste sorting; Near-Infrared Spectroscopy (NIRS); circular economy; chemometrics; machine learning Citation: Claudio Marchesi, Monika Rani, Stefania Federici, Matteo Lancini, Laura E. Depero, Evaluating chemometric strategies and machine learning approaches for a miniaturized near-infrared spectrometer in plastic waste classification, Acta IMEKO, vol. 12, no. 2, article 40, June 2023, identifier: IMEKO- ACTA-12 (2023)-02-40 Section Editor: Leonardo Iannucci, Politecnico di Torino, Italy Received March 31, 2023; In final form June 19, 2023; Published June 2023 Copyright: This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Funding: The research was funded by the project PON “R&I” 2014-2020: SIRIMAP—SIstemi di Rilevamento dell’Inquinamento MArino da Plastiche e successivo recupero-riciclo (No. ARS01_01183, CUP D86C18000520008) and partially based upon work from COST Action CA20101 Plastics monitoring detection Remediation recovery - PRIORITY, supported by COST (European Cooperation in Science and Technology), www.cost.eu. Corresponding authors: Stefania Federici, e-mail: stefania.federici@unibs.it; Matteo Lancini, e-mail: matteo.lancini@unibs.it 1. INTRODUCTION In the perspective of whole-system economic sustainability, the enormous volume of urban plastic waste and the constant increase in human plastic consumption require a high level of waste valorisation. By the numbers, global plastic production reached 367 million tons in 2021, with Europe accounting for 16 % of the total [1]. 9 % of plastic was recycled, 12 % was incinerated, and 79 % ended up in landfills or natural compartments [2]. The recycling of polymer waste has significant environmental advantages owing to the replacement of primary manufacturing, and waste sorting optimization plays a critical role in the development of the recycling process [3], [4]. Recycling is a technique for plastic product end-of-life waste management [5]. Basically, two types of recycling processes can be distinguished: mechanical and chemical processes [3], [6]. In both, sorting is the most critical stage in the recycling process, and this is true regardless of how effective the recycling program is [3], [4]. The use of automated sorting equipment makes the process more efficient [7]. Usually, these devices rely on vibrational spectroscopic techniques [8]-[11], and camera ABSTRACT Optimizing the sorting of plastic waste plays a crucial role in improving the recycling process. In this contribution, we report on a comparative study of multiple machine learning and chemometric approaches to categorize a data set derived from the analysis of plastic waste performed with a handheld spectrometer working in the Near-Infrared (NIR) spectral range. Conducting a cost-effective NIR study requires identifying appropriate techniques to improve commodity identification and categorization. Chemometric techniques, such as Principal Component Analysis (PCA) and Partial Least Squares - Discriminant Analysis (PLS - DA), and machine learning techniques such as Support- Vector Machines (SVM), fine tree, bagged tree, and ensemble learning were compared. Various pre- treatments were tested on the collected NIR spectra. In particular, Standard Normal Variate (SNV) and Savitzky-Golay derivatives as signal pre-processing tools were compared with feature selection techniques such as multiple Gaussian Curve Fit based on Radial Basis Functions (RBF). Furthermore, results were combined into a single predictor by using a likelihood-based aggregation formula. Predictive performances of the tested models were compared in terms of classification parameters such as Non-Error Rate (NER) and Sensitivity (Sn) with the analysis of the confusion matrices, giving a broad overview and a rational means for the selection of the approach in the analysis of NIR data for plastic waste sorting. mailto:stefania.federici@unibs.it mailto:matteo.lancini@unibs.it ACTA IMEKO | www.imeko.org June 2023 | Volume 12 | Number 2 | 2 systems for the polymer identification of clear and coloured products [5], [12]. Other techniques are based on ultraviolet (UV) spectroscopy [13], [14], X-ray [15], and hyperspectral imaging [16]-[18]. Over the years, this strategy has increased the purity of the output plastic, achieving a high percentage of recyclates in the production of secondary materials. However, these systems reach their limits with mixed plastics that require additional sorting elsewhere and can affect the quality of the recyclate if not appropriately allocated. A positive cost-benefit analysis is only possible if the separated polymer fractions have a high purity grade and satisfy the market demand for high-quality recyclates. Therefore, post-consumer recycling consists of many essential steps: collection, sorting, cleaning, size reduction and separation, and/or compatibilization to reduce polymer contamination [5]. In this scenario, the prospect of combining a well-established polymer identification technology with a small, portable, low- cost, real-time spectrometer for local and intermittent semi- automatic sorting is highly desirable, accompanied by robust data analysis [19], [20]. In recent years, chemometric analysis of non- destructive spectroscopic data has been widely investigated as an automated method for improving plastic sorting systems [21]- [24]. This improvement has been driven by the need to reduce the environmental impact [25]. Recently, machine learning has attracted considerable attention in plastic waste recognition using spectroscopic techniques [26]-[32]. In this study, we compared machine learning and chemometric techniques for classifying plastic waste data acquired with a portable Near-Infrared (NIR) spectrometer (see Figure 1 for the scheme of the work). Comparisons were made between chemometric approaches, Principal Component Analysis (PCA) and Partial Least Squares – Discriminant Analysis (PLS-DA), and machine learning techniques, Support-Vector Machines (SVM), Fine Tree, Bagged Tree, and Ensemble Learning. A comparison was also made in terms of pre-processing: traditional techniques, such as Standard Normal Variate (SNV) and Savitzky-Golay derivatives were examined in contrast to feature reduction techniques, such as multiple Gaussian Curve Fit based on Radial Basis Functions (RBF). The predictive performances of the tested models were compared in terms of classification parameters, such as Non- Error Rate (NER) and Sensitivity (Sn) with the analysis of confusion matrices, providing a comprehensive overview and a rational means of selecting the approach for the analysis of NIR data for plastic waste sorting. 2. MATERIALS AND METHODS 2.1. Samples collection The first batch of plastic samples was collected in the Selection Division of the Montello SpA recovery and recycling plant (Bergamo, Italy), which accepts post-consumer plastic in the form of municipal waste for recycling [20]. Subsequently, the dataset was expanded to include new samples from municipal waste collected before ending up in landfills. A total of 325 samples from a variety of polymer classes were used in this study. Specifically, the products studied were: 75 samples of poly(ethylene terephthalate) (PET), 100 samples of polyethylene (PE), 75 samples of polypropylene (PP), and 75 samples of poly(styrene) (PS). The assortment included bottles, containers, and packaging of various sizes, shapes, and colours. 2.2. NIR analysis Plastic samples were analysed using the MicroNIR On-site spectrometer (Viavi Solutions Inc., CA, United States) in reflectance mode without pre-treatment of the samples. The instrument is a palm-sized, portable spectrometer weighing approximately 250 g and measuring less than 200 mm in length and 50 mm in diameter. The instrument is equipped with a Linear Variable Filter (LVF), coupled to a linear detector array, which operates in the wavelength range 950-1650 nm. Control settings for spectral data acquisition were set to 10 milliseconds integration time and 50 scans, resulting in a short measurement time of 0.25 seconds. A point-and-shoot technique was used to perform 5 replicates for each sample to reduce the effects caused by sample non-uniformity. A total of 1625 spectra were acquired, and acquisition was performed using MicroNIRTM Pro v3.0 software (Viavi Solutions Inc., CA, United States). 2.3. Spectral pre-processing and chemometrics Pre-processing NIR spectral data has become an essential aspect of chemometric modelling. The goal is to eliminate physical events from the spectra to improve subsequent multivariate regression, classification model, or exploratory analysis [33]. In this study, the spectra were retrieved in a single matrix of 1625 × 125 (samples × wavenumbers) and pre- processing was applied using the Savitzky-Golay second derivative method with seven data points and a second order polynomial followed by Standard Normal Variate (SNV). The second derivative was applied to correct the drift effect [34], [35] Figure 1. Scheme of the work. ACTA IMEKO | www.imeko.org June 2023 | Volume 12 | Number 2 | 3 in the NIR spectra, while SNV corrects the baseline shift [36]. SNV was calculated as follows [36]: 𝑋corr = 𝑋org − 𝑎0 𝑎1 (1) where 𝑋corr is the spectrum corrected, 𝑋org is the raw spectrum collected by the instrument, 𝑎0 is the value of the mean of the spectrum to be corrected, and 𝑎1 is the standard deviation. In addition, normalization was performed by mean centering. Different chemometric methods were used for the correct evaluation of the data of all analysed samples. PCA was initially applied as an exploratory analysis to investigate the data structure and was performed on 1625 NIR spectra from all polymer classes. Then, PLS-DA was applied as a supervised pattern recognition tool to separate the different commodities. Prior to using PLS-DA, data were split into a training set and a test set using a MATLAB proprietary function. The process was repeated 500 times, generating a different training and test set each time (75 % of the samples belonged to the training set and 25 % to the test set). All chemometric analyses were performed with MATLAB 2021b (The MathWorks, Inc, Natick, MA, USA) using the PLS-Toolbox (Eigenvector Research, Inc. Manson, Washington, USA). 2.4. Machine learning and pre-processing Various machine learning algorithms were applied for classification purposes; SVM, Fine Tree, Ensemble Learning, and Bagged Tree. In addition, a likelihood-based aggregation procedure (here called Combo) was used to integrate the data into a single predictor, and the same procedure was applied with a Monte Carlo Method (MCM) to make a perturbation on raw data, to improve the generalization performance. The chosen hyperparameters are the following: for Fine Tree Gini's diversity index (gdi) was used as split criterion with 100 maximum number of splits; SVM was performed with a linear kernel function with kernel scale equal to 3. Lastly, Ensemble Learning was performed with the Bagged Tree method with 30 cycles of learning. To test the reliability of the system, 200 random extractions were performed for splitting the training and testing set. Again, 75 % of the samples were used for training and the rest for testing. Machine learning methods were performed on three different datasets: the raw data collected as specified in the previous paragraph (2.2), data reduced using the Gaussian RBF curve fit [37], and a dataset obtained combining raw and pre-processed data. Each curve of the dataset has been fitted using a combination of 12 gaussian functions and a linear interpolation with a second- degree function, thus reducing the dataset dimension to 12 RBF centres and 12 sigma values. The procedure is as follows: 1. The second order derivative is computed and fed to find detection algorithm for the initial guesses of the RBF centres (here the MATLAB function “Findpeaks” was used with a limitation of 12 peaks maximum and excluding the first and last 20 samples of the spectrum). 2. A linear regression with a second-degree equation is used to remove offset and second-order trends. 3. The RBF centres are used as initial guess to an optimization procedure based on a Sequential Quadratic Programming constrained minimization function [38]. The cost function 𝜀 used is reported in (2) where 𝐴𝑖 is the frequency of the 𝑖-th sample, 𝑦𝑖 is its raw values, and 𝜇𝑗 , 𝜎𝑗 , and 𝐴𝑗 are the centre, sigma and amplitude of the 𝑗-th RBF function respectively. 𝜀 = ∑ ∑ 𝜀𝑖,𝑗 𝑗𝑖 𝜀𝑖,𝑗 = { 0, 𝜎𝑗 < 0 𝐴𝑗 𝑒 − (𝑓𝑖−𝜇𝑗) 2 2 𝜎𝑗 2 , 𝜎𝑗 ≥ 0 (2) 4. The centres and sigmas found are collected as features of the new dataset. The condition posed in (2) on the positive value allows to reduce dynamically the number of RBF functions actually used, while the interpolation removes trends that could hide peaks. A third dataset combining the two previous dataset (raw and RBF Gaussian fit) is also created simply joining the two tables. All calculations were performed using MATLAB and Statistics Toolbox release 2021b (The MathWorks, Inc, Natick, MA, USA). Automation of the procedure was implemented using MATLAB functions created in-house. In Figure 2 the data analysis approach starting from raw data is reported, both for chemometrics and machine learning modelling. 3. RESULTS ANN DISCUSSION 3.1. NIR spectra The main advantage of NIR spectroscopy is that it is a fast- response analytical technique capable of collecting spectra without prior processing and predicting physical and chemical properties from a single spectrum [39]. The absorption bands in the NIR region are caused by overtones and/or combination bands of primarily carbon-hydrogen vibrations and oxygen- hydrogen vibrations. Correct band assignment is difficult since it may be caused by various combinations of fundamental vibrations. Also, overtone vibrations are highly overlapping [40]. Representative NIR reflectance spectra of the four polymers (PE, PET, PP, and PS) are shown in Figure 3. Figure 2. Data analysis approach. ACTA IMEKO | www.imeko.org June 2023 | Volume 12 | Number 2 | 4 The main absorbance band for PET was found at 1660 nm, which is related to the 1st overtone of C-H stretching [41], with other two peaks at about 1130 nm and 1415 nm. For PE the peak around 1211 nm is related to 2nd overtone of methylene C-H group, while the peak at about 1217 nm is related to the C-H stretch [42]. Peaks at 1391 nm and 1168 nm, correspond, respectively, to C-H combination band and 2nd overtone of CH2 symmetric stretch. Regarding PP, the 2nd overtone of the asymmetric methyl C-H stretch is around 1193 nm, while the asymmetric methylene C-H stretch occurs at about 1211 nm [43]. The two peaks at 1391 nm and 1397 nm are related to methyl and methylene (C-H) combination. Lastly, for PS the peak at 1205 nm corresponds to the 2nd overtone of the aromatic C-H stretch; the stretching vibrational mode of C-H which occurs around 1639 nm, and the 1st overtone of aromatic C-H stretch overlaps with C-H combination band, which occurs at about 1391 nm [42]. To allow comparison between the raw spectra and the same spectra after applying the Savitzky-Golay 2nd derivative and SNV, Figure 4 shows the representative spectra of the four commodities after pre-processing. 3.2. Principal component analysis The PCA calculation was performed after the pre-processing described above for the entire spectral range. For data structure analysis, PCA is a useful chemometric method. The goal of PCA is to extract the information stored in many variables into a smaller number of variables, called Principal Components [44]. Figure 5 shows the score plot of the first two components (73.88 % of the total explained variability), in which a clear separation between the polymer classes can be seen. Along PC1 PET is distinguished from the other commodities. PET samples show very negative score values, while the other samples show positive score values. On the other hand, along PC2, PS is clearly separated from the other plastics. A clear separation between PP and PE can be noticed in the score plot of PC1 vs PC3 in Figure 6, where PC3 accounts for 15.83 % of the total information and explains the difference of PP from the other class of polymers. 3.3. Partial least squares discriminant analysis Following the exploratory PCA analysis, a supervised classification technique was used to distinguish the different plastic groups. In PLS-DA, a classification objective is added to Figure 3. Representative near-infrared (NIR) spectra of the four classes of polymers. Figure 4. NIR spectra of the four classes of the polymers after the typical pre- processing for chemometric analysis: Savitzky-Golay 2nd derivative and Standard Normal Variate. Figure 5. Results of PCA performed with spectral data of different commodities. The score plot of PC1 vs PC2 is presented. Figure 6. Results of PCA performed with spectral data of different commodities. The score plot of PC1 vs PC3 is presented. ACTA IMEKO | www.imeko.org June 2023 | Volume 12 | Number 2 | 5 the PLS regression technique. The response variable is categorical and reflects the class to which the statistical units belong. PLS-DA returns the prediction as a vector with values between 0 and 1 and a length equal to the number of classes in the predictor variables [45], [46]. Each time PLS-DA was performed, the parameters such as NER and sensitivity were calculated in fitting, in cross-validation (CV), and for the test set. The cross-validation procedure was based on venetian blind approach with 5 groups. CV was also used to determine the optimal number of Latent Variables (LVs) for each PLS-DA model. Figure 7 shows all sensitivities for each class, calculated for training set, CV, and for test set. The values are close to 1, indicating a very high classification performance. Moreover, the results are very balanced between training, CV, and test set; therefore, overfitting is completely avoided, and the model can be considered reliable and stable. Table 1 shows the NER defined as mean class sensitivity [47], calculated for all the training set, cross-validation, and test set. Overall, 99 % of the samples were correctly classified for each of the 500 iterations. 3.4. Machine learning Due to the complexity and the large number of results, for the machine learning analysis the classification parameters are presented only for the test set. Figure 8 shows the NER of the classes for each computed model and for each treatment of the data. It is noticeable that the models run on raw data have the worst performances. The NER ranges from 0.74 (Fine Tree) to 0.9 (SVM), indicating a high variability in the results. For raw data only SVM can be considered as a satisfactory model for pattern recognition. Lower variability in the results is observed for pre- treated data and for a mixture of pre-treated and raw data, where the NER ranges from 0.96 to 0.99 and from 0.96 to 0.98, respectively. Thus, there is no difference in the results between pre-processed data and the combination of raw and pre-treated data. These results confirm that feature reduction based on the Gaussian curve with RBF gives high performances for pattern recognition in machine learning analysis. In general, the model performance is comparable between machine learning and multivariate analysis methods. After random extraction of training and test data repeated 500 and 200 times for chemometrics and machine learning, respectively, the NER calculated for the test set is above 0.95 for both methods. However, the use of chemometrics reduces the computational time, compared to the computationally intensive machine learning algorithms. 4. CONCLUSION This paper included a side-by-side comparison between conventional chemometric methods and machine learning algorithms for the classification of a dataset obtained from the study of plastic waste with a portable Near-Infrared (NIR) spectrometer. Multivariate methods such as Principal Component Analysis (PCA) and Partial Least Squares - Discriminant Analysis (PLS - DA) were investigated, as well as machine learning methods such as Support Vector Machines (SVM), Fine Tree, Bagged Tree and Ensemble Learning. Results were also compared in terms of data processing: signal pre- processing tools, SNV, and Savitzky-Golay derivatives were compared with feature reduction approaches such as Multiple Gaussian Curve Fit based on Radial Basis Functions (RBF). In addition, the machine learning algorithms were run on raw data, pre-processed data, and the combination of the two approaches. The results from PLS-DA showed very high performances for pattern recognition; in fact, the NER for the training set, in CV, and for the test set are all equal to 0.99. In contrast, for machine learning, the NER for raw data ranges from 0.74 for Fine Tree to 0.90 for SVM, indicating high variability in the results. The results for the pre-processed data show lower variability with NER value ranging from 0.96 to 0.99, which is also valid for the combination of raw data and pre-processed data. This confirms that RBF-based variable reduction is the most crucial point to improve classification performances. Contrarily to some results found in the literature regarding the pre-treatment of data having a negative effect in accuracy using chemometrics [48], the pre- treatment of data is generally an improvement in the detection accuracy using machine learning techniques. We can conclude that the multivariate and machine learning approaches produce comparable results in terms of model performance. The NER estimated for the test set is above 0.95 for both chemometrics and machine learning after randomly extracting the training and Figure 7. PLS-DA Model. Class sensitivities (Sn) calculated for training set, in cross-validation and for test set. Table 1. PLS-DA Model. Non-Error Rate calculated for training set, in CV and for test set. NER Training 0.99 CV 0.99 Test 0.99 Figure 8. Machine Learning. Comparison of the Non-Error Rate (NER) calculated from the confusion matrices for each model. Results are presented for raw data, pre-treated data, and the combination of raw and pre-treated data. ACTA IMEKO | www.imeko.org June 2023 | Volume 12 | Number 2 | 6 test data and repeating them 500 and 200 times, respectively. On the other hand, chemometrics is characterised by a lower computation time compared to machine learning algorithms and it can therefore be considered more advantageous. REFERENCES [1] Plastics Europe. Plastics Europe website. Online [Accessed 25 June 2023] http://www.plasticseurope.org [2] R. Geyer, J. R. Jambeck, K. L. Law, Production, use, and fate of all plastics ever made, Science Advances vol. 3 (2017) no. 7. DOI: 10.1126/sciadv.1700782 [3] S. M. Al-Salem, P. Lettieri, J. Baeyens, Recycling and recovery routes of plastic solid waste (PSW): A review, Waste Management. vol. 29 (2009) no.10, pp. 2625–2643. DOI: 10.1016/j.wasman.2009.06.004 [4] R. Siddique, J. Khatib, I. Kaur, Use of recycled plastic in concrete: A review, Waste Management vol.28 (2008) no.10, pp. 1835-1852. DOI: 10.1016/j.wasman.2007.09.011 [5] J. Hopewell, R. Dvorak, E. Kosior, Plastics recycling: Challenges and opportunities, Philosophical Transactions of the Royal Society B: Biological Sciences vol. 364 (2009) no. 1526, pp. 2115-2126. DOI: 10.1098/rstb.2008.0311 [6] K. Ragaert, L. Delva, K. Van Geem, Mechanical, and chemical recycling of solid plastic waste, Waste Management vol. 69 (2017), pp. 24-58. DOI: 10.1016/j.wasman.2017.07.044 [7] S. P. Gundupalli, S. Hait, A. A. Thakur, A review on automated sorting of source-separated municipal solid waste for recycling, Waste Management vol. 60 (2017), pp. 56-74. DOI: 10.1016/j.wasman.2016.09.015 [8] V. Allen, J. H. Kalivas, R. G. Rodriguez, Post-consumer plastic identification using Raman spectroscopy, Applied Spectroscopy vol. 53 (1999) no. 6, pp. 672–681. DOI: 10.1366/000370299194732 [9] V. Ludwig, Z. Da Costa Ludwig, M. M. Rodrigues, V. Anjos, C. Batesttin Costa, D. R. Sant’Anna das Dores, V. R. Da Silva, F. Soares, Analysis by Raman and infrared spectroscopy combined with theoretical studies on the identification of plasticizer in PVC films, Vibrational Spectroscopy vol. 98 (2018), pp. 134-138. DOI: 10.1016/j.vibspec.2018.08.004 [10] O. Rozenstein, E. Puckrin, J. Adamowski, Development of a new approach based on midwave infrared spectroscopy for post- consumer black plastic waste sorting in the recycling industry, Waste Management vol. 68 (2017), pp. 38–44. DOI: 10.1016/j.wasman.2017.07.023 [11] A. Vázquez-Guardado, M. Money, N. McKinney, D. Chanda, Multi-spectral infrared spectroscopy for robust plastic identification, Appl. Optics vol. 54 (2015) no. 24, pp. 7396-7405. DOI: 10.1364/AO.54.007396 [12] Y. Tachwali, Y. Al-Assaf, Y. & A. R. Al-Ali, Automatic multistage classification system for plastic bottles recycling, Resources, Conseration and Recycling vol. 52 (2007) no. 2, pp. 266–285. DOI: 10.1016/j.resconrec.2007.03.008 [13] E. Maris, A. Aoussat, E. Naffrechoux, D. Froelich, Polymer tracer detection systems with UV fluorescence spectrometry to improve product recyclability, Minerals Engineering vol. 29 (2012), pp. 77– 88. DOI: 10.1016/j.mineng.2011.09.016 [14] S. M. Safavi, H. Masoumi, S. Mirian, M. Tabrizchi, Sorting of polypropylene resins by color in MSW using visible reflectance spectroscopy, Waste Management vol. 30 (2010) no. 11, pp. 2216– 2222. DOI: 10.1016/j.wasman.2010.06.023 [15] S. Brunner, P. Fomin, C. Kargel, Automated sorting of polymer flakes: Fluorescence labeling and development of a measurement system prototype, Waste Management vol. 38 (2015), pp. 49–60. DOI: 10.1016/j.wasman.2014.12.006 [16] Y. Zheng, J. Bai, J. Xu, X. Li, Y. Zhang, A discrimination model in waste plastics sorting using NIR hyperspectral imaging system, Waste Management vol. 72 (2018), pp. 87–98. DOI: 10.1016/j.wasman.2017.10.015 [17] M. Vidal, A. Gowen, J. M. Amigo, NIR Hyperspectral Imaging for Plastics Classification, NIR news vol. 23 (2012) no.1, pp. 13–15. DOI: 10.1255/nirn.1285 [18] M. Moroni, A. Mei, A. Leonardi, E. Lupo, F. La Marca, PET and PVC separation with hyperspectral imagery, Sensors vol. 15 (2015) no. 1, pp. 2205–2227. DOI: 10.3390/s150102205 [19] I. Vollmer, M. J. F. Jenks, M. C. P. Roelands, R. J. White, T. van Harmelen, G. P. van der Laan, F. Meirer, J. T. F. Keurenjes, B. M. Weckhuysen, Beyond Mechanical Recycling: Giving New Life to Plastic Waste, Angewandte Chemie - International Edition vol. 59 (2020) no.36, pp. 15402–15423. DOI: 10.1002/anie.201915651 [20] M. Rani, C. Marchesi, S. Federici, G. Rovelli, I. Alessandri, I. Vassalini, S. Ducoli, L. Borgese, A. Zacco, F. Bilo, E. Bontempi, L. E. Depero, Miniaturized Near-Infrared (MicroNIR) Spectrometer in Plastic Waste Sorting, Materials vol. 12 (2019) no. 17. DOI: 10.3390/ma12172740 [21] R. Junjuri, M. K. Gundawar, A low-cost LIBS detection system combined with chemometrics for rapid identification of plastic waste, Waste Management vol. 117 (2021), pp. 48–57. DOI: 10.1016/j.wasman.2020.07.046 [22] V. C. Costa, F. W. B. Aquino, C. M. Paranhos, E. R. Pereira-Filho, Identification and classification of polymer e-waste using laser- induced breakdown spectroscopy (LIBS) and chemometric tools, Polymer Testing vol. 59 (2017), pp. 390–395. DOI: 10.1016/j.polymertesting.2017.02.017 [23] G. Bonifazi, L. Fiore, R. Gasbarrone, P. Hennebert, P. S. Serranti, Detection of brominated plastics from e-waste by short-wave infrared spectroscopy, Recycling vol. 6 (2021) no. 3. DOI: 10.3390/recycling6030054 [24] E. R. K. Neo, Z. Yeo, J. S. C. Low, V. Goodship, K. Debattista, A review on chemometric techniques with infrared, Raman and laser-induced breakdown spectroscopy for sorting plastic waste in the recycling industry, Resources, Conservation and Recycling vol. 180 (2022) 106217. DOI: 10.1016/j.resconrec.2022.106217 [25] C. Araujo-Andrade, E. Bugnicourt, L. Philippet, L. Rodriguez- Turienzo, D. Nettleton, L. Hoffmann, M. Schlummer, Review on the photonic techniques suitable for automatic monitoring of the composition of multi-materials wastes in view of their posterior recycling, Waste Management & Research vol. 39 (2021) no. 5, pp. 631–651. DOI: 10.1177/0734242X21997908 [26] V. Da Silva, H. Murphy, J. M. Amigo, C. Stedmon, J. Strand, Classification and Quantification of Microplastics (<100 μm) Using a Focal Plane Array-Fourier Transform Infrared Imaging System and Machine Learning, Analytical Chemistry vol. 92 (2020) no. 20, pp. 13724–13733. DOI: 10.1021/acs.analchem.0c01324 [27] S. Zhu, H. Chen, M. Wang, X. Guo, Y. Lei, G. Jin, Plastic solid waste identification system based on near infrared spectroscopy in combination with support vector machine, Advanced Industrial and Engineering Polymer Research vol. 2 (2019) no.2, pp. 77–81. DOI: 10.1016/j.aiepr.2019.04.001 [28] A. P. M. Michel, A. E. Morrison, V. L. Preston, C. T. Marx, B. C. Colson, H. K. White, Rapid Identification of Marine Plastic Debris via Spectroscopic Techniques and Machine Learning Classifiers, Environmental Science &Technologies vol. 54 (2020) no. 17, pp. 10630–10637. DOI: 10.1021/acs.est.0c02099 [29] Y. Yang, W. Zhang, Z. Wang, Y. Li, Differentiation of Plastics by Combining Raman Spectroscopy and Machine Learning, Journal of Applied Spectroscopy vol. 89 (2022) no.4, pp. 790–798. DOI: 10.1007/s10812-022-01426-1 http://www.plasticseurope.org/ https://doi.org/10.1126/sciadv.1700782 https://doi.org/10.1016/j.wasman.2009.06.004 https://doi.org/10.1016/j.wasman.2007.09.011 https://doi.org/10.1098/rstb.2008.0311 https://doi.org/10.1016/j.wasman.2017.07.044 https://doi.org/10.1016/j.wasman.2016.09.015 https://doi.org/10.1366/0003702991947324 https://doi.org/10.1016/j.vibspec.2018.08.004 https://doi.org/10.1016/j.wasman.2017.07.023 https://doi.org/10.1364/AO.54.007396 https://doi.org/10.1016/j.mineng.2011.09.016 https://doi.org/10.1016/j.wasman.2010.06.023 https://doi.org/10.1016/j.wasman.2014.12.006 https://doi.org/10.1016/j.wasman.2017.10.015 https://doi.org/10.1255/nirn.1285 https://doi.org/10.3390/s150102205 https://doi.org/10.1002/anie.201915651 https://doi.org/10.3390/ma12172740 https://doi.org/10.1016/j.wasman.2020.07.046 https://doi.org/10.1016/j.polymertesting.2017.02.017 https://doi.org/10.3390/recycling6030054 https://doi.org/10.1016/j.resconrec.2022.106217 https://doi.org/10.1177/0734242X21997908 https://doi.org/10.1021/acs.analchem.0c01324 https://doi.org/10.1016/j.aiepr.2019.04.001 https://doi.org/10.1021/acs.est.0c02099 https://doi.org/10.1007/s10812-022-01426- ACTA IMEKO | www.imeko.org June 2023 | Volume 12 | Number 2 | 7 [30] B. Carrera, V. L. Piñol, J. B. Mata, K. Kim, A machine learning based classification models for plastic recycling using different wavelength range spectrums, Journal of Cleaner Production vol.374 (2022) 133883. DOI: 10.1016/j.jclepro.2022.133883 [31] D. Covarrubias-Martínez, H. Lobato-Morales, J. M. Ramírez- Cortés, G. A. Álvarez-Botero, Classification of plastic materials using machine-learning algorithms and microwave resonant sensor, Journal of Electromagnetic Waves and Applications vol. 36 (2022) no. 12, pp. 1760–1775. DOI: 10.1080/09205071.2022.2043192 [32] S. Zinchik, S. Jiang, S. Friis, F. Long, L. Høgstedt, V. M. Zavala, E. Bar-Ziv, Accurate Characterization of Mixed Plastic Waste Using Machine Learning and Fast Infrared Spectroscopy, ACS Sustainable Chemistry & Engineering vol. 9 (2021) no. 42, pp. 14143-14151. DOI: 10.1021/acssuschemeng.1c04281 [33] A, Rinnan, F. van den Berg, S. B. Engelsen, Review of the most common pre-processing techniques for near-infrared spectra, TrAC Trends in Analytical Chemistry vol. 28 (2009) no. 10, pp. 1201–1222. DOI: 10.1016/j.trac.2009.07.007 [34] P. Oliveri, C. Malegori, R. Simonetti, M. Casale, The impact of signal pre-processing on the final interpretation of analytical outcomes – A tutorial, Analytica Chimica Acta vol. 1058 (2019), pp. 9–17. DOI: 10.1016/j.aca.2018.10.055 [35] V. M. Taavitsainen, Denoising and Signal-to-Noise Ratio Enhancement: Derivatives. in Comprehensive Chemometrics, eds. Brown, S. D., Tauler, R. & Walczak, B., pp. 57–66, Elsevier, 2009, ISBN 978-0-444-52701-1. DOI: 10.1016/B978-044452701-1.00101-0 [36] R. J. Barnes, M. S. Dhanoa, S. J. Lister, Standard normal variate transformation and de-trending of near-infrared diffuse reflectance spectra, Applied Spectroscopy vol. 43 (1989) no. 5, pp. 772–777. DOI: 10.1366/000370289420220 [37] M. M. Li, The development of a nonlinear curve fitter using RBF neural networks with hybrid neurons. Lecture Notes in Computer Science, part of 13th International Symposium on Neural Networks, ISNN 2016, St. Petersburg, Russia, July 6-8, 2016, Proceedings vol. 9719, 2016, pp. 434–443. DOI: 10.1007/978-3-319-40663-3_50 [38] J. Frédéric Bonnans, J. Charles Gilbert, C. Lemaréchal, C. A. Sagastizábal, Numerical Optimization: Theoretical and Practical Aspects, Springer Berlin Heidelberg, 2006, ISBN 978-3-540- 35447-5. DOI: 10.1007/978-3-540-35447-5 [39] M. Blanco, M. I. Villarroya, NIR spectroscopy: a rapid-response analytical tool, TrAC Trends in Analytical Chemistry vol. 21 (2002) no. 4, pp. 240–250. DOI: 10.1016/S0165-9936(02)00404-1 [40] J. Workman Jr., L. Weyer, Practical Guide and Spectral Atlas for Interpretive Near-Infrared Spectroscopy, CRC Press, Boca Raton, 2006, ISBN 9781439875254. DOI: 10.1201/b11894 [41] E. W. Crandall, A. N. Jagtap, The near‐infrared spectra of polymers. Journal of Applied Polymer Science vol. 21 (1977), pp. 449–454. DOI: 10.1002/app.1977.070210211 [42] J. Workman Jr., The Handbook of Organic Compounds, NIR, IR, R, and UV-Vis Spectra Featuring Polymers and Surfactants, Academic Press, 2001, ISBN 13: 9780127635613. DOI: 10.1016/B978-0-12-763560-6.X5000-4 [43] S. Zhu, Z. Song, S. Shi, M. Wang, M. G. Jin, Fusion of Near- Infrared and Raman Spectroscopy for In-Line Measurement of Component Content of Molten Polymer Blends, Sensors vol. 19 (2019) no.16, 3643. DOI: 10.3390/s19163463 [44] D. Ballabio, A MATLAB toolbox for Principal Component Analysis and unsupervised exploration of data structure, Chemometrics and Intelligent Laboratory Systems vol. 149 (2015), pp. 1–9. DOI: 10.1016/j.chemolab.2015.10.003 [45] D. Ballabio, V. Consonni, Classification tools in chemistry. Part 1: Linear models. PLS-DA, Analytical Methods vol. 5 (2013), pp. 3790–3798. DOI: 10.1039/C3AY40582F [46] R. G. Brereton, G.R. Lloyd, Partial least squares discriminant analysis: Taking the magic away, Journal of Chemometrics vol. 28 (2014), pp. 213–225. DOI: 10.1002/cem.2609 [47] D. Ballabio, F. Grisoni, R. Todeschini, Multivariate comparison of classification performance measures, Chemometrics and Intelligent Laboratory Systems vol. 174 (2018), pp. 33–44. DOI: 10.1016/j.chemolab.2017.12.004 [48] P. Mishra, D. N. Rutledge, J. M. Roger, K. Wali, H. A. Khan, Chemometric pre-processing can negatively affect the performance of near-infrared spectroscopy models for fruit quality prediction, Talanta vol. 229 (2021) 122303. DOI: 10.1016/j.talanta.2021.122303 https://doi.org/10.1016/j.jclepro.2022.133883 https://doi.org/10.1080/09205071.2022.2043192 https://doi.org/10.1021/acssuschemeng.1c04281 https://doi.org/10.1016/j.trac.2009.07.007 https://doi.org/10.1016/j.aca.2018.10.055 http://dx.doi.org/10.1016/B978-044452701-1.00101-0 https://doi.org/10.1366/000370289420220 https://doi.org/10.1007/978-3-319-40663-3_50 https://doi.org/10.1007/978-3-540-35447-5 https://doi.org/10.1016/S0165-9936(02)00404-1 http://dx.doi.org/10.1201/b11894 https://doi.org/10.1002/app.1977.070210211 https://doi.org/10.1016/B978-0-12-763560-6.X5000-4 https://doi.org/10.3390/s19163463 https://doi.org/10.1016/j.chemolab.2015.10.003 https://doi.org/10.1039/C3AY40582F https://doi.org/10.1002/cem.2609 https://doi.org/10.1016/j.chemolab.2017.12.004 https://doi.org/10.1016/j.talanta.2021.122303