Скачать книгу

1.21. The generic algorithms of data analysis, including statistical data processing and classification roles, are classified as computer science and data mining algorithms; other high‐level algorithms are included in AI algorithms. The main function of the engine processor enables the managing of big data and of data processing. As previously mentioned, a basic concept of algorithm classification is in the learning supervision: in a supervised learning model, the algorithm learns on a pre‐selected dataset with specific labeled attributes filtered by the user; in an unsupervised model all the attributes are unlabeled, and the algorithm tries to extract features and patterns without a guideline. The supervised algorithms mainly support the user to find a solution for a specific problem such as finding a specific defect category or a specific failure system.

Schematic illustration of algorithm classification and Industry 5.0 facilities. Schematic illustration of (a) regression analysis, (b) data classification, and (c) data clustering.

      The ensemble approach is an alternative method for data classification. An ensemble is a set of classifiers that learn a target function. By combining different outputs of several classifiers, the risk of selecting a poorly performing classifier is reduced. The typical ensemble procedure is provided by the following pseudocode where T denotes the original training dataset, κ is the number of base classifiers, and B is the test data:

Schematic illustration of ensemble method and classification.

       F input features are randomly selected to split at each node (step 1 of creation of random vectors).

       A linear combination of the input features is created to split at each node (step 2 of using a random vector to build multiple DTs).

       A combination of DTs is created (step 3).

Schematic illustration of ensemble method and classification.

      The RFo classification technique is also applied in image processing detecting defect features. The logic of the DT algorithm is reported by the following pseudocode:

      Decision_Tree Function. 1. Compute Gain values for all attributes and select an attribute having the highest value creating a node for that attribute. 2. Make a branch from this node for every value of the attribute. 3. Assign all possible values of the attributes to branches.

      Follow each branch partitioning the dataset to be only instances whereby the value of the branch is present (or for similar values) and then go back to 1.

Schematic illustration of (a) LSTM unit cell. (b) LSTM network and its memory.

      (1.32)equation

v 1 v 2 v

Скачать книгу