When optimizing the performance of neural network-based chatbots, determining the optimizer is one of the most important aspects. Optimizers primarily control the adjustment of model parameters such as weight and bias to minimize a loss function during training. Adaptive optimizers such as ADAM have become a standard choice and are widely used for their invariant parameter updates' magnitudes concerning gradient scale variations, but often pose generalization problems. Alternatively, Stochastic Gradient Descent (SGD) with Momentum and the extension of ADAM, the ADAMW, offers several advantages. This study aims to compare and examine the effects of these optimizers on the chatbot CST dataset. The effectiveness of each optimizer is evaluated based on its sparse-categorical loss during training and BLEU in the inference phase, utilizing a neural generative attention-based additive scoring function. Despite memory constraints that limited ADAMW to ten epochs, this optimizer showed promising results compared to configurations using early stopping techniques. SGD provided higher BLEU scores for generalization but was very time-consuming. The results highlight the importance of finding a balance between optimization performance and computational efficiency, positioning ADAMW as a promising alternative when training efficiency and generalization are primary concerns.
Abstract
The research attempted to find an explanation and solution to a problem related to the fluctuation and decrease In the rate of return on assets for the research sample banks during the duration of the research, The search started from the hypothesis that, The effect of salary Domiciliation on the banking profitability of a sample of Iraqi banks participating in the salary settlement system for the period (2016-2019),The research used the descriptive historical approach, the quantitative analytical approach and the statistical approach. The research reached a set of conclusions, the most important of which is The effect of salary Domiciliation on banking profitability was achieved in three banks
... Show MoreThe experiment was carried out in the green house of botanical garden belong to Department of Biology/College of Education for Pure Science Ibn AL-Haitham, University of Baghdad for growing season 2017-2018 to evaluate effect of lead stress with concentrations (0, 50, 100, 150) mg.L -1 and Selenium concentrations (0, 15, 30) mg.L-1 on growth of dill plant using pots. The experiment was designed according to completely randomized design (CRD) with three replications. Result indicated that dill plants subjected to lead stress with height concentrations caused decrease in plant parameters (plant height, no. of branches. plant-1, root length, shoot dry weight, the content of nitrogen, phosphorus and potassium, protein concentration, no. of umbe
... Show MoreThe research problem focused through the researcher's experience in the gymnastics game and the lack of use of educational models that give the student an important role in the educational process, so it became necessary to identify the type of prevailing style for students, and the need for diversity in the use of educational models based on scientific theories, including the Daniel Document model. Based on three theories of learning, which are structural, behavioral, and meaningful learning. The research aimed to identify the effect of using the Daniel model for people with two types of brain control (left and right) to learn the skill of the Cartwheel in artistic gymnastics for students of the second stage. The researcher used the experi
... Show MoreThe research problem lies in the fundamental questions that revolve around the role of each of the tools of promotion, namely advertising, personal sale, public relations, sales promotion, and direct marketing in achieving leadership for business organizations. Research to know the role of promoting the service in the researched company and whether the promotion of the service is qualified to lead the researched company to leadership, and for this purpose formulated research hypotheses of three hypotheses, the first hypothesis says that there is a significant impact relationship between promotion and entrepreneurship. The second hypothesis aimed to determine the role played by promotion in the researched company to achieve unique
... Show MoreThe research paper aims to highlight the impact of electronic governance in improving the quality of auditing through accounting disclosure and how to make use out of it in resolving many of the problems by economic units in general and in particular the financial problems in particular. It focuses on the most important financial information of the loss of confidence and credibility in the financial information of the economic units, This study has been carried out through the use and application of many of the principles and rules contained in the electronic governance, The most important Which is the accounting disclosure, and hence the dimensions of accounting for electronic governance through the achievement of ac
... Show MoreThe paper present design of a control structure that enables integration of a Kinematic neural controller for trajectory tracking of a nonholonomic differential two wheeled mobile robot, then proposes a Kinematic neural controller to direct a National Instrument mobile robot (NI Mobile Robot). The controller is to make the actual velocity of the wheeled mobile robot close the required velocity by guarantees that the trajectory tracking mean squire error converges at minimum tracking error. The proposed tracking control system consists of two layers; The first layer is a multi-layer perceptron neural network system that controls the mobile robot to track the required path , The second layer is an optimization layer ,which is impleme
... Show MoreAn adaptive nonlinear neural controller to reduce the nonlinear flutter in 2-D wing is proposed in the paper. The nonlinearities in the system come from the quasi steady aerodynamic model and torsional spring in pitch direction. Time domain simulations are used to examine the dynamic aero elastic instabilities of the system (e.g. the onset of flutter and limit cycle oscillation, LCO). The structure of the controller consists of two models :the modified Elman neural network (MENN) and the feed forward multi-layer Perceptron (MLP). The MENN model is trained with off-line and on-line stages to guarantee that the outputs of the model accurately represent the plunge and pitch motion of the wing and this neural model acts as the identifier. Th
... Show MoreSloped solar chimney system is a solar chimney power plant with a sloped collector. Practically, the sloped collector can function as a chimney, then the chimney height can be reduced and the construction cost would be reduced.The continuity, Naver-stockes, energy and radiation transfer equations have been solved and carried out by Fluent software. The governing equations are solved for incompressible, 3-D, steady, turbulent standard model with Boussiuesq approximation to develop for the sloped solar chimney system in this study and evaluate the performance of solar chimney power plant in Baghdad city of Iraq numerically by Fluent (14) software with orking conditions such as solar radiation intensity (30
... Show MoreFiber reinforced polymer composite is an important material for structural application. The diversified application of FRP composite has taken center of attraction for interdisciplinary research. However, improvements on mechanical properties of this class of materials are still under research for different applications. In this paper we have modified the epoxy matrix by Al2O3, SiO2 and TiO2 nano particles in glass fiber/epoxy composite to improve the mechanical and physical properties. The composites are fabricated by hand lay-up method. It is observed that mechanical properties like flexural strength, hardness are more in case of SiO2 modified epoxy composite compare to other nano
... Show MoreThis study seeks to identify the possibility of achieving the property of faithful representation of accounting information and measure it by using the standard approach based on mathematical and statistical equations by comparing two financial periods before and after the application of (IFRS-15) Revenue from contracts with customers, during the period. (2014-2018), for the financial statements of the mixed joint stock companies listed on the Iraq Stock Exchange, which is one of the main pillars of the economic structure of the country, as a joint investment between the state and the private sector, and has importance in many aspects, including support for projects of public companies, S Absorption and employment of labor, as well as ra
... Show More