0
RESEARCH PAPERS

Using Q-Learning and Genetic Algorithms to Improve the Efficiency of Weight Adjustments for Optimal Control and Design Problems

[+] Author and Article Information
Kaivan Kamali1

Laboratory for Intelligent Agents, College of Information Sciences and Technology, The Pennsylvania State University, University Park, PA 16802kxk302@psu.edu

L. J. Jiang

Structural Dynamics and Control Laboratory, Department of Mechanical and Nuclear Engineering, The Pennsylvania State University, University Park, PA 16802lxj148@psu.edu

John Yen

Laboratory for Intelligent Agents, College of Information Sciences and Technology, The Pennsylvania State University, University Park, PA 16802juy1@psu.edu

K. W. Wang

Structural Dynamics and Control Laboratory, Department of Mechanical and Nuclear Engineering, The Pennsylvania State University, University Park, PA 16802kxw2@psu.edu

1

Corresponding author.

J. Comput. Inf. Sci. Eng 7(4), 302-308 (Apr 08, 2007) (7 pages) doi:10.1115/1.2739502 History: Received August 11, 2005; Revised April 08, 2007

In traditional optimal control and design problems, the control gains and design parameters are usually derived to minimize a cost function reflecting the system performance and control effort. One major challenge of such approaches is the selection of weighting matrices in the cost function, which are usually determined via trial-and-error and human intuition. While various techniques have been proposed to automate the weight selection process, they either can not address complex design problems or suffer from slow convergence rate and high computational costs. We propose a layered approach based on Q-learning, a reinforcement learning technique, on top of genetic algorithms (GA) to determine the best weightings for optimal control and design problems. The layered approach allows for reuse of knowledge. Knowledge obtained via Q-learning in a design problem can be used to speed up the convergence rate of a similar design problem. Moreover, the layered approach allows for solving optimizations that cannot be solved by GA alone. To test the proposed method, we perform numerical experiments on a sample active-passive hybrid vibration control problem, namely adaptive structures with active-passive hybrid piezoelectric networks. These numerical experiments show that the proposed Q-learning scheme is a promising approach for automation of weight selection for complex design problems.

FIGURES IN THIS ARTICLE
<>
Copyright © 2007 by American Society of Mechanical Engineers
Your Session has timed out. Please sign back in to continue.

References

Figures

Grahic Jump Location
Figure 1

Overall view of the proposed methodology

Grahic Jump Location
Figure 2

GA Computation of optimal passive design variables

Grahic Jump Location
Figure 3

Schematic of the system with APPN

Tables

Errata

Discussions

Some tools below are only available to our subscribers or users with an online account.

Related Content

Customize your page view by dragging and repositioning the boxes below.

Related Journal Articles
Related eBook Content
Topic Collections

Sorry! You do not have access to this content. For assistance or to subscribe, please contact us:

  • TELEPHONE: 1-800-843-2763 (Toll-free in the USA)
  • EMAIL: asmedigitalcollection@asme.org
Sign In