The IEEE Transactions on Neural Networks and Learning Systems publishes technical articles that deal with the theory, design, and applications of neural networks and related learning systems.
Journal Citation Metrics Journal Citation Metrics such as Impact Factor, Eigenfactor Score™ and Article Influence Score™ are available where applicable. Each year, Journal Citation Reports© (JCR) from Thomson Reuters examines the influence and impact of scholarly research journals. JCR reveals the relationship between citing and cited journals, offering a systematic, objective means to evaluate the world's leading journals. Find out more about IEEE Journal Rankings.
In order to support the world-wide efforts in flighting the COVID-19, the IEEE Computational Intelligence Society (IEEE CIS) has set up a program, the COVID 19 Initiative. Under this initiative, the IEEE TNNLS will expedite, to the extent possible, the processing of all articles submitted to TNNLS with primary focus on COVID 19. Here are the important information:
- We have set-up a special Fast-Track under IEEE TNNLS to process COVID-19 focused manuscripts. All papers submitted to this Fast Track will be undergone a fast review process, with the targeted first decision within 4 weeks. If the paper can go to the revision stage, the author(s) then have 2 weeks of revision time, followed by another round of review within 3 weeks to reach a final decision. That is to say, we target to reach a final decision for all the Fast Track manuscripts within 9 weeks.
- When you decide to submit to this special Fast Track, please kindly make sure you select the Paper type "Fast Track: COVID-19 Focused Papers". Also, please make sure that your manuscript must be within the scope of IEEE TNNLS, as well as with a research focus on COVID-19.
- If accepted, TNNLS will arrange to publish and print such articles immediately. Furthermore, all such articles will be published, free-of-charge to authors and readers, as free access for one year from the date of the publication to enable the research findings to be disseminated widely and freely to other researchers and the community at large.
We look forward to your submissions and support to TNNLS!
Call for Special Issues
IEEE TNNLS Special Issue on "Deep Neural Networks for Graphs: Theory, Models, Algorithms and Applications," Guest Editors: Ming Li, Zhejiang Normal University, China; Alessio Micheli, University of Pisa, Italy; Yu Guang Wang, Max Planck Institute for Mathematics in the Sciences, Germany; Shirui Pan, Monash University, Australia; Pietro Liò, University of Cambridge, UK; Giorgio Stefano Gnecco, IMT School for Advanced Studies, AXES Research Unit, Italy; Marcello Sanguineti, University of Genoa, Italy. Submission Deadline: July 31, 2021. [Call for Papers]
IEEE TNNLS Special Issue on "Deep Learning for Earth and Planetary Geosciences," Guest Editors: Antonio Paiva, ExxonMobil Research and Engineering, USA; Weichang Li, Aramco Research Center, USA; Maarten V. de Hoop, Rice University, USA; Chris A. Mattmann, NASA/JPL, USA; Youzuo Lin, Los Alamos National Laboratory, USA. Submission Deadline: March 12, 2021. [Call for Papers]
The Boundedness Conditions for Model-Free HDP( λ )
Authors: Seaar Al-Dabooni, Donald Wunsch
Publication: IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
Issue: Volume 30, Issue 7 – July 2019
Abstract: This paper provides the stability analysis for a model-free action-dependent heuristic dynamic programing (HDP) approach with an eligibility trace long-term prediction parameter (λ). HDP(λ) learns from more than one future reward. Eligibility traces have long been popular in Q-learning. This paper proves and demonstrates that they are worthwhile to use with HDP. In this paper, we prove its uniformly ultimately bounded (UUB) property under certain conditions. Previous works present a UUB proof for traditional HDP [HDP(λ = 0)], but we extend the proof with the λ parameter. By using Lyapunov stability, we demonstrate the boundedness of the estimated error for the critic and actor neural networks as well as learning rate parameters. Three case studies demonstrate the effectiveness of HDP(λ). The trajectories of the internal reinforcement signal nonlinear system are considered as the first case. We compare the results with the performance of HDP and traditional temporal difference [TD(λ)] with different λ values. The second case study is a single-link inverted pendulum. We investigate the performance of the inverted pendulum by comparing HDP(λ) with regular HDP, with different levels of noise. The third case study is a 3-D maze navigation benchmark, which is compared with state action reward state action, Q(λ), HDP, and HDP(λ). All these simulation results illustrate that HDP(λ) has a competitive performance; thus this contribution is not only UUB but also useful in comparison with traditional HDP.
Index Terms: λ-return, action dependent (AD), approximate dynamic programing (ADP), heuristic dynamic programing (HDP), Lyapunov stability, model free, uniformly ultimately bounded (UUB)
IEEE Xplore Link: https://ieeexplore.ieee.org/document/8528554