References for: SAC-TD3Hybrid

Full identifier: https://w3id.org/np/RAgOeYsOZ9JMrf7eZXm9LfwjGMukSuzSpk6eeHFRpRlNo#SAC-TD3Hybrid

Nanopublication Part Subject Predicate Object Published By Published On
links a nanopublication to its assertion http://www.nanopub.org/nschema#hasAssertion assertion
SAC-TD3Hybrid
Tobias Kuhn
2024-10-04T09:20:40.120Z
links a nanopublication to its assertion http://www.nanopub.org/nschema#hasAssertion assertion
SAC-TD3Hybrid
(unknown)
2024-10-02T09:51:13.183Z
links a nanopublication to its assertion http://www.nanopub.org/nschema#hasAssertion assertion
SAC-TD3Hybrid
SAC-TD3 Hybrid
(unknown)
2024-10-02T09:51:13.183Z
links a nanopublication to its assertion http://www.nanopub.org/nschema#hasAssertion assertion
SAC-TD3Hybrid
(unknown)
2024-10-02T09:51:13.183Z
links a nanopublication to its assertion http://www.nanopub.org/nschema#hasAssertion assertion
SAC-TD3Hybrid
SAC-TD3 is an algorithm incorporating elements both from the SAC and TD3 algorithms for reinforcement learning. It incorporates the idea of entropy regularization from SAC while borrowing target policy smoothening, delayed policy updates from the TD3 algorithm. It has been used to estimate the reaction barrier given a potential energy surface. Stable states in complex systems correspond to local minima on the associated potential energy surface, transitions between which govern the dynamics of the system. Precisely determining the transition pathways in complex and high-dimensional systems is challenging because these transitions are rare events, and the system remains near a local minimum for most of the time. The probability of such transitions decreases exponentially with the height of the energy barrier, making the system's dynamics highly sensitive to the calculated energy barriers. This problem has is formulated as a cost-minimization problem and solved using the aforementioned reinforcement learning algorithm. The exploratory nature of the algorithm enables efficient sampling and better estimation of the minimum energy barrier for transitions.
(unknown)
2024-10-02T09:51:13.183Z
links a nanopublication to its assertion http://www.nanopub.org/nschema#hasAssertion assertion
SAC-TD3Hybrid
(unknown)
2024-10-02T09:51:13.183Z
links a nanopublication to its assertion http://www.nanopub.org/nschema#hasAssertion assertion
SAC-TD3Hybrid
(unknown)
2024-10-02T09:51:13.183Z
links a nanopublication to its pubinfo http://www.nanopub.org/nschema#hasPublicationInfo pubinfo
SAC-TD3Hybrid
(unknown)
2024-10-02T09:51:13.183Z