DK3552156T3 - Neural episodisk styring - Google Patents

Neural episodisk styring Download PDF

Info

Publication number
DK3552156T3
DK3552156T3 DK18707703.7T DK18707703T DK3552156T3 DK 3552156 T3 DK3552156 T3 DK 3552156T3 DK 18707703 T DK18707703 T DK 18707703T DK 3552156 T3 DK3552156 T3 DK 3552156T3
Authority
DK
Denmark
Prior art keywords
neural
episodic control
episodic
control
neural episodic
Prior art date
Application number
DK18707703.7T
Other languages
English (en)
Inventor
Alexander Pritzel
Charles Blundell
Adria Puigdomenech Badia
Benigno Uria-MartãNez
Original Assignee
Deepmind Tech Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Deepmind Tech Ltd filed Critical Deepmind Tech Ltd
Application granted granted Critical
Publication of DK3552156T3 publication Critical patent/DK3552156T3/da

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/004Artificial life, i.e. computing arrangements simulating life
    • G06N3/006Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • G06N3/0442Recurrent networks, e.g. Hopfield networks characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/092Reinforcement learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/01Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Image Analysis (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
DK18707703.7T 2017-02-24 2018-02-26 Neural episodisk styring DK3552156T3 (da)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201762463558P 2017-02-24 2017-02-24
PCT/EP2018/054624 WO2018154100A1 (en) 2017-02-24 2018-02-26 Neural episodic control

Publications (1)

Publication Number Publication Date
DK3552156T3 true DK3552156T3 (da) 2022-08-22

Family

ID=61386852

Family Applications (1)

Application Number Title Priority Date Filing Date
DK18707703.7T DK3552156T3 (da) 2017-02-24 2018-02-26 Neural episodisk styring

Country Status (6)

Country Link
US (2) US10664753B2 (da)
EP (2) EP3552156B8 (da)
JP (2) JP6817456B2 (da)
CN (1) CN110235149B (da)
DK (1) DK3552156T3 (da)
WO (1) WO2018154100A1 (da)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3523760B1 (en) * 2016-11-04 2024-01-24 DeepMind Technologies Limited Reinforcement learning systems
CA3103470A1 (en) 2018-06-12 2019-12-19 Intergraph Corporation Artificial intelligence applications for computer-aided dispatch systems
US11455530B2 (en) * 2018-11-20 2022-09-27 Google Llc Controlling agents using scene memory data
WO2020132693A1 (en) * 2018-12-21 2020-06-25 Waymo Llc Searching an autonomous vehicle sensor data repository
WO2022069747A1 (en) * 2020-10-02 2022-04-07 Deepmind Technologies Limited Training reinforcement learning agents using augmented temporal difference learning
CN112476424B (zh) 2020-11-13 2025-05-09 腾讯科技(深圳)有限公司 机器人控制方法、装置、设备及计算机存储介质
US12197214B2 (en) * 2021-03-23 2025-01-14 Honda Motor Co., Ltd. System and method for completing continual multi-agent trajectory forecasting
KR102821722B1 (ko) * 2021-08-09 2025-06-17 한국전자통신연구원 에피소드 메모리 기반 인공 지능 학습 장치 및 방법
CN117274732B (zh) * 2023-09-18 2024-07-16 广东石油化工学院 一种基于情景记忆驱动构建优化扩散模型的方法和系统

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9189409B2 (en) * 2013-02-19 2015-11-17 Avago Technologies General Ip (Singapore) Pte. Ltd. Reducing writes to solid state drive cache memories of storage controllers
US9679258B2 (en) * 2013-10-08 2017-06-13 Google Inc. Methods and apparatus for reinforcement learning
US10628733B2 (en) * 2015-04-06 2020-04-21 Deepmind Technologies Limited Selecting reinforcement learning actions using goals and observations
US20180165602A1 (en) * 2016-12-14 2018-06-14 Microsoft Technology Licensing, Llc Scalability of reinforcement learning by separation of concerns

Also Published As

Publication number Publication date
US20190303764A1 (en) 2019-10-03
WO2018154100A1 (en) 2018-08-30
CN110235149A (zh) 2019-09-13
JP2020508527A (ja) 2020-03-19
US10664753B2 (en) 2020-05-26
EP3552156B8 (en) 2022-08-03
JP7038790B2 (ja) 2022-03-18
EP4057189A1 (en) 2022-09-14
EP3552156A1 (en) 2019-10-16
CN110235149B (zh) 2023-07-07
EP3552156B1 (en) 2022-06-22
US20200265317A1 (en) 2020-08-20
JP6817456B2 (ja) 2021-01-20
US11720796B2 (en) 2023-08-08
JP2021064387A (ja) 2021-04-22

Similar Documents

Publication Publication Date Title
PL3827381T3 (pl) Kontrola wielu kubitów
DK3552156T3 (da) Neural episodisk styring
EP3570139A4 (en) CONTROL DEVICE
EP3677983A4 (en) REMOTE CONTROL
EP3587053A4 (en) ROBOT CONTROL DEVICE
EP3582036A4 (en) CONTROL DEVICE
DK3502823T3 (da) Reguleringsventil
EP3614216A4 (en) CONTROL UNIT
PL2993964T5 (pl) Sterowanie oświetleniem
EP3418851C0 (en) CONTROL BODY
EP3336820A4 (en) BLUETOOTH REMOTE CONTROL
LT3478563T (lt) Vilkiko artinimosi valdymas
EP3722906A4 (en) DEVICE MOTION CONTROL
EP3592990C0 (de) Steuereinrichtung
LT3393753T (lt) Monolitinis nuotolinio valdymo pultas
EP3648602A4 (en) PEST CONTROL SYSTEM
EP3588213A4 (en) CONTROL DEVICE
GB2581751B (en) Weed control
EP3575895A4 (en) CONTROL DEVICE
DK3510465T3 (da) Betjeningsindretning
EP3185657C0 (en) CONTROLLER
DK3345171T3 (da) Fjernstyringsindretning
EP3639248C0 (en) LOW COST FLOW CONTROL
EP3557368A4 (en) CONTROL DEVICE
EP3553881A4 (en) Remote controller