PL3399418T3 - Wykonywanie drobnoziarnistej komunikacji obliczeniowej do struktur uczenia głębokiego - Google Patents

Wykonywanie drobnoziarnistej komunikacji obliczeniowej do struktur uczenia głębokiego

Info

Publication number
PL3399418T3
PL3399418T3 PL18170151.7T PL18170151T PL3399418T3 PL 3399418 T3 PL3399418 T3 PL 3399418T3 PL 18170151 T PL18170151 T PL 18170151T PL 3399418 T3 PL3399418 T3 PL 3399418T3
Authority
PL
Poland
Prior art keywords
grain
fine
deep learning
communication execution
learning frameworks
Prior art date
Application number
PL18170151.7T
Other languages
English (en)
Inventor
Srinivas Sridharan
Dheevatsa Mudigere
Original Assignee
Intel Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Intel Corporation filed Critical Intel Corporation
Publication of PL3399418T3 publication Critical patent/PL3399418T3/pl

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/54Interprogram communication
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing
    • G06T1/20Processor architectures; Processor configuration, e.g. pipelining
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/098Distributed learning, e.g. federated learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • G06N3/0442Recurrent networks, e.g. Hopfield networks characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/06Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
    • G06N3/063Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T15/00Three-dimensional [3D] image rendering
    • G06T15/005General purpose rendering architectures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T15/00Three-dimensional [3D] image rendering
    • G06T15/04Texture mapping
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T15/00Three-dimensional [3D] image rendering
    • G06T15/50Lighting effects
    • G06T15/80Shading
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T17/00Three-dimensional [3D] modelling for computer graphics
    • G06T17/10Constructive solid geometry [CSG] using solid primitives, e.g. cylinders, cubes
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T17/00Three-dimensional [3D] modelling for computer graphics
    • G06T17/20Finite element generation, e.g. wire-frame surface description, tesselation

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • General Health & Medical Sciences (AREA)
  • Neurology (AREA)
  • Computer Graphics (AREA)
  • Image Processing (AREA)
  • Image Generation (AREA)
  • Combined Controls Of Internal Combustion Engines (AREA)
  • Electrically Operated Instructional Devices (AREA)
PL18170151.7T 2017-05-05 2018-04-30 Wykonywanie drobnoziarnistej komunikacji obliczeniowej do struktur uczenia głębokiego PL3399418T3 (pl)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201762502453P 2017-05-05 2017-05-05
US15/869,502 US12154028B2 (en) 2017-05-05 2018-01-12 Fine-grain compute communication execution for deep learning frameworks via hardware accelerated point-to-point primitives

Publications (1)

Publication Number Publication Date
PL3399418T3 true PL3399418T3 (pl) 2023-04-17

Family

ID=62196323

Family Applications (1)

Application Number Title Priority Date Filing Date
PL18170151.7T PL3399418T3 (pl) 2017-05-05 2018-04-30 Wykonywanie drobnoziarnistej komunikacji obliczeniowej do struktur uczenia głębokiego

Country Status (5)

Country Link
US (1) US12154028B2 (pl)
EP (2) EP4089537B1 (pl)
CN (1) CN108805798B (pl)
ES (1) ES2939271T3 (pl)
PL (1) PL3399418T3 (pl)

Families Citing this family (142)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA3108151C (en) * 2017-02-23 2024-02-20 Cerebras Systems Inc. Accelerated deep learning
WO2018183553A1 (en) 2017-03-29 2018-10-04 Fungible, Inc. Non-blocking any-to-any data center network having multiplexed packet spraying within access node groups
WO2018183526A1 (en) 2017-03-29 2018-10-04 Fungible, Inc. Non-blocking, full-mesh data center network having optical permutors
WO2018183542A1 (en) 2017-03-29 2018-10-04 Fungible, Inc. Non-blocking any-to-any data center network with packet spraying over multiple alternate data paths
US11037330B2 (en) * 2017-04-08 2021-06-15 Intel Corporation Low rank matrix compression
CN117971715A (zh) 2017-04-10 2024-05-03 微软技术许可有限责任公司 多处理器系统中的中继一致存储器管理
US12154028B2 (en) 2017-05-05 2024-11-26 Intel Corporation Fine-grain compute communication execution for deep learning frameworks via hardware accelerated point-to-point primitives
US10579762B2 (en) * 2017-05-15 2020-03-03 LegUp Computing Inc. High-level synthesis (HLS) method and apparatus to specify pipeline and spatial parallelism in computer hardware
CN117348976A (zh) 2017-07-10 2024-01-05 微软技术许可有限责任公司 用于流处理的数据处理单元
US10659254B2 (en) 2017-07-10 2020-05-19 Fungible, Inc. Access node integrated circuit for data centers which includes a networking unit, a plurality of host units, processing clusters, a data network fabric, and a control network fabric
US12341687B2 (en) 2017-09-29 2025-06-24 Microsoft Technology Licensing, Llc Reliable fabric control protocol extensions for data center networks with failure resilience
US12278763B2 (en) 2017-09-29 2025-04-15 Microsoft Technology Licensing, Llc Fabric control protocol with congestion control for data center networks
US12212495B2 (en) 2017-09-29 2025-01-28 Microsoft Technology Licensing, Llc Reliable fabric control protocol extensions for data center networks with unsolicited packet spraying over multiple alternate data paths
US10965586B2 (en) 2017-09-29 2021-03-30 Fungible, Inc. Resilient network communication using selective multipath packet flow spraying
US11178262B2 (en) 2017-09-29 2021-11-16 Fungible, Inc. Fabric control protocol for data center networks with packet spraying over multiple alternate data paths
US12231353B2 (en) 2017-09-29 2025-02-18 Microsoft Technology Licensing, Llc Fabric control protocol for data center networks with packet spraying over multiple alternate data paths
US12294470B2 (en) 2017-09-29 2025-05-06 Microsoft Technology Licensing, Llc Fabric control protocol for large-scale multi-stage data center networks
WO2019104090A1 (en) 2017-11-21 2019-05-31 Fungible, Inc. Work unit stack data structures in multiple core processor system for stream data processing
WO2019152063A1 (en) 2018-02-02 2019-08-08 Fungible, Inc. Efficient work unit processing in a multicore system
US12154025B1 (en) * 2018-02-13 2024-11-26 EMC IP Holding Company LLC Optimization of graphics processing unit memory for deep learning computing
US11630994B2 (en) * 2018-02-17 2023-04-18 Advanced Micro Devices, Inc. Optimized asynchronous training of neural networks using a distributed parameter server with eager updates
US10713021B2 (en) * 2018-03-05 2020-07-14 Apple Inc. Geometric 64-bit capability pointer
US11514371B2 (en) * 2018-03-13 2022-11-29 Woven Planet North America, Inc. Low latency image processing using byproduct decompressed images
US11361213B1 (en) 2018-04-20 2022-06-14 Perceive Corporation Using lookup table to represent neural network activation function
US11568227B1 (en) 2018-04-20 2023-01-31 Perceive Corporation Neural network inference circuit read controller with multiple operational modes
US11783167B1 (en) 2018-04-20 2023-10-10 Perceive Corporation Data transfer for non-dot product computations on neural network inference circuit
US11586910B1 (en) 2018-04-20 2023-02-21 Perceive Corporation Write cache for neural network inference circuit
US12518146B1 (en) 2018-04-20 2026-01-06 Amazon Technologies, Inc. Address decoding by neural network inference circuit read controller
US10977338B1 (en) 2018-04-20 2021-04-13 Perceive Corporation Reduced-area circuit for dot product computation
US11481612B1 (en) 2018-04-20 2022-10-25 Perceive Corporation Storage of input values across multiple cores of neural network inference circuit
US12093696B1 (en) 2018-04-20 2024-09-17 Perceive Corporation Bus for transporting output values of a neural network layer to cores specified by configuration data
US20200125958A1 (en) * 2018-10-19 2020-04-23 Preferred Networks, Inc. Training apparatus, training method, inference apparatus, inference method, and non-transitory computer readable medium
CN111078291B (zh) * 2018-10-19 2021-02-09 中科寒武纪科技股份有限公司 运算方法、系统及相关产品
US10929175B2 (en) 2018-11-21 2021-02-23 Fungible, Inc. Service chaining hardware accelerators within a data stream processing integrated circuit
US11922314B1 (en) * 2018-11-30 2024-03-05 Ansys, Inc. Systems and methods for building dynamic reduced order physical models
US11521067B2 (en) * 2018-11-30 2022-12-06 International Business Machines Corporation Decentralized distributed deep learning
US11995533B1 (en) 2018-12-05 2024-05-28 Perceive Corporation Executing replicated neural network layers on inference circuit
CN113841165B (zh) * 2018-12-17 2025-02-14 芯成半导体(开曼)有限公司 用于训练人工神经网络的系统和方法
CN111381979B (zh) * 2018-12-29 2023-05-23 杭州海康威视数字技术股份有限公司 神经网络的开发验证方法、装置、系统及存储介质
US11093438B2 (en) * 2019-01-07 2021-08-17 International Business Machines Corporation Pipelining multi-directional reduction
EP3881245B1 (en) * 2019-01-14 2024-09-18 Siemens Aktiengesellschaft Hardware accelerator extension to transfer learning - extending/finishing training to the edge
EP3889846A4 (en) * 2019-01-16 2022-06-01 Huawei Cloud Computing Technologies Co., Ltd. METHOD AND SYSTEM FOR TRAINING DEEP LEARNING MODELS
CN109783412B (zh) * 2019-01-18 2022-04-22 电子科技大学 一种深度强化学习加速训练的方法
CN109919322B (zh) * 2019-02-01 2022-01-28 京微齐力(北京)科技有限公司 一种测试系统芯片上的人工智能模块的方法和系统芯片
CN111526169B (zh) * 2019-02-01 2022-06-14 阿里巴巴集团控股有限公司 通过网络发送数据的方法、介质、服务器和计算机设备
US11748599B2 (en) * 2019-02-21 2023-09-05 Texas Instruments Incorporated Super-tiling in neural network processing to enable analytics at lower memory speed
DE112020001253T5 (de) * 2019-03-15 2021-12-09 Nvidia Corporation Techniken zum Trainieren eines neuronalen Netzes unter Verwendung von Transformationen
US11036545B2 (en) * 2019-03-15 2021-06-15 Intel Corporation Graphics systems and methods for accelerating synchronization using fine grain dependency check and scheduling optimizations based on available shared memory space
CN111722937B (zh) * 2019-03-21 2024-05-10 阿里巴巴集团控股有限公司 深度学习权重更新方法、装置
CN110096356B (zh) * 2019-03-22 2022-06-03 北京达佳互联信息技术有限公司 资源调度方法、装置、电子设备及存储介质
US11783176B2 (en) * 2019-03-25 2023-10-10 Western Digital Technologies, Inc. Enhanced storage device memory architecture for machine learning
US10996976B2 (en) * 2019-04-05 2021-05-04 Alibaba Group Holding Limited Systems and methods for scheduling neural networks by varying batch sizes
CN110008028B (zh) * 2019-04-10 2021-08-06 北京旷视科技有限公司 计算资源分配方法、装置、计算机设备和存储介质
US11868901B1 (en) 2019-05-21 2024-01-09 Percieve Corporation Compiler for optimizing memory allocations within cores
US12436804B2 (en) 2019-05-28 2025-10-07 Micron Technology, Inc. Memory as a service for artificial neural network (ANN) applications
US11061819B2 (en) 2019-05-28 2021-07-13 Micron Technology, Inc. Distributed computing based on memory as a service
US11334387B2 (en) 2019-05-28 2022-05-17 Micron Technology, Inc. Throttle memory as a service based on connectivity bandwidth
US11256624B2 (en) 2019-05-28 2022-02-22 Micron Technology, Inc. Intelligent content migration with borrowed memory
KR102351087B1 (ko) * 2019-06-04 2022-01-14 주식회사 딥엑스 인공신경망의 데이터 로컬리티 기반의 데이터 캐슁을 이용하여 고속의 인공신경망 오퍼레이션을 지원하는 데이터 관리 장치
US11016775B2 (en) * 2019-06-26 2021-05-25 Amazon Technologies, Inc. Neural network operation reordering for parallel execution
US12190225B2 (en) * 2019-06-27 2025-01-07 Advanced Micro Devices, Inc. Composable neural network kernels
US12265915B2 (en) * 2019-06-27 2025-04-01 Advanced Micro Devices, Inc. Composable neural network kernels
US11054997B2 (en) * 2019-08-12 2021-07-06 Micron Technology, Inc. Artificial neural networks in memory
US12061971B2 (en) 2019-08-12 2024-08-13 Micron Technology, Inc. Predictive maintenance of automotive engines
US12249189B2 (en) 2019-08-12 2025-03-11 Micron Technology, Inc. Predictive maintenance of automotive lighting
US11042350B2 (en) 2019-08-21 2021-06-22 Micron Technology, Inc. Intelligent audio control in vehicles
US12497055B2 (en) 2019-08-21 2025-12-16 Micron Technology, Inc. Monitoring controller area network bus for vehicle control
US12210401B2 (en) 2019-09-05 2025-01-28 Micron Technology, Inc. Temperature based optimization of data storage operations
CN110618870B (zh) * 2019-09-20 2021-11-19 广东浪潮大数据研究有限公司 一种深度学习训练任务的工作方法及装置
US11443243B2 (en) * 2019-10-10 2022-09-13 Baidu Usa Llc Method and system for artificial intelligence model training using a watermark-enabled kernel for a data processing accelerator
CN110990323B (zh) * 2019-10-17 2023-09-15 尧芯微半导体(重庆)有限公司 一种优化的xhci调度方法
CN110837891B (zh) * 2019-10-23 2022-05-17 南京大学 基于simd架构的自组织映射方法及系统
US12020149B2 (en) * 2019-10-28 2024-06-25 Micron Technology, Inc. Distributed neural network processing on an intelligent image sensor stack
CN110826609B (zh) * 2019-10-29 2023-03-24 华中科技大学 一种基于强化学习的双流特征融合图像识别方法
US12511543B2 (en) * 2019-11-05 2025-12-30 Nvidia Corporation Distributed weight update for backpropagation of a neural network
CN111027671B (zh) * 2019-11-12 2023-07-04 华中科技大学 一种基于模型结构特性的分布式深度学习通信方法和系统
CN110866610A (zh) * 2019-11-20 2020-03-06 苏州浪潮智能科技有限公司 一种深度学习模型分布式运算的方法及装置
WO2021097784A1 (en) * 2019-11-22 2021-05-27 Huawei Technologies Co., Ltd. Method and system for constructing compiler intermediate representations from tensorflow graph
US12547934B2 (en) 2019-12-03 2026-02-10 Visa International Service Association Techniques for providing secure federated machine-learning
CN112949844B (zh) * 2019-12-10 2026-04-14 华为技术有限公司 神经网络计算方法和神经网络计算装置
CN111062473B (zh) * 2019-12-16 2023-05-23 腾讯科技(深圳)有限公司 神经网络模型中的数据计算方法、图像处理方法及装置
GB2604271B (en) * 2019-12-18 2025-04-09 Nvidia Corp Master transform architecture for deep learning
US11250648B2 (en) 2019-12-18 2022-02-15 Micron Technology, Inc. Predictive maintenance of automotive transmission
US11442631B2 (en) * 2019-12-26 2022-09-13 Micron Technology, Inc. Memory operations with consideration for wear leveling
GB2591106B (en) * 2020-01-15 2022-02-23 Graphcore Ltd Control of data transfer between processors
US11521007B2 (en) 2020-02-17 2022-12-06 International Business Machines Corporation Accelerator resource utilization by neural networks
CN110955530A (zh) * 2020-02-25 2020-04-03 深圳鲲云信息科技有限公司 深度学习引擎并行处理数据方法、装置、设备及储存介质
WO2021174370A1 (en) * 2020-03-05 2021-09-10 Huawei Technologies Co., Ltd. Method and system for splitting and bit-width assignment of deep learning models for inference on distributed systems
WO2021183135A1 (en) * 2020-03-13 2021-09-16 Hewlett-Packard Development Company, L.P. Transmitting node instructions
US11681905B2 (en) * 2020-03-23 2023-06-20 Microsoft Technology Licensing, Llc Hardware-assisted gradient optimization using streamed gradients
US12223230B2 (en) * 2020-03-24 2025-02-11 Protolabs, Inc. Methods and systems for generating an instant design for manufacturability of a part at a computing device
US11610128B2 (en) * 2020-03-31 2023-03-21 Amazon Technologies, Inc. Neural network training under memory restraint
GB2593756B (en) * 2020-04-02 2022-03-30 Graphcore Ltd Control of data transfer between processing nodes
CN111558937B (zh) * 2020-04-07 2023-03-24 向仲宇 基于深度学习的机器人运动控制方法
US11355175B2 (en) * 2020-04-09 2022-06-07 Micron Technology, Inc. Deep learning accelerator and random access memory with a camera interface
CN111488211A (zh) * 2020-04-09 2020-08-04 北京嘀嘀无限科技发展有限公司 基于深度学习框架的任务处理方法、装置、设备及介质
CN113556242B (zh) * 2020-04-24 2023-01-17 中科寒武纪科技股份有限公司 一种基于多处理节点来进行节点间通信的方法和设备
US11605228B2 (en) 2020-06-26 2023-03-14 Nxp Usa, Inc. System and method for sensor fusion system having distributed convolutional neural network
CN111770173B (zh) * 2020-06-29 2022-09-06 中国人民解放军国防科技大学 一种基于网络控制器的归约方法及系统
CN111782905B (zh) * 2020-06-29 2024-02-09 中国工商银行股份有限公司 一种数据组包方法和装置、终端设备和可读存储介质
CN111984679B (zh) * 2020-07-02 2021-06-04 中科驭数(北京)科技有限公司 硬件加速数据库的访问方法、装置、主机、系统及介质
EP3944153A1 (en) * 2020-07-24 2022-01-26 GrAl Matter Labs S.A.S. Message based multi-processor system and method of operating the same
US12246736B2 (en) * 2020-07-29 2025-03-11 Micron Technology, Inc. Image sensor for processing sensor data to reduce data traffic to host system
KR20230051491A (ko) * 2020-08-18 2023-04-18 퀄컴 인코포레이티드 채널 상태 정보에 대한 구성 고려사항들
JP7648638B2 (ja) * 2020-08-28 2025-03-18 富士フイルム株式会社 学習装置、学習方法、プログラム、学習済みモデル、及び内視鏡システム
CN114359767B (zh) * 2020-09-30 2025-03-28 阿里巴巴集团控股有限公司 产品数据的处理方法、装置、存储介质和处理器
US12361318B2 (en) * 2020-10-15 2025-07-15 The Boeing Company Computing platform to architect a machine learning pipeline
US12229078B2 (en) 2020-11-02 2025-02-18 T-Head (Shanghai) Semiconductor Co., Ltd. Neural processing unit synchronization systems and methods
CN112395272B (zh) * 2021-01-20 2021-07-13 鹏城实验室 通信算法数据库构建方法、分布式机器装置和存储介质
US12430581B2 (en) 2021-02-15 2025-09-30 Bank Of America Corporation Machine learning training device
CN113157953B (zh) * 2021-02-24 2022-04-29 山东大学 一种跨终端图片传输方法及系统
US11675965B2 (en) * 2021-04-07 2023-06-13 At&T Intellectual Property I, L.P. Converting text to a numerical vector by mapping to a hypercube
US12217160B1 (en) 2021-04-23 2025-02-04 Amazon Technologies, Inc. Allocating blocks of unified memory for integrated circuit executing neural network
US11797270B2 (en) 2021-06-17 2023-10-24 International Business Machines Corporation Single function to perform multiple operations with distinct operation parameter validation
US12079658B2 (en) 2021-06-17 2024-09-03 International Business Machines Corporation Detection of invalid machine-specific data types during data conversion
US11669331B2 (en) 2021-06-17 2023-06-06 International Business Machines Corporation Neural network processing assist instruction
US11675592B2 (en) 2021-06-17 2023-06-13 International Business Machines Corporation Instruction to query for model-dependent information
US12236338B2 (en) 2021-06-17 2025-02-25 International Business Machines Corporation Single function to perform combined matrix multiplication and bias add operations
US11693692B2 (en) 2021-06-17 2023-07-04 International Business Machines Corporation Program event recording storage alteration processing for a neural network accelerator instruction
US11269632B1 (en) 2021-06-17 2022-03-08 International Business Machines Corporation Data conversion to/from selected data type with implied rounding mode
US11734013B2 (en) 2021-06-17 2023-08-22 International Business Machines Corporation Exception summary for invalid values detected during instruction execution
US20230033075A1 (en) * 2021-07-13 2023-02-02 Nvidia Corporation Image annotation using one or more neural networks
DE102022119217A1 (de) 2021-08-04 2023-02-09 Motional Ad Llc Trainieren eines neuronalen Netzwerks unter Verwendung einer Datenmenge mit Labeln mehrerer Granularitäten
US12333828B2 (en) 2021-08-04 2025-06-17 Motional Ad Llc Scalable and realistic camera blockage dataset generation
US12548311B2 (en) * 2021-08-04 2026-02-10 Motional Ad Llc Training a neural network using a data set with labels of multiple granularities
US12579416B1 (en) 2021-09-13 2026-03-17 Amazon Technologies, Inc. Neural network inference circuit with piecewise linear activation circuit
US12294368B2 (en) * 2021-09-24 2025-05-06 Intel Corporation Three-dimensional stacked programmable logic fabric and processor design architecture
US12282790B2 (en) 2021-11-17 2025-04-22 Bank Of America Corporation Cloud-based parallel processing and cognitive learning computing platform
TWI820704B (zh) * 2022-05-12 2023-11-01 財團法人工業技術研究院 聲音訊號的分析方法及裝置、晶片的設計方法及裝置
US20240036834A1 (en) * 2022-08-01 2024-02-01 Bank Of America Corporation Unified Desktop Computing System
GB2621195B (en) * 2022-08-01 2024-09-18 Advanced Risc Mach Ltd Complex rendering using tile buffers
US20260023981A1 (en) * 2022-09-30 2026-01-22 Intel Corporation Accelerate deep learning with inter-iteration scheduling
CN118041927A (zh) * 2022-11-11 2024-05-14 华为技术有限公司 一种通信方法及装置
CN115994567B (zh) * 2022-12-28 2024-03-22 兰州交通大学 一种深度神经网络模型并行计算任务异步调度方法
US12260253B2 (en) 2023-01-23 2025-03-25 SiMa Technologies, Inc. Layout-based data transfer between synchronized, interconnected processing elements for implementing machine learning networks
WO2024158588A1 (en) * 2023-01-23 2024-08-02 SiMa Technologies, Inc. Allocation of computations to synchronized, interconnected processing elements for implementing machine learning networks
CN115809092B (zh) * 2023-02-13 2023-04-28 湖南大学 基于mt3000异构处理器的深度学习计算库实现方法
EP4462230A1 (en) * 2023-05-12 2024-11-13 Deer IT Company Artificial intelligence inferencing machine and method of use thereof
US12506800B2 (en) * 2023-05-16 2025-12-23 Hexagon Technology Center Gmbh Precision geometry service for thin client applications
US12554557B2 (en) * 2023-05-16 2026-02-17 Hexagon Technology Center Gmbh Precision geometry client for thin client applications
US20260032092A1 (en) * 2024-07-24 2026-01-29 Advanced Micro Devices, Inc. Prioritize the earlier step messages for collective algorithms
CN120045337B (zh) * 2025-04-24 2025-11-14 清枫(北京)科技有限公司 基于动态资源管理的多人语音游戏性能优化方法及装置

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5909681A (en) * 1996-03-25 1999-06-01 Torrent Systems, Inc. Computer system and computerized method for partitioning data for parallel processing
US6005583A (en) * 1997-04-30 1999-12-21 Hewlett-Packard Company Precise gradient calculation system and method for a texture mapping system of a computer graphics system
US7219085B2 (en) 2003-12-09 2007-05-15 Microsoft Corporation System and method for accelerating and optimizing the processing of machine learning techniques using a graphics processing unit
US7873812B1 (en) 2004-04-05 2011-01-18 Tibet MIMAR Method and system for efficient matrix multiplication in a SIMD processor architecture
US7747070B2 (en) * 2005-08-31 2010-06-29 Microsoft Corporation Training convolutional neural networks on graphics processing units
US8319781B2 (en) 2007-11-23 2012-11-27 Pme Ip Australia Pty Ltd Multi-user multi-GPU render server apparatus and methods
US8965819B2 (en) * 2010-08-16 2015-02-24 Oracle International Corporation System and method for effective caching using neural networks
US20130294519A1 (en) * 2011-12-22 2013-11-07 Marat Gilmutdinov Complexity scalable frame rate-up conversion
US9390461B1 (en) 2012-05-08 2016-07-12 Apple Inc. Graphics hardware mode controls
CN102841773A (zh) * 2012-06-29 2012-12-26 上海大学 基于FPGA的Roberts边沿检测器
US9137156B2 (en) * 2013-04-24 2015-09-15 Brocade Communications Systems, Inc. Scalable and efficient flow-aware packet distribution
US9924490B2 (en) 2013-10-09 2018-03-20 International Business Machines Corporation Scaling multi-core neurosynaptic networks across chip boundaries
US10031857B2 (en) * 2014-05-27 2018-07-24 Mellanox Technologies, Ltd. Address translation services for direct accessing of local memory over a network fabric
CN104036451B (zh) * 2014-06-20 2018-12-11 深圳市腾讯计算机系统有限公司 基于多图形处理器的模型并行处理方法及装置
US10223333B2 (en) 2014-08-29 2019-03-05 Nvidia Corporation Performing multi-convolution operations in a parallel processing system
CN106297774B (zh) * 2015-05-29 2019-07-09 中国科学院声学研究所 一种神经网络声学模型的分布式并行训练方法及系统
US10229468B2 (en) * 2015-06-03 2019-03-12 Intel Corporation Automated conversion of GPGPU workloads to 3D pipeline workloads
US10796397B2 (en) * 2015-06-12 2020-10-06 Intel Corporation Facilitating dynamic runtime transformation of graphics processing commands for improved graphics performance at computing devices
US10891538B2 (en) 2016-08-11 2021-01-12 Nvidia Corporation Sparse convolutional neural network accelerator
US10997496B2 (en) 2016-08-11 2021-05-04 Nvidia Corporation Sparse convolutional neural network accelerator
US10783437B2 (en) * 2017-03-05 2020-09-22 International Business Machines Corporation Hybrid aggregation for deep learning neural networks
US12154028B2 (en) 2017-05-05 2024-11-26 Intel Corporation Fine-grain compute communication execution for deep learning frameworks via hardware accelerated point-to-point primitives
US11501152B2 (en) 2017-05-05 2022-11-15 Intel Corporation Efficient learning and using of topologies of neural networks in machine learning

Also Published As

Publication number Publication date
US20180322386A1 (en) 2018-11-08
EP4089537B1 (en) 2025-03-12
EP4089537A1 (en) 2022-11-16
CN108805798A (zh) 2018-11-13
US12154028B2 (en) 2024-11-26
EP3399418B1 (en) 2022-12-07
CN108805798B (zh) 2026-02-06
EP3399418A1 (en) 2018-11-07
ES2939271T3 (es) 2023-04-20

Similar Documents

Publication Publication Date Title
PL3399418T3 (pl) Wykonywanie drobnoziarnistej komunikacji obliczeniowej do struktur uczenia głębokiego
PL3783479T3 (pl) Zoptymalizowany sprzęt obliczeniowy do operacji uczenia maszynowego
GB2550864B (en) Well
AU201614763S (en) Watchcase
AU364424S (en) Watchcase
GB201717299D0 (en) Instruction set
AU364423S (en) Watchcase
GB201621776D0 (en) Debugging Method
GB201610168D0 (en) Subsea foundations
GB201502581D0 (en) Subsea pipe-in-pie structures
PL3396623T3 (pl) Uczenie głębokie zależne od kontekstu w czasie rzeczywistym
GB201603639D0 (en) Customisable jewellery
GB201805428D0 (en) Training aid
GB201718744D0 (en) Structural member
GB2555188B (en) Building strap
GB201718746D0 (en) Structural member
GB201517240D0 (en) Rig
GB201721734D0 (en) Inter-processor communication
GB201518781D0 (en) Training aid
GB201718327D0 (en) Training Aid
GB201600298D0 (en) Training aid
GB201720952D0 (en) Swimming aid
EM38279550001S (pl) Sprzęt treningowy
HUP1700144A1 (hu) Úszást elõsegítõ eszköz - uszony
GB201611811D0 (en) Training System