Method and system for partitioning of deep convolution network for executing on computationally constraint devices
Abstract:
A growing need for inferencing to be run on fog devices exists, in order to reduce the upstream network traffic. However, being computationally constrained in nature, executing complex deep inferencing models on such devices has been proved difficult. A system and method for partitioning of deep convolution neural network for execution of computationally constraint devices at a network edge has been provided. The system is configured to use depth wise input partitioning of convolutional operations in deep convolutional neural network (DCNN). The convolution operation is performed based on an input filter depth and number of filters for determining the appropriate parameters for partitioning based on an inference speedup method. The system uses a master-slave network for partitioning the input. The system is configured to address these problems by depth wise partitioning of input which ensures speedup inference of convolution operations by reducing pixel overlaps.
Information query
Patent Agency Ranking
0/0