.. _ch_gpu_schedules: Operator Optimizations on GPUs ============================== The chapter talks about the operator optimization on Nvidia GPUs. Basically, we follow the some logic of last chapter, starting from introducing the architecture of GPUs, followed by the optimization of some typical operators. .. toctree:: :maxdepth: 2 :numbered: arch vector_add broadcast_add matmul conv depthwise_conv pooling batch_norm