测试代码:importtimeimporttorchfromloguruimportloggerdevice='cuda'batch_size=1000image_channel=3image_size=224count=int(100000/batch_size)logger.debug(f'readytoinputdata')input_data=torch.randn(batch_size,image_channelsize,image_image_size)total_bytes=input_data.numel()*input_data.element_size()print('total_MB',total_bytes/1024/1024)logger.debug(f'开始计数')started_at=time.time()foriinrange(count):input_data_with_cuda=input_data.to(device)ended_at=time.time()print('paytime',ended_at-started_at)测试不同平台下的运行速度,因为这个肯定和内存速度,显存带宽,显存速度等等等等都是相关的测试平台1:intelXeonE5-2690CPU+tesla-m60GPUCPU:IntelXeonE5-2690RAM:DDR42400MHzGPU:NVIDIATeslaM60运行结果2023-03-1507:18:28.542|调试|__main__:
