Amazon engineers discuss the migration of 80 percent of Alexa’s workload to Inferentia ASICs in this three-minute clip. On Thursday, an Amazon AWS blogpost announced that the company has moved most of the cloud processing for its Alexa personal assistant off of Nvidia GPUs and onto its own Inferentia Application Specific Integrated Circuit (ASIC). Amazon dev Sebastien […]