In order to fit today’s neural networks onto hardware, some practitioners utilize some type of weight-pruning method to compress the size of the model and reduce the size and computational cost of running it. Today’s weight-pruning methods can reach up to 50× data compression without major accuracy loss This is on top of ephemeral activation […] The post Can a Sparse-AI Hardware Architecture for Data Centers Work? appeared first on HPCwire .

Source: HPCwire — read the full report at the original publisher.

This is a curated wire item. The Continuum Brief does not republish full third-party articles; this entry links to the original source.