25 Apr Data parallelism vs. model parallelism – How do they differ in distributed training? Poulomi Chatterjee Data parallelism Model parallelism seemed more apt for DNN models as a bigger number of GPUs was added.