Published onApril 11, 2024Mixture of Depth is VibeMLGPTlanguage-modelmixture-of-depthpytorchbitnetMoDThought and code about mixture of depth
Published onMarch 23, 2024Expriments with Bitnet 1.5 (~ngmi~)MLGPTlanguage-modeltinytorchpytorchbitnetThe blog explores implementing Bitnet 1.5, highlighting its efficiency gains and challenges in quantized neural network training.