BitNet

lee/BitNet

mirror of https://github.com/microsoft/BitNet.git synced 2026-07-09 11:41:52 +00:00

Author	SHA1	Message	Date
Yan Xia	01eb415772	Update demo link in README.md	2026-03-10 15:49:46 +08:00
tsong-ms	0fdaa16ae3	Merge pull request #421 from microsoft/fix/unsafe-deserialization-gpu-pipeline fix: add weights_only=True to torch.load in GPU inference pipeline	2026-03-09 20:20:21 +08:00
Ubuntu	eb60fc39cb	fix: add weights_only=True to torch.load in GPU inference pipeline Mitigate unsafe deserialization vulnerability (CWE-502) in the GPU inference pipeline. torch.load without weights_only=True allows arbitrary code execution via malicious pickle payloads in checkpoint files. Affected locations: - gpu/convert_checkpoint.py:37 (checkpoint conversion utility) - gpu/generate.py:67,69 (fp16 and int2 checkpoint loading) The utils/ scripts already applied this parameter correctly; this commit brings the GPU pipeline to the same safety standard. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-09 12:09:19 +00:00
tsong-ms	8fd3412fbc	Merge pull request #406 from XsquirrelC/main [fix] convert pt to gguf	2026-02-03 13:30:42 +08:00
XSquirrelC	3987a503fd	[fix] convert pt to gguf	2026-02-03 05:24:24 +00:00
tsong-ms	ade47a535c	Merge pull request #380 from XsquirrelC/main [fix] change README link	2026-01-27 13:48:51 +08:00
XSquirrelC	77e136fb08	[fix] change README link	2026-01-27 03:39:11 +00:00
tsong-ms	cc3c9e4c80	Merge pull request #379 from XsquirrelC/main BitNet CPU Inference Optimization	2026-01-27 11:24:02 +08:00
XSquirrelC	1876a3e889	[merge] submodule llama.cpp	2026-01-27 03:09:32 +00:00
XSquirrelC	e8c8107dcf	[modify] some test picture and add power test script	2026-01-25 06:51:33 +00:00
XSquirrelC	7b2c52b9d5	[modify] some utils test script	2026-01-24 08:40:36 +00:00
XSquirrelC	2fed9af730	[fix] setup_env.py bug COMPILER_EXTRA_ARGS	2026-01-22 11:11:14 +00:00
deva100	7e6f0e14f1	[modify] update README; [feat] some test script in utils	2026-01-22 06:33:03 +00:00
deva100	7ea1f2601f	[modify] fine_tuning_result.png	2026-01-20 07:40:37 +00:00
deva100	b68802ff17	[fix] embed-quant q6_k; [modify] README update	2026-01-20 04:56:50 +00:00
deva100	35b1c28585	[fix] correct README	2026-01-15 03:44:50 +00:00
deva100	53ffe5e92b	[chore] update README	2026-01-15 03:37:16 +00:00
deva100	43da5e5f76	[fix] make demo_benchmark.sh more fast	2025-12-23 07:23:14 +00:00
deva100	41cc304868	[chore] add some automation bash script for BitNet Tech Report	2025-12-23 06:48:33 +00:00
deva100	112f853414	[feat] I2S kernels for weight & activation parallel on Intel & ARM machine; [feat] I2S GEMV & GEMM(llama.cpp); [feat] quantize activation & dequantize embedding(llama.cpp); [fix] compile bug: cannot define __ARM_FEATURE_DOTPROD(llama.cpp)	2025-11-19 07:35:05 +00:00
Junhui He	404980eeca	Merge pull request #290 from microsoft/gpu-readme-dev Update readme for gpu kernels	2025-06-03 14:14:20 +08:00
Junhui He	088e607b25	Merge pull request #280 from microsoft/fix-convert-dev Enable conversion from .safetensors checkpoints to gguf files	2025-06-03 13:59:47 +08:00
ZeonfaiHo	c1e9a9a237	Update readme for gpu kernels	2025-05-31 21:41:41 +08:00
junhuihe	43e9b2d4a0	Enable conversion from .safetensors checkpoints to gguf files	2025-05-23 16:19:29 +08:00
tsong-ms	69a20459f5	Merge pull request #268 from younesbelkada/add-falcon-e-final Add falcon-e support	2025-05-21 16:28:05 +08:00
younesbelkada	5c12850ed9	Merge branch 'add-falcon-e-final' of github.com:younesbelkada/BitNet into add-falcon-e-final	2025-05-21 11:53:40 +04:00
younesbelkada	765741d80b	update submodule	2025-05-21 11:52:30 +04:00
Younes Belkada	f314d18863	feat: add also base models	2025-05-21 04:11:07 +04:00
Younes Belkada	9e9575665e	Merge branch 'microsoft:main' into add-falcon-e-final	2025-05-20 17:05:11 +04:00
tsong-ms	70285e0154	Merge pull request #276 from microsoft/readme-dev refine readme for gpu kernel	2025-05-20 16:14:18 +08:00
tsong-ms	6197e9feb0	refine readme for gpu kernel	2025-05-20 12:29:56 +08:00
Junhui He	6c2c08f67e	Merge pull request #266 from microsoft/gpu-dev	2025-05-19 12:46:20 +08:00
Junhui He	154c92b704	Init gpu branch	2025-05-19 04:34:00 +00:00
Younes Belkada	0015ad5201	Update README.md	2025-05-15 18:49:28 +04:00
younesbelkada	de371b708d	add falcon-e support	2025-05-14 17:07:05 +04:00
Benjamin Wegener	c9e752c9d7	Fix build error with GCC by forcing Clang compiler in CMake on android/aarch64 (#242 ) GCC does not recognize Clang-specific warning flags like -Wunreachable-code-break and -Wunreachable-code-return, which are passed by upstream submodules (e.g., ggml). This patch forces CMake to use Clang via command-line arguments, avoiding the need to patch nested submodules. This resolves compiler errors without modifying submodule source code.	2025-05-08 16:22:45 +08:00
Benjamin Wegener	1792346223	Add run_inference_server.py for Running llama.cpp Built-in Server (#204 ) * Update CMakeLists.txt I added a CMake option to compile the Llama.cpp server. This update allows us to easily build and deploy the server using BitNet * Create run_inference_server.py same as run_inference, but for use with llama.cpp's built in server, for some extra comfort In particular: - The build directory is determined based on whether the system is running on Windows or not. - A list of arguments (`--model`, `-m` etc.) is created. - The main argument list is parsed and passed to the `subprocess.run()` method to execute the system command.	2025-05-08 16:22:12 +08:00
Junhui He	c17d1c5d77	Merge pull request #212 from microsoft/arch-name-dev Fix model architecture name	2025-04-23 11:20:15 +08:00
junhuihe	488dc1e876	Fix model architecture name	2025-04-22 17:28:59 +08:00
tsong-ms	fd9f1d6e46	Merge pull request #176 from microsoft/readme-dev refine readme	2025-04-16 12:35:53 +08:00
tsong	874e6bd5fb	refine readme	2025-04-16 04:34:59 +00:00
tsong-ms	034b34cb70	Merge pull request #175 from microsoft/readme-dev add third-party demo	2025-04-15 22:42:12 +08:00
tsong	71fdd9472f	add third-party demo	2025-04-15 14:36:05 +00:00
Yan Xia	1c77bd8966	Update README.md	2025-04-15 17:11:23 +08:00
Yan Xia	8f75f99c72	Update README.md (#172 ) add two FAQs for windows build requestions.	2025-04-15 17:07:20 +08:00
Yan Xia	0e7dadba1e	Update README.md	2025-04-15 15:24:42 +08:00
Yan Xia	fd3f355a0b	update readme and setup script to support official BitNet b1.58 model (#171 ) * update readme and setup file for new model. * update model file name --------- Co-authored-by: Yan Xia <yanxia@microsoft.com>	2025-04-15 14:53:56 +08:00
tsong-ms	fa854cf8f8	Merge pull request #167 from potassiummmm/bitnet-25 add support for bitnet2b_2501 model	2025-04-15 14:27:46 +08:00
potassiummmm	09f91066d6	add conversion logic for new model	2025-03-12 18:34:05 +08:00
potassiummmm	4f2e41a514	add support for bitnet2b_2501 model	2025-03-12 18:16:45 +08:00

1 2

96 Commits