MaxView

← Back to run

Log Summary

XPK Start: Sat Apr 18 20:15:19 UTC 2026
PyTorch was not found. Models won't be available and only tokenizers, configuration and file/data utilities can be used.
2026-04-18 20:15:43.041994: E external/local_xla/xla/stream_executor/cuda/cuda_platform.cc:51] failed call to cuInit: INTERNAL: CUDA error: Failed call to cuInit: UNKNOWN ERROR (303)
I0418 20:15:43.573025 133547157423936 max_utils.py:273] Attempting to initialize the jax distributed system...
I0418 20:15:52.613421 133547157423936 distributed.py:149] Starting JAX distributed service on [::]:8482
I0418 20:15:52.615740 133547157423936 distributed.py:172] Connecting to JAX distributed service on mt-10-shardy-true-zmpp7-slice-job-0-0.mt-10-shardy-true-zmpp7:8482
I0418 20:15:54.496875 133547157423936 max_utils.py:284] Jax distributed system initialized!
I0418 20:16:00.628384 133547157423936 max_utils.py:800] System Information: Jax Version: 0.9.2
I0418 20:16:00.628483 133547157423936 max_utils.py:801] System Information: Jaxlib Version: 0.9.2
I0418 20:16:00.628522 133547157423936 max_utils.py:802] System Information: Jax Backend: PJRT C API
TFRT TPU v6 lite
Built on Mar 4 2026 11:32:08 (1772652728) cl/878335365
I0418 20:16:00.628558 133547157423936 train_utils.py:348] WARNING: Sequence packing is essentially ignored for synthetic data. Please use a real dataset to use sequence packing.
Traceback (most recent call last):
  File "<frozen runpy>", line 198, in _run_module_as_main
  File "<frozen runpy>", line 88, in _run_code
  File "/deps/src/maxtext/trainers/pre_train/train.py", line 744, in <module>
    app.run(main)
  File "/usr/local/lib/python3.12/site-packages/absl/app.py", line 367, in run
    _run_main(main, args)
  File "/usr/local/lib/python3.12/site-packages/absl/app.py", line 312, in _run_main
    sys.exit(main(argv))
             ^^^^^^^^^^
  File "/deps/src/maxtext/trainers/pre_train/train.py", line 740, in main
    train_func()
  File "/deps/src/maxtext/trainers/pre_train/train.py", line 730, in train_func
    run(config, recorder, diagnostic_config)
  File "/deps/src/maxtext/trainers/pre_train/train.py", line 709, in run
    train_loop(config, recorder)
  File "/deps/src/maxtext/trainers/pre_train/train.py", line 536, in train_loop
    ) = train_utils.setup_train_loop(config, recorder)
        ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/deps/src/maxtext/utils/train_utils.py", line 217, in setup_train_loop
    raise NotImplementedError("Pure NNX support has not been implemented yet.")
NotImplementedError: Pure NNX support has not been implemented yet.
XPK End: Sat Apr 18 20:16:09 UTC 2026
EXIT_CODE=1