Discussion about this post

User's avatar
Scott Hrastar's avatar

Any idea why the Mac with 128 GB of memory, same as GX1O, would struggle with the larger context window? Is the model implementation somehow different for the MLX environment versus Cuda?

1 more comment...

No posts

Ready for more?