145 2 weeks ago

Sweep Next-Edit a 1.5B parameter model for next-edit autocomplete, quantized to Q8_0 GGUF format. (Template supports tools)

tools
ollama run maternion/sweep-next-edit-1.5B

Details

2 weeks ago

6a6bbfafb2cf · 1.5GB ·

qwen2
·
1.45B
·
Q8_0
{{- if .Messages }} {{- if or .System .Tools }}<|im_start|>system {{- if .System }} {{ .System }} {{

Readme

Model Description

Sweep Next-Edit predicts your next code edit before you make it. It runs locally on your laptop in under 500ms (with speculative decoding) and outperforms models over 4x its size on next-edit benchmarks. More details here.

Usage

Download ollama and then:

ollama run maternion/sweep-next-edit-1.5B

Model Details

  • Format: GGUF (Q8_0 quantization)
  • Parameters: 1.5B
  • Context Length: 8192 tokens
  • Base Model: Qwen2.5-Coder

Example

The model uses a specific prompt format with file context, recent diffs, and current state to predict the next edit.

Links

License

Apache 2.0