You're right there is no way to specifically target the neural engine. You have ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		babl-yc 64 days ago \| parent \| context \| favorite \| on: iPhone Air You're right there is no way to specifically target the neural engine. You have to use it via CoreML which abstracts away the execution. If you use Metal / GPU compute shaders it's going to run exclusively on GPU. Some inference libraries like TensorFlow/LiteRT with backend = .gpu use this.

scosman 64 days ago [–]

Exactly. And most folks are using a framework like llama.cpp which does control where it’s run.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact