infer

suspend fun infer(module: Module<FP32, Float>, input: YoloInput, executionContext: ExecutionContext, reportProgress: suspend (current: Int, total: Int, message: String?) -> Unit = { _, _, _ -> }): List<Detection>(source)

Full inference: forward pass + decode + NMS. Expects the input tensor to be preprocessed to the configured model size (see YoloPreprocess).