Wideband autonomous SDR analysis engine forked from sdr-visual-suite

Jan Svabenik f27080b8b5 Update web audio player and streamer diagnostics		пре 21 часа
..
build	Update web audio player and streamer diagnostics	пре 21 часа
native	Improve GPU phase precision	пре 22 часа
README.md	docs: split CUDA build paths by platform	пре 5 дана
batch.go	Implement streaming recording redesign	пре 3 дана
batch_runner.go	Implement streaming recording redesign	пре 3 дана
batch_runner_other.go	Checkpoint current working SDR pipeline state	пре 3 дана
batch_runner_test.go	chore: rename module to sdr-wideband-suite	пре 2 дана
batch_runner_windows.go	chore: rename module to sdr-wideband-suite	пре 2 дана
doc.go	docs: add initial CUDA demod kernel source	пре 5 дана
errors.go	Introduce reusable gpudemod batch runner	пре 5 дана
gpudemod.go	chore: rename module to sdr-wideband-suite	пре 2 дана
gpudemod_cufft_test.go	build: wire CUDA demod package through nvcc and MSVC	пре 5 дана
gpudemod_stub.go	Add GPU shift-filter-decimate path for signal extraction	пре 5 дана
gpudemod_test.go	feat: prepare CUDA demod launch boundary	пре 5 дана
gpudemod_windows.go	chore: rename module to sdr-wideband-suite	пре 2 дана
kernels.cu	Improve GPU phase precision	пре 22 часа
validation.go	chore: rename module to sdr-wideband-suite	пре 2 дана
validation_extra.go	chore: rename module to sdr-wideband-suite	пре 2 дана
validation_extra_test.go	feat: add demod validation and GPU mode telemetry	пре 5 дана
validation_runtime.go	Disable GPU validation by default in production	пре 5 дана
validation_test.go	chore: rename module to sdr-wideband-suite	пре 2 дана
windows_bridge.go	feat: parallelize mixed-bandwidth GPU batch demod	пре 4 дана

README.md

gpudemod

Phase 1 CUDA demod scaffolding.

Current state

Standard Go builds use gpudemod_stub.go (!cufft).
cufft builds allocate GPU buffers and cross the CGO/CUDA launch boundary.
If CUDA launch wrappers are not backed by compiled kernels yet, the code falls back to CPU DSP.
The shifted IQ path is already wired so a successful GPU freq-shift result can be copied back and reused immediately.
Build orchestration should now be considered OS-specific; see docs/build-cuda.md.

First real kernel

kernels.cu contains the first candidate implementation:

gpud_freq_shift_kernel

This is not compiled automatically yet in the current environment because the machine currently lacks a CUDA compiler toolchain in PATH (nvcc not found).

Next machine-side step

On a CUDA-capable dev machine with toolchain installed:

Compile kernels.cu into an object file and archive it into a linkable library
- helper script: tools/build-gpudemod-kernel.ps1
On Jan's Windows machine, the working kernel-build path currently relies on nvcc + MSVC cl.exe in PATH
Link gpudemod_kernels.lib into the cufft build
Replace gpud_launch_freq_shift(...) stub body with the real kernel launch
Validate copied-back shifted IQ against dsp.FreqShift
Only then move the next stage (FM discriminator) onto the GPU

Why this is still useful

The runtime/buffer/recorder/fallback structure is already in place, so once kernel compilation is available, real acceleration can be inserted without another architecture rewrite.