Kelsidavis-WoWee

mirror of https://github.com/Kelsidavis/WoWee.git synced 2026-05-08 01:53:52 +00:00

Author	SHA1	Message	Date
Kelsi	61cb2df400	Add ABI v2 external-handle plumbing for FSR3 bridge dispatch	2026-03-09 01:31:01 -07:00
Kelsi	a1c4244a27	Implement clean Path-B FSR3 wrapper ABI for framegen	2026-03-09 00:08:22 -07:00
Kelsi	93850ac6dc	Add Path A/B/C FSR3 runtime detection with clear FG fallback status	2026-03-09 00:01:45 -07:00
Kelsi	5ad4b9be2d	Add FSR3 frame generation runtime stats to performance HUD	2026-03-08 23:35:39 -07:00
Kelsi	aa43aa6fc8	Bridge FSR3 Vulkan framegen dispatch and route sharpen to interpolated output	2026-03-08 23:20:50 -07:00
Kelsi	538a1db866	Fix FSR3 runtime wrapper for local SDK API and real Vulkan resource dispatch	2026-03-08 23:13:08 -07:00
Kelsi	f1099f5940	Add FSR3 runtime library probing and readiness status	2026-03-08 23:03:45 -07:00
Kelsi	e1c93c47be	Stage FSR3 framegen dispatch hook in frame pipeline	2026-03-08 22:57:35 -07:00
Kelsi	bdfec103ac	Add persisted AMD FSR3 framegen runtime toggle plumbing	2026-03-08 22:53:21 -07:00
Kelsi	a49decd9a6	Add AMD FSR3 framegen interface probe and CI validation	2026-03-08 22:47:46 -07:00
Kelsi	94ad89c764	Set native-focused FSR defaults and reorder quality UI	2026-03-08 21:34:31 -07:00
Kelsi	e6d373df3e	Remove FSR performance preset and add native quality mode	2026-03-08 21:17:04 -07:00
Kelsi	eccbfb6a5f	Tune FSR2 defaults and simplify jitter controls	2026-03-08 21:08:17 -07:00
Kelsi	2e71c768db	Add live FSR2 motion/jitter tuning controls and HUD readout	2026-03-08 20:56:22 -07:00
Kelsi	a12126cc7e	Persist upscaling mode and refine FSR2 jitter behavior	2026-03-08 20:22:11 -07:00
Kelsi	51a8cf565f	Integrate AMD FSR2 backend and document SDK bootstrap	2026-03-08 19:56:52 -07:00
Kelsi	a24ff375fb	Add AMD FSR2 SDK detection and backend integration scaffolding	2026-03-08 19:33:07 -07:00
Kelsi	e2a2316038	Stabilize FSR2 path and refine temporal pipeline groundwork	2026-03-08 18:52:04 -07:00
Kelsi	e94eb7f2d1	FSR2 temporal upscaling fixes: unjittered reprojection, sharpen Y-flip, MSAA guard, descriptor double-buffering Some checks are pending Build / Build (arm64) (push) Waiting to run Details Build / Build (x86-64) (push) Waiting to run Details Build / Build (macOS arm64) (push) Waiting to run Details Build / Build (windows-arm64) (push) Waiting to run Details Build / Build (windows-x86-64) (push) Waiting to run Details Security / CodeQL (C/C++) (push) Waiting to run Details Security / Semgrep (push) Waiting to run Details Security / Sanitizer Build (ASan/UBSan) (push) Waiting to run Details - Motion vectors: single unjittered reprojection matrix (80 bytes) instead of two jittered matrices (160 bytes), eliminating numerical instability from jitter amplification through large world coordinates - Sharpen pass: fix Y-flip for correct UV sampling, double-buffer descriptor sets to avoid race with in-flight command buffers - MSAA: auto-disable when FSR2 enabled, grey out AA setting in UI - Accumulation: variance-based neighborhood clamping in YCoCg space, correct history layout transitions - Frame index: wrap at 256 for stable Halton sequence	2026-03-08 01:22:15 -08:00
Kelsi	52317d1edd	Implement FSR 2.2 temporal upscaling Full FSR 2.2 pipeline with depth-based motion vector reprojection, temporal accumulation with YCoCg neighborhood clamping, and RCAS contrast-adaptive sharpening. Architecture (designed for FSR 3.x frame generation readiness): - Camera: Halton(2,3) sub-pixel jitter with unjittered projection stored separately for motion vector computation - Motion vectors: compute shader reconstructs world position from depth + inverse VP, reprojects with previous frame's VP - Temporal accumulation: compute shader blends 5-10% current frame with 90-95% clamped history, adaptive blend for disocclusion - History: ping-pong R16G16B16A16 buffers at display resolution - Sharpening: RCAS fragment pass with contrast-adaptive weights Integration: - FSR2 replaces both FSR1 and MSAA when enabled - Scene renders to internal resolution framebuffer (no MSAA) - Compute passes run between scene and swapchain render passes - Camera cut detection resets history on teleport - Quality presets shared with FSR1 (0.50-0.77 scale factors) - UI: "Upscaling" combo with Off/FSR 1.0/FSR 2.2 options	2026-03-07 23:13:01 -08:00
Kelsi	0ffeabd4ed	Revert "Further reduce tile streaming aggressiveness" This reverts commit `f681a8b361`.	2026-03-07 23:02:25 -08:00
Kelsi	f681a8b361	Further reduce tile streaming aggressiveness - Load radius: 4→3 (normal), 6→5 (taxi) - Terrain chunks per step: 16→8 - M2 models per step: 6→2 (removed idle boost) - WMO models per step: 2→1 (removed idle boost) - WMO doodads per step: 4→2 - All budgets now constant (no idle-vs-busy branching)	2026-03-07 22:55:02 -08:00
Kelsi	ac3c90dd75	Fix M2 animated instance flashing (deer/bird/critter pop-in) Root cause: bonesDirty was a single bool shared across both double-buffered frame indices. When bones were copied to frame 0's SSBO and bonesDirty cleared, frame 1's newly-allocated SSBO would contain garbage/zeros and never get populated — causing animated M2 instances to flash invisible on alternating frames. Fix: Make bonesDirty per-frame-index (bool[2]) so each buffer independently tracks whether it needs bone data uploaded. When bones are recomputed, both indices are marked dirty. When uploaded during render, only the current frame index is cleared. New buffer allocations in prepareRender force their frame index dirty.	2026-03-07 22:47:07 -08:00
Kelsi	6cf08fbaa6	Throttle proactive tile streaming to reduce post-load hitching Add 2-second cooldown timer before re-checking for unloaded tiles when workers are idle, preventing excessive streamTiles() calls that caused frame hitches right after world load.	2026-03-07 22:40:07 -08:00
Kelsi	4cb03c38fe	Parallel animation updates, thread-safe collision, M2 pop-in fix, shadow stabilization - Overlap M2 and character animation updates via std::async (~2-5ms saved) - Thread-local collision scratch buffers for concurrent floor queries - Parallel terrain/WMO/M2 floor queries in camera controller - Seed new M2 instance bones from existing siblings to eliminate pop-in flash - Fix shadow flicker: snap center along stable light axes instead of in view space - Increase shadow distance default to 300 units (slider max 500)	2026-03-07 22:29:06 -08:00
Kelsi	a4966e486f	Fix WMO wall collision, normal mapping, POM backfill, and M2/WMO rendering performance - Fix MOPY flag check (0x08 not 0x01) for proper wall collision detection - Cap MAX_PUSH to PLAYER_RADIUS to prevent gradual clip-through - Fix WMO doodad quaternion component ordering (X/Y swap) - Linear normal map strength blend in shader for smooth slider control - Enable shadow sampling for interior WMO groups (covered outdoor areas) - Backfill deferred normal/height maps after streaming with descriptor rebind - M2: prepareRender only iterates animated instances, bone dirty flag - M2: remove worker thread VMA allocation, skip unready bone instances - WMO: persistent visibility vectors, sequential culling - Add FSR EASU/RCAS shaders	2026-03-07 22:03:28 -08:00
Kelsi	02cf0e4df3	Background normal map generation, queue-draining load screen warmup - Normal map CPU work (luminance→blur→Sobel) moved to background threads, main thread only does GPU upload (~1-2ms vs 15-22ms per texture) - Load screen warmup now waits until ALL spawn/equipment/gameobject queues are drained before transitioning (prevents naked character, NPC pop-in) - Exit condition: min 2s + 5 consecutive empty iterations, hard cap 15s - Equipment queue processes 8 items per warmup iteration instead of 1 - Added LoadingScreen::renderOverlay() for future world-behind-loading use	2026-03-07 18:40:24 -08:00
Kelsi	63efac9fa6	Unlimited creature model uploads during load screen, remove duplicate code Loading screen now calls processCreatureSpawnQueue(unlimited=true) which removes the 1-upload-per-frame cap and 2ms time budget, allowing all pending creature models to upload to GPU in bulk. Also increases concurrent async background loads from 4 to 16 during load screen. Replaces 40-line inline duplicate of processAsyncCreatureResults with the shared function.	2026-03-07 17:31:47 -08:00
Kelsi	24f2ec75ec	Defer normal map generation to reduce GPU model upload stalls by ~50% Some checks are pending Build / Build (arm64) (push) Waiting to run Details Build / Build (x86-64) (push) Waiting to run Details Build / Build (macOS arm64) (push) Waiting to run Details Build / Build (windows-arm64) (push) Waiting to run Details Build / Build (windows-x86-64) (push) Waiting to run Details Security / CodeQL (C/C++) (push) Waiting to run Details Security / Semgrep (push) Waiting to run Details Security / Sanitizer Build (ASan/UBSan) (push) Waiting to run Details Each loadTexture call was generating a normal/height map inline (3 full-image passes: luminance + blur + Sobel). For models with 15-20 textures this added 30-40ms to the 70ms model upload. Now deferred to a per-frame budget (2/frame in-game, 10/frame during load screen). Models render without POM until their normal maps are ready.	2026-03-07 17:16:38 -08:00
Kelsi	faca22ac5f	Async humanoid NPC texture pipeline to eliminate 30-150ms main-thread stalls Move all DBC lookups (CharSections, ItemDisplayInfo), texture path resolution, and BLP decoding for humanoid NPCs to background threads. Only GPU texture uploads remain on the main thread via pre-decoded BLP cache.	2026-03-07 16:54:58 -08:00
Kelsi	7ac990cff4	Background BLP texture pre-decoding + deferred WMO normal maps (12x streaming perf) Move CPU-heavy BLP texture decoding from main thread to background worker threads for all hot paths: terrain M2 models, WMO doodad M2s, WMO textures, creature models, and gameobject WMOs. Each renderer (M2, WMO, Character) now accepts a pre-decoded BLP cache that loadTexture() checks before falling back to synchronous decode. Defer WMO normal/height map generation (3 per-pixel passes: luminance, box blur, Sobel) during terrain streaming finalization — this was the dominant remaining bottleneck after BLP pre-decoding. Terrain streaming stalls: 1576ms → 124ms worst case.	2026-03-07 15:46:56 -08:00
Kelsi	0313bd8692	Performance: ring buffer UBOs, batched load screen uploads, background world preloader - Replace per-frame VMA alloc/free of material UBOs with a ring buffer in CharacterRenderer (~500 allocations/frame eliminated) - Batch all ready terrain tiles into a single GPU upload during load screen (processAllReadyTiles instead of one-at-a-time with individual fence waits) - Lift per-frame creature/GO spawn budgets during load screen warmup phase - Add background world preloader: saves last world position to disk, pre-warms AssetManager file cache with ADT files starting at app init (login screen) so terrain workers get instant cache hits when Enter World is clicked - Distance-filter expensive collision guard to 8-unit melee range - Merge 3 CharacterRenderer update loops into single pass - Time-budget instrumentation for slow update stages (>3ms threshold) - Count-based async creature model upload budget (max 3/frame in-game) - 1-per-frame game object spawn + per-doodad time budget for transport loading - Use deque for creature spawn queue to avoid O(n) front-erase	2026-03-07 13:44:09 -08:00
Kelsi	25bb63c50a	Faster terrain/model loading: more workers, batched finalization, skip redundant I/O - Worker threads: use (cores - 1-2) instead of cores/2, minimum 4 - Outer upload batch in processReadyTiles: ALL model/texture uploads per frame share a single command buffer submission + fence wait - Upload multiple models per finalization step: 8 M2s, 4 WMOs, 16 doodads per call instead of 1 each (all within same GPU batch) - Terrain chunks: 64 per step instead of 16 - Skip redundant M2 file I/O: thread-safe uploadedM2Ids_ set lets background workers skip re-reading+parsing models already on GPU - processAllReadyTiles (loading screen) and processOneReadyTile also wrapped in outer upload batches	2026-03-07 12:32:39 -08:00
Kelsi	16b4336700	Batch GPU uploads to eliminate per-upload fence waits (stutter fix) Every uploadBuffer/VkTexture::upload called immediateSubmit which did a separate vkQueueSubmit + vkWaitForFences. Loading a single creature model with textures caused 4-8+ fence waits; terrain chunks caused 80+ per batch. Added beginUploadBatch/endUploadBatch to VkContext: records all upload commands into a single command buffer, submits once with one fence wait. Staging buffers are deferred for cleanup after the batch completes. Wrapped in batch mode: - CharacterRenderer::loadModel (creature VB/IB + textures) - M2Renderer::loadModel (doodad VB/IB + textures) - TerrainRenderer::loadTerrain/loadTerrainIncremental (chunk geometry + textures) - TerrainRenderer::uploadPreloadedTextures - WMORenderer::loadModel (group geometry + textures)	2026-03-07 12:19:59 -08:00
Kelsi	884b72bc1c	Incremental terrain upload + M2 instance dedup hash for city stutter Terrain finalization was uploading all 256 chunks (GPU fence waits) in one atomic advanceFinalization call that couldn't be interrupted by the 5ms time budget. Now split into incremental batches of 16 chunks per call, allowing the time budget to yield between batches. M2 instance creation had O(N) dedup scans iterating ALL instances to check for duplicates. In cities with 5000+ doodads, this caused O(N²) total work during tile loading. Replaced with hash-based DedupKey map for O(1) lookups. Changes: - TerrainRenderer::loadTerrainIncremental: uploads N chunks per call - FinalizingTile tracks terrainChunkNext for cross-frame progress - TERRAIN phase yields after preload and after each chunk batch - M2Renderer::DedupKey hash map replaces linear scan in createInstance and createInstanceWithMatrix - Dedup map maintained through rebuildSpatialIndex and clear paths	2026-03-07 11:59:19 -08:00
Kelsi	f9410cc4bd	Fix city NPC stuttering: async model loading, CharSections cache, frame budgets - Async creature model loading: M2 file I/O and parsing on background threads via std::async, GPU upload on main thread when ready (MAX_ASYNC_CREATURE_LOADS=4) - CharSections.dbc lookup cache: O(1) hash lookup instead of O(N) full DBC scan per humanoid NPC spawn (was scanning thousands of records twice per spawn) - Frame time budget: 4ms cap on creature spawn processing per frame - Wolf/worg model name check cached per modelId (was doing tolower+find per hostile creature per frame) - Weapon attach throttle: max 2 per 1s tick (was attempting all unweaponized NPCs) - Separate texture application tracking (displayIdTexturesApplied_) so async-loaded models still get skin/equipment textures applied correctly	2026-03-07 11:44:14 -08:00
Kelsi	a24fe4cc45	Ironforge Great Forge lava, magma water rendering, LavaSteam particle effects - Add magma/slime rendering path to water shader (fbm noise, crust/molten/core coloring) - Fix WMO liquid height filter rejecting high-altitude zones like Ironforge (Z>300) - Allow interior WMO magma/slime MLIQ groups to load (skip only water/ocean) - Mark LAVASTEAM.m2 as spell effect for proper additive blend, hide emission mesh - Add isLavaModel flag for M2 ForgeLava/LavaPots UV scroll fallback - Add isLava material detection in WMO renderer for lava texture UV animation - Fix WMO material UBO colors for magma (was blue, now orange-red)	2026-03-07 00:48:04 -08:00
Kelsi	2c5b7cd368	WMO glass transparency for instances, disable interior shadows - Add case-insensitive "glass" detection for WMO window materials - Make instance (WMO-only) glass highly transparent (12-35% alpha) so underwater scenes are visible through Deeprun Tram windows - Keep normal world windows at existing opacity (40-95% alpha) - Disable shadow mapping for interior WMO groups to fix dark indoor areas like Ironforge	2026-03-06 23:48:35 -08:00
Kelsi	f4c115ade9	Fix Deeprun Tram: visual movement, direction, and player riding - Fix NULL renderer pointers by moving TransportManager connection after initializeRenderers for WMO-only maps - Fix tram direction by negating DBC TransportAnimation X/Y local offsets before serverToCanonical conversion - Implement client-side M2 transport boarding via proximity detection (server doesn't send transport attachment for trams) - Use position-delta approach: player keeps normal movement while transport's frame-to-frame motion is applied on top - Prevent server movement packets from clearing client-side M2 transport state (isClientM2Transport guard) - Fix getPlayerWorldPosition for M2 transports: simple canonical addition instead of render-space matrix multiplication	2026-03-06 23:01:11 -08:00
Kelsi	e001aaa2b6	Suppress movement after teleport/portal, add shadow distance slider - Add movementSuppressTimer to camera controller that forces all movement keys to read as false, preventing held W key from carrying through loading screens (fixes always-running-forward after instance portals) - Increase shadow frustum default from 60 to 72 units (+20%) - Make shadow distance configurable via setShadowDistance() (40-200 range) - Add shadow distance slider in Video settings tab (persisted to config)	2026-03-06 20:38:58 -08:00
Kelsi	8014dde29b	Improve WMO wall collision, unstuck, interior zoom, and chat focus - Stronger wall collision push (0.35/0.15) and swept push (0.45/0.25) for interior/exterior WMOs to reduce clipping through tunnel walls - Use all triangles (not just pre-classified walls) for collision checks - Allow invisible collidable triangles (MOPY 0x01 without 0x20) to block - Pass insideWMO flag to all collision callers, match swim sweep to ground - Widen swept hit detection radius from 0.15 to 0.25 - Restrict camera zoom to 12 units inside WMO interiors - Fix /unstuck launching player above WMOs: remove +20 fallback, use gravity when no floor found - Slash and Enter keys always focus chat unless already typing	2026-03-06 20:00:27 -08:00
Kelsi	5a227c0376	Add water refraction toggle with per-frame scene history Fix VK_ERROR_DEVICE_LOST crash by allocating per-frame scene history images (color + depth) instead of a single shared image that raced between frames in flight. Water refraction can now be toggled via Settings > Video > Water Refraction. Without refraction: richer blue base colors, animated caustic shimmer, and normal-based color shifts give the water visible life. With refraction: clean screen-space refraction with Beer-Lambert absorption. Disabling clears scene history to black for immediate fallback.	2026-03-06 19:15:34 -08:00
Kelsi	585d0bf50e	Instance portal glow, spin, and transparent additive rendering	2026-03-06 18:03:08 -08:00
Kelsi	4cbceced67	Fix invisible walls from WMO doodad M2 collision and MOPY filtering WMO interior doodads (gears, decorations) were blocking player movement via M2 collision. Skip collision for all WMO doodad M2 instances since the WMO itself handles wall collision. Also filter WMO wall collision using MOPY per-triangle flags: only rendered+collidable triangles block the player, skipping invisible collision hulls. Revert tram portal extended range (no longer needed with collision fix).	2026-03-06 12:26:17 -08:00
Kelsi	ee4e6a31ce	Filter WMO decorative geometry from collision, fix tram portal trigger IDs Parse MOPY per-triangle flags in WMO groups and exclude detail/decorative triangles (flag 0x04) from collision detection. This prevents invisible walls from objects like gears and railings in WMO interiors. Add WotLK area trigger IDs 2173/2175 to extended-range tram triggers.	2026-03-06 10:37:32 -08:00
Kelsi	16d44c5bb3	Fix interior WMO floor gaps, water exit stair clipping, NPC equipment, portal animation - Fix Stormwind barracks floor: interior WMO groups named "facade" were incorrectly marked as LOD shells and hidden when close. Add !isIndoor guard to all LOD detection conditions so interior groups always render. - Fix water exit stair clipping: anchor lastGroundZ to current position on swim exit, set grounded=true for full step-up budget, add upward velocity boost to clear stair lip geometry. - Re-enable NPC humanoid equipment geosets (kEnableNpcHumanoidOverrides) so guards render with proper armor instead of underwear. - Keep instance portal GameObjects animated (spinning/glowing) instead of freezing all GO animations indiscriminately. - Fix equipment disappearing after instance round-trip by resetting dirty tracking on world reload. - Fix multi-doodad-set loading: load both set 0 (global) and placement- specific doodad set, with dedup to avoid double-loading. - Clear placedWmoIds in softReset/unloadAll to prevent stale dedup. - Apply MODF rotation to instance WMOs, snap player to WMO floor. - Re-enable rebuildSpatialIndex in setInstanceTransform. - Store precomputeFloorCache results in precomputed grid. - Add F8 debug key for WMO floor diagnostics at player position. - Expand mapIdToName with all Classic/TBC/WotLK instance map IDs.	2026-03-04 19:47:01 -08:00
Kelsi	2e432fc123	Eliminate per-instance hash lookups in M2 render/shadow culling loops Use cached model flags (isValid, isSmoke, isInvisibleTrap, isGroundDetail, disableAnimation, boundRadius) on M2Instance instead of models.find() in the hot culling paths. Also complete cached flag initialization in createInstanceWithMatrix().	2026-03-04 08:28:21 -08:00
Kelsi	30fa9836d9	Fix glow sprite flashing, move fadeAlpha to push constants, throttle character bones - Glow sprites now use dedicated vertex buffer (glowVB_) separate from M2 particle buffer to prevent data race when renderM2Particles() overwrites glow data mid-flight - Move fadeAlpha from shared material UBO to per-draw push constants, eliminating cross-instance alpha race on non-double-buffered UBOs - Smooth adaptive render distance transitions to prevent pop-in/out at instance count thresholds (1000/2000) - Distance-tiered character bone throttling: near (<30u) every frame, mid (30-60u) every 3rd, far (60-120u) every 6th frame - Skip weapon instance animation updates (transforms set by parent bones)	2026-03-04 08:17:32 -08:00
Kelsi	3482dacea8	Optimize M2 update loop: skip static doodads, incremental spatial index Some checks failed Build / Build (arm64) (push) Has been cancelled Details Build / Build (x86-64) (push) Has been cancelled Details Build / Build (macOS arm64) (push) Has been cancelled Details Build / Build (windows-arm64) (push) Has been cancelled Details Build / Build (windows-x86-64) (push) Has been cancelled Details Security / CodeQL (C/C++) (push) Has been cancelled Details Security / Semgrep (push) Has been cancelled Details Security / Sanitizer Build (ASan/UBSan) (push) Has been cancelled Details - Split M2 instances into fast-path index lists (animated, particle-only, particle-all, smoke) to avoid iterating all 46K instances per frame - Cache model flags (hasAnimation, disableAnimation, isSmoke, etc.) on M2Instance struct to eliminate per-frame hash lookups - Replace full rebuildSpatialIndex on position/transform updates with incremental grid cell remove+add, preventing 8.5ms/frame rebuild cost - Advance animTime for all instances (texture UV animation) but only compute bones and particles for the ~3K that need it M2_UPDATE: 10.7ms → 2.0ms, FPS: 35 → 55-59	2026-03-02 14:45:49 -08:00
Kelsi	b948720ec3	Fix NPC chat showing only name without message text Two bugs in SMSG_MESSAGECHAT parser for MONSTER_SAY/YELL/EMOTE: 1. Sender name included trailing null byte from server (nameLen includes null terminator). The embedded null in std::string caused ImGui to truncate the concatenated display string at the NPC name, hiding " says: <message>" entirely. 2. Missing NamedGuid receiver name for non-player/non-pet targets. When the receiver GUID is a creature, the server writes an additional SizedCString (target name) that we weren't reading, shifting all subsequent field reads. Also adds MONSTER_WHISPER, MONSTER_PARTY, RAID_BOSS_EMOTE, RAID_BOSS_WHISPER chat types with proper parsing and display formatting (says/yells/whispers).	2026-03-02 08:31:34 -08:00

1 2 3 4 5 ...

553 commits