Kelsidavis-WoWee

mirror of https://github.com/Kelsidavis/WoWee.git synced 2026-04-13 16:13:51 +00:00

Author	SHA1	Message	Date
Kelsi	1feb6ea63f	fix(rendering): sync async upload batches before rendering Wait on in-flight upload batch fences at the start of each frame and insert a memory barrier (transfer→fragment shader) so the graphics queue sees completed layout transitions from the transfer queue. Fixes VK_IMAGE_LAYOUT_UNDEFINED validation errors for freshly loaded textures.	2026-04-03 22:09:41 -07:00
Kelsi	23bda2d476	fix(vulkan): enable missing device features for FSR2 compute shaders AMD RADV validation flagged missing shaderStorageImageWriteWithoutFormat, shaderInt16, shaderFloat16, and deviceCoherentMemory. The first two are now required device features; shaderFloat16 is optionally enabled via Vulkan 1.2 feature query; AMD device coherent memory extension and feature are enabled when available to prevent VMA memory type errors.	2026-04-03 21:20:37 -07:00
Kelsi	17c16150d6	fix(vulkan): MSAA crash on AMD RADV due to vkCreateRenderPass2 null dispatch Some checks are pending Build / Build (arm64) (push) Waiting to run Details Build / Build (x86-64) (push) Waiting to run Details Build / Build (macOS arm64) (push) Waiting to run Details Build / Build (windows-arm64) (push) Waiting to run Details Build / Build (windows-x86-64) (push) Waiting to run Details Security / CodeQL (C/C++) (push) Waiting to run Details Security / Semgrep (push) Waiting to run Details Security / Sanitizer Build (ASan/UBSan) (push) Waiting to run Details Instance was created with Vulkan 1.1 but depthResolveSupported_ was gated on the physical device's API version (1.2+ on RADV). This caused vkCreateRenderPass2 (core 1.2) to dispatch through a null function pointer when MSAA was enabled. Now requests 1.2 instance with 1.1 minimum fallback and gates depth resolve on the actual instance API version. Also removes all diagnostic crash-phase instrumentation from the previous investigation.	2026-04-03 20:58:32 -07:00
Kelsi	3ac8c4d95f	fix(rendering): wait all frame fences before freeing shared descriptor sets deferAfterFrameFence only waits for one frame slot's fence, but shared resources (material descriptor sets, vertex/index buffers) are bound by both in-flight frames' command buffers. On AMD RADV this caused vkFreeDescriptorSets errors and eventual SIGSEGV. Add deferAfterAllFrameFences: queues to every frame slot with a shared counter so cleanup runs exactly once, after the last slot is fenced. Use it for WMO, terrain, water, and character model shared resources. Per-frame bone sets keep using deferAfterFrameFence (already correct). Also fix character renderer vertex format: R8G8B8A8_UINT -> _SINT to match shader's ivec4 input (RADV validation rejects the mismatch).	2026-04-03 19:48:43 -07:00
Kelsi	ac5c61203d	fix(rendering): defer descriptor set destruction during streaming unload M2 destroyInstanceBones and WMO destroyGroupGPU freed descriptor sets and buffers immediately during tile streaming, while in-flight command buffers still referenced them — causing DEVICE_LOST on AMD RADV. Now defers GPU resource destruction via deferAfterFrameFence in streaming paths (removeInstance, removeInstances, unloadModel). Immediate destruction preserved for shutdown/clear paths that vkDeviceWaitIdle first. Also: vkDeviceWaitIdle before WMO backfillNormalMaps descriptor rebinds, and fillModeNonSolid added to required device features for wireframe pipelines on AMD.	2026-04-03 18:30:52 -07:00
Kelsi	8e1addf7a6	fix(rendering): increase ImGui descriptor pool from 100 to 2048 The pool was exhausted by cached spell/item/talent icon textures, causing vkAllocateDescriptorSets to fail inside ImGui_ImplVulkan_AddTexture. The NVIDIA driver crashed on the subsequent invalid descriptor write. Also add a null-check on the returned descriptor set so pool exhaustion gracefully returns VK_NULL_HANDLE instead of crashing.	2026-04-03 18:14:46 -07:00
Kelsi	2096e67bf9	fix(rendering): prevent shutdown crash from deferred cleanup use-after-free During shutdown, VkContext::runDeferredCleanup() was executing lambdas that called vkFreeDescriptorSets on descriptor pools already destroyed by Renderer::shutdown(). This corrupted the validation layer's internal state, causing a SIGSEGV during process exit on AMD RADV. Clear the deferred queues without executing them — vkDestroyDevice reclaims all device-child resources anyway. Also guard against the double shutdown() call (explicit + destructor).	2026-04-03 18:02:24 -07:00
Kelsi	b2cb98e969	fix(rendering): per-image semaphores and depth-format shadow placeholder Avoid semaphore reuse while the presentation engine still holds a reference by switching from per-frame-slot to per-swapchain-image semaphores with a rotating free semaphore for acquire. Replace the R8G8B8A8_UNORM dummy white texture in CharacterPreview with a proper D16_UNORM depth texture cleared to 1.0, matching the sampler2DShadow expectation in shaders. AMD RADV enforces strict format/sampler type compatibility.	2026-04-03 17:52:48 -07:00
Kelsi	4f7912cf45	fix(rendering): water reflection render pass compat, anisotropy feature, shadow pool race Three bugs found via AMD RADV crash log: 1. Water reflection render pass used BOTTOM_OF_PIPE as srcStageMask but pipelines were created against the main pass (EARLY_FRAGMENT_TESTS \| COLOR_ATTACHMENT_OUTPUT). AMD enforces strict render pass compatibility → SIGSEGV when scene renders into reflection texture. 2. samplerAnisotropy was never enabled during device creation despite being used in sampler creation — now requested via PhysicalDeviceSelector. 3. Shadow texture descriptor pool was reset each frame while prior frame's command buffers might still reference it. Split into per-frame-slot pools so each reset is fence-guarded.	2026-04-03 17:41:14 -07:00
Kelsi	62b8a757a3	fix(rendering): skip TRANSIENT_ATTACHMENT for MSAA on GPUs without lazily allocated memory AMD RDNA4 (9070XT) crashes with SIGSEGV when MSAA is enabled because the driver optimizes TRANSIENT images for tile-only storage. Without lazily allocated memory backing, the MSAA resolve reads unbacked memory. Now we only set TRANSIENT+LAZILY_ALLOCATED when the device actually exposes that memory type.	2026-04-03 17:23:52 -07:00
Kelsi	8c7db3e6c8	refactor: name FNV-1a/transport constants, fix dead code, add comments - vk_context: name FNV-1a hash constants (kFnv1aOffsetBasis/kFnv1aPrime) with why-comment on algorithm choice for sampler cache - transport_manager: collapse redundant if/else that both set looping=false into single unconditional assignment, add why-comment explaining the time-closed path design - transport_manager: hoist duplicate kMinFallbackZOffset constants out of separate if-blocks, add why-comment on icebreaker Z clamping - entity: expand velocity smoothing comment — explain 65/35 EMA ratio and its tradeoff (jitter suppression vs direction change lag)	2026-03-30 14:48:06 -07:00
Kelsi	5b91ef398e	fix: return UINT32_MAX from findMemType on failure, add [[nodiscard]] The findMemType/findMemoryType helper in auth_screen, loading_screen, and vk_context returned 0 on failure — a valid memory type index. Changed to return UINT32_MAX and log an error, so vkAllocateMemory receives an invalid index and fails cleanly rather than silently using the wrong memory type. Add [[nodiscard]] to VkBuffer::uploadToGPU/createMapped and VkContext::initialize/recreateSwapchain so callers that ignore failure are flagged at compile time. Suppress with (void) cast at 3 call sites where failure is non-actionable (resize best-effort).	2026-03-27 14:53:29 -07:00
Kelsi	ba99d505dd	refactor: remaining C-style casts, color constants, and header guard cleanup Replace ~37 remaining C-style casts with static_cast across 16 files. Extract named color constants (kColorRed/Green/Yellow/Gray) and dialog window flags (kDialogFlags) in game_screen.cpp, replacing 72 inline literals. Normalize keybinding_manager.hpp to #pragma once.	2026-03-25 11:57:22 -07:00
Kelsi	1dd3823013	perf: use second GPU queue for parallel texture/buffer uploads Request 2 queues from the graphics family when available (NVIDIA exposes 16, AMD 2+). Upload batches now submit to queue[1] while rendering uses queue[0], enabling parallel GPU transfers without queue-family ownership transfer barriers (same family). Falls back to single-queue path on GPUs with only 1 queue in the graphics family. Transfer command pool is separate to avoid contention.	2026-03-24 14:09:16 -07:00
Kelsi	a152023e5e	fix: add VkSampler cache to prevent sampler exhaustion crash Validation layers revealed 9965 VkSamplers allocated against a device limit of 4000 — every VkTexture created its own sampler even when configurations were identical. This exhausted NVIDIA's sampler pool and caused intermittent SIGSEGV in vkCmdBeginRenderPass. Add a thread-safe sampler cache in VkContext that deduplicates samplers by FNV-1a hash of all 14 VkSamplerCreateInfo fields. All texture, render target, renderer, water, and loading screen sampler creation now goes through getOrCreateSampler(). Textures set ownsSampler_=false so shared samplers aren't double-freed. Also auto-disable anisotropy in the cache when the physical device doesn't support the samplerAnisotropy feature, fixing the validation error VUID-VkSamplerCreateInfo-anisotropyEnable-01070.	2026-03-24 11:44:54 -07:00
Kelsi	1556559211	fix: skip VkPipelineCache on NVIDIA to prevent driver crash VkPipelineCache causes vkCmdBeginRenderPass to SIGSEGV inside libnvidia-glcore.so on NVIDIA 590.x drivers. Skip pipeline cache creation on NVIDIA GPUs — NVIDIA drivers already provide built-in shader disk caching, so the Vulkan-level cache is redundant. Pipeline cache still works on AMD and other vendors.	2026-03-24 10:30:25 -07:00
Kelsi	d2a396df11	feat: log GPU vendor/name at init, add PLAY_SOUND diagnostics Log GPU name and vendor ID during VkContext initialization for easier debugging of GPU-specific issues (FSR3, driver compat, etc.). Add isAmdGpu()/isNvidiaGpu() accessors. Temporarily log SMSG_PLAY_SOUND and SMSG_PLAY_OBJECT_SOUND at WARN level (sound ID, name, file path) to diagnose unidentified ambient NPC sounds reported by the user.	2026-03-24 09:56:54 -07:00
Kelsi	c8c01f8ac0	perf: add Vulkan pipeline cache persistence for faster startup Create a VkPipelineCache at device init, loaded from disk if available. All 65 pipeline creation calls across 19 renderer files now use the shared cache. On shutdown, the cache is serialized to disk so subsequent launches skip redundant shader compilation. Cache path: ~/.local/share/wowee/pipeline_cache.bin (Linux), ~/Library/Caches/wowee/ (macOS), %APPDATA%\wowee\ (Windows). Stale/corrupt caches are handled gracefully (fallback to empty cache).	2026-03-24 09:47:03 -07:00
Kelsi	6cfb439fd6	fix(vulkan): defer resource frees until frame fence	2026-03-14 03:32:31 -07:00
Kelsi	19eb7a1fb7	fix: animation stutter, resolution crash, memory cap, spell tooltip hints, GO collision - Animation stutter: skip playAnimation(Run) for the local player in the server movement callback — the player renderer state machine already manages it; resetting animTime on every movement packet caused visible stutter - Resolution crash: reorder swapchain recreation so old swapchain is only destroyed after confirming the new build succeeded; add null-swapchain guard in beginFrame to survive the retry window - Memory cap: reduce cache budget from 80% uncapped to 50% hard-capped at 16 GB to prevent excessive RAM use on high-memory systems - Spell tooltip: suppress "Drag to action bar / Double-click to cast" hints when the tooltip is shown from the action bar (showUsageHints=false) - M2 collision: add watermelon/melon/squash/gourd to foliage (no-collision); exclude chair/bench/stool/seat/throne from smallSolidProp so invisible chair bounding boxes no longer trap the player	2026-03-10 22:26:50 -07:00
Kelsi	a4966e486f	Fix WMO wall collision, normal mapping, POM backfill, and M2/WMO rendering performance - Fix MOPY flag check (0x08 not 0x01) for proper wall collision detection - Cap MAX_PUSH to PLAYER_RADIUS to prevent gradual clip-through - Fix WMO doodad quaternion component ordering (X/Y swap) - Linear normal map strength blend in shader for smooth slider control - Enable shadow sampling for interior WMO groups (covered outdoor areas) - Backfill deferred normal/height maps after streaming with descriptor rebind - M2: prepareRender only iterates animated instances, bone dirty flag - M2: remove worker thread VMA allocation, skip unready bone instances - WMO: persistent visibility vectors, sequential culling - Add FSR EASU/RCAS shaders	2026-03-07 22:03:28 -08:00
Kelsi	7ac990cff4	Background BLP texture pre-decoding + deferred WMO normal maps (12x streaming perf) Move CPU-heavy BLP texture decoding from main thread to background worker threads for all hot paths: terrain M2 models, WMO doodad M2s, WMO textures, creature models, and gameobject WMOs. Each renderer (M2, WMO, Character) now accepts a pre-decoded BLP cache that loadTexture() checks before falling back to synchronous decode. Defer WMO normal/height map generation (3 per-pixel passes: luminance, box blur, Sobel) during terrain streaming finalization — this was the dominant remaining bottleneck after BLP pre-decoding. Terrain streaming stalls: 1576ms → 124ms worst case.	2026-03-07 15:46:56 -08:00
Kelsi	16b4336700	Batch GPU uploads to eliminate per-upload fence waits (stutter fix) Every uploadBuffer/VkTexture::upload called immediateSubmit which did a separate vkQueueSubmit + vkWaitForFences. Loading a single creature model with textures caused 4-8+ fence waits; terrain chunks caused 80+ per batch. Added beginUploadBatch/endUploadBatch to VkContext: records all upload commands into a single command buffer, submits once with one fence wait. Staging buffers are deferred for cleanup after the batch completes. Wrapped in batch mode: - CharacterRenderer::loadModel (creature VB/IB + textures) - M2Renderer::loadModel (doodad VB/IB + textures) - TerrainRenderer::loadTerrain/loadTerrainIncremental (chunk geometry + textures) - TerrainRenderer::uploadPreloadedTextures - WMORenderer::loadModel (group geometry + textures)	2026-03-07 12:19:59 -08:00
Kelsi	f1caf8c03e	Fix Stockades crash: suppress area triggers on initial login, handle VK_ERROR_DEVICE_LOST Root cause: LOGIN_VERIFY_WORLD path did not set areaTriggerCheckTimer_ or areaTriggerSuppressFirst_, so the Stockades exit portal (AT 503) fired immediately on login, teleporting the player back to Stormwind and crashing the GPU during the unexpected map transition. Fixes: - Set 5s area trigger cooldown + suppress-first in handleLoginVerifyWorld (same as SMSG_NEW_WORLD handler already did for teleports) - Add deviceLost_ flag to VkContext so beginFrame returns immediately once VK_ERROR_DEVICE_LOST is detected, preventing infinite retry loops - Track device lost from both fence wait and queue submit paths	2026-03-02 08:19:14 -08:00
Kelsi	a559d5944b	Fix shutdown hangs, bank bag icons/drag-drop, loading screen progress, and login spawn - Fix shutdown hang: skip vmaDestroyAllocator (walked thousands of allocations), replace unsafe pthread_timedjoin_np with plain join + early-exit checks in workers - Bank window: full icon rendering, click-and-hold pickup (0.10s), drag-drop for all bank slots including bank bag equip slots, same-slot drop detection - Loading screen: process one tile per frame for live progress updates - Camera reset: trust server position in online mode to avoid spawning under WMOs - Fix PLAYER_BYTES/PLAYER_BYTES_2 field indices, preserve purchasedBankBagSlots across inventory rebuilds, fix bank slot purchase result codes	2026-02-26 13:38:29 -08:00
Kelsi	bd0305f6dd	Stabilize Vulkan rendering state for minimap, foliage, and water	2026-02-22 09:34:27 -08:00
Kelsi	7dd1dada5f	Work on character rendering and frustrum culling etc	2026-02-22 05:58:45 -08:00
Kelsi	325254dfcb	Port UI icon textures from OpenGL to Vulkan, fix loading screen clear values Replace all glGenTextures/glTexImage2D calls in UI code with Vulkan texture uploads via new VkContext::uploadImGuiTexture() helper. This fixes segfaults from calling OpenGL functions without a GL context (null GLEW function pointers). - Add uploadImGuiTexture() to VkContext with staging buffer upload pattern - Convert game_screen, inventory_screen, spellbook_screen, talent_screen from GLuint/GL calls to VkDescriptorSet/Vulkan uploads - Fix loading_screen clearValueCount (was 1, needs 2 or 3 for MSAA)	2026-02-22 03:32:08 -08:00
Kelsi	fa1867cf2f	Fix MSAA 8x crash and eliminate redundant GPU stalls - Add error handling: revert to 1x if recreateSwapchain fails - Clamp requested MSAA to device maximum before applying - Retry MSAA color image allocation without TRANSIENT on failure - Remove redundant vkDeviceWaitIdle from WMO/M2/Character recreatePipelines (caller already waits once, was causing ~13 stalls instead of 1)	2026-02-22 03:05:55 -08:00
Kelsi	e12141a673	Add configurable MSAA anti-aliasing, update auth screen and terrain shader - MSAA: conditional 2-att (off) vs 3-att (on) render pass with auto-resolve - MSAA: multisampled color+depth images, query max supported sample count - MSAA: .setMultisample() on all 25+ main-pass pipelines across 17 renderers - MSAA: recreatePipelines() on every sub-renderer for runtime MSAA changes - MSAA: Renderer::setMsaaSamples() orchestrates swapchain+pipeline+ImGui rebuild - MSAA: Anti-Aliasing combo (Off/2x/4x/8x) in Video settings, persisted - Update auth screen assets and terrain fragment shader	2026-02-22 02:59:24 -08:00
Kelsi	83b576e8d9	Vulcan Nightmare Experimentally bringing up vulcan support	2026-02-21 22:04:17 -08:00

31 commits