brkho/mesa - mesa - Brian's Gitea

Commit Graph

Author	SHA1	Message	Date
Kenneth Graunke	f3bdffc33d	iris: Only enable GL_AMD_depth_clamp_separate on Gen9+ The hardware feature is new as of Gen9+. I accidentally enabled it on Gen8.	7 years ago
Kenneth Graunke	dcfca0af7c	iris: Set XY Clipping correctly. I was setting it based off a pipe_rasterizer_state field that appears to be entirely dead outside of the draw module respecting it. I should be setting it when the primitive type reaching the SF is neither points nor lines. This is, unfortunately, rather dirty, as we have to look at the rasterizer state, the geometry shader state, the tessellation evaluation shader state, and the primitive type...	7 years ago
Rhys Perry	bd4c661ad0	ac,ac/nir: use a better sync scope for shared atomics https://reviews.llvm.org/rL356946 (present in LLVM 9 and later) changed the meaning of the "system" sync scope, making it no longer restricted to the memory operation's address space. So a single address space sync scope is needed for shared atomic operations (such as "system-one-as" or "workgroup-one-as") otherwise buffer_wbinvl1 and s_waitcnt instructions can be created at each shared atomic operation. This mostly reimplements LLVMBuildAtomicRMW and LLVMBuildAtomicCmpXchg to allow for more sync scopes and uses the new functions in ac->nir with the "workgroup-one-as" or "workgroup" sync scopes. F1 2017 (4K, Ultra High settings, TAA), avg FPS : 59 -> 59.67 (+1.14%) Strange Brigade (4K, ~highest settings), avg FPS : 51.5 -> 51.6 (+0.19%) RotTR/mountain (4K, VeryHigh settings, FXAA), avg FPS : 57.2 -> 57.2 (+0.0%) RotTR/tomb (4K, VeryHigh settings, FXAA), avg FPS : 42.5 -> 43.0 (+1.17%) RotTR/valley (4K, VeryHigh settings, FXAA), avg FPS : 40.7 -> 41.6 (+2.21%) Warhammer II/fallen, avg FPS : 31.63 -> 31.83 (+0.63%) Warhammer II/skaven, avg FPS : 37.77 -> 38.07 (+0.79%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	7 years ago
Hal Gentz	e91ee763c3	glx: Fix synthetic error generation in __glXSendError To quote Uli Schlachter, who understands this stuff more than I do: > The function __glXSendError() in mesa's src/glx/glx_error.c invents an X11 > protocol error out of thin air. For the sequence number it uses dpy->request. > This is the sequence number of the last request that was sent. _XError() will > then update dpy->last_request_read based on the sequence number of the error > that just "came in". > > If now another something comes in with a sequence number less than > dpy->last_request_read, since sequence numbers are monotonically increasing, > widen() will incorrectly add 1<<32 to the sequence number and things might go > downhill afterwards. `__glXSendErrorForXcb` was also patched, as that's the function that `glXCreateContextAttribsARB` actually uses. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=99781 Cc: mesa-stable@lists.freedesktop.org Fixes: `ad503c41` 'apple: Initial import of libGL for OSX from AppleSGLX svn repository' Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Hal Gentz <zegentzy@protonmail.com>	7 years ago
Lionel Landwerlin	9628631a38	Revert "anv: limit URB reconfigurations when using blorp" In commit `0d46e404` ("anv: limit URB reconfigurations when using blorp") we tried to limit the number of URB reconfiguration by checking if the last allocation is large enough to fit the blorp dispatch. We used the last bound pipeline to compare the allocation. The problem with this is that the pipeline is bound but its commands might not have been emitted into the command buffer yet. Let's just revert commit `0d46e40467` since it didn't seem to yield any performance improvement. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `0d46e404` ("anv: limit URB reconfigurations when using blorp") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110535 Acked-by: Jason Ekstrand <jason@jlekstrand.net>	7 years ago
Erik Faye-Lund	cc5b8a938a	mesa/st: remove always-false state This code is essentially dead now. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	7 years ago
Erik Faye-Lund	be110ba2e4	mesa/st: accept NULL and empty buffer objects It's prefectly legal and well-defined to render using a non-existing or empty buffer object. The data coming out of the buffer object isn't well defined unless we have the robustness flag set on the context, but that's a different matter, and up to the shader hardware; it's the same as out-of-bounds reads. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	7 years ago
Erik Faye-Lund	ef13691e0c	swr: support NULL-resources It's legal for a buffer-object to have a NULL-resource, but let's just skip over it, as there's nothing to do. This patch switches the order of the conditionals in swr_update_derived, so the logic becomes a bit more straight forward: if (is_user_buffer) ... else if (resource) ... else ... ...instead of this: if (!is_user_buffer) if (resource) ... else ... else ... Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Alok Hota <alok.hota@intel.com>	7 years ago
Erik Faye-Lund	04b0c6e9df	nouveau: support NULL-resources It's legal for a buffer-object to have a NULL-resource, but let's just skip over it, as there's nothing to do. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: Karol Herbst <kherbst@redhat.com>	7 years ago
Erik Faye-Lund	a11945d179	i915: support NULL-resources It's legal for a buffer-object to have a NULL-resource, but let's just skip over it, as there's nothing to do. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	7 years ago
Erik Faye-Lund	a8e8204b18	gallium/u_vbuf: support NULL-resources It's legal for a buffer-object to have a NULL-resource, but let's just skip over it, as there's nothing to do. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	7 years ago
Erik Faye-Lund	0607ceb655	mesa/st: remove impossible error-check st_setup_current never sets this flag, and it's already checked against right before. So let's remove this pointless check. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	7 years ago
Andres Gomez	c81fbb42d9	glsl/linker: check for xfb_offset aliasing From page 76 (page 80 of the PDF) of the GLSL 4.60 v.5 spec: " No aliasing in output buffers is allowed: It is a compile-time or link-time error to specify variables with overlapping transform feedback offsets." Currently, this is expected to fail, but it succeeds: " ... layout (xfb_offset = 0) out vec2 a; layout (xfb_offset = 0) out vec4 b; ... " Fixes the following piglit test: tests/spec/arb_enhanced_layouts/compiler/transform-feedback-layout-qualifiers/xfb_offset/invalid-overlap.vert Fixes the following test: KHR-GL44.enhanced_layouts.xfb_output_overlapping v2: - Use a data structure to track the used components instead of a nested loop (Ilia). v3: - Take the BITSET_WORD array out from the gl_transform_feedback_buffer struct and make it local to the validation process (Timothy). - Do not use a nested scope for the validation (Timothy). v4: - Add reference to the fixed piglit test in the commit log. - Add reference to the fixed VK-GL-CTS test in the commit log (Tapani). - Empty initialize the BITSET_WORD pointers array (Tapani). Cc: Timothy Arceri <tarceri@itsqueeze.com> Cc: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	7 years ago
Patrick Lerda	812288bf0f	lima/ppir: fix pointer referenced after a free Issue detected by valgrind. Fixes: `92d7ca4b1c` ("gallium: add lima driver") Signed-off-by: Patrick Lerda <patrick9876@free.fr> Reviewed-by: Qiang Yu <yuq825@gmail.com>	7 years ago
Eleni Maria Stea	bb953de96c	radv: consider MESA_VK_VERSION_OVERRIDE when setting the api version Before setting the physical device API version, we should check if the MESA_VK_VERSION_OVERRIDE environment variable is set and take it into account. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	7 years ago
Kenneth Graunke	9dcf90d7ba	intel/fs: Don't emit empty ELSE blocks. While we can clean this up later, it's trivial to not generate the stupid code in the first place, which saves some optimization work. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Matt Turner <mattst88@gmail.com>	7 years ago
Kenneth Graunke	2b44b27dbe	nir: Add a new nir_cf_list_is_empty_block() helper. Helper and name suggested by Eric Anholt. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Matt Turner <mattst88@gmail.com>	7 years ago
Kenneth Graunke	08dc93c67c	glsl/list: Add an exec_list_is_singular() helper. Similar to list_is_singular() in util/list.h. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Matt Turner <mattst88@gmail.com>	7 years ago
Tapani Pälli	376c3e8f87	anv: expose VK_EXT_queue_family_foreign on Android VK_ANDROID_external_memory_android_hardware_buffer requires this extension. It is safe to enable it since currently aux usage is disabled for ahw buffers. Fixes following dEQP extension dependency test on Android: dEQP-VK.api.info.device#extensions Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	7 years ago
Andreas Baierl	c960323a81	lima/ppir: Add gl_FragCoord handling Treat gl_FragCoord variable as a system value and lower the w component with a nir pass. Add the necessary bits for correct codegen. Signed-off-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Qiang Yu <yuq825@gmail.com>	7 years ago
Andreas Baierl	b82de2b4d7	nir: add rcp(w) lowering for gl_FragCoord On some hardware (e.g. Mali400) the shader needs to apply some transformations for correct gl_FragCoord handling. The lowering actions look like the following in pseudocode: gl_FragCoord.xyz = gl_FragCoord_orig.xyz gl_FragCoord.w = 1.0 / gl_FragCoord_orig.w Add this lowering as a nir pass in preparation for using it in the driver. Signed-off-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	7 years ago
Romain Failliot	7050eccd77	docs: changed "Done" to "DONE" in features.txt Mesamatrix.net expects uppercase. Acked-by: Timothy Arceri <tarceri@itsqueeze.com>	7 years ago
Alyssa Rosenzweig	ec65e1b763	panfrost: Workaround -bshadow regression I have no idea what's happening here, but let's not regress an app that used to work in the mean time while we're figuring it out.. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	7 years ago
Alyssa Rosenzweig	3978614d88	panfrost/midgard: Safety check immediate precision degradations Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	7 years ago
Alyssa Rosenzweig	0ebf1047a4	panfrost: Use fp32 (not fp16) varyings In a perfect world, we'd use fp16 varyings for mediump and fp32 for highp, allowing us to get a performance win without sacrificing conformance. Unfortunately, we're not there (yet), so it's better we assume always fp32 than always fp16 to avoid artefacts / breaking a lot of deqp. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	7 years ago
Alyssa Rosenzweig	a81267f228	panfrost/midgard: imov workaround Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	7 years ago
Alyssa Rosenzweig	53d6e11393	panfrost/midgard: Fix tex propogation Unbreaks mpv. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	7 years ago
Alyssa Rosenzweig	68a1508dc9	panfrost/midgard: Fix regressions in -bjellyfish Two fixes here, one is that we tried to copyprop non-strictly-SSA values which was bound to fly in our face. The other was peeling back the imov workaround.. Turns out we still need that. More research is needed still, but let's not regress real apps. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	7 years ago
Alyssa Rosenzweig	bdaa23b32b	panfrost/midgard: Only copyprop without an outmod With an outmod, we would need to propagate that through, which is for future work. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	7 years ago
Alyssa Rosenzweig	a3d6a3dfc4	Revert "panfrost/midgard: Extend copy propagation pass" Fixes: commit `b53b4573c3`. Optimization gone wrong. In the future, we should try this again (it's a net win if implemented right), but at the moment this just regresses. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	7 years ago
Samuel Pitoiset	07745f9494	radv: add missing VEGA20 chip in radv_get_device_name() Otherwise it returns "AMD RADV unknown". Cc: 19.0 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	7 years ago
Kenneth Graunke	6bd4cb920e	iris: Fix zeroing of transform feedback offsets in strange cases. Some of the dEQP.functional.transform_feedback tests end up doing the following sequence of operations: 1. BeginTransformFeedback 2. PauseTransformFeedback 3. Draw 4. ResumeTransformFeedback At step 1, we'd pack 3DSTATE_SO_BUFFER commands saying to zero the SO_WRITE_OFFSET registers. At step 2, we disable streamout, so step 3 doesn't bother emitting those commands. Then, step 4 re-packs new 3DSTATE_SO_BUFFER commands with offset = 0xFFFFFFFF, saying to continue appending at the existing offset. This loads the value from the BO as the offsets - but we never actually zeroed it. So, just maintain a flag saying "we actually emitted the commands", and stomp offset back to zero until we emit some.	7 years ago
Eric Anholt	edb04953c8	vc4: Fall back to renderonly if the vc4 driver doesn't have v3d. I have a platform with vc4 display but V3D 4.x. We can fall back on kmsro's probing to bring up the v3d gallium driver. Acked-by: Rob Clark <robdclark@chromium.org>	7 years ago
Eric Anholt	7e069832a0	kmsro: Add support for V3D. Like vc4, we expect to have SOCs with various displays that have a single V3D instance for rendering. v2: Add v3d to the list of drivers that make enabling kmsro valid. Acked-by: Rob Clark <robdclark@chromium.org>	7 years ago
Marek Olšák	a8a0e5c03c	radeonsi: don't ignore PIPE_FLUSH_ASYNC Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	7 years ago
Eric Anholt	fb0611df3d	v3d: Fix detection of TMU write sequences in register spilling. We can't use the QPU functions to detect this until register allocation is done and we've moved inst->dst into inst->qpu. Fixes bad TMU sequences from register spilling in KHR-GLES31.core.compute_shader.shared-max.	7 years ago
Eric Anholt	18894a5e5a	v3d: Fix detection of the last ldtmu before a new TMU op. We were looking at the start instruction, instead of scanning through the list of following instructions to find any more ldtmus.	7 years ago
Eric Anholt	575caab895	v3d: Re-add support for memory_barrier_shared. Looks like I lost it in a rebase conflict resolution. We'd hit the unknown intrinsic assertion in KHR-GLES31.core.compute_shader.shared-struct. Fixes: `6b1c659825` ("v3d: Add Compute Shader compilation support.")	7 years ago
Eric Anholt	971a13d805	Revert "v3d: Disable PIPE_CAP_BLIT_BASED_TEXTURE_TRANSFER." This reverts commit ccce9409470c1053c40c822d759b9bd417062bc0, leaving a note as to why we had to (corruption in chromium, breaking some GLES3.1 tests).	7 years ago
Eric Anholt	49071b2e3f	v3d: Don't try to update the shadow texture for separate stencil. There are two cases where v3d's sampler view's resource doesn't match the base's: shadow textures for sampling from raster, and pointing at the separate depth texture for z32f_s8x24. We only want to update shadow for the first case. Fixes dEQP-GLES31.functional.stencil_texturing.render.depth32f_stencil8_draw when run after the previous testcase.	7 years ago
Eric Anholt	4358904c06	v3d: Add a note about i/o indirection for future performance work.	7 years ago
Eric Anholt	c74d0e7f62	vc4: Use _mesa_hash_table_remove_key() where appropriate.	7 years ago
Eric Anholt	d8486c2ad7	v3d: Use _mesa_hash_table_remove_key() where appropriate.	7 years ago
Eric Anholt	24587ae8ae	v3d: Assert that we do request the normal texturing return data. An unused tex should be DCEed, but if it wasn't we'd run into trouble with not doing a TMUWT.	7 years ago
Eric Anholt	42210a4351	v3d: Apply the GFXH-930 workaround to the case where the VS loads attrs. We were emitting a dummy load for when the VS doesn't load any attributes, but we also need to emit a dummy load for when the render VS loads attributes but the binner VS doesn't. Fixes simulator assertion failures and GPU hangs on KHR-GLES31.core.texture_gather.\*	7 years ago
Eric Anholt	448fc3ea42	v3d: Fill in the ignored segment size fields to appease new simulator. We are assured that the input segment size field is ignored for !separate_segs mode, and now the simulator wants an in-range value set regardless of whether it's functionally ignored or not.	7 years ago
Tapani Pälli	af06963d24	glsl: use empty brace initializer fixes following warning with clang: warning: suggest braces around initialization of subobject Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	7 years ago
coypu	976004d0e7	gbm: don't return void Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	7 years ago
Tapani Pälli	7a7f182dac	nir: use braces around subobject in initializer Used same syntax as elsewhere with Mesa sources, verified result against MSVC with godbolt.org. fixes following warning with clang: warning: suggest braces around initialization of subobject v2: empty braces -> braces around subobject (Caio, Kristian) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	7 years ago
Kristian H. Kristensen	a7c70bb2a1	freedreno/drm: Quiet pointer to u64 conversion warning	7 years ago

1 2 3 4 5 ...

110456 Commits (f3bdffc33db3b95b87d3f1c29bbf52439ef7d8c4) All Branches Search

110456 Commits (f3bdffc33db3b95b87d3f1c29bbf52439ef7d8c4)

All Branches