brkho/mesa - mesa - Brian's Gitea

Revīziju grafs

Autors	SHA1	Ziņojums	Datums
Paul Berry	bdf13dc832	i965: Stop passing num_samples to intel_miptree_alloc_hiz(). The number of samples is already available in the miptree data structure, so there's no need to pass it in. I suspect this may fix a subtle bug because in one case (intel_renderbuffer_update_wrapper) we were always passing zero for num_samples, even though the buffer in question was not guaranteed to be single-sampled. But I wasn't able to find a failing test case. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	pirms 12 gadiem
Zack Rusin	d48054ff22	draw: don't crash if GS doesn't emit anything Technically it's legal for geometry shader to not emit any vertices. It's silly, but perfectly legal, so lets make draw stop crashing if it happens. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: José Fonseca <jfonseca@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	pirms 12 gadiem
Eric Anholt	e56095dc2e	i965: Implement color clears using a simple shader in blorp. The upside is less CPU overhead in fiddling with GL error handling, the ability to use the constant color write message in most cases, and no GLSL clear shaders appearing in MESA_GLSL=dump output. The downside is more batch flushing and a total recompute of GL state at the end of blorp. However, if we're ever going to use the fast color clear feature of CMS surfaces, we'll need this anyway since it requires very special state setup. This increases the fail rate of some the GLES3conform ARB_sync tests, because of the initial flush at the start of blorp. The tests already intermittently failed (because it's just a bad testing procedure), and we can return it to its previous fail rate by fixing the initial flush. Improves GLB2.7 performance 0.37% +/- 0.11% (n=71/70, outlier removed). v2: Rename the key member, use the core helper for sRGB, and use BRW_MASK_* enums, fix comment and indentation (review by Paul). v3: Rewrite a comment, drop a silly temporary variable (review by Ken) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	pirms 12 gadiem
Eric Anholt	e34c857639	mesa: Make a Mesa core function for sRGB render encoding handling. v2: const-qualify ctx, and add a comment about the function (recommended by Brian and Kenneth). Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (v1)	pirms 12 gadiem
Eric Anholt	db31bc5cfb	i965: Don't flush the batch at the end of blorp. Improves GLB2.7 performance 0.13% +/- 0.09% (n=104/105, outliers removed). More importantly, once color glClear()s are done through blorp in the next commit, this reduces regression in GLES3 conformance tests that rely on queueing up many glClear()s and having the GPU report being still busy in an ARB_sync query after that. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	pirms 12 gadiem
Vadim Girlin	fb1eed9ec5	r600g/sb: remove unused code Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	pirms 12 gadiem
Vadim Girlin	3f18dd818f	r600g/sb: collect shader statistics Collects various statistical information for each shader and total stats for contexts. Printed with R600_DEBUG=sb,sbstat Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	pirms 12 gadiem
Vadim Girlin	6ba7a162b6	r600g/sb: don't propagate dead values in GVN pass In some cases we use value::gvn_source field to link values that are known to be equal before gvn pass (e.g. results of DOT4 in different slots of the same alu group), but then source value may become dead later and this confuses further passes. This patch resets value::gvn_source to NULL in the dce_cleanup pass if it points to dead value. Fixes segfault during shader optimization with ETQW. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	pirms 12 gadiem
Vadim Girlin	3e476c311f	r600g/sb: use simple heuristic to limit register pressure It's not a complete register pressure tracking, yet it helps to prevent register allocation problems in some cases where they were observed. The problems are uncovered by false dependencies between fetch instructions introduced by some recent changes in TGSI and/or default backend. Sometimes we have code like this: ... SAMPLE R5.xyzw, R5.xyzw ... store R5.xyzw somewhere MOV R5.x, <next x coord> MOV R5.y, <next y coord> SAMPLE R5.xyzw, R5.xyzw ... <may be repeated a lot of times> With 2D resources, z and w in SAMPLE src reg aren't used and can be simply masked, but shader backend doesn't have this information, so it's considered as data dependency by optimization algorithms.	pirms 12 gadiem
Vadim Girlin	6d6c8c88a3	r600g/sb: improve error checking in ra_coalesce pass	pirms 12 gadiem
Vadim Girlin	188c893e65	r600g/sb: use source bytecode in case of optimization errors	pirms 12 gadiem
Vadim Girlin	ad1df471d0	r600g: plug in optimizing backend Optimization is enabled with "R600_DEBUG=sb". Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	pirms 12 gadiem
Vadim Girlin	2cd7691793	r600g/sb: initial commit of the optimizing shader backend	pirms 12 gadiem
Vadim Girlin	fbb065d629	r600g: use enum type for domains field in struct r600_resource This prevents the problems when the header is included in C++ code.	pirms 12 gadiem
Vadim Girlin	d5b30fd036	r600g: add new flags to isa instruction tables	pirms 12 gadiem
Vadim Girlin	a919424215	r600g: always create reverse lookup isa tables	pirms 12 gadiem
Vadim Girlin	7d555f2f4c	r600g: mask unused source components for SAMPLE This results in more clean shader code and may improve the quality of optimized code produced by r600-sb due to eliminated false dependencies in some cases. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	pirms 12 gadiem
Eric Anholt	df410863d7	intel: Remove the last spans code! The remaining bits happen to do nothing that _swrast_span_render_start()/finish() don't do. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	pirms 12 gadiem
Eric Anholt	526cf46666	intel: Move the S8 offset calc function near its remaining usage. It's not really span code ever since we stopped using spans for S8. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	pirms 12 gadiem
Eric Anholt	e7c5e9949b	intel: Ensure renderbuffers are current when mapping them. In the case of renering to windows in X, we would render to stale buffers (or not render at all!) if you hit a MapRenderbuffer as the first thing done to your window after new buffers are ready to be collected in DRI2. I think this also covers the weird comment about irb->mt being missing sometimes. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	pirms 12 gadiem
Eric Anholt	0e8ef74c5f	mesa: Add a clarifying comment about rowStride of compressed textures. I always forget how we do this for compressed textures. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	pirms 12 gadiem
Eric Anholt	3750ff9e5f	mesa: Remove the Map field from texture images. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	pirms 12 gadiem
Eric Anholt	adf958d9c2	swrast: Always use MapTextureImage for mapping textures for swrast. Now that everything goes through ImageSlices[], we can rely on the driver's existing texture mapping function. A big block of code goes away on Radeon that looks like it was to deal with the validate that happened at SpanRenderStart, which no longer occurs since we don't need validation for the MapTextureImage hook. v2: Rewrite comment about ImageSlices, fix duplicated swImages, touch up unmap loop. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (v1) Reviewed-by: Brian Paul <brianp@vmware.com>	pirms 12 gadiem
Eric Anholt	ea05e259c9	nouveau: Replace swrast_texture_image->Map usage with ->Buffer. This code is trying to deal with providing a map in the case that AllocTexImageBuffer was called, which is hooked up to the swrast variant. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	pirms 12 gadiem
Eric Anholt	b78e48289f	nouveau: Just use MapTextureImage instead of duplicating the logic. MapTextureImage has the exact same logic, except it can also handle swrast-allocated buffers. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	pirms 12 gadiem
Eric Anholt	f91823f026	swrast: Make a teximage's stored RowStride be in terms of bytes per row. For hardware drivers with pitch alignment requirements, a non-power-of-two-sized texture format won't end up being an integer number of pixels per row. Also, avoids having to change our units between MapTextureImage's rowStride and swrast's RowStride. This doesn't fully convert the compressed texel fetch path, but does make sure we don't drop any bits (not that we'd expect to). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	pirms 12 gadiem
Eric Anholt	35e179b18c	swrast: Replace use of teximage Map in 1D/2D paths with ImageSlices[0]. This gets us ready for the Map field to die. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	pirms 12 gadiem
Eric Anholt	0c883e46d8	swrast: Replace ImageOffsets with an ImageSlices pointer. This is a step toward allowing drivers to use their normal mapping paths, instead of requiring that all slice mappings come from an aligned offset from the first slice's map. This incidentally fixes missing slice handling in FXT1 swrast. v2: Use slice height helper function. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	pirms 12 gadiem
Eric Anholt	e7ecc11311	swrast: Reuse _swrast_free_texture_image_buffer from drivers. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	pirms 12 gadiem
Eric Anholt	0a484f1006	swrast: Move ImageOffsets allocation to shared code. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	pirms 12 gadiem
Eric Anholt	f709c31c67	swrast: Clean up and explain the mapping process. v2: Move slice height calculation to a helper function (recommeded by Brian). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (v1) Reviewed-by: Brian Paul <brianp@vmware.com>	pirms 12 gadiem
Eric Anholt	741e540055	swrast: Factor out texture slice counting. This function going to get used a lot more in upcoming patches. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	pirms 12 gadiem
Eric Anholt	dca4178130	radeon: Remove some dead teximage mapping code. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	pirms 12 gadiem
Eric Anholt	0de08fb594	radeon: Add missing swrast field initialization. This is the equivalent of intel's `80513ec8b4`. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	pirms 12 gadiem
Vincent Lejeune	a6a4b70e2d	r600g/llvm: Fix opencl build	pirms 12 gadiem
Alexander von Gluck IV	f1361ed084	Gallium: Use mmap on Haiku for executable memory vs malloc * Haiku now has DEP enabled by default.	pirms 12 gadiem
Alexander von Gluck IV	60cc73c333	Mapi: Use mmap on Haiku for executable memory vs malloc * Haiku now has DEP enabled by default.	pirms 12 gadiem
Alexander von Gluck IV	39bdf08628	Mesa: Use mmap on Haiku for executable memory vs malloc * Haiku now has DEP enabled by default.	pirms 12 gadiem
Vincent Lejeune	51e9bfdc48	r600g/llvm: get use_kill from compiler shader	pirms 12 gadiem
Eric Anholt	a79786af64	i965/fs: Print out the estimated cycle count in INTEL_DEBUG=wm This could be used by shader-db for hopefully more accurate regression testing. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	pirms 12 gadiem
Eric Anholt	61ca2c4f73	i965/fs: Allow LRPs with uniform registers. Improves GLB2.7 performance on my HSW by 0.671455% +/- 0.225037% (n=62). v2: Make is_valid_3src() a method of the fs_reg. (recommended by Ken) Reviewed-by: Matt Turner <mattst88@gmail.com> (v1) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (v1)	pirms 12 gadiem
Eric Anholt	de7e8b1d01	intel: Be more conservative in disabling tiling to save memory. Improves GLB2.7 trex performance 1.01985% +/- 0.721366% on my IVB (n=10) and by 3.38771% +/- 0.584241% (n=15) on my HSW, due to a 32x32 ARGB8888 cubemap going from untiled to tiled. Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch>	pirms 12 gadiem
Eric Anholt	73bc6061f5	i965: Disable Z16 on contexts that don't require it. It appears that Z16 on Intel hardware is in fact slower than Z24, so people are getting surprisingly hurt when trying to use Z16 as a performance-versus-precision tradeoff, or when they're targeting GLES2 and that's all you get. GL 3.0+ have Z16 on the list of required exact format sizes, but GLES doesn't, so choose the better-performing layout in that case. Improves GLB 2.7 trex performance at 1920x1080 by 10.7% +/- 1.1% (n=3) on my IVB system. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	pirms 12 gadiem
Eric Anholt	e409889213	intel: Report FBO incompleteness causes through GL_ARB_debug_output. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	pirms 12 gadiem
Eric Anholt	6ae473221a	intel: Fold the one last function intel_tex_format.c into the caller. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	pirms 12 gadiem
Eric Anholt	40b207b62f	mesa: Fix error checking for GS UBO getters. These are supposed to be present if both things are available, but we were enabling them if either one was.	pirms 12 gadiem
Eric Anholt	072709da91	mesa: Add a clarifying comment about EXTRA_ error checking.	pirms 12 gadiem
Eric Anholt	eac1199604	mesa: Add an extra clarifying set of braces to getter checking. For this multi-page single statement, my thought the end was to that the next block was mis-indented, rather than that the dropped indentation actually indicated the end of the loop.	pirms 12 gadiem
Eric Anholt	2534f0a57d	mesa: Fix error checking for getters consisting of only API versions. In almost all of our cases, getters that are turned on for only some API variants will have an extension listed as one of the things that can enable it, and thus api_check gets set. For extra_gl30_es3 (used for NUM_EXTENSIONS, MAJOR_VERSION, MINOR_VERSION) on a GL 2.1 context, though, we would check twice, not find either one, but never actually throw the error.	pirms 12 gadiem
Eric Anholt	d63a10afcc	mesa: Clarify the names of error checking variables for glGet. There's no reason to actually count these things, so the integer ++ behavior was just confusing.	pirms 12 gadiem

1 2 3 4 5 ...

56381 Revīzijas (bdf13dc8324c391b7d34f8bdaea72c4452ab7edb) Visi atzari Meklēt

56381 Revīzijas (bdf13dc8324c391b7d34f8bdaea72c4452ab7edb)

Visi atzari