[Intel-gfx] [ANNOUNCE] xf86-video-intel 2.16.901

Sun Oct 30 17:57:23 CET 2011

This is the first release candidate in preparation for the upcoming
2.17.0 release. We will appreciate any feedback we can get from
testing of this snapshot.

The basis of this release are to push out a couple of critical stability
fixes for SandyBridge and IvyBridge. There is very little else to see
here. Those 200+ other commits do not exist...
-Chris

Bugs fixed in this snapshot (compared to 2.16.0)
--------------------------------------------------

 * Workaround blitter hang on SandyBridge and IvyBridge
   https://bugzilla.kernel.org/show_bug.cgi?id=27892
   https://bugs.freedesktop.org/show_bug.cgi

 * Workaround pipe control issues on SandyBridge

 * Use correct maximum PS thread count on IvyBridge

 * Protect against failed pixmap allocation for XV
   https://bugs.freedesktop.org/show_bug.cgi?id=40439

Complete list of changes since 2.16.0
-------------------------------------

Chris Wilson (252):
      sna/damage: Take advantage of marking all-damaged
      sna/gradient: Use a high-precision ramp for a color step rather than fallback
      sna/trapezoids: Speedup tor rasteriser
      sna/render: allow CLAMP_TO_EDGE for outside samples of extract regions as well
      sna/gen3: reset blend state after applying CA pass
      sna/display: Destroy shadow data
      sna/dri2: Add some debug around the use of the Resource database
      sna: Clear structures across server reset
      sna/video: Flush the video state at the end of the operation
      sna/accel: Simplify single pixel read-back
      sna/trapezoids: Reduce imprecise sampling to 4x4
      sna/trapezoids: Use the tor scan converter to compute the low precision mask
      video: check that the pixmap exists before use
      sna/video: Defend against PutImage to a broken screen
      sna: Cleanup up the cache upon close
      sna: Free the buffers immediately upon release
      sna: Retain the GTT space used for an upload buffer
      sna/trapezoids: Allocate sufficient space for a8 mask for mono traps
      sna/trapezoids: Reject invalid traps after projecting onto the sample grid
      sna/trapezoids: Refactor to project the trapezoid only once
      sna/gen6: Prefer the BLT ring, except for copies on behalf of DRI
      sna: Use the shadow buffer for PutImage
      sna/trapezoids: Edges may lie out of bounds
      sna: Add missing implementation for Triangles
      sna/trapezoids: Check for alignment after projection
      sna/gen2: Improve batch decoder.
      sna/gen2: Flush the batch when we run out of vertex space
      sna/gen3: Do not assume video updates are always vsync'ed
      sna/traps: Use the trapezoid path for AddTraps
      sna/trapezoids: Fast upload path for gpu busy bo
      sna/trapezoids: Hook up Imprecise AddTraps in lieu of spans
      sna: Record git-tree used for compilation
      sna: Prefer memcpy_blt over fbBlt
      sna/gen5: Prefer BLT for solids
      sna/gen3: Use a clear pattern for ill-defined radial gradients
      sna: Mark the bo as reusable after extracting the handle from the buffer
      gitignore: add git_version.h
      sna/trapezoids: Prevent a memleak if feed lots of degenerate boxes
      sna: Fix compilation without timerfd()
      sna: Provide a direct lookup cache of alpha values
      sna: Use single precision for computing the texcoord scale factor
      sna/gen6: Prefer RENDER for copies as it compacts better
      sna: Avoid the call overhead for the trivial clip case
      sna/gen5: Avoid bitfields for simple assignments
      sna/gen5: Fix debug compilation
      sna/accel: Use correct enum for requesting a write-buffer
      sna: Don't clear the kernel-needs-flush flag if we emit a MI_FLUSH
      sna: Reuse the smallest bo that fits the request
      sna/trapezoids: Implement a mono-rasteriser to skip opacity spans
      sna/video: Stop advertising unsupported Xv attributes
      video: Stop advertising unsupported Xv attributes
      sna: Add a couple of debug statements for why we may not flush the scanout
      sna/debug: Assert that BLT commands on gen2/3 are properly fenced
      sna: Clear the list of buffers upon server regen
      sna/gen3: Reset the shader after playing video
      sna: assert that the memcpy upload path points to valid regions
      sna: Yet another s/x/y/ typo
      sna/trapezoids: Amalgamate adjacent mono spans
      configure: Introduce --with-builderstring
      sna: Remove bad assert
      sna: Debug option to disable tiling
      sna: Can't free bo from the active list
      sna/glyphs: Correct offset of fallback-via-mask glyphs
      sna/glyphs: Disable the glyph cache for debugging
      sna: Add debug option to force allocaiton of gpu-only pixmaps
      sna/trapezoids: Ensure we do not overrun the vertical edges for mono
      sna: Apply the screen offset for direct upload of a composited pixmap
      sna: perform a warnings reduction pass
      sna: compile fixes for debugging
      sna/trapezoids: Fix overflow during sorting of mono edge step
      sna: Move an assertion to the point of import
      sna: Use pot buckets for active cache
      sna/dri: Play with marking bo reusable
      sna/dri: Bump DRI2INFOREC_VERSION
      sna: move an assert
      sna/accel: fix assert to include the offset of copy
      sna/accel: Only skip undamaging the GPU for reads
      sna/accel: Always subtract the enlarged region from the outstanding GPU damage
      sna/accel: Use the PolyFillRect to handle tiled spans
      sna: Add a debug option to disable caching
      sna/glyphs: Convert all sub-8bpp masks to a8
      sna/gen5: Debug option to disable state caching
      sna: Paranoid debug flush after every op (as well as before)
      sna/kgem: Check all operation bo in a single amalgamation
      sna/glyph: Avoid useless attempt at GPU glyph rendering to a1 destinations
      sna/glyphs: Add glyphs directly onto a client temporary buffer
      sna/accel: Debug option to force CPU fallbacks
      sna/accel: Fix s/x/y/ typo in computing relative drawing coordinates
      sna/gen7: Fix up a couple instances of my inability to count
      sna/gen7: Confusion reigns as trying to fix errors found by an outdated checker
      sna: Check for request retires after every batch
      sna/gen6: Fix offset of Scan-Line-Compare register
      sna/io: Only mark the buffer as LAST if we know we will flush the IO
      sna: Sort partials by remaining space
      sna/accel: Use the mi*Line routines to convert the line into spans for gpu bo
      sna/accel: Convert segments into spans similarly to PolyLine
      sna/accel: Use miPolyArc to convert arcs into spans for gpu bo
      sna/accel: Don't attempt converting to spans if we will only fallback
      sna/accel: Micro-optimise sna_fill_spans_blt
      sna/accel: Only disable the flush migitration if nothing changed
      sna/accel: Add an compile option to disable use of spans
      sna/accel: Add a compile option to force flushing on every blockhandler
      sna/accel: Only throttle after flushing
      sna/accel: Correct syntax for constifying BoxPtr
      sna/blt: Use SETUP_MONO to reduce the number of fill relocations
      sna/blt: SETUP_BLT needs 9 dwords of batch space, not 3!
      sna/blt: Suppress repeated SETUP_BLT
      sna: Move the source to the GPU if it is reused
      sna/gen[23]: Fix compilation with debugging enabled
      sna: Increase reserved space in batch to accommodate gen5 workaround
      sna: Use BLT operations to avoid fallbacks in core glyph rendering
      sna: Support a fast composite hook for solitary boxes
      sna/gen6: Add render support for fill-one-box
      sna/gen7: Add render support for fill one
      sna/gen3: avoid applying zero offset to common spans
      sna/trapezoid: Perform the NULL check for damage in the caller
      sna: Mark the spans render functions as fastcall
      sna: use correct insertion point for sorting partials
      sna/accel: If the data is already on the GPU, use it for the source
      sna: Enable single fill fast path for PolySegment
      sna/gen3: Avoid RENDER/BLT context switch for fill boxes
      sna: Micro-optimise checking for singular clip boxes
      sna: More micro-optimisation of messing around with clip regions
      sna: And free the clip after creation for PolySegments
      sna: compare against the right array of cache buckets
      Remove vestigial includes from DRI1
      sna/gen2: Trim some surplus state commands
      sna/blt: Check for reloc overflow when initialising solid fills
      sna/accel: Check for reloc overflow when adding a new glyph run
      sna/accel: Correctly offset the damage intersection for glyph runs
      sna/accel: Actually apply the clip to the glyph extents
      sna/accel: Fix sense of nothing-to-do due to clipping
      sna: Ensure that the scanout is flushed before we sleep
      sna/tiling: Hook up composite_box
      sna/trapezoids: Fallback if the composite operation is not supported
      sna/blt: Add a pair of operator reductions for opaque fills
      Remove a couple of trivial compile warnings for unused variables
      sna/gen7: Add missing tail to 3DSTATE_SBE
      sna/gen2: Fix fill-one-box
      sna/gen2: Eliminate redundant diffuse and rectlist emission
      sna/accel: Fall-forward for handling a non-copy CopyArea to a dst gpu bo
      sna/accel: Upload source to GPU for !copy CopyAreas
      sna: Simplify busy tracking by trusting the bo->gpu flag
      sna: Prefer to accelerate non-RENDER operations if already using the GPU
      sna: Defer CPU fallback if fb calls into mi
      configure: Add a check for pixman version
      sna/gen3: Improve reduction of render operator to blt
      sna/gen3: Store floats_per_rect alongside floats_per_vertex
      sna/gen3: Micro-optimise gen3_rectangle_begin()
      snb,ivb: Workaround unknown blitter death
      sna/damage: Clear damage->all when subtracting
      sna/gen3: Remove memset(0) from fill_boxes
      sna: Improve debug output for mi/fb fallbacks
      sna/accel: Implement a simpler path for CopyArea between the same pixmaps
      sna: Fast path common colour conversions
      sna/gen5: Rearrange the BLT->RENDER workaround
      sna/gen3: More removal of memset(0)
      sna/gen3: Use immediates for black/white solid sources
      sna: Only retire the flush handler if nothing was submitted within the period
      sna: Some more debug output for request/bo retirement
      sna/gen3: Prefer to use the BLT to clear the scratch glyph pixmaps
      sna/gen3: Prefer to use the BLT where possible for composite ops
      sna/gen2: Prefer to use the BLT
      sna/gen2: Precompute floats-per-rect
      sna/gen6: Micro-optimise gen6_rectangle_begin
      sna/gen6: Try continuing with the BLT if the last batch was also BLT
      sna/gen5: Add fill-one implementation
      sna/gen6: Precompute floats_per_rect
      sna/gen4: Add fill-one
      sna/blt: Move the conditional out of the fill-many hotpath
      sna/gen6: Don't modify composite arguments whilst checking for BLT
      sna: Micro-optimise fill-spans
      sna: Use the unlikely wedged() throughout
      sna: Compute region extents in place to eliminate redundant stack space
      sna/gen6: Apply the unknown blitter death workaround
      sna/damage: Avoid reducing the damage for simple tests
      sna: Enlarge the minimum pixmap size to migrate for Render
      sna: Don't rewrite the solid colour cache if it hasn't changed
      sna: Reset the ring flag upon idling
      sna: The initial aperture check for a set of bo is unlikely to fail
      sna/blt: Use SCANLINE_BLT for multiple fill boxes
      sna: Clean up some debug messages for 64bit
      sna: Add some debug to discern the nature of the Cr fallback
      sna: Reuse any partial write buffer for readback
      sna/dri: Perform a little dance to restore the gpu status on release of the bo
      sna: Actually apply the composite offset for the self-copy
      sna/damage: Only track the mode globally
      sna/damage: Stash the last freed damage for quick reallocation
      sna: Speed-up blitting of unclipped lines
      sna: Check whether we can blt whilst determining the PolyLine extents
      sna: Pass a BoxRec to the fill op
      sna: Remove the memset(0) of the fill op
      sna: Micro-optimise PolyLine blt
      sna: Short-circuit GetWindowPixmap() to speed-up pixmap retrieval
      sna: Fast-path unclipped fill spans
      sna: Optimise reduce_damage() to handle all-damaged pixmaps
      sna: Fast path for unclipped rectangles
      sna: Fast path unclipped points
      sna: Treat a bo marked as being last seen on the GPU as requiring a flush
      sna: Fix debug compilation
      sna: Execute blits directly for PolyRectangle
      sna/blt: Upload the box using a single 64-bit instruction
      sna: Flatten the branching for fill-spans
      sna: Convert diagonal zero-width lines into blits
      self-copy-damage
      sna: Use the new fill->boxes for PolyRectangle
      sna: Use fill->boxes to unroll PolyPoint
      sna: Use fill->boxes to unroll FillSpans
      sna: Inline box_intersect()
      sna/blt: Share the common fill->boxes between blt and render paths
      sna/blt: Rename the composite paths
      sna: Handle degenerate copy area whilst wedged
      sna: Fix debug compilation, again.
      sna/composite: Discard opaque masks
      sna/gen2: Cache the last solid colour for spans
      sna: Suppress an overwritten fill
      sna/blt: Convert Copy with color 0 into a Clear
      sna/blt: After submitting the batch, it will be empty
      sna: Further improve use of fill->boxes for PolyRectangle
      sna/io: Update batch mode after submitting partial requests
      sna: Handle singular clip regions for PolyRectangles
      sna: Clip PolyLines to the current clip box
      sna: Use fill->boxes for rectilinear blits
      sna: Use fill->boxes to pack PolyFillRectangle
      sna: Fix PolyPoint not to always send 512 boxes
      sna: Fix advance through clip boxes for fill->boxes
      sna: Tweak zero line spans to include missing last segment
      sna: Create scratch pixmap on the CPU
      sna: Use private identifier for internal scratch pixmaps
      sna: Fast path typically unclipped tiled rects
      sna/gen2: Reset BLT registers across 2D/3D context switches
      sna: Faster unclipped spans
      sna: Call directly into tiled rects for tiled spans.
      sna: Faster unclipped PolyPoint
      sna: Faster unclipped PolyFillRect
      sna: Call miZeroArcLine where appropriate for GPU paths
      sna: Faster unclipped rectilinear segments
      sna: Don't flush the render caches if in the process of writing again
      sna: Retire and search inactive again before creating new buffer
      sna: Increment clip box when short-circuiting the iteration
      sna: Further clipping bugs
      sna: Tweak PolyRectangle blitting
      sna/genX: Improve reduction of Render operator to BLT alu
      sna/gen3: Set the src flags for fill
      sna/composite: Fix incorrect operator reduction for RenderFillRectangles
      Revert "sna: Don't flush the render caches if in the process of writing again"
      sna: Push overflow detection to the end of extents
      sna: Rewrite box32_add_rect() to enable int32_t promotion
      sna: Fix compilation
      sna: Convert PolyRectangles to use Box32
      NEWS: Add release notes for 2.16.901
      configure: Bump version to 2.16.901

Daniel Vetter (5):
      snb: implement PIPE_CONTROL workaround
      i810: kill dead i830 defines
      i810: rip out PIO access functions
      i810: use vgahw functions instead of roll-your-own
      i810: kill mmio vga access helpers

Jeremy Huddleston (1):
      Use malloc/calloc/realloc/free directly

Kenneth Graunke (1):
      Fix incorrect maximum PS thread count on IvyBridge

Paulo Zanoni (2):
      Fix "always false" conditionals
      Remove useless assertion

Stefan Dirsch (1):
      Fix array size calculation for intel_pci_probe().

git tag: 2.16.901

http://xorg.freedesktop.org/archive/individual/driver/xf86-video-intel-2.16.901.tar.bz2
MD5:  ae2c2e7493b1e8caa4926b4b7692fd6c  xf86-video-intel-2.16.901.tar.bz2
SHA1: 2ff7b124029f25b67394e98c72e9a9ecc618f55e  xf86-video-intel-2.16.901.tar.bz2

http://xorg.freedesktop.org/archive/individual/driver/xf86-video-intel-2.16.901.tar.gz
MD5:  31a673a7eec88f612b2e74c1196637bc  xf86-video-intel-2.16.901.tar.gz
SHA1: 9db74ef6246e8a59d95847a0b7387a7b150c2ef3  xf86-video-intel-2.16.901.tar.gz

-- 
Chris Wilson, Intel Open Source Technology Centre
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 198 bytes
Desc: Digital signature
URL: <http://lists.freedesktop.org/archives/intel-gfx/attachments/20111030/9be3dc14/attachment.sig>