[Intel-gfx] Question regarding forcewake in i915
Dave Gordon
david.s.gordon at intel.com
Tue Jan 6 07:19:12 PST 2015
On 22/12/14 12:26, 유재용 wrote:
> Hello intel-gfx,
>
> I'm reading i915 gpu drivers and find myself quite hard to understand
> about forcewake concepts.
>
> I understand that it is something with the energy efficiency so related
> to ACPI. And it looks like forcewake is working as a pair (get and put).
> In the "get" part, what it first does it waiting on FORCEWAKE_ACK_HSW
> register (in case of haswell).
> And then, it writes something to FORCEWAKE_MT register, read from ECOBUS.
> And again, it waits on FORCEWAKE_ACK_HSW again!
> It becomes more confusing when it comes to put.
> In the "put" part, what it does it writing to FORCEWAKE_MT register and
> read from ECOBUS.
>
> I tried to find some good reading materials about this forcewake, but
> what I found was a series of patches in this mailing list. (which are
> quite hard to follow from the begining)
> Could you explain about the concept of FORCEWAKE and possibly the magic
> tricks on these get and put?
>
> Thanks,
> Jaeyong
Hi Jaeyong,
FORCEWAKE details vary a little from one chip to another, so this is
only a general description, but essentially setting one or more bits in
the FORCEWAKE register(s) prevents some or all of the power domains from
going into the deeper idle (sleep) states (and forces them out of the
sleep state if they're already asleep). Clearing the bit(s) allows the
affected parts to go to sleep again.
The FORCEWAKE_ACK register(s) contain one or more bits which reflect the
internal state, and so acknowledge that the most recent write to the
corresponding FORCEWAKE register has been accepted and acted upon. It
can take a while for a portion of the chip to wake up, so after setting
a FORCEWAKE bit we have to spin-wait until it's taken effect.
So, the general algorithm for accessing some part of the chip that may
be asleep is:
1) set the relevant bit of (a) FORCEWAKE register
2) poll (matching) FORCEWAKE_ACK until the write is acknowledged
3) access the chip (this can encompass several reads and writes)
4) clear the FORCEWAKE bit that we set earlier
5) poll FORCEWAKE_ACK again until this write is acknowledged
Now for extra confusion, there are a few more details:
* because reads and writes can in some cases be reordered, we
need to force the write to FORCEWAKE to complete before the
busy-polling of FORCEWAKE_ACK. This is the sole purpose of the
read of the ECOBUS register, which is used just because it
happens to lie in the same cacheline as FORCEWAKE.
* we can choose not to poll for FORCEWAKE_ACK clear in step (5).
Instead, we can just leave the chip to go back to sleep while
we get on with other things. But in that case, we might come
back and try to wake the chip again before it's finished
responding to the write in step (4). So if we don't poll at
the end of the sequence, we have to poll at the beginning
instead; in other words, move step (5) to before step (1).
IIRC, gen6 had a single FORCEWAKE register containing a single effective
bit, gen7 has a single register containing multiple bits (so that they
can be controlled by different agents) which are OR-ed together to
produce a combined wakeup signal (this also applies to HSW, although the
FORCEWAKE_ACK is in a different place from earlier chips); and VLV has
multiple registers for different power domains (e.g. MEDIA vs RENDER).
Hope this helps!
Dave
More information about the Intel-gfx
mailing list