How do I interpret the output of intel-gpu-top and intel-gpu-overlay?

9,357

Taken from the link given in the comments in OP.

I was curious as well, so here are just a few things I could grab from the reference manuals. Also of interest is the intel-gpu-tools source, and especially lib/instdone.c which describes what can appear in all Intel GPU models. This patch was also hugely helpful in translating all those acronyms!

Some may be wrong, I'd love it if somebody more knowledgeable could chime in! I'll come back to update the answer with more as I learn this stuff.

First, the three lines on the right :

  • The render space is probably used by regular 3D operations.
  • From googling, bitstream seems to be about audio decoding? This is quite a generic term, so hard to find with a query. It does not appear on my GPU though (Skylake HD 530), so it might not be everywhere.
  • The blitter is described in vol. 11 and seems responsible for hardware acceleration of 2D operations (blitting).

Fixed function (FF) pipeline units (old-school GPU features) :

  • VF: Vertex Fetcher (vol. 1), the first FF unit in the 3D Pipeline responsible for fetching vertex data from memory.
  • VS: Vertex Shader (vol.1), computes things on the vertices of each primitive drawn by the GPU. Pretty standard operation on GPUs.
  • HS: Hull Shader
  • TE: Tessellation Engine
  • DS: Domain Shader
  • GS: Geometry Shader
  • SOL: Stream Output Logic
  • CL: Clip Unit
  • SF: Strips and Fans (vol.1), FF unit whose main function is to decompose primitive topologies such as strips and fans into primitives or objects.

Units used for thread and pipeline management, for both FF units and GPGPU (see Intel Open Source HD Graphics Programmers Manual for a lot of info on how this all works) :

  • CS: Command Streamer (vol.1), functional unit of the Graphics Processing Engine that fetches commands, parses them, and routes them to the appropriate pipeline.
  • TDG: Thread Dispatcher
  • VFE: Video Front-End
  • TSG: Thread Spawner
  • URBM: Unified Return Buffer Manager

Other stuff :

  • GAM: see GFX Page Walker (vol. 5), also called Memory Arbiter, has to do with how the GPU keeps track of its memory pages, seems quite similar to what the TLB (see also SLAT) does for your RAM.
  • SDE: South Display Engine ; according to vol. 12, "the South Display Engine supports Hot Plug Detection, GPIO, GMBUS, Panel Power Sequencing, and Backlight Modulation".

Credits:

StackOverflow User F.X.

Share:
9,357

Related videos on Youtube

mpr
Author by

mpr

Programmer

Updated on September 18, 2022

Comments

  • mpr
    mpr over 1 year

    I have these commands compiled and running but their contents are a bit of a mystery to me.

    The processes from intel-gpu-overlay read something like: 15R, 16B, 41ms waits. What is an R, what is a B, what does that wait time indicate?

    It has CPU: 152% (I'd guess this is the same as what I get from top). render: 32%, bitstream: 6%, blt: 6%. What kinds of code would cause these values to bottle neck and what would be the behavior of the system when they did?

    Here is a sample of intel-gpu-top:

                   render busy:  23%: ████▋                                  render space: 12/16384
    
                          task  percent busy
                           GAM:  29%: █████▉                  vert fetch: 1380772913 (5386667/sec)
                            CS:  23%: ████▋                   prim fetch: 350972637 (1368891/sec)
                          GAFS:   9%: █▉                   VS invocations: 1375586768 (5385212/sec)
                           TSG:   8%: █▋                   GS invocations: 0 (0/sec)
                           VFE:   7%: █▌                        GS prims: 0 (0/sec)
                           SVG:   3%: ▋                    CL invocations: 677098924 (2648400/sec)
                            VS:   3%: ▋                         CL prims: 682224019 (2663834/sec)
                          URBM:   2%: ▌                    PS invocations: 9708568482932 (34396218804/sec)
                            VF:   2%: ▌                    PS depth pass: 15549624948405 (58732230331/sec)
                           SDE:   0%:                      
                            CL:   0%:                      
                            SF:   0%:                      
                           TDG:   0%:                      
                            RS:   0%:                      
                          GAFM:   0%:                      
                           SOL:   0%:
    
    • mpr
      mpr about 7 years
      Here is a pretty good answer on stackoverflow, may just close this: stackoverflow.com/questions/28876242/…
    • George Udosen
      George Udosen over 6 years
      Why would you close this, are here not worthy of such a greate answer?
  • Sahil Chaudhary
    Sahil Chaudhary about 3 years