Skip to content

Commit 5d712a8

Browse files
jmentiVeeruPrudhvi
authored andcommitted
fix(linux): Fix headers and whitespace in AM62P performance rst
The performance guides use incorrect header hierarchy, fix header hierarchy and whitespace to establish an easier to read/parse html. Signed-off-by: Judith Mendez <jm@ti.com>
1 parent 8e30b50 commit 5d712a8

2 files changed

Lines changed: 94 additions & 73 deletions

File tree

source/devices/AM62PX/linux/Linux_Performance_Guide.rst

Lines changed: 79 additions & 63 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,10 @@
1+
#################################
2+
Linux 12.00.00 Performance Guide
3+
#################################
14

2-
===================================
3-
Linux 12.00.00 Performance Guide
4-
===================================
5-
6-
.. rubric:: **Read This First**
7-
:name: read-this-first-kernel-perf-guide
5+
***************
6+
Read This First
7+
***************
88

99
**All performance numbers provided in this document are gathered using
1010
following Evaluation Modules unless otherwise specified.**
@@ -17,26 +17,30 @@ following Evaluation Modules unless otherwise specified.**
1717

1818
Table: Evaluation Modules
1919

20-
.. rubric:: About This Manual
21-
:name: about-this-manual-kernel-perf-guide
20+
*****************
21+
About This Manual
22+
*****************
2223

2324
This document provides performance data for each of the device drivers
2425
which are part of the Processor SDK Linux package. This document should be
2526
used in conjunction with release notes and user guides provided with the
2627
Processor SDK Linux package for information on specific issues present
2728
with drivers included in a particular release.
2829

29-
.. rubric:: If You Need Assistance
30-
:name: if-you-need-assistance-kernel-perf-guide
31-
3230
For further information or to report any problems, contact
3331
https://e2e.ti.com/ or https://support.ti.com/
3432

33+
|
34+
35+
*****************
3536
System Benchmarks
36-
-----------------
37+
*****************
38+
39+
|
3740
3841
LMBench
39-
^^^^^^^
42+
=======
43+
4044
LMBench is a collection of microbenchmarks of which the memory bandwidth
4145
and latency related ones are typically used to estimate processor
4246
memory system performance. More information about lmbench at
@@ -180,7 +184,7 @@ Execute the LMBench with the following:
180184
"tcp_latency_using_localhost (microsec)","0.92 (min 0.76, max 1.14)"
181185

182186
Dhrystone
183-
^^^^^^^^^
187+
=========
184188
Dhrystone is a core only benchmark that runs from warm L1 caches in all
185189
modern processors. It scales linearly with clock speed.
186190

@@ -202,7 +206,8 @@ Execute the benchmark with the following:
202206
"dhrystone_per_second (dhrystonep)","6323683.00 (min 5882353.00, max 6896551.50)"
203207

204208
Whetstone
205-
^^^^^^^^^
209+
=========
210+
206211
Whetstone is a benchmark primarily measuring floating-point arithmetic performance.
207212

208213
Execute the benchmark with the following:
@@ -217,7 +222,8 @@ Execute the benchmark with the following:
217222
"whetstone (mips)","5000.00"
218223

219224
Linpack
220-
^^^^^^^
225+
=======
226+
221227
Linpack measures peak double precision (64 bit) floating point performance in
222228
solving a dense linear system.
223229

@@ -227,7 +233,8 @@ solving a dense linear system.
227233
"linpack (kflops)","559402.33 (min 518395.00, max 580053.00)"
228234

229235
NBench
230-
^^^^^^
236+
======
237+
231238
NBench which stands for Native Benchmark is used to measure macro benchmarks
232239
for commonly used operations such as sorting and analysis algorithms.
233240
More information about NBench at
@@ -248,7 +255,8 @@ https://nbench.io/articles/index.html
248255
"string_sort (iterations)","157.42 (min 150.17, max 168.31)"
249256

250257
Stream
251-
^^^^^^
258+
======
259+
252260
STREAM is a microbenchmark for measuring data memory system performance without
253261
any data reuse. It is designed to miss on caches and exercise data prefetcher
254262
and speculative accesses.
@@ -274,7 +282,8 @@ Execute the benchmark with the following:
274282
"triad (mb/s)","2328.48 (min 2274.80, max 2413.50)"
275283

276284
CoreMarkPro
277-
^^^^^^^^^^^
285+
===========
286+
278287
CoreMark®-Pro is a comprehensive, advanced processor benchmark that works with
279288
and enhances the market-proven industry-standard EEMBC CoreMark® benchmark.
280289
While CoreMark stresses the CPU pipeline, CoreMark-Pro tests the entire processor,
@@ -310,7 +319,8 @@ and floating-point workloads, and data sets for utilizing larger memory subsyste
310319
"zip-test (workloads/)","41.87 (min 38.46, max 46.51)"
311320

312321
MultiBench
313-
^^^^^^^^^^
322+
==========
323+
314324
MultiBench™ is a suite of benchmarks that allows processor and system designers to
315325
analyze, test, and improve multicore processors. It uses three forms of concurrency:
316326
Data decomposition: multiple threads cooperating on achieving a unified goal and
@@ -357,11 +367,13 @@ thread-enabled workloads to be tested.
357367
"x264-4mq (workloads/)","0.54 (min 0.51, max 0.58)"
358368
"x264-4mqw1 (workloads/)","0.54 (min 0.52, max 0.58)"
359369

370+
|
371+
360372
Boot-time Measurement
361-
---------------------
373+
=====================
362374

363375
Boot media: MMCSD
364-
^^^^^^^^^^^^^^^^^
376+
-----------------
365377

366378
.. csv-table:: Linux boot time MMCSD
367379
:header: "Boot Configuration","am62pxx_sk-fs: Boot time in seconds: avg(min,max)"
@@ -372,8 +384,8 @@ Boot time numbers [avg, min, max] are measured from "Starting kernel" to Linux p
372384

373385
|
374386
375-
ALSA SoC Audio Driver
376-
---------------------
387+
ALSA SoC Audio
388+
==============
377389

378390
#. Access type - RW\_INTERLEAVED
379391
#. Channels - 2
@@ -408,11 +420,12 @@ ALSA SoC Audio Driver
408420

409421
|
410422
411-
Graphics SGX/RGX Driver
412-
-----------------------
423+
Graphics SGX/RGX
424+
================
413425

414426
GFXBench
415-
^^^^^^^^
427+
--------
428+
416429
Run GFXBench and capture performance reported (Score and Display rate in fps). All display outputs (HDMI, Displayport and/or LCD) are connected when running these tests
417430

418431
.. csv-table:: GFXBench Performance
@@ -423,7 +436,7 @@ Run GFXBench and capture performance reported (Score and Display rate in fps). A
423436
" GFXBench 5.x gl_5_high_off","108.08 (min 104.61, max 115.03)","1.68 (min 1.63, max 1.79)"
424437

425438
Glmark2
426-
^^^^^^^
439+
-------
427440

428441
Run Glmark2 and capture performance reported (Score). All display outputs (HDMI, Displayport and/or LCD) are connected when running these tests
429442

@@ -436,7 +449,8 @@ Run Glmark2 and capture performance reported (Score). All display outputs (HDMI,
436449
|
437450
438451
Ethernet
439-
--------
452+
========
453+
440454
Ethernet performance benchmarks were measured using :command:`netperf` 2.7.1 https://hewlettpackard.github.io/netperf/doc/netperf.html
441455
Test procedures were modeled after those defined in RFC-2544:
442456
https://tools.ietf.org/html/rfc2544, where the DUT is the TI device
@@ -502,29 +516,29 @@ Running the following commands will trigger :command:`netperf` clients to measur
502516
netperf -H <DUT ip> -j -C -l 60 -t UDP_STREAM -b <burst_size> -w <wait_time> -- -m <UDP datagram size>
503517
-k DIRECTION,THROUGHPUT,MEAN_LATENCY,LOCAL_CPU_UTIL,REMOTE_CPU_UTIL,LOCAL_BYTES_SENT,REMOTE_BYTES_RECVD,LOCAL_SEND_SIZE
504518
505-
CPSW/CPSW2g/CPSW3g Ethernet Driver
506-
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
519+
CPSW/CPSW2g/CPSW3g Ethernet
520+
---------------------------
507521

508-
- CPSW3g: AM64x, AM62x, AM62ax, AM62px
522+
- CPSW3g: AM62px
509523

510-
.. rubric:: TCP Bidirectional Throughput
511-
:name: CPSW2g-tcp-bidirectional-throughput
524+
TCP Bidirectional Throughput
525+
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
512526

513527
.. csv-table:: CPSW2g TCP Bidirectional Throughput
514528
:header: "Command Used","am62pxx_sk-fs: THROUGHPUT (Mbits/sec)","am62pxx_sk-fs: CPU Load % (LOCAL_CPU_UTIL)"
515529

516530
"netperf -H 192.168.0.1 -j -c -C -l 60 -t TCP_STREAM; netperf -H 192.168.0.1 -j -c -C -l 60 -t TCP_MAERTS","1553.15 (min 1176.25, max 1811.29)","54.49 (min 39.05, max 70.68)"
517531

518-
.. rubric:: TCP Bidirectional Throughput Interrupt Pacing
519-
:name: CPSW2g-tcp-bidirectional-throughput-interrupt-pacing
532+
TCP Bidirectional Throughput Interrupt Pacing
533+
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
520534

521535
.. csv-table:: CPSW2g TCP Bidirectional Throughput Interrupt Pacing
522536
:header: "Command Used","am62pxx_sk-fs: THROUGHPUT (Mbits/sec)","am62pxx_sk-fs: CPU Load % (LOCAL_CPU_UTIL)"
523537

524538
"netperf -H 192.168.0.1 -j -c -C -l 60 -t TCP_STREAM; netperf -H 192.168.0.1 -j -c -C -l 60 -t TCP_MAERTS","1849.66 (min 1825.39, max 1875.16)","38.86 (min 32.01, max 45.30)"
525539

526-
.. rubric:: UDP Throughput
527-
:name: CPSW2g-udp-throughput-0-loss
540+
UDP Throughput
541+
^^^^^^^^^^^^^^
528542

529543
.. csv-table:: CPSW2g UDP Egress Throughput 0 loss
530544
:header: "Frame Size(bytes)","am62pxx_sk-fs: UDP Datagram Size(bytes) (LOCAL_SEND_SIZE)","am62pxx_sk-fs: THROUGHPUT (Mbits/sec)","am62pxx_sk-fs: Packets Per Second (kPPS)","am62pxx_sk-fs: CPU Load % (LOCAL_CPU_UTIL)"
@@ -547,14 +561,11 @@ CPSW/CPSW2g/CPSW3g Ethernet Driver
547561

548562
|
549563
550-
Linux OSPI Flash Driver
551-
-----------------------
564+
OSPI
565+
====
552566

553-
.. rubric:: AM62PXX-SK
554-
:name: am62pxx-sk-ospi
555-
556-
.. rubric:: UBIFS
557-
:name: am62pxx-sk-ospi-ubifs
567+
UBIFS
568+
-----
558569

559570
.. csv-table:: OSPI Flash Driver
560571
:header: "Buffer size (bytes)","am62pxx_sk-fs: Write UBIFS Throughput (Mbytes/sec)","am62pxx_sk-fs: Write UBIFS CPU Load (%)","am62pxx_sk-fs: Read UBIFS Throughput (Mbytes/sec)","am62pxx_sk-fs: Read UBIFS CPU Load (%)"
@@ -564,16 +575,19 @@ Linux OSPI Flash Driver
564575
"524288","0.13 (min 0.10, max 0.18)","29.15 (min 25.26, max 32.64)","27.37 (min 27.03, max 27.85)","6.63 (min 3.45, max 9.68)"
565576
"1048576","0.13 (min 0.10, max 0.18)","30.02 (min 27.22, max 33.36)","27.16 (min 26.74, max 27.82)","9.50 (min 3.33, max 12.90)"
566577

567-
.. rubric:: RAW
568-
:name: am62pxx-sk-ospi-raw
578+
OSPI RAW
579+
--------
569580

570581
.. csv-table:: OSPI Raw Flash Driver
571582
:header: "File size (Mbytes)","am62pxx_sk-fs: Raw Read Throughput (Mbytes/sec)"
572583

573584
"50","37.88"
574585

575-
EMMC Driver
576-
-----------
586+
|
587+
588+
EMMC
589+
====
590+
577591
.. warning::
578592

579593
**IMPORTANT**: The performance numbers can be severely affected if the media is
@@ -583,7 +597,7 @@ EMMC Driver
583597
re-mount in async mode.
584598

585599
EMMC EXT4 FIO 1G
586-
^^^^^^^^^^^^^^^^
600+
----------------
587601

588602
.. csv-table:: EMMC EXT4 FIO 1G
589603
:header: "Buffer size (bytes)","am62pxx_sk-fs: Write EXT4 Throughput (Mbytes/sec)","am62pxx_sk-fs: Write EXT4 CPU Load (%)","am62pxx_sk-fs: Read EXT4 Throughput (Mbytes/sec)","am62pxx_sk-fs: Read EXT4 CPU Load (%)"
@@ -594,7 +608,7 @@ EMMC EXT4 FIO 1G
594608
"256k","91.10 (min 90.60, max 91.70)","2.08 (min 1.91, max 2.26)","267.80 (min 173.00, max 294.00)","4.07 (min 2.71, max 4.75)"
595609

596610
EMMC EXT4
597-
^^^^^^^^^
611+
---------
598612

599613
.. csv-table:: EMMC EXT4
600614
:header: "Buffer size (bytes)","am62pxx_sk-fs: Write EXT4 Throughput (Mbytes/sec)","am62pxx_sk-fs: Write EXT4 CPU Load (%)","am62pxx_sk-fs: Read EXT4 Throughput (Mbytes/sec)","am62pxx_sk-fs: Read EXT4 CPU Load (%)"
@@ -606,7 +620,7 @@ EMMC EXT4
606620
"5242880","75.53 (min 47.37, max 95.20)","2.43 (min 1.12, max 4.32)","181.68 (min 181.18, max 182.02)","5.64 (min 5.19, max 6.49)"
607621

608622
EMMC VFAT
609-
^^^^^^^^^
623+
---------
610624

611625
.. csv-table:: EMMC VFAT
612626
:header: "Buffer size (bytes)","am62pxx_sk-fs: Write VFAT Throughput (Mbytes/sec)","am62pxx_sk-fs: Write VFAT CPU Load (%)","am62pxx_sk-fs: Read VFAT Throughput (Mbytes/sec)","am62pxx_sk-fs: Read VFAT CPU Load (%)"
@@ -617,8 +631,8 @@ EMMC VFAT
617631
"1048576","52.85 (min 12.24, max 75.55)","5.12 (min 3.58, max 6.94)","262.95 (min 176.25, max 285.29)","14.56 (min 8.05, max 17.57)"
618632
"5242880","55.67 (min 12.34, max 82.40)","5.28 (min 3.61, max 7.06)","262.72 (min 175.95, max 284.89)","14.16 (min 9.62, max 16.44)"
619633

620-
UBoot EMMC Driver
621-
-----------------
634+
UBoot EMMC
635+
----------
622636

623637
.. csv-table:: UBOOT EMMC RAW
624638
:header: "File size (bytes in hex)","am62pxx_sk-fs: Write Throughput (Kbytes/sec)","am62pxx_sk-fs: Read Throughput (Kbytes/sec)"
@@ -627,7 +641,7 @@ UBoot EMMC Driver
627641
"4000000","97568.90 (min 95672.99, max 99598.78)","247676.85 (min 193893.49, max 302009.22)"
628642

629643
MMCSD
630-
-----
644+
=====
631645

632646
.. warning::
633647

@@ -638,7 +652,7 @@ MMCSD
638652
re-mount in async mode.
639653

640654
MMC EXT4 FIO 1G
641-
^^^^^^^^^^^^^^^
655+
---------------
642656

643657
.. csv-table:: MMC EXT4 FIO 1G
644658
:header: "Buffer size (bytes)","am62pxx_sk-fs: Write EXT4 Throughput (Mbytes/sec)","am62pxx_sk-fs: Write EXT4 CPU Load (%)","am62pxx_sk-fs: Read EXT4 Throughput (Mbytes/sec)","am62pxx_sk-fs: Read EXT4 CPU Load (%)"
@@ -649,7 +663,7 @@ MMC EXT4 FIO 1G
649663
"256k","38.92 (min 38.50, max 39.50)","1.30 (min 1.13, max 1.44)","83.68 (min 83.30, max 84.40)","1.59 (min 1.47, max 1.72)"
650664

651665
MMC EXT4
652-
^^^^^^^^
666+
--------
653667

654668
.. csv-table:: MMC EXT4
655669
:header: "Buffer size (bytes)","am62pxx_sk-fs: Write Raw Throughput (Mbytes/sec)","am62pxx_sk-fs: Write Raw CPU Load (%)","am62pxx_sk-fs: Read Raw Throughput (Mbytes/sec)","am62pxx_sk-fs: Read Raw CPU Load (%)"
@@ -684,11 +698,11 @@ The performance numbers were captured using the following:
684698

685699
|
686700
687-
USB Driver
688-
----------
701+
USB
702+
===
689703

690704
USB Device Controller
691-
^^^^^^^^^^^^^^^^^^^^^
705+
---------------------
692706

693707
.. csv-table:: USBDEVICE HIGHSPEED SLAVE_READ_THROUGHPUT
694708
:header: "Number of Blocks","am62pxx_sk-fs: Throughput (MB/sec)"
@@ -702,11 +716,11 @@ USB Device Controller
702716

703717
|
704718
705-
CRYPTO Driver
706-
-------------
719+
CRYPTO
720+
======
707721

708722
OpenSSL Performance
709-
^^^^^^^^^^^^^^^^^^^
723+
-------------------
710724

711725
.. csv-table:: OpenSSL Performance
712726
:header: "Algorithm","Buffer Size (in bytes)","am62pxx_sk-fs: throughput (KBytes/Sec)"
@@ -778,3 +792,5 @@ Listed for each algorithm are the code snippets used to run each
778792
::
779793

780794
time -v openssl speed -elapsed -evp aes-128-cbc
795+
796+
|

0 commit comments

Comments
 (0)