Qualcomm Atheros QCA9533 (MIPS 24Kc), 650MHz, 32 MB (16-bit DDR2 195MHz CL3 x 2). TP-Link TL-WR841N.
32 K 3 TLB + L1 64 K 5 + 90 ns 2 90 ns + 4 DTLB miss + 170 ns RAM 128 K 7 + 130 ns 2 40 ns 256 K 29 + 150 ns 22 20 ns + 42 (JTLB miss) 512 K 39 + 160 ns 10 + 10 ns 1 M 44 + 164 ns 5 + 5 ns 2 M 47 + 166 ns 3 + 2 ns 4 M 48 + 168 ns 1 + 2 ns 8 M 49 + 175 ns 1 + 7 ns 16 M 49 + 210 ns 1 + 35 ns + 170 ns RAM (Page walk to RAM)
Branch misprediction penalty = 4 cycles.
| # | Stage | Name | Description |
|---|---|---|---|
| 1 | IF | Instruction Fetch First |
I-cache tag/data arrays accessed Branch History Table accessed ITLB address translation performed EJTAG break/watch compares done |
| 2 | IS | Instruction Fetch Second |
Detect I-cache hit Way select MIPS32 Branch prediction |
| 3 | RF | Register File Access |
Register File access Instruction decoding/dispatch logic Bypass muxes |
| 4 | AG | Address Generation |
D-cache Address Generation Bypass muxes |
| 5 | EX | Execute/Memory Access |
Skewed ALU DTLB Start DCache access Branch Resolution |
| 6 | MS | Memory Access Second |
Complete DCache access DCache hit detection Way select mux Load align |
| 7 | ER | Exception Resolution |
Instruction completion Register file write setup Exception processing |
| 8 | WB | Writeback | Register file writeback occurs on rising edge of this cycle |
There are some decoding or execution problems for MIPS16 code. The execution rate for sequence of simple instructions is only about 0.75 IPC (3 instructions per 4 cycles) for MIPS16 code. Why?
7z b -mm=* : MIPS and Effectiveness values are normalized with AMD K8 cpu.
LZMA Dictionary is reduced, so some rating values are not accurate.
# gcc-6 -O2 -march=24kc
7-Zip (a) 16.02 : Copyright (c) 1999-2016 Igor Pavlov : 2016-05-21
p7zip Version 16.02 (locale=C.UTF-8,Utf16=on,HugeFiles=on,32 bits,1 CPU BE)
freq= 650
BE
CPU Freq: 638 639 639 639 634 636 638 636
RAM size: 28 MB, # CPU hardware threads: 1
RAM usage: 16 MB, # Benchmark threads: 1
Compressing | Decompressing
Dict Speed Usage R/U Rating | Speed Usage R/U Rating
KiB/s % MIPS MIPS | KiB/s % MIPS MIPS
18: 255 100 227 227 | 5558 100 450 449
19: 247 100 222 222 | 5485 100 450 449
20: 239 100 218 218 | 5413 100 448 449
---------------------------------- | ------------------------------
Avr: 100 222 222 | 100 449 449
Tot: 100 336 336
7-Zip (a) 16.02 : Copyright (c) 1999-2016 Igor Pavlov : 2016-05-21
p7zip Version 16.02 (locale=C.UTF-8,Utf16=on,HugeFiles=on,32 bits,1 CPU BE)
freq= 650
BE
CPU Freq: 629 639 639 639 634 639 636 637
RAM size: 28 MB, # CPU hardware threads: 1
Dictionary reduced to: 19
RAM usage: 16 MB, # Benchmark threads: 1
Method Speed Usage R/U Rating E/U Effec
KiB/s % MIPS MIPS % %
CPU 100 637 637 98 98
CPU 100 637 638 98 98
CPU 100 637 637 98 98
LZMA:x1 677 100 248 248 38 38
5426 100 438 438 67 67
LZMA:x5:mt1 248 100 310 310 48 48
5508 100 439 439 68 67
LZMA:x5:mt2 251 100 314 314 48 48
5508 100 438 439 67 67
Deflate:x1 1452 100 184 184 28 28
11208 100 348 348 54 54
Deflate:x5 760 100 293 293 45 45
11226 100 348 348 54 54
Deflate:x7 324 100 360 360 55 55
11280 100 350 350 54 54
Deflate64:x5 689 100 298 298 46 46
11057 100 344 344 53 53
BZip2:x1 348 100 211 211 32 32
1806 100 196 196 30 30
BZip2:x5 317 100 265 265 41 41
1635 100 316 316 49 49
BZip2:x5:mt2 316 100 264 264 41 41
1638 100 316 316 49 49
BZip2:x7 105 100 273 273 42 42
1646 100 317 318 49 49
PPMD:x1 243 100 252 252 39 39
206 100 243 243 37 37
PPMD:x5 183 100 310 310 48 48
163 100 307 307 47 47
Delta:4 24107 100 148 148 23 23
23787 100 146 146 22 22
BCJ 36209 100 148 148 23 23
37010 100 152 152 23 23
AES256CBC:1 5988 100 147 147 23 23
6105 100 150 150 23 23
AES256CBC:2
CRC32:1 50659 100 369 369 57 57
CRC32:4 64931 100 145 145 22 22
CRC32:8 68413 100 93 93 14 14
CRC64 53828 100 110 110 17 17
SHA256 12855 100 262 262 40 40
SHA1 25031 100 234 234 36 36
BLAKE2sp 14221 100 313 313 48 48
CPU 100 638 637 98 98
------------------------------------------------------
Tot: 100 321 321 49 49
# gcc-6 -O2 -march=24kc -mips16
7-Zip (a) 16.02 : Copyright (c) 1999-2016 Igor Pavlov : 2016-05-21
p7zip Version 16.02 (locale=C.UTF-8,Utf16=on,HugeFiles=on,32 bits,1 CPU BE)
freq= 650
BE
CPU Freq: 469 469 464 469 469 468 467
RAM size: 28 MB, # CPU hardware threads: 1
RAM usage: 16 MB, # Benchmark threads: 1
Compressing | Decompressing
Dict Speed Usage R/U Rating | Speed Usage R/U Rating
KiB/s % MIPS MIPS | KiB/s % MIPS MIPS
18: 160 100 143 143 | 2398 100 194 194
19: 157 100 141 141 | 2381 100 195 195
20: 153 100 139 140 | 2367 100 196 196
---------------------------------- | ------------------------------
Avr: 100 141 141 | 100 195 195
Tot: 100 168 168
7-Zip (a) 16.02 : Copyright (c) 1999-2016 Igor Pavlov : 2016-05-21
p7zip Version 16.02 (locale=C.UTF-8,Utf16=on,HugeFiles=on,32 bits,1 CPU BE)
freq= 650
BE
CPU Freq: 469 464 469 469 467 468 468
RAM size: 28 MB, # CPU hardware threads: 1
Dictionary reduced to: 19
RAM usage: 16 MB, # Benchmark threads: 1
Method Speed Usage R/U Rating E/U Effec
KiB/s % MIPS MIPS % %
CPU 100 468 468 72 72
CPU 100 468 468 72 72
CPU 100 469 468 72 72
LZMA:x1 460 100 168 168 26 26
2324 100 188 188 29 29
LZMA:x5:mt1 157 100 197 197 30 30
2388 100 190 190 29 29
LZMA:x5:mt2 171 100 214 214 33 33
2389 100 190 190 29 29
Deflate:x1 1136 100 144 144 22 22
6164 100 191 191 29 29
Deflate:x5 493 100 190 190 29 29
6171 100 192 191 29 29
Deflate:x7 189 100 210 210 32 32
6204 100 192 192 30 30
Deflate64:x5 459 100 199 198 31 31
6161 100 192 192 29 29
BZip2:x1 234 100 142 142 22 22
1113 100 121 121 19 19
BZip2:x5 217 100 182 182 28 28
1027 100 198 198 30 30
BZip2:x5:mt2 217 100 181 181 28 28
1024 100 198 198 30 30
BZip2:x7 63 100 165 165 25 25
1033 100 199 199 31 31
PPMD:x1 145 100 150 150 23 23
122 100 145 145 22 22
PPMD:x5 122 100 208 208 32 32
107 100 201 201 31 31
Delta:4 15731 100 97 97 15 15
16164 100 99 99 15 15
BCJ 23122 100 95 95 15 15
23446 100 96 96 15 15
AES256CBC:1 3236 100 80 80 12 12
2905 100 71 71 11 11
AES256CBC:2
CRC32:1 30741 100 224 224 34 34
CRC32:4 34787 100 78 78 12 12
CRC32:8 42049 100 57 57 9 9
CRC64 28638 100 59 59 9 9
SHA256 4488 100 92 92 14 14
SHA1 10408 100 97 97 15 15
BLAKE2sp 5431 100 120 119 18 18
CPU 100 468 468 72 72
------------------------------------------------------
Tot: 100 177 177 27 27