Absolute Core Clock Fluctuating

I have 3070 ti zotac trinity oc edition, having strange behaviour. It is keep core clock fluctuating, while other GPUs are normal, strict on it overclock parameter. Any one have same experiences and solution? Thanks

Is it thermal throttling?

Temperature still less than 60 C, so I think it is not thermal throtling.

Can you paste the output of this command: nvidia-smi -q

GPU 00000000:16:00.0
Product Name : NVIDIA GeForce RTX 3070 Ti
Product Brand : GeForce
Display Mode : Disabled
Display Active : Disabled
Persistence Mode : Enabled
MIG Mode
Current : N/A
Pending : N/A
Accounting Mode : Disabled
Accounting Mode Buffer Size : 4000
Driver Model
Current : N/A
Pending : N/A
Serial Number : N/A
GPU UUID : GPU-9d15195e-75ed-ce9c-dfd4-19e5739f3b58
Minor Number : 8
VBIOS Version : 94.04.5A.40.22
MultiGPU Board : No
Board ID : 0x1600
GPU Part Number : N/A
Module ID : 0
Inforom Version
Image Version : G001.0000.03.03
OEM Object : 2.0
ECC Object : N/A
Power Management Object : N/A
GPU Operation Mode
Current : N/A
Pending : N/A
GSP Firmware Version : N/A
GPU Virtualization Mode
Virtualization Mode : None
Host VGPU Mode : N/A
IBMNPU
Relaxed Ordering Mode : N/A
PCI
Bus : 0x16
Device : 0x00
Domain : 0x0000
Device Id : 0x248210DE
Bus Id : 00000000:16:00.0
Sub System Id : 0x165319DA
GPU Link Info
PCIe Generation
Max : 2
Current : 2
Link Width
Max : 16x
Current : 1x
Bridge Chip
Type : N/A
Firmware : N/A
Replays Since Reset : 0
Replay Number Rollovers : 0
Tx Throughput : 13000 KB/s
Rx Throughput : 36000 KB/s
Fan Speed : 65 %
Performance State : P2
Clocks Throttle Reasons
Idle : Not Active
Applications Clocks Setting : Not Active
SW Power Cap : Not Active
HW Slowdown : Not Active
HW Thermal Slowdown : Not Active
HW Power Brake Slowdown : Not Active
Sync Boost : Not Active
SW Thermal Slowdown : Active
Display Clock Setting : Not Active
FB Memory Usage
Total : 7982 MiB
Used : 4917 MiB
Free : 3065 MiB
BAR1 Memory Usage
Total : 256 MiB
Used : 5 MiB
Free : 251 MiB
Compute Mode : Default
Utilization
Gpu : 100 %
Memory : 100 %
Encoder : 0 %
Decoder : 0 %
Encoder Stats
Active Sessions : 0
Average FPS : 0
Average Latency : 0
FBC Stats
Active Sessions : 0
Average FPS : 0
Average Latency : 0
Ecc Mode
Current : N/A
Pending : N/A
ECC Errors
Volatile
SRAM Correctable : N/A
SRAM Uncorrectable : N/A
DRAM Correctable : N/A
DRAM Uncorrectable : N/A
Aggregate
SRAM Correctable : N/A
SRAM Uncorrectable : N/A
DRAM Correctable : N/A
DRAM Uncorrectable : N/A
Retired Pages
Single Bit ECC : N/A
Double Bit ECC : N/A
Pending Page Blacklist : N/A
Remapped Rows : N/A
Temperature
GPU Current Temp : 56 C
GPU Shutdown Temp : 98 C
GPU Slowdown Temp : 95 C
GPU Max Operating Temp : 93 C
GPU Target Temperature : 83 C
Memory Current Temp : N/A
Memory Max Operating Temp : N/A
Power Readings
Power Management : Supported
Power Draw : 204.18 W
Power Limit : 320.00 W
Default Power Limit : 310.00 W
Enforced Power Limit : 320.00 W
Min Power Limit : 100.00 W
Max Power Limit : 341.00 W
Clocks
Graphics : 825 MHz
SM : 825 MHz
Memory : 11001 MHz
Video : 1215 MHz
Applications Clocks
Graphics : N/A
Memory : N/A
Default Applications Clocks
Graphics : N/A
Memory : N/A
Max Clocks
Graphics : 2100 MHz
SM : 2100 MHz
Memory : 9501 MHz
Video : 1950 MHz
Max Customer Boost Clocks
Graphics : N/A
Clock Policy
Auto Boost : N/A
Auto Boost Default : N/A
Voltage
Graphics : 843.750 mV
Processes
GPU instance ID : N/A
Compute instance ID : N/A
Process ID : 14793
Type : G
Name : /usr/lib/xorg/Xorg
Used GPU Memory : 6 MiB
GPU instance ID : N/A
Compute instance ID : N/A
Process ID : 18230
Type : C
Name : /hive/miners/gminer/2.74/gminer
Used GPU Memory : 4907 MiB

image

Core clock set 1125 but actual fluctuating between 700 to 1125

Yep, it’s thermal throttling. Turn the fan to 100%

How much temperature target ideally?

Thermal throttling happens at 110c on the memory, so less than that. I run all my gddr6x cards at 100% fan, fans are the cheapest part imo.

isn’t it because of VRAM speed? 3500?.. GDDR6X is problematic… maybe if you change the thermal pads will sort it out

nvidia-smi -q | egrep “Thermal Slowdown”
is enough to write i think

So if thermal throttling normally happens at 110c, then why in my case, it is happens while still less than 60 C ?

because in hiveos it shows only core temp, not virtual memory temps…
put your gpu at 90% or even 100% FAN speed

core temp vs memory temp

Ok thanks all…

This topic was automatically closed 416 days after the last reply. New replies are no longer allowed.