Bartosz Taudul
f0ce7de193
Move callstack collection in mem events out of critical section.
2018-06-22 23:00:03 +02:00
Bartosz Taudul
b6088b908f
Callstack capture for ZoneBegin.
2018-06-22 00:56:30 +02:00
Bartosz Taudul
0c13fb818b
Initialize rpmalloc in Mem{Alloc,Free}Callstack().
...
rpmalloc may still be uninitialized here (i.e. if memory allocation/free
is performed before any other tracy operation that would initialize
thread_local data). Since memory allocations are using serialized queue
(which is not held in thread_local section) and obtaining callstack
involves memory allocation, we need to initialize rpmalloc manually.
This won't be a problem when support for zone callbacks becomes online,
because zones are stored in per-thread queues, which initialize
thread_local data before rpmalloc is needed in the Callstack() call.
2018-06-21 17:02:40 +02:00
Bartosz Taudul
909166daf7
Hide SendCallstackMemory().
2018-06-20 23:30:19 +02:00
Bartosz Taudul
8c46ad81d5
Extract common code.
2018-06-20 23:29:44 +02:00
Bartosz Taudul
5177a7b960
Callstack frame transfer.
2018-06-20 01:06:31 +02:00
Bartosz Taudul
9b1fb01e16
Disable Callstack() call if there's no callstack support.
2018-06-19 19:38:30 +02:00
Bartosz Taudul
51043ebc47
Callstack payload transfer.
2018-06-19 19:31:16 +02:00
Bartosz Taudul
55e6a4a484
No return status is needed here.
2018-06-19 19:00:57 +02:00
Bartosz Taudul
d0d3545988
Optional sending of callstack ptr in memory events.
2018-06-19 18:51:21 +02:00
Bartosz Taudul
9c11e0fc5b
Vulkan tracing.
2018-06-17 18:14:37 +02:00
Bartosz Taudul
5b6d9769af
Properly separate HW timer from MSVC rdtscp optimization.
2018-04-27 19:40:47 +02:00
Bartosz Taudul
237aee30a8
Test if HW timer can be used on arm.
2018-04-27 16:58:45 +02:00
Bartosz Taudul
6a2311a7b7
Arm64 also defines __ARM_ARCH.
2018-04-26 17:39:04 +02:00
Bartosz Taudul
a3f5003f88
Read time from timer register on armv6, armv7.
...
Same improvement as on aarch64.
2018-04-26 17:18:10 +02:00
Bartosz Taudul
69a50b04c1
Really don't care about cpu id.
2018-04-26 16:12:52 +02:00
Bartosz Taudul
1899066e36
Read time from timer register on arm64.
...
On ODROID C2 this change improves timer resolution from 250 ns to 41 ns.
2018-04-26 16:03:31 +02:00
Bartosz Taudul
3a20104882
No need for separate tracy_rdtscp() function.
2018-04-26 15:30:53 +02:00
Bartosz Taudul
48665cc09b
s/TRACY_RDTSCP_SUPPORTED/TRACY_HW_TIMER/
2018-04-26 15:25:54 +02:00
Bartosz Taudul
15219b1481
Support 4-byte size_t.
2018-04-14 16:08:39 +02:00
Bartosz Taudul
459890ef0e
Don't hold lock on serial queue during dequeue.
2018-04-14 15:46:11 +02:00
Bartosz Taudul
9c403d9cc2
GetTime() calls also must be serialized.
2018-04-01 21:07:33 +02:00
Bartosz Taudul
794f199bdc
Serial queue dequeuing.
2018-04-01 20:04:35 +02:00
Bartosz Taudul
860e0e1809
Store memory operations in the serial queue.
2018-04-01 19:53:24 +02:00
Bartosz Taudul
faeecdd773
Add serial queue to profiler.
2018-04-01 19:53:05 +02:00
Bartosz Taudul
991fc6bd95
Memory allocations tracker.
2018-03-31 21:56:05 +02:00
Bartosz Taudul
3b03e849f0
Harden client code against unaligned memory access.
...
There shouldn't be any changes in generated code on modern
architectures, as the memcpy will be reduced to a store/load operation
identical to the one generated with plain struct member access.
GetTime( cpu ) needs special handling, as the MSVC intrinsic for rdtscp
can't store cpu identifier in a register. Using intermediate variable
would cause store to stack, read from stack, store to the destination
address. Since rdtscp is only available on x86, which handles unaligned
stores without any problems, we can have one place with direct struct
member access.
2018-03-31 14:15:04 +02:00
Bartosz Taudul
2f669aea41
Workaround gcc issues.
2017-11-15 10:56:27 +01:00
Bartosz Taudul
c43eb29ce0
Don't send source location pointer in query reply.
...
Since reply order is the same as the query order, the server already
knows what source location it receives. This observation allows placing
zone name into the source location struct.
2017-11-14 23:06:45 +01:00
Bartosz Taudul
3c00ce0958
GPU context registration.
2017-11-11 19:44:09 +01:00
Bartosz Taudul
c2797a4cc7
Data packets can't cross data buffer boundary.
2017-11-11 15:08:03 +01:00
Bartosz Taudul
fc4e31bb8f
Wait for send buffer to be full before sending data.
2017-11-11 14:16:37 +01:00
Bartosz Taudul
bc77aa8d26
Source location payload transfer. No storage yet.
2017-11-05 16:46:00 +01:00
Bartosz Taudul
3659afac2a
Move another big buffer out of stack.
2017-11-02 17:37:10 +01:00
Bartosz Taudul
cea7cdd716
Don't create big buffers on stack.
2017-11-02 12:58:38 +01:00
Bartosz Taudul
f861f89134
Disable rdtscp on android.
2017-10-30 16:23:23 +01:00
Bartosz Taudul
97880a89ae
Clobber ecx register.
2017-10-29 16:20:07 +01:00
Bartosz Taudul
a220043114
Add no-cpu GetTime() variant.
...
In this version the address of cpu output variable is const, so there's
no stack address calculation involved.
2017-10-29 16:12:16 +01:00
Bartosz Taudul
68f5a17bca
Use 32-bit registers for rdtscp output.
2017-10-29 13:15:43 +01:00
Bartosz Taudul
f1da7c1c85
Force TLS block creation on cygwin before malloc.
2017-10-20 18:28:25 +02:00
Bartosz Taudul
51013dc0e6
Manual allocation of socket memory.
2017-10-18 19:50:28 +02:00
Bartosz Taudul
fc94378e0c
Move TracyAlloc.hpp to common. Use rpmalloc only if TRACY_ENABLE.
2017-10-18 19:50:28 +02:00
Bartosz Taudul
7c47edc64f
Terminate connection handshake.
2017-10-18 18:48:51 +02:00
Bartosz Taudul
51f5ae4796
More precise profiler init end time measurement.
2017-10-17 01:07:54 +02:00
Bartosz Taudul
9f28205548
Use custom threading wrapper instead of std::thread.
...
std::thread may perform memory allocation when a thread is created (it
does so on MSVC). Tracy heap is managed by its own allocator and this
changes prevents accessing application heap.
2017-10-16 21:17:58 +02:00
Bartosz Taudul
5b9fcddfb3
String literal message transfer.
2017-10-15 13:06:49 +02:00
Bartosz Taudul
40bc4c8015
Missing include.
2017-10-14 17:21:14 +02:00
Bartosz Taudul
c497966c7f
Use rpmalloc to allocate tracy client memory.
2017-10-14 17:15:18 +02:00
Bartosz Taudul
8c7b60fbe6
Allow sending text messages.
2017-10-14 13:23:13 +02:00
Bartosz Taudul
cb0011755d
Prevent type conversions.
2017-10-13 02:21:29 +02:00
Bartosz Taudul
f0484b50ca
Plot data transfer.
2017-10-13 02:07:03 +02:00
Bartosz Taudul
e23da05a65
Workaround gcc stupidity.
2017-10-11 01:44:35 +02:00
Bartosz Taudul
77dfefb5d0
Remove one stack address load.
2017-10-11 01:27:22 +02:00
Bartosz Taudul
af3773dc9a
Remove one level of indirection.
2017-10-11 01:04:21 +02:00
Bartosz Taudul
cc8b357f09
Avoid excessive stack operations for cpu query.
2017-10-10 23:21:30 +02:00
Bartosz Taudul
75457c1465
Remove +x flag from files.
2017-10-10 21:56:15 +02:00
Bartosz Taudul
3f0bd793fd
Send program start time, not connection time.
2017-10-04 00:34:05 +02:00
Bartosz Taudul
b1aa16763b
Prevent accesing TLS data twice on gcc.
2017-10-03 16:55:04 +02:00
Bartosz Taudul
d1edd30ca6
Zone ids are unnecessary.
2017-10-03 16:41:32 +02:00
Bartosz Taudul
2fb4c47491
Remember to calibrate timer.
2017-10-03 15:35:43 +02:00
Bartosz Taudul
7b1135239c
Use rdtscp when there's no intrinsic.
2017-10-03 15:28:31 +02:00
Bartosz Taudul
9cde85646a
Fix typo.
2017-10-03 15:16:48 +02:00
Bartosz Taudul
e01d378f52
More force inlining.
2017-10-03 15:10:25 +02:00
Bartosz Taudul
ba037e5798
Do not store tail index in memory.
2017-10-03 14:50:55 +02:00
Bartosz Taudul
353fda95a3
Expose profiler internals to make it easier for inlining.
...
concurrentqueue.h doesn't bring any poisonous includes, only STL.
2017-10-03 14:22:49 +02:00
Bartosz Taudul
a1abf1f015
Record CPU id.
2017-10-01 19:17:08 +02:00
Bartosz Taudul
f46781808c
Construct queue items directly in queue memory.
2017-10-01 17:49:45 +02:00
Bartosz Taudul
6a2cb2c14e
Calculate timer resolution.
2017-09-29 18:29:39 +02:00
Bartosz Taudul
8c1c395cec
Allow sending custom zone names.
2017-09-28 19:28:24 +02:00
Bartosz Taudul
d1bbb731fc
Zone text (custom string) transfer.
2017-09-27 02:18:17 +02:00
Bartosz Taudul
3cc7cc596e
Remove GetNewId() from Profiler interface.
2017-09-27 01:02:04 +02:00
Bartosz Taudul
f584bf76e8
Profiler ID can be static (one less instruction).
2017-09-27 00:30:02 +02:00
Bartosz Taudul
e076d1d475
Send source location answer in stream, not as separate packet.
2017-09-26 19:00:25 +02:00
Bartosz Taudul
e90a86e06e
Store zone color in source location struct.
2017-09-26 18:54:48 +02:00
Bartosz Taudul
7424077d70
Store source location in a single object.
...
Source file, function name and line number are now stored in a const
static container object. This has the following benefits:
- Slightly lighter profiling workload (3 instructions less).
- Profiling queue event size is significantly reduced, by 12 bytes. This
has an effect on all queue event types.
- Source location grouping has now no cost, as it's performed at the
compilation stage. This allows simplification of server code.
The downside is that the full source location resolution is now
performed in two steps, as the server has to query both source location
container and strings contained within. This has almost no real impact
on profiler operation.
2017-09-26 02:39:08 +02:00
Bartosz Taudul
e5ad7d9ac4
GetTime() call can be now inlined.
...
No dependencies on either windows.h, or static instance of Profiler.
2017-09-26 00:42:09 +02:00
Bartosz Taudul
fce04c6215
Profiling delay calibration.
2017-09-24 16:02:09 +02:00
Bartosz Taudul
7770014844
Use rdtscp to measure time on windows.
2017-09-23 21:33:05 +02:00
Bartosz Taudul
bd9ffc16b5
Hide GetTime() in Profiler.
2017-09-23 21:10:26 +02:00
Bartosz Taudul
2faa1abb21
Store main thread id.
2017-09-23 01:37:07 +02:00
Bartosz Taudul
70ad3407c0
Rework client handling of server requests.
2017-09-22 01:54:04 +02:00
Bartosz Taudul
9d2fef2f11
Hide implementation details wrt concurrent queue.
2017-09-18 18:51:45 +02:00
Bartosz Taudul
d7914439e9
Use stream compression.
...
Previously each data packet was compressed independently. After this
change all new packets reference the previously sent data, which
achieves better compression.
2017-09-17 13:10:58 +02:00
Bartosz Taudul
03ece0ac48
Send frame markers.
2017-09-16 00:30:27 +02:00
Bartosz Taudul
f61f50385d
Add ability to send strings over network.
2017-09-14 19:24:35 +02:00
Bartosz Taudul
f3ce055568
Mirror TracyView::ShouldExit in TracyProfiler.
2017-09-14 19:23:50 +02:00
Bartosz Taudul
76df000467
Move sending data to a separate function.
2017-09-14 19:07:56 +02:00
Bartosz Taudul
d999f35dfa
Exchange time and id in queue header and data structs.
2017-09-14 01:14:40 +02:00
Bartosz Taudul
a31ab6a256
Move TracyQueue.hpp to common.
2017-09-13 22:56:08 +02:00
Bartosz Taudul
8d3aae24bf
Use producer tokens during event insertion.
2017-09-10 20:52:10 +02:00
Bartosz Taudul
6a7fdea6fd
Store profiling start time.
2017-09-10 20:14:16 +02:00
Bartosz Taudul
12a6306c0b
Allow queuing zones.
2017-09-10 20:09:14 +02:00
Bartosz Taudul
05486c8225
Add unique event identifier source.
2017-09-10 20:08:42 +02:00
Bartosz Taudul
a5d6039aea
Profiler worker thread skeleton.
2017-09-10 17:43:56 +02:00