Bartosz Taudul
a7d2ab4d4f
Send single string for lock name.
2020-07-26 01:47:49 +02:00
Bartosz Taudul
e91950f006
Send single string for messages.
2020-07-26 01:35:52 +02:00
Bartosz Taudul
5553761c02
Send single string for zone text and name.
2020-07-26 01:35:52 +02:00
Bartosz Taudul
309a151610
Symbol length no longer has to be packed.
2020-07-26 01:35:52 +02:00
Bartosz Taudul
88685440b6
Send single string for callstack frame.
2020-07-26 01:35:52 +02:00
Bartosz Taudul
61e3acde06
Send single string for callstack frame size.
2020-07-26 01:35:52 +02:00
Bartosz Taudul
06ca4e2d16
Send single string for symbol information.
2020-07-26 01:35:52 +02:00
Bartosz Taudul
14b180cd16
Send single string for code information.
2020-07-26 01:35:51 +02:00
Bartosz Taudul
81d5a8db5e
Implement transport of single string data.
...
In most cases only one string is sent per message and no pointer
tracking is needed.
This is only plumbing work, no changes to messages have been made yet.
2020-07-26 01:35:51 +02:00
Bartosz Taudul
02e7893c75
Preserve messages size.
2020-07-21 20:58:58 +02:00
Bartosz Taudul
9d01fa86ab
Preserve zone text, zone name string length.
2020-07-21 20:58:58 +02:00
Bartosz Taudul
5d748f3568
Don't calculate known strings length.
2020-07-21 20:58:58 +02:00
Bartosz Taudul
e4fbf60668
Add SendString() with length parameter.
2020-07-21 20:58:58 +02:00
Bartosz Taudul
346e5a426f
Mask out alpha channel on SSE and AVX2 paths.
2020-07-20 13:58:35 +02:00
Bartosz Taudul
7b00812fa7
Add missing rpmalloc include.
2020-07-19 13:04:51 +02:00
Bartosz Taudul
2bef3629b7
Merge pull request #74 from avoroshilov/manual-lifetime
...
Manual lifetime management for Multi-DLL
2020-07-19 12:06:11 +02:00
Andrey Voroshilov
175ec3e3d8
Moving ThreadExitHandler to a commonly accessible space, adding thread handler usage to SysTrace worker threads
2020-07-18 18:24:49 -07:00
Bartosz Taudul
b8df7a1302
Expose m_isConnected in non-on-demand builds.
2020-07-16 11:22:06 +02:00
Andrey Voroshilov
cbfb19816b
Merge remote-tracking branch 'tracy/master' into manual-lifetime
...
# Conflicts:
# AUTHORS
2020-07-13 01:49:11 -07:00
Andrey Voroshilov
4c397ebe1e
Fixing some of the copy-paste errors
2020-07-12 10:12:50 -07:00
Andrey Voroshilov
3ae84647c4
Moving GetProfiler
back to the original place
2020-07-12 10:10:09 -07:00
Andrey Voroshilov
a758de2f03
Moving GetProfiler
back to the original place
2020-07-12 10:06:09 -07:00
Andrey Voroshilov
6b790d778d
Replacing removing spinlock that is not needed anymore, making TRACY_MANUAL_LIFETIME
a sub-option of TRACY_DELAYED_INIT
, and addressing feedback
2020-07-12 10:04:07 -07:00
Andrey Voroshilov
4d505f507b
Fixed rogue TAB indentation
2020-07-08 03:52:00 -07:00
Andrey Voroshilov
199dc46e7d
Fixing crash on exit due to RP malloc not being deinitialized
2020-07-08 03:50:25 -07:00
Andrey Voroshilov
71ca0683d6
Fixing preprocessor formatting
2020-07-08 03:48:37 -07:00
Bartosz Taudul
5e5bf928a5
Add QPC frequency query to API.
2020-07-07 21:25:35 +02:00
Andrey Voroshilov
6a72560989
Fixing functions case to match the source capitalization rules
2020-07-07 03:12:02 -07:00
Andrey Voroshilov
da5e58682f
Adding manual lifetime management to aid multi-DLL usecase
2020-07-07 00:39:09 -07:00
Bartosz Taudul
384e2e3fa1
Reduce memory requirements of allocated call stacks.
2020-07-05 17:33:29 +02:00
Bartosz Taudul
f718761905
Reduce allocated source location size by 2 bytes.
2020-07-05 17:11:15 +02:00
Bartosz Taudul
4179e85029
Add missing parameters.
2020-07-02 17:17:01 +02:00
Bartosz Taudul
0da6342a86
Trace vsyncs.
2020-06-27 19:56:23 +02:00
Bartosz Taudul
1fff99ebb0
Fix memory leak.
2020-06-27 17:43:08 +02:00
Bartosz Taudul
4bbeb51e34
Add secure alloc/free macros.
2020-06-24 01:33:26 +02:00
Bartosz Taudul
530e464347
Add checker for profiler availability.
2020-06-24 01:32:57 +02:00
Bartosz Taudul
cf8620b956
Cosmetics.
2020-06-24 01:14:55 +02:00
Simonas Kazlauskas
b0d67064b5
Check for null FILE handles in ReportTopology
...
On Linux `/sys` is not guaranteed to exist. When it doesn't, tracy
would crash with a SIGSEGV due to a nullptr dereference. Not actually
sure how well tracy clients will handle the case where this information
is missing, though.
2020-06-21 20:14:05 +03:00
Bartosz Taudul
b353eb753b
Merge pull request #56 from nagisa/nagisa/fix-macos-build
...
Fix build on macos
2020-06-20 19:57:09 +02:00
Simonas Kazlauskas
b03da81afa
Fix build on macos
...
It was broken by a110b42011
.
In addition, adjust the CI definition to test builds on macOS.
2020-06-20 20:44:55 +03:00
Simonas Kazlauskas
29886435b4
___tracy_alloc_*
take pointer-size pairs
...
This enables better bindings in languages that do not have 0-terminated
strings for source/function name. It does not introduce any additional
overhead in languages that do use 0-terminated strings, either, but it
_is_ a breaking API change.
Fixes https://github.com/wolfpld/tracy/issues/53
2020-06-20 20:35:42 +03:00
Simonas Kazlauskas
a110b42011
Add ___tracy_init_thread function
...
This function needs to be called for each non-main thread before calling
the `___tracy_alloc_` functions.
Alternative way to achieve this could be initializing the allocator
transparently in the `___tracy_alloc_*` calls.
2020-06-20 04:22:32 +03:00
Bartosz Taudul
aa3b0de1f5
Use proper cpuid flag.
2020-06-10 16:25:19 +02:00
Bartosz Taudul
284d49b34b
Change rdtscp check to rdtsc check.
2020-06-08 19:35:42 +02:00
Bartosz Taudul
65314e0c90
Set proper SymbolData contents on symbol retrieval error.
2020-06-04 17:46:39 +02:00
Graydon Hoare
28a29d071f
only write SysTime::used if fscanf succeeds
2020-06-03 19:54:49 -07:00
Graydon Hoare
93b7b5a8e7
ensure regs is initialized even if cpuid fails
2020-06-03 19:54:48 -07:00
Bartosz Taudul
54eb75b063
Report symbol entry address in inline function discovery.
2020-05-30 15:38:59 +02:00
Bartosz Taudul
2b304581cf
Implement transfer of integral values for zones.
2020-05-24 16:13:09 +02:00
Bartosz Taudul
1bcde1f2ff
Only one branch.
2020-05-24 13:41:41 +02:00
kudansam
1151ec1328
Fix defines when compiling with -Werror=undef
...
Some ARM defines fail when compiling with -Werror=undef as they rely on
the missing define mapping to 0.
2020-05-22 15:48:59 +02:00
Thales Sabino
c2c234cf5a
Fix crash when running Tracy from DLLs
...
Instantiating Tracy from within a DLL will tie its internal threads life-time to the DLL. Windows does not guarantee
that threads will be alive after the main function. This has implications in the Profiler dtor since will try to perform
some deallocations, however, _memory_deallocate_large will try to get the heap of the current thread which can
be invalid at the point of shutdown causing a crash. Checking the pointer here will won't make TRACE_NO_EXIT
work, but it will prevent the Profiler from crashing.
2020-05-21 14:26:29 +01:00
Bartosz Taudul
4eb78f5c86
Auto-initialize profiler in delayed init scenario.
2020-05-19 13:55:54 +02:00
Bartosz Taudul
fad7e72fd4
Harden against uninitialized rpmalloc.
...
Initialize rpmalloc either by explicitly calling InitRPMallocThread(),
or by forcing initialization of thread local variables block.
2020-05-19 13:51:11 +02:00
Bartosz Taudul
665c6d6699
Don't check for allocation validity.
...
Will fail anyway right afterwards, if nullptr.
2020-05-16 16:40:25 +02:00
Bartosz Taudul
21c168156c
Restrict client to C++11.
2020-05-13 18:15:12 +02:00
Bartosz Taudul
e9f93f5bc7
Send lean frame images.
2020-05-10 20:16:08 +02:00
Bartosz Taudul
03b5dfacd6
Send lean callstack samples.
2020-05-10 20:00:51 +02:00
Bartosz Taudul
09388f3c99
Send lean callstack allocs.
2020-05-10 19:56:36 +02:00
Bartosz Taudul
5a774c82cc
Send lean callstacks.
2020-05-10 19:43:12 +02:00
Bartosz Taudul
f0ade07be8
Send lean memory callstacks.
2020-05-10 19:28:08 +02:00
Bartosz Taudul
2dc07fca0b
Send lean allocated source locations.
2020-05-10 19:20:59 +02:00
Bartosz Taudul
91bb392678
Avoid executing strlen() twice in assert-enabled builds.
2020-05-10 15:55:12 +02:00
txfx
412d252eea
Remove extra semicolons at the end of namespaces
2020-05-10 15:32:39 +02:00
Bartosz Taudul
a47c7d467f
Send x86 processor info in welcome message.
2020-05-06 19:18:17 +02:00
Bartosz Taudul
f13413922d
Use one cpuid implementation.
2020-05-06 18:52:36 +02:00
Rokas Kupstys
04eaf358d0
Fix linking error in some configurations. Unresolved CallTrace symbol was observed in static MSVC RelWithDebInfo build (but not in debug build).
2020-05-05 13:23:46 +03:00
ikrima
707117c04f
Build sanitization & Static analysis warning fixes
...
- Wrapping FORCEINLINE & WIN32_LEAN_AND_MEAN definess with ifndef bc other libraries may define it and trigger redefinition warning
- Possibly contentious given tone in the manual (:P) but removing variable shadowing in TracySysTrace.cpp
- Alternate Solution: Add #define TRACY_FORCE_SILENT_WARNINGS toggle-able flag. If flag is enabled, push/pop warning disables that have to be included in client code
2020-05-02 14:52:57 -07:00
Bartosz Taudul
14ec246659
Fix typo.
2020-04-24 00:55:57 +02:00
Bartosz Taudul
a5bff2f7e5
Sleep to force rescheduling main thread during init.
...
This fixes problems with first context switch data region possibly not being
available for the main thread, if no rescheduling was performed after sys
tracing has started.
2020-04-14 22:45:32 +02:00
Bartosz Taudul
3b85c51e5f
Search for free listen port, if default is occupied.
2020-04-13 21:40:52 +02:00
Bartosz Taudul
b389ccbb38
Issue just one read call when handling server queries.
2020-04-13 14:32:31 +02:00
Bartosz Taudul
1bbece649f
Implement socket read without exit check.
2020-04-13 14:22:58 +02:00
Bartosz Taudul
a2187565d1
Optimize non-native-size memcpy.
2020-04-13 13:45:21 +02:00
Bartosz Taudul
b69aaf04e9
Add support for QPC timer.
2020-04-07 22:01:31 +02:00
Bartosz Taudul
34b512d04b
Don't declare unused variables on cygwin.
2020-04-07 21:41:12 +02:00
Bartosz Taudul
8d9a611874
Get rid of unicode ifdefs.
2020-04-07 21:35:37 +02:00
Bartosz Taudul
69c5e667ae
Dynamically load Get/SetThreadDescription.
2020-04-07 21:33:03 +02:00
Bartosz Taudul
7fca642c3d
Compress full-quality DXT1 on AVX2 path.
2020-04-05 17:10:43 +02:00
Bartosz Taudul
a6468b6b6e
Sleep when clearing queues if listen port is occupied.
2020-04-04 21:08:13 +02:00
Bartosz Taudul
b2a8b53efa
Query source location of each assembly instruction.
2020-04-01 21:43:03 +02:00
Bartosz Taudul
0ba0125eb5
Cosmetics.
2020-04-01 21:42:14 +02:00
Bartosz Taudul
a8e8a4a167
Add code address to function, line decoder.
2020-04-01 21:41:33 +02:00
Bartosz Taudul
36ddd0b98b
Don't use new to allocate memory on the client.
2020-03-28 21:27:19 +01:00
Bartosz Taudul
9b8eb69886
Apparently sampled call stacks may be empty.
2020-03-28 16:09:44 +01:00
Bartosz Taudul
40281ce2a1
Add default no-op to switch.
2020-03-26 01:07:25 +01:00
Bartosz Taudul
add5b29d03
Report CPU architecture in welcome message.
2020-03-25 21:28:02 +01:00
Bartosz Taudul
ce449ac0e2
Notify server that parameter was handled.
2020-03-25 20:37:26 +01:00
Bartosz Taudul
f114ec3f80
Add code transfer from client to server.
2020-03-25 20:04:55 +01:00
Bartosz Taudul
3e0e120222
Add extra parameter to server queries.
2020-03-25 20:04:01 +01:00
Bartosz Taudul
c999a74d34
Symbol length transfer.
2020-03-25 18:32:36 +01:00
Bartosz Taudul
d47e6819a8
Collect symbol sizes.
2020-03-25 18:28:28 +01:00
Bartosz Taudul
6c0c508280
Ignore kernel-only stacks.
...
It is common to receive duplicate stack traces for the same timestamp
(and thread), one containing proper user-space stack, and the second one
containing only kernel frames. Discard the second one, as there's no
documentation how this should be interpreted and the kernel stack is
mostly useless.
2020-03-21 15:25:30 +01:00
Bartosz Taudul
c7afda2562
Exit processing loops when trace has stopped.
2020-03-10 18:56:49 +01:00
Bartosz Taudul
c6bb08355c
Allow specification of port through env variable.
2020-03-08 16:14:36 +01:00
Bartosz Taudul
1da62c2190
Send deferred lock names.
2020-03-08 15:05:35 +01:00
Bartosz Taudul
127224acc6
Send listen port in broadcast message.
2020-03-08 14:37:59 +01:00
Bartosz Taudul
14c896573d
Separate config for data and broadcast port.
2020-03-08 14:34:09 +01:00
Bartosz Taudul
2ffaa88c9e
Fix typo.
2020-03-08 14:19:08 +01:00
Bartosz Taudul
e7240cb77d
Custom lock name transfer.
2020-03-08 13:47:38 +01:00
Bartosz Taudul
f945278959
Fix rpmalloc on android.
2020-03-02 17:10:47 +01:00
Bartosz Taudul
c36ed4b8b8
Boring warning fixes.
2020-03-01 01:48:20 +01:00
Bartosz Taudul
c23984dd6a
Fix static assert in rpmalloc.
2020-03-01 01:31:31 +01:00
Bartosz Taudul
e9a32d5dc7
Greatly increase queue block size.
...
Previous block size could hold only 256 elements (8KB), which stressed
out the memory allocator. Storing 65536 elements (2MB) per block almost
completely reduces the allocator pressure.
2020-03-01 01:15:13 +01:00
Bartosz Taudul
82f463724c
Update rpmalloc to 1.4.0.
...
Notable changes: use C++11 atomics everywhere.
2020-03-01 01:02:25 +01:00
Bartosz Taudul
710a2a64e4
Fix copy pasta.
2020-02-27 14:08:56 +01:00
Bartosz Taudul
4346620afa
No need to copy module name.
2020-02-27 13:45:39 +01:00
Bartosz Taudul
fd8a9465d4
Cosmetics.
2020-02-27 13:40:41 +01:00
Bartosz Taudul
9ae71ac4ee
Dl_info doesn't destroy data.
2020-02-27 13:28:45 +01:00
Bartosz Taudul
5f6b3d2cd5
No need for module name intermediate buffer.
2020-02-27 13:24:36 +01:00
Bartosz Taudul
474383b656
Only copy symbol strings, if needed.
2020-02-27 13:17:26 +01:00
Bartosz Taudul
2df6f9068a
Don't retrieve symbol name for address.
2020-02-27 12:58:01 +01:00
Bartosz Taudul
be5793987e
Don't send symbol name.
2020-02-27 12:49:48 +01:00
Bartosz Taudul
56dce646cc
Symbol address decoding on unix.
2020-02-26 23:38:04 +01:00
Bartosz Taudul
4ddafdeeaf
Symbol address decoding for old androids.
2020-02-26 23:24:18 +01:00
Bartosz Taudul
7c506d5426
Remove unused variables.
2020-02-26 23:24:11 +01:00
Bartosz Taudul
26cee8acf0
Perform symbol information queries.
2020-02-26 22:35:15 +01:00
Bartosz Taudul
ef05570540
Symbol address decoding (win32 implementation).
2020-02-26 22:32:42 +01:00
Bartosz Taudul
03ff08a934
Increase max name size.
2020-02-26 22:32:09 +01:00
Bartosz Taudul
d1fcf80c2d
Move definition of max symbol name size to one place.
2020-02-26 22:30:11 +01:00
Bartosz Taudul
c0f49c648b
Validate size.
2020-02-26 22:27:10 +01:00
Bartosz Taudul
890cec9872
Retrieve symbol addresses on unix.
2020-02-26 02:25:45 +01:00
Bartosz Taudul
9231261d73
Retrieve image name on unix.
2020-02-26 02:11:51 +01:00
Bartosz Taudul
fe80a7ed46
Retrieve symbol address on old androids.
2020-02-26 02:06:44 +01:00
Bartosz Taudul
abf8c42a7c
Send module name.
2020-02-26 00:33:09 +01:00
Bartosz Taudul
7d0dac9ae2
Store callstack frame module name.
2020-02-26 00:32:47 +01:00
Bartosz Taudul
4cf520db93
Unify copying symbol strings.
2020-02-26 00:02:30 +01:00
Bartosz Taudul
c5b2d14f8c
Send sampling period in welcome message.
2020-02-25 23:12:31 +01:00
Bartosz Taudul
2b7f5091f1
Store sampling period.
2020-02-25 23:08:52 +01:00
Bartosz Taudul
3402d16548
Send symbol base address.
2020-02-25 23:03:40 +01:00
Bartosz Taudul
85ffe0ea04
Don't search module list for kernel addresses.
2020-02-24 23:04:53 +01:00
Bartosz Taudul
ece32b47df
Zero capacity is invalid.
2020-02-24 23:04:53 +01:00
Bartosz Taudul
9c9e854005
Replace list with vector.
...
Maybe next time let's not forget that there's already a custom
allocating vector available.
2020-02-24 23:04:53 +01:00
Bartosz Taudul
24cd73e366
Fix linux tracing with long pids.
2020-02-23 18:23:53 +01:00
Bartosz Taudul
0fa1d25d98
Disable trace annotations.
2020-02-23 18:20:48 +01:00
Bartosz Taudul
02d200878d
Process queue data in-place.
2020-02-23 15:18:24 +01:00
Bartosz Taudul
96034bca3e
Force inline AppendData(), NeedDataSize().
2020-02-23 14:44:19 +01:00
Bartosz Taudul
bd34c24b84
Increase block size.
2020-02-23 12:35:30 +01:00
Bartosz Taudul
26b13abac8
Pre-fill module cache.
2020-02-22 21:32:18 +01:00
Bartosz Taudul
0a02cf32be
Add module name cache.
2020-02-22 21:32:10 +01:00
Bartosz Taudul
096e8cd8ae
Retrieve module name if symbol name cannot be found.
2020-02-22 21:06:32 +01:00
Bartosz Taudul
d0930e9053
Use maximum possible sampling rate.
2020-02-22 19:08:15 +01:00
Bartosz Taudul
4502858407
Use maximum possible etw buffer size (1MB).
2020-02-22 18:52:38 +01:00
Bartosz Taudul
e270603117
Don't write reference time to memory in each iteration.
2020-02-22 18:52:37 +01:00
Bartosz Taudul
054a6f8563
Send time deltas in callstack sample data packets.
2020-02-22 16:42:47 +01:00
Bartosz Taudul
1ee80e0df5
Send/free callstack sample payloads.
2020-02-22 16:20:43 +01:00
Bartosz Taudul
3b0ed5337b
Provide TraceSetInformation() definition for cygwin.
2020-02-22 16:03:07 +01:00
Bartosz Taudul
baf8e6fe80
No support for sampling on 32-bit windows.
...
Note that 32-bit applications running on 64-bit windows will perform
sampling.
2020-02-22 14:16:04 +01:00
Bartosz Taudul
23fe3e623d
64-bit only version of callstack payload sender.
2020-02-22 14:05:01 +01:00
Bartosz Taudul
9e9c7db5b1
Send sampled call stacks.
2020-02-22 13:42:09 +01:00
Bartosz Taudul
f186540c4f
Fix callstack pointers in 32-bit builds.
2020-02-22 13:38:09 +01:00
Bartosz Taudul
9b9474ada1
Request stack traces for execution sampling events.
2020-02-22 13:13:49 +01:00
Bartosz Taudul
28d0f387ea
Setup execution sampling profiling.
2020-02-22 13:13:32 +01:00
Bartosz Taudul
ad77b4f73b
Store current process id.
2020-02-22 13:11:16 +01:00
Bartosz Taudul
1f671fbacc
Keep sys trace variables local.
2020-02-22 13:08:35 +01:00
Bartosz Taudul
539ccf5a61
Check provider id in etw callback.
2020-02-22 12:56:33 +01:00
Bartosz Taudul
0b82902618
Optimize scalar DXT1 compression.
2020-02-15 13:43:40 +01:00
Bartosz Taudul
838c0aaaa9
Check if BUS_MCEERR_AR and BUS_MCEERR_AO are defined.
2020-02-12 01:27:03 +01:00
Bartosz Taudul
2c8d519d70
Fix typo.
2020-02-11 15:12:06 +01:00
Bartosz Taudul
abfa4c65df
Update fun list of iDevices.
2020-02-10 16:13:32 +01:00
Bartosz Taudul
8d5f4d7363
Always use init once to initialize rpmalloc.
2020-01-30 20:08:34 +01:00
Bartosz Taudul
885fa16373
Don't retrieve connection id, if zone is not active.
2020-01-25 17:21:30 +01:00
Bartosz Taudul
aa94df0845
Replace rpmalloc_thread_initialize with InitRPMallocThread().
2020-01-25 17:16:08 +01:00
Bartosz Taudul
ab2fbd6164
Move ParamaterSetup() implementation to header.
2020-01-25 16:51:17 +01:00
Bartosz Taudul
13370dc01c
Hide RtlWalkFrameChain inside library.
2020-01-25 16:49:29 +01:00
Bartosz Taudul
a90004b983
Move Set/GetThreadName() to Tracy API.
2020-01-25 16:36:58 +01:00
Bartosz Taudul
6f31eb2a9d
Disable MSVC idiocy.
2020-01-20 22:49:03 +01:00
Bartosz Taudul
55d03cb03e
Hide async queue setup/commit behind macros.
2020-01-19 15:06:11 +01:00
Bartosz Taudul
25082b2bec
Don't report CPU topology if delayed init is active.
...
Reporting topology requires producer to be available, which creates a
deadlock during delayed init data structures construction.
Calling GetProducer() results in a call to GetProfilerThreadData(),
which in turn calls GetProfilerData() to construct its thread local
variable. However, at this point we already are calling
GetProfilerData() (to construct the profiler itself). This would result
in an incorrect double construction of data, but the code already
prevents this by allowing init code to be entered only once. Hence the
deadlock.
Currently this is a non-issue, as no platform which can report CPU
topology needs to use delayed init.
2020-01-14 19:41:34 +01:00
Bartosz Taudul
4f8eb53e8b
Capture exact tid to pid mapping on windows.
2020-01-14 02:06:22 +01:00
Bartosz Taudul
4ef2ce4622
Fix _mm256_cvtsi256_si32 on gcc.
2019-12-12 02:13:12 +01:00
Bartosz Taudul
129b80ef0f
Free source location, if zone is not active.
2019-12-06 00:42:42 +01:00
Bartosz Taudul
b9cdf2cbb7
Expose srcloc allocation in C API.
2019-12-06 00:25:52 +01:00
Bartosz Taudul
399b87fecc
Add allocated srcloc zone begin emit functions to C API.
2019-12-06 00:22:49 +01:00
Bartosz Taudul
68ff33d0ba
Extract source location allocation functionality.
2019-12-06 00:15:46 +01:00
Bartosz Taudul
e8fcc250a1
Report CPU topology on Linux.
2019-11-30 01:51:29 +01:00
Bartosz Taudul
712403e9fd
Transfer, display, save CPU topology data.
2019-11-29 22:41:41 +01:00
Bartosz Taudul
59371eef5a
Obtain CPU topology on windows.
2019-11-29 18:29:31 +01:00
thedmd
a1e2c533f6
libbacktrace: Add support for Mach-O (dSYM)
...
`macho.cpp` was backported from official libbacktrace repository.
2019-11-29 12:04:47 +01:00
Bartosz Taudul
a7d2d5f08b
Fix DeferItem() call.
2019-11-26 01:10:50 +01:00
Bartosz Taudul
4551553eb4
Implement setting client parameters from server.
2019-11-25 23:59:48 +01:00
Bartosz Taudul
c5c9dfb0c9
Native callstacks are now optional in allocated callstack messages.
2019-11-25 22:54:10 +01:00
Bartosz Taudul
37eef59d54
Implement reading sys time on BSD.
2019-11-21 20:41:57 +01:00
Bartosz Taudul
c7a22cc1ff
Use libbacktrace on BSD.
2019-11-21 20:41:57 +01:00
Bartosz Taudul
bd7b0a8197
Support callstack capture on BSD.
2019-11-21 02:34:42 +01:00
Bartosz Taudul
c79449a6a1
Get proper program name on BSD.
2019-11-21 02:16:12 +01:00
Bartosz Taudul
7940977dba
Report physical memory size on BSD.
2019-11-21 02:14:08 +01:00
Bartosz Taudul
3854ae11b2
Revert "Remove dead code."
...
This reverts commit a36b73f745
.
2019-11-17 17:38:02 +01:00
Bartosz Taudul
a36b73f745
Remove dead code.
2019-11-16 18:34:05 +01:00
Bartosz Taudul
8286b0b72f
Plumbing for message call stacks.
2019-11-14 23:40:41 +01:00
Bartosz Taudul
0befc75f83
Fix conflicts with X.h.
2019-11-14 18:24:29 +01:00
Bartosz Taudul
655864eb7c
Enable crash handler on cygwin.
...
Crash is properly recorded, but the profiler hangs while waiting for
shutdown finish.
2019-11-07 19:20:13 +01:00
Bartosz Taudul
3fd74a92f9
Native threads are used on mingw.
2019-11-07 19:02:54 +01:00
Bartosz Taudul
351e220d30
Don't calculate queue delay if delayed init is used.
...
Queue calibration requires queue access during profiler construction. This
in turn requires construction of profiler data block, *which at this point
is underway*, because the profiler is being constructed.
2019-06-19 17:29:04 +02:00
Bartosz Taudul
c98f1f0b6b
Make sure profiler is initialized only once in delayed init scenario.
2019-06-19 17:28:18 +02:00
Bartosz Taudul
d4f58ddaf3
Use native windows threads on cygwin, mingw.
2019-11-06 01:42:14 +01:00
Bartosz Taudul
ca198e44d3
Remove dead code from concurrentqueue.
2019-11-05 21:40:52 +01:00
Bartosz Taudul
b5590ed197
Include <mutex> for std::once.
2019-11-05 21:40:35 +01:00
Bartosz Taudul
3e9bb80217
More header cleanup.
2019-11-05 20:15:53 +01:00
Bartosz Taudul
6bbf273581
Partial header inclusion cleanup.
2019-11-05 20:09:40 +01:00
Bartosz Taudul
907574e637
Allow remote plot configuration.
2019-11-05 17:45:19 +01:00
Bartosz Taudul
f34609fd9b
Set per-cpu kernel buffer size to 512 KB.
...
The default setting was causing events to be lost on Android.
2019-11-03 21:52:20 +01:00
Bartosz Taudul
b8d459d48b
Use proper string size (for consistency).
...
On Android code path this value is ignored.
2019-11-03 21:51:49 +01:00
Bartosz Taudul
ca0fae33d1
Remove obsolete assert.
...
Before-terminate-events now include events that have time delta
processing, with no memory to free.
2019-11-01 20:10:24 +01:00
Bartosz Taudul
1f0c18882c
Don't collect sys time after application has exited.
2019-10-29 23:05:14 +01:00
Bartosz Taudul
0f2503d334
Send time deltas in GPU time events.
2019-10-25 19:52:01 +02:00
Bartosz Taudul
8fa5188176
Send delta times for context switches.
2019-10-25 19:13:11 +02:00
Bartosz Taudul
25b3cdc1ee
Send thread wakeups when handling disconnect request.
2019-10-25 18:22:42 +02:00
Bartosz Taudul
04b132b6e2
Check if requested data size doesn't overflow buffer.
2019-10-24 21:22:22 +02:00
Bartosz Taudul
ba61a9ed84
Transfer time deltas, not absolute times.
...
This change significantly reduces network bandwidth requirements.
Implemented for:
- CPU zones,
- GPU zones,
- locks,
- plots,
- memory events.
2019-10-24 00:06:41 +02:00
Bartosz Taudul
cf88265304
Full 64-bit register is set by rdtsc.
2019-10-21 01:13:55 +02:00
Bartosz Taudul
07b66cd4ab
Move fake source location out of loop.
2019-10-20 22:18:05 +02:00
Bartosz Taudul
909503403b
Simplify delay calibration.
2019-10-20 22:13:29 +02:00
Bartosz Taudul
c774534b47
Use rdtsc instead of rdtscp.
...
But rdtscp is serializing!
No, it's not. Quoting the Intel Instruction Set Reference:
"The RDTSCP instruction is not a serializing instruction, but it does
wait until all previous instructions have executed and all previous
loads are globally visible. But it does not wait for previous stores to
be globally visible, and subsequent instructions may begin execution
before the read operation is performed.",
"The RDTSC instruction is not a serializing instruction. It does not
necessarily wait until all previous instructions have been executed
before reading the counter. Similarly, subsequent instructions may begin
execution before the read operation is performed."
So, the difference is in waiting for prior instructions to finish
executing. Notice that even in the rdtscp case, execution of the
following instructions may commence before time measurement is finished
and data stores may be still pending.
But, you may say, Intel in its "How to Benchmark Code Execution Times"
document shows that using rdtscp is superior to rdstc. Well, not
exactly. What they do show is that when a *single function* is
considered, there are ways to measure its execution time with little to
no error.
This is not what Tracy is doing.
In our case there is no way to determine absolute "this is before" and
"this is after" points of a zone, as we probably already are inside
another zone. Stopping the CPU execution, so that a deeply nested zone
may be measured with great precision, will skew the measurements of all
parent zones.
And this is not what we want to measure, anyway. We are not interested
in how a *single function* behaves, but how a *whole program* behaves.
The out-of-order CPU behavior may influence the measurements? Good! We
are interested in that. We want to see *how* the code is really
executed. How is *stopping* the CPU to make a timer read an appropriate
thing to do, when we want to see how a program is performing?
At least that's the theory.
And besides all that, the profiling overhead is now reduced.
2019-10-20 20:52:33 +02:00
Bartosz Taudul
30fc2f02ab
Omit calculation of on-stack variable address.
2019-10-20 19:42:29 +02:00
Bartosz Taudul
c3870f8837
Use proper type.
2019-10-10 20:30:08 +02:00
Bartosz Taudul
707f113bda
Add missing NOMINMAX definitions.
2019-10-10 20:29:06 +02:00
Bartosz Taudul
7cf3608493
Avoid unused variables.
2019-10-05 02:11:45 +02:00
Bartosz Taudul
e481b5ba22
Add missing thread sent indication.
2019-10-04 19:18:47 +02:00
Bartosz Taudul
9e1935f070
Make C API symbols visible across dlls.
2019-10-03 22:39:26 +02:00
Bartosz Taudul
130365f4ff
Inject tracy_systrace into filesystem and use instead of cat.
...
Statistics for a one-minute trace:
Capture tool | Running time | Running regions
---------------+--------------+-----------------
cat | 25.11 s | 392,300
tracy_systrace | 10.41 s | 12,249
2019-09-27 15:51:29 +02:00
Bartosz Taudul
3dba4088ee
Embed precompiled tracy_systrace for android.
2019-09-27 15:50:58 +02:00
Bartosz Taudul
e13cbf52fd
Allow changing tracy port in client.
2019-09-21 15:11:15 +02:00
Bartosz Taudul
a221f121ba
Extract lock state handling to a separate context class.
2019-09-21 14:55:14 +02:00
Bartosz Taudul
37661fd2ee
Fix 32 bit NEON version of DXT1 compression.
...
This reverts commit b32e8fa24e
.
Apparently it is possible to receive non-uniform data in alpha channel, which
breaks the original assumption about not needing the mask. This seemed to be a
problem only on 32 bit NEON implementation of DXT1 compression. Other
implementations handle such data without degradation of visual output.
2019-09-03 21:37:07 +02:00
Bartosz Taudul
7a6564feae
Only recycle producers, if there's no data in queue.
...
("The queue" is per-thread partial queue here.)
This fixes a problem where one thread writes to the queue, then is
terminated, making the (partially filled) queue available for other
threads to recycle. If another thread re-owns the queue, it will change
the associated thread id, while part of the queue was filled by the
original thread. This obviously created invalid data during dequeue.
The fix makes the recycling process check not only for queue inactivity
(which is marked when the original thread terminates), but also if the
queue is empty, preventing mixing data from different threads.
2019-08-30 14:28:44 +02:00
Bartosz Taudul
00b26c1acf
Fix TRACY_NO_SYSTEM_TRACING.
2019-08-26 18:02:10 +02:00
Bartosz Taudul
fbeee3cf61
Fix (?) invalid function pointer signature.
2019-08-26 17:59:58 +02:00
Bartosz Taudul
78127dc357
System threads only allow limited information queries.
2019-08-25 00:33:22 +02:00
Bartosz Taudul
deb59b4c38
Somehow fix event ordering.
2019-08-24 01:43:55 +02:00
Bartosz Taudul
1e74a89924
Check if there's data to read from kernel.
...
Reading from kernel pipe, while being a blocking operation, spin locks the
thread.
2019-08-24 01:06:21 +02:00
Bartosz Taudul
8f6e94d75c
Sleep if sys trace pipe buffer underruns.
2019-08-24 00:42:00 +02:00
Bartosz Taudul
2d50d07438
Allow completely disabling system tracing.
2019-08-21 01:16:25 +02:00
Bartosz Taudul
0cbb853945
Add missing SetThreadName() calls.
2019-08-20 16:23:00 +02:00
Bartosz Taudul
332262dd84
Shorter thread names.
2019-08-20 16:22:54 +02:00
Bartosz Taudul
247acd03ee
Kernel tracing on android.
2019-08-20 15:49:40 +02:00
Bartosz Taudul
e427d67347
Don't bail out if unimportant variables are not available.
2019-08-20 12:19:05 +02:00
Bartosz Taudul
bfda30be0b
Use su on android to set tracing variables.
2019-08-20 12:18:46 +02:00
Bartosz Taudul
9d87a8394d
Add missing getline() implementation for android API < 18.
2019-08-19 15:26:09 +02:00
Bartosz Taudul
9be6f4a414
Fix typo.
2019-08-19 13:03:37 +02:00
Bartosz Taudul
d209bb4d01
Add missing function pointer checks.
2019-08-19 12:47:27 +02:00
Bartosz Taudul
20e8a5ecc8
Create tid to pid mapping.
2019-08-17 22:32:41 +02:00
Bartosz Taudul
678e942e9f
Transfer PID of profiled program.
2019-08-17 22:19:04 +02:00
Bartosz Taudul
77c636c3fd
Retrieve module name for threads with no names on windows.
2019-08-17 21:24:40 +02:00
Bartosz Taudul
f7589bde02
Trace thread wakeups on linux.
2019-08-17 17:18:11 +02:00
Bartosz Taudul
414f903cc5
Collect thread wakeup data.
2019-08-17 17:05:29 +02:00
Bartosz Taudul
e9080bdbcd
Hardcode windows PID 4 as "System".
2019-08-17 03:44:47 +02:00
Bartosz Taudul
40eb8a5a03
Proper check for invalid handle.
2019-08-17 03:44:11 +02:00
Bartosz Taudul
6c1dd8eaec
Cast thread handle to DWORD.
2019-08-16 21:21:37 +02:00
Bartosz Taudul
d7104c752a
Cygwin compat layer.
2019-08-16 21:16:04 +02:00
Bartosz Taudul
819ef2a82b
External process/thread name retrieval on linux.
2019-08-16 21:00:42 +02:00
Bartosz Taudul
e975c4d7bf
Also retrieve external thread names.
2019-08-16 19:49:16 +02:00
Bartosz Taudul
fe7f56b022
Implement retrieval of external process names.
2019-08-16 19:22:23 +02:00
Bartosz Taudul
83fddd9aa6
Fix unicode builds.
2019-08-16 13:09:27 +02:00
Bartosz Taudul
9d5240c597
Mutable char array is required here due to shit API design.
2019-08-16 13:03:20 +02:00
Bartosz Taudul
14a373a3b8
Add number of CPU cores to host info.
2019-08-15 02:28:35 +02:00
Bartosz Taudul
69077e4e6f
Finish sending context switches during disconnect.
2019-08-14 23:06:13 +02:00
Bartosz Taudul
6dc79cf14e
Cosmetics.
2019-08-14 23:05:58 +02:00
Bartosz Taudul
c0b524d8de
Add a separate method for clearing serial queue.
2019-08-14 22:39:12 +02:00
Bartosz Taudul
71b54dd48a
Always collect thread names.
...
This fixes an issue when a thread was destroyed before its name could be
retrieved.
2019-08-14 16:52:04 +02:00
Bartosz Taudul
5e199d1ab3
Support ftrace on ARM.
2019-08-14 16:28:54 +02:00
Bartosz Taudul
5fbb811f5d
Degrade ARM timer to monotonic raw clock.
...
The monotonic raw clock has the same accuracy as reading cntvct registers, but
using clock_gettime() has a measurable impact on queueing time (135 us vs
83 us).
This change is needed to enable ftrace time readings on ARM linux, which
doesn't provide any way to get raw cntvct readings, like x86-tsc on x86.
2019-08-14 16:19:02 +02:00
Bartosz Taudul
42865d7c7b
Don't set x86-tsc clock on non-x86 platforms.
2019-08-14 15:14:36 +02:00
Bartosz Taudul
54a9132bb5
Skip context switch events in on demand mode, if no connection.
2019-08-14 15:09:33 +02:00
Bartosz Taudul
602c38c6c0
Allow checking timer implementation.
2019-08-14 14:35:44 +02:00
Bartosz Taudul
3988b56c92
Capture context switches on linux.
2019-08-14 13:56:15 +02:00
Bartosz Taudul
92b6da7cc2
SetThreadName() only works on the current thread.
...
This breaking change is required, because kernel trace facilities use
kernel thread ids, which are inaccessible from the pthread_t level.
2019-08-14 02:22:45 +02:00
Bartosz Taudul
73cbf2eead
Use windows thread ids on cygwin.
2019-08-13 16:22:58 +02:00
Bartosz Taudul
b313e46139
Keep event trace properties to terminate trace on exit.
2019-08-13 13:10:37 +02:00
Bartosz Taudul
90d26cb1b6
Collect and send context switch events.
2019-08-13 02:35:32 +02:00
Bartosz Taudul
fe0f1aea07
Add system tracing skeleton.
2019-08-12 23:05:34 +02:00
Bartosz Taudul
8aa0be39d5
Drop support for CPU id queries.
2019-08-12 23:05:34 +02:00
Bartosz Taudul
d6f32a0839
Serialize lock processing.
...
This makes is much easier to process on the server and opens new
optimization possibilities. It also fixes theoretical problems, which
may be caused by invalid ordering of events with the same timestamp.
2019-08-12 13:51:01 +02:00
Bartosz Taudul
0431c03556
Add serial queue interface.
2019-08-12 13:27:15 +02:00
Bartosz Taudul
4d2c7899ab
Allow skipping invariant TSC check.
2019-08-08 19:21:39 +02:00
Bartosz Taudul
3a221dafde
Display error messages on console, if available.
2019-08-08 19:18:05 +02:00
Bartosz Taudul
aada588129
Proper buffer reset.
2019-08-04 17:48:19 +02:00
Rokas Kupstys
b391e4c21a
Fix multiple build errors when compiling with MinGW.
2019-08-04 15:49:46 +03:00
Bartosz Taudul
12969ee497
Track thread context.
...
This change exploits the fact that events are processed in batches
originating from a single thread. A single message changing thread
context is enough to handle multiple messages, as opposed to inclusion
of thread identifier in each message.
2019-08-02 20:18:08 +02:00
Bartosz Taudul
a4e7a341c0
Proper handling of disconnect request.
2019-08-01 23:14:09 +02:00
Bartosz Taudul
ca3571fd2b
Still more.
2019-07-30 01:30:31 +02:00
Bartosz Taudul
47423e6263
And more.
2019-07-30 01:29:13 +02:00
Bartosz Taudul
d3783ae359
Remove magic template syntax.
2019-07-30 01:28:21 +02:00
Bartosz Taudul
9c28b82954
RPMallocInit and RPMallocThreadInit are identical.
2019-07-30 01:15:14 +02:00
Bartosz Taudul
a6a3f45810
Fill in thread id during dequeue, not during enqueue.
2019-07-30 01:15:14 +02:00
Bartosz Taudul
142ef53b42
Dequeue items from a single thread.
2019-07-29 23:44:08 +02:00
Bartosz Taudul
c7f769c52b
Allow dequeuing from a single producer, retrieving thread id.
2019-07-29 23:29:30 +02:00
Bartosz Taudul
6cad76ae67
Store thread id in queue producer.
2019-07-29 23:13:06 +02:00
Bartosz Taudul
7ae9a28e32
Drop BlockingConcurrentQueue.
2019-07-29 22:58:13 +02:00
Bartosz Taudul
480a427e07
No need to hash thread ids anymore.
2019-07-29 22:36:04 +02:00
Bartosz Taudul
c60af95053
Remove unused const.
2019-07-29 22:33:32 +02:00
Bartosz Taudul
2d42abf552
Remove CannoAlloc functions.
2019-07-29 22:31:32 +02:00
Bartosz Taudul
b142860c8d
More implicit producer removal.
2019-07-29 22:29:39 +02:00
Bartosz Taudul
db6eceb1a6
Producers must be explicit.
2019-07-29 22:25:28 +02:00
Bartosz Taudul
89928fde7b
Queue must be always able to alloc.
2019-07-29 22:13:16 +02:00
Bartosz Taudul
a03734afa6
Remove more debug code.
2019-07-29 22:01:06 +02:00
Bartosz Taudul
e9a0145cd5
Remove MCDBGQ_NOLOCKFREE_IMPLICITPRODBLOCKINDEX.
2019-07-29 21:56:53 +02:00
Bartosz Taudul
b496f1ff90
Remove MOODYCAMEL_QUEUE_INTERNAL_DEBUG.
2019-07-29 21:52:49 +02:00
Bartosz Taudul
beaadc3a56
Remove always disabled MCDBGQ_TRACKMEM code.
2019-07-29 21:51:29 +02:00
Bartosz Taudul
82a4a6d9cc
Add tracy_ prefix to concurrentqueue.h file name.
2019-07-29 21:47:50 +02:00
Bartosz Taudul
276d764141
Fix cygwin.
2019-07-26 00:02:57 +02:00
Bartosz Taudul
36de7b2cc7
Fix incomplete headers.
2019-07-25 23:41:42 +02:00
Bartosz Taudul
e659220602
Use generic std::call_once() on other platforms.
2019-07-25 23:30:47 +02:00
Bartosz Taudul
d31d1f5946
Detect and report clang-cl.
2019-07-25 19:03:58 +02:00
Bartosz Taudul
092e830264
Use shifts instead of const vector and.
2019-07-22 19:56:47 +02:00
Bartosz Taudul
178dc9eba7
Combine block data directly in AVX registers.
2019-07-20 14:52:34 +02:00
Bartosz Taudul
a6300ef7d1
Ditto on ARM.
2019-07-19 22:13:56 +02:00
Bartosz Taudul
dc49f2f76a
Move DXT1 index conversion to server.
2019-07-19 21:46:58 +02:00
Bartosz Taudul
11ba77ced5
Use pthread_once() to initialize rpmalloc on linux.
2019-07-19 20:15:56 +02:00
Bartosz Taudul
4c28593031
Fix races in rpmalloc initialization.
...
Ensure rpmalloc_thread_initialize() int worker threads is called only after
rpmalloc_initialize() was called on the main profiler thread.
2019-07-19 19:25:27 +02:00
Bartosz Taudul
cef8124247
Replace or with addition to enable usra instruction.
2019-07-19 01:40:27 +02:00
Bartosz Taudul
fd4689a6e2
Don't perform unnecessary ands.
2019-07-19 01:19:52 +02:00
Bartosz Taudul
f65373ece7
Replace two packs with one shuffle.
2019-07-13 20:01:12 +02:00
Bartosz Taudul
fc83f97ad3
Same for AVX/SSE.
2019-07-13 19:34:08 +02:00
Bartosz Taudul
62a167541c
No need to mask out indices.
2019-07-13 19:07:25 +02:00
Alex
0c5ea710b0
Merged in z33ky/tracy/const-frame-image (pull request #37 )
...
Constify frame-image pointer in API.
2019-07-13 13:09:21 +00:00
Bartosz Taudul
7bb9549e84
ARM64 specific NEON implementation of DXT1 compression.
2019-07-13 14:31:33 +02:00
Alexander 'z33ky' Hirsch
c6e8dc8d63
Constify frame-image pointer in API.
2019-07-13 12:33:55 +02:00
Bartosz Taudul
60d2384a6a
Allow sending application information messages.
2019-07-12 18:34:46 +02:00
Bartosz Taudul
a1ce5fc1f6
Add include for built-in __get_cpuid() on gcc/clang.
2019-07-10 02:09:19 +02:00
Bartosz Taudul
c164a70b9d
Check for rdstcp/invariant tsc support.
2019-07-10 02:04:14 +02:00
Bartosz Taudul
c0670848d2
Reuse variable.
2019-07-08 02:08:06 +02:00
Bartosz Taudul
17dbbe67de
Remove dependency on range subtraction.
2019-07-08 00:14:36 +02:00
Bartosz Taudul
af1bd3e1fa
Faster horizontal add.
2019-07-07 23:57:23 +02:00
Bartosz Taudul
b32e8fa24e
Ditto for NEON.
2019-07-06 00:18:53 +02:00
Bartosz Taudul
d236d4b70f
Ditto for AVX2.
2019-07-06 00:05:32 +02:00
Bartosz Taudul
f62b21c21d
Masking alpha out is not needed.
...
We assume that alpha value is constant for the whole image. The range
calculation is max - min, so alpha zeroes out. The color normalization
to range is color - min, so alpha also zeroes out here.
2019-07-05 23:58:19 +02:00
Bartosz Taudul
03189a30b8
Two ands less in NEON DXT1 compression.
2019-07-05 18:37:25 +02:00
Bartosz Taudul
275d992cb1
Two ands less in AVX2 DXT1 compression.
2019-07-05 18:22:42 +02:00
Bartosz Taudul
c89358d6b9
Two ands less in SSE DXT1 compression.
2019-07-05 18:17:50 +02:00
Bartosz Taudul
5bfc62f1bf
iOS device name decoding.
2019-06-19 09:59:46 +02:00
Bartosz Taudul
59b4f84ce5
Display unknown implementer, part as hex values.
2019-07-03 21:18:17 +02:00
Bartosz Taudul
c6f6c368b2
Decode ARM CPU names.
2019-07-03 21:01:34 +02:00
Bartosz Taudul
e26ab8e9f6
Make forwarding functions more compact.
2019-07-03 18:05:38 +02:00
Bartosz Taudul
bdfb568742
Fix div tables for max range on all channels.
2019-07-01 12:31:06 +02:00
Bartosz Taudul
684a119a2c
Fix order of checks for including intrinsics.
2019-07-01 11:45:16 +02:00
Bartosz Taudul
983c48994b
Write block data directly to memory.
2019-06-30 11:44:32 +02:00
Bartosz Taudul
9b8c18f99e
Improve readability.
2019-06-30 11:44:00 +02:00
Bartosz Taudul
52b6bdb55a
Force inline ProcessRGB functions.
2019-06-30 03:33:14 +02:00
Bartosz Taudul
8c06f7288c
AVX2 DXT1 compression.
2019-06-30 03:20:58 +02:00
Bartosz Taudul
2e893bba91
Use division tables.
2019-06-29 12:16:49 +02:00
Bartosz Taudul
ab9f036f5e
Integrate CheckSolid into ProcessRGB.
2019-06-29 02:04:08 +02:00
Bartosz Taudul
faf6bb97a4
DXT1 NEON color index packing.
2019-06-28 22:36:44 +02:00
Bartosz Taudul
2df1eaaa7e
Pack color indices using SSE.
2019-06-28 21:58:10 +02:00
Bartosz Taudul
fcb5b4b888
NEON DXT1 compression.
2019-06-28 14:24:16 +02:00
Bartosz Taudul
e8d4ba492b
Unify shifts.
2019-06-28 13:05:32 +02:00
Bartosz Taudul
be4900c822
NEON CheckSolid.
2019-06-28 01:47:04 +02:00
Bartosz Taudul
3c066f1527
Simplify code.
2019-06-27 22:40:03 +02:00
Bartosz Taudul
72a0d4c2ab
Rest of SSE DXTC compression.
2019-06-27 22:29:44 +02:00
Bartosz Taudul
137b28e110
SSE CheckSolid.
2019-06-27 22:29:44 +02:00
Bartosz Taudul
3d590b6b8c
Initialize rpmalloc in compression thread.
2019-06-27 19:14:51 +02:00
Bartosz Taudul
1939c31165
Experimental DXT1 compressor.
2019-06-27 19:14:51 +02:00
Bartosz Taudul
79eb1b9029
Swap queue and dequeue only if queue has contents.
2019-06-27 13:37:09 +02:00
Bartosz Taudul
bb35f9a897
Compress frame images in a separate thread.
2019-06-27 13:24:35 +02:00
Bartosz Taudul
7ebd2162c6
Add ETC1 compression thread.
2019-06-26 22:57:24 +02:00
Bartosz Taudul
f565e11976
Store frame images in queue.
2019-06-26 22:52:24 +02:00
Bartosz Taudul
281dcf7c1f
Cast to proper types.
2019-06-26 19:33:37 +02:00
Bartosz Taudul
8ce41b3543
Proper init order of thread local thread handle.
2019-06-26 19:32:52 +02:00
Bartosz Taudul
bc7f2c49c8
GetThreadHandle() might be used by application's code.
2019-06-25 15:44:49 +02:00
Bartosz Taudul
c749a2e3fe
Add C API for plots and messages.
2019-06-24 21:03:39 +02:00
Bartosz Taudul
48e08acb62
Add C API for frame markup.
2019-06-24 21:03:39 +02:00
Bartosz Taudul
ee99ce833c
Implement memory allocation tracking for C API.
2019-06-24 21:03:39 +02:00
Bartosz Taudul
281477f7f9
Tokens must be retrieved for each enqueue.
2019-06-24 20:12:14 +02:00
Bartosz Taudul
06a41708a7
Move TLS accesses close together.
2019-06-24 19:38:44 +02:00
Bartosz Taudul
c4f0965851
Don't use cached thread id to retrieve main thread id.
2019-06-24 19:38:07 +02:00
Bartosz Taudul
a56c47a6a0
Store thread handle in a thread local variable.
...
This saves us a non-inlineable function call. Thread local block is
accessed anyway, since we need to get the token, so we already have the
pointer and don't need to get it a second time (which is done inside
Windows' GetCurrentThreadId()). We also don't need to store the thread
id in ScopedZone anymore, as it was a micro-optimization to save us the
second GetThreadHandle() call.
This change has a measurable effect of reducing enqueue time from ~10 to
~8 ns.
A further optimization would be to completely skip thread handle
retrieval during zone capture and do it instead on retrieval of data
from the queue. Since each thread has its own producer ("token"), the
thread handle should be accessible during the dequeue operation. This is
a much more invasive change, that would require a) modification of the
queue, b) additional processing of dequeued data to inject the thread
handle.
2019-06-24 19:19:47 +02:00
Bartosz Taudul
fd9fc880a6
Send current time in on-demand welcome message.
2019-06-21 19:39:41 +02:00
Bartosz Taudul
5309e6d94a
Broadcast client activity time.
2019-06-18 20:46:12 +02:00
Bartosz Taudul
aa5259b20a
Use the same port (8086) for both TCP and UDP traffic.
2019-06-18 20:28:03 +02:00
Bartosz Taudul
0e5a7263d9
Define broadcast message, add versioning.
2019-06-18 20:26:40 +02:00
Bartosz Taudul
0b394c3f53
Don't need to keep last broadcast time in Profiler class.
2019-06-18 20:15:09 +02:00
Bartosz Taudul
11dc8e67e5
Change broadcast rate from 5s to 3s.
2019-06-17 19:57:17 +02:00
Bartosz Taudul
6bf8081f5b
Remove debug leftovers.
2019-06-17 19:52:44 +02:00
Bartosz Taudul
de058d2a0d
Don't hardcode broadcast port.
2019-06-17 18:37:34 +02:00
Bartosz Taudul
1b3b3a94a2
Broadcast protocol version and process name.
2019-06-17 18:34:35 +02:00
Bartosz Taudul
0b9ef7e514
Disable broadcast if TRACY_NO_BROADCAST is defined.
2019-06-17 18:18:58 +02:00
Bartosz Taudul
e609c0fdce
UDP broadcast loop.
2019-06-17 02:25:09 +02:00
Bartosz Taudul
014c3ed63b
Use non-reference, optimized NEON ETC1 compression.
2019-06-15 15:35:57 +02:00
Bartosz Taudul
ab4e99229d
Indicate whether client is running on apple shitware.
2019-06-13 14:05:15 +02:00
Bartosz Taudul
e5d5abf59a
Add NEON path for ETC1 compression.
2019-06-13 02:04:19 +02:00
Bartosz Taudul
d3e0163dd4
Add byteswap for apple.
2019-06-12 16:54:44 +02:00
Bartosz Taudul
37d1457b44
Frame image may need flipping.
2019-06-12 15:28:32 +02:00
Bartosz Taudul
04dd33f5c4
Fix mismatched linkage.
2019-06-11 23:51:12 +02:00
Rokas K. (rku)
c4e05b6264
Merged in rokups/tracy/dllimport-cleanup (pull request #36 )
...
Clean up imported functions in multi-dll projects.
Approved-by: Till Rathmann <till.rathmann@gmx.de>
2019-06-11 15:04:34 +00:00
Bartosz Taudul
57b8b425ba
Discard send buffer data after disconnect.
2019-06-10 02:11:29 +02:00
Bartosz Taudul
80dff1ede1
Add connection id for on-demand mode.
...
Long-lived zones could send their end events without begin events in a
following scenario:
1. On-demand connection is made.
2. Zone begin is emitted, m_active is set to true.
3. Connection is terminated.
4. A new connection is made.
5. Zone end is emitted, because m_active is true.
To this point it was assumed that all zone end events will happen before
a new connection is made, but it's not necessarily true.
2019-06-09 17:15:47 +02:00
Bartosz Taudul
0db9c73d76
Immediately react to connection termination.
2019-06-09 16:51:39 +02:00
Bartosz Taudul
cc5bad294a
More strict memory ordering for on-demand connection status.
2019-06-09 16:48:00 +02:00
Bartosz Taudul
e2d42fae2f
We're done here, don't try to send termination request.
2019-06-09 16:25:52 +02:00
Bartosz Taudul
496f866add
Don't send data when connection is terminated.
...
There are only two cases for which HandleServerQuery() returns false.
Either data can't be read from the socket (which is checked by HasData()
call before calling HandleServerQuery()), or if the server sent
termination query. In both these cases there's no need to send data
anymore.
2019-06-09 16:19:40 +02:00
Bartosz Taudul
23e7850162
Make DequeueStatus enum class.
2019-06-09 16:14:30 +02:00
Bartosz Taudul
34d89d39a1
Prevent double freeing of socket.
2019-06-09 16:10:49 +02:00
Bartosz Taudul
139299389b
Add comments to client connection handling.
2019-06-09 16:10:49 +02:00
Bartosz Taudul
4c2ff80ac8
Restore frame counting for on-demand mode.
2019-06-09 15:23:01 +02:00
Bartosz Taudul
00a468162d
Fix signed/unsigned comparison.
2019-06-08 00:57:25 +02:00
Bartosz Taudul
9ef128995a
Add AVX2 version of etcpak.
2019-06-08 00:50:39 +02:00
Bartosz Taudul
7e9539ef2d
AVX implies SSE 4.1.
2019-06-08 00:39:19 +02:00
Bartosz Taudul
784c4da53a
Include frame offset in frame image message.
2019-06-07 20:09:29 +02:00
Rokas Kupstys
9bd1037347
Clean up imported functions in multi-dll projects.
2019-06-07 19:50:08 +03:00
Bartosz Taudul
d271634a95
Keep one ETC1 compression buffer.
2019-06-07 01:29:24 +02:00