Bartosz Taudul
83fddd9aa6
Fix unicode builds.
2019-08-16 13:09:27 +02:00
Bartosz Taudul
9d5240c597
Mutable char array is required here due to shit API design.
2019-08-16 13:03:20 +02:00
Bartosz Taudul
14a373a3b8
Add number of CPU cores to host info.
2019-08-15 02:28:35 +02:00
Bartosz Taudul
69077e4e6f
Finish sending context switches during disconnect.
2019-08-14 23:06:13 +02:00
Bartosz Taudul
6dc79cf14e
Cosmetics.
2019-08-14 23:05:58 +02:00
Bartosz Taudul
c0b524d8de
Add a separate method for clearing serial queue.
2019-08-14 22:39:12 +02:00
Bartosz Taudul
71b54dd48a
Always collect thread names.
...
This fixes an issue when a thread was destroyed before its name could be
retrieved.
2019-08-14 16:52:04 +02:00
Bartosz Taudul
5e199d1ab3
Support ftrace on ARM.
2019-08-14 16:28:54 +02:00
Bartosz Taudul
5fbb811f5d
Degrade ARM timer to monotonic raw clock.
...
The monotonic raw clock has the same accuracy as reading cntvct registers, but
using clock_gettime() has a measurable impact on queueing time (135 us vs
83 us).
This change is needed to enable ftrace time readings on ARM linux, which
doesn't provide any way to get raw cntvct readings, like x86-tsc on x86.
2019-08-14 16:19:02 +02:00
Bartosz Taudul
42865d7c7b
Don't set x86-tsc clock on non-x86 platforms.
2019-08-14 15:14:36 +02:00
Bartosz Taudul
54a9132bb5
Skip context switch events in on demand mode, if no connection.
2019-08-14 15:09:33 +02:00
Bartosz Taudul
602c38c6c0
Allow checking timer implementation.
2019-08-14 14:35:44 +02:00
Bartosz Taudul
3988b56c92
Capture context switches on linux.
2019-08-14 13:56:15 +02:00
Bartosz Taudul
92b6da7cc2
SetThreadName() only works on the current thread.
...
This breaking change is required, because kernel trace facilities use
kernel thread ids, which are inaccessible from the pthread_t level.
2019-08-14 02:22:45 +02:00
Bartosz Taudul
73cbf2eead
Use windows thread ids on cygwin.
2019-08-13 16:22:58 +02:00
Bartosz Taudul
b313e46139
Keep event trace properties to terminate trace on exit.
2019-08-13 13:10:37 +02:00
Bartosz Taudul
90d26cb1b6
Collect and send context switch events.
2019-08-13 02:35:32 +02:00
Bartosz Taudul
fe0f1aea07
Add system tracing skeleton.
2019-08-12 23:05:34 +02:00
Bartosz Taudul
8aa0be39d5
Drop support for CPU id queries.
2019-08-12 23:05:34 +02:00
Bartosz Taudul
d6f32a0839
Serialize lock processing.
...
This makes is much easier to process on the server and opens new
optimization possibilities. It also fixes theoretical problems, which
may be caused by invalid ordering of events with the same timestamp.
2019-08-12 13:51:01 +02:00
Bartosz Taudul
0431c03556
Add serial queue interface.
2019-08-12 13:27:15 +02:00
Bartosz Taudul
4d2c7899ab
Allow skipping invariant TSC check.
2019-08-08 19:21:39 +02:00
Bartosz Taudul
3a221dafde
Display error messages on console, if available.
2019-08-08 19:18:05 +02:00
Bartosz Taudul
aada588129
Proper buffer reset.
2019-08-04 17:48:19 +02:00
Rokas Kupstys
b391e4c21a
Fix multiple build errors when compiling with MinGW.
2019-08-04 15:49:46 +03:00
Bartosz Taudul
12969ee497
Track thread context.
...
This change exploits the fact that events are processed in batches
originating from a single thread. A single message changing thread
context is enough to handle multiple messages, as opposed to inclusion
of thread identifier in each message.
2019-08-02 20:18:08 +02:00
Bartosz Taudul
a4e7a341c0
Proper handling of disconnect request.
2019-08-01 23:14:09 +02:00
Bartosz Taudul
ca3571fd2b
Still more.
2019-07-30 01:30:31 +02:00
Bartosz Taudul
47423e6263
And more.
2019-07-30 01:29:13 +02:00
Bartosz Taudul
d3783ae359
Remove magic template syntax.
2019-07-30 01:28:21 +02:00
Bartosz Taudul
9c28b82954
RPMallocInit and RPMallocThreadInit are identical.
2019-07-30 01:15:14 +02:00
Bartosz Taudul
a6a3f45810
Fill in thread id during dequeue, not during enqueue.
2019-07-30 01:15:14 +02:00
Bartosz Taudul
142ef53b42
Dequeue items from a single thread.
2019-07-29 23:44:08 +02:00
Bartosz Taudul
c7f769c52b
Allow dequeuing from a single producer, retrieving thread id.
2019-07-29 23:29:30 +02:00
Bartosz Taudul
6cad76ae67
Store thread id in queue producer.
2019-07-29 23:13:06 +02:00
Bartosz Taudul
7ae9a28e32
Drop BlockingConcurrentQueue.
2019-07-29 22:58:13 +02:00
Bartosz Taudul
480a427e07
No need to hash thread ids anymore.
2019-07-29 22:36:04 +02:00
Bartosz Taudul
c60af95053
Remove unused const.
2019-07-29 22:33:32 +02:00
Bartosz Taudul
2d42abf552
Remove CannoAlloc functions.
2019-07-29 22:31:32 +02:00
Bartosz Taudul
b142860c8d
More implicit producer removal.
2019-07-29 22:29:39 +02:00
Bartosz Taudul
db6eceb1a6
Producers must be explicit.
2019-07-29 22:25:28 +02:00
Bartosz Taudul
89928fde7b
Queue must be always able to alloc.
2019-07-29 22:13:16 +02:00
Bartosz Taudul
a03734afa6
Remove more debug code.
2019-07-29 22:01:06 +02:00
Bartosz Taudul
e9a0145cd5
Remove MCDBGQ_NOLOCKFREE_IMPLICITPRODBLOCKINDEX.
2019-07-29 21:56:53 +02:00
Bartosz Taudul
b496f1ff90
Remove MOODYCAMEL_QUEUE_INTERNAL_DEBUG.
2019-07-29 21:52:49 +02:00
Bartosz Taudul
beaadc3a56
Remove always disabled MCDBGQ_TRACKMEM code.
2019-07-29 21:51:29 +02:00
Bartosz Taudul
82a4a6d9cc
Add tracy_ prefix to concurrentqueue.h file name.
2019-07-29 21:47:50 +02:00
Bartosz Taudul
276d764141
Fix cygwin.
2019-07-26 00:02:57 +02:00
Bartosz Taudul
36de7b2cc7
Fix incomplete headers.
2019-07-25 23:41:42 +02:00
Bartosz Taudul
e659220602
Use generic std::call_once() on other platforms.
2019-07-25 23:30:47 +02:00
Bartosz Taudul
d31d1f5946
Detect and report clang-cl.
2019-07-25 19:03:58 +02:00
Bartosz Taudul
092e830264
Use shifts instead of const vector and.
2019-07-22 19:56:47 +02:00
Bartosz Taudul
178dc9eba7
Combine block data directly in AVX registers.
2019-07-20 14:52:34 +02:00
Bartosz Taudul
a6300ef7d1
Ditto on ARM.
2019-07-19 22:13:56 +02:00
Bartosz Taudul
dc49f2f76a
Move DXT1 index conversion to server.
2019-07-19 21:46:58 +02:00
Bartosz Taudul
11ba77ced5
Use pthread_once() to initialize rpmalloc on linux.
2019-07-19 20:15:56 +02:00
Bartosz Taudul
4c28593031
Fix races in rpmalloc initialization.
...
Ensure rpmalloc_thread_initialize() int worker threads is called only after
rpmalloc_initialize() was called on the main profiler thread.
2019-07-19 19:25:27 +02:00
Bartosz Taudul
cef8124247
Replace or with addition to enable usra instruction.
2019-07-19 01:40:27 +02:00
Bartosz Taudul
fd4689a6e2
Don't perform unnecessary ands.
2019-07-19 01:19:52 +02:00
Bartosz Taudul
f65373ece7
Replace two packs with one shuffle.
2019-07-13 20:01:12 +02:00
Bartosz Taudul
fc83f97ad3
Same for AVX/SSE.
2019-07-13 19:34:08 +02:00
Bartosz Taudul
62a167541c
No need to mask out indices.
2019-07-13 19:07:25 +02:00
Alex
0c5ea710b0
Merged in z33ky/tracy/const-frame-image (pull request #37 )
...
Constify frame-image pointer in API.
2019-07-13 13:09:21 +00:00
Bartosz Taudul
7bb9549e84
ARM64 specific NEON implementation of DXT1 compression.
2019-07-13 14:31:33 +02:00
Alexander 'z33ky' Hirsch
c6e8dc8d63
Constify frame-image pointer in API.
2019-07-13 12:33:55 +02:00
Bartosz Taudul
60d2384a6a
Allow sending application information messages.
2019-07-12 18:34:46 +02:00
Bartosz Taudul
a1ce5fc1f6
Add include for built-in __get_cpuid() on gcc/clang.
2019-07-10 02:09:19 +02:00
Bartosz Taudul
c164a70b9d
Check for rdstcp/invariant tsc support.
2019-07-10 02:04:14 +02:00
Bartosz Taudul
c0670848d2
Reuse variable.
2019-07-08 02:08:06 +02:00
Bartosz Taudul
17dbbe67de
Remove dependency on range subtraction.
2019-07-08 00:14:36 +02:00
Bartosz Taudul
af1bd3e1fa
Faster horizontal add.
2019-07-07 23:57:23 +02:00
Bartosz Taudul
b32e8fa24e
Ditto for NEON.
2019-07-06 00:18:53 +02:00
Bartosz Taudul
d236d4b70f
Ditto for AVX2.
2019-07-06 00:05:32 +02:00
Bartosz Taudul
f62b21c21d
Masking alpha out is not needed.
...
We assume that alpha value is constant for the whole image. The range
calculation is max - min, so alpha zeroes out. The color normalization
to range is color - min, so alpha also zeroes out here.
2019-07-05 23:58:19 +02:00
Bartosz Taudul
03189a30b8
Two ands less in NEON DXT1 compression.
2019-07-05 18:37:25 +02:00
Bartosz Taudul
275d992cb1
Two ands less in AVX2 DXT1 compression.
2019-07-05 18:22:42 +02:00
Bartosz Taudul
c89358d6b9
Two ands less in SSE DXT1 compression.
2019-07-05 18:17:50 +02:00
Bartosz Taudul
5bfc62f1bf
iOS device name decoding.
2019-06-19 09:59:46 +02:00
Bartosz Taudul
59b4f84ce5
Display unknown implementer, part as hex values.
2019-07-03 21:18:17 +02:00
Bartosz Taudul
c6f6c368b2
Decode ARM CPU names.
2019-07-03 21:01:34 +02:00
Bartosz Taudul
e26ab8e9f6
Make forwarding functions more compact.
2019-07-03 18:05:38 +02:00
Bartosz Taudul
bdfb568742
Fix div tables for max range on all channels.
2019-07-01 12:31:06 +02:00
Bartosz Taudul
684a119a2c
Fix order of checks for including intrinsics.
2019-07-01 11:45:16 +02:00
Bartosz Taudul
983c48994b
Write block data directly to memory.
2019-06-30 11:44:32 +02:00
Bartosz Taudul
9b8c18f99e
Improve readability.
2019-06-30 11:44:00 +02:00
Bartosz Taudul
52b6bdb55a
Force inline ProcessRGB functions.
2019-06-30 03:33:14 +02:00
Bartosz Taudul
8c06f7288c
AVX2 DXT1 compression.
2019-06-30 03:20:58 +02:00
Bartosz Taudul
2e893bba91
Use division tables.
2019-06-29 12:16:49 +02:00
Bartosz Taudul
ab9f036f5e
Integrate CheckSolid into ProcessRGB.
2019-06-29 02:04:08 +02:00
Bartosz Taudul
faf6bb97a4
DXT1 NEON color index packing.
2019-06-28 22:36:44 +02:00
Bartosz Taudul
2df1eaaa7e
Pack color indices using SSE.
2019-06-28 21:58:10 +02:00
Bartosz Taudul
fcb5b4b888
NEON DXT1 compression.
2019-06-28 14:24:16 +02:00
Bartosz Taudul
e8d4ba492b
Unify shifts.
2019-06-28 13:05:32 +02:00
Bartosz Taudul
be4900c822
NEON CheckSolid.
2019-06-28 01:47:04 +02:00
Bartosz Taudul
3c066f1527
Simplify code.
2019-06-27 22:40:03 +02:00
Bartosz Taudul
72a0d4c2ab
Rest of SSE DXTC compression.
2019-06-27 22:29:44 +02:00
Bartosz Taudul
137b28e110
SSE CheckSolid.
2019-06-27 22:29:44 +02:00
Bartosz Taudul
3d590b6b8c
Initialize rpmalloc in compression thread.
2019-06-27 19:14:51 +02:00
Bartosz Taudul
1939c31165
Experimental DXT1 compressor.
2019-06-27 19:14:51 +02:00
Bartosz Taudul
79eb1b9029
Swap queue and dequeue only if queue has contents.
2019-06-27 13:37:09 +02:00