StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures, Concurr. Comput. : Pract. Exper, vol.23, pp.187-198, 2011. ,
URL : https://hal.archives-ouvertes.fr/inria-00384363
A Black-box Approach to Energy-aware Scheduling on Integrated CPU-GPU Systems, CGO. ACM, pp.70-81, 2016. ,
JetsonLeap: A Framework to Measure Energy-Aware Code Optimizations in Embedded and Heterogeneous Systems, pp.16-30, 2016. ,
The PARSEC Benchmark Suite: Characterization and Architectural Implications, PACT. ACM, pp.72-81, 2008. ,
Power-Aware Microarchitecture: Design and Modeling Challenges for Next-Generation Microprocessors, IEEE Micro, vol.20, pp.26-44, 2000. ,
Rodinia: A Benchmark Suite for Heterogeneous Computing, pp.44-54, 2009. ,
Heterogeneous Multi-Processing Solution of Exynos 5 Octa with ARM big.LITTLE Technology, 2012. ,
Energy-efficient Scheduling on Heterogeneous Multi-core Architectures, ISLPED. ACM, pp.345-350, 2012. ,
The program dependence graph and its use in optimization, TOPLAS, vol.9, pp.319-349, 1987. ,
A Framework for Application-Guided Task Management on Heterogeneous Embedded Systems, ACM Trans. Archit. Code Optim, vol.12, p.25, 2015. ,
A static task partitioning approach for heterogeneous systems using OpenCL, Compiler Construction, pp.286-305, 2011. ,
Continuous shape shifting: Enabling loop co-optimization via near-free dynamic code rewriting, MICRO, pp.1-12, 2016. ,
Bottleneck Identification and Scheduling in Multithreaded Applications, ASPLOS. ACM, pp.223-234, 2012. ,
LLVM: A Compilation Framework for Lifelong Program Analysis & Transformation, CGO. IEEE, pp.75-88, 2004. ,
Dynamic Voltage and Frequency Scaling: The Laws of Diminishing Returns, HotPower. USENIX Association, pp.1-8, 2010. ,
Qilin: Exploiting Parallelism on Heterogeneous Multiprocessors with Adaptive Mapping, MICRO. ACM, pp.45-55, 2009. ,
Exploring Fine-Grained Heterogeneity with Composite Cores, IEEE Trans. Comput, vol.65, pp.535-547, 2016. ,
Portable and Transparent Software Managed Scheduling on Accelerators for Fair Resource Sharing, CGO, pp.82-93, 2016. ,
DawnCC: Automatic Annotation for Data Parallelism and Offloading, TACO, vol.14, p.25, 2017. ,
Hipster: Hybrid Task Manager for Latency-Critical Cloud Workloads, HPCA. IEEE, pp.409-420, 2017. ,
Bones: An Automatic Skeleton-Based C-to-CUDA Compiler for GPUs, vol.11, p.25, 2014. ,
Octopus-man: QoSdriven task management for heterogeneous multicores in warehousescale computers, pp.246-258, 2015. ,
Compiler Support for Selective Page Migration in NUMA Architectures, PACT, pp.369-380, 2014. ,
Static Placement of Computation on Heterogeneous Devices, OOPSLA. ACM, pp.1-18, 2017. ,
Thread Motion: Fine-grained Power Management for Multi-core Systems, ISCA. ACM, pp.302-313, 2009. ,
Dandelion: A Compiler and Runtime for Heterogeneous Systems, SOSP. ACM, pp.49-68, 2013. ,
Price Theory Based Power Management for Heterogeneous Multi-cores, ASPLOS. ACM, pp.161-176, 2014. ,
Introduction to Reinforcement Learning, 1998. ,
ReQoS: Reactive Static/Dynamic Compilation for QoS in Warehouse Scale Computers, ASPLOS. ACM, pp.89-100, 2013. ,
Scheduling Heterogeneous Multi-cores Through Performance Impact Estimation (PIE), ISCA. IEEE, pp.213-224, 2012. ,
José Ignacio Gómez, Christian Tenllado, and Francky Catthoor, vol.9, p.23, 2013. ,
Accurate and Stable Run-Time Power Modeling for Mobile and Embedded CPUs, TCAD, vol.36, pp.106-119, 2016. ,
Neural acceleration for GPU throughput processors, pp.482-493, 2015. ,
Maximizing Performance Under a Power Cap: A Comparison of Hardware, Software, and Hybrid Techniques, ASPLOS. ACM, pp.545-559, 2016. ,