S. Mittal and J. Vetter, A Survey of CPU-GPU Heterogeneous Computing Techniques, ACM Computing Surveys, vol.47, issue.4, pp.1-6935, 2015.
DOI : 10.1145/2788396

D. Tullsen, S. Eggers, and H. Levy, Simultaneous multithreading: Maximizing on-chip parallelism. 25 Years of the International Symposia on Computer Architecture, ISCA '98, pp.533-544, 1998.

R. Keller, Look-Ahead Processors, ACM Computing Surveys, vol.7, issue.4, pp.177-195, 1975.
DOI : 10.1145/356654.356657

W. Gropp, R. Thakur, and E. Lusk, Using MPI-2: Advanced Features of the Message Passing Interface, 1999.

P. Hudak, Conception, evolution, and application of functional programming languages, ACM Computing Surveys, vol.21, issue.3, pp.359-411, 1989.
DOI : 10.1145/72551.72554

P. Hartel, H. Muller, and H. Glaser, The Functional C experience, Journal of Functional Programming, vol.14, issue.2, pp.129-135, 2004.
DOI : 10.1017/S0956796803004817

P. Hudak, J. Hughes, P. Jones, S. Wadler, and P. , A history of Haskell, Proceedings of the third ACM SIGPLAN conference on History of programming languages , HOPL III, pp.12-13, 2007.
DOI : 10.1145/1238844.1238856

J. Dean and G. S. Mapreduce, Simplified data processing on large clusters, Proceedings of the 6th Conference on Symposium on Opearting Systems Design & Implementation, pp.10-10, 2004.

C. Luk, R. Cohn, R. Muth, H. Patil, A. Klauser et al., Pin, ACM SIGPLAN Notices, vol.40, issue.6, pp.190-200, 2005.
DOI : 10.1145/1064978.1065034

J. Diaz, C. Munoz-caro, and A. Nino, A survey of parallel programming models and tools in the multi and many-core era. Parallel and Distributed Systems, IEEE Transactions on, vol.23, issue.8, pp.1369-1386, 2012.

P. Feautrier, Dataflow analysis of array and scalar references, International Journal of Parallel Programming, vol.24, issue.4, pp.23-53, 1991.
DOI : 10.1007/BF01407931

M. Benabderrahmane, L. Pouchet, A. Cohen, and C. Bastoul, The Polyhedral Model Is More Widely Applicable Than You Think, Proceedings of the 19th Joint European Conference on Theory and Practice of Software, International Conference on Compiler Construction, pp.283-303, 2010.
DOI : 10.1007/978-3-642-11970-5_16

URL : https://hal.archives-ouvertes.fr/inria-00551087

B. Chapman, G. Jost, and P. Rvd, Using OpenMP: Portable Shared Memory Parallel Programming (Scientific and Engineering Computation), 2007.

D. Butenhof, Programming with POSIX Threads, 1997.

E. Lee, The Problem with Threads, Computer, vol.39, issue.5, pp.33-42, 2006.
DOI : 10.1109/MC.2006.180

H. Kim and J. Smith, An instruction set and microarchitecture for instruction level distributed processing, Proceedings. 29th Annual International Symposium on, pp.71-81, 2002.
DOI : 10.1145/545214.545224

M. Mehrara, J. Hao, P. Hsu, and S. Mahlke, Parallelizing sequential applications on commodity hardware using a low-cost software transactional memory, Proceedings of the 30th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI '09, pp.166-176, 2009.

R. Ranjan, F. Latorre, P. Marcuello, and A. Gonzalez, Fg-STP: Fine-Grain Single Thread Partitioning on Multicores, 2011 IEEE 17th International Symposium on High Performance Computer Architecture, pp.15-24, 2011.
DOI : 10.1109/HPCA.2011.5749713

M. Sharafeddine, K. Jothi, and H. Akkary, Disjoint out-of-order execution processor. Transactions on Architecture and Code Optimization (TACO) sept 2012, pp.1-1932

D. Wall, Limits of instruction-level parallelism, Proceedings of the Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, pp.176-188, 1991.

B. Goossens, D. Parello, K. Porada, and D. Rahmoune, Toward a Core Design to Distribute an Execution on a Manycore Processor, Parallel Computing Technologies Lecture Notes in Computer Science, vol.9251, pp.390-404, 2015.
DOI : 10.1007/978-3-319-21909-7_38

URL : https://hal.archives-ouvertes.fr/hal-01152664

B. Goossens and D. Parello, Limits of Instruction-Level Parallelism Capture, 2013 International Conference on Computational Science, pp.1664-1673, 2013.
DOI : 10.1016/j.procs.2013.05.334

J. Shun, G. Blelloch, J. Fineman, P. Gibbons, A. Kyrola et al., Brief announcement, Proceedinbgs of the 24th ACM symposium on Parallelism in algorithms and architectures, SPAA '12, pp.68-70, 2012.
DOI : 10.1145/2312005.2312018

S. Marlow, P. Jones, S. Singh, and S. , Runtime support for multicore haskell. SIGPLAN Not, Aug, vol.44, issue.9, pp.65-78, 2009.

B. Goossens, D. Parello, K. Porada, and D. Rahmoune, Parallel Locality and Parallelization Quality, Proceedings of the 7th International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM'16, 2016.
DOI : 10.1145/2883404.2883410

URL : https://hal.archives-ouvertes.fr/hal-01252007