Carlos Couder-Castañeda

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Carlos Couder-Castañeda is active.

Explore More

Publication

Featured researches published by Carlos Couder-Castañeda.

Scientific Programming | 2015

Performance of a code migration for the simulation of supersonic ejector flow to SMP, MIC, and GPU using OpenMP, OpenMP+LEO, and OpenACC directives

Carlos Couder-Castañeda; Hector Barrios-Piña; Isidoro Gitler; Maricela Arroyo

A serial source code for simulating a supersonic ejector flow is accelerated using parallelization based on OpenMP and OpenACC directives. The purpose is to reduce the development costs and to simplify the maintenance of the application due to the complexity of the FORTRAN source code. This research follows well-proven strategies in order to obtain the best performance in both OpenMP and OpenACC. OpenMP has become the programming standard for scientific multicore software and OpenACC is one true alternative for graphics accelerators without the need of programming low level kernels. The strategies using OpenMP are oriented towards reducing the creation of parallel regions, tasks creation to handle boundary conditions, and a nested control of the loop time for the programming in offload mode specifically for the Xeon Phi. In OpenACC, the strategy focuses on maintaining the data regions among the executions of the kernels. Experiments for performance and validation are conducted here on a 12-core Xeon CPU, Xeon Phi 5110p, and Tesla C2070, obtaining the best performance from the latter. The Tesla C2070 presented an acceleration factor of 9.86X, 1.6X, and 4.5X compared against the serial version on CPU, 12-core Xeon CPU, and Xeon Phi, respectively.

Journal of Applied Mathematics | 2013

TESLA GPUs versus MPI with OpenMP for the Forward Modeling of Gravity and Gravity Gradient of Large Prisms Ensemble

Carlos Couder-Castañeda; Carlos Ortiz-Aleman; Mauricio Gabriel Orozco-del-Castillo; Mauricio Nava-Flores

An implementation with the CUDA technology in a single and in several graphics processing units (GPUs) is presented for the calculation of the forward modeling of gravitational fields from a tridimensional volumetric ensemble composed by unitary prisms of constant density. We compared the performance results obtained with the GPUs against a previous version coded in OpenMP with MPI, and we analyzed the results on both platforms. Today, the use of GPUs represents a breakthrough in parallel computing, which has led to the development of several applications with various applications. Nevertheless, in some applications the decomposition of the tasks is not trivial, as can be appreciated in this paper. Unlike a trivial decomposition of the domain, we proposed to decompose the problem by sets of prisms and use different memory spaces per processing CUDA core, avoiding the performance decay as a result of the constant calls to kernels functions which would be needed in a parallelization by observations points. The design and implementation created are the main contributions of this work, because the parallelization scheme implemented is not trivial. The performance results obtained are comparable to those of a small processing cluster.

Journal of Applied Mathematics | 2009

Simulation of Supersonic Flow in an Ejector Diffuser Using the JPVM

Carlos Couder-Castañeda

The ejectors are used commonly to extract gases in the petroleum industry where it is not possible to use an electric bomb due the explosion risk because the gases are flammable. The steam ejector is important in creating and holding a vacuum system. The goal of this job is to develop an object oriented parallel numerical code to investigate the unsteady behavior of the supersonic flow in the ejector diffuser to have an efficient computational tool that allows modeling different diffuser designs. The first step is the construction of a proper transformation of the solution space to generate a computational regular space to apply an explicit scheme. The second step, consists in developing the numerical code with an-object-oriented parallel methodology. Finally, the results obtained about the flux are satisfactory compared with the physical sensors, and the parallel paradigm used not only reduces the computational time but also shows a better maintainability, reusability, and extensibility accuracy of the code.

Modelling and Simulation in Engineering | 2010

A case study of nearshore wave transformation processes along the coast of mexico near the laguna verde nuclear power plant using a fast simulation method

Israel-Enrique Herrera-Díaz; Carlos Couder-Castañeda; Hermilo Ramírez-León

A numerical model based on themild-slope equation of water wave propagation over complicated bathymetry, taking into account the combined effects of refraction, diffraction, and reflection due to breakwater, is presented. The numericalmethod was developed using a split proposed version of the mild-slope equation and solved by an implicit method in a finite volume grid; this technique easily allows model the wave effects caused by the breakwater building in coastal waters, where industrial and other economic activities take place. Controlled case studies have been made and the results match very well with the reference solution. The capability and utility of the model for real coastal areas are illustrated by application to the breakwater of the Laguna Verde Nuclear Power Plant (LVNPP).

Scientific Programming | 2015

A performance study of a dual Xeon-Phi cluster for the forward modelling of gravitational fields

Maricela Arroyo; Carlos Couder-Castañeda; Alfredo Trujillo-Alcantara; Israel Enrique Herrera-Díaz; Nain Vera-Chavez

With at least 60 processing cores, the Xeon-Phi coprocessor is a truly multicore architecture, which consists of an interconnection speed among cores of 240 GB/s, two levels of cache memory, a theoretical performance of 1.01 Tflops, and programming flexibility, all making the Xeon-Phi an excellent coprocessor for parallelizing applications that seek to reduce computational times. The objective of this work is to migrate a geophysical application designed to directly calculate the gravimetric tensor components and their derivatives and in this way research the performance of one and two Xeon-Phi coprocessors integrated on the same node and distributed in various nodes. This application allows the analysis of the design factors that drive good performance and compare the results against a conventional multicore CPU. This research shows an efficient strategy based on nested parallelism using Open MP, a design that in its outer structure acts as a controller of interconnected Xeon-Phi coprocessors while its interior is used for parallelyzing the loops. MPI is subsequently used to reduce the information among the nodes of the cluster.

Journal of Applied Mathematics | 2014

Modelling Shallow Water Wakes Using a Hybrid Turbulence Model

Clemente Rodríguez-Cuevas; Carlos Couder-Castañeda; Esteban Flores-Mendez; Israel Enrique Herrera-Díaz; Rodolfo Cisneros-Almazan

A numerical research with different turbulence models for shallow water equations was carried out. This was done in order to investigate which model has the ability to reproduce more accurately the wakes produced by the shock of the water hitting a submerged island inside a canal. The study of this phenomenon is important for the numerical methods application advancement in the simulation of free surface flows since these models involve a number of simplifications and assumptions that can have a significant impact on the numerical solutions quality and thus can not reproduce correctly the physical phenomenon. The numerical experiments were carried out on an experimental case under controlled conditions, consisting of a channel with a submerged conical island. The numerical scheme is based on the Eulerian-Lagrangian finite volume method with four turbulence models, three mixing lengths (ml), and one joining on the horizontal axis with a mixing-length model (ml) on the vertical axis. The experimental results show that a with ml turbulence model makes it possible to approach the experimental results in a more qualitative manner. We found that when using only a model in the vertical and horizontal direction, the numerical results overestimate the experimental data. Additionally the computing time is reduced by simplifying the turbulence model.

Mathematical Problems in Engineering | 2018

Error Assessment Model for the Inverse Kinematics Problem for Stewart Parallel Mechanisms for Accurate Aerospace Optical Linkage

J. J. Hernández-Gómez; I. Medina; C. R. Torres-San Miguel; A. Solís-Santomé; Carlos Couder-Castañeda; J. C. Ortiz-Alemán; J.I. Grageda-Arellano

In this work we develop a mathematical model to estimate the error for inverse kinematics problem for Gough-Stewart parallel mechanisms. We propose the estimation error method to include manufacture, assembly, backlash, and sensing errors. We provide the error transmission matrices for the length of each leg of the hexapod, which permits evaluation of the accuracy error in the position of each one, given a desired position and orientation of the mobile platform. We also present numerical modelling in order to estimate the accuracy of the methodology herein proposed, for specific attitude operations corresponding to performing a successful ground-LEO nanosatellite optical link. In such a case, we were able to provide the required tolerances for the actuators in order to guarantee an orientation precision requirement of the order of milliradians.

International Journal of Antennas and Propagation | 2018

Analysis of Electromagnetic Propagation from MHz to THz with a Memory-Optimised CPML-FDTD Algorithm

A. Rodríguez-Sánchez; Carlos Couder-Castañeda; J. J. Hernández-Gómez; I. Medina; S. Peña-Ruiz; J. Sosa-Pedroza; M. A. Enciso-Aguilar

FDTD method opened a fertile research area on the numerical analysis of electromagnetic phenomena under a wide range of media and propagation conditions, providing an extensive analysis of electromagnetic behaviour like propagation, reflection, refraction, and multitrajectory phenomena. In this paper, we present an optimised FDTD-CPML algorithm, focused in saving memory while increasing the performance of the algorithm. We particularly implement FDTD-CPML method at high frequency bands, used in several telecommunications applications as well as in nanoelectromagnetism. We show an analysis of the performance of the algorithm in single and double precision, as well as a stability of the algorithm analysis, from where we conclude that the implemented CPML ABC constitutes a robust choice in terms of precision and accuracy for the high frequencies herein considered. It is important to recall that the CPML ABC parameters provided in this paper are fixed for the tested range of frequencies, that is, from MHz to THz.

Mathematical Problems in Engineering | 2017

Light Particle Tracking Model for Simulating Bed Sediment Transport Load in River Areas

Israel Enrique Herrera-Díaz; Franklin Torres-Bejarano; Jatziri Y. Moreno-Martínez; Clemente Rodríguez-Cuevas; Carlos Couder-Castañeda

In this work a fast computational particles tracer model is developed based on Particle-In-Cell method to estimate the sediment transport in the access zone of a river port area. To apply the particles tracer method, first it is necessary to calculate the hydrodynamic fields of the study zone to determine the velocity fields in the three directions. The particle transport is governed mainly by the velocity fields and the turbulent dispersion. The mechanisms of dispersion and resuspension of particles are based in stochastic models, which describes the movement through a probability function. The developed code was validated using two well known cases with a discrete transformation obtaining a max relative error around 4.8% in both cases. The simulations were carried out with 350,000 particles allowing us to determine under certain circumstances different hydrodynamic scenarios where the zones are susceptible to present erosion and siltation at the entrance of the port.

International Journal of Modern Physics C | 2017

High solar activity predictions through an artificial neural network

Mauricio Gabriel Orozco-del-Castillo; José Carlos Ortiz-Alemán; Carlos Couder-Castañeda; J. J. Hernández-Gómez; A. Solís-Santomé

The effects of high-energy particles coming from the Sun on human health as well as in the integrity of outer space electronics make the prediction of periods of high solar activity (HSA) a task of significant importance. Since periodicities in solar indexes have been identified, long-term predictions can be achieved. In this paper, we present a method based on an artificial neural network to find a pattern in some harmonics which represent such periodicities. We used data from 1973 to 2010 to train the neural network, and different historical data for its validation. We also used the neural network along with a statistical analysis of its performance with known data to predict periods of HSA with different confidence intervals according to the three-sigma rule associated with solar cycles 24–26, which we found to occur before 2040.

Explore More