Context Navigation

← Previous Changeset
Next Changeset →

Changeset 313

Timestamp:

Oct 10, 2011, 12:03:17 PM (14 years ago)

Author:

acolaitis

Message:

* AC 10/10/2011 *

*
This commit aims at increasing the thermals speed. Using these corrections, gcm performances in 64x48x32 using 1 tracer goes from 27.9% elapsed time in thermals to 18.76%.

*
Additional work needs to be done in tracer advection to gain speed in high tracer number configuration. (tracer advection (but not momentum nor temperature) could be decoupled from sub-timestep, as they do not act on the thermals scheme (water vapor is neglected as we use theta and not theta_v, and radiative effect of dust is not computed in the thermals.))

*
=> TOP 5 of routine contributions to gcm runtime :

Each sample counts as 0.01 seconds.

% cumulative self self total

time seconds seconds calls s/call s/call name
18.76 6.33 6.33 960 0.01 0.01 thermcell_main_mars_
17.19 12.13 5.80 svml_powf4.A
13.72 16.76 4.63 10369 0.00 0.00 filtreg_

3.94 18.09 1.33 intel_new_memset
3.73 19.35 1.26 2880 0.00 0.00 thermcell_dqupdown_

note: thermcell_main_mars_ does call quite a lot power computations (see svml_powf4.A), but this number will not increase with tracer numbers.

*
=> LOG:

M 312 libf/phymars/thermcell_main_mars.F90
^{------------------- removed (commented) computations on buoyancy which is purely diagnostic}

tuned internal convergence loop and added convergence criterion

M 312 libf/phymars/thermcell_dqupdown.F90
^{------------------- removed (commented) downdraft-related if-loops (as we do not advect tracers and momentum in downdrafts for now)}

M 312 libf/phymars/calltherm_mars.F90
^{------------------- removed (commented) diagnostic-related computations}

changed default thermals spliting and aspect ratio
corrected a bug where maximum height was not correctly computed and could result in convective adjustment used in place of thermals
when using certains sets of nsplit and r_aspect (was not happening with the baseline version, so that this correction is transparent to
users)

Location:

trunk/LMDZ.MARS

Files:

: 4 edited

README (modified) (1 diff)
libf/phymars/calltherm_mars.F90 (modified) (11 diffs)
libf/phymars/thermcell_dqupdown.F90 (modified) (6 diffs)
libf/phymars/thermcell_main_mars.F90 (modified) (7 diffs)

Legend:

: Unmodified
: Added
: Removed

trunk/LMDZ.MARS/README

-                      r311
+                      r313
 >> Bug correction in lect_start_archive.F ; in some cases layer(:) was not
    initialized.
+== 10/10/2011 == AC
+***********
+This commit aims at increasing the thermals speed. Using these corrections, gcm performances in 64x48x32 using 1 tracer goes from 27.9% elapsed time in thermals to 18.76%.
+***********
+Additional work needs to be done in tracer advection to gain speed in high tracer number configuration. (tracer advection (but not momentum nor temperature) could be decoupled from sub-timestep, as they do not act on the thermals scheme (water vapor is neglected as we use theta and not theta_v, and radiative effect of dust is not computed in the thermals.))
+***********
+=> TOP 5 of routine contributions to gcm runtime :
+Each sample counts as 0.01 seconds.
+  %   cumulative   self              self     total
+ time   seconds   seconds    calls   s/call   s/call  name
+.76      6.33     6.33      960     0.01     0.01  thermcell_main_mars_
+.19     12.13     5.80                             __svml_powf4.A
+.72     16.76     4.63    10369     0.00     0.00  filtreg_
+.94     18.09     1.33                             __intel_new_memset
+.73     19.35     1.26     2880     0.00     0.00  thermcell_dqupdown_
+note: thermcell_main_mars_ does call quite a lot power computations (see __svml_powf4.A), but this number will not increase with tracer numbers.
+***********
+=> LOG:
+M             312   libf/phymars/thermcell_main_mars.F90
+^------------------- removed (commented) computations on buoyancy which is purely diagnostic
+                     tuned internal convergence loop and added convergence criterion
+M             312   libf/phymars/thermcell_dqupdown.F90
+^------------------- removed (commented) downdraft-related if-loops (as we do not advect tracers and momentum in downdrafts for now)
+M             312   libf/phymars/calltherm_mars.F90
+^------------------- removed (commented) diagnostic-related computations
+                     changed default thermals spliting and aspect ratio
+                     corrected a bug where maximum height was not correctly computed and could result in convective adjustment used in place of thermals
+                     when using certains sets of nsplit and r_aspect (was not happening with the baseline version, so that this correction is transparent to
+                     users)
+********************

trunk/LMDZ.MARS/libf/phymars/calltherm_mars.F90

-                      r300
+                      r313
      &      ,u_seri,v_seri,t_seri,pq_therm,q2_therm  &
      &      ,d_u_ajs,d_v_ajs,d_t_ajs,d_q_ajs,dq2_therm  &
      &      ,fm_therm,entr_therm,detr_therm,lmax,zmax,&
+     &      ,fm_therm,entr_therm,detr_therm,lmax,zmaxth,&
      &   zw2,fraca,zpopsk,ztla,heatFlux,heatFlux_down,&
      &     buoyancyOut,buoyancyEst,hfmax,wmax)
 …
       real hfmax(ngridmx)
       integer lmax(ngridmx)
+      real zmax(ngridmx)
+      real lmax_real(ngridmx)
+      real zmax(ngridmx),zmaxth(ngridmx)
 !nouvelles variables pour la convection
 …
          r_aspect_thermals=0.7
 #else
          nsplit_thermals=50
          r_aspect_thermals=1.5
+         nsplit_thermals=20
+         r_aspect_thermals=2.
 #endif
          call getin("nsplit_thermals",nsplit_thermals)
+         fm_therm(:,:)=0.
+         detr_therm(:,:)=0.
+         entr_therm(:,:)=0.
+         call getin("r_aspect_thermals",r_aspect_thermals)
+!         fm_therm(:,:)=0.
+!         detr_therm(:,:)=0.
+!         entr_therm(:,:)=0.
          heatFlux(:,:)=0.
          heatFlux_down(:,:)=0.
          buoyancyOut(:,:)=0.
          buoyancyEst(:,:)=0.
+!         buoyancyOut(:,:)=0.
+!         buoyancyEst(:,:)=0.
          zw2(:,:)=0.
+         zmaxth(:)=0.
+         lmax_real(:)=0.
          zdt=dtime/REAL(nsplit_thermals)
 …
 ! cas de splitting
          zfm_therm(:,:)=0.
          zentr_therm(:,:)=0.
          zdetr_therm(:,:)=0.
+!         zfm_therm(:,:)=0.
+!         zentr_therm(:,:)=0.
+!         zdetr_therm(:,:)=0.
+!
          zheatFlux(:,:)=0.
          zheatFlux_down(:,:)=0.
          zbuoyancyOut(:,:)=0.
          zbuoyancyEst(:,:)=0.
+!         zbuoyancyOut(:,:)=0.
+!         zbuoyancyEst(:,:)=0.
          zzw2(:,:)=0.
+         zmax(:)=0.
+         lmax(:)=0.
          d_t_the(:,:)=0.
          d_u_the(:,:)=0.
          d_v_the(:,:)=0.
          dq2_the(:,:)=0.
+!         dq2_the(:,:)=0.
          if (nqmx .ne. 0) then
             d_q_the(:,:,:)=0.
 …
              CALL thermcell_main_mars(zdt  &
+!             CALL thermcell_main_mars_coupled_v2(zdt  &
      &      ,pplay,paprs,pphi,zzlev,zzlay  &
      &      ,u_seri,v_seri,t_seri,pq_therm,q2_therm  &
 …
             d_u_the(:,:)=d_u_the(:,:)*fact
             d_v_the(:,:)=d_v_the(:,:)*fact
             dq2_the(:,:)=dq2_the(:,:)*fact
+!            dq2_the(:,:)=dq2_the(:,:)*fact
             if (nqmx .ne. 0) then
 …
             endif
+            fm_therm(:,:)=fm_therm(:,:)  &
+     &      +zfm_therm(:,:)*fact
+            entr_therm(:,:)=entr_therm(:,:)  &
+     &       +zentr_therm(:,:)*fact
+            detr_therm(:,:)=detr_therm(:,:)  &
+     &       +zdetr_therm(:,:)*fact
+             zmaxth(:)=zmaxth(:)+zmax(:)*fact
+             lmax_real(:)=lmax_real(:)+float(lmax(:))*fact
+!            fm_therm(:,:)=fm_therm(:,:)  &
+!     &      +zfm_therm(:,:)*fact
+!            entr_therm(:,:)=entr_therm(:,:)  &
+!     &       +zentr_therm(:,:)*fact
+!            detr_therm(:,:)=detr_therm(:,:)  &
+!     &       +zdetr_therm(:,:)*fact
             heatFlux(:,:)=heatFlux(:,:) &
 …
             heatFlux_down(:,:)=heatFlux_down(:,:) &
      &       +zheatFlux_down(:,:)*fact
             buoyancyOut(:,:)=buoyancyOut(:,:) &
      &       +zbuoyancyOut(:,:)*fact
             buoyancyEst(:,:)=buoyancyEst(:,:) &
      &       +zbuoyancyEst(:,:)*fact
+!            buoyancyOut(:,:)=buoyancyOut(:,:) &
+!     &       +zbuoyancyOut(:,:)*fact
+!            buoyancyEst(:,:)=buoyancyEst(:,:) &
+!     &       +zbuoyancyEst(:,:)*fact
             zw2(:,:)=zw2(:,:) + zzw2(:,:)*fact
 …
             d_v_ajs(:,:)=d_v_ajs(:,:)+d_v_the(:,:)
             d_q_ajs(:,:,:)=d_q_ajs(:,:,:)+d_q_the(:,:,:)
             dq2_therm(:,:)=dq2_therm(:,:)+dq2_the(:,:)
+!            dq2_therm(:,:)=dq2_therm(:,:)+dq2_the(:,:)
 !  incrementation des variables meteo
 …
             v_seri(:,:) = v_seri(:,:) + d_v_the(:,:)
             pq_therm(:,:,:) = pq_therm(:,:,:) + d_q_the(:,:,:)
             q2_therm(:,:) = q2_therm(:,:) + dq2_therm(:,:)
+!            q2_therm(:,:) = q2_therm(:,:) + dq2_therm(:,:)
 …
             wmax(i)=MAXVAL(zw2(i,:))
           ENDDO
+         lmax(:)=nint(lmax_real(:))
       return

trunk/LMDZ.MARS/libf/phymars/thermcell_dqupdown.F90

-                      r278
+                      r313
       qa(:,:)=q_therm(:,:)
       q(:,:)=q_therm(:,:)
       qd(:,:)=q_therm(:,:)
       active(:,:)=.false.
+!      qd(:,:)=q_therm(:,:)
+!      active(:,:)=.false.
 ! previous calculation of zdthl_down uses the divergence of fmd
 …
 ! of fmd in the equations, so it has to be positive
+!
       fmd(:,:)=-fm_down(:,:)
+!      fmd(:,:)=-fm_down(:,:)
+!
 !! ========== Entrainment, Detrainement and Mass =================
 …
       do ig=1,ngridmx
          if(active(ig,1)) then
          dq_therm(ig,1)=(detr0(ig,1)*qa(ig,1)+detrd(ig,1)*qd(ig,1) &
       &               +fm0(ig,2)*q(ig,2)   &
       &               -entr0(ig,1)*q(ig,1)-entrd(ig,1)*q(ig,1)   &
       &               -fmd(ig,2)*q(ig,1)) &
       &               *ptimestep/masse0(ig,1)
          else
+!         if(active(ig,1)) then
+!
+!         dq_therm(ig,1)=(detr0(ig,1)*qa(ig,1)+detrd(ig,1)*qd(ig,1) &
+!      &               +fm0(ig,2)*q(ig,2)   &
+!      &               -entr0(ig,1)*q(ig,1)-entrd(ig,1)*q(ig,1)   &
+!      &               -fmd(ig,2)*q(ig,1)) &
+!      &               *ptimestep/masse0(ig,1)
+!
+!         else
          dq_therm(ig,1)=(detr0(ig,1)*qa(ig,1)+fm0(ig,2)*q(ig,2) &
       &               -entr0(ig,1)*q(ig,1)) &
       &               *ptimestep/masse0(ig,1)
          endif
+!         endif
        enddo
 …
          do ig=1, ngridmx
          if(active(ig,k)) then
          dq_therm(ig,k)=(detr0(ig,k)*qa(ig,k)+detrd(ig,k)*qd(ig,k) &
       &               +fm0(ig,k+1)*q(ig,k+1)+fmd(ig,k)*q(ig,k-1)   &
       &               -entr0(ig,k)*q(ig,k)-entrd(ig,k)*q(ig,k)   &
       &               -fm0(ig,k)*q(ig,k)-fmd(ig,k+1)*q(ig,k))      &
       &               *ptimestep/masse0(ig,k)
          else
+!         if(active(ig,k)) then
+!
+!         dq_therm(ig,k)=(detr0(ig,k)*qa(ig,k)+detrd(ig,k)*qd(ig,k) &
+!      &               +fm0(ig,k+1)*q(ig,k+1)+fmd(ig,k)*q(ig,k-1)   &
+!      &               -entr0(ig,k)*q(ig,k)-entrd(ig,k)*q(ig,k)   &
+!      &               -fm0(ig,k)*q(ig,k)-fmd(ig,k+1)*q(ig,k))      &
+!     &               *ptimestep/masse0(ig,k)
+!         else
          dq_therm(ig,k)=(detr0(ig,k)*qa(ig,k)+fm0(ig,k+1)*q(ig,k+1) &
       &               -entr0(ig,k)*q(ig,k)-fm0(ig,k)*q(ig,k))  &
 …
          endif
+!         endif
          enddo
 …
          do ig=1, ngridmx
          if(active(ig,nlayermx)) then
          dq_therm(ig,nlayermx)=(detr0(ig,nlayermx)*qa(ig,nlayermx)+detrd(ig,nlayermx)*qd(ig,nlayermx) &
       &               +fmd(ig,nlayermx)*q(ig,nlayermx-1)   &
       &         -entr0(ig,nlayermx)*q(ig,nlayermx)-entrd(ig,nlayermx)*q(ig,nlayermx)   &
       &               -fm0(ig,nlayermx)*q(ig,nlayermx)) &
       &               *ptimestep/masse0(ig,nlayermx)
          else
+!         if(active(ig,nlayermx)) then
+!
+!         dq_therm(ig,nlayermx)=(detr0(ig,nlayermx)*qa(ig,nlayermx)+detrd(ig,nlayermx)*qd(ig,nlayermx) &
+!      &               +fmd(ig,nlayermx)*q(ig,nlayermx-1)   &
+!      &         -entr0(ig,nlayermx)*q(ig,nlayermx)-entrd(ig,nlayermx)*q(ig,nlayermx)   &
+!      &               -fm0(ig,nlayermx)*q(ig,nlayermx)) &
+!      &               *ptimestep/masse0(ig,nlayermx)
+!         else
          dq_therm(ig,nlayermx)=(detr0(ig,nlayermx)*qa(ig,nlayermx) &
       &             -entr0(ig,nlayermx)*q(ig,nlayermx)-fm0(ig,nlayermx)*q(ig,nlayermx)) &
       &               *ptimestep/masse0(ig,nlayermx)
          endif
+!         endif
          enddo

trunk/LMDZ.MARS/libf/phymars/thermcell_main_mars.F90

-                      r300
+                      r313
       REAL, INTENT(OUT) :: pdvadj(ngridmx,nlayermx)
       REAL, INTENT(OUT) :: pdqadj(ngridmx,nlayermx,nqmx)
+      REAL, INTENT(OUT) :: pdq2adj(ngridmx,nlayermx)
+!      REAL, INTENT(OUT) :: pdq2adj(ngridmx,nlayermx)
+      REAL :: pdq2adj(ngridmx,nlayermx)
       REAL, INTENT(OUT) :: zw2(ngridmx,nlayermx+1)
 ! Diagnostics
       REAL, INTENT(OUT) :: heatFlux(ngridmx,nlayermx)   ! interface heatflux
       REAL, INTENT(OUT) :: heatFlux_down(ngridmx,nlayermx) ! interface heat flux from downdraft
       REAL, INTENT(OUT) :: buoyancyOut(ngridmx,nlayermx)  ! interlayer buoyancy term
       REAL, INTENT(OUT) :: buoyancyEst(ngridmx,nlayermx)  ! interlayer estimated buoyancy term
+     REAL, INTENT(OUT) :: heatFlux_down(ngridmx,nlayermx) ! interface heat flux from downdraft
+!      REAL, INTENT(OUT) :: buoyancyOut(ngridmx,nlayermx)  ! interlayer buoyancy term
+!      REAL, INTENT(OUT) :: buoyancyEst(ngridmx,nlayermx)  ! interlayer estimated buoyancy term
 ! dummy variables when output not needed :
 …
 !      REAL :: heatFlux(ngridmx,nlayermx)   ! interface heatflux
 !      REAL :: heatFlux_down(ngridmx,nlayermx) ! interface heat flux from downdraft
 !      REAL :: buoyancyOut(ngridmx,nlayermx)  ! interlayer buoyancy term
 !      REAL :: buoyancyEst(ngridmx,nlayermx)  ! interlayer estimated buoyancy term
+      REAL :: buoyancyOut(ngridmx,nlayermx)  ! interlayer buoyancy term
+      REAL :: buoyancyEst(ngridmx,nlayermx)  ! interlayer estimated buoyancy term
 …
 !---------------------------------------------------------------------------
+      DO tic=0,3
+      DO tic=0,6  ! internal convergence loop
       activetmp(:)=active(:) .and. f_star(:,l+1)>1.e-10
       do ig=1,ngridmx
 …
         endif
       enddo
+      activetmp(:)=activetmp(:).and.(abs(ztla(:,l)-ztva(:,l)).gt.0.01)
       do ig=1,ngridmx
 …
       enddo
+      ENDDO
+      ENDDO   ! of tic
 !---------------------------------------------------------------------------
 !initialisations pour le calcul de la hauteur du thermique, de l'inversion et de la vitesse verticale max
 …
 !------------------------------------------------------------------
       modname='tke'
       call thermcell_dqupdown(ngridmx,nlayermx,ptimestep,fm,entr,detr,  &
       &      masse,pq2,pdq2adj,ztvd,fm_down,ztv,modname,lmax)
+!      modname='tke'
+!      call thermcell_dqupdown(ngridmx,nlayermx,ptimestep,fm,entr,detr,  &
+!      &      masse,pq2,pdq2adj,ztvd,fm_down,ztv,modname,lmax)
 ! ===========================================================================
 …
        do ig=1,ngridmx
         heatFlux(ig,l)=fm(ig,l)*(teta_th_int(ig,l)-teta_env_int(ig,l))/(rhobarz(ig,l))
         buoyancyOut(ig,l)=g*(ztva(ig,l)-ztv(ig,l))/ztv(ig,l)
         buoyancyEst(ig,l)=g*(ztva_est(ig,l)-ztv(ig,l))/ztv(ig,l)
+!        buoyancyOut(ig,l)=g*(ztva(ig,l)-ztv(ig,l))/ztv(ig,l)
+!        buoyancyEst(ig,l)=g*(ztva_est(ig,l)-ztv(ig,l))/ztv(ig,l)
         heatFlux_down(ig,l)=fm_down(ig,l)*(teta_down_int(ig,l)-teta_env_int(ig,l))/rhobarz(ig,l)
        enddo

Note: See TracChangeset for help on using the changeset viewer.