Derived Fields and Profiles¶

[]

Derived Fields and Profiles

One of the most powerful features in yt is the ability to create derived fields that act and look exactly like fields that exist on disk. This means that they will be generated on demand and can be used anywhere a field that exists on disk would be used. Additionally, you can create them by just writing python functions.

In [1]:

from yt.imods import *

Derived Fields

This is an example of the simplest possible way to create a derived field. All derived fields are defined by a function and some metadata; that metadata can include units, LaTeX-friendly names, conversion factors, and so on. Fields can be defined in the way in the next cell. What this does is create a function which accepts two arguments and then provide the units for that field. In this case, our field is Dinosaurs and our units are Trex/s. The function itself can access any fields that are in the simulation, and it does so by requesting data from the object called data.

In [2]:

@derived_field(units = "Trex/s")
def Dinosaurs(field, data):
    return data["Density"]**(2.0/3.0) * data["VelocityMagnitude"]

One important thing to note is that derived fields must be defined before any datasets are loaded. Let's load up our data and take a look at some quantities.

In [3]:

pf = load("IsolatedGalaxy/galaxy0030/galaxy0030")
dd = pf.h.all_data()
print dd.quantities.keys()

['MinLocation', 'StarAngularMomentumVector', 'WeightedVariance', 'TotalMass', 'AngularMomentumVector', 'TotalQuantity', 'IsBound', 'WeightedAverageQuantity', 'CenterOfMass', 'BulkVelocity', 'ParticleSpinParameter', 'Action', 'Extrema', 'MaxLocation', 'BaryonSpinParameter']

One interesting question is, what are the minimum and maximum values of dinosaur production rates in our isolated galaxy? We can do that by examining the Extrema quantity -- the exact same way that we would for Density, Temperature, and so on.

In [4]:

print dd.quantities["Extrema"]("Dinosaurs")

[(2.2146366774504352e-20, 9.1573883828992124e-09)]

We can do the same for the average quantities as well.

In [5]:

print dd.quantities["WeightedAverageQuantity"]("Dinosaurs", weight="Temperature")

5.29302670063e-11

A Few Other Quantities

We can ask other quantities of our data, as well. For instance, this sequence of operations will find the most dense point, center a sphere on it, calculate the bulk velocity of that sphere, calculate the baryonic angular momentum vector, and then the density extrema. All of this is done in a memory conservative way: if you have an absolutely enormous dataset, yt will split that dataset into pieces, apply intermediate reductions and then a final reduction to calculate your quantity.

In [6]:

sp = pf.h.sphere("max", (10.0, 'kpc'))
bv = sp.quantities["BulkVelocity"]()
L = sp.quantities["AngularMomentumVector"]()
(rho_min, rho_max), = sp.quantities["Extrema"]("Density")
print bv, L, rho_min, rho_max

[ -892738.71870531  1107085.74853252   620737.94861487] [ 0.00175796 -0.06228019 -0.99805716] 2.21478238124e-28 7.73426503924e-24

Profiles

yt provides the ability to bin in 1, 2 and 3 dimensions. This means discretizing in one or more dimensions of phase space (density, temperature, etc) and then calculating either the total value of a field in each bin or the average value of a field in each bin.

We do this using the objects BinnedProfile1D, BinnedProfile2D, and BinnedProfile3D. The first two are the most common since they are the easiest to visualize.

This first set of commands manually creates a BinnedProfile1D from the sphere we created earlier, binned in 32 bins according to density between rho_min and rho_max, and then takes the Density-weighted average of the fields Temperature and (previously-defined) Dinosaurs. We then plot it in a loglog plot.

In [7]:

prof = BinnedProfile1D(sp, 32, "Density", rho_min, rho_max)
prof.add_fields(["Temperature", "Dinosaurs"], weight="Density")
pylab.loglog(prof["Density"], prof["Temperature"], "-x")

Out[7]:

[<matplotlib.lines.Line2D at 0x4b9bc90>]

Now we plot the Dinosaurs field.

In [8]:

pylab.loglog(prof["Density"], prof["Dinosaurs"], '-x')

Out[8]:

[<matplotlib.lines.Line2D at 0x781f7d0>]

If we want to see the total mass in every bin, we add the CellMassMsun field with no weight. Specifying weight=None will simply take the total value in every bin and add that up.

In [9]:

prof.add_fields(["CellMassMsun"], weight=None)
pylab.loglog(prof["Density"], prof["CellMassMsun"])

Out[9]:

[<matplotlib.lines.Line2D at 0x7847d10>]

We can also specify accumulation, which sums all the bins, from left to right. Note that for 2D and 3D profiles, this needs to be a tuple of length 2 or 3.

In [10]:

prof.add_fields(["CellMassMsun"], weight=None, accumulation=True)
pylab.loglog(prof["Density"], prof["CellMassMsun"])

Out[10]:

[<matplotlib.lines.Line2D at 0x4a37a10>]

Advanced Derived Fields

This section can be skipped!

You can also define fields that require extra zones. This is useful, for instance, if you want to take the average, or apply a stencil. yt provides fields like DivV that do this internally. This example is a very busy example of how to do it. You need to specify the validator ValidateSpatial with the number of extra zones on each side of the grid that you need, and then inside your function you need to return a field with those zones stripped off. So by necessity, the arrays returned by data[something] will have larger spatial extent than what should be returned by the function itself. If you specify that you need 0 extra zones, this will also work and will simply supply a grid object for the field.

In [11]:

@derived_field(name = "AveragedTemperature",
               validators = [ValidateSpatial(1)],
               units = r"K")
def _AveragedTemperature(field, data):
    nx, ny, nz = data["Temperature"].shape
    new_field = na.zeros((nx-2,ny-2,nz-2), dtype='float64')
    weight_field = na.zeros((nx-2,ny-2,nz-2), dtype='float64')
    i_i, j_i, k_i = na.mgrid[0:3,0:3,0:3]
    for i,j,k in zip(i_i.ravel(),j_i.ravel(),k_i.ravel()):
        sl = [slice(i,nx-(2-i)),slice(j,ny-(2-j)),slice(k,nz-(2-k))]
        new_field += data["Temperature"][sl] * data["CellMass"][sl]
        weight_field += data["CellMass"][sl]
    # Now some fancy footwork
    new_field2 = na.zeros((nx,ny,nz))
    new_field2[1:-1,1:-1,1:-1] = new_field/weight_field
    return new_field2

Now, once again, we can access AveragedTemperature just like any other field. Note that because it requires ghost zones, this will be a much slower process!

In [12]:

pf = load("IsolatedGalaxy/galaxy0030/galaxy0030")
dd = pf.h.all_data()
(tmin, tmax), (atmin, atmax) = dd.quantities["Extrema"](["Temperature", "AveragedTemperature"])
print tmin, tmax, atmin, atmax
print tmin / atmin, tmax / atmax

20.8445072174 24826104.0 168.973351715 14529859.2676
0.123359731022 1.70862659733

Field Parameters

Field parameters are a method of passing information to derived fields. For instance, you might pass in information about a vector you want to use as a basis for a coordinate transformation. yt often uses things like bulk_velocity to identify velocities that should be subtracted off. Here we show how that works:

In [13]:

sp_small = pf.h.sphere("max", (1.0, 'kpc'))
bv = sp_small.quantities["BulkVelocity"]()

sp = pf.h.sphere("max", (0.1, 'mpc'))
rv1 = sp.quantities["Extrema"]("RadialVelocity")

sp.clear_data()
sp.set_field_parameter("bulk_velocity", bv)
rv2 = sp.quantities["Extrema"]("RadialVelocity")

print bv
print rv1
print rv2

[  1850985.29258994  17682121.43107695     37849.37460756]
[(-69582744.411569089, 32330360.014973201)]
[(-73515036.314695209, 25835696.435245823)]

(5)_Derived_Fields_and_Profiles.ipynb; 5)_Derived_Fields_and_Profiles_evaluated.ipynb; 5)_Derived_Fields_and_Profiles.py)

yt 2.6 documentation