MultiAgentDecisionProcess
|
DecPOMDP is a simple implementation of DecPOMDPInterface. More...
#include <DecPOMDP.h>
Public Member Functions | |
DecPOMDP () | |
Default constructor. sets RewardType to REWARD and discount to 1.0. More... | |
double | GetDiscount () const |
Returns the discount parameter. More... | |
double | GetDiscountForAgent (Index agentI) const |
Returns the discount parameter. More... | |
reward_t | GetRewardType () const |
Returns the reward type. More... | |
reward_t | GetRewardTypeForAgent (Index agentI) const |
Returns the reward type. More... | |
void | SetDiscount (double d) |
Sets the discount parameter to d. More... | |
void | SetDiscountForAgent (Index agentI, double d) |
Functions needed for POSGInterface: More... | |
void | SetRewardType (reward_t r) |
Sets the reward type to reward_t r. More... | |
void | SetRewardTypeForAgent (Index agentI, reward_t r) |
Sets the reward type to reward_t r. More... | |
std::string | SoftPrint () const |
SoftPrints some information on the DecPOMDP. More... | |
Public Member Functions inherited from DecPOMDPInterface | |
virtual DecPOMDPInterface * | Clone () const =0 |
Returns a pointer to a copy of this class. More... | |
virtual double | GetReward (State *s, JointAction *ja) const =0 |
Function that returns the reward for a state and joint action. More... | |
virtual void | SetReward (State *s, JointAction *ja, double r)=0 |
Function that sets the reward for a state and joint action. More... | |
virtual | ~DecPOMDPInterface () |
Virtual destructor. More... | |
Public Member Functions inherited from POSGInterface | |
virtual double | GetRewardForAgent (Index agentI, State *s, JointAction *ja) const =0 |
Function that returns the reward for a state and joint action. More... | |
virtual void | SetRewardForAgent (Index agentI, State *s, JointAction *ja, double r)=0 |
Function that sets the reward for an agent, state and joint action. More... | |
virtual | ~POSGInterface () |
Virtual destructor. More... | |
Public Member Functions inherited from MultiAgentDecisionProcessInterface | |
virtual const Scope & | GetAllAgentScope () const =0 |
virtual size_t | GetNrAgents () const =0 |
Return the number of agents. More... | |
virtual std::string | GetUnixName () const =0 |
Returns the base part of the problem filename. More... | |
virtual | ~MultiAgentDecisionProcessInterface () |
Destructor. More... | |
Private Attributes | |
double | _m_discount |
The discount parameter. More... | |
reward_t | _m_rewardType |
Do the agents get rewards or costs? More... | |
DecPOMDP is a simple implementation of DecPOMDPInterface.
It defines a couple of functions that relate to the (types of) rewards and discount factor.
Conceptually an MultiAgentDecisionProcess that implements this interface, is a Dec-POMDP: the system is cooperative and there is only 1 reward function.
DecPOMDP::DecPOMDP | ( | ) |
Default constructor. sets RewardType to REWARD and discount to 1.0.
References Globals::REWARD.
|
inlinevirtual |
Returns the discount parameter.
Implements DecPOMDPInterface.
References _m_discount.
Referenced by FactoredDecPOMDPDiscrete::ExportSpuddFile(), and GetDiscountForAgent().
|
inlinevirtual |
|
inlinevirtual |
Returns the reward type.
Implements DecPOMDPInterface.
References _m_rewardType.
Referenced by GetRewardTypeForAgent().
|
virtual |
Sets the discount parameter to d.
Implements DecPOMDPInterface.
Referenced by ProblemFOBSFireFightingFactored::InitializePFFF(), ProblemFireFightingFactored::InitializePFFF(), POMDPFormatParsing::ParserPOMDPFormat_Spirit::dp_SetDiscountParam::operator()(), DPOMDPFormatParsing::ParserDPOMDPFormat_Spirit::dp_SetDiscountParam::operator()(), ParserPOMDPDiscrete::Parse(), ParserTOIDecPOMDPDiscrete::ParseBase(), ProblemDecTiger::ProblemDecTiger(), ProblemDecTigerWithCreaks::ProblemDecTigerWithCreaks(), ProblemFireFighting::ProblemFireFighting(), and SetDiscountForAgent().
|
inlinevirtual |
|
virtual |
Sets the reward type to reward_t r.
At the moment only REWARD is supported.
Implements DecPOMDPInterface.
References Globals::REWARD.
Referenced by DPOMDPFormatParsing::ParserDPOMDPFormat_Spirit::vt_REWARDTOK::operator()(), POMDPFormatParsing::ParserPOMDPFormat_Spirit::vt_REWARDTOK::operator()(), DPOMDPFormatParsing::ParserDPOMDPFormat_Spirit::vt_COSTTOK::operator()(), POMDPFormatParsing::ParserPOMDPFormat_Spirit::vt_COSTTOK::operator()(), ParserPOMDPDiscrete::Parse(), and SetRewardTypeForAgent().
string DecPOMDP::SoftPrint | ( | ) | const |
SoftPrints some information on the DecPOMDP.
Referenced by DecPOMDPDiscrete::SoftPrint(), FactoredDecPOMDPDiscrete::SoftPrint(), and TOIDecPOMDPDiscrete::SoftPrint().
|
private |
The discount parameter.
When agents have different interests (the POSG setting), they may also have different discount factors. For a Dec-POMDP, however, we have one global discount factor (which typically is 1.0 in the finite horizon case).
Referenced by GetDiscount().
|
private |
Do the agents get rewards or costs?
Referenced by GetRewardType().